Platform Extension Framework — High-Performance Federated Data Access
Read and write Text, Avro, JSON, Parquet, ORC, and SequenceFile data in HDFS. Query Hive tables and HBase.
Access data on AWS S3, Azure Blob Storage, Google Cloud Storage, and MinIO with the same SQL interface.
Federated queries against PostgreSQL, MySQL, Oracle, Trino, and any JDBC-compatible database.
Parallel data access across all Cloudberry segments. Filter pushdown and column projection minimize data transfer.
Build custom connectors with the PXF plugin API. Register your own profiles to access any data source.
Kerberos authentication, user impersonation, configurable memory and thread management.