gptkbp:feature
|
gptkb:Catalyst_query_optimizer
gptkb:Data_source_API
gptkb:Tungsten_execution_engine
Columnar storage support
Distributed SQL query execution
In-memory computation
Integration with BI tools
Schema inference
Streaming support via Structured Streaming
Support for user-defined functions (UDFs)
Unified DataFrame and Dataset API
Support for JSON, Avro, Parquet, ORC, CSV, JDBC data sources
|