Apache Spark SQL

GPTKB entity

Statements (50)
Predicate Object
gptkbp:instanceOf gptkb:software
gptkbp:developedBy gptkb:Apache_Software_Foundation
gptkbp:feature gptkb:Catalyst_optimizer
gptkb:Tungsten_execution_engine
Columnar storage support
Cost-based optimization
Integration with BI tools
Schema inference
Streaming support via Structured Streaming
Support for ACID transactions via Delta Lake
Support for ANSI SQL
Support for Avro, JSON, CSV, and other formats
Support for UDFs
Support for aggregations
Support for data source connectors
Support for joins
Support for partitioning
Support for subqueries
Support for user-defined table functions (UDTFs)
Support for window functions
Unified DataFrame and Dataset API
Support for user-defined aggregate functions (UDAFs)
gptkbp:firstReleased 2014
https://www.w3.org/2000/01/rdf-schema#label Apache Spark SQL
gptkbp:integratesWith gptkb:JDBC
gptkb:ODBC
gptkb:Apache_Hive
gptkb:Apache_Parquet
gptkb:Apache_ORC
gptkb:Thrift_Server
gptkbp:latestReleaseVersion 3.5.0
gptkbp:license gptkb:Apache_License_2.0
gptkbp:operatingSystem Cross-platform
gptkbp:partOf gptkb:Apache_Spark
gptkbp:programmingLanguage gptkb:Java
gptkb:Python
gptkb:Scala
R
SQL
gptkbp:supports gptkb:DataFrame_API
SQL queries
Hive compatibility
structured data processing
gptkbp:usedFor big data analytics
data warehousing
ETL
interactive querying
gptkbp:website https://spark.apache.org/sql/
gptkbp:bfsParent gptkb:Reynold_Xin
gptkbp:bfsLayer 6