Spark SQL

GPTKB entity

Statements (38)
Predicate Object
gptkbp:instanceOf gptkb:software
gptkbp:developedBy gptkb:Apache_Software_Foundation
gptkbp:feature gptkb:Catalyst_query_optimizer
gptkb:Data_source_API
gptkb:Tungsten_execution_engine
Columnar storage support
Distributed SQL query execution
In-memory computation
Integration with BI tools
Schema inference
Streaming support via Structured Streaming
Support for user-defined functions (UDFs)
Unified DataFrame and Dataset API
Support for JSON, Avro, Parquet, ORC, CSV, JDBC data sources
https://www.w3.org/2000/01/rdf-schema#label Spark SQL
gptkbp:integratesWith gptkb:JDBC
gptkb:ODBC
gptkb:Apache_Hive
gptkb:Apache_Parquet
gptkb:Apache_ORC
gptkbp:latestReleaseVersion 3.5.1
gptkbp:license gptkb:Apache_License_2.0
gptkbp:operatingSystem Cross-platform
gptkbp:partOf gptkb:Apache_Spark
gptkbp:programmingLanguage gptkb:Java
gptkb:Python
gptkb:Scala
R
gptkbp:releaseYear 2014
gptkbp:supports gptkb:HiveQL
SQL
Datasets
DataFrames
gptkbp:usedFor big data processing
structured data analysis
gptkbp:website https://spark.apache.org/sql/
gptkbp:bfsParent gptkb:Hive
gptkbp:bfsLayer 5