Apache Spark SQL

GPTKB entity

Properties (55)
Predicate Object
gptkbp:instanceOf gptkb:API
gptkbp:developedBy gptkb:Apache_Software_Foundation
gptkbp:hasFeature Subqueries
Joins
Data definition language
User-defined functions
Aggregation functions
Data control language
Data manipulation language
Schema inference
Window functions
Data_source_API
gptkbp:hasOccupation Active user community
Contributors
User groups
Documentation and tutorials
Meetups and conferences
https://www.w3.org/2000/01/rdf-schema#label Apache Spark SQL
gptkbp:integratesWith gptkb:Apache_Hive
gptkb:Apache_Kafka
Apache Parquet
Apache Avro
gptkbp:isCompatibleWith gptkb:Hadoop
NoSQL databases
Relational databases
gptkbp:isPartOf Apache_Spark_ecosystem
gptkbp:isPopularIn Cloud computing
Data analysis
Machine learning
Real-time analytics
Data engineering
gptkbp:isUsedFor Graph processing libraries
Streaming libraries
Machine_Learning_libraries
gptkbp:isUsedIn Data science
Business intelligence
Data warehousing
Big data analytics
ETL_processes
gptkbp:language gptkb:Java
Python
Scala
R
gptkbp:provides Batch processing
In-memory computing
Real-time stream processing
Unified Data Processing
Interactive queries
gptkbp:releaseDate 2014
gptkbp:supports DataFrames
Datasets
Hive_Query_Language
gptkbp:uses gptkb:SQL
gptkb:Catalyst_optimizer
Tungsten execution engine