Properties (55)
Predicate | Object |
---|---|
gptkbp:instanceOf |
gptkb:API
|
gptkbp:developedBy |
gptkb:Apache_Software_Foundation
|
gptkbp:hasFeature |
Subqueries
Joins Data definition language User-defined functions Aggregation functions Data control language Data manipulation language Schema inference Window functions Data_source_API |
gptkbp:hasOccupation |
Active user community
Contributors User groups Documentation and tutorials Meetups and conferences |
https://www.w3.org/2000/01/rdf-schema#label |
Apache Spark SQL
|
gptkbp:integratesWith |
gptkb:Apache_Hive
gptkb:Apache_Kafka Apache Parquet Apache Avro |
gptkbp:isCompatibleWith |
gptkb:Hadoop
NoSQL databases Relational databases |
gptkbp:isPartOf |
Apache_Spark_ecosystem
|
gptkbp:isPopularIn |
Cloud computing
Data analysis Machine learning Real-time analytics Data engineering |
gptkbp:isUsedFor |
Graph processing libraries
Streaming libraries Machine_Learning_libraries |
gptkbp:isUsedIn |
Data science
Business intelligence Data warehousing Big data analytics ETL_processes |
gptkbp:language |
gptkb:Java
Python Scala R |
gptkbp:provides |
Batch processing
In-memory computing Real-time stream processing Unified Data Processing Interactive queries |
gptkbp:releaseDate |
2014
|
gptkbp:supports |
DataFrames
Datasets Hive_Query_Language |
gptkbp:uses |
gptkb:SQL
gptkb:Catalyst_optimizer Tungsten execution engine |