Statements (42)
Predicate | Object |
---|---|
gptkbp:instanceOf |
gptkb:REST_API
|
gptkbp:canReadFrom |
gptkb:ORC
gptkb:JDBC gptkb:JSON gptkb:Hive CSV Parquet |
gptkbp:canWriteTo |
gptkb:ORC
gptkb:JDBC gptkb:JSON gptkb:Hive CSV Parquet |
gptkbp:documentation |
https://spark.apache.org/docs/latest/sql-programming-guide.html#dataframes
|
https://www.w3.org/2000/01/rdf-schema#label |
Spark DataFrame API
|
gptkbp:introducedIn |
Spark 1.3
|
gptkbp:partOf |
gptkb:Apache_Spark
|
gptkbp:provides |
gptkb:transformation
caching filtering grouping lazy evaluation data manipulation window functions schema enforcement serialization aggregation joins data input/output distributed data processing SQL-like operations |
gptkbp:relatedTo |
gptkb:Spark_SQL
gptkb:Spark_Dataset_API Spark RDD API |
gptkbp:supportsLanguage |
gptkb:Java
gptkb:Python gptkb:Scala R |
gptkbp:usedFor |
big data processing
structured data analysis |
gptkbp:bfsParent |
gptkb:Tungsten_execution_engine
|
gptkbp:bfsLayer |
7
|