Statements (42)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:REST_API
|
| gptkbp:canReadFrom |
gptkb:ORC
gptkb:JDBC gptkb:JSON gptkb:Hive CSV Parquet |
| gptkbp:canWriteTo |
gptkb:ORC
gptkb:JDBC gptkb:JSON gptkb:Hive CSV Parquet |
| gptkbp:documentation |
https://spark.apache.org/docs/latest/sql-programming-guide.html#dataframes
|
| gptkbp:introducedIn |
Spark 1.3
|
| gptkbp:partOf |
gptkb:Apache_Spark
|
| gptkbp:provides |
gptkb:transformation
caching filtering grouping lazy evaluation data manipulation window functions schema enforcement serialization aggregation joins data input/output distributed data processing SQL-like operations |
| gptkbp:relatedTo |
gptkb:Spark_SQL
gptkb:Spark_Dataset_API Spark RDD API |
| gptkbp:supportsLanguage |
gptkb:Java
gptkb:Python gptkb:Scala R |
| gptkbp:usedFor |
big data processing
structured data analysis |
| gptkbp:bfsParent |
gptkb:Tungsten_execution_engine
|
| gptkbp:bfsLayer |
8
|
| https://www.w3.org/2000/01/rdf-schema#label |
Spark DataFrame API
|