Spark 3.1

GPTKB entity

Statements (62)
Predicate Object
gptkbp:instance_of gptkb:park
gptkbp:bfsLayer 4
gptkbp:bfsParent gptkb:Matei_Zaharia
gptkbp:developed_by gptkb:software_framework
gptkbp:enhances gptkb:Graph_X
SQL functions
Structured Streaming
M Llib
Data Frame API
gptkbp:has_feature SQL support
Fault tolerance
In-memory computing
Lazy evaluation
Distributed processing
Integration with Hadoop
Integration with JDBC
Data Frame API
Support for Aggregations
Support for Joins
RDDAPI
Graph processing library
Integration with Avro
Integration with Cassandra
Integration with Hive
Integration with Kafka
Integration with ORC
Integration with Parquet
Machine Learning library
Streaming library
Support for Arrow
Support for Data Frame actions
Support for Data Frame caching
Support for Data Frame persistence
Support for Data Frame transformations
Support for Data Sources API
Support for Delta Lake
Support for Koalas
Support for M Lflow
Support for SQL on Data Frames
Support for User-defined functions (UD Fs)
Support for Window functions
https://www.w3.org/2000/01/rdf-schema#label Spark 3.1
gptkbp:improves gptkb:benchmark
Kubernetes support
gptkbp:introduced Adaptive Query Execution
Dynamic Partition Pruning
New Pandas API on Spark
gptkbp:is_compatible_with gptkb:Spark_2.x
gptkbp:is_used_in gptkb:Company
gptkb:software_framework
Big Data processing
Stream Processing
gptkbp:latest_version 3.1.0
gptkbp:provides Better error messages
Improved UI
gptkbp:release_date February 2021
gptkbp:supports gptkb:Scala_2.12
gptkb:Java_8
gptkb:Python_3.8
gptkbp:written_in gptkb:Java
gptkb:Library
gptkb:Skrull