Spark 3.8

GPTKB entity

Statements (74)
Predicate Object
gptkbp:instance_of gptkb:park
gptkbp:bfsLayer 4
gptkbp:bfsParent gptkb:Matei_Zaharia
gptkbp:category gptkb:software_framework
gptkb:Big_Data
Data Processing
Stream Processing
gptkbp:dependency gptkb:Scala_2.12
Java 8+
Hadoop 3.2+
gptkbp:developer gptkb:software_framework
gptkbp:features Improved performance
Support for Kubernetes
Better integration with Delta Lake
Enhanced machine learning library
Improved UI
New SQL functions
Support for Python 3.10
https://www.w3.org/2000/01/rdf-schema#label Spark 3.8
gptkbp:language gptkb:Java
gptkb:R
gptkb:Library
gptkb:Skrull
gptkbp:license Apache License 2.0
gptkbp:notable_feature Enhanced security features
Improved error messages
Performance optimizations
Improved resource management
Support for data warehousing
Improved documentation
Support for user-defined functions
Support for complex data types
Support for data analytics
Support for new file formats
Support for multi-tenancy
Support for data archiving
Support for data lineage
Support for data quality checks
Support for data transformation
Support for machine learning pipelines
Support for real-time analytics
Support for data sharing
Improved caching mechanisms
Support for new data sources
Improved support for data governance
Improved support for data science
Improved support for data visualization
Support for data compliance
Support for data engineering
Adaptive Query Execution
Columnar Data Processing
Dynamic Partition Pruning
Improved Data Frame API
Improved integration with BI tools
Improved performance for joins
Improved support for data collaboration
Improved support for data exploration
Improved support for data integration
Improved support for data lakes
Improved support for data management
Improved support for data migration
Improved support for data standards
Improved support for graph processing
Improved support for window functions
New M Llib algorithms
Support for Arrow
Support for SQL on streaming data
Support for batch and streaming workloads
Support for custom aggregations
Support for data governance frameworks
gptkbp:predecessor Spark 3.6
gptkbp:release_date 2023-08-01
gptkbp:successor Spark 3.7
gptkbp:website https://spark.apache.org/