Apache Delta Lake

GPTKB entity

Statements (51)
Predicate Object
gptkbp:instanceOf open-source storage layer
gptkbp:category gptkb:software
big data
cloud computing
data management
data engineering
gptkbp:compatibleWith gptkb:Snowflake
gptkb:Apache_Flink
gptkb:Apache_Spark
gptkb:Google_BigQuery
gptkb:Presto
gptkb:Trino
gptkb:Hive
gptkb:Amazon_Athena
gptkb:Microsoft_Synapse
gptkbp:developedBy gptkb:Databricks
gptkbp:feature data compaction
data lakehouse architecture
data reliability
data vacuuming
concurrent reads and writes
scalable metadata handling
streaming and batch unification
unified batch and streaming
gptkbp:format Parquet
gptkbp:governedBy gptkb:Linux_Foundation
https://www.w3.org/2000/01/rdf-schema#label Apache Delta Lake
gptkbp:integratesWith gptkb:Amazon_S3
gptkb:Google_Cloud_Storage
gptkb:HDFS
gptkb:Azure_Data_Lake_Storage
gptkbp:latestReleaseVersion 2024-05-15
3.1.0
gptkbp:license gptkb:Apache_License_2.0
gptkbp:openSource 2019
gptkbp:repository https://github.com/delta-io/delta
gptkbp:supports time travel
ACID transactions
data versioning
schema enforcement
gptkbp:usedFor gptkb:machine_learning
big data analytics
ETL pipelines
data lakes
gptkbp:website https://delta.io/
gptkbp:writtenBy gptkb:Java
gptkb:Python
gptkb:Scala
SQL
gptkbp:bfsParent gptkb:Databricks_Lakehouse_Platform
gptkbp:bfsLayer 6