Delta Lake

GPTKB entity

Properties (56)
Predicate Object
gptkbp:instanceOf data storage
gptkbp:developedBy Databricks
gptkbp:enables batch and streaming data processing
https://www.w3.org/2000/01/rdf-schema#label Delta Lake
gptkbp:integratesWith gptkb:Apache_Spark
gptkbp:isAttendedBy enterprises
gptkbp:isAvailableIn gptkb:AWS
gptkb:Google_Cloud_Platform
Azure
gptkbp:isCompatibleWith gptkb:SQL
gptkb:Apache_Hive
data science tools
gptkbp:isConsidered data management solution
gptkbp:isDesignedFor data lakes
gptkbp:isDocumentedIn official documentation
gptkbp:isEnhancedBy optimizations
gptkbp:isEvaluatedBy analysts
gptkbp:isIntegratedWith gptkb:Apache_Airflow
gptkb:Apache_Kafka
MLflow
gptkbp:isOpenTo true
gptkbp:isOptimizedFor cloud storage
data lakes
gptkbp:isPartOf data ecosystem
data architecture
Lakehouse_architecture
gptkbp:isPromotedBy cloud providers
gptkbp:isRated Apache Parquet
gptkbp:isRecognizedFor open-source project
gptkbp:isSupportedBy community contributions
user community
Databricks Runtime
gptkbp:isTestedFor unit tests
gptkbp:isUsedBy data engineers
gptkbp:isUsedFor data integration
real-time analytics
data warehousing
gptkbp:isUsedIn machine learning
big data analytics
ETL_processes
gptkbp:isUtilizedIn data pipelines
gptkbp:language Scala
gptkbp:offers data versioning
gptkbp:provides data lineage
time travel
data reliability
data caching
schema enforcement
gptkbp:releaseDate 2019
gptkbp:supports data governance
data sharing
multi-version concurrency control
data lakes
streaming analytics
ACID_transactions
gptkbp:uses Parquet format