gptkbp:instance_of
|
gptkb:cloud_storage
|
gptkbp:built
|
gptkb:Apache_Spark
|
gptkbp:can_be_used_with
|
gptkb:AWS_Glue
gptkb:Azure_Databricks
|
gptkbp:developed_by
|
gptkb:Databricks
|
gptkbp:enables
|
data sharing
time travel
|
https://www.w3.org/2000/01/rdf-schema#label
|
Delta Lake
|
gptkbp:integrates_with
|
gptkb:Apache_Spark
|
gptkbp:is_accessible_by
|
gptkb:Apache_Hive
|
gptkbp:is_adopted_by
|
enterprises
|
gptkbp:is_available_on
|
gptkb:Git_Hub
|
gptkbp:is_compatible_with
|
gptkb:Jupyter_Notebooks
gptkb:SQL
data formats
data lakes
|
gptkbp:is_designed_for
|
cloud environments
|
gptkbp:is_documented_in
|
official documentation
|
gptkbp:is_effective_against
|
data processing
|
gptkbp:is_integrated_with
|
BI tools
|
gptkbp:is_open_source
|
gptkb:true
|
gptkbp:is_optimized_for
|
big data analytics
data lakes
|
gptkbp:is_part_of
|
data ecosystem
data architecture
open-source ecosystem
Lakehouse architecture
|
gptkbp:is_scalable
|
gptkb:true
|
gptkbp:is_supported_by
|
gptkb:Databricks_Runtime
community contributions
data scientists
data engineers
|
gptkbp:is_used_for
|
data quality
data integration
data transformation
real-time analytics
data warehousing
data storage solutions
|
gptkbp:is_used_in
|
gptkb:machine_learning
ETL processes
data analytics
data visualization
data science
data engineering
|
gptkbp:language
|
gptkb:Scala
|
gptkbp:offers
|
data versioning
|
gptkbp:provides
|
data lineage
data reliability
schema enforcement
|
gptkbp:release_date
|
gptkb:2019
|
gptkbp:supports
|
ACID transactions
data governance
multi-version concurrency control
data pipelines
batch and streaming data
|
gptkbp:uses
|
Parquet format
|
gptkbp:bfsParent
|
gptkb:Data
|
gptkbp:bfsLayer
|
3
|