gptkbp:instanceOf
|
open-source data management framework
|
gptkbp:alternativeTo
|
gptkb:Apache_Iceberg
gptkb:Delta_Lake
|
gptkbp:category
|
analysis
Cloud Computing
Big Data
Data Management
Data Lake
Open Source Software
Data Engineering
|
gptkbp:compatibleWith
|
gptkb:Apache_Flink
gptkb:Apache_Hive
gptkb:Apache_Spark
gptkb:Presto
gptkb:Trino
gptkb:Amazon_Athena
|
gptkbp:developedBy
|
gptkb:Apache_Software_Foundation
|
https://www.w3.org/2000/01/rdf-schema#label
|
Apache Hudi
|
gptkbp:integratesWith
|
gptkb:AWS_Glue
gptkb:Amazon_S3
gptkb:Google_Cloud_Storage
gptkb:Databricks
gptkb:Apache_Iceberg
gptkb:HDFS
gptkb:Azure_Data_Lake_Storage
gptkb:Delta_Lake
|
gptkbp:latestReleaseVersion
|
0.14.0
|
gptkbp:license
|
gptkb:Apache_License_2.0
|
gptkbp:officialWebsite
|
https://hudi.apache.org/
|
gptkbp:releaseDate
|
2017
|
gptkbp:repository
|
https://github.com/apache/hudi
|
gptkbp:supports
|
streaming data ingestion
batch data ingestion
incremental data processing
|
gptkbp:supportsFormat
|
gptkb:Avro
gptkb:ORC
Parquet
|
gptkbp:usedFor
|
data migration
data ingestion
data versioning
data compaction
data deletes
data lake management
data lakehouse architecture
data replication
data upserts
data clustering
data rollback
managing large analytical datasets
|
gptkbp:writtenBy
|
gptkb:Java
gptkb:Scala
|
gptkbp:bfsParent
|
gptkb:Hadoop
gptkb:Apache_Software_Foundation_projects
|
gptkbp:bfsLayer
|
6
|