Parquet format

GPTKB entity

Statements (41)
Predicate Object
gptkbp:instanceOf columnar storage file format
gptkbp:basedOn gptkb:Google_Dremel_paper
gptkbp:compatibleWith gptkb:Apache_Hadoop
gptkb:Apache_Hive
gptkb:Apache_Impala
gptkb:Apache_Spark
gptkb:Google_BigQuery
gptkb:Presto
gptkb:Amazon_Athena
gptkb:Apache_Drill
gptkb:Microsoft_Azure_Data_Lake
gptkbp:developedBy gptkb:Apache_Software_Foundation
gptkbp:fileExtension .parquet
gptkbp:firstReleased 2013
https://www.w3.org/2000/01/rdf-schema#label Parquet format
gptkbp:openSource true
gptkbp:standardizedBy gptkb:Apache_Incubator
gptkbp:supports compression
nested data structures
schema evolution
dictionary encoding
run-length encoding
complex data types
gzip compression
predicate pushdown
splitting
bit packing
brotli compression
column pruning
lz4 compression
row group
snappy compression
zstd compression
gptkbp:usedFor data storage
data interchange
gptkbp:usedIn big data processing
gptkbp:website https://parquet.apache.org/
gptkbp:writtenBy gptkb:Java
gptkb:C++
gptkbp:bfsParent gptkb:RCFile_format
gptkbp:bfsLayer 7