Apache ORC

GPTKB entity

Statements (61)
Predicate Object
gptkbp:instance_of gptkb:cloud_storage
gptkbp:designed_for big data analytics
gptkbp:developed_by gptkb:Apache_Software_Foundation
gptkbp:file_extension .orc
gptkbp:first_released gptkb:2013
gptkbp:has gptkb:Documentation
community support
data types
metadata storage
indexing capabilities
block compression
data block
file footer
https://www.w3.org/2000/01/rdf-schema#label Apache ORC
gptkbp:integrates_with gptkb:Apache_Flink
gptkb:Hadoop
gptkb:Apache_Drill
gptkbp:is gptkb:open-source_software
highly scalable
gptkbp:is_compatible_with gptkb:Apache_Impala
gptkb:Apache_Hive
gptkb:Apache_Spark
gptkbp:is_optimized_for gptkb:Hadoop_ecosystem
gptkbp:provides data integrity
data analysis tools
data retrieval
data serialization
efficient storage
query optimization
schema validation
data locality
predicate pushdown
efficient compression
lightweight compression
fast read performance
row-level compression
gptkbp:supports data encryption
data transformation
data visualization
user-defined types
data aggregation
data lineage
schema evolution
distributed processing
data partitioning
streaming data
columnar storage
complex data types
data skipping
multi-language access
gptkbp:used_by business analysts
data scientists
data engineers
gptkbp:used_for ETL processes
storing large datasets
gptkbp:used_in data lakes
data warehouses
gptkbp:uses Apache Avro for schema
gptkbp:written_in gptkb:Java
gptkbp:bfsParent gptkb:Apache_Arrow
gptkbp:bfsLayer 5