Statements (61)
Predicate | Object |
---|---|
gptkbp:instance_of |
gptkb:cloud_storage
|
gptkbp:designed_for |
big data analytics
|
gptkbp:developed_by |
gptkb:Apache_Software_Foundation
|
gptkbp:file_extension |
.orc
|
gptkbp:first_released |
gptkb:2013
|
gptkbp:has |
gptkb:Documentation
community support data types metadata storage indexing capabilities block compression data block file footer |
https://www.w3.org/2000/01/rdf-schema#label |
Apache ORC
|
gptkbp:integrates_with |
gptkb:Apache_Flink
gptkb:Hadoop gptkb:Apache_Drill |
gptkbp:is |
gptkb:open-source_software
highly scalable |
gptkbp:is_compatible_with |
gptkb:Apache_Impala
gptkb:Apache_Hive gptkb:Apache_Spark |
gptkbp:is_optimized_for |
gptkb:Hadoop_ecosystem
|
gptkbp:provides |
data integrity
data analysis tools data retrieval data serialization efficient storage query optimization schema validation data locality predicate pushdown efficient compression lightweight compression fast read performance row-level compression |
gptkbp:supports |
data encryption
data transformation data visualization user-defined types data aggregation data lineage schema evolution distributed processing data partitioning streaming data columnar storage complex data types data skipping multi-language access |
gptkbp:used_by |
business analysts
data scientists data engineers |
gptkbp:used_for |
ETL processes
storing large datasets |
gptkbp:used_in |
data lakes
data warehouses |
gptkbp:uses |
Apache Avro for schema
|
gptkbp:written_in |
gptkb:Java
|
gptkbp:bfsParent |
gptkb:Apache_Arrow
|
gptkbp:bfsLayer |
5
|