Statements (58)
Predicate | Object |
---|---|
gptkbp:instanceOf |
data storage format
|
gptkbp:availableFormats |
.orc
|
gptkbp:compatibleWith |
gptkb:Apache_Hive
gptkb:Apache_Spark |
gptkbp:designedFor |
large datasets
high performance |
gptkbp:developedBy |
gptkb:Apache_Software_Foundation
|
gptkbp:enables |
fast data processing
|
gptkbp:features |
predicate pushdown
lightweight compression multi-level indexing support for complex data types |
gptkbp:firstPublished |
2013
|
gptkbp:hasVersion |
1.7.0
|
https://www.w3.org/2000/01/rdf-schema#label |
Apache ORC
|
gptkbp:provides |
data integrity
data interoperability low latency data accessibility data serialization format high throughput metadata storage data retrieval efficiency efficient compression data skipping schema evolution capabilities |
gptkbp:publishedIn |
gptkb:Java
gptkb:C++ |
gptkbp:supports |
gptkb:Hadoop_ecosystem
data governance data serialization data transformation user-defined types compression algorithms data compression data lineage schema evolution data partitioning data archiving nested data structures data analytics frameworks ACID_transactions |
gptkbp:usedBy |
data lakes
data warehouses |
gptkbp:usedFor |
gptkb:Apache_Flink
gptkb:Apache_Drill gptkb:Hadoop_MapReduce Presto columnar storage |
gptkbp:usedIn |
machine learning
business intelligence data migration data visualization real-time analytics data science big data analytics cloud storage solutions ETL_processes |