Statements (119)
Predicate | Object |
---|---|
gptkbp:instance_of |
gptkb:software
gptkb:Data |
gptkbp:api |
gptkb:API
REST API Python API C++ API |
gptkbp:architectural_style |
Master-slave architecture
Tablet servers Metadata servers |
gptkbp:architecture |
flow-based architecture
|
gptkbp:designed_for |
Big Data Analytics
|
gptkbp:developed_by |
gptkb:Apache_Software_Foundation
|
gptkbp:features |
Scalability
Low latency real-time data processing web-based interface Fault tolerance High throughput data provenance flow-based programming |
gptkbp:has_community |
Open source community
|
gptkbp:has_component |
gptkb:user_interface
data sources configuration management data lineage data integration tools data pipelines data workflows templates connections data provenance tracking data sinks data processors data connectors data orchestration tools flow templates processors reporting tasks controller services data queues data routing rules data transformation rules dataflows flowfiles process groups site-to-site protocol |
gptkbp:has_documentation |
API reference
User guide Developer guide |
gptkbp:has_performance |
High performance
Efficient storage Fast data access Optimized for large datasets Low latency reads and writes |
https://www.w3.org/2000/01/rdf-schema#label |
Apache Ni Fi
|
gptkbp:integrates_with |
gptkb:Apache_Spark
gptkb:Hadoop |
gptkbp:is_available_on |
gptkb:Docker_Hub
gptkb:Git_Hub Cloud platforms Apache Software Foundation website |
gptkbp:is_compatible_with |
gptkb:Apache_Impala
gptkb:Apache_Hive gptkb:Apache_Flink |
gptkbp:is_optimized_for |
Analytical workloads
Transactional workloads |
gptkbp:is_part_of |
Apache Software Foundation projects
|
gptkbp:is_used_by |
Data scientists
Data engineers |
gptkbp:language |
gptkb:C++
|
gptkbp:latest_version |
1.16.0
|
gptkbp:license |
Apache License 2.0
|
gptkbp:operating_system |
cross-platform
|
gptkbp:programming_language |
gptkb:Java
|
gptkbp:provides |
Real-time analytics
Data processing Data storage data enrichment data transformation data visualization monitoring tools data aggregation alerting capabilities Data ingestion data routing |
gptkbp:release_date |
gptkb:2014
2014-02-12 |
gptkbp:repository |
gptkb:Git_Hub
|
gptkbp:supports |
gptkb:SQL
gptkb:SSL/_TLS REST API Data replication user authentication data ingestion data security multi-tenancy user authorization Partitioning backpressure data prioritization Multi-user concurrency clustered deployment flow versioning |
gptkbp:use_case |
Machine learning
Data warehousing Log analytics Io T data processing Real-time analytics applications |
gptkbp:uses |
gptkb:Apache_Kafka
gptkb:Apache_Spark gptkb:Hadoop Columnar storage format dataflow |
gptkbp:written_in |
gptkb:Java
gptkb:C++ |
gptkbp:bfsParent |
gptkb:Apache
gptkb:Apache_Software_Foundation gptkb:Hadoop |
gptkbp:bfsLayer |
4
|