gptkbp:instance_of
|
gptkb:Java_ecosystem
|
gptkbp:api
|
gptkb:API
gptkb:R_API
REST API
Python API
|
gptkbp:designed_for
|
big data
|
gptkbp:developed_by
|
gptkb:Apache_Software_Foundation
|
gptkbp:has_component
|
gptkb:Apache_Ambari
gptkb:Apache_Ranger
gptkb:Apache_Knox
gptkb:Hadoop_Common
gptkb:YARN
gptkb:Apache_Ni_Fi
Hadoop Streaming
|
https://www.w3.org/2000/01/rdf-schema#label
|
Hadoop ecosystem
|
gptkbp:includes
|
gptkb:Apache_Pig
gptkb:Apache_HBase
gptkb:Apache_Hive
gptkb:Apache_Kafka
gptkb:Apache_Oozie
gptkb:Apache_Spark
gptkb:Apache_Zookeeper
gptkb:Apache_Mahout
gptkb:Apache_Flume
gptkb:Map_Reduce
gptkb:Hadoop_Distributed_File_System_(HDFS)
gptkb:Apache_Sqoop
|
gptkbp:is_compatible_with
|
cloud platforms
on-premises systems
|
gptkbp:is_documented_in
|
tutorials
community forums
user guides
Apache documentation
|
gptkbp:is_open_source
|
gptkb:true
|
gptkbp:is_popular_among
|
gptkb:developers
IT professionals
business analysts
data scientists
data engineers
|
gptkbp:is_scalable
|
gptkb:true
|
gptkbp:is_supported_by
|
community contributions
commercial vendors
|
gptkbp:is_used_in
|
gptkb:machine_learning
real-time analytics
data mining
data lakes
data warehousing
batch processing
log processing
|
gptkbp:provides
|
gptkb:cloud_storage
data analysis
data processing
|
gptkbp:strategic_goals
|
gptkb:true
|
gptkbp:supports
|
unstructured data
structured data
semi-structured data
|
gptkbp:uses
|
distributed computing
parallel processing
|
gptkbp:written_in
|
gptkb:Java
|
gptkbp:bfsParent
|
gptkb:Joint_Task_Force
gptkb:Apache_HBase
gptkb:Apache_Hive
gptkb:Apache_Spark
|
gptkbp:bfsLayer
|
4
|