Hadoop ecosystem

GPTKB entity

Statements (64)
Predicate Object
gptkbp:instance_of gptkb:Java_ecosystem
gptkbp:api gptkb:API
gptkb:R_API
REST API
Python API
gptkbp:designed_for big data
gptkbp:developed_by gptkb:Apache_Software_Foundation
gptkbp:has_component gptkb:Apache_Ambari
gptkb:Apache_Ranger
gptkb:Apache_Knox
gptkb:Hadoop_Common
gptkb:YARN
gptkb:Apache_Ni_Fi
Hadoop Streaming
https://www.w3.org/2000/01/rdf-schema#label Hadoop ecosystem
gptkbp:includes gptkb:Apache_Pig
gptkb:Apache_HBase
gptkb:Apache_Hive
gptkb:Apache_Kafka
gptkb:Apache_Oozie
gptkb:Apache_Spark
gptkb:Apache_Zookeeper
gptkb:Apache_Mahout
gptkb:Apache_Flume
gptkb:Map_Reduce
gptkb:Hadoop_Distributed_File_System_(HDFS)
gptkb:Apache_Sqoop
gptkbp:is_compatible_with cloud platforms
on-premises systems
gptkbp:is_documented_in tutorials
community forums
user guides
Apache documentation
gptkbp:is_open_source gptkb:true
gptkbp:is_popular_among gptkb:developers
IT professionals
business analysts
data scientists
data engineers
gptkbp:is_scalable gptkb:true
gptkbp:is_supported_by community contributions
commercial vendors
gptkbp:is_used_in gptkb:machine_learning
real-time analytics
data mining
data lakes
data warehousing
batch processing
log processing
gptkbp:provides gptkb:cloud_storage
data analysis
data processing
gptkbp:strategic_goals gptkb:true
gptkbp:supports unstructured data
structured data
semi-structured data
gptkbp:uses distributed computing
parallel processing
gptkbp:written_in gptkb:Java
gptkbp:bfsParent gptkb:Joint_Task_Force
gptkb:Apache_HBase
gptkb:Apache_Hive
gptkb:Apache_Spark
gptkbp:bfsLayer 4