Apache Hadoop

GPTKB entity

Properties (63)
Predicate Object
gptkbp:instanceOf software framework
gptkbp:architect master/slave
gptkbp:availableFormats gptkb:ORC
text
Parquet
binary
Avro
sequence
gptkbp:community conferences
meetups
open source
user groups
active contributors
gptkbp:component gptkb:Hadoop_YARN
gptkb:Hadoop_MapReduce
gptkb:Hadoop_Distributed_File_System_(HDFS)
Hadoop Common
gptkbp:deployedTo on-premises
cloud
hybrid
gptkbp:developedBy gptkb:Apache_Software_Foundation
gptkbp:ecosystem gptkb:Apache_Pig
gptkb:Apache_HBase
gptkb:Apache_Hive
gptkb:Apache_Kafka
gptkb:Apache_Oozie
gptkb:Apache_Storm
gptkb:Apache_ZooKeeper
gptkb:Apache_Spark
gptkb:Apache_Mahout
gptkb:Apache_Flume
Apache Sqoop
gptkbp:hasVersion 3.3.1
https://www.w3.org/2000/01/rdf-schema#label Apache Hadoop
gptkbp:license Apache License 2.0
gptkbp:performance low latency
high throughput
stream processing
batch processing
gptkbp:provides distributed processing
distributed storage
gptkbp:publishedIn gptkb:Java
gptkbp:relatedPatent machine learning
data mining
data warehousing
data archiving
log processing
gptkbp:releasedIn April 1, 2006
gptkbp:security access control
audit logging
data encryption
Kerberos_authentication
gptkbp:supports scalability
big data applications
fault_tolerance
gptkbp:usedBy gptkb:Yahoo
Facebook
LinkedIn
Netflix
Twitter
gptkbp:uses MapReduce
HDFS
data locality