gptkbp:instanceOf
|
software framework
|
gptkbp:architect
|
master/slave
|
gptkbp:availableFormats
|
gptkb:ORC
text
Parquet
binary
Avro
sequence
|
gptkbp:community
|
conferences
meetups
open source
user groups
active contributors
|
gptkbp:component
|
gptkb:Hadoop_YARN
gptkb:Hadoop_MapReduce
gptkb:Hadoop_Distributed_File_System_(HDFS)
Hadoop Common
|
gptkbp:deployedTo
|
on-premises
cloud
hybrid
|
gptkbp:developedBy
|
gptkb:Apache_Software_Foundation
|
gptkbp:ecosystem
|
gptkb:Apache_Pig
gptkb:Apache_HBase
gptkb:Apache_Hive
gptkb:Apache_Kafka
gptkb:Apache_Oozie
gptkb:Apache_Storm
gptkb:Apache_ZooKeeper
gptkb:Apache_Spark
gptkb:Apache_Mahout
gptkb:Apache_Flume
Apache Sqoop
|
gptkbp:hasVersion
|
3.3.1
|
https://www.w3.org/2000/01/rdf-schema#label
|
Apache Hadoop
|
gptkbp:license
|
Apache License 2.0
|
gptkbp:performance
|
low latency
high throughput
stream processing
batch processing
|
gptkbp:provides
|
distributed processing
distributed storage
|
gptkbp:publishedIn
|
gptkb:Java
|
gptkbp:relatedPatent
|
machine learning
data mining
data warehousing
data archiving
log processing
|
gptkbp:releasedIn
|
April 1, 2006
|
gptkbp:security
|
access control
audit logging
data encryption
Kerberos_authentication
|
gptkbp:supports
|
scalability
big data applications
fault_tolerance
|
gptkbp:usedBy
|
gptkb:Yahoo
Facebook
LinkedIn
Netflix
Twitter
|
gptkbp:uses
|
MapReduce
HDFS
data locality
|