Hadoop 2.x

GPTKB entity

Statements (51)
Predicate Object
gptkbp:instanceOf gptkb:Company
gptkbp:architect Master-slave architecture
gptkbp:bornIn Widely adopted in industry
gptkbp:component gptkb:Hadoop_YARN
gptkb:Hadoop_MapReduce
gptkb:Hadoop_Distributed_File_System_(HDFS)
Hadoop Common
gptkbp:developer gptkb:Apache_Software_Foundation
gptkbp:ecosystem gptkb:Apache_Pig
gptkb:Apache_Ambari
gptkb:Apache_HBase
gptkb:Apache_Hive
gptkb:Apache_Oozie
gptkb:Apache_ZooKeeper
gptkb:Apache_Knox
gptkb:Apache_Spark
gptkb:Apache_Mahout
gptkb:Apache_Flume
Apache Sqoop
gptkbp:features YARN
HDFS improvements
MapReduce v2
gptkbp:firstPublished 2013-07-28
2.10.1
gptkbp:hasVersion 2.0.0
https://www.w3.org/2000/01/rdf-schema#label Hadoop 2.x
gptkbp:language gptkb:Java
gptkbp:license Apache License 2.0
gptkbp:publishedIn gptkb:Java
gptkbp:relatedPatent Data analysis
Machine learning
Real-time analytics
Business intelligence
Data integration
Data warehousing
Data archiving
Log processing
Data lake
ETL_processes
gptkbp:releaseDate July 2013
gptkbp:successor gptkb:Hadoop_3.x
gptkbp:supports Scalability
Resource management
Fault tolerance
Multi-tenancy
Data locality
Job scheduling
gptkbp:uses Big data processing
Distributed storage
Commodity hardware
gptkbp:website https://hadoop.apache.org/