Hadoop Distributed File System (HDFS)

GPTKB entity

Statements (52)
Predicate Object
gptkbp:instanceOf file format
gptkbp:accessibleBy gptkb:Hadoop_MapReduce
gptkb:Apache_Hive
gptkb:Apache_Pig
gptkbp:alternativeTo gptkb:Amazon_S3
gptkb:Google_Cloud_Storage
gptkb:Azure_Data_Lake_Storage
gptkbp:component gptkb:DataNode
gptkb:NameNode
gptkbp:DataNodeRole data storage
gptkbp:defaultBlockSize 128 MB
gptkbp:defaultReplicationFactor 3
gptkbp:designedFor large-scale data storage
gptkbp:developedBy gptkb:Apache_Software_Foundation
gptkbp:faultToleranceBy data replication
gptkbp:firstReleased 2006
https://www.w3.org/2000/01/rdf-schema#label Hadoop Distributed File System (HDFS)
gptkbp:inspiredBy gptkb:Google_File_System_(GFS)
gptkbp:license gptkb:Apache_License_2.0
gptkbp:NameNodeRole metadata management
gptkbp:numberOfLocations large files
gptkbp:openSource true
gptkbp:partOf gptkb:Apache_Hadoop
gptkbp:storesDataAs blocks
gptkbp:supports gptkb:government
gptkb:Kerberos_authentication
access control lists (ACLs)
fault tolerance
replication
high availability
encryption at rest
high throughput
snapshots
horizontal scaling
commodity hardware
POSIX-like permissions
rack awareness
streaming data access
gptkbp:supportsProtocol HDFS protocol
gptkbp:usedBy gptkb:Facebook
gptkb:LinkedIn
gptkb:Twitter
gptkb:Yahoo!
gptkbp:usedFor big data analytics
gptkbp:usedIn cloud environments
on-premises clusters
gptkbp:uses master-slave architecture
gptkbp:website https://hadoop.apache.org/
gptkbp:writtenBy gptkb:Java
gptkbp:bfsParent gptkb:Distributed_File_System_(DFS)
gptkb:Microsoft_Azure_Data_Lake
gptkbp:bfsLayer 7