DistCp

GPTKB entity

Statements (29)
Predicate Object
gptkbp:instanceOf gptkb:software
gptkbp:commandLineTool true
gptkbp:developedBy Hadoop ecosystem
gptkbp:documentation https://hadoop.apache.org/docs/current/hadoop-distcp/DistCp.html
gptkbp:feature fault tolerance
data filtering
customizable copy options
incremental copy
parallel copying
preserves file attributes
resumable copy
gptkbp:fullName Distributed Copy
https://www.w3.org/2000/01/rdf-schema#label DistCp
gptkbp:maintainedBy gptkb:Apache_Software_Foundation
gptkbp:openSource true
gptkbp:purpose large inter/intra-cluster copying
gptkbp:runsOn gptkb:Hadoop_MapReduce
gptkbp:supports gptkb:Google_Cloud_Storage
gptkb:Azure_Blob_Storage
gptkb:HDFS
cloud storage
S3
gptkbp:usedFor backup of HDFS data
copying large amounts of data
replicating data between Hadoop clusters
gptkbp:writtenBy gptkb:Java
gptkbp:bfsParent gptkb:Apache_Oozie
gptkb:Oozie
gptkbp:bfsLayer 6