Dataflow Shuffle

GPTKB entity

Statements (15)
Predicate Object
gptkbp:instanceOf data processing technique
gptkbp:alternativeTo Batch Shuffle
Streaming Shuffle
gptkbp:enables parallel processing
https://www.w3.org/2000/01/rdf-schema#label Dataflow Shuffle
gptkbp:improves scalability
fault tolerance
gptkbp:purpose redistribute data between workers
gptkbp:relatedTo gptkb:Apache_Beam
gptkbp:usedFor group by operations
aggregation operations
join operations
gptkbp:usedIn gptkb:Google_Cloud_Dataflow
gptkbp:bfsParent gptkb:Dataflow
gptkbp:bfsLayer 6