Spark Streaming

GPTKB entity

Statements (56)
Predicate Object
gptkbp:instance_of gptkb:streaming_service
gptkbp:can batch data
live data streams
gptkbp:developed_by gptkb:Apache_Software_Foundation
gptkbp:has_feature checkpointing
dynamic allocation
backpressure handling
output operations
custom receivers
streaming SQL
streaming aggregations
streaming joins
streaming window operations
https://www.w3.org/2000/01/rdf-schema#label Spark Streaming
gptkbp:integrates_with gptkb:Amazon_S3
gptkb:Cassandra
gptkb:Kafka
gptkb:Flume
gptkb:HDFS
gptkb:HBase
gptkbp:is_available_on gptkb:2014
gptkbp:is_compatible_with gptkb:Apache_Pig
gptkb:SQL
gptkb:Apache_Hive
gptkbp:is_part_of big data ecosystem
gptkbp:is_used_by gptkb:developers
data scientists
data engineers
gptkbp:is_used_in gptkb:machine_learning
data ingestion
data integration
real-time analytics
log processing
gptkbp:language gptkb:Java
gptkb:Python
gptkb:Scala
gptkbp:provides fault tolerance
real-time data processing
windowed computations
stateful processing
streaming analytics
gptkbp:runs_through gptkb:Kubernetes
gptkb:YARN
Mesos
gptkbp:supports micro-batch processing
watermarking
event time processing
gptkbp:uses gptkb:Apache_Spark
DStream API
Structured Streaming
gptkbp:written_in gptkb:Java
gptkb:Python
gptkb:Scala
gptkbp:bfsParent gptkb:Apache_Spark
gptkb:MLlib
gptkbp:bfsLayer 4