gptkbp:instance_of
|
gptkb:streaming_service
|
gptkbp:can
|
batch data
live data streams
|
gptkbp:developed_by
|
gptkb:Apache_Software_Foundation
|
gptkbp:has_feature
|
checkpointing
dynamic allocation
backpressure handling
output operations
custom receivers
streaming SQL
streaming aggregations
streaming joins
streaming window operations
|
https://www.w3.org/2000/01/rdf-schema#label
|
Spark Streaming
|
gptkbp:integrates_with
|
gptkb:Amazon_S3
gptkb:Cassandra
gptkb:Kafka
gptkb:Flume
gptkb:HDFS
gptkb:HBase
|
gptkbp:is_available_on
|
gptkb:2014
|
gptkbp:is_compatible_with
|
gptkb:Apache_Pig
gptkb:SQL
gptkb:Apache_Hive
|
gptkbp:is_part_of
|
big data ecosystem
|
gptkbp:is_used_by
|
gptkb:developers
data scientists
data engineers
|
gptkbp:is_used_in
|
gptkb:machine_learning
data ingestion
data integration
real-time analytics
log processing
|
gptkbp:language
|
gptkb:Java
gptkb:Python
gptkb:Scala
|
gptkbp:provides
|
fault tolerance
real-time data processing
windowed computations
stateful processing
streaming analytics
|
gptkbp:runs_through
|
gptkb:Kubernetes
gptkb:YARN
Mesos
|
gptkbp:supports
|
micro-batch processing
watermarking
event time processing
|
gptkbp:uses
|
gptkb:Apache_Spark
DStream API
Structured Streaming
|
gptkbp:written_in
|
gptkb:Java
gptkb:Python
gptkb:Scala
|
gptkbp:bfsParent
|
gptkb:Apache_Spark
gptkb:MLlib
|
gptkbp:bfsLayer
|
4
|