Spark Structured Streaming

GPTKB entity

Statements (51)
Predicate Object
gptkbp:instanceOf stream processing engine
gptkbp:developedBy gptkb:Apache_Software_Foundation
gptkbp:feature scalability
fault tolerance
watermarking
checkpointing
integration with batch processing
late data handling
output modes
trigger intervals
gptkbp:firstReleased 2016
https://www.w3.org/2000/01/rdf-schema#label Spark Structured Streaming
gptkbp:input gptkb:File
gptkb:Kafka
gptkb:Socket
gptkb:Amazon_Kinesis
gptkb:Azure_Event_Hubs
gptkbp:latestReleaseVersion 3.5.1
gptkbp:license gptkb:Apache_License_2.0
gptkbp:output gptkb:Amazon_S3
gptkb:File
gptkb:Kafka
gptkb:Memory
gptkb:Console
gptkb:Azure_Blob_Storage
gptkb:Delta_Lake
Foreach
gptkbp:outputMode complete
update
append
gptkbp:partOf gptkb:Apache_Spark
gptkbp:processor real-time data streams
gptkbp:supports exactly-once semantics
stateful operations
windowed aggregations
event-time processing
continuous processing mode
micro-batch processing mode
gptkbp:supportsLanguage gptkb:Java
gptkb:Python
gptkb:Scala
R
gptkbp:uses gptkb:Dataset_API
gptkb:DataFrame_API
gptkbp:website https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html
gptkbp:writtenBy gptkb:Java
gptkb:Python
gptkb:Scala
R
gptkbp:bfsParent gptkb:Cloudflow
gptkbp:bfsLayer 7