Apache Spark Streaming

GPTKB entity

Properties (54)
Predicate Object
gptkbp:instanceOf gptkb:Streaming_service
gptkbp:canBe horizontally
gptkbp:developedBy gptkb:Apache_Software_Foundation
gptkbp:hasAccreditation data to various sinks
gptkbp:hasFeature dynamic scaling
checkpointing
backpressure handling
support for various data formats
integration_with_SQL
gptkbp:hasOccupation active user community
https://www.w3.org/2000/01/rdf-schema#label Apache Spark Streaming
gptkbp:integratesWith gptkb:Amazon_S3
gptkb:Apache_Kafka
gptkb:Apache_Flume
HDFS
gptkbp:isAvailableIn gptkb:Maven_Central
GitHub
gptkbp:isCompatibleWith gptkb:Spark_GraphX
gptkb:Spark_SQL
Spark MLlib
gptkbp:isDocumentedIn Apache Spark documentation
gptkbp:isFacilitatedBy high-throughput data streams
gptkbp:isInvolvedIn October 2013
gptkbp:isOptimizedFor low-latency processing
gptkbp:isPartOf big data processing frameworks
Apache_Spark_ecosystem
gptkbp:isUsedBy data scientists
software developers
data engineers
gptkbp:isUsedFor cloud platforms
data enrichment
data ingestion
data transformation
real-time monitoring
on-premises servers
recommendation systems
event detection
alerting systems
machine learning libraries
log processing
graph processing libraries
streaming_ETL
gptkbp:isUsedIn big data applications
gptkbp:mayHave data from various sources
gptkbp:provides micro-batch processing
real-time dashboards
streaming analytics
fault_tolerance
gptkbp:publishedIn Scala
gptkbp:supports real-time data processing
windowed computations
stateful processing
gptkbp:uses RDDs
DStream API