Apache Spark pools

GPTKB entity

Statements (51)
Predicate Object
gptkbp:instanceOf cloud computing service feature
gptkbp:allows batch processing
interactive analytics
machine learning workflows
data engineering workflows
parallel data processing
gptkbp:canBe library management
node size
number of nodes
auto-pause settings
gptkbp:enables gptkb:Apache_Spark_workloads
https://www.w3.org/2000/01/rdf-schema#label Apache Spark pools
gptkbp:integratesWith gptkb:Azure_Machine_Learning
gptkb:Azure_Blob_Storage
gptkb:Azure_Synapse_Studio
gptkb:Azure_Data_Lake_Storage
gptkbp:offeredBy gptkb:Microsoft_Azure
gptkbp:partOf gptkb:Azure_Synapse_Analytics
gptkbp:provides on-demand Spark clusters
gptkbp:supports gptkb:transformation
gptkb:data_visualization
gptkb:Spark_MLlib
gptkb:GraphX
gptkb:Spark_SQL
gptkb:Apache_Spark_3.x
gptkb:Synapse_pipelines
gptkb:Delta_Lake
gptkb:Spark_Streaming
data exploration
role-based access control
job scheduling
autoscaling
integration with Power BI
data lake analytics
monitoring and logging
custom libraries
integration with Azure Active Directory
integration with Azure Event Hubs
integration with Azure Cosmos DB
integration with Azure Key Vault
integration with Azure Data Factory
integration with Azure SQL Database
notebook development
gptkbp:supportsLanguage gptkb:Python
gptkb:Scala
gptkb:.NET
SQL
gptkbp:usedFor big data analytics
distributed data processing
gptkbp:bfsParent gptkb:Microsoft_Azure_Synapse_Analytics
gptkbp:bfsLayer 6