Apache Spark pools

URI: https://gptkb.org/entity/Apache_Spark_pools

GPTKB entity

Predicate	Object
gptkbp:instanceOf	gptkb:cloud_computing_service_feature
gptkbp:allows	batch processing interactive analytics machine learning workflows data engineering workflows parallel data processing
gptkbp:canBe	library management node size number of nodes auto-pause settings
gptkbp:enables	gptkb:Apache_Spark_workloads
gptkbp:integratesWith	gptkb:Azure_Machine_Learning gptkb:Azure_Blob_Storage gptkb:Azure_Synapse_Studio gptkb:Azure_Data_Lake_Storage
gptkbp:offeredBy	gptkb:Microsoft_Azure
gptkbp:partOf	gptkb:Azure_Synapse_Analytics
gptkbp:provides	on-demand Spark clusters
gptkbp:supports	gptkb:transformation gptkb:data_visualization gptkb:Spark_MLlib gptkb:GraphX gptkb:Spark_SQL gptkb:Apache_Spark_3.x gptkb:Synapse_pipelines gptkb:Delta_Lake gptkb:Spark_Streaming data exploration role-based access control job scheduling autoscaling integration with Power BI data lake analytics monitoring and logging custom libraries integration with Azure Active Directory integration with Azure Event Hubs integration with Azure Cosmos DB integration with Azure Key Vault integration with Azure Data Factory integration with Azure SQL Database notebook development
gptkbp:supportsLanguage	gptkb:Python gptkb:Scala gptkb:.NET SQL
gptkbp:usedFor	big data analytics distributed data processing
gptkbp:bfsParent	gptkb:Microsoft_Azure_Synapse_Analytics
gptkbp:bfsLayer	6
http://www.w3.org/2000/01/rdf-schema#label	Apache Spark pools