| gptkbp:instanceOf | gptkb:cloud_service 
 | 
                        
                            
                                | gptkbp:billingModel | per-second billing 
 | 
                        
                            
                                | gptkbp:developedBy | gptkb:Google 
 | 
                        
                            
                                | gptkbp:documentation | https://cloud.google.com/dataproc/docs 
 | 
                        
                            
                                | gptkbp:enables | managed clusters scalable data processing
 
 | 
                        
                            
                                | gptkbp:firstReleased | 2016 
 | 
                        
                            
                                | gptkbp:hasFeature | gptkb:mobile_application API access
 encryption in transit
 job scheduling
 encryption at rest
 autoscaling
 integration with Vertex AI
 diagnostic tools
 IAM integration
 custom images
 preemptible VMs
 secure cluster connectivity
 workflow templates
 serverless Spark jobs
 cluster resizing
 custom initialization actions
 integration with Dataproc Hub
 integration with VPC
 single-node clusters
 
 | 
                        
                            
                                | gptkbp:integratesWith | gptkb:Google_Cloud_Storage gptkb:BigQuery
 gptkb:Cloud_Logging
 gptkb:Cloud_Monitoring
 gptkb:Cloud_Dataproc_Metastore
 
 | 
                        
                            
                                | gptkbp:offeredBy | gptkb:Google_Cloud_Platform 
 | 
                        
                            
                                | gptkbp:regionAvailability | multiple Google Cloud regions 
 | 
                        
                            
                                | gptkbp:supports | gptkb:Java gptkb:Python
 gptkb:Scala
 gptkb:Jupyter_Notebooks
 gptkb:Apache_Hadoop
 gptkb:Apache_Hive
 gptkb:Apache_Spark
 gptkb:Apache_Pig
 
 | 
                        
                            
                                | gptkbp:url | https://console.cloud.google.com/dataproc 
 | 
                        
                            
                                | gptkbp:usedFor | data analytics big data processing
 ETL workflows
 
 | 
                        
                            
                                | gptkbp:bfsParent | gptkb:Cloud_Spanner gptkb:Vertex_AI
 
 | 
                        
                            
                                | gptkbp:bfsLayer | 6 
 | 
                        
                            
                                | https://www.w3.org/2000/01/rdf-schema#label | Dataproc 
 |