Dataproc

URI: https://gptkb.org/entity/Dataproc

GPTKB entity

Predicate	Object
gptkbp:instanceOf	gptkb:cloud_service
gptkbp:billingModel	per-second billing
gptkbp:developedBy	gptkb:Google
gptkbp:documentation	https://cloud.google.com/dataproc/docs
gptkbp:enables	managed clusters scalable data processing
gptkbp:firstReleased	2016
gptkbp:hasFeature	gptkb:mobile_application API access encryption in transit job scheduling encryption at rest autoscaling integration with Vertex AI diagnostic tools IAM integration custom images preemptible VMs secure cluster connectivity workflow templates serverless Spark jobs cluster resizing custom initialization actions integration with Dataproc Hub integration with VPC single-node clusters
gptkbp:integratesWith	gptkb:Google_Cloud_Storage gptkb:BigQuery gptkb:Cloud_Logging gptkb:Cloud_Monitoring gptkb:Cloud_Dataproc_Metastore
gptkbp:offeredBy	gptkb:Google_Cloud_Platform
gptkbp:regionAvailability	multiple Google Cloud regions
gptkbp:supports	gptkb:Java gptkb:Python gptkb:Scala gptkb:Jupyter_Notebooks gptkb:Apache_Hadoop gptkb:Apache_Hive gptkb:Apache_Spark gptkb:Apache_Pig
gptkbp:url	https://console.cloud.google.com/dataproc
gptkbp:usedFor	data analytics big data processing ETL workflows
gptkbp:bfsParent	gptkb:Cloud_Spanner gptkb:Vertex_AI
gptkbp:bfsLayer	6
http://www.w3.org/2000/01/rdf-schema#label	Dataproc