Properties (57)
Predicate | Object |
---|---|
gptkbp:instanceOf |
gptkb:API
|
gptkbp:allows |
cluster management
monitoring of jobs cluster creation job submission |
gptkbp:compatibleWith |
gptkb:Apache_Pig
gptkb:Apache_HBase gptkb:Apache_Hive gptkb:Apache_Flink |
gptkbp:enables |
data analysis
data processing data transformation |
gptkbp:engineConfiguration |
high availability
auto-scaling |
https://www.w3.org/2000/01/rdf-schema#label |
Google Cloud Dataproc API
|
gptkbp:integratesWith |
gptkb:Google_Cloud_Pub/Sub
gptkb:Google_BigQuery gptkb:Google_Cloud_Storage |
gptkbp:is_accessible_by |
client libraries
REST_API gcloud_command-line_tool |
gptkbp:is_available_in |
multiple regions
multiple zones |
gptkbp:is_designed_to |
big data analytics
data lakes data pipelines |
gptkbp:is_part_of |
gptkb:Google_Cloud_Platform
|
gptkbp:is_used_in |
data scientists
data visualization developers report generation data engineers data exploration |
gptkbp:maintainedBy |
Google_Cloud_Team
|
gptkbp:offers |
flexibility
managed services scalability cost efficiency |
gptkbp:provides |
security features
logging capabilities RESTful interface job monitoring features managed_Spark_and_Hadoop_services |
gptkbp:suitableFor |
machine learning workloads
data warehousing ETL_processes |
gptkbp:supports |
gptkb:Java
Python Scala R big data processing batch processing custom images streaming processing preemptible VMs |
gptkbp:uses |
gptkb:Apache_Hadoop
gptkb:Apache_Spark |