OpenStack Sahara

GPTKB entity

Statements (59)
Predicate Object
gptkbp:instanceOf river
gptkbp:allows data scientists to create clusters
gptkbp:can_be cluster provisioning
gptkbp:compatibleWith gptkb:OpenStack
Kubernetes
gptkbp:deployedTo private clouds
public clouds
Hadoop clusters
Storm clusters
Spark_clusters
gptkbp:documentedIn OpenStack documentation
gptkbp:enables data analytics
gptkbp:features data visualization
job scheduling
gptkbp:has_a users
developers
https://www.w3.org/2000/01/rdf-schema#label OpenStack Sahara
gptkbp:integration gptkb:Cassandra
MongoDB
HDFS
gptkbp:introduced 2013
gptkbp:is_a_time_for big data challenges
gptkbp:is_available_in Apache License 2.0
gptkbp:is_designed_to data analysts
data engineers
reduce operational complexity
simplify big data deployments
gptkbp:is_featured_in gptkb:OpenStack_Summit
gptkbp:is_integrated_with data lakes
gptkbp:is_known_for flexibility
scalability
cost-effectiveness
gptkbp:is_part_of gptkb:OpenStack_Big_Data_project
gptkb:OpenStack_ecosystem
OpenStack_release_cycle
gptkbp:is_recognized_for plugins
gptkbp:is_supported_by various vendors
gptkbp:is_used_in big data processing
cloud environments
real-time data processing
research institutions
enterprises
run machine learning workloads
ETL_processes
gptkbp:isFacilitatedBy data pipelines
gptkbp:maintainedBy gptkb:OpenStack_Foundation
gptkbp:offers user interface
REST_API
gptkbp:produces gptkb:OpenStack_community
gptkbp:provides monitoring tools
cluster management
data processing frameworks
data processing services
gptkbp:supports gptkb:Hadoop
Spark
multi-tenancy
multiple data sources
Storm
gptkbp:wrote Python