SQL-on-Hadoop

GPTKB entity

Statements (62)
Predicate Object
gptkbp:instanceOf gptkb:technology
gptkbp:competitor traditional RDBMS
NoSQL-on-Hadoop
gptkbp:enables business intelligence
data warehousing
ad hoc querying
ETL operations
SQL queries on Hadoop data
gptkbp:example gptkb:Apache_Hive
gptkb:Presto
gptkb:Apache_Drill
gptkb:Spark_SQL
gptkb:Apache_Phoenix
Cloudera Impala
Google BigQuery (inspired by SQL-on-Hadoop)
IBM Big SQL
gptkbp:feature gptkb:security
batch processing
user authentication
scalability
fault tolerance
role-based access control
metadata management
parallel execution
integration with BI tools
data federation
interactive querying
support for large datasets
cost-based optimization
distributed query processing
support for multiple data formats
open source and commercial options
schema-on-read
support for ANSI SQL
gptkbp:field big data
distributed computing
data analytics
gptkbp:goal bridge gap between Hadoop and relational databases
make Hadoop accessible to SQL users
https://www.w3.org/2000/01/rdf-schema#label SQL-on-Hadoop
gptkbp:integratesWith gptkb:Impala
gptkb:Phoenix
gptkb:Apache_Hadoop
gptkb:Presto
gptkb:HBase
gptkb:Hive
gptkb:HDFS
gptkb:Spark_SQL
Drill
Big SQL
gptkbp:notableDriver adoption of Hadoop in enterprises
demand for self-service analytics
need for SQL access to big data
gptkbp:originatedIn 2010s
gptkbp:supports structured data analysis
gptkbp:usedBy data analysts
business analysts
data scientists
data engineers
gptkbp:bfsParent gptkb:Apache_Tajo
gptkb:Apache_Drill
gptkbp:bfsLayer 7