Py Spark

GPTKB entity

Statements (106)
Predicate Object
gptkbp:instance_of gptkb:Library
gptkb:park
gptkbp:bfsLayer 5
gptkbp:bfsParent gptkb:Spark_3.3
gptkb:Spark_4.0
gptkbp:based_on gptkb:park
gptkbp:can_be_used_with gptkb:Graph_X
gptkb:Jupyter_Notebooks
gptkb:SQL
gptkb:Sultan
gptkb:park
gptkb:Apache_Zeppelin
M Llib
gptkbp:developed_by gptkb:software_framework
gptkbp:has active community
extensive documentation
https://www.w3.org/2000/01/rdf-schema#label Py Spark
gptkbp:integrates_with gptkb:park
gptkbp:is_accessible_by pip
conda
gptkbp:is_compatible_with gptkb:S3
gptkb:Google_Cloud_Dataproc
gptkb:Azure_Blob_Storage
gptkb:Cloud_Computing_Service
gptkb:computer
gptkb:Databricks
gptkb:HDFS
gptkb:Apache_Zeppelin
Azure HD Insight
AWSEMR
gptkbp:is_part_of Apache Spark ecosystem
gptkbp:is_popular_in data science
big data analytics
data engineering
gptkbp:is_used_by gptkb:anime
gptkb:musician
gptkb:streaming_service
gptkb:Airbnb
gptkb:Alibaba
gptkb:Door_Dash
gptkb:Linked_In
gptkb:Lyft
gptkb:Pinterest
gptkb:Quora
gptkb:Reddit
gptkb:Shopify
gptkb:Slack
gptkb:Snapchat
gptkb:Uber
gptkb:Foursquare
gptkb:Kickstarter
gptkb:Author
gptkb:Bloomberg
gptkb:Microsoft
gptkb:The_Guardian
gptkb:The_New_York_Times
gptkb:CERN
gptkb:Instacart
gptkb:Yelp
gptkb:Yahoo
gptkb:CEO
gptkb:album
gptkb:NASA
gptkb:beach
gptkb:Twitter_account
gptkb:Twitch
gptkb:tank
gptkb:Mozilla
gptkb:Zillow
gptkb:collection
gptkbp:is_used_for gptkb:software_framework
data analysis
ETL processes
big data processing
data visualization
real-time analytics
data science
data engineering
stream processing
batch processing
graph processing
gptkbp:language gptkb:Library
gptkbp:offers streaming capabilities
graph processing capabilities
machine learning libraries
gptkbp:passes_through gptkb:fortification
gptkb:Monarch
gptkb:YARN
Standalone cluster
gptkbp:provides machine learning capabilities
SQL querying capabilities
streaming capabilities
graph processing capabilities
Data Frame API
gptkbp:released gptkb:2014
gptkbp:supports gptkb:Graph_X
gptkb:Spark_SQL
SQL queries
Python 2 and 3
M Llib
RDDAPI
dataframe API
Python UD Fs
gptkbp:uses Datasets
RDD (Resilient Distributed Dataset)
Data Frames