Statements (94)
Predicate | Object |
---|---|
gptkbp:instanceOf |
gptkb:REST_API
gptkb:software |
gptkbp:canBe |
gptkb:Hadoop
gptkb:Azure_HDInsight gptkb:Jupyter_Notebook gptkb:Hive gptkb:AWS_EMR gptkb:Google_Dataproc |
gptkbp:category |
gptkb:Machine_Learning
gptkb:Data_Science Big Data Distributed Systems |
gptkbp:class |
pyspark.SparkContext
pyspark.ml pyspark.sql.Column pyspark.sql.DataFrame pyspark.sql.DataFrameReader pyspark.sql.DataFrameWriter pyspark.sql.GroupedData pyspark.sql.Row pyspark.sql.SparkSession pyspark.sql.SparkSession.Builder pyspark.sql.Window pyspark.sql.catalog pyspark.sql.catalog.Catalog pyspark.sql.functions pyspark.sql.functions.add_months pyspark.sql.functions.array pyspark.sql.functions.avg pyspark.sql.functions.col pyspark.sql.functions.concat pyspark.sql.functions.count pyspark.sql.functions.current_date pyspark.sql.functions.current_timestamp pyspark.sql.functions.date_add pyspark.sql.functions.date_format pyspark.sql.functions.date_sub pyspark.sql.functions.datediff pyspark.sql.functions.dayofmonth pyspark.sql.functions.explode pyspark.sql.functions.expr pyspark.sql.functions.from_unixtime pyspark.sql.functions.hour pyspark.sql.functions.last_day pyspark.sql.functions.lit pyspark.sql.functions.max pyspark.sql.functions.min pyspark.sql.functions.minute pyspark.sql.functions.month pyspark.sql.functions.months_between pyspark.sql.functions.next_day pyspark.sql.functions.pandas_udf pyspark.sql.functions.round pyspark.sql.functions.second pyspark.sql.functions.split pyspark.sql.functions.struct pyspark.sql.functions.sum pyspark.sql.functions.to_date pyspark.sql.functions.trunc pyspark.sql.functions.udf pyspark.sql.functions.unix_timestamp pyspark.sql.functions.when pyspark.sql.functions.window pyspark.sql.functions.year pyspark.sql.streaming pyspark.sql.types pyspark.sql.udf pyspark.streaming |
gptkbp:developedBy |
gptkb:Apache_Software_Foundation
|
gptkbp:documentation |
https://spark.apache.org/docs/latest/api/python/
|
gptkbp:firstReleased |
2012
|
https://www.w3.org/2000/01/rdf-schema#label |
Python (PySpark)
|
gptkbp:latestReleaseVersion |
3.4.1
|
gptkbp:license |
gptkb:Apache_License_2.0
|
gptkbp:openSource |
true
|
gptkbp:operatingSystem |
Cross-platform
|
gptkbp:partOf |
gptkb:Apache_Spark
|
gptkbp:programmingLanguage |
gptkb:Python
|
gptkbp:repository |
https://github.com/apache/spark
|
gptkbp:requires |
gptkb:Java
gptkb:Python_3.x |
gptkbp:supports |
gptkb:RDD_API
gptkb:Spark_MLlib gptkb:Spark_Streaming gptkb:GraphX_(limited) gptkb:DataFrame_API SQL queries |
gptkbp:usedFor |
gptkb:machine_learning
data analysis distributed computing big data processing |
gptkbp:website |
https://spark.apache.org/docs/latest/api/python/
|
gptkbp:bfsParent |
gptkb:AWS_Glue_Studio
|
gptkbp:bfsLayer |
6
|