Python (PySpark)

GPTKB entity

Statements (94)
Predicate Object
gptkbp:instanceOf gptkb:REST_API
gptkb:software
gptkbp:canBe gptkb:Hadoop
gptkb:Azure_HDInsight
gptkb:Jupyter_Notebook
gptkb:Hive
gptkb:AWS_EMR
gptkb:Google_Dataproc
gptkbp:category gptkb:Machine_Learning
gptkb:Data_Science
Big Data
Distributed Systems
gptkbp:class pyspark.SparkContext
pyspark.ml
pyspark.sql.Column
pyspark.sql.DataFrame
pyspark.sql.DataFrameReader
pyspark.sql.DataFrameWriter
pyspark.sql.GroupedData
pyspark.sql.Row
pyspark.sql.SparkSession
pyspark.sql.SparkSession.Builder
pyspark.sql.Window
pyspark.sql.catalog
pyspark.sql.catalog.Catalog
pyspark.sql.functions
pyspark.sql.functions.add_months
pyspark.sql.functions.array
pyspark.sql.functions.avg
pyspark.sql.functions.col
pyspark.sql.functions.concat
pyspark.sql.functions.count
pyspark.sql.functions.current_date
pyspark.sql.functions.current_timestamp
pyspark.sql.functions.date_add
pyspark.sql.functions.date_format
pyspark.sql.functions.date_sub
pyspark.sql.functions.datediff
pyspark.sql.functions.dayofmonth
pyspark.sql.functions.explode
pyspark.sql.functions.expr
pyspark.sql.functions.from_unixtime
pyspark.sql.functions.hour
pyspark.sql.functions.last_day
pyspark.sql.functions.lit
pyspark.sql.functions.max
pyspark.sql.functions.min
pyspark.sql.functions.minute
pyspark.sql.functions.month
pyspark.sql.functions.months_between
pyspark.sql.functions.next_day
pyspark.sql.functions.pandas_udf
pyspark.sql.functions.round
pyspark.sql.functions.second
pyspark.sql.functions.split
pyspark.sql.functions.struct
pyspark.sql.functions.sum
pyspark.sql.functions.to_date
pyspark.sql.functions.trunc
pyspark.sql.functions.udf
pyspark.sql.functions.unix_timestamp
pyspark.sql.functions.when
pyspark.sql.functions.window
pyspark.sql.functions.year
pyspark.sql.streaming
pyspark.sql.types
pyspark.sql.udf
pyspark.streaming
gptkbp:developedBy gptkb:Apache_Software_Foundation
gptkbp:documentation https://spark.apache.org/docs/latest/api/python/
gptkbp:firstReleased 2012
https://www.w3.org/2000/01/rdf-schema#label Python (PySpark)
gptkbp:latestReleaseVersion 3.4.1
gptkbp:license gptkb:Apache_License_2.0
gptkbp:openSource true
gptkbp:operatingSystem Cross-platform
gptkbp:partOf gptkb:Apache_Spark
gptkbp:programmingLanguage gptkb:Python
gptkbp:repository https://github.com/apache/spark
gptkbp:requires gptkb:Java
gptkb:Python_3.x
gptkbp:supports gptkb:RDD_API
gptkb:Spark_MLlib
gptkb:Spark_Streaming
gptkb:GraphX_(limited)
gptkb:DataFrame_API
SQL queries
gptkbp:usedFor gptkb:machine_learning
data analysis
distributed computing
big data processing
gptkbp:website https://spark.apache.org/docs/latest/api/python/
gptkbp:bfsParent gptkb:AWS_Glue_Studio
gptkbp:bfsLayer 6