Apache Spark MLlib

GPTKB entity

Statements (55)
Predicate Object
gptkbp:instanceOf gptkb:model
gptkbp:developedBy gptkb:Apache_Software_Foundation
gptkbp:firstReleased 2014
https://www.w3.org/2000/01/rdf-schema#label Apache Spark MLlib
gptkbp:integratesWith gptkb:GraphX
gptkb:Spark_SQL
gptkb:Spark_Streaming
gptkbp:latestReleaseVersion 3.5.0
gptkbp:license gptkb:Apache_License_2.0
gptkbp:openSource true
gptkbp:partOf gptkb:Apache_Spark
gptkbp:platform Cross-platform
gptkbp:provides DataFrame-based API
RDD-based API
gptkbp:replacedBy gptkb:Spark_MLlib_RDD_API_(deprecated)
gptkbp:supports gptkb:Naive_Bayes
gptkb:TF-IDF
gptkb:dictionary
gptkb:principal_component_analysis
gptkb:word2vec
gptkb:ALS_(Alternating_Least_Squares)
distributed computing
regression
clustering
cross-validation
decision trees
dimensionality reduction
feature extraction
hyperparameter tuning
k-means clustering
linear regression
logistic regression
random forests
support vector machines
model selection
collaborative filtering
model evaluation
pipelines
parallel processing
feature scaling
feature transformation
count vectorizer
gradient-boosted trees
multilayer perceptron classifier
one-hot encoding
vector assembler
gptkbp:usedFor large-scale machine learning
gptkbp:website https://spark.apache.org/mllib/
gptkbp:writtenBy gptkb:Java
gptkb:Python
gptkb:Scala
R
gptkbp:bfsParent gptkb:Databricks_Runtime_13.x
gptkb:MLlib_RDD-based_API
gptkbp:bfsLayer 7