gptkbp:instanceOf
|
gptkb:model
|
gptkbp:developedBy
|
gptkb:Apache_Software_Foundation
|
gptkbp:firstReleased
|
2014
|
https://www.w3.org/2000/01/rdf-schema#label
|
Apache Spark MLlib
|
gptkbp:integratesWith
|
gptkb:GraphX
gptkb:Spark_SQL
gptkb:Spark_Streaming
|
gptkbp:latestReleaseVersion
|
3.5.0
|
gptkbp:license
|
gptkb:Apache_License_2.0
|
gptkbp:openSource
|
true
|
gptkbp:partOf
|
gptkb:Apache_Spark
|
gptkbp:platform
|
Cross-platform
|
gptkbp:provides
|
DataFrame-based API
RDD-based API
|
gptkbp:replacedBy
|
gptkb:Spark_MLlib_RDD_API_(deprecated)
|
gptkbp:supports
|
gptkb:Naive_Bayes
gptkb:TF-IDF
gptkb:dictionary
gptkb:principal_component_analysis
gptkb:word2vec
gptkb:ALS_(Alternating_Least_Squares)
distributed computing
regression
clustering
cross-validation
decision trees
dimensionality reduction
feature extraction
hyperparameter tuning
k-means clustering
linear regression
logistic regression
random forests
support vector machines
model selection
collaborative filtering
model evaluation
pipelines
parallel processing
feature scaling
feature transformation
count vectorizer
gradient-boosted trees
multilayer perceptron classifier
one-hot encoding
vector assembler
|
gptkbp:usedFor
|
large-scale machine learning
|
gptkbp:website
|
https://spark.apache.org/mllib/
|
gptkbp:writtenBy
|
gptkb:Java
gptkb:Python
gptkb:Scala
R
|
gptkbp:bfsParent
|
gptkb:Databricks_Runtime_13.x
gptkb:MLlib_RDD-based_API
|
gptkbp:bfsLayer
|
7
|