gptkbp:instanceOf
|
gptkb:model
|
gptkbp:developedBy
|
gptkb:Apache_Software_Foundation
|
gptkbp:firstReleased
|
2013
|
https://www.w3.org/2000/01/rdf-schema#label
|
Spark MLlib
|
gptkbp:integratesWith
|
gptkb:Amazon_S3
gptkb:Avro
gptkb:Hadoop
gptkb:ORC
gptkb:JDBC
gptkb:TensorFlow
gptkb:XGBoost
gptkb:Cassandra
gptkb:Kubernetes
gptkb:scikit-learn
gptkb:HBase
gptkb:Hive
gptkb:HDFS
gptkb:YARN
gptkb:ONNX
gptkb:PMML
gptkb:Delta_Lake
gptkb:MLflow
Parquet
|
gptkbp:latestReleaseVersion
|
3.5.1
|
gptkbp:license
|
gptkb:Apache_License_2.0
|
gptkbp:openSource
|
true
|
gptkbp:partOf
|
gptkb:Apache_Spark
|
gptkbp:provides
|
gptkb:dictionary
regression
clustering
dimensionality reduction
feature extraction
collaborative filtering
model evaluation
model tuning
feature transformation
pipeline API
|
gptkbp:supports
|
gptkb:RDD_API
gptkb:DataFrame_API
distributed computing
batch processing
cross-validation
feature engineering
hyperparameter tuning
streaming data
large-scale data processing
model persistence
|
gptkbp:usedFor
|
big data analytics
machine learning workflows
|
gptkbp:website
|
https://spark.apache.org/mllib/
|
gptkbp:writtenBy
|
gptkb:Java
gptkb:Python
gptkb:Scala
R
|
gptkbp:bfsParent
|
gptkb:Random_Forests
gptkb:MLflow
|
gptkbp:bfsLayer
|
6
|