StreamSets Data Collector

GPTKB entity

Statements (52)
Predicate Object
gptkbp:instanceOf data integration tool
gptkbp:deployment gptkb:cloud_service
hybrid
on-premises
gptkbp:developedBy gptkb:StreamSets_Inc.
gptkbp:documentation https://docs.streamsets.com/portal/#datacollector
gptkbp:feature role-based access control
data lineage tracking
data masking
error handling
pipeline monitoring
pipeline versioning
data drift detection
data preview
containerized deployment
REST API for automation
drag-and-drop pipeline design
edge data collection
integration with StreamSets Control Hub
schema evolution support
gptkbp:firstReleased 2015
https://www.w3.org/2000/01/rdf-schema#label StreamSets Data Collector
gptkbp:integratesWith gptkb:Amazon_S3
gptkb:Google_Cloud_Storage
gptkb:Hadoop
gptkb:Snowflake
gptkb:Apache_Kafka
gptkb:Amazon_Kinesis
gptkb:Elasticsearch
gptkb:MongoDB
gptkb:Amazon_Redshift
gptkb:Apache_HBase
gptkb:Apache_Hive
gptkb:Google_BigQuery
gptkb:Azure_Data_Lake
gptkb:Azure_Synapse
gptkb:HTTP/REST_APIs
JDBC databases
FTP/SFTP
gptkbp:latestReleaseVersion 2024-03-13
5.8.0
gptkbp:license gptkb:Apache_License_2.0
gptkbp:platform Cross-platform
gptkbp:programmingLanguage gptkb:Java
gptkbp:supports real-time data ingestion
batch data ingestion
gptkbp:type gptkb:software
gptkbp:usedFor ETL (Extract, Transform, Load)
data pipeline automation
gptkbp:website https://streamsets.com/products/data-collector/
gptkbp:bfsParent gptkb:StreamSets
gptkbp:bfsLayer 7