gptkbp:instanceOf
|
AWS Service Feature
|
gptkbp:broadcastOn
|
Yes
|
gptkbp:canBe
|
Schedule
Crawl policies
Custom classifiers
Include/exclude patterns
Output location
|
gptkbp:created
|
Partitions
Schema versions
Table definitions
|
gptkbp:detects
|
File formats
Schema changes
|
gptkbp:documentation
|
https://docs.aws.amazon.com/glue/latest/dg/add-crawler.html
|
gptkbp:hasVersion
|
Yes
|
https://www.w3.org/2000/01/rdf-schema#label
|
Glue Crawlers
|
gptkbp:integratesWith
|
gptkb:AWS_DataBrew
gptkb:AWS_Lake_Formation
gptkb:AWS_Athena
AWS Glue ETL Jobs
AWS Redshift Spectrum
|
gptkbp:launched
|
2017
|
gptkbp:monitors
|
gptkb:CloudWatch
|
gptkbp:output
|
Metadata
Catalog tables
Partition information
|
gptkbp:partOf
|
gptkb:AWS_Glue
|
gptkbp:provides
|
gptkb:Amazon_Web_Services
|
gptkbp:purpose
|
Discover data schema
Populate AWS Glue Data Catalog
|
gptkbp:supports
|
gptkb:Redshift
gptkb:Kafka
gptkb:JDBC
gptkb:MongoDB
gptkb:DynamoDB
gptkb:DocumentDB
S3
Data Lake Storage
|
gptkbp:supportsIncrementalCrawling
|
Yes
|
gptkbp:triggeredBy
|
gptkb:EventBridge
On-demand
|
gptkbp:uses
|
Built-in classifiers
Classifiers
Custom classifiers
|
gptkbp:bfsParent
|
gptkb:Amazon_Glue
|
gptkbp:bfsLayer
|
5
|