Glue Crawlers

GPTKB entity

Statements (45)
Predicate Object
gptkbp:instanceOf AWS Service Feature
gptkbp:broadcastOn Yes
gptkbp:canBe Schedule
Crawl policies
Custom classifiers
Include/exclude patterns
Output location
gptkbp:created Partitions
Schema versions
Table definitions
gptkbp:detects File formats
Schema changes
gptkbp:documentation https://docs.aws.amazon.com/glue/latest/dg/add-crawler.html
gptkbp:hasVersion Yes
https://www.w3.org/2000/01/rdf-schema#label Glue Crawlers
gptkbp:integratesWith gptkb:AWS_DataBrew
gptkb:AWS_Lake_Formation
gptkb:AWS_Athena
AWS Glue ETL Jobs
AWS Redshift Spectrum
gptkbp:launched 2017
gptkbp:monitors gptkb:CloudWatch
gptkbp:output Metadata
Catalog tables
Partition information
gptkbp:partOf gptkb:AWS_Glue
gptkbp:provides gptkb:Amazon_Web_Services
gptkbp:purpose Discover data schema
Populate AWS Glue Data Catalog
gptkbp:supports gptkb:Redshift
gptkb:Kafka
gptkb:JDBC
gptkb:MongoDB
gptkb:DynamoDB
gptkb:DocumentDB
S3
Data Lake Storage
gptkbp:supportsIncrementalCrawling Yes
gptkbp:triggeredBy gptkb:EventBridge
On-demand
gptkbp:uses Built-in classifiers
Classifiers
Custom classifiers
gptkbp:bfsParent gptkb:Amazon_Glue
gptkbp:bfsLayer 5