|
NAMEPaws::Glue::CreateCrawler - Arguments for method CreateCrawler on Paws::Glue DESCRIPTIONThis class represents the parameters used for calling the method CreateCrawler on the AWS Glue service. Use the attributes of this class as arguments to method CreateCrawler. You shouldn't make instances of this class. Each attribute should be used as a named argument in the call to CreateCrawler. SYNOPSIS my $glue = Paws->service('Glue');
my $CreateCrawlerResponse = $glue->CreateCrawler(
Name => 'MyNameString',
Role => 'MyRole',
Targets => {
CatalogTargets => [
{
DatabaseName => 'MyNameString', # min: 1, max: 255
Tables => [
'MyNameString', ... # min: 1, max: 255
], # min: 1
},
...
], # OPTIONAL
DynamoDBTargets => [
{
Path => 'MyPath', # OPTIONAL
ScanAll => 1, # OPTIONAL
ScanRate => 1, # OPTIONAL
},
...
], # OPTIONAL
JdbcTargets => [
{
ConnectionName => 'MyConnectionName', # OPTIONAL
Exclusions => [
'MyPath', ... # OPTIONAL
], # OPTIONAL
Path => 'MyPath', # OPTIONAL
},
...
], # OPTIONAL
MongoDBTargets => [
{
ConnectionName => 'MyConnectionName', # OPTIONAL
Path => 'MyPath', # OPTIONAL
ScanAll => 1, # OPTIONAL
},
...
], # OPTIONAL
S3Targets => [
{
ConnectionName => 'MyConnectionName', # OPTIONAL
Exclusions => [
'MyPath', ... # OPTIONAL
], # OPTIONAL
Path => 'MyPath', # OPTIONAL
SampleSize => 1, # OPTIONAL
},
...
], # OPTIONAL
},
Classifiers => [
'MyNameString', ... # min: 1, max: 255
], # OPTIONAL
Configuration => 'MyCrawlerConfiguration', # OPTIONAL
CrawlerSecurityConfiguration =>
'MyCrawlerSecurityConfiguration', # OPTIONAL
DatabaseName => 'MyDatabaseName', # OPTIONAL
Description => 'MyDescriptionString', # OPTIONAL
LineageConfiguration => {
CrawlerLineageSettings => 'ENABLE', # values: ENABLE, DISABLE; OPTIONAL
}, # OPTIONAL
RecrawlPolicy => {
RecrawlBehavior => 'CRAWL_EVERYTHING'
, # values: CRAWL_EVERYTHING, CRAWL_NEW_FOLDERS_ONLY; OPTIONAL
}, # OPTIONAL
Schedule => 'MyCronExpression', # OPTIONAL
SchemaChangePolicy => {
DeleteBehavior => 'LOG'
, # values: LOG, DELETE_FROM_DATABASE, DEPRECATE_IN_DATABASE; OPTIONAL
UpdateBehavior => 'LOG', # values: LOG, UPDATE_IN_DATABASE; OPTIONAL
}, # OPTIONAL
TablePrefix => 'MyTablePrefix', # OPTIONAL
Tags => {
'MyTagKey' => 'MyTagValue', # key: min: 1, max: 128, value: max: 256
}, # OPTIONAL
);
Values for attributes that are native types (Int, String, Float, etc) can passed as-is (scalar values). Values for complex Types (objects) can be passed as a HashRef. The keys and values of the hashref will be used to instance the underlying object. For the AWS API documentation, see <https://docs.aws.amazon.com/goto/WebAPI/glue/CreateCrawler> ATTRIBUTESClassifiers => ArrayRef[Str|Undef]A list of custom classifiers that the user has registered. By default, all built-in classifiers are included in a crawl, but these custom classifiers always override the default classifiers for a given classification. Configuration => StrCrawler configuration information. This versioned JSON string allows users to specify aspects of a crawler's behavior. For more information, see Configuring a Crawler (https://docs.aws.amazon.com/glue/latest/dg/crawler-configuration.html). CrawlerSecurityConfiguration => StrThe name of the "SecurityConfiguration" structure to be used by this crawler. DatabaseName => StrThe Glue database where results are written, such as: "arn:aws:daylight:us-east-1::database/sometable/*". Description => StrA description of the new crawler. LineageConfiguration => Paws::Glue::LineageConfigurationSpecifies data lineage configuration settings for the crawler. REQUIRED Name => StrName of the new crawler. RecrawlPolicy => Paws::Glue::RecrawlPolicyA policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run. REQUIRED Role => StrThe IAM role or Amazon Resource Name (ARN) of an IAM role used by the new crawler to access customer resources. Schedule => StrA "cron" expression used to specify the schedule (see Time-Based Schedules for Jobs and Crawlers (https://docs.aws.amazon.com/glue/latest/dg/monitor-data-warehouse-schedule.html). For example, to run something every day at 12:15 UTC, you would specify: "cron(15 12 * * ? *)". SchemaChangePolicy => Paws::Glue::SchemaChangePolicyThe policy for the crawler's update and deletion behavior. TablePrefix => StrThe table prefix used for catalog tables that are created. Tags => Paws::Glue::TagsMapThe tags to use with this crawler request. You may use tags to limit access to the crawler. For more information about tags in Glue, see Amazon Web Services Tags in Glue (https://docs.aws.amazon.com/glue/latest/dg/monitor-tags.html) in the developer guide. REQUIRED Targets => Paws::Glue::CrawlerTargetsA list of collection of targets to crawl. SEE ALSOThis class forms part of Paws, documenting arguments for method CreateCrawler in Paws::Glue BUGS and CONTRIBUTIONSThe source code is located here: <https://github.com/pplu/aws-sdk-perl> Please report bugs to: <https://github.com/pplu/aws-sdk-perl/issues>
|