NAME

Paws::Glue::UpdateCrawler - Arguments for method UpdateCrawler on Paws::Glue

DESCRIPTION

This class represents the parameters used for calling the method UpdateCrawler on the AWS Glue service. Use the attributes of this class as arguments to method UpdateCrawler.

You shouldn't make instances of this class. Each attribute should be used as a named argument in the call to UpdateCrawler.

SYNOPSIS

my $glue = Paws->service('Glue');
my $UpdateCrawlerResponse = $glue->UpdateCrawler(
  Name        => 'MyNameString',
  Classifiers => [
    'MyNameString', ...    # min: 1, max: 255
  ],    # OPTIONAL
  Configuration                => 'MyCrawlerConfiguration',    # OPTIONAL
  CrawlerSecurityConfiguration =>
    'MyCrawlerSecurityConfiguration',                          # OPTIONAL
  DatabaseName         => 'MyDatabaseName',                    # OPTIONAL
  Description          => 'MyDescriptionStringRemovable',      # OPTIONAL
  LineageConfiguration => {
    CrawlerLineageSettings => 'ENABLE',  # values: ENABLE, DISABLE; OPTIONAL
  },    # OPTIONAL
  RecrawlPolicy => {
    RecrawlBehavior => 'CRAWL_EVERYTHING'
    ,    # values: CRAWL_EVERYTHING, CRAWL_NEW_FOLDERS_ONLY; OPTIONAL
  },    # OPTIONAL
  Role               => 'MyRole',              # OPTIONAL
  Schedule           => 'MyCronExpression',    # OPTIONAL
  SchemaChangePolicy => {
    DeleteBehavior => 'LOG'
    ,   # values: LOG, DELETE_FROM_DATABASE, DEPRECATE_IN_DATABASE; OPTIONAL
    UpdateBehavior => 'LOG',    # values: LOG, UPDATE_IN_DATABASE; OPTIONAL
  },    # OPTIONAL
  TablePrefix => 'MyTablePrefix',    # OPTIONAL
  Targets     => {
    CatalogTargets => [
      {
        DatabaseName => 'MyNameString',    # min: 1, max: 255
        Tables       => [
          'MyNameString', ...              # min: 1, max: 255
        ],    # min: 1

      },
      ...
    ],    # OPTIONAL
    DynamoDBTargets => [
      {
        Path     => 'MyPath',    # OPTIONAL
        ScanAll  => 1,           # OPTIONAL
        ScanRate => 1,           # OPTIONAL
      },
      ...
    ],    # OPTIONAL
    JdbcTargets => [
      {
        ConnectionName => 'MyConnectionName',    # OPTIONAL
        Exclusions     => [
          'MyPath', ...                          # OPTIONAL
        ],    # OPTIONAL
        Path => 'MyPath',    # OPTIONAL
      },
      ...
    ],    # OPTIONAL
    MongoDBTargets => [
      {
        ConnectionName => 'MyConnectionName',    # OPTIONAL
        Path           => 'MyPath',              # OPTIONAL
        ScanAll        => 1,                     # OPTIONAL
      },
      ...
    ],    # OPTIONAL
    S3Targets => [
      {
        ConnectionName => 'MyConnectionName',    # OPTIONAL
        Exclusions     => [
          'MyPath', ...                          # OPTIONAL
        ],    # OPTIONAL
        Path       => 'MyPath',    # OPTIONAL
        SampleSize => 1,           # OPTIONAL
      },
      ...
    ],    # OPTIONAL
  },    # OPTIONAL
);

Values for attributes that are native types (Int, String, Float, etc) can passed as-is (scalar values). Values for complex Types (objects) can be passed as a HashRef. The keys and values of the hashref will be used to instance the underlying object. For the AWS API documentation, see https://docs.aws.amazon.com/goto/WebAPI/glue/UpdateCrawler

ATTRIBUTES

Classifiers => ArrayRef[Str|Undef]

A list of custom classifiers that the user has registered. By default, all built-in classifiers are included in a crawl, but these custom classifiers always override the default classifiers for a given classification.

Configuration => Str

Crawler configuration information. This versioned JSON string allows users to specify aspects of a crawler's behavior. For more information, see Configuring a Crawler (https://docs.aws.amazon.com/glue/latest/dg/crawler-configuration.html).

CrawlerSecurityConfiguration => Str

The name of the SecurityConfiguration structure to be used by this crawler.

DatabaseName => Str

The Glue database where results are stored, such as: arn:aws:daylight:us-east-1::database/sometable/*.

Description => Str

A description of the new crawler.

LineageConfiguration => Paws::Glue::LineageConfiguration

Specifies data lineage configuration settings for the crawler.

REQUIRED Name => Str

Name of the new crawler.

RecrawlPolicy => Paws::Glue::RecrawlPolicy

A policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run.

Role => Str

The IAM role or Amazon Resource Name (ARN) of an IAM role that is used by the new crawler to access customer resources.

Schedule => Str

A cron expression used to specify the schedule (see Time-Based Schedules for Jobs and Crawlers (https://docs.aws.amazon.com/glue/latest/dg/monitor-data-warehouse-schedule.html). For example, to run something every day at 12:15 UTC, you would specify: cron(15 12 * * ? *).

SchemaChangePolicy => Paws::Glue::SchemaChangePolicy

The policy for the crawler's update and deletion behavior.

TablePrefix => Str

The table prefix used for catalog tables that are created.

Targets => Paws::Glue::CrawlerTargets

A list of targets to crawl.

SEE ALSO

This class forms part of Paws, documenting arguments for method UpdateCrawler in Paws::Glue

BUGS and CONTRIBUTIONS

The source code is located here: https://github.com/pplu/aws-sdk-perl

Please report bugs to: https://github.com/pplu/aws-sdk-perl/issues