processing_jobs
Creates, updates, deletes or gets a processing_job resource or lists processing_jobs in a region
Overview
| Name | processing_jobs |
| Type | Resource |
| Description | Resource Type definition for AWS::SageMaker::ProcessingJob |
| Id | awscc.sagemaker.processing_jobs |
Fields
| Name | Datatype | Description |
|---|---|---|
app_specification | object | Configures the processing job to run a specified Docker container image. |
environment | object | Sets the environment variables in the Docker container |
experiment_config | object | Associates a SageMaker job as a trial component with an experiment and trial. |
network_config | object | Networking options for a job, such as network traffic encryption between containers, whether to allow inbound and outbound network calls to and from containers, and the VPC subnets and security groups to use for VPC-enabled jobs. |
processing_inputs | array | An array of inputs configuring the data to download into the processing container. |
processing_job_name | string | The name of the processing job. The name must be unique within an AWS Region in the AWS account. |
processing_output_config | object | Configuration for uploading output from the processing container. |
processing_resources | object | Identifies the resources, ML compute instances, and ML storage volumes to deploy for a processing job. In distributed training, you specify more than one instance. |
role_arn | string | The Amazon Resource Name (ARN) of an IAM role that Amazon SageMaker can assume to perform tasks on your behalf. |
stopping_condition | object | Configures conditions under which the processing job should be stopped, such as how long the processing job has been running. After the condition is met, the processing job is stopped. |
tags | array | (Optional) An array of key-value pairs. For more information, see Using Cost Allocation Tags(https://docs.aws.amazon.com/awsaccountbilling/latest/aboutv2/cost-alloc-tags.html#allocation-whatURL) in the AWS Billing and Cost Management User Guide. |
processing_job_arn | string | The Amazon Resource Name (ARN) of the processing job. |
auto_ml_job_arn | string | The ARN of an AutoML job associated with this processing job. |
exit_message | string | An optional string, up to one KB in size, that contains metadata from the processing container when the processing job exits. |
failure_reason | string | A string, up to one KB in size, that contains the reason a processing job failed, if it failed. |
monitoring_schedule_arn | string | The ARN of a monitoring schedule for an endpoint associated with this processing job. |
training_job_arn | string | The ARN of a training job associated with this processing job |
processing_job_status | string | Provides the status of a processing job. |
creation_time | string | The time at which the processing job was created. |
last_modified_time | string | The time at which the processing job was last modified. |
processing_start_time | string | The time at which the processing job started. |
processing_end_time | string | The time at which the processing job completed. |
region | string | AWS region. |
For more information, see AWS::SageMaker::ProcessingJob.
Methods
| Name | Accessible by | Required Params |
|---|---|---|
create_resource | INSERT | AppSpecification, ProcessingResources, RoleArn, region |
delete_resource | DELETE | data__Identifier, region |
list_resources | SELECT | region |
get_resource | SELECT | data__Identifier, region |
SELECT examples
Gets all properties from an individual processing_job.
SELECT
region,
app_specification,
environment,
experiment_config,
network_config,
processing_inputs,
processing_job_name,
processing_output_config,
processing_resources,
role_arn,
stopping_condition,
tags,
processing_job_arn,
auto_ml_job_arn,
exit_message,
failure_reason,
monitoring_schedule_arn,
training_job_arn,
processing_job_status,
creation_time,
last_modified_time,
processing_start_time,
processing_end_time
FROM awscc.sagemaker.processing_jobs
WHERE region = 'us-east-1' AND data__Identifier = '<ProcessingJobArn>';
INSERT example
Use the following StackQL query and manifest file to create a new processing_job resource, using stack-deploy.
- Required Properties
- All Properties
- Manifest
/*+ create */
INSERT INTO awscc.sagemaker.processing_jobs (
AppSpecification,
ProcessingResources,
RoleArn,
region
)
SELECT
'{{ AppSpecification }}',
'{{ ProcessingResources }}',
'{{ RoleArn }}',
'{{ region }}';
/*+ create */
INSERT INTO awscc.sagemaker.processing_jobs (
AppSpecification,
Environment,
ExperimentConfig,
NetworkConfig,
ProcessingInputs,
ProcessingJobName,
ProcessingOutputConfig,
ProcessingResources,
RoleArn,
StoppingCondition,
Tags,
region
)
SELECT
'{{ AppSpecification }}',
'{{ Environment }}',
'{{ ExperimentConfig }}',
'{{ NetworkConfig }}',
'{{ ProcessingInputs }}',
'{{ ProcessingJobName }}',
'{{ ProcessingOutputConfig }}',
'{{ ProcessingResources }}',
'{{ RoleArn }}',
'{{ StoppingCondition }}',
'{{ Tags }}',
'{{ region }}';
version: 1
name: stack name
description: stack description
providers:
- aws
globals:
- name: region
value: '{{ vars.AWS_REGION }}'
resources:
- name: processing_job
props:
- name: AppSpecification
value:
ContainerArguments:
- '{{ ContainerArguments[0] }}'
ContainerEntrypoint:
- '{{ ContainerEntrypoint[0] }}'
ImageUri: '{{ ImageUri }}'
- name: Environment
value: {}
- name: ExperimentConfig
value:
ExperimentName: '{{ ExperimentName }}'
TrialName: '{{ TrialName }}'
TrialComponentDisplayName: '{{ TrialComponentDisplayName }}'
RunName: '{{ RunName }}'
- name: NetworkConfig
value:
EnableInterContainerTrafficEncryption: '{{ EnableInterContainerTrafficEncryption }}'
EnableNetworkIsolation: '{{ EnableNetworkIsolation }}'
VpcConfig:
SecurityGroupIds:
- '{{ SecurityGroupIds[0] }}'
Subnets:
- '{{ Subnets[0] }}'
- name: ProcessingInputs
value:
- S3Input:
LocalPath: '{{ LocalPath }}'
S3CompressionType: '{{ S3CompressionType }}'
S3DataDistributionType: '{{ S3DataDistributionType }}'
S3DataType: '{{ S3DataType }}'
S3InputMode: '{{ S3InputMode }}'
S3Uri: '{{ S3Uri }}'
DatasetDefinition:
AthenaDatasetDefinition:
Catalog: '{{ Catalog }}'
Database: '{{ Database }}'
OutputS3Uri: '{{ OutputS3Uri }}'
QueryString: '{{ QueryString }}'
WorkGroup: '{{ WorkGroup }}'
OutputFormat: '{{ OutputFormat }}'
KmsKeyId: '{{ KmsKeyId }}'
OutputCompression: '{{ OutputCompression }}'
RedshiftDatasetDefinition:
Database: '{{ Database }}'
DbUser: '{{ DbUser }}'
QueryString: '{{ QueryString }}'
ClusterId: '{{ ClusterId }}'
ClusterRoleArn: '{{ ClusterRoleArn }}'
OutputS3Uri: '{{ OutputS3Uri }}'
OutputFormat: '{{ OutputFormat }}'
KmsKeyId: '{{ KmsKeyId }}'
OutputCompression: '{{ OutputCompression }}'
DataDistributionType: '{{ DataDistributionType }}'
InputMode: '{{ InputMode }}'
LocalPath: '{{ LocalPath }}'
InputName: '{{ InputName }}'
AppManaged: '{{ AppManaged }}'
- name: ProcessingJobName
value: '{{ ProcessingJobName }}'
- name: ProcessingOutputConfig
value:
KmsKeyId: '{{ KmsKeyId }}'
Outputs:
- OutputName: '{{ OutputName }}'
AppManaged: '{{ AppManaged }}'
S3Output:
LocalPath: '{{ LocalPath }}'
S3UploadMode: '{{ S3UploadMode }}'
S3Uri: '{{ S3Uri }}'
FeatureStoreOutput:
FeatureGroupName: '{{ FeatureGroupName }}'
- name: ProcessingResources
value:
ClusterConfig:
InstanceCount: '{{ InstanceCount }}'
InstanceType: '{{ InstanceType }}'
VolumeSizeInGB: '{{ VolumeSizeInGB }}'
VolumeKmsKeyId: '{{ VolumeKmsKeyId }}'
- name: RoleArn
value: '{{ RoleArn }}'
- name: StoppingCondition
value:
MaxRuntimeInSeconds: '{{ MaxRuntimeInSeconds }}'
- name: Tags
value:
- Value: '{{ Value }}'
Key: '{{ Key }}'
DELETE example
/*+ delete */
DELETE FROM awscc.sagemaker.processing_jobs
WHERE data__Identifier = '<ProcessingJobArn>'
AND region = 'us-east-1';
Permissions
To operate on the processing_jobs resource, the following permissions are required:
Create
ec2:CreateNetworkInterface,
ec2:CreateNetworkInterfacePermission,
ec2:DeleteNetworkInterface,
ec2:DeleteNetworkInterfacePermission,
ec2:DescribeNetworkInterfaces,
ec2:DescribeVpcs,
ec2:DescribeDhcpOptions,
ec2:DescribeSubnets,
ec2:DescribeSecurityGroups,
kms:CreateGrant,
kms:Decrypt,
kms:DescribeKey,
sagemaker:CreateProcessingJob,
sagemaker:DescribeProcessingJob,
sagemaker:AddTags,
sagemaker:ListTags,
iam:PassRole
Read
sagemaker:DescribeProcessingJob,
sagemaker:ListTags
Delete
ec2:DeleteNetworkInterface,
ec2:DeleteNetworkInterfacePermission,
ec2:DescribeNetworkInterfaces,
ec2:DescribeVpcs,
ec2:DescribeSubnets,
ec2:DescribeSecurityGroups,
sagemaker:StopProcessingJob,
sagemaker:DescribeProcessingJob,
sagemaker:ListTags,
sagemaker:DeleteTags
List
sagemaker:ListProcessingJobs,
sagemaker:DescribeProcessingJob,
sagemaker:ListTags