Skip to main content

clusters

Creates, updates, deletes or gets a cluster resource or lists clusters in a region

Overview

Nameclusters
TypeResource
DescriptionResource Type definition for AWS::SageMaker::Cluster
Idawscc.sagemaker.clusters

Fields

NameDatatypeDescription
cluster_arnstringThe Amazon Resource Name (ARN) of the HyperPod Cluster.
vpc_configobjectSpecifies an Amazon Virtual Private Cloud (VPC) that your SageMaker jobs, hosted models, and compute resources have access to. You can control access to and from your resources by configuring a VPC. For more information, see https://docs.aws.amazon.com/sagemaker/latest/dg/infrastructure-give-access.html
node_recoverystringIf node auto-recovery is set to true, faulty nodes will be replaced or rebooted when a failure is detected. If set to false, nodes will be labelled when a fault is detected.
instance_groupsarrayThe instance groups of the SageMaker HyperPod cluster.
restricted_instance_groupsarrayThe restricted instance groups of the SageMaker HyperPod cluster.
orchestratorobjectSpecifies parameter(s) specific to the orchestrator, e.g. specify the EKS cluster.
cluster_rolestringThe cluster role for the autoscaler to assume.
node_provisioning_modestringDetermines the scaling strategy for the SageMaker HyperPod cluster. When set to 'Continuous', enables continuous scaling which dynamically manages node provisioning. If the parameter is omitted, uses the standard scaling approach in previous release.
creation_timestringThe time at which the HyperPod cluster was created.
cluster_namestringThe name of the HyperPod Cluster.
failure_messagestringThe failure message of the HyperPod Cluster.
auto_scalingobjectConfiguration for cluster auto-scaling
cluster_statusstringThe status of the HyperPod Cluster.
tagsarrayCustom tags for managing the SageMaker HyperPod cluster as an AWS resource. You can add tags to your cluster in the same way you add them in other AWS services that support tagging.
regionstringAWS region.

For more information, see AWS::SageMaker::Cluster.

Methods

NameResourceAccessible byRequired Params
create_resourceclustersINSERT, region
delete_resourceclustersDELETEIdentifier, region
update_resourceclustersUPDATEIdentifier, PatchDocument, region
list_resourcesclusters_list_onlySELECTregion
get_resourceclustersSELECTIdentifier, region

SELECT examples

Gets all properties from an individual cluster.

SELECT
region,
cluster_arn,
vpc_config,
node_recovery,
instance_groups,
restricted_instance_groups,
orchestrator,
cluster_role,
node_provisioning_mode,
creation_time,
cluster_name,
failure_message,
auto_scaling,
cluster_status,
tags
FROM awscc.sagemaker.clusters
WHERE
region = '{{ region }}' AND
Identifier = '{{ cluster_arn }}';

INSERT example

Use the following StackQL query and manifest file to create a new cluster resource, using stack-deploy.

/*+ create */
INSERT INTO awscc.sagemaker.clusters (
,
region
)
SELECT
'{{ }}',
'{{ region }}'
RETURNING
ErrorCode,
EventTime,
Identifier,
Operation,
OperationStatus,
RequestToken,
ResourceModel,
RetryAfter,
StatusMessage,
TypeName
;

UPDATE example

Use the following StackQL query and manifest file to update a cluster resource, using stack-deploy.

/*+ update */
UPDATE awscc.sagemaker.clusters
SET PatchDocument = string('{{ {
"NodeRecovery": node_recovery,
"ClusterRole": cluster_role,
"NodeProvisioningMode": node_provisioning_mode,
"AutoScaling": auto_scaling,
"Tags": tags
} | generate_patch_document }}')
WHERE
region = '{{ region }}' AND
Identifier = '{{ cluster_arn }}'
RETURNING
ErrorCode,
EventTime,
Identifier,
Operation,
OperationStatus,
RequestToken,
ResourceModel,
RetryAfter,
StatusMessage,
TypeName
;

DELETE example

/*+ delete */
DELETE FROM awscc.sagemaker.clusters
WHERE
Identifier = '{{ cluster_arn }}' AND
region = '{{ region }}'
RETURNING
ErrorCode,
EventTime,
Identifier,
Operation,
OperationStatus,
RequestToken,
ResourceModel,
RetryAfter,
StatusMessage,
TypeName
;

Additional Parameters

Mutable resources in the Cloud Control provider support additional optional parameters which can be supplied with INSERT, UPDATE, or DELETE operations. These include:

ParameterDescription
ClientToken
A unique identifier to ensure the idempotency of the resource request.This allows the provider to accurately distinguish between retries and new requests.
A client token is valid for 36 hours once used.
After that, a resource request with the same client token is treated as a new request.
If you do not specify a client token, one is generated for inclusion in the request.
RoleArn
The ARN of the IAM role used to perform this resource operation.The role specified must have the permissions required for this operation.
If you do not specify a role, a temporary session is created using your AWS user credentials.
TypeVersionId
For private resource types, the type version to use in this resource operation.If you do not specify a resource version, the default version is used.

Permissions

To operate on the clusters resource, the following permissions are required:

sagemaker:DescribeCluster,
sagemaker:ListTags