You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm a software engineer at AWS that is experiencing this bug in SageMaker Studio (JumpStart). My alias is @evakravi
Describe the bug
If you attempt to deploy a model and are successful in a) uploading the script/deps to s3, (b) creating a model, (c) creating an endpoint config, but d) FAIL to create the endpoint (which could happen if an account does not have GPU instances allocated), then if you attempt to re-deploy the same model with parameters such that the endpoint creation succeeds, the deployment will still fail.
To reproduce
Deploy a model on a disallowed instance type, then deploy the identical model to a permitted instance type.
Expected behavior
The deployment to an allowed instance type should be successful, but it ends up failing.
System information
A description of your system. Please provide:
SageMaker Python SDK version: 2.21.0
Python version: Python 3.7.10
CPU or GPU: SageMaker Studio Jupyter Server
Custom Docker image (Y/N): SageMaker Studio Jupyter Server
Additional context
While this issue is related to SageMaker JumpStart ModelHub, the sagemaker api issue can be reproduced outside of SageMaker Studio.
The text was updated successfully, but these errors were encountered:
Something went wrong
We encountered an error while preparing to deploy your endpoint. You can get more details below.
operation deploy failed: An error occurred (ResourceLimitExceeded) when calling the CreateEndpoint operation: The account-level service limit 'ml.g4dn.xlarge for endpoint usage' is 0 Instances, with current utilization of 0 Instances and a request delta of 1 Instances. Please contact AWS support to request an increase for this limit.
I'm a software engineer at AWS that is experiencing this bug in SageMaker Studio (JumpStart). My alias is @evakravi
Describe the bug
If you attempt to deploy a model and are successful in a) uploading the script/deps to s3, (b) creating a model, (c) creating an endpoint config, but d) FAIL to create the endpoint (which could happen if an account does not have GPU instances allocated), then if you attempt to re-deploy the same model with parameters such that the endpoint creation succeeds, the deployment will still fail.
To reproduce
Deploy a model on a disallowed instance type, then deploy the identical model to a permitted instance type.
Expected behavior
The deployment to an allowed instance type should be successful, but it ends up failing.
System information
A description of your system. Please provide:
Additional context
While this issue is related to SageMaker JumpStart ModelHub, the sagemaker api issue can be reproduced outside of SageMaker Studio.
The text was updated successfully, but these errors were encountered: