-
Notifications
You must be signed in to change notification settings - Fork 1.2k
SageMaker Bring Your Own Container on local mode - ProcessingOutput is not linked to local filesystem #3083
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi @idanmoradarthas Thank you for your feedback! We will bring this to the team and will work on discussing and prioritizing this enhancement as part of our roadmap. Regards, |
Hi @shreyapandit I do want to emphasize that Outseer, my company, will benefit very much from a local mode that is completely local, without internet connection, As we use the local mode for our testing suite. |
Hi @shreyapandit. Would you be able to share any insights on when this could be resolved? Specifically, I'm speaking about the fact that
and this is because it still requires |
the core of this issue seems to be the default_bucket definition in the local session, even though it is specified when creating the session, sagemaker sdk still does the whole _create_s3_bucket_if_it_does_not_exist, which requires internet and credentials set up, which blocks solutions such as localstack for mocking s3 |
this is marked as fixed here but is not actually fixed |
Without the ability to map outputs back to the local filesytem - local mode is basicly unusable - it can't actually even be that hard - there is a volume in the docker-compose for the output directory - its just thrown away on completion... |
Describe the feature you'd like
During the work with SageMaker BYOC on local mode (with Python SDK), we encountered the situation where the outputs of the container are staged into the SageMaker default artifact bucket. Then the SDK does not download those artifacts into the local file system.
How would this feature be used? Please describe.
We want that SDK will download the artifacts created automatically into the local file system.
Describe alternatives you've considered

We had to create a mechanism to download those files by ourselves:
In the following snippet of code, we used https://github.com/aws-samples/amazon-sagemaker-local-mode/blob/main/scikit_learn_bring_your_own_container_local_processing/scikit_learn_bring_your_own_container_local_processing.py as a reference (also used the output_config dictionary)
The text was updated successfully, but these errors were encountered: