-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Repack always uses sklearn image #3143
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi @jponf , thanks for using Sagemaker! Can you share us the code snippet to reproduce the issue? I'm wondering why we install the dependencies listed in |
Hi @qidewenwhen, Please find attached a small example with a dummy training step and a register step. The expected outcome is that, after training, the repack step will fail because inside the I'll be away for a week or so, in the meantime if you need anything else you can ask my colleague @Guillem96. |
Thanks for the code samples! It helps a lot to quickly reproduce the issue. The repack step is invoking a TrainingJob under the hood to simply repack custom dependencies and code into an existing model TAR archive. I've added this in our backlog and will raise this to my team for discussion. Will get back to you once we figure out the next steps. |
When registering a pytorch model, using python 3.8, it fails because the requirements.txt includes dependencies (numpy is the one failing in this case, but may be others) that cannot be installed during the repack step.
We've found that the repack image cannot be changed and that right now it is an image using Python 3.7, a Python version not supported by our numpy version.
sagemaker-python-sdk/src/sagemaker/workflow/_utils.py
Line 40 in 56452f1
With numpy we can probably remove it from the requirement.txt but there might be other libraries that we cannot ignore this way.
System information
A description of your system. Please provide:
The text was updated successfully, but these errors were encountered: