-
Notifications
You must be signed in to change notification settings - Fork 72
doc: pytorch serving eia #52
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -25,6 +25,7 @@ Table of Contents | |
|
||
#. `Getting Started <#getting-started>`__ | ||
#. `Building your Image <#building-your-image>`__ | ||
#. `Amazon Elastic Inference with PyTorch in SageMaker <#elastic-inference>`__ | ||
#. `Running the tests <#running-the-tests>`__ | ||
|
||
Getting Started | ||
|
@@ -142,6 +143,36 @@ If you want to build "final" Docker images, then use: | |
# GPU | ||
docker build -t preprod-pytorch:1.0.0-gpu-py3 -f docker/1.0.0/final/Dockerfile.gpu --build-arg py_version=3 . | ||
|
||
Amazon Elastic Inference with PyTorch in SageMaker | ||
-------------------------------------------------- | ||
`Amazon Elastic Inference <https://aws.amazon.com/machine-learning/elastic-inference/>`__ allows you to to attach | ||
low-cost GPU-powered acceleration to Amazon EC2 and Amazon SageMaker instances to reduce the cost of running deep | ||
learning inference by up to 75%. Currently, Amazon Elastic Inference supports TensorFlow, Apache MXNet, PyTorch, | ||
and ONNX models, with more frameworks coming soon. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. "with more frameworks coming soon"...if this was originally intended to refer to PyTorch, maybe it should be removed now. |
||
|
||
Support for using PyTorch with Amazon Elastic Inference in SageMaker is supported in the public SageMaker PyTorch serving containers. | ||
|
||
* For information on how to use the Python SDK to create an endpoint with Amazon Elastic Inference and PyTorch in SageMaker, see `Deploying PyTorch Models<https://sagemaker.readthedocs.io/en/stable/using_pytorch.html#deploy-pytorch-models>`__. | ||
* For information on how Amazon Elastic Inference works, see `How EI Works <https://docs.aws.amazon.com/sagemaker/latest/dg/ei.html#ei-how-it-works>`__. | ||
* For more information in regards to using Amazon Elastic Inference in SageMaker, see `Amazon SageMaker Elastic Inference <https://docs.aws.amazon.com/sagemaker/latest/dg/ei.html>`__. | ||
|
||
Building the SageMaker Elastic Inference PyTorch Serving container | ||
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ||
Amazon Elastic Inference is designed to be used with AWS enhanced versions of TensorFlow serving, Apache MXNet or PyTorch serving. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. "PyTorch serving"? |
||
The SageMaker PyTorch containers with Amazon Elastic Inference support were built utilizing the | ||
same instructions listed `above <https://github.com/aws/sagemaker-pytorch-serving-container#building-your-image>`__ with the | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. nit: Should this link just be |
||
EIA Dockerfiles, which are all named ``Dockerfile.eia``, and can be found in the same ``docker/`` directory. | ||
|
||
Example: | ||
|
||
:: | ||
|
||
# PyTorch 1.3.1, Python 3, EI | ||
$ cp dist/sagemaker_pytorch_inference-*.tar.gz dist/sagemaker_pytorch_inference.tar.gz | ||
$ docker build -t preprod-pytorch-serving-eia:1.3.1-cpu-py3 -f docker/1.3.1/py3/Dockerfile.eia . | ||
|
||
|
||
* Currently, only PyTorch serving 1.3.1 is supported for Elastic Inference. | ||
|
||
Running the tests | ||
----------------- | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
#elastic-inference
doesn't work as a link. It should be#amazon-elastic-inference-with-pytorch-in-sagemaker