Skip to content

Commit a5d38f6

Browse files
committed
update smdmp change log, archive api doc for 1.4.0 and 1.5.0
1 parent 2beb91e commit a5d38f6

9 files changed

+2435
-6
lines changed

doc/api/training/smd_model_parallel_release_notes/smd_model_parallel_change_log.rst

Lines changed: 69 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,67 @@
1-
Sagemaker Distributed Model Parallel 1.4.0 Release Notes
1+
Sagemaker Distributed Model Parallel 1.6.0 Release Notes
22
========================================================
33

4+
*Date: December. 20. 2021*
5+
6+
**New Features**
7+
8+
- **PyTorch**
9+
10+
- Added extended memory-saving features for PyTorch 1.8.1:
11+
12+
- Tensor parallelism
13+
- Optimizer state sharding
14+
- Activation checkpointing
15+
- Activation offloading
16+
17+
For more information, see the following documentation:
18+
19+
- `SageMaker distributed model parallel developer guide <https://docs.aws.amazon.com/sagemaker/latest/dg/model-parallel-extended-features-pytorch.html>`_
20+
- `SageMaker distributed model parallel API documentation for v1.6.0 <https://sagemaker.readthedocs.io/en/stable/api/training/smp_versions/latest.html>`_
21+
22+
**Migration to AWS Deep Learning Containers**
23+
24+
This version passed benchmark testing and is migrated to the following
25+
AWS Deep Learning Container(s):
26+
27+
- Deep Learning Container for PyTorch 1.8.1:
28+
29+
.. code::
30+
31+
763104351884.dkr.ecr.<region>.amazonaws.com/pytorch-training:1.8.1-gpu-py36-cu111-ubuntu18.04
32+
33+
----
34+
35+
Release History
36+
===============
37+
38+
Sagemaker Distributed Model Parallel 1.5.0 Release Notes
39+
--------------------------------------------------------
40+
41+
*Date: November. 03. 2021*
42+
43+
**New Features**
44+
45+
- **PyTorch**
46+
47+
- Currency update for PyTorch 1.10.0
48+
49+
**Migration to AWS Deep Learning Containers**
50+
51+
This version passed benchmark testing and is migrated to the following
52+
AWS Deep Learning Containers:
53+
54+
- Deep Learning Container for PyTorch 1.10.0:
55+
56+
.. code::
57+
58+
763104351884.dkr.ecr.<region>.amazonaws.com/pytorch-training:1.10.0-gpu-py38-cu113-ubuntu20.04-sagemaker
59+
60+
----
61+
62+
Sagemaker Distributed Model Parallel 1.4.0 Release Notes
63+
--------------------------------------------------------
64+
465
*Date: June. 29. 2021*
566

667
**New Features**
@@ -15,17 +76,19 @@ Sagemaker Distributed Model Parallel 1.4.0 Release Notes
1576
This version passed benchmark testing and is migrated to the following
1677
AWS Deep Learning Containers:
1778

18-
- TensorFlow 2.5.0 DLC release: `v1.0-tf-2.5.0-tr-py37
19-
<https://github.com/aws/deep-learning-containers/releases/tag/v1.0-tf-2.5.0-tr-py37>`__
79+
- Deep Learning Container for TensorFlow 2.5.0:
2080

2181
.. code::
2282
2383
763104351884.dkr.ecr.<region>.amazonaws.com/tensorflow-training:2.5.0-gpu-py37-cu112-ubuntu18.04-v1.0
2484
25-
----
85+
- Deep Learning Container for PyTorch 1.9.1:
2686

27-
Release History
28-
===============
87+
.. code::
88+
89+
763104351884.dkr.ecr.<region>.amazonaws.com/pytorch-training:1.9.1-gpu-py38-cu111-ubuntu20.04
90+
91+
----
2992

3093
Sagemaker Distributed Model Parallel 1.3.1 Release Notes
3194
--------------------------------------------------------
Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
#!/usr/bin/env python
2+
# add_no_index2.py
3+
import fileinput
4+
import sys
5+
6+
for line in fileinput.input(inplace=True):
7+
if '.. class::' in line or '.. function::' in line or '.. data::' in line or '.. _' in line:
8+
sys.stdout.write(line + ' :noindex:\n')
9+
else:
10+
sys.stdout.write(line)

0 commit comments

Comments
 (0)