Skip to content

Commit 468653f

Browse files
committed
documentation: smddp v1.7.0 release note
1 parent 5cf3e44 commit 468653f

File tree

2 files changed

+44
-8
lines changed

2 files changed

+44
-8
lines changed

doc/api/training/sdp_versions/latest.rst

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -26,8 +26,8 @@ depending on the version of the library you use.
2626
<https://docs.aws.amazon.com/sagemaker/latest/dg/data-parallel-use-api.html#data-parallel-use-python-skd-api>`_
2727
for more information.
2828

29-
Version 1.4.0, 1.4.1, 1.5.0, 1.6.0 (Latest)
30-
===========================================
29+
For versions between 1.4.0 and 1.7.0 (Latest)
30+
=============================================
3131

3232
.. toctree::
3333
:maxdepth: 1

doc/api/training/smd_data_parallel_release_notes/smd_data_parallel_change_log.rst

Lines changed: 42 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -7,9 +7,50 @@ Release Notes
77
New features, bug fixes, and improvements are regularly made to the SageMaker
88
distributed data parallel library.
99

10-
SageMaker Distributed Data Parallel 1.6.0 Release Notes
10+
SageMaker Distributed Data Parallel 1.7.0 Release Notes
1111
=======================================================
1212

13+
*Date: Feb. 10. 2022*
14+
15+
**New Features**
16+
17+
* Added support for PyTorch 1.13.1.
18+
19+
**Improvements**
20+
21+
* SMDDP throws timeout attribution that provides a more descriptive message about what causes timeout error.
22+
23+
**Bug Fixes**
24+
25+
* Improved tests for large model collectives (LMC) such as AllGather and ReduceScatter.
26+
* Fixed the missing Estimator arguments (hyperparameters, specifically) issue when launching a distributed
27+
training job with a shell script as the entry point and with the pytorchddp distribution strategy.
28+
29+
**Migration to AWS Deep Learning Containers**
30+
31+
This version passed benchmark testing and is migrated to the following AWS Deep Learning Containers (DLC):
32+
33+
* PyTorch 1.13.1 DLC
34+
35+
.. code::
36+
37+
763104351884.dkr.ecr.us-east-1.amazonaws.com/pytorch-training:1.13.1-gpu-py39-cu117-ubuntu20.04-sagemaker
38+
39+
Binary file of this version of the library for custom container users:
40+
41+
.. code::
42+
43+
https://smdataparallel.s3.amazonaws.com/binary/pytorch/1.13.1/cu117/2023-01-09/smdistributed_dataparallel-1.7.0-cp39-cp39-linux_x86_64.whl
44+
45+
46+
----
47+
48+
Release History
49+
===============
50+
51+
SageMaker Distributed Data Parallel 1.6.0 Release Notes
52+
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
53+
1354
*Date: Dec. 15. 2022*
1455

1556
**New Features**
@@ -44,11 +85,6 @@ Binary file of this version of the library for `custom container
4485
https://smdataparallel.s3.amazonaws.com/binary/pytorch/1.12.1/cu113/2022-12-05/smdistributed_dataparallel-1.6.0-cp38-cp38-linux_x86_64.whl
4586
4687
47-
----
48-
49-
Release History
50-
===============
51-
5288
SageMaker Distributed Data Parallel 1.5.0 Release Notes
5389
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
5490

0 commit comments

Comments
 (0)