@@ -7,9 +7,50 @@ Release Notes
7
7
New features, bug fixes, and improvements are regularly made to the SageMaker
8
8
distributed data parallel library.
9
9
10
- SageMaker Distributed Data Parallel 1.6 .0 Release Notes
10
+ SageMaker Distributed Data Parallel 1.7 .0 Release Notes
11
11
=======================================================
12
12
13
+ *Date: Feb. 10. 2022 *
14
+
15
+ **New Features **
16
+
17
+ * Added support for PyTorch 1.13.1.
18
+
19
+ **Improvements **
20
+
21
+ * SMDDP throws timeout attribution that provides a more descriptive message about what causes timeout error.
22
+
23
+ **Bug Fixes **
24
+
25
+ * Improved tests for large model collectives (LMC) such as AllGather and ReduceScatter.
26
+ * Fixed the missing Estimator arguments (hyperparameters, specifically) issue when launching a distributed
27
+ training job with a shell script as the entry point and with the pytorchddp distribution strategy.
28
+
29
+ **Migration to AWS Deep Learning Containers **
30
+
31
+ This version passed benchmark testing and is migrated to the following AWS Deep Learning Containers (DLC):
32
+
33
+ * PyTorch 1.13.1 DLC
34
+
35
+ .. code ::
36
+
37
+ 763104351884.dkr.ecr.us-east-1.amazonaws.com/pytorch-training:1.13.1-gpu-py39-cu117-ubuntu20.04-sagemaker
38
+
39
+ Binary file of this version of the library for custom container users:
40
+
41
+ .. code ::
42
+
43
+ https://smdataparallel.s3.amazonaws.com/binary/pytorch/1.13.1/cu117/2023-01-09/smdistributed_dataparallel-1.7.0-cp39-cp39-linux_x86_64.whl
44
+
45
+
46
+ ----
47
+
48
+ Release History
49
+ ===============
50
+
51
+ SageMaker Distributed Data Parallel 1.6.0 Release Notes
52
+ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
53
+
13
54
*Date: Dec. 15. 2022 *
14
55
15
56
**New Features **
@@ -44,11 +85,6 @@ Binary file of this version of the library for `custom container
44
85
https://smdataparallel.s3.amazonaws.com/binary/pytorch/1.12.1/cu113/2022-12-05/smdistributed_dataparallel-1.6.0-cp38-cp38-linux_x86_64.whl
45
86
46
87
47
- ----
48
-
49
- Release History
50
- ===============
51
-
52
88
SageMaker Distributed Data Parallel 1.5.0 Release Notes
53
89
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
54
90
0 commit comments