@@ -6,9 +6,60 @@ New features, bug fixes, and improvements are regularly made to the SageMaker
6
6
distributed model parallel library.
7
7
8
8
9
- SageMaker Distributed Model Parallel 1.11 .0 Release Notes
9
+ SageMaker Distributed Model Parallel 1.13 .0 Release Notes
10
10
=========================================================
11
11
12
+ *Date: Dec. 15. 2022 *
13
+
14
+ **New Features **
15
+
16
+ * Sharded data parallelism now supports a new backend for collectives called *SMDDP Collectives *.
17
+ For supported scenarios, SMDDP Collectives are on by default for the AllGather operation.
18
+ For more information, see
19
+ `Sharded data parallelism with SMDDP Collectives
20
+ <https://docs.aws.amazon.com/sagemaker/latest/dg/model-parallel-extended-features-pytorch-sharded-data-parallelism.html#model-parallel-extended-features-pytorch-sharded-data-parallelism-smddp-collectives> `_
21
+ in the *Amazon SageMaker Developer Guide *.
22
+ * Introduced FlashAttention for DistributedTransformer to improve memory usage and computational
23
+ performance of models such as GPT2, GPTNeo, GPTJ, GPTNeoX, BERT, and RoBERTa.
24
+
25
+ **Bug Fixes **
26
+
27
+ * Fixed initialization of ``lm_head `` in DistributedTransformer to use a provided range
28
+ for initialization, when weights are not tied with the embeddings.
29
+
30
+ **Improvements **
31
+
32
+ * When a module has no parameters, we have introduced an optimization to execute
33
+ such a module on the same rank as its parent during pipeline parallelism.
34
+
35
+ **Migration to AWS Deep Learning Containers **
36
+
37
+ This version passed benchmark testing and is migrated to the following AWS Deep Learning Containers (DLC):
38
+
39
+ - SageMaker training container for PyTorch v1.12.1
40
+
41
+ .. code ::
42
+
43
+ 763104351884.dkr.ecr.<region>.amazonaws.com/pytorch-training:1.12.1-gpu-py38-cu113-ubuntu20.04-sagemaker
44
+
45
+
46
+ Binary file of this version of the library for `custom container
47
+ <https://docs.aws.amazon.com/sagemaker/latest/dg/model-parallel-sm-sdk.html#model-parallel-bring-your-own-container> `_ users:
48
+
49
+ - For PyTorch 1.12.0
50
+
51
+ .. code ::
52
+
53
+ https://sagemaker-distributed-model-parallel.s3.us-west-2.amazonaws.com/pytorch-1.12.1/build-artifacts/2022-12-08-21-34/smdistributed_modelparallel-1.13.0-cp38-cp38-linux_x86_64.whl
54
+
55
+ ----
56
+
57
+ Release History
58
+ ===============
59
+
60
+ SageMaker Distributed Model Parallel 1.11.0 Release Notes
61
+ ---------------------------------------------------------
62
+
12
63
*Date: August. 17. 2022 *
13
64
14
65
**New Features **
@@ -41,12 +92,7 @@ Binary file of this version of the library for `custom container
41
92
42
93
.. code ::
43
94
44
- https://sagemaker-distributed-model-parallel.s3.us-west-2.amazonaws.com/pytorch-1.12.0/build-artifacts/2022-08-12-16-58/smdistributed_modelparallel-1.11.0-cp38-cp38-linux_x86_64.whl
45
-
46
- ----
47
-
48
- Release History
49
- ===============
95
+ https://sagemaker-distribu
50
96
51
97
SageMaker Distributed Model Parallel 1.10.1 Release Notes
52
98
---------------------------------------------------------
0 commit comments