Skip to content

Commit 5f6ca0a

Browse files
mchoi8739aaronmarkham
authored andcommitted
documentation: add more info about volume_size (aws#3289)
Co-authored-by: Aaron Markham <[email protected]>
1 parent 0fd9b54 commit 5f6ca0a

File tree

1 file changed

+77
-9
lines changed

1 file changed

+77
-9
lines changed

src/sagemaker/estimator.py

Lines changed: 77 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -166,10 +166,44 @@ def __init__(
166166
instance_type (str): Type of EC2 instance to use for training,
167167
for example, ``'ml.c4.xlarge'``. Required if instance_groups is
168168
not set.
169-
volume_size (int): Size in GB of the EBS volume to use for
170-
storing input data during training (default: 30). Must be large
171-
enough to store training data if File Mode is used (which is the
172-
default).
169+
volume_size (int): Size in GB of the storage volume to use for
170+
storing input and output data during training (default: 30).
171+
172+
Must be large enough to store training data if File mode is
173+
used, which is the default mode.
174+
175+
When you use an ML instance with the EBS-only storage option
176+
such as ``ml.c5`` and ``ml.p2``,
177+
you must define the size of the EBS
178+
volume through the ``volume_size`` parameter in the estimator class.
179+
180+
.. note::
181+
182+
When you use an ML instance with `NVMe SSD volumes
183+
<https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ssd-instance-store.html#nvme-ssd-volumes>`_
184+
such as ``ml.p4d``, ``ml.g4dn``, and ``ml.g5``,
185+
do not include this parameter in the estimator configuration.
186+
If you use one of those ML instance types,
187+
SageMaker doesn't provision Amazon EBS General Purpose SSD
188+
(gp2) storage nor take this parameter to adjust the NVMe instance storage.
189+
Available storage is fixed to the NVMe instance storage
190+
capacity. SageMaker configures storage paths for training
191+
datasets, checkpoints, model artifacts, and outputs to use the
192+
entire capacity of the instance storage.
193+
194+
Note that if you include this parameter and specify a number that
195+
exceeds the size of the NVMe volume attached to the instance type,
196+
SageMaker returns an ``Invalid VolumeSizeInGB`` error.
197+
198+
To look up instance types and their instance storage types
199+
and volumes, see `Amazon EC2 Instance Types
200+
<http://aws.amazon.com/ec2/instance-types/>`_.
201+
202+
To find the default local paths defined by the SageMaker
203+
training platform, see `Amazon SageMaker Training Storage
204+
Folders for Training Datasets, Checkpoints, Model Artifacts,
205+
and Outputs
206+
<https://docs.aws.amazon.com/sagemaker/latest/dg/model-train-storage.html>`_.
173207
volume_kms_key (str): Optional. KMS key ID for encrypting EBS
174208
volume attached to the training instance (default: None).
175209
max_run (int): Timeout in seconds for training (default: 24 *
@@ -2232,12 +2266,46 @@ def __init__(
22322266
instance_count (int): Number of Amazon EC2 instances to use
22332267
for training. Required if instance_groups is not set.
22342268
instance_type (str): Type of EC2 instance to use for training,
2235-
for example, 'ml.c4.xlarge'. Required if instance_groups is
2269+
for example, ``'ml.c4.xlarge'``. Required if instance_groups is
22362270
not set.
2237-
volume_size (int): Size in GB of the EBS volume to use for
2238-
storing input data during training (default: 30). Must be large
2239-
enough to store training data if File Mode is used (which is the
2240-
default).
2271+
volume_size (int): Size in GB of the storage volume to use for
2272+
storing input and output data during training (default: 30).
2273+
2274+
Must be large enough to store training data if File mode is
2275+
used, which is the default mode.
2276+
2277+
When you use an ML instance with the EBS-only storage option
2278+
such as ``ml.c5`` and ``ml.p2``,
2279+
you must define the size of the EBS
2280+
volume through the ``volume_size`` parameter in the estimator class.
2281+
2282+
.. note::
2283+
2284+
When you use an ML instance with `NVMe SSD volumes
2285+
<https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ssd-instance-store.html#nvme-ssd-volumes>`_
2286+
such as ``ml.p4d``, ``ml.g4dn``, and ``ml.g5``,
2287+
do not include this parameter in the estimator configuration.
2288+
If you use one of those ML instance types,
2289+
SageMaker doesn't provision Amazon EBS General Purpose SSD
2290+
(gp2) storage nor take this parameter to adjust the NVMe instance storage.
2291+
Available storage is fixed to the NVMe instance storage
2292+
capacity. SageMaker configures storage paths for training
2293+
datasets, checkpoints, model artifacts, and outputs to use the
2294+
entire capacity of the instance storage.
2295+
2296+
Note that if you include this parameter and specify a number that
2297+
exceeds the size of the NVMe volume attached to the instance type,
2298+
SageMaker returns an ``Invalid VolumeSizeInGB`` error.
2299+
2300+
To look up instance types and their instance storage types
2301+
and volumes, see `Amazon EC2 Instance Types
2302+
<http://aws.amazon.com/ec2/instance-types/>`_.
2303+
2304+
To find the default local paths defined by the SageMaker
2305+
training platform, see `Amazon SageMaker Training Storage
2306+
Folders for Training Datasets, Checkpoints, Model Artifacts,
2307+
and Outputs
2308+
<https://docs.aws.amazon.com/sagemaker/latest/dg/model-train-storage.html>`_.
22412309
volume_kms_key (str): Optional. KMS key ID for encrypting EBS
22422310
volume attached to the training instance (default: None).
22432311
max_run (int): Timeout in seconds for training (default: 24 *

0 commit comments

Comments
 (0)