Skip to content

Commit 8c8092c

Browse files
committed
add more info about volume_size
1 parent 284ddbe commit 8c8092c

File tree

1 file changed

+72
-8
lines changed

1 file changed

+72
-8
lines changed

src/sagemaker/estimator.py

Lines changed: 72 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -165,10 +165,42 @@ def __init__(
165165
instance_type (str): Type of EC2 instance to use for training,
166166
for example, ``'ml.c4.xlarge'``. Required if instance_groups is
167167
not set.
168-
volume_size (int): Size in GB of the EBS volume to use for
169-
storing input data during training (default: 30). Must be large
170-
enough to store training data if File Mode is used (which is the
171-
default).
168+
volume_size (int): Size in GB of the storage volume to use for
169+
storing input and output data during training (default: 30).
170+
171+
Must be large enough to store training data if File mode is
172+
used, which is the default mode.
173+
174+
When using an ML instance with the EBS-only storage option and
175+
without instance storage such as `NVMe SSD volumes
176+
<https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ssd-instance-store.html#nvme-ssd-volumes>`_,
177+
you must define the size of EBS
178+
volume through the ``volume_size`` parameter in the estimator class.
179+
For example, ML instance families that use EBS volumes include
180+
``ml.c5`` and ``ml.p2``.
181+
182+
.. note::
183+
184+
When using an ML instance with `NVMe SSD volumes
185+
<https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ssd-instance-store.html#nvme-ssd-volumes>`_,
186+
SageMaker doesn't provision Amazon EBS General Purpose SSD
187+
(gp2) storage.
188+
Available storage is fixed to the NVMe-type instance's storage
189+
capacity. SageMaker configures storage paths for training
190+
datasets, checkpoints, model artifacts, and outputs to use the
191+
entire capacity of the instance storage. For example, ML
192+
instance families with the NVMe-type instance storage include
193+
``ml.p4d``, ``ml.g4dn``, and ``ml.g5``.
194+
195+
To look up instance types and their instance storage types
196+
and volumes, see `Amazon EC2 Instance Types
197+
<http://aws.amazon.com/ec2/instance-types/>`_.
198+
199+
To find the default local paths defined by the SageMaker
200+
training platform, see `Amazon SageMaker Training Storage
201+
Folders for Training Datasets, Checkpoints, Model Artifacts,
202+
and Outputs
203+
<https://docs.aws.amazon.com/sagemaker/latest/dg/model-train-storage.html>`_.
172204
volume_kms_key (str): Optional. KMS key ID for encrypting EBS
173205
volume attached to the training instance (default: None).
174206
max_run (int): Timeout in seconds for training (default: 24 *
@@ -2196,10 +2228,42 @@ def __init__(
21962228
instance_type (str): Type of EC2 instance to use for training,
21972229
for example, 'ml.c4.xlarge'. Required if instance_groups is
21982230
not set.
2199-
volume_size (int): Size in GB of the EBS volume to use for
2200-
storing input data during training (default: 30). Must be large
2201-
enough to store training data if File Mode is used (which is the
2202-
default).
2231+
volume_size (int): Size in GB of the storage volume to use for
2232+
storing input and output data during training (default: 30).
2233+
2234+
Must be large enough to store training data if File mode is
2235+
used, which is the default mode.
2236+
2237+
When using an ML instance with the EBS-only storage option and
2238+
without instance storage such as `NVMe SSD volumes
2239+
<https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ssd-instance-store.html#nvme-ssd-volumes>`_,
2240+
you must define the size of EBS
2241+
volume through the ``volume_size`` parameter in the estimator class.
2242+
For example, ML instance families that use EBS volumes include
2243+
``ml.c5`` and ``ml.p2``.
2244+
2245+
.. note::
2246+
2247+
When using an ML instance with `NVMe SSD volumes
2248+
<https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ssd-instance-store.html#nvme-ssd-volumes>`_,
2249+
SageMaker doesn't provision Amazon EBS General Purpose SSD
2250+
(gp2) storage.
2251+
Available storage is fixed to the NVMe-type instance's storage
2252+
capacity. SageMaker configures storage paths for training
2253+
datasets, checkpoints, model artifacts, and outputs to use the
2254+
entire capacity of the instance storage. For example, ML
2255+
instance families with the NVMe-type instance storage include
2256+
``ml.p4d``, ``ml.g4dn``, and ``ml.g5``.
2257+
2258+
To look up instance types and their instance storage types
2259+
and volumes, see `Amazon EC2 Instance Types
2260+
<http://aws.amazon.com/ec2/instance-types/>`_.
2261+
2262+
To find the default local paths defined by the SageMaker
2263+
training platform, see `Amazon SageMaker Training Storage
2264+
Folders for Training Datasets, Checkpoints, Model Artifacts,
2265+
and Outputs
2266+
<https://docs.aws.amazon.com/sagemaker/latest/dg/model-train-storage.html>`_.
22032267
volume_kms_key (str): Optional. KMS key ID for encrypting EBS
22042268
volume attached to the training instance (default: None).
22052269
max_run (int): Timeout in seconds for training (default: 24 *

0 commit comments

Comments
 (0)