Skip to content

Commit a35f36a

Browse files
Merge branch 'master' into fix_async_config
2 parents 6dfd75e + 0e9c10e commit a35f36a

File tree

303 files changed

+23331
-2796
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

303 files changed

+23331
-2796
lines changed

.githooks/pre-push

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -12,5 +12,5 @@ start_time=`date +%s`
1212
tox -e sphinx,doc8 --parallel all
1313
./ci-scripts/displaytime.sh 'sphinx,doc8' $start_time
1414
start_time=`date +%s`
15-
tox -e py36,py37,py38,py39 --parallel all -- tests/unit
16-
./ci-scripts/displaytime.sh 'py36,py37,py38,py39 unit' $start_time
15+
tox -e py37,py38,py39 --parallel all -- tests/unit
16+
./ci-scripts/displaytime.sh 'py37,py38,py39 unit' $start_time

CHANGELOG.md

Lines changed: 237 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,242 @@
11
# Changelog
22

3+
## v2.104.0 (2022-08-17)
4+
5+
### Features
6+
7+
* local mode executor implementation
8+
* Pipelines local mode setup
9+
* Add PT 1.12 support
10+
* added _AnalysisConfigGenerator for clarify
11+
12+
### Bug Fixes and Other Changes
13+
14+
* yaml safe_load sagemaker config
15+
* pipelines local mode minor bug fixes
16+
* add local mode integ tests
17+
* implement local JsonGet function
18+
* Add Pipeline annotation in model base class and tensorflow estimator
19+
* Allow users to customize trial component display names for pipeline launched jobs
20+
* Update localmode code to decode urllib response as UTF8
21+
22+
### Documentation Changes
23+
24+
* New content for Pipelines local mode
25+
* Correct documentation error
26+
27+
## v2.103.0 (2022-08-05)
28+
29+
### Features
30+
31+
* AutoGluon 0.4.3 and 0.5.2 image_uris
32+
33+
### Bug Fixes and Other Changes
34+
35+
* Revert "change: add a check to prevent launching a modelparallel job on CPU only instances"
36+
* Add gpu capability to local
37+
* Link PyTorch 1.11 to 1.11.0
38+
39+
## v2.102.0 (2022-08-04)
40+
41+
### Features
42+
43+
* add warnings for xgboost specific rules in debugger rules
44+
* Add PyTorch DDP distribution support
45+
* Add test for profiler enablement with debugger_hook false
46+
47+
### Bug Fixes and Other Changes
48+
49+
* Two letter language code must be supported
50+
* add a check to prevent launching a modelparallel job on CPU only instances
51+
* Allow StepCollection added in ConditionStep to be depended on
52+
* Add PipelineVariable annotation in framework models
53+
* skip managed spot training mxnet nb
54+
55+
### Documentation Changes
56+
57+
* smdistributed libraries currency updates
58+
59+
## v2.101.1 (2022-07-28)
60+
61+
### Bug Fixes and Other Changes
62+
63+
* added more ml frameworks supported by SageMaker Workflows
64+
* test: Vspecinteg2
65+
* Add PipelineVariable annotation in amazon models
66+
67+
## v2.101.0 (2022-07-27)
68+
69+
### Features
70+
71+
* Algorithms region launch on CGK
72+
* enhance-bucket-override-support
73+
* infer framework and version
74+
* support clarify bias detection when facets not included
75+
* Add CGK region to frameworks by DLC
76+
77+
### Bug Fixes and Other Changes
78+
79+
* Make repack step output path align with model repack path
80+
* Support parameterized source code input for TrainingStep
81+
82+
### Documentation Changes
83+
84+
* heterogeneous cluster api doc fix
85+
* smdmp v1.10 release note
86+
87+
## v2.100.0 (2022-07-18)
88+
89+
### Features
90+
91+
* upgrade to support python 3.10
92+
* Add target_model to support multi-model endpoints
93+
* Added support for feature group schema change and feature parameters
94+
95+
### Bug Fixes and Other Changes
96+
97+
* enable model.register without 'inference' & 'transform' instances
98+
* rename RegisterModel inner steps to prevent duplicate step names
99+
* remove primitive_or_expr() from conditions
100+
* support pipeline variables for spark processors run arguments
101+
* make 'ModelInput' field optional for inference recommendation
102+
* Fix processing image uri param
103+
* fix: neo inferentia as compilation target not using framework ver
104+
105+
### Documentation Changes
106+
107+
* SageMaker model parallel library v1.10.0 documentation
108+
* add detail & links to clarify docstrings
109+
110+
## v2.99.0 (2022-07-08)
111+
112+
### Features
113+
114+
* heterogeneous cluster set up in distribution config
115+
* support heterogeneous cluster for training
116+
* include fields to work with inference recommender
117+
118+
### Bug Fixes and Other Changes
119+
120+
* Moving the newly added field instance_group to the end of method
121+
* image_uri does not need to be specified with instance_groups
122+
* Loosen version of attrs dependency
123+
* Add PipelineVariable annotation in estimatory, processing, tuner, transformer base classes
124+
* model table link
125+
126+
### Documentation Changes
127+
128+
* documentation for heterogeneous cluster
129+
130+
## v2.98.0 (2022-07-05)
131+
132+
### Features
133+
134+
* Adding deepar image
135+
136+
### Documentation Changes
137+
138+
* edit to clarify how to use inference.py
139+
140+
## v2.97.0 (2022-06-28)
141+
142+
### Deprecations and Removals
143+
144+
* remove support for python 3.6
145+
146+
### Features
147+
148+
* update prebuilt models documentation
149+
150+
### Bug Fixes and Other Changes
151+
152+
* Skipping test_candidate_estimator_default_rerun_and_deploy
153+
* Update model name from 'compiled.pt' to 'model.pth' for neo
154+
* update pytest, skip hf integ temp
155+
* Add override_pipeline_parameter_var decorator to give grace period to update invalid pipeline var args
156+
157+
## v2.96.0 (2022-06-20)
158+
159+
### Features
160+
161+
* Add helper method to generate pipeline adjacency list
162+
163+
### Bug Fixes and Other Changes
164+
165+
* changing trcomp integ tests to be able to run in all regions
166+
167+
## v2.95.0 (2022-06-16)
168+
169+
### Features
170+
171+
* Adding Training Compiler support for TensorFlow estimator starting TF 2.9
172+
* Add support for TF 2.9 training
173+
174+
### Bug Fixes and Other Changes
175+
176+
* integs fallback from p3 to p2 instance
177+
* bucket exists check for session.default_bucket
178+
* make instance type fields as optional
179+
180+
### Documentation Changes
181+
182+
* improvements on the docstring of ModelStep
183+
* Add XGBoostProcessor
184+
185+
## v2.94.0 (2022-06-07)
186+
187+
### Features
188+
189+
* AutoGluon 0.4.2 image_uris support
190+
191+
## v2.93.1 (2022-06-06)
192+
193+
### Bug Fixes and Other Changes
194+
195+
* add input parameterization tests for workflow job steps
196+
* add parameterized tests to transformer
197+
198+
## v2.93.0 (2022-06-03)
199+
200+
### Features
201+
202+
* MxNet 1.9 support
203+
204+
### Bug Fixes and Other Changes
205+
206+
* bump importlib-metadata version upperbound to support TF2.9
207+
* fix pipeline doc code example where process.run only accepts argument
208+
* Fix Tensorflow default model_dir generation when output_path is pipeline variable
209+
* Support transformer data parameterization
210+
211+
## v2.92.2 (2022-05-31)
212+
213+
### Bug Fixes and Other Changes
214+
215+
* turn off Pipeline Parameter inheritance from python primitives
216+
* Add more validations for pipeline step new interfaces
217+
* Changed method description per AWS request
218+
219+
## v2.92.1 (2022-05-26)
220+
221+
### Bug Fixes and Other Changes
222+
223+
* pin protobuf to < 4.0 to fix breaking change
224+
225+
## v2.92.0 (2022-05-26)
226+
227+
### Features
228+
229+
* add 'Domain' property to RegisterModel step
230+
231+
### Bug Fixes and Other Changes
232+
233+
* support estimator output path parameterization
234+
* Add back Prevent passing PipelineVariable object into image_uris.retrieve
235+
* jumpstart amt tracking
236+
* fix missing register method params for framework models
237+
* fix docstring for decorated functions
238+
* Documents: add sagemaker model building pipeline readthedocs
239+
3240
## v2.91.1 (2022-05-19)
4241

5242
### Bug Fixes and Other Changes

README.rst

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -87,7 +87,6 @@ Supported Python Versions
8787

8888
SageMaker Python SDK is tested on:
8989

90-
- Python 3.6
9190
- Python 3.7
9291
- Python 3.8
9392
- Python 3.9

VERSION

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
2.91.2.dev0
1+
2.104.1.dev0

doc/Makefile

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@
33

44
# You can set these variables from the command line.
55
SPHINXOPTS = -W
6-
SPHINXBUILD = python -msphinx
6+
SPHINXBUILD = python3 -msphinx
77
SPHINXPROJ = sagemaker
88
SOURCEDIR = .
99
BUILDDIR = _build

doc/algorithms/index.rst

Lines changed: 7 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -1,20 +1,15 @@
11
######################
2-
First-Party Algorithms
2+
Built-in Algorithms
33
######################
44

55
Amazon SageMaker provides implementations of some common machine learning algorithms optimized for GPU architecture and massive datasets.
66

77
.. toctree::
88
:maxdepth: 2
99

10-
sagemaker.amazon.amazon_estimator
11-
factorization_machines
12-
ipinsights
13-
kmeans
14-
knn
15-
lda
16-
linear_learner
17-
ntm
18-
object2vec
19-
pca
20-
randomcutforest
10+
tabular/index
11+
text/index
12+
time_series/index
13+
unsupervised/index
14+
vision/index
15+
other/index

doc/algorithms/other/index.rst

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
######################
2+
Other
3+
######################
4+
5+
:ref:`All Pre-trained Models <all-pretrained-models>`
6+
7+
.. toctree::
8+
:maxdepth: 2
9+
10+
sagemaker.amazon.amazon_estimator

doc/algorithms/tabular/autogluon.rst

Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
############
2+
AutoGluon
3+
############
4+
5+
`AutoGluon-Tabular <https://auto.gluon.ai/stable/index.html>`__ is a popular open-source AutoML framework that trains highly accurate machine learning models on an unprocessed tabular dataset.
6+
Unlike existing AutoML frameworks that primarily focus on model and hyperparameter selection, AutoGluon-Tabular succeeds by ensembling multiple models and stacking them in multiple layers.
7+
8+
9+
The following table outlines a variety of sample notebooks that address different use cases of Amazon SageMaker AutoGluon-Tabular algorithm.
10+
11+
.. list-table::
12+
:widths: 25 25
13+
:header-rows: 1
14+
15+
* - Notebook Title
16+
- Description
17+
* - `Tabular classification with Amazon SageMaker AutoGluon-Tabular algorithm <https://github.com/aws/amazon-sagemaker-examples/blob/main/introduction_to_amazon_algorithms/autogluon_tabular/Amazon_Tabular_Classification_AutoGluon.ipynb>`__
18+
- This notebook demonstrates the use of the Amazon SageMaker AutoGluon-Tabular algorithm to train and host a tabular classification model.
19+
* - `Tabular regression with Amazon SageMaker AutoGluon-Tabular algorithm <https://github.com/aws/amazon-sagemaker-examples/blob/main/introduction_to_amazon_algorithms/autogluon_tabular/Amazon_Tabular_Regression_AutoGluon.ipynb>`__
20+
- This notebook demonstrates the use of the Amazon SageMaker AutoGluon-Tabular algorithm to train and host a tabular regression model.
21+
22+
23+
For instructions on how to create and access Jupyter notebook instances that you can use to run the example in SageMaker, see
24+
`Use Amazon SageMaker Notebook Instances <https://docs.aws.amazon.com/sagemaker/latest/dg/nbi.html>`__. After you have created a notebook
25+
instance and opened it, choose the SageMaker Examples tab to see a list of all of the SageMaker samples. To open a notebook, choose its
26+
Use tab and choose Create copy.
27+
28+
For detailed documentation, please refer to the `Sagemaker AutoGluon-Tabular Algorithm <https://docs.aws.amazon.com/sagemaker/latest/dg/autogluon-tabular.html>`__.

doc/algorithms/tabular/catboost.rst

Lines changed: 37 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,37 @@
1+
############
2+
CatBoost
3+
############
4+
5+
6+
`CatBoost <https://catboost.ai/>`__ is a popular and high-performance open-source implementation of the Gradient Boosting Decision Tree (GBDT)
7+
algorithm. GBDT is a supervised learning algorithm that attempts to accurately predict a target variable by combining an ensemble of
8+
estimates from a set of simpler and weaker models.
9+
10+
CatBoost introduces two critical algorithmic advances to GBDT:
11+
12+
* The implementation of ordered boosting, a permutation-driven alternative to the classic algorithm
13+
14+
* An innovative algorithm for processing categorical features
15+
16+
Both techniques were created to fight a prediction shift caused by a special kind of target leakage present in all currently existing
17+
implementations of gradient boosting algorithms.
18+
19+
The following table outlines a variety of sample notebooks that address different use cases of Amazon SageMaker CatBoost algorithm.
20+
21+
.. list-table::
22+
:widths: 25 25
23+
:header-rows: 1
24+
25+
* - Notebook Title
26+
- Description
27+
* - `Tabular classification with Amazon SageMaker LightGBM and CatBoost algorithm <https://github.com/aws/amazon-sagemaker-examples/blob/main/introduction_to_amazon_algorithms/lightgbm_catboost_tabular/Amazon_Tabular_Classification_LightGBM_CatBoost.ipynb>`__
28+
- This notebook demonstrates the use of the Amazon SageMaker CatBoost algorithm to train and host a tabular classification model.
29+
* - `Tabular regression with Amazon SageMaker LightGBM and CatBoost algorithm <https://github.com/aws/amazon-sagemaker-examples/blob/main/introduction_to_amazon_algorithms/lightgbm_catboost_tabular/Amazon_Tabular_Regression_LightGBM_CatBoost.ipynb>`__
30+
- This notebook demonstrates the use of the Amazon SageMaker CatBoost algorithm to train and host a tabular regression model.
31+
32+
For instructions on how to create and access Jupyter notebook instances that you can use to run the example in SageMaker, see
33+
`Use Amazon SageMaker Notebook Instances <https://docs.aws.amazon.com/sagemaker/latest/dg/nbi.html>`__. After you have created a notebook
34+
instance and opened it, choose the SageMaker Examples tab to see a list of all of the SageMaker samples. To open a notebook, choose its
35+
Use tab and choose Create copy.
36+
37+
For detailed documentation, please refer to the `Sagemaker CatBoost Algorithm <https://docs.aws.amazon.com/sagemaker/latest/dg/catboost.html>`__.

doc/algorithms/factorization_machines.rst renamed to doc/algorithms/tabular/factorization_machines.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
FactorizationMachines
1+
Factorization Machines
22
-------------------------
33

44
The Amazon SageMaker Factorization Machines algorithm.

0 commit comments

Comments
 (0)