Add hyperparameter tuning support #207

laurenyu · 2018-05-31T18:36:36Z

Description of changes:
Add support for hyperparameter tuning jobs.

This introduces a few key features:

a new class, HyperparameterTuner, which looks/acts like an estimator with fit(), deploy(), and attach() except that it creates hyperparameter tuning jobs instead of regular training jobs
a new method for estimators, _prepare_for_training(), which should set all values needed before training
new analytics classes for training and hyperparameter tuning jobs

This PR also bumps the SDK version to 1.4.0.

Merge Checklist

Put an x in the boxes that apply. You can also fill these out after creating the PR. If you're unsure about any of them, don't hesitate to ask. We're here to help! This is simply a reminder of what we are going to look for before merging your pull request.

I have read the CONTRIBUTING doc
I have added tests that prove my fix is effective or that my feature works (if appropriate)
I have updated the changelog with a description of my changes (if appropriate)
I have updated any necessary documentation (if appropriate)

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

owen-t

One small thing - but happy to have this fixed post-release.

owen-t · 2018-05-31T18:48:38Z

src/sagemaker/tuner.py

+                based on the training image name and current timestamp.
+            **kwargs: Other arguments
+        """
+        if isinstance(inputs, list) or isinstance(inputs, RecordSet):


This is much better:

kwargs = dict(kwargs) kwargs['job_name'] = job_name self._prepare_for_training(**kwargs)

basically replace line 140 to 145 with that block.

_prepare_for_training() still needs records for 1P estimators but not for the others, though. Instead, it'd end up looking like:

kwargs = dict(kwargs) kwargs['job_name'] = job_name if isinstance(inputs, list) or isinstance(inputs, RecordSet): kwargs['records'] = inputs self.estimator._prepare_for_training(**kwargs)

…d of TrainingStartTime.

codecov-io · 2018-06-04T16:39:39Z

Codecov Report

Merging #207 into master will increase coverage by 0.73%.
The diff coverage is 93.58%.

@@            Coverage Diff             @@
##           master     #207      +/-   ##
==========================================
+ Coverage   90.76%   91.49%   +0.73%     
==========================================
  Files          42       45       +3     
  Lines        2717     3162     +445     
==========================================
+ Hits         2466     2893     +427     
- Misses        251      269      +18

Impacted Files	Coverage Δ
src/sagemaker/amazon/hyperparameter.py	`97.22% <ø> (ø)`	⬆️
src/sagemaker/amazon/kmeans.py	`100% <100%> (ø)`	⬆️
src/sagemaker/amazon/ntm.py	`100% <100%> (ø)`	⬆️
src/sagemaker/amazon/lda.py	`100% <100%> (ø)`	⬆️
src/sagemaker/__init__.py	`100% <100%> (ø)`	⬆️
src/sagemaker/utils.py	`90.9% <100%> (+3.03%)`	⬆️
src/sagemaker/amazon/randomcutforest.py	`100% <100%> (ø)`	⬆️
src/sagemaker/amazon/linear_learner.py	`100% <100%> (ø)`	⬆️
src/sagemaker/amazon/pca.py	`100% <100%> (ø)`	⬆️
src/sagemaker/session.py	`86.7% <78.57%> (-1.14%)`	⬇️
... and 10 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 731641c...2c16ae8. Read the comment docs.

laurenyu added 3 commits May 31, 2018 11:23

Add support for hyperparameter tuning jobs

42974a2

Update changelog

566039b

Bump version to 1.4.0

485e944

laurenyu requested a review from owen-t May 31, 2018 18:36

Skip integ tests for now

033bd42

owen-t previously approved these changes May 31, 2018

View reviewed changes

Fix unit tests

93addef

laurenyu dismissed owen-t’s stale review via 93addef May 31, 2018 19:02

laurenyu and others added 5 commits May 31, 2018 13:35

Fix some flake8 errors

5557a3b

Improving elapsed time to be more robust, and use CreationTime instea…

1c22945

…d of TrainingStartTime.

Add tags for hyperparameter tuning jobs

48c4e02

Fix flake8 'too complex' error

8670d9d

Merge branch 'master' into hyperparameter-tuning-support

5c52054

ChoiByungWook and others added 3 commits June 4, 2018 14:55

Merge branch 'master' into hyperparameter-tuning-support

2a04096

Fix parameter name for wait

5df7840

Fix rcf default batch_size

2c16ae8

ChoiByungWook force-pushed the hyperparameter-tuning-support branch from a7576b4 to 2c16ae8 Compare June 4, 2018 23:14

owen-t approved these changes Jun 5, 2018

View reviewed changes

ChoiByungWook merged commit 502d6eb into aws:master Jun 5, 2018

laurenyu deleted the hyperparameter-tuning-support branch June 6, 2018 16:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add hyperparameter tuning support #207

Add hyperparameter tuning support #207

laurenyu commented May 31, 2018

owen-t left a comment

owen-t May 31, 2018

owen-t May 31, 2018

laurenyu Jun 1, 2018

codecov-io commented Jun 4, 2018 •

edited

Loading

Add hyperparameter tuning support #207

Add hyperparameter tuning support #207

Conversation

laurenyu commented May 31, 2018

Merge Checklist

owen-t left a comment

Choose a reason for hiding this comment

owen-t May 31, 2018

Choose a reason for hiding this comment

owen-t May 31, 2018

Choose a reason for hiding this comment

laurenyu Jun 1, 2018

Choose a reason for hiding this comment

codecov-io commented Jun 4, 2018 • edited Loading

Codecov Report

codecov-io commented Jun 4, 2018 •

edited

Loading