Support multi-part uploads #45

jbencook · 2018-01-16T21:54:50Z

For large datasets, the current Session.upload_data method fails. This PR switches the call to Object.upload_file which can do multi-part uploads. Also updated the unit tests.

AWS master

owen-t · 2018-01-18T21:30:09Z

.gitignore

@@ -20,3 +20,4 @@ examples/tensorflow/distributed_mnist/data
 doc/_build
 **/.DS_Store
 venv/
+*~


Can this make it into the commit message as well.

Yeah I accidentally added an emacs file on our fork. You mean you want a new commit in this PR with a message about that change?

It's a good addition, happy for it to be part of this PR. If we leave it in this PR, can the commit message just reference the addition. Something like

"Upload files to S3 using multipart uploads.

Emacs temporary files covered in .gitignore"

Yeah I'm happy to add it. Any idea if amending my commit message and force pushing to our fork will do it?

Yeah, should be fine

Ok that worked

owen-t · 2018-01-18T21:32:04Z

Thanks for your submission!

Ignore Emacs backup files

* add sagemaker cli (#32) * add sagemaker cli * remove unnecessary close * address PR comments * tidy up imports * fix imports, flake8 errors * improve help message for bucket-name * remove default role name * fix log-level and py3 tests, add copyright * update cli example scripts * Add documentation about BYO Models (#47) * Add test for BYO estimator using Factorization Machines algorithm as an example. (#50) * Support multi-part uploads (#45) * Update TensorFlow examples following API change (#44) * Add data_type to hyperparameters (#54) When we describe a training job the data type of the hyper parameters is lost because we use a dict[str, str]. This adds a new field to Hyperparameter so that we can convert the datatypes at runtime. instead of validating with isinstance(), we cast the hp value to the type it is meant to be. This enforces a "strongly typed" value. When we deserialize from the API string responses it becomes easier to deal with too.

Seq2seq

Merge pull request #1 from aws/master

f0f67dd

AWS master

owen-t previously approved these changes Jan 18, 2018

View reviewed changes

Support multi-part uploads

76cb2d3

Ignore Emacs backup files

jbencook dismissed owen-t’s stale review via 76cb2d3 January 19, 2018 12:55

jbencook force-pushed the master branch from 9241ed8 to 76cb2d3 Compare January 19, 2018 12:55

owen-t approved these changes Jan 19, 2018

View reviewed changes

Merge branch 'master' into master

9678c6e

lukmis merged commit 05d4b0b into aws:master Jan 22, 2018

ragavvenkatesan mentioned this pull request Jan 30, 2018

sync (#63) #64

Closed

laurenyu pushed a commit to laurenyu/sagemaker-python-sdk that referenced this pull request May 31, 2018

Fix multiple channel (aws#45)

f9838b6

apacker pushed a commit to apacker/sagemaker-python-sdk that referenced this pull request Nov 15, 2018

Merge pull request aws#45 from awslabs/seq2seq

3b82e40

Seq2seq

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support multi-part uploads #45

Support multi-part uploads #45

jbencook commented Jan 16, 2018

owen-t Jan 18, 2018

jbencook Jan 18, 2018

owen-t Jan 18, 2018

jbencook Jan 18, 2018

owen-t Jan 19, 2018

jbencook Jan 19, 2018

owen-t commented Jan 18, 2018

Support multi-part uploads #45

Support multi-part uploads #45

Conversation

jbencook commented Jan 16, 2018

owen-t Jan 18, 2018

Choose a reason for hiding this comment

jbencook Jan 18, 2018

Choose a reason for hiding this comment

owen-t Jan 18, 2018

Choose a reason for hiding this comment

jbencook Jan 18, 2018

Choose a reason for hiding this comment

owen-t Jan 19, 2018

Choose a reason for hiding this comment

jbencook Jan 19, 2018

Choose a reason for hiding this comment

owen-t commented Jan 18, 2018