Skip to content

Support processing datasets from S3 directly which have already been processed in the desired format #42

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
ragavvenkatesan opened this issue Jan 12, 2018 · 2 comments

Comments

@ragavvenkatesan
Copy link

ragavvenkatesan commented Jan 12, 2018

Hi,

We are trying to support SM algorithms, where the user has multiple channels of processed data in S3 that is ready. The plan is to create a AmazonS3BaseEstimator class that will inherit from AmazonBaseEstimator class and will overwrite the record_set method. In this method, we will return a list of RecordSet objects. The _TrainingJob class already works with a dictionary of multiple channels of data. Therefore, we will overwrite the fit method in our class with data element being a dictionary of multiple channels of record objects.

User will have to supply in their fit call, a list of S3 uri's.

Thanks,
R

@ragavvenkatesan
Copy link
Author

@ragavvenkatesan ragavvenkatesan changed the title RecordSet modification required for SM 1P algorithms to support multiple channels directly from S3 Support processing datasets from S3 directly which have already been processed in the desired format Jan 12, 2018
@owen-t
Copy link
Contributor

owen-t commented Jan 16, 2018

Hi Ragav,

I'd be happy to help here - it's not clear what your question is.

Please arrange a call or meeting with me.

Thanks,

Owen.

@owen-t owen-t closed this as completed Jan 18, 2018
laurenyu added a commit to laurenyu/sagemaker-python-sdk that referenced this issue May 31, 2018
apacker pushed a commit to apacker/sagemaker-python-sdk that referenced this issue Nov 15, 2018
Synced image classification notebooks based on bash
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants