Skip to content

CSV Problem with Scientific Notation #141

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
hantianz opened this issue Apr 11, 2018 · 2 comments
Closed

CSV Problem with Scientific Notation #141

hantianz opened this issue Apr 11, 2018 · 2 comments

Comments

@hantianz
Copy link

hantianz commented Apr 11, 2018

When I tried to train an xgboost model, I used a csv file without header line that is output of an all number pandas data-frame but got error: Alphabet found in the header line.
Then after a while I realized that it was because there is scientific notation in the csv file, e.g., 1e-10
It would be better if it can automatically convert scientific notation to a number. Also this problem is hard to debug, because I am sure that my csv file only contains numbers but the error basically says you have non-numbers.

Best,
Hantian

@iquintero
Copy link
Contributor

HI @hantianz thanks for using SageMaker and pointing this out.

We look at all enhancement / feature requests and consider them as part of our backlog and are constantly looking at our prioirities. I agree providing a better error message or even just converting it would be great!

We will keep this issue open to track it.

apacker pushed a commit to apacker/sagemaker-python-sdk that referenced this issue Nov 15, 2018
new notebook -- predicting product success when review data is available
@trungleduc
Copy link
Collaborator

Closing as it's not related to the SDK, please feel free to re-open if needed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants