CSV Problem with Scientific Notation #141

hantianz · 2018-04-11T18:36:22Z

When I tried to train an xgboost model, I used a csv file without header line that is output of an all number pandas data-frame but got error: Alphabet found in the header line.
Then after a while I realized that it was because there is scientific notation in the csv file, e.g., 1e-10
It would be better if it can automatically convert scientific notation to a number. Also this problem is hard to debug, because I am sure that my csv file only contains numbers but the error basically says you have non-numbers.

Best,
Hantian

iquintero · 2018-04-16T22:29:53Z

HI @hantianz thanks for using SageMaker and pointing this out.

We look at all enhancement / feature requests and consider them as part of our backlog and are constantly looking at our prioirities. I agree providing a better error message or even just converting it would be great!

We will keep this issue open to track it.

new notebook -- predicting product success when review data is available

trungleduc · 2023-12-20T20:31:49Z

Closing as it's not related to the SDK, please feel free to re-open if needed.

iquintero added the type: feature request label Apr 16, 2018

apacker pushed a commit to apacker/sagemaker-python-sdk that referenced this issue Nov 15, 2018

Merge pull request aws#141 from rabowskyb/video-game-sales

988f6b4

new notebook -- predicting product success when review data is available

trungleduc added the XGBoost label Sep 26, 2023

martinRenou assigned trungleduc Dec 14, 2023

trungleduc closed this as completed Dec 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CSV Problem with Scientific Notation #141

CSV Problem with Scientific Notation #141

hantianz commented Apr 11, 2018 •

edited

Loading

iquintero commented Apr 16, 2018

trungleduc commented Dec 20, 2023

CSV Problem with Scientific Notation #141

CSV Problem with Scientific Notation #141

Comments

hantianz commented Apr 11, 2018 • edited Loading

iquintero commented Apr 16, 2018

trungleduc commented Dec 20, 2023

hantianz commented Apr 11, 2018 •

edited

Loading