read_csv example tricks parser dtypes #4692

hayd · 2013-08-27T17:43:59Z

Poster has an example which tricks read_csv into thinking column a is a int, but then throws it lots of strings (and it then infers 1s as strings).

import pandas as pd
df = pd.DataFrame({'a':['1']*100000 + ['X']*100000 + ['1']*100000, 'b':['b']*300000})
df.to_csv('test', sep='\t', index=False, na_rep='NA')
df2 = pd.read_csv('test', sep='\t')
print df2['a'].unique()

http://stackoverflow.com/questions/18471859/pandas-read-csv-dtype-inference-issue

I think this is rather an edge case tbh. :)

The text was updated successfully, but these errors were encountered:

TomAugspurger · 2013-08-27T17:55:10Z

He (or someone) beat you to it. #4691

hayd · 2013-08-27T17:56:09Z

I did not do a very good search. #4691

hayd closed this as completed Aug 27, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

read_csv example tricks parser dtypes #4692

read_csv example tricks parser dtypes #4692

hayd commented Aug 27, 2013

TomAugspurger commented Aug 27, 2013

hayd commented Aug 27, 2013

read_csv example tricks parser dtypes #4692

read_csv example tricks parser dtypes #4692

Comments

hayd commented Aug 27, 2013

TomAugspurger commented Aug 27, 2013

hayd commented Aug 27, 2013