Skip to content

read_table and encoding #1975

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
jseabold opened this issue Sep 26, 2012 · 3 comments
Closed

read_table and encoding #1975

jseabold opened this issue Sep 26, 2012 · 3 comments
Labels
Bug IO Data IO issues that don't fit into a more specific label
Milestone

Comments

@jseabold
Copy link
Contributor

What in the world is going on here? Shouldn't these both give unicode?

import pandas
from StringIO import StringIO
fin = StringIO("Łaski, Jan;1")
df1 = pandas.read_table(fin, sep=";", encoding="utf-8", header=None)
fin.seek(0)
df2 = pandas.read_csv(fin, sep=";", encoding="utf-8", header=None)

type(df1["X.1"].values[0])
type(df2["X.1"].values[0])
@wesm
Copy link
Member

wesm commented Sep 26, 2012

That's not cool. I'll try to figure out what's up

changhiskhan added a commit that referenced this issue Sep 26, 2012
@changhiskhan
Copy link
Contributor

read_table was specifically setting encoding=None

@jseabold
Copy link
Contributor Author

That's what I figured. Must've read over it in the code. Thanks.

yarikoptic added a commit to neurodebian/pandas that referenced this issue Sep 27, 2012
Version 0.9.0 Release Candidate 2

* tag 'v0.9.0rc2':
  DOC: release notes, bump to RC2
  DOC: missed a few for release notes 0.9
  DOC: add a few more notes on bug fixes in release.rst
  BUG: repr fix for all-NA index level. close pandas-dev#1971
  BLD: don't link against math library on windows
  TST: kludge around test failure on win64 python 3.2.2
  BLD: link against math library explicitly. close pandas-dev#1955
  DOC: Add line about resetting to default index
  DOC: Adding details on normalization for variance functions.
  DOC: Specify default merge behavior for on = None
  BUG: PeriodIndex slicing by datetime fails when either end out-of-bounds pandas-dev#1977
  BUG: read_table unicode bug pandas-dev#1975
  BUG: BlockManager.iget fails with non-unique MultiIndex pandas-dev#1970
  Better error message for DataFrame.apply if axis is not 0 or 1
  TST: fix up tzlocal test cases
  DOC: add level option in Series.reset_index to release notes
  ENH: level parameter for Series.reset_index
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Bug IO Data IO issues that don't fit into a more specific label
Projects
None yet
Development

No branches or pull requests

3 participants