-
-
Notifications
You must be signed in to change notification settings - Fork 18.4k
DOC, BUG: Clarify and Standardize Whitespace Delimiter Behaviour with Custom Line Terminator #12912
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
this should just raise an error I think. It doesn't make sense, these are 2 conflicting options. |
>>> from pandas import read_csv
>>> from pandas.compat import StringIO
>>> data = """a b c~1 2 3~4 5 6~7 8 9"""
>>> df = read_csv(StringIO(data), lineterminator='~', delimiter=' ')
>>> df
a b c
0 1 2 3
1 4 5 6
2 7 8 9 |
ok, I guess accepting |
also docs could be updated as well. |
Yep, that's what I was thinking of doing as well, as there aren't any for |
there is a tiny comment in |
Ah, okay. I'll add use that documentation in |
@gfyoung regarding your example of |
@jorisvandenbossche : Whitespace in |
I don't know about the c internals of the parser code, but in any case here: https://github.com/pydata/pandas/blob/master/pandas/io/parsers.py#L679 the |
To be clear, I am not saying that this is not a bug, I was just pointing out that your example of |
@jorisvandenbossche : I did understand that they were not taking the same code path, though I did not see that override there that you pointed out. Nevertheless, I think we can agree then that the documentation definitely needs some explanation as what goes on with |
@gfyoung Certainly! |
Addresses DOC issue part of pandas-devgh-12912.
Addresses BUG issue part of pandas-devgh-12912. Closes pandas-devgh-12912.
expected:
Note that this bug is only for the C engine, as the Python engine does not yet support
delim_whitespace
.The text was updated successfully, but these errors were encountered: