BUG: datetime parsing: error message indicating position of conflicting string is wrong for larger data #55345

jorisvandenbossche · 2023-10-01T17:28:07Z

Using the latest pandas main (and also happens on released version 2.1.1):

In [1]: pd.to_datetime(["2012-01-01"] * 49 + ["2012-01-02 09"])
...
ValueError: unconverted data remains when parsing with format "%Y-%m-%d": " 09", at position 49. You might want to try:
    - passing `format` if your strings have a consistent format;
    - passing `format='ISO8601'` if your strings are all ISO8601 but not necessarily in exactly the same format;
    - passing `format='mixed'`, and the format will be inferred for each element individually. You might want to use `dayfirst` alongside this.

In [2]: pd.to_datetime(["2012-01-01"] * 50 + ["2012-01-02 09"])
...
ValueError: unconverted data remains when parsing with format "%Y-%m-%d": " 09", at position 1. You might want to try:
...

In the first case, it correctly says "position 49", while in the second case (n > 50), it confusingly says "position 1".

The text was updated successfully, but these errors were encountered:

KartikeyBartwal · 2023-10-01T19:09:18Z

starting to brawl with this issue

KartikeyBartwal · 2023-10-01T19:13:02Z

no issues on my machine:

paulreece · 2023-10-01T19:48:20Z

I can confirm this occurs on the main development branch:

>>> pd.to_datetime(["2012-01-01"] * 50 + ["2012-01-02 09"])
Traceback (most recent call last):
...
ValueError: unconverted data remains when parsing with format "%Y-%m-%d": " 09", at position 1. You might want to try:
...

>>> pd.to_datetime(["2012-01-01"] * 49 + ["2012-01-02 09"])
Traceback (most recent call last):
...
ValueError: unconverted data remains when parsing with format "%Y-%m-%d": " 09", at position 49. You might want to try:
    ...
``

KartikeyBartwal · 2023-10-02T03:51:18Z

Might be clashing with some other package. Could you share your requirements.txt content files?

jorisvandenbossche · 2023-10-02T06:22:02Z

@KartikeyBartwal my guess is that you are using an older version of pandas (starting with pandas 2.0, the datetime parsing got stricter, and we now parse all values using the same format by default, see https://pandas.pydata.org/pdeps/0004-consistent-to-datetime-parsing.html)

rsm-23 · 2023-10-05T10:05:55Z

take

Kartikey-Bartwal · 2023-10-05T10:32:39Z

@KartikeyBartwal my guess is that you are using an older version of pandas (starting with pandas 2.0, the datetime parsing got stricter, and we now parse all values using the same format by default, see https://pandas.pydata.org/pdeps/0004-consistent-to-datetime-parsing.html)

You got it right! My version was '1.3.4'

jorisvandenbossche added Bug Datetime Datetime data dtype Error Reporting Incorrect or improved errors from pandas labels Oct 1, 2023

github-actions bot assigned rsm-23 Oct 5, 2023

rsm-23 mentioned this issue Oct 5, 2023

BUG: fix for datetime parse error for more than 50 rows #55411

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: datetime parsing: error message indicating position of conflicting string is wrong for larger data #55345

BUG: datetime parsing: error message indicating position of conflicting string is wrong for larger data #55345

jorisvandenbossche commented Oct 1, 2023

KartikeyBartwal commented Oct 1, 2023

KartikeyBartwal commented Oct 1, 2023

paulreece commented Oct 1, 2023

KartikeyBartwal commented Oct 2, 2023

jorisvandenbossche commented Oct 2, 2023

rsm-23 commented Oct 5, 2023

Kartikey-Bartwal commented Oct 5, 2023

BUG: datetime parsing: error message indicating position of conflicting string is wrong for larger data #55345

BUG: datetime parsing: error message indicating position of conflicting string is wrong for larger data #55345

Comments

jorisvandenbossche commented Oct 1, 2023

KartikeyBartwal commented Oct 1, 2023

KartikeyBartwal commented Oct 1, 2023

paulreece commented Oct 1, 2023

KartikeyBartwal commented Oct 2, 2023

jorisvandenbossche commented Oct 2, 2023

rsm-23 commented Oct 5, 2023

Kartikey-Bartwal commented Oct 5, 2023