BUG: Pandas 1.4.0 - pd.NaT can not be replaced. #45836

xmatthias · 2022-02-05T15:06:50Z

Pandas version checks

I have checked that this issue has not already been reported.
I have confirmed this bug exists on the latest version of pandas.
I have confirmed this bug exists on the main branch of pandas.

Reproducible Example

import pandas as pd

pd.DataFrame([pd.NaT, pd.NaT]).replace({pd.NaT: None, pd.np.NaN: None})

# Either pd.NaT or pd.np.NaN work in 1.3.5

Issue Description

In Pandas 1.3.5, the above example returns

  pd.DataFrame([pd.NaT, pd.NaT]).replace({pd.NaT: None, pd.np.NaN: None})
Out[1]: 
      0
0  None
1  None

in pandas 1.4.0, this no longer works

  pd.DataFrame([pd.NaT, pd.NaT]).replace({pd.NaT: None, pd.np.NaN: None})
Out[5]: 
    0
0 NaT
1 NaT

Expected Behavior

.replace() should correctly replace pd.NaT to whatever is specified.

even if we assume pd.NaT is no longer np.NaN - this should still work as pd.NaT is explicitly given.

Installed Versions

Version 1.4.0:

INSTALLED VERSIONS

commit : bb1f651
python : 3.9.7.final.0
python-bits : 64
OS : Linux
OS-release : 5.16.5-arch1-1
Version : #1 SMP PREEMPT Tue, 01 Feb 2022 21:42:50 +0000
machine : x86_64
processor :
byteorder : little
LC_ALL : None
LANG : en_US.UTF-8
LOCALE : en_US.UTF-8
pandas : 1.4.0
numpy : 1.22.1
pytz : 2021.3
dateutil : 2.8.2
pip : 22.0.3
setuptools : 57.4.0
Cython : None
pytest : 6.2.5
hypothesis : None
sphinx : None
blosc : 1.10.6
feather : None
xlsxwriter : None
lxml.etree : None
html5lib : None
pymysql : 1.0.2
psycopg2 : 2.9.3
jinja2 : 3.0.3
IPython : 7.29.0
pandas_datareader: None
bs4 : None
bottleneck : None
fastparquet : None
fsspec : None
gcsfs : None
matplotlib : None
numba : None
numexpr : 2.7.3
odfpy : None
openpyxl : None
pandas_gbq : None
pyarrow : None
pyreadstat : None
pyxlsb : None
s3fs : None
scipy : 1.7.3
sqlalchemy : 1.4.31
tables : 3.7.0
tabulate : 0.8.9
xarray : None
xlrd : None
xlwt : None
zstandard : None

Version 1.3.5 (where it's still working):

INSTALLED VERSIONS

commit : 66e3805
python : 3.9.7.final.0
python-bits : 64
OS : Linux
OS-release : 5.16.5-arch1-1
Version : #1 SMP PREEMPT Tue, 01 Feb 2022 21:42:50 +0000
machine : x86_64
processor :
byteorder : little
LC_ALL : None
LANG : en_US.UTF-8
LOCALE : en_US.UTF-8

pandas : 1.3.5
numpy : 1.22.1
pytz : 2021.3
dateutil : 2.8.2
pip : 22.0.3
setuptools : 57.4.0
Cython : None
pytest : 6.2.5
hypothesis : None
sphinx : None
blosc : 1.10.6
feather : None
xlsxwriter : None
lxml.etree : None
html5lib : None
pymysql : 1.0.2
psycopg2 : 2.9.3 (dt dec pq3 ext lo64)
jinja2 : 3.0.3
IPython : 7.29.0
pandas_datareader: None
bs4 : None
bottleneck : None
fsspec : None
fastparquet : None
gcsfs : None
matplotlib : None
numexpr : 2.7.3
odfpy : None
openpyxl : None
pandas_gbq : None
pyarrow : None
pyxlsb : None
s3fs : None
scipy : 1.7.3
sqlalchemy : 1.4.31
tables : 3.7.0
tabulate : 0.8.9
xarray : None
xlrd : None
xlwt : None
numba : None

The text was updated successfully, but these errors were encountered:

phofl · 2022-02-06T14:48:38Z

Looks like a duplicate/closely aligned with #45601

jorisvandenbossche · 2022-02-07T16:51:21Z

It's indeed the same issue as #45601 (and I suppose also caused by the same code change, but didn't verify that).

But this is certainly a regression, so let's maybe keep this open to track for 1.4.1 (the other issue is about pd.NA, which is strictly speaking still experimental, and we don't necessarily have to fix it for 1.4, although it might be easier/better to fix both)

jorisvandenbossche · 2022-02-07T16:54:42Z

cc @jbrockmendel (probably caused by #44940, based on the comments in #45601)

jbrockmendel · 2022-02-07T19:36:48Z

Yah, one option is to say "manually convert to object dtype". Another is we could check for if both to_replace and value are NA and in that case infer that the user doesn't want them treated as equivalent. i think we do something similar in replace_list

simonjayhawkins · 2022-02-09T13:10:53Z

cc @jbrockmendel (probably caused by #44940, based on the comments in #45601)

can confirm first bad commit: [9cd1c6f] BUG: nullable dtypes not preserved in Series.replace (#44940)

simonjayhawkins · 2022-02-10T19:14:52Z

It's indeed the same issue as #45601 (and I suppose also caused by the same code change, but didn't verify that).

ran git bisect on #45601, can confirm same PR #44940

xmatthias added Bug Needs Triage Issue that has not been reviewed by a pandas team member labels Feb 5, 2022

xmatthias changed the title ~~BUG: Pandas 1.4.0 pd.NaT can not be replaced.~~ BUG: Pandas 1.4.0 - pd.NaT can not be replaced. Feb 5, 2022

xmatthias mentioned this issue Feb 5, 2022

freqUI: fix can't import backtest with missing datetime data freqtrade/freqtrade#6340

Merged

jorisvandenbossche added Regression Functionality that used to work in a prior pandas version replace replace method Missing-data np.nan, pd.NaT, pd.NA, dropna, isnull, interpolate and removed Bug Needs Triage Issue that has not been reviewed by a pandas team member labels Feb 7, 2022

jorisvandenbossche added this to the 1.4.1 milestone Feb 7, 2022

simonjayhawkins added a commit to simonjayhawkins/pandas that referenced this issue Feb 9, 2022

code sample for pandas-dev#45836

ab9f0d7

jbrockmendel mentioned this issue Feb 11, 2022

BUG: Replacing pd.NA by None has no effect #45601

Closed

3 tasks

simonjayhawkins modified the milestones: 1.4.1, 1.4.2 Feb 11, 2022

This was referenced Mar 16, 2022

REGR: only convert at end for Block.replace_list #46393

Closed

BUG: RecursionError when attempting to replace np.nan values (#45725) #45749

Merged

REGR: DataFrame.replace when the replacement value was explicitly None #46404

Merged

jreback closed this as completed in #46404 Mar 19, 2022

jbrockmendel mentioned this issue Mar 21, 2022

REGR: RecursionError when attempting to replace np.nan values #46443

Closed

4 tasks

simonjayhawkins mentioned this issue Apr 12, 2022

BUG: dtype not being preserved for replace on a CategoricalDtype #46672

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: Pandas 1.4.0 - pd.NaT can not be replaced. #45836

BUG: Pandas 1.4.0 - pd.NaT can not be replaced. #45836

xmatthias commented Feb 5, 2022

INSTALLED VERSIONS

INSTALLED VERSIONS

phofl commented Feb 6, 2022

jorisvandenbossche commented Feb 7, 2022

jorisvandenbossche commented Feb 7, 2022 •

edited

Loading

jbrockmendel commented Feb 7, 2022

simonjayhawkins commented Feb 9, 2022

simonjayhawkins commented Feb 10, 2022

BUG: Pandas 1.4.0 - pd.NaT can not be replaced. #45836

BUG: Pandas 1.4.0 - pd.NaT can not be replaced. #45836

Comments

xmatthias commented Feb 5, 2022

Pandas version checks

Reproducible Example

Issue Description

Expected Behavior

Installed Versions

INSTALLED VERSIONS

INSTALLED VERSIONS

phofl commented Feb 6, 2022

jorisvandenbossche commented Feb 7, 2022

jorisvandenbossche commented Feb 7, 2022 • edited Loading

jbrockmendel commented Feb 7, 2022

simonjayhawkins commented Feb 9, 2022

simonjayhawkins commented Feb 10, 2022

jorisvandenbossche commented Feb 7, 2022 •

edited

Loading