BUG: DataFrame.pivot(index=None) raises KeyError #52436

douglas-raillard-arm · 2023-04-05T08:49:34Z

Pandas version checks

I have checked that this issue has not already been reported.
I have confirmed this bug exists on the latest version of pandas.
I have confirmed this bug exists on the main branch of pandas.

Reproducible Example

import pandas as pd

df = pd.DataFrame(dict(a=[0,1,1,0], b=[42,43,44,45]), index=[10,20,30,40])
pivoted = df.pivot(index=None, columns='a', values='b')
print(pivoted)

Issue Description

DataFrame.pivot() documentation states:

index: str or object or a list of str, optional

    Column to use to make new frame’s index. If None, uses existing index.

https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.pivot.html

In pandas 1.5, index=None provides the expected result. In pandas 2.0.0, it raises a KeyError exception.

Expected Behavior

index=None should use the existing index as per documentation, or documentation should be modified and 2.0.0 breaking changes list updated to add that new behavior.

Installed Versions

INSTALLED VERSIONS ------------------ commit : 478d340 python : 3.8.10.final.0 python-bits : 64 OS : Linux OS-release : 5.15.0-60-generic Version : #66~20.04.1-Ubuntu SMP Wed Jan 25 09:41:30 UTC 2023 machine : x86_64 processor : x86_64 byteorder : little LC_ALL : None LANG : en_US.UTF-8 LOCALE : en_US.UTF-8

pandas : 2.0.0
numpy : 1.24.2
pytz : 2023.3
dateutil : 2.8.2
setuptools : 67.6.1
pip : 23.0.1
Cython : None
pytest : 7.2.2
hypothesis : None
sphinx : 6.1.3
blosc : None
feather : None
xlsxwriter : None
lxml.etree : 4.9.2
html5lib : None
pymysql : None
psycopg2 : None
jinja2 : 3.1.2
IPython : 8.12.0
pandas_datareader: None
bs4 : 4.12.0
bottleneck : None
brotli : None
fastparquet : None
fsspec : None
gcsfs : None
matplotlib : 3.7.1
numba : None
numexpr : None
odfpy : None
openpyxl : None
pandas_gbq : None
pyarrow : 11.0.0
pyreadstat : None
pyxlsb : None
s3fs : None
scipy : 1.10.1
snappy : None
sqlalchemy : None
tables : None
tabulate : None
xarray : None
xlrd : None
zstandard : None
tzdata : 2023.3
qtpy : None
pyqt5 : None

The text was updated successfully, but these errors were encountered:

DataFrame.pivot(index=None) seems to have changed behavior: pandas-dev/pandas#52436 Remove index=None so that it works across pandas versions.

MarcoGorelli · 2023-04-05T10:59:02Z

Thanks @douglas-raillard-arm for the report

This was caused by #48293 cc @phofl

https://www.kaggle.com/code/marcogorelli/pandas-regression-example?scriptVersionId=124667072

MarcoGorelli · 2023-04-05T11:01:14Z

Looks like

df.pivot(columns='a', values='b')

gives you your expected output, so maybe the docs just need to be updated to say "Column to use to make new frame’s index. If not given, uses existing index."

phofl · 2023-04-05T11:01:15Z

I‘d say that this behaves as intended and needs a doc update

srkds · 2023-04-05T15:12:36Z

take

srkds · 2023-04-05T15:49:20Z

Hi
I understood the problem and tried reproducing the issue.
as described by Marco and Patrick we just need a doc update.

Just to confirm we have to change the doc here if I'm not wrong can anyone help me here? I would like to contribute.

pandas/pandas/core/frame.py

Line 8677 in 3ce07cb

Column to use to make new frame's index. If None, uses

Thanks!

MarcoGorelli · 2023-04-05T15:56:28Z

yup

srkds · 2023-04-05T16:00:00Z

yup

Thanks, Marco for confirming 👍

douglas-raillard-arm · 2023-04-05T16:58:43Z

Please also make sure to include it in the breaking change list on pandas 2.0.0:
https://pandas.pydata.org/docs/dev/whatsnew/v2.0.0.html

srkds · 2023-04-06T03:31:15Z

Please also make sure to include it in the breaking change list on pandas 2.0.0: https://pandas.pydata.org/docs/dev/whatsnew/v2.0.0.html

I think it's already written here
https://github.com/pandas-dev/pandas/blob/main/doc/source/whatsnew/v2.0.0.rst#reshaping

Bug Fixes / reshaping
👉 Bug in :meth:DataFrame.pivot not respecting None as column name (:issue:48293)

or should I put
"Disallow passing index=None argument to DataFrame.pivot() if not given, uses existing index."
under removal-of-prior-version-deprecations-changes section?

douglas-raillard-arm · 2023-04-06T07:27:25Z

I see how it is considered a bug fix but I think for the vast majority of people using strings as column names that will be a breaking change where the working default value has changed from None to no_default

douglas-raillard-arm added Bug Needs Triage Issue that has not been reviewed by a pandas team member labels Apr 5, 2023

douglas-raillard-arm changed the title ~~BUG:~~ BUG: DataFrame.pivot(index=None) raises KeyError Apr 5, 2023

douglas-raillard-arm mentioned this issue Apr 5, 2023

lisa.tests.starting.utilclamp: Fix for pandas 2.0.0 ARM-software/lisa#1966

Merged

MarcoGorelli added Regression Functionality that used to work in a prior pandas version Reshaping Concat, Merge/Join, Stack/Unstack, Explode and removed Bug Needs Triage Issue that has not been reviewed by a pandas team member labels Apr 5, 2023

MarcoGorelli added this to the 2.0.1 milestone Apr 5, 2023

MarcoGorelli added Docs good first issue labels Apr 5, 2023

github-actions bot assigned srkds Apr 5, 2023

srkds mentioned this issue Apr 5, 2023

DOC: update pivot docs for version pandas 2.x #52453

Merged

5 tasks

mroeschke closed this as completed in #52453 Apr 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: DataFrame.pivot(index=None) raises KeyError #52436

BUG: DataFrame.pivot(index=None) raises KeyError #52436

douglas-raillard-arm commented Apr 5, 2023 •

edited

Loading

MarcoGorelli commented Apr 5, 2023

MarcoGorelli commented Apr 5, 2023

phofl commented Apr 5, 2023

srkds commented Apr 5, 2023

srkds commented Apr 5, 2023 •

edited

Loading

MarcoGorelli commented Apr 5, 2023

srkds commented Apr 5, 2023

douglas-raillard-arm commented Apr 5, 2023

srkds commented Apr 6, 2023

douglas-raillard-arm commented Apr 6, 2023

BUG: DataFrame.pivot(index=None) raises KeyError #52436

BUG: DataFrame.pivot(index=None) raises KeyError #52436

Comments

douglas-raillard-arm commented Apr 5, 2023 • edited Loading

Pandas version checks

Reproducible Example

Issue Description

Expected Behavior

Installed Versions

MarcoGorelli commented Apr 5, 2023

MarcoGorelli commented Apr 5, 2023

phofl commented Apr 5, 2023

srkds commented Apr 5, 2023

srkds commented Apr 5, 2023 • edited Loading

MarcoGorelli commented Apr 5, 2023

srkds commented Apr 5, 2023

douglas-raillard-arm commented Apr 5, 2023

srkds commented Apr 6, 2023

douglas-raillard-arm commented Apr 6, 2023

douglas-raillard-arm commented Apr 5, 2023 •

edited

Loading

srkds commented Apr 5, 2023 •

edited

Loading