-
-
Notifications
You must be signed in to change notification settings - Fork 18.4k
BUG/API: Indexes on empty frames/series should be RangeIndex, are Index[object] #49572
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Sir I looked in the series/frame.py files to find the places to be changed but I was unsuccesful so can you please guide |
Hi @ramvikrams. Thanks for taking a look at it:-) I didn’t know where to change the code when I wrote the issue, it’s just going head first in and grinding till I figured it out :-). You can see the results of my own work in the linked PR #49637. However it turns out that adjusting the test suite is quite a lot of work, so if you want to help out, I’d absolutely appreciate it. For example, you could take on some subset of the test suite and make it pass. I’ve already done tests/frame, tests/series and tests/indexing, but there are several others with failing tests in #49637 that you could take on. I think you should be able to just push your changes to my branch and I’ll review and merge it when ok. Maybe you will want to see if my question in #49637 are answered positively before starting just in case the other devs think differently about this… |
thanks @topper-123 sir. I will surely take a look at the linked pr and try to add try to add the tests . Thank you again sir for your help. |
Sir it will take some time for writing tests have some college work this weekend. |
Alright no problem. If you got the time you're welcome to join back in (or do other separate PRs if that is more to your liking). |
i'll join back sir |
+1, cc @jbrockmendel @phofl |
Pandas version checks
I have checked that this issue has not already been reported.
I have confirmed this bug exists on the latest version of pandas.
I have confirmed this bug exists on the main branch of pandas.
Reproducible Example
Issue Description
When users don't specify indexes, pandas infers the index type for them. It infers to be a
RangeIndex
, except when the frame/series is emtpy, then the index is inferred to be aIndex[object]
.The type of index should be consistently RangeIndex, when users don't declare the index. Same for columns.
Expected Behavior
Empty frames/series should have index/columns like
RangeIndex(0)
instead an empty object index.Found working on #49560.
Installed Versions
NSTALLED VERSIONS
commit : 527be72
python : 3.9.7.final.0
python-bits : 64
OS : Darwin
OS-release : 21.6.0
Version : Darwin Kernel Version 21.6.0: Wed Aug 10 14:28:35 PDT 2022; root:xnu-8020.141.5~2/RELEASE_ARM64_T8101
machine : arm64
processor : arm
byteorder : little
LC_ALL : None
LANG : None
LOCALE : None.UTF-8
pandas : 2.0.0.dev0+593.g527be722a2
numpy : 1.23.3
pytz : 2022.1
dateutil : 2.8.2
setuptools : 65.5.0
pip : 22.2.2
Cython : 0.29.32
pytest : 7.1.2
hypothesis : 6.37.0
sphinx : 5.0.2
blosc : 1.10.6
feather : None
xlsxwriter : None
lxml.etree : 4.9.1
html5lib : None
pymysql : 1.0.2
psycopg2 : 2.9.3
jinja2 : 3.1.2
IPython : 7.29.0
pandas_datareader: None
bs4 : 4.11.1
bottleneck : None
brotli :
fastparquet : 0.8.3
fsspec : 2022.10.0
gcsfs : None
matplotlib : 3.5.2
numba : 0.56.4
numexpr : 2.8.3
odfpy : None
openpyxl : None
pandas_gbq : None
pyarrow : 10.0.0
pyreadstat : None
pyxlsb : None
s3fs : 2022.10.0
scipy : 1.9.1
snappy : None
sqlalchemy : None
tables : 3.7.0
tabulate : 0.9.0
xarray : None
xlrd : None
zstandard : 0.18.0
tzdata : None
The text was updated successfully, but these errors were encountered: