-
-
Notifications
You must be signed in to change notification settings - Fork 18.4k
Major Performance regression of df.groupby(..).indices #38495
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@bordingj thanks a lot for the report. |
I think this is caused by #36911 (which changed the implementation of Grouping.indices, and cc @mroeschke @phofl |
|
@jorisvandenbossche if this is fixed on master (and 1.2.0rc0) and we are not currently planning another release in the 1.1.x series, can this issue be closed? |
I think we are using the function causing the performance regression in other places. Don't know how the impact is there, but not sure if we can close this. |
Have to correct myself here. Looks like we are only using the method in pandas/pandas/tests/groupby/test_grouping.py Line 855 in 03e1c89
which seems to be more or less a dummy test. Ran related tests locally. Not quite sure what to do with this method? Is the code snipped from the test intended to be public available? |
I think this can be closed now that #38649 is merged? |
yea i think so |
I'm experiencing major performance regressions with pandas=1.1.5 versus 1.1.3
Version 1.1.3:
Version 1.1.5:
The text was updated successfully, but these errors were encountered: