-
-
Notifications
You must be signed in to change notification settings - Fork 18.4k
BUG: groupby.sum with timedelta64 and NaT #44658
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
jbrockmendel
commented
Nov 28, 2021
•
edited
Loading
edited
- closes BUG:dataframe.groupby('some_column').timedelta.sum() results wrong when timedelta contains NaT (pandas=1.3.0) #42659
- tests added / passed
- Ensure all linting tests pass, see here for how to run them
- whatsnew entry
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
if this is for backport can you add a whatsnew for 1.3.5
@meeseeksdev backport 1.3.x |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
cc @jbrockmendel as suspected, needs a manual backport. |
having a hard time with the cherrypick. flake8 complains about the cython syntax at the |
cc @simonjayhawkins any idea how to handle the cherrypick problem? |
didn't have any. PS. Thanks for the PR. |
At first glance this appears to cause a slowdown of up to 2x for several benchmarks, but including the cython 0.29.21 results it appears the reported regressions on the 1.3.x tracker could be false positives? |
Looking at the diff, the only non-invasive optimization I see is doing
yah those look pretty darn noisy. is "wait a couple days for more data" a viable option? |
this is not worth holding up 1.3.x |