Skip to content

Fix Cython 3.0 regression with time_loc_dups #55915

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
Nov 10, 2023

Conversation

WillAyd
Copy link
Member

@WillAyd WillAyd commented Nov 10, 2023

re #55179

From discussion in cython/cython#1807 (comment) it looks like Cython prior to 3.0 would always use the sequence protocol for indexing with an integral value. However, Python prefers the object protocol first if available, and Cython switched to match that logic with 3.0

NumPy arrays implement both the sequence and the mapping protocol. In cases where we have untyped arrays that fall back to Python calls we will see a performance regression since this will now route through the mapping space

The changes in this PR are not meant to be an exhaustive review of the codebase, rather just a quick POC to reset the time_loc_dups benchmark

@jbrockmendel
Copy link
Member

LGTM, bummer that its necessary though.

@mroeschke mroeschke added this to the 2.2 milestone Nov 10, 2023
@mroeschke mroeschke added Performance Memory or execution speed performance Internals Related to non-user accessible pandas implementation labels Nov 10, 2023
@mroeschke mroeschke merged commit d650212 into pandas-dev:main Nov 10, 2023
@mroeschke
Copy link
Member

Thanks @WillAyd

@WillAyd WillAyd deleted the fix-dup-perf branch November 11, 2023 00:07
@rhshadrach rhshadrach mentioned this pull request Nov 12, 2023
5 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Internals Related to non-user accessible pandas implementation Performance Memory or execution speed performance
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants