PERF: tighter cython declarations, faster iter #43872

jbrockmendel · 2021-10-04T03:48:30Z

closes #xxxx
tests added / passed
Ensure all linting tests pass, see here for how to run them
whatsnew entry

mzeitlin11

LGTM! Are the ndim=1 additions for readability or performance? Have always just assumed (but never checked) that cython infers it. If it helps perf we should do it everywhere - a quick regex shows lots of places we don't use ndim=1

jreback

lgtm

jreback · 2021-10-04T12:09:48Z

can you post benchmark changes in the top

jbrockmendel · 2021-10-04T17:09:08Z

Are the ndim=1 additions for readability or performance? Have always just assumed (but never checked) that cython infers it.

When I was working on this I convinced myself that it made a difference, but I can no longer remember why. Maybe @da-woods can weigh in?

can you post benchmark changes in the top

It's all going to be micro-perf that asvs don't pick up well

da-woods · 2021-10-04T17:26:46Z

Are the ndim=1 additions for readability or performance? Have always just assumed (but never checked) that cython infers it.

When I was working on this I convinced myself that it made a difference, but I can no longer remember why. Maybe @da-woods can weigh in?

https://cython.readthedocs.io/en/latest/src/tutorial/numpy.html#efficient-indexing

“ndim” keyword-only argument, if not provided then one-dimensional is assumed

I had a quick test of a simple example (looking at the annotated source) and the behaviour looks to match what the docs say. So I think there's no performance advantage to specifying ndim=1 but you may prefer to be explicit for readability reasons.

jbrockmendel · 2021-10-04T18:09:47Z

Thanks @da-woods.

If the ndim=1 is extraneous, I'm OK with either the more concise or more explicit versions.

PERF: tighter cython declarations, faster __iter__

b37ccab

mzeitlin11 approved these changes Oct 4, 2021

View reviewed changes

mzeitlin11 added Internals Related to non-user accessible pandas implementation Performance Memory or execution speed performance labels Oct 4, 2021

mzeitlin11 added this to the 1.4 milestone Oct 4, 2021

jreback approved these changes Oct 4, 2021

View reviewed changes

jreback merged commit 6599834 into pandas-dev:master Oct 5, 2021

jbrockmendel deleted the perf-cy branch October 5, 2021 01:39

gasparitiago pushed a commit to gasparitiago/pandas that referenced this pull request Oct 9, 2021

PERF: tighter cython declarations, faster __iter__ (pandas-dev#43872)

4cd4b79

rhshadrach pushed a commit to rhshadrach/pandas that referenced this pull request Oct 10, 2021

PERF: tighter cython declarations, faster __iter__ (pandas-dev#43872)

6021c06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

PERF: tighter cython declarations, faster iter #43872

PERF: tighter cython declarations, faster iter #43872

Uh oh!

jbrockmendel commented Oct 4, 2021

Uh oh!

mzeitlin11 left a comment

Uh oh!

jreback left a comment

Uh oh!

jreback commented Oct 4, 2021

Uh oh!

jbrockmendel commented Oct 4, 2021

Uh oh!

da-woods commented Oct 4, 2021

Uh oh!

jbrockmendel commented Oct 4, 2021

Uh oh!

Uh oh!

Uh oh!

PERF: tighter cython declarations, faster __iter__ #43872

PERF: tighter cython declarations, faster __iter__ #43872

Uh oh!

Conversation

jbrockmendel commented Oct 4, 2021

Uh oh!

mzeitlin11 left a comment

Choose a reason for hiding this comment

Uh oh!

jreback left a comment

Choose a reason for hiding this comment

Uh oh!

jreback commented Oct 4, 2021

Uh oh!

jbrockmendel commented Oct 4, 2021

Uh oh!

da-woods commented Oct 4, 2021

Uh oh!

jbrockmendel commented Oct 4, 2021

Uh oh!

Uh oh!

PERF: tighter cython declarations, faster iter #43872

PERF: tighter cython declarations, faster iter #43872