Use take_1d in take_nd #40405

jorisvandenbossche · 2021-03-12T18:31:39Z

xref #40300 (comment)

jbrockmendel · 2021-03-15T17:08:54Z

can remove comment in take_1d about being only for ArrayManager

Update: ill just do this in one of my cleanup branches

jbrockmendel · 2021-03-15T23:29:55Z

test failure is unrelated (and the flaky test is fixed on master). LGTM

jreback · 2021-03-16T15:37:30Z

cool, does this allow refactor of other code?

jbrockmendel · 2021-03-19T16:45:08Z

can this be moved out of Draft mode?

jbrockmendel · 2021-03-26T00:48:01Z

@jorisvandenbossche gentle ping

jbrockmendel · 2021-04-07T17:35:59Z

@jorisvandenbossche gentle ping to see if we can move this out of draft mode

jbrockmendel · 2021-04-21T15:46:38Z

@jorisvandenbossche gentle ping

jorisvandenbossche · 2021-04-22T12:17:02Z

Updated the PR.
But personally not sure if it is worth it, though:

In [1]: from pandas.core.array_algos.take import take_1d, take_nd

In [2]: dtype = 'int'
   ...: m = 100
   ...: n = 1000

In [3]: values = np.arange(m * n)

In [4]: index = np.arange(m)

# this is not affected, just including as reference
In [5]: %timeit take_1d(values, index)
5.67 µs ± 27 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)  <-- master
5.59 µs ± 50.6 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)  <-- PR

In [6]: %timeit take_nd(values, index)
10.5 µs ± 104 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)  <-- master
8.86 µs ± 68.3 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)  <-- PR

So it does give a small improvement, but when increasing the size of the array, the improvement almost completely disappears.

jbrockmendel · 2021-04-22T15:55:35Z

3 lines for a 15% improvement for a fairly common case seems like a good deal to me

jbrockmendel · 2021-04-22T15:55:55Z

pandas/core/array_algos/take.py

@@ -89,6 +89,10 @@ def take_nd(
    if fill_value is lib.no_default:
        fill_value = na_value_for_dtype(arr.dtype, compat=False)

+    if arr.ndim == 1 and axis == 0 and indexer is not None:
+        indexer = ensure_platform_int(indexer)


comment to the effect of "fastpath"

jreback · 2021-04-22T19:24:55Z

agree this can't hurt (and you know me about not liking special cases)

github-actions · 2021-05-24T00:04:07Z

This pull request is stale because it has been open for thirty days with no activity. Please update or respond to this comment if you're still interested in working on this.

simonjayhawkins · 2021-06-08T18:20:59Z

Thanks @jorisvandenbossche for the PR. closing as stale. re-open when ready.

Use take_1d in take_nd

ae1f919

jorisvandenbossche mentioned this pull request Mar 12, 2021

PERF: further improve take (reindex/unstack) for ArrayManager #40300

Merged

simonjayhawkins added the Refactor Internal refactoring of code label Mar 14, 2021

jbrockmendel approved these changes Mar 15, 2021

View reviewed changes

jorisvandenbossche added 2 commits April 22, 2021 14:02

Merge remote-tracking branch 'upstream/master' into take_nd-dispatch-1d

3609eb9

fixup

c5f079d

jbrockmendel reviewed Apr 22, 2021

View reviewed changes

github-actions bot added the Stale label May 24, 2021

simonjayhawkins closed this Jun 8, 2021

jorisvandenbossche deleted the take_nd-dispatch-1d branch November 11, 2021 14:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use take_1d in take_nd #40405

Use take_1d in take_nd #40405

jorisvandenbossche commented Mar 12, 2021

jbrockmendel commented Mar 15, 2021 •

edited

Loading

jbrockmendel commented Mar 15, 2021

jreback commented Mar 16, 2021

jbrockmendel commented Mar 19, 2021

jbrockmendel commented Mar 26, 2021

jbrockmendel commented Apr 7, 2021

jbrockmendel commented Apr 21, 2021

jorisvandenbossche commented Apr 22, 2021

jbrockmendel commented Apr 22, 2021

jbrockmendel Apr 22, 2021

jreback commented Apr 22, 2021

github-actions bot commented May 24, 2021

simonjayhawkins commented Jun 8, 2021

Use take_1d in take_nd #40405

Use take_1d in take_nd #40405

Conversation

jorisvandenbossche commented Mar 12, 2021

jbrockmendel commented Mar 15, 2021 • edited Loading

jbrockmendel commented Mar 15, 2021

jreback commented Mar 16, 2021

jbrockmendel commented Mar 19, 2021

jbrockmendel commented Mar 26, 2021

jbrockmendel commented Apr 7, 2021

jbrockmendel commented Apr 21, 2021

jorisvandenbossche commented Apr 22, 2021

jbrockmendel commented Apr 22, 2021

jbrockmendel Apr 22, 2021

Choose a reason for hiding this comment

jreback commented Apr 22, 2021

github-actions bot commented May 24, 2021

simonjayhawkins commented Jun 8, 2021

jbrockmendel commented Mar 15, 2021 •

edited

Loading