PERF: use fastpath=True in Index methods (delete/drop/insert/etc)? #6933

immerrr · 2014-04-22T20:02:27Z

Been hit by this when optimizing index-oblivious Blocks. In my case,

is_deleted = np.zeros(len(index), dtype=np.bool_)
is_deleted[deleted_loc] = True
index = index[~is_deleted]

was still a lot faster then than index.delete(deleted_loc).

It appears, that delete doesn't add fastpath=True to ctor and that triggers type inference for string (object) indices. There seems to be plenty of methods that do it the same way and hence are slow too, can we do something about that?

The text was updated successfully, but these errors were encountered:

jreback · 2014-04-22T20:19:22Z

sure
want to do a quick pr ?

vbenches a plus but not that necessary
maybe spot check ok

jtratner · 2014-04-24T07:57:24Z

so what's the downside here?

immerrr · 2014-04-24T08:35:27Z

@jtratner it's the same as one of my earlier PRs about __getitem__: we drop type inference that might've changed index type after deletion of non-matching elements. That is if we go with "trivial" solution.

If trivial is not necessary, then dtype inference may be preserved without the necessity to traverse each element of dtype=O array. If one made an int8 array of object classes (or int 16 if 255 classes is not enough) to accompany the main one, then instead of enumerating all entries each time it would suffice to do single np.bincount(x.ravel()).nonzero() to find out which classes are present. Memory overhead for that int8 array should be acceptable, but it may require some non-trivial effort to make sure they're in sync.

jreback added the Performance label Apr 22, 2014

immerrr mentioned this issue May 5, 2014

PERF: optimize Index.delete for dtype=object #7040

Merged

jreback added this to the 0.14.0 milestone May 5, 2014

jreback added the Indexing label May 5, 2014

jreback closed this as completed in #7040 May 5, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PERF: use fastpath=True in Index methods (delete/drop/insert/etc)? #6933

PERF: use fastpath=True in Index methods (delete/drop/insert/etc)? #6933

immerrr commented Apr 22, 2014

jreback commented Apr 22, 2014

jtratner commented Apr 24, 2014

immerrr commented Apr 24, 2014

PERF: use fastpath=True in Index methods (delete/drop/insert/etc)? #6933

PERF: use fastpath=True in Index methods (delete/drop/insert/etc)? #6933

Comments

immerrr commented Apr 22, 2014

jreback commented Apr 22, 2014

jtratner commented Apr 24, 2014

immerrr commented Apr 24, 2014