You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
closes#13904
Creates an efficient MultiIndexHashTable in cython.
This allows us to efficiently store a multi-index for fast indexing
(.get_loc() and .get_indexer()), with the current tuple-based
(and gil holding) use of the PyObject Hash Table.
This uses the pandas.tools.hashing routines to hash each of the 'values' of a MI
to a single uint64. So this makes MI more memory friendly and much
more efficient. You get these speedups, because the creation of the
hashtable is now much more efficient.
Author: Jeff Reback <[email protected]>
Closes#15245 from jreback/mi and squashes the following commits:
7df6c34 [Jeff Reback] PERF: high memory in MI
Copy file name to clipboardExpand all lines: doc/source/whatsnew/v0.20.0.txt
+3-1
Original file line number
Diff line number
Diff line change
@@ -472,7 +472,7 @@ Performance Improvements
472
472
- Improved performance of timeseries plotting with an irregular DatetimeIndex
473
473
(or with ``compat_x=True``) (:issue:`15073`).
474
474
- Improved performance of ``groupby().cummin()`` and ``groupby().cummax()`` (:issue:`15048`, :issue:`15109`)
475
-
475
+
- Improved performance and reduced memory when indexing with a ``MultiIndex`` (:issue:`15245`)
476
476
- When reading buffer object in ``read_sas()`` method without specified format, filepath string is inferred rather than buffer object.
477
477
478
478
@@ -502,6 +502,8 @@ Bug Fixes
502
502
- Bug in ``DataFrame.loc`` with indexing a ``MultiIndex`` with a ``Series`` indexer (:issue:`14730`)
503
503
504
504
505
+
506
+
- Bug in the display of ``.info()`` where a qualifier (+) would always be displayed with a ``MultiIndex`` that contains only non-strings (:issue:`15245`)
505
507
- Bug in ``pd.read_msgpack()`` in which ``Series`` categoricals were being improperly processed (:issue:`14901`)
506
508
- Bug in ``Series.ffill()`` with mixed dtypes containing tz-aware datetimes. (:issue:`14956`)
0 commit comments