MultiIndex reindex should behave like Index. #7895

rockg · 2014-08-01T00:51:23Z

I'm having trouble understand how exactly reindex is supposed to work in the case of MultiIndex. What I would like is the exact behavior in the Index case where values can be filled or padded according to some level of the MultiIndex. I can't understand what reindex is doing in the MultiIndex case. The below is an example where I would like to interpolate using the prior value a month frame to an hourly frame.

import pandas as pd
d1 = pd.date_range('1/1/2012', '3/1/2012', freq='MS')
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]}, index=d1)
dfu = df.unstack()

dfu
Out[12]: 
A  2012-01-01    1
   2012-02-01    2
   2012-03-01    3
B  2012-01-01    4
   2012-02-01    5
   2012-03-01    6
dtype: int64

d2 = pd.date_range('1/1/2012', '3/31/2012', freq='D')

dfu.reindex(d2, level=1) # What is this supposed to do?
Out[17]: 
A  2012-01-01    1
   2012-02-01    2
   2012-03-01    3
B  2012-01-01    4
   2012-02-01    5
   2012-03-01    6
dtype: int64

df.reindex(d2, method='pad').unstack() #Ideally something like this, but without the trickery
Out[19]: 
A  2012-01-01    1
   2012-01-02    1
   2012-01-03    1
   2012-01-04    1
   2012-01-05    1
   2012-01-06    1
   2012-01-07    1
   2012-01-08    1
   2012-01-09    1
   2012-01-10    1
   2012-01-11    1
   2012-01-12    1
   2012-01-13    1
   2012-01-14    1
   2012-01-15    1
...
B  2012-03-17    6
   2012-03-18    6
   2012-03-19    6
   2012-03-20    6
   2012-03-21    6
   2012-03-22    6
   2012-03-23    6
   2012-03-24    6
   2012-03-25    6
   2012-03-26    6
   2012-03-27    6
   2012-03-28    6
   2012-03-29    6
   2012-03-30    6
   2012-03-31    6
Length: 182, dtype: int64

The text was updated successfully, but these errors were encountered:

jreback · 2014-08-01T00:54:35Z

we were just discussing this today
see #7886

rockg · 2014-08-01T00:58:15Z

That seemed kind of different to me, but maybe I'm not looking closely enough.

jreback · 2014-08-01T00:59:57Z

see also comments in #7867

jreback · 2014-08-01T01:02:52Z

you can do this easily with a groupby

something like

df.groupby(level=0).apply(lambda x: x.reindex(d2,method='pad')

prob could directly make reindex a groupby method (so make this syntactically nicer)
maybe have reindex to under the hood

jreback · 2014-08-01T12:58:06Z

Ya, I guess this is a bit awkward

In [29]: dfu.reset_index().groupby('level_0').apply(lambda x: x.set_index('level_1').reindex(d2,method='pad'))
Out[29]: 
                   level_0  0
level_0                      
A       2012-01-01       A  1
        2012-01-02       A  1
        2012-01-03       A  1
        2012-01-04       A  1
        2012-01-05       A  1
        2012-01-06       A  1
        2012-01-07       A  1
        2012-01-08       A  1
        2012-01-09       A  1
        2012-01-10       A  1
        2012-01-11       A  1
        2012-01-12       A  1
        2012-01-13       A  1
        2012-01-14       A  1
        2012-01-15       A  1
        2012-01-16       A  1
        2012-01-17       A  1
        2012-01-18       A  1
        2012-01-19       A  1
        2012-01-20       A  1
        2012-01-21       A  1
        2012-01-22       A  1
        2012-01-23       A  1
        2012-01-24       A  1
        2012-01-25       A  1
        2012-01-26       A  1
        2012-01-27       A  1
        2012-01-28       A  1
        2012-01-29       A  1
        2012-01-30       A  1
...                    ... ..
B       2012-03-02       B  6
        2012-03-03       B  6
        2012-03-04       B  6
        2012-03-05       B  6
        2012-03-06       B  6
        2012-03-07       B  6
        2012-03-08       B  6
        2012-03-09       B  6
        2012-03-10       B  6
        2012-03-11       B  6
        2012-03-12       B  6
        2012-03-13       B  6
        2012-03-14       B  6
        2012-03-15       B  6
        2012-03-16       B  6
        2012-03-17       B  6
        2012-03-18       B  6
        2012-03-19       B  6
        2012-03-20       B  6
        2012-03-21       B  6
        2012-03-22       B  6
        2012-03-23       B  6
        2012-03-24       B  6
        2012-03-25       B  6
        2012-03-26       B  6
        2012-03-27       B  6
        2012-03-28       B  6
        2012-03-29       B  6
        2012-03-30       B  6
        2012-03-31       B  6

[182 rows x 2 columns]

markb-trustifi · 2018-06-22T19:49:25Z

In my case the solution has been to create a full multiindex by:
midx = pd.MultiIndex.from_product([pd.date_range(df['date'].min(), df['date'].max()), range(24)])
and after that to apply it to the original data frame:
df = df.reindex(midx)

ms7463 · 2021-12-15T09:45:39Z

Could create a utility like this.

def mi_reindex(df, idx, axis=0):
    if axis == 1:
        df = df.T
    if not isinstance(idx, pd.Index):
        raise ValueError('idx must be an index object')
    if (not all(idx.names)) or (not all(df.index.names)):
        raise ValueError('All indexes must have non-null names')
    if set(idx.names) - set(df.index.names):
        raise ValueError('idx names must be a subset of df index names')

    meta = df.index.to_frame(index=False)
    idx_df = idx.to_frame(index=False)
    new_meta = meta.drop(idx_df.columns, axis=1).drop_duplicates().merge(idx_df, how='cross')[meta.columns]
    df = df.reindex(pd.MultiIndex.from_frame(new_meta))
    if axis == 1:
        df = df.T
    return df

dfu = dfu.rename_axis(['Letters', 'Dates'])
d2 = d2.rename('Dates')

mi_reindex(dfu, d2).ffill()

@jreback - does there exist a "blessed" library for pandas utils that don't quite belong in core, but show up often enough to warrant being cataloged somewhere?

jreback added API Design labels Aug 1, 2014

jreback added this to the 0.15.1 milestone Aug 1, 2014

rockg mentioned this issue Sep 19, 2014

Bloomberg Hackathon #8323

Closed

jreback modified the milestones: 0.16.0, Next Major Release Mar 6, 2015

jreback mentioned this issue Jun 13, 2015

limiting reindex with MultiIndex ffill/bfill within levels. #10347

Closed

jreback added Difficulty Advanced labels Jun 13, 2015

jreback mentioned this issue Oct 12, 2015

PERF: groupby-fillna perf, implement in cython #11296

Closed

jreback mentioned this issue Feb 13, 2016

API: should reindex on a level introduce NaNs for missing entries per label of other levels? #12319

Open

chris-b1 mentioned this issue Feb 13, 2017

reindex() doesn't work with MultiIndex #15384

Closed

jreback added the Prio-medium label May 6, 2017

jreback mentioned this issue May 6, 2017

ENH: resample - upsampling on level #16267

Open

jreback modified the milestones: Next Major Release, Interesting Issues May 6, 2017

jreback modified the milestones: Interesting Issues, Next Major Release Nov 26, 2017

jreback mentioned this issue Mar 8, 2018

reindex() does not reindex a MultiIndex #20048

Closed

matthewgilbert mentioned this issue May 29, 2018

reindex() inconsistency between Index and MultiIndex #21247

Closed

jbrockmendel removed Effort Medium labels Oct 21, 2019

mroeschke removed the API Design label Apr 11, 2021

jreback mentioned this issue Dec 2, 2021

ENH: Custom date boundaries for resamplers? #44712

Closed

simonjayhawkins mentioned this issue Jun 2, 2022

Passing level to reindex on a multi-index doesn't reindex the desired level #25460

Closed

mroeschke removed this from the Contributions Welcome milestone Oct 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MultiIndex reindex should behave like Index. #7895

MultiIndex reindex should behave like Index. #7895

rockg commented Aug 1, 2014

jreback commented Aug 1, 2014

rockg commented Aug 1, 2014

jreback commented Aug 1, 2014

jreback commented Aug 1, 2014 •

edited

Loading

jreback commented Aug 1, 2014

markb-trustifi commented Jun 22, 2018 •

edited

Loading

ms7463 commented Dec 15, 2021

MultiIndex reindex should behave like Index. #7895

MultiIndex reindex should behave like Index. #7895

Comments

rockg commented Aug 1, 2014

jreback commented Aug 1, 2014

rockg commented Aug 1, 2014

jreback commented Aug 1, 2014

jreback commented Aug 1, 2014 • edited Loading

jreback commented Aug 1, 2014

markb-trustifi commented Jun 22, 2018 • edited Loading

ms7463 commented Dec 15, 2021

jreback commented Aug 1, 2014 •

edited

Loading

markb-trustifi commented Jun 22, 2018 •

edited

Loading