Bug related to multilevel index series ? #2706

halleygithub · 2013-01-19T03:39:00Z

I am upgrading Pandas from 0.8.1 to 0.10.1.dev-f7f7e13 . My environment is Window XP with below: Python: 2.7.3 Numpy: 1.6.2 MPL: 1.1.1 Pandas: 0.10.1.dev-f7f7e13.

Then OK application on 0.8.1 now meets errors. I trace the root cause to filtering the duplicated index of Series. Detail in : http://stackoverflow.com/questions/14395678/how-to-drop-extra-copy-of-duplicate-index-of-pandas-series

simply put: below snippet has two issues :

import pandas as pd
idx_tp = [('600809', '20061231'), ('600809', '20070331'), ('600809', '20070630'), ('600809', '20070331')]
dt = ['demo','demo','demo','demo']
idx = pd.MultiIndex.from_tuples(idx_tp,names = ['STK_ID','RPT_Date'])
s = pd.Series(dt,index=idx)

Issue 1: s[s.index.unique()] works well on 0.8.1 but not 0.10.1

Issue 2: s.groupby(s.index).first() will crash on my machine

wesm · 2013-01-19T17:51:47Z

thanks will have a look

wesm · 2013-01-20T21:37:43Z

Fixed the second issue. I'm surprised the first ever worked, going to have a look at 0.8.1

wesm · 2013-01-20T21:59:52Z

The first is not a supported API and only worked by accident before. Please do something like:

In [16]: s[-Series(s.index.values, s.index).duplicated()]
Out[16]: 
STK_ID  RPT_Date
600809  20061231    demo
        20070331    demo
        20070630    demo

I need to add a top level function duplicated.

halleygithub · 2013-01-21T02:25:40Z

Thanks. but "s[s.index.unique()]" looks elegant than "s[-Series(s.index.values, s.index).duplicated()]" .

ghost assigned changhiskhan and wesm Jan 20, 2013

wesm added a commit that referenced this issue Jan 20, 2013

BUG: fix groupby segfault on MultiIndex issue #2706

4022a03

wesm closed this as completed Jan 20, 2013

wesm mentioned this issue Jan 20, 2013

Add duplicated/drop_duplicates top-level array functions #2715

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug related to multilevel index series ? #2706

Bug related to multilevel index series ? #2706

halleygithub commented Jan 19, 2013

wesm commented Jan 19, 2013

wesm commented Jan 20, 2013

wesm commented Jan 20, 2013

halleygithub commented Jan 21, 2013

Bug related to multilevel index series ? #2706

Bug related to multilevel index series ? #2706

Comments

halleygithub commented Jan 19, 2013

Issue 1: s[s.index.unique()] works well on 0.8.1 but not 0.10.1

Issue 2: s.groupby(s.index).first() will crash on my machine

wesm commented Jan 19, 2013

wesm commented Jan 20, 2013

wesm commented Jan 20, 2013

halleygithub commented Jan 21, 2013