Not duplicates in multiIndex columns with duplicates not indexed properly when selected #4146

hayd · 2013-07-06T11:39:54Z

This appears to be a regression since 0.11 in handling duplicates in MultiIndex columns:

In [11]: df
Out[11]:
  h1 main  h3 sub  h5
0  a    A   1  A1   1
1  b    B   2  B1   2
2  c    B   3  A1   3
3  d    A   4  B2   4
4  e    A   5  B2   5
5  f    B   6  A2   6

In [12]: df2 = df.set_index(['main', 'sub']).T.sort_index(1)

In [13]: df2
Out[13]:
main  A        B
sub  A1 B2 B2 A1 A2 B1
h1    a  d  e  c  f  b
h3    1  4  5  3  6  2
h5    1  4  5  3  6  2

If we grab out successively we get an unexpected result for the non-duplicate:

In [14]: df2['A']
Out[14]:
sub A1 B2 B2
h1   a  d  e
h3   1  4  5
h5   1  4  5

In [15]: df2['A']['B2']
Out[15]:
sub B2 B2
h1   d  e
h3   4  5
h5   4  5

In [16]: df2['A']['A1']  # this worked in 0.11
Out[16]:
   0
0  a
1  1
2  1

In [21]: df2['A']['A1']  # pandas 0.11
Out[21]:
h1    a
h3    1
h5    1
Name: A1, dtype: object

FWIW never like how this can return different a type...

The text was updated successfully, but these errors were encountered:

hayd mentioned this issue Jul 6, 2013

DataFrame MultiIndex column access (and pop) #4145

Closed

jreback mentioned this issue Jul 6, 2013

BUG: (GH4145/4146) Fixed bugs in multi-index selection with column multi index duplicates #4148

Merged

hayd closed this as completed Jul 6, 2013

jreback mentioned this issue Jul 6, 2013

TST: additional test case for GH4146 #4149

Merged

hfactor13 mentioned this issue Jul 14, 2024

DOC: Added a missing docstring to pandas/conftest.py. #59244

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not duplicates in multiIndex columns with duplicates not indexed properly when selected #4146

Not duplicates in multiIndex columns with duplicates not indexed properly when selected #4146

hayd commented Jul 6, 2013

Not duplicates in multiIndex columns with duplicates not indexed properly when selected #4146

Not duplicates in multiIndex columns with duplicates not indexed properly when selected #4146

Comments

hayd commented Jul 6, 2013