Modifying DataFrame first row produces wrong result #29917

t6s4 · 2019-11-28T14:17:03Z

# Reproducing faulty behavior:
dfs = pd.DataFrame([[1,'a1',31],[2,'a2',32],[3,'a3',33]], index=['a','b','c'], columns=['c1','c2','c3'])
dfs.iloc[0] = {'c1':101, 'c2':'A1', 'c3':111}
assert dfs.iloc[0][0] == 101     #FAILED 
assert dfs.iloc[0][1] == 'A1'     #FAILED
assert dfs.iloc[0][2] == 111     #FAILED

Problem description

Updating first row duplicates column labels.

jorisvandenbossche · 2019-11-28T19:05:02Z

Thanks for the report!

According to our docs this should indeed work (https://dev.pandas.io/docs/user_guide/indexing.html#attribute-access, see the "You can also assign a dict to a row of a DataFrame:" a bit below the linked title). The example from there:

In [24]: x = pd.DataFrame({'x': [1, 2, 3], 'y': [3, 4, 5]})

In [25]: x.iloc[1] = {'x': 9, 'y': 99}

In [26]: x
Out[26]: 
   x   y
0  1   3
1  9  99
2  3   5

Now, in practice, this only seems to work like that if your dataframe has uniform types. Eg if I change the second column to float in the above example, it no longer works:

In [63]: x = pd.DataFrame({'x': [1, 2, 3], 'y': [0.3, 0.4, 0.5]})  

In [64]: x.iloc[1] = {'x': 9, 'y': 99}                         

In [65]: x   
Out[65]: 
   x    y
0  1  0.3
1  x    y
2  3  0.5

Now, it seems to already been this way for a very long time (tested 0.21 and 0.18, which show the same), so it is certainly not a recent regression (if it ever worked).

BTW, if you want to have this work reliably, you can simply wrap the dict into a Series:

In [66]: x = pd.DataFrame({'x': [1, 2, 3], 'y': [0.3, 0.4, 0.5]}) 

In [67]: x.iloc[1] = pd.Series({'x': 9, 'y': 99}) 

In [68]: x 
Out[68]: 
   x     y
0  1   0.3
1  9  99.0
2  3   0.5

jorisvandenbossche · 2019-11-28T19:07:11Z

See #16724 for a similar issue (and we came to the same conclusion about the dtypes then :)).
Closing this as a duplicate.

jorisvandenbossche · 2019-11-28T19:07:25Z

Duplicate of #16724

jorisvandenbossche added Indexing Related to indexing on series/frames, not to indexes themselves Needs Discussion Requires discussion from core team before further action labels Nov 28, 2019

jorisvandenbossche marked this as a duplicate of #16724 Nov 28, 2019

jorisvandenbossche closed this as completed Nov 28, 2019

jorisvandenbossche added Duplicate Report Duplicate issue or pull request and removed Needs Discussion Requires discussion from core team before further action labels Nov 28, 2019

jorisvandenbossche added this to the No action milestone Nov 28, 2019

jorisvandenbossche mentioned this issue Nov 28, 2019

Unexpected result when setting a row by a dict #16724

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modifying DataFrame first row produces wrong result #29917

Modifying DataFrame first row produces wrong result #29917

t6s4 commented Nov 28, 2019

jorisvandenbossche commented Nov 28, 2019

jorisvandenbossche commented Nov 28, 2019

jorisvandenbossche commented Nov 28, 2019

Modifying DataFrame first row produces wrong result #29917

Modifying DataFrame first row produces wrong result #29917

Comments

t6s4 commented Nov 28, 2019

Problem description

jorisvandenbossche commented Nov 28, 2019

jorisvandenbossche commented Nov 28, 2019

jorisvandenbossche commented Nov 28, 2019