Strange behavior assigning values to elements #3970

phobson · 2013-06-20T15:56:19Z

Basically, if you create a Data

import pandas as pd

df = pd.DataFrame({ "aa":range(5), "bb":[2.2]*5})

df["cc"] = 0.0 # remove this line, it works

ck = [True]*len(df)

df["bb"].iloc[0] = .13
df_tmp = df.iloc[ck] # or remove this line and it'll work
df["bb"].iloc[0] = .15 # doesn't happen

print df

which gives:

   aa    bb  cc
0   0  0.13   0
1   1  2.20   0
2   2  2.20   0
3   3  2.20   0
4   4  2.20   0

Very strange.

Specs:

pd.version.version
Out[5]: '0.11.0'

np.version.version
Out[7]: '1.7.1'

The text was updated successfully, but these errors were encountered:

jreback · 2013-06-20T16:00:14Z

I responded to you e-mail..repro here

this is not a bug; you are modifying a copy (which is caused by the dtype change)
use loc like this

http://pandas.pydata.org/pandas-docs/dev/indexing.html#indexing-view-versus-copy

In [11]: df.loc[0,'bb'] = .13

In [12]: df_tmp = df[ck]

In [13]: df.loc[0,'bb'] = .15

In [14]: df
Out[14]: 
   aa    bb
0   0  0.15
1   1  2.20
2   2  2.20
3   3  2.20
4   4  2.20
5   5  2.20
6   6  2.20
7   7  2.20
8   8  2.20
9   9  2.20

phobson · 2013-06-20T16:30:55Z

I'm confused. Where does the dtype change?

I do agree that this behaves as expected

import pandas as pd
df = pd.DataFrame({ "aa":range(5), "bb":[2.2]*5})
df["cc"] = 0.0 
ck = [True]*len(df)
df.loc[0, 'bb'] = .13
df_tmp = df.iloc[ck] 
df.loc[0, 'bb'] = .15 
print df

But how is this not inconsistent behavior?

import pandas as pd
df = pd.DataFrame({ "aa":range(5), "bb":[2.2]*5})
df["cc"] = 0.0 
ck = [True]*len(df)
df["bb"].loc[0] = .13 # works
df_tmp = df.iloc[ck] 
df["bb"].loc[0] = .15 # doesn't work
print df

In other words, why can I change an element once, but not a second time?

jreback · 2013-06-20T16:51:29Z

sorry was thinking about another issue, dtype is not a problem here

this
df["bb"].loc[0] = .13

creates a copy (so you assign to the copy rather than the frame)
note that this does not always create a copy (has to do whether this is view or not,
and unfortunately that is numpy defined)

this is why a multi-axes assignment should use all axes in a single loc/iloc

jreback · 2013-06-20T16:53:47Z

df_tmp = df.iloc[ck] this triggers a copy of the underlying data (and it is no longer a view), not exactly sure why

cpcloud · 2013-06-20T17:58:22Z

doesn't indexing with a sequence (or any other non slice, non tuple object that is a valid index) trigger a copy? it does with numpy

jreback · 2013-06-20T18:04:29Z

the old advanced/fancy vs basic, could be

cpcloud · 2013-06-20T18:24:24Z

arg there's no explanation for why fancy indexing requires a copy. maybe it's obvious but i don't see it

jreback · 2013-06-20T18:31:30Z

I think its really related to how memory is laid out, what you are doing, etc. can it be done in an efficient manner and so for, basically implementation dependent.

cpcloud · 2013-06-20T18:36:44Z

probably is because in general fancy indexing must follow the rule that any operation leading to an irregularly strided array must return a copy. in come cases (ones that are equivalent to slicing you could have views, but there's the overhead of checking whether there's an equivalent slice to the passed numpy array)

wesm · 2013-06-20T20:31:37Z

Looks buggy to me, marked as such and labeled for 0.11.1

jreback · 2013-06-20T20:40:57Z

ill take a look

johannh-zz · 2013-06-21T13:08:45Z

Here's another good one. If I look at df["bb"] after this, I see the change (on 0.11):

In [21]: print df
   aa    bb  cc
0   0  0.13   0
1   1  2.20   0
2   2  2.20   0
...

In [22]: df["bb"]
Out[22]:
0    0.15
1    2.20
2    2.20
3    2.20
...

…mixed_type silently consolidating (hurf). also fix stable sorting bug presenting on my machine

jreback · 2013-06-28T18:41:53Z

closed by #4077

jreback mentioned this issue Jun 21, 2013

BUG: Possibly invalidate the item_cache when numpy implicty converts a v... #3977

Closed

wesm added a commit to wesm/pandas that referenced this issue Jun 28, 2013

BUG: fix item cache invalidation bug in pandas-dev#3970 caused by is_…

2041d72

…mixed_type silently consolidating (hurf). also fix stable sorting bug presenting on my machine

jreback closed this as completed Jun 28, 2013

This was referenced Jun 29, 2013

BUG: GH4080, int conversion of underlying series to float needs updating in parent frame #4081

Closed

BUG: series update #4080

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Strange behavior assigning values to elements #3970

Strange behavior assigning values to elements #3970

phobson commented Jun 20, 2013

jreback commented Jun 20, 2013

phobson commented Jun 20, 2013

jreback commented Jun 20, 2013

jreback commented Jun 20, 2013

cpcloud commented Jun 20, 2013

jreback commented Jun 20, 2013

cpcloud commented Jun 20, 2013

jreback commented Jun 20, 2013

cpcloud commented Jun 20, 2013

wesm commented Jun 20, 2013

jreback commented Jun 20, 2013

johannh-zz commented Jun 21, 2013

jreback commented Jun 28, 2013

Strange behavior assigning values to elements #3970

Strange behavior assigning values to elements #3970

Comments

phobson commented Jun 20, 2013

jreback commented Jun 20, 2013

phobson commented Jun 20, 2013

jreback commented Jun 20, 2013

jreback commented Jun 20, 2013

cpcloud commented Jun 20, 2013

jreback commented Jun 20, 2013

cpcloud commented Jun 20, 2013

jreback commented Jun 20, 2013

cpcloud commented Jun 20, 2013

wesm commented Jun 20, 2013

jreback commented Jun 20, 2013

johannh-zz commented Jun 21, 2013

jreback commented Jun 28, 2013