ENH: Allow inplace arithmetic operations #5104

jtratner · 2013-10-04T00:34:19Z

In current master (and before the initial arithmetic refactor), __iadd__ doesn't do anything, it's just a synonym for __add__ (or not defined). (just for reference, v0.12.0 had __iadd__ = __add__ on Series and no __add__ defined on DataFrame). It would be great to support in place ops on += and friends. Is this possible to do?

The text was updated successfully, but these errors were encountered:

jreback · 2013-10-04T00:42:37Z

with dtype conversion....seems ok?

In [19]: s = Series([1,2,3])

In [20]: s += 1.5

In [21]: s
Out[21]: 
0    2.5
1    3.5
2    4.5
dtype: float64

jtratner · 2013-10-04T00:45:46Z

It works but it's not actually in place (with += python uses __add__ if __iadd__ isn't defined):

In [1]: import pandas

In [2]: from pandas import *

In [3]: s = Series([1, 2, 3])

In [4]: s2 = s

In [5]: s += 1.5

In [6]: s
Out[6]:
0    2.5
1    3.5
2    4.5
dtype: float64

In [7]: s2
Out[7]:
0    1
1    2
2    3
dtype: int64

In [8]:

jreback · 2013-10-04T00:47:55Z

oh...i c....prob don't have tests for that I thought was ok (did it work before?)

jtratner · 2013-10-04T00:48:30Z

as I said, hasn't worked that way since at least v0.12, not sure if earlier.

jtratner · 2013-10-04T00:48:37Z

I'll take a look

jtratner · 2013-10-04T00:50:03Z

Looked back, hasn't (truly) supported inplace ops since at least 0.9 (at least not for DataFrame, and Series appears to have overridden inplace ops)

jreback · 2013-10-04T11:59:35Z

ok....should be straightforward in any event

BrenBarn · 2013-11-08T07:53:15Z

Would also be nice to have in-place versions of div, add, etc., with the axis argument allowing in-place operations on any axis.

jtratner · 2013-11-10T03:52:41Z

@BrenBarn even if you had inplace ops, pandas isn't actually going to do it inplace (i.e., it'll still allocate new memory for the final arrays). It would be relatively easy to add an inplace keyword argument, because we now have this _update_inplace method.

My problem is that it's a total lie to the end user: you think that you're doing an operation inplace and that this will mean you are saving on memory, but all you would be doing is a convenience function that just updates the wrapper.

BrenBarn · 2013-11-11T19:04:04Z

I see. So you mean it's simply not possible to update Pandas structures in-place at all? That is unfortunate.

jreback · 2013-11-11T19:19:51Z

@BrenBarn the structures CAN be updated in some cases in-place (a lot depends on what kind of operation it is), but for example, the operation would have to be 2 structures that are alignable w/o copy, and the operation itself would then be done in-place. e.g. arithmetic is straightforward. Any type of reshape or dtype change would invalidate this.

Would welcome some tests for this; that is the main issue. I don't think making is work all that hard.

BrenBarn · 2013-11-11T19:23:53Z

But this whole issue is about arithmetic operations, right? I'm not saying every possible manipulation should be available in-place, just basic ones that can already be done in-place on numpy arrays (e.g., add, div, etc.). Probably there could be a check to see that the data have the same indexes and same dtype and an error raised if the operation can't succeed.

jreback · 2013-11-11T19:40:54Z

@BrenBarn this could be done when the op doesn't change dtype, however, I think this might actually be more confusing, because their would be a subset of ops that are actually in-place, while most would not actually modify the underlying data, but replace it (as happens now in all cases). This would be deterministic, but IMHO confusing to the user.

A simple case of when its not possible is changes, e.g.

In [1]: s = Series([1,2,3])

In [2]: id(s.values)
Out[2]: 58659888

In [3]: s += 1.5

In [4]: id(s.values)
Out[4]: 58634416

Numpy keeps the same object, but just does a wrong thing IMHO

In [5]: x = np.array([1,2,3])

In [10]: id(x)
Out[10]: 61601360

In [11]: x += 1.5

In [12]: id(x)
Out[12]: 61601360

In [13]: x
Out[13]: array([2, 3, 4])

jtratner · 2013-11-11T22:05:49Z

just to echo @jreback - I did not mean it's impossible to update inplace,
it's just that generally because of pandas' flexibility a number of ops
cause copies.

shoyer · 2014-08-09T03:04:12Z

In-place operations have unfortunate complications when coupled with automatic index-based alignment, as I discovered when I tried to implement this for xray:
pydata/xarray#184

The problem is that you can end up with (int, int) operations giving you complete garbage if the second argument needs to be realigned and hence ends up with a bunch of NaN values.

Numpy does not catch in-place integer operations with non-scalar NaN values (even though it probably should raise)... so you can end up with complete garage, not just bad rounding:

>>> x = np.array([0])
>>> x += np.array([np.nan])
>>> x
array([-9223372036854775808])

jreback modified the milestones: 0.15.0, 0.14.0 Mar 28, 2014

jreback modified the milestones: 0.15.0, 0.16.0 Oct 9, 2014

This was referenced Oct 9, 2014

BUG: Bug in inplace operations with column sub-selections on the lhs (GH8511) #8520

Merged

Indexing problem with in-place operator #8511

Closed

jreback closed this as completed in #8520 Oct 9, 2014

TomAugspurger mentioned this issue Mar 4, 2015

Change in the way DataFrames are assigned / copied #9587

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Allow inplace arithmetic operations #5104

ENH: Allow inplace arithmetic operations #5104

jtratner commented Oct 4, 2013

jreback commented Oct 4, 2013

jtratner commented Oct 4, 2013

jreback commented Oct 4, 2013

jtratner commented Oct 4, 2013

jtratner commented Oct 4, 2013

jtratner commented Oct 4, 2013

jreback commented Oct 4, 2013

BrenBarn commented Nov 8, 2013

jtratner commented Nov 10, 2013

BrenBarn commented Nov 11, 2013

jreback commented Nov 11, 2013

BrenBarn commented Nov 11, 2013

jreback commented Nov 11, 2013

jtratner commented Nov 11, 2013

shoyer commented Aug 9, 2014

ENH: Allow inplace arithmetic operations #5104

ENH: Allow inplace arithmetic operations #5104

Comments

jtratner commented Oct 4, 2013

jreback commented Oct 4, 2013

jtratner commented Oct 4, 2013

jreback commented Oct 4, 2013

jtratner commented Oct 4, 2013

jtratner commented Oct 4, 2013

jtratner commented Oct 4, 2013

jreback commented Oct 4, 2013

BrenBarn commented Nov 8, 2013

jtratner commented Nov 10, 2013

BrenBarn commented Nov 11, 2013

jreback commented Nov 11, 2013

BrenBarn commented Nov 11, 2013

jreback commented Nov 11, 2013

jtratner commented Nov 11, 2013

shoyer commented Aug 9, 2014