BUG: ufunc is not applied to sparse.fill_value #13853

sinhrks · 2016-07-30T22:01:42Z

tests added / passed
passes git diff upstream/master | flake8 --diff
whatsnew entry

When ufunc is applied to sparse, it is not applied to fill_value. Thus results are incorrect.

on current master:

np.abs(pd.SparseArray([1, -2, -1], fill_value=-2))
# [1.0, -2, 1.0]
# Fill: -2
# IntIndex
# Indices: array([0, 2], dtype=int32)

np.add(pd.SparseArray([1, -2, -1], fill_value=-2), 1)
# [2.0, -2, 0.0]
# Fill: -2
# IntIndex
# Indices: array([0, 2], dtype=int32)

cc @gfyoung

codecov-io · 2016-07-30T22:49:01Z

Current coverage is 85.27% (diff: 93.33%)

Merging #13853 into master will decrease coverage by <.01%

@@             master     #13853   diff @@
==========================================
  Files           139        139          
  Lines         50020      50031    +11   
  Methods           0          0          
  Messages          0          0          
  Branches          0          0          
==========================================
+ Hits          42657      42666     +9   
- Misses         7363       7365     +2   
  Partials          0          0

Powered by Codecov. Last update 97de42a...a14f573

gfyoung · 2016-07-31T03:04:35Z

pandas/sparse/tests/test_array.py

+        tm.assert_sp_array_equal(np.abs(sparse), result)
+
+        sparse = SparseArray([1, -1, 2, -2], fill_value=1)
+        result = SparseArray([1, 2, 2], sparse_index=sparse.sp_index,


Can you explain why this result is correct?

Because assert_sp_array_equal compares sparse internal representation, it is for prepare correct internal repr. You can see the result is correct from its dense repr.

# test case sparse = pd.SparseArray([1, -1, 2, -2], fill_value=1) abs(sparse).to_dense() # array([ 1., 1., 2., 2.])

# result pd.SparseArray([1, 2, 2], sparse_index=sparse.sp_index, fill_value=1).to_dense() # array([ 1., 1., 2., 2.])

Okay, good to know. I wasn't 100% clear on how the sparse comparison worked. Thanks!

gfyoung · 2016-08-01T03:59:23Z

LGTM

cc @jreback

jreback · 2016-08-01T10:35:43Z

pandas/sparse/array.py

@@ -213,6 +213,17 @@ def kind(self):
        elif isinstance(self.sp_index, IntIndex):
            return 'integer'

+    def __array_wrap__(self, out_arr, context=None):
+        if isinstance(context, tuple) and len(context) == 3:


can you put a comment here of what this is doing

jreback · 2016-08-01T10:43:18Z

I understand why this is needed, but it feels a tad unnatural. I am not sure a user will be expecting that the fill value will have the ufunc be applied here. Can we add a section to the docs showing this?

sinhrks · 2016-08-01T13:17:26Z

Sure, added small section.

sinhrks added Bug Sparse Sparse Data Type Compat pandas objects compatability with Numpy or Python functions labels Jul 30, 2016

sinhrks added this to the 0.19.0 milestone Jul 30, 2016

gfyoung reviewed Jul 31, 2016
View reviewed changes

sinhrks force-pushed the sparse_ufunc branch from 1603188 to 985594f Compare July 31, 2016 22:52

jreback reviewed Aug 1, 2016
View reviewed changes

sinhrks force-pushed the sparse_ufunc branch from 985594f to 9787625 Compare August 1, 2016 13:16

sinhrks force-pushed the sparse_ufunc branch from 9787625 to c46beed Compare August 1, 2016 13:18

BUG: ufunc is not applied to sparse.fill_value

a14f573

sinhrks force-pushed the sparse_ufunc branch from c46beed to a14f573 Compare August 3, 2016 13:31

jreback closed this in 5f47608 Aug 3, 2016

sinhrks deleted the sparse_ufunc branch August 3, 2016 23:06

sinhrks mentioned this pull request Sep 7, 2016

SparseSeries.__array__ only returns non-fills #14167

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: ufunc is not applied to sparse.fill_value #13853

BUG: ufunc is not applied to sparse.fill_value #13853

sinhrks commented Jul 30, 2016

codecov-io commented Jul 30, 2016 •

edited

Loading

gfyoung Jul 31, 2016

sinhrks Jul 31, 2016

gfyoung Aug 1, 2016

gfyoung commented Aug 1, 2016

jreback Aug 1, 2016

jreback commented Aug 1, 2016

sinhrks commented Aug 1, 2016

BUG: ufunc is not applied to sparse.fill_value #13853

BUG: ufunc is not applied to sparse.fill_value #13853

Conversation

sinhrks commented Jul 30, 2016

codecov-io commented Jul 30, 2016 • edited Loading

Current coverage is 85.27% (diff: 93.33%)

gfyoung Jul 31, 2016

Choose a reason for hiding this comment

sinhrks Jul 31, 2016

Choose a reason for hiding this comment

gfyoung Aug 1, 2016

Choose a reason for hiding this comment

gfyoung commented Aug 1, 2016

jreback Aug 1, 2016

Choose a reason for hiding this comment

jreback commented Aug 1, 2016

sinhrks commented Aug 1, 2016

codecov-io commented Jul 30, 2016 •

edited

Loading