Additional tests for ufunc(Series) #26951

TomAugspurger · 2019-06-19T18:02:20Z

This adds a set of tests for ufuncs on Series. The goal is to
establish the correct behavior prior to implementing Series.__array_ufunc__.

There are two kinds of xfails right now

Series[Sparse] fails because Series.__array_ufunc__ doesn't
yet dispatch to Series.array.__array_ufunc__
ufunc(series, series) when the two series are unaligned. It's
been determined that these should align, but isn't currently
implemented.

LMK if there's anything I can do to make these tests clearer. The heavy parameterization is a bit unfortunate, but I think the best option.

xref #23293

This adds a set of tests for ufuncs on Series. The goal is to establish the correct behavior prior to implementing `Series.__array_ufunc__`. There are two kinds of xfails right now 1. Series[Sparse] fails because `Series.__array_ufunc__` doesn't yet dispatch to `Series.array.__array_ufunc__` 2. `ufunc(series, series)` when the two series are unaligned. It's been determined that these should align, but isn't currently implemented.

pandas/tests/series/test_ufunc.py

jbrockmendel · 2019-06-19T18:49:55Z

Should DTA[tz] be included along with Sparse?

codecov · 2019-06-19T18:56:33Z

Codecov Report

Merging #26951 into master will increase coverage by <.01%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master   #26951      +/-   ##
==========================================
+ Coverage   91.98%   91.98%   +<.01%     
==========================================
  Files         180      180              
  Lines       50772    50774       +2     
==========================================
+ Hits        46704    46706       +2     
  Misses       4068     4068

Flag	Coverage Δ
#multiple	`90.62% <ø> (+0.05%)`	⬆️
#single	`41.85% <ø> (-0.06%)`	⬇️

Impacted Files	Coverage Δ
pandas/io/gbq.py	`88.88% <0%> (-11.12%)`	⬇️
pandas/core/frame.py	`96.89% <0%> (-0.12%)`	⬇️
pandas/io/pytables.py	`90.3% <0%> (ø)`	⬆️
pandas/core/arrays/sparse.py	`94.19% <0%> (+0.46%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 2243629...d1788b0. Read the comment docs.

TomAugspurger · 2019-06-19T19:00:50Z

Should DTA[tz] be included along with Sparse?

IMO, the intention of these tests is to verify the correctness of the future Series.__array_ufunc__. I parametrize over Series[int] and Series[Sparse[int]] to verify the correctness of the dispatch to array logic. In theory the dispatch logic will handle all arrays the same. I'll add a small sanity test, separate from the rest, since the set of ufuncs valid for datetime64 is different.

jbrockmendel · 2019-06-19T19:03:41Z

OK, many of the DTA[tz] ufuncs are currently broken, so its OK by me if you want to leave them out.

TomAugspurger · 2019-06-20T16:47:03Z

0b1e745 adds another xfail for ufunc(Index, Series), e.g. np.add(index, series). Right now that returns an Index

In [1]: import pandas as pd; import numpy as np

In [2]: a = pd.Series([1, 2])

In [3]: b = pd.Index([1, 2])

In [4]: np.add(b, a)
Out[4]: Int64Index([2, 4], dtype='int64')

I think that's a bug. It's inconsistent with the binop

In [5]: b + a
Out[5]:
0    2
1    4
dtype: int64

So the rule is that Index.__array_ufunc__ should check the types for Series. If any are found then it should return NotImplemented, so that Series.__array_ufunc__ can take over.

(cherry picked from commit ba5cb55)

TomAugspurger · 2019-06-20T20:05:33Z

775c2ef fixes an issue with one of the tests. The test wasn't correct for ufunc(a, b) when we shuffle b (to force alignment). It was passing earlier, since we xfail shuffle=True for now.

I think this PR should be ready to go. I'd like to merge Joris' array_ufunc branch on top of it to actually start fixing the xfails.

jreback

seem reasonble, just some readability comments

pandas/tests/series/test_ufunc.py

jreback · 2019-06-21T01:06:24Z

pandas/tests/series/test_ufunc.py

+def test_binary_ufunc(ufunc, sparse, shuffle, box_other,
+                      flip,
+                      arrays_for_binary_ufunc):
+    # Check the invariant that


can you give a little expl of what the parametizations do if not obvious, e.g. flip & shuffle actually are not immediately obvious

I may split those out to separate tests. It'll be a bit more code, but much clearer.

jreback · 2019-06-21T01:06:39Z

pandas/tests/series/test_ufunc.py

+    # Check the invariant that
+    #   ufunc(Series(a), Series(b)) == Series(ufunc(a, b))
+    #   with alignment.
+    a1, a2 = arrays_for_binary_ufunc


can you call these: left_array, right_array

I know its a bit longer, but more readable IMHO

jreback · 2019-06-21T01:07:34Z

pandas/tests/series/test_ufunc.py

+        a2 = pd.SparseArray(a2, dtype=pd.SparseDtype('int', 0))
+
+    name = "name"
+    s1 = pd.Series(a1, name=name)


left_series and right_series

jreback · 2019-06-21T01:08:44Z

pandas/tests/series/test_ufunc.py

+            a2 = a2.take(idx)
+
+    a, b = s1, s2
+    c, d = a1, a2


I find this very confusing, can you use the same terms and not use a, b, c, d?

jreback · 2019-06-21T01:09:07Z

pandas/tests/series/test_ufunc.py

+    series = pd.Series(array, name="name")
+
+    a, b = series, other
+    c, d = array, other


same as above if you can make this more clear

pandas/tests/series/test_ufunc.py

jreback · 2019-06-21T01:10:12Z

pandas/tests/series/test_ufunc.py

+        array = pd.SparseArray(array)
+
+    series = pd.Series(array, name="name")
+    result = np.modf(series)


ok I c you are doing this here

jreback · 2019-06-21T01:11:16Z

do we have an issue for this? (or maybe just xref the actual ufunc issue at the top of the PR)

TomAugspurger · 2019-06-21T15:17:43Z

Changes

split the binop ufunc test into three: with array, with index, with Series. Simplified things quite a bit I think
Cleaned up the names. Hopefully array_args and series_args make things much clearer that a, b, c, d.

jreback · 2019-06-21T15:26:10Z

lgtm.

jreback · 2019-06-21T15:58:34Z

thanks @TomAugspurger

TomAugspurger added 2 commits June 19, 2019 12:57

fixup release note

8f46391

TomAugspurger added Numeric Operations Arithmetic, Comparison, and Logical operations Testing pandas testing functions or related to the test suite labels Jun 19, 2019

TomAugspurger added this to the 0.25.0 milestone Jun 19, 2019

TomAugspurger commented Jun 19, 2019

View reviewed changes

pandas/tests/series/test_ufunc.py Outdated Show resolved Hide resolved

pandas/tests/series/test_ufunc.py Outdated Show resolved Hide resolved

pandas/tests/series/test_ufunc.py Outdated Show resolved Hide resolved

TomAugspurger added 3 commits June 19, 2019 14:33

fixups

44e3c7e

remove stale comment

e179913

xfail ufunc(series, index)

0b1e745

TomAugspurger added 2 commits June 20, 2019 14:00

32-bit compat

9be1dff

(cherry picked from commit ba5cb55)

fixup

775c2ef

jreback requested changes Jun 21, 2019

View reviewed changes

TomAugspurger added 3 commits June 21, 2019 09:16

Merge remote-tracking branch 'upstream/master' into series-array-ufunc

bbbf269

more

64d8908

lint

d1788b0

jreback approved these changes Jun 21, 2019

View reviewed changes

jreback merged commit ba69f95 into pandas-dev:master Jun 21, 2019

Uh oh!

Additional tests for ufunc(Series) #26951

Additional tests for ufunc(Series) #26951

Uh oh!

Conversation

TomAugspurger commented Jun 19, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jbrockmendel commented Jun 19, 2019

Uh oh!

codecov bot commented Jun 19, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

TomAugspurger commented Jun 19, 2019

Uh oh!

jbrockmendel commented Jun 19, 2019

Uh oh!

TomAugspurger commented Jun 20, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TomAugspurger commented Jun 20, 2019

Uh oh!

jreback left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jreback Jun 21, 2019

Choose a reason for hiding this comment

Uh oh!

TomAugspurger Jun 21, 2019

Choose a reason for hiding this comment

Uh oh!

jreback Jun 21, 2019

Choose a reason for hiding this comment

Uh oh!

jreback Jun 21, 2019

Choose a reason for hiding this comment

Uh oh!

jreback Jun 21, 2019

Choose a reason for hiding this comment

Uh oh!

jreback Jun 21, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jreback Jun 21, 2019

Choose a reason for hiding this comment

Uh oh!

jreback commented Jun 21, 2019

Uh oh!

TomAugspurger commented Jun 21, 2019

Uh oh!

jreback commented Jun 21, 2019

Uh oh!

jreback commented Jun 21, 2019

Uh oh!

Uh oh!

TomAugspurger commented Jun 19, 2019 •

edited

Loading

codecov bot commented Jun 19, 2019 •

edited

Loading

TomAugspurger commented Jun 20, 2019 •

edited

Loading