TST: add tests to validate margin results for pivot (#25815) #27245

peterpanmj · 2019-07-05T08:27:48Z

[x ] closes wrong aggregated results when pivot_table index is ordered categorical data #25815
[x ] tests added / passed
[x ] passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

WillAyd

I think this was fixed as part of #27071 - didn't see a specific test for this at first glance but if can double check would be great!

WillAyd · 2019-07-05T15:26:41Z

pandas/tests/reshape/test_pivot.py

+                                   aggfunc='sum', margins=True)
+
+        result = pivot_tab['All']
+        expected = pivot_tab.iloc[:, :-1].sum(axis=1)


Can you just construct this manually? I think generating the result and expected values from the same variable can let bugs pass through undetected

WillAyd · 2019-07-05T15:26:55Z

pandas/tests/reshape/test_pivot.py

+
+        result = pivot_tab['All']
+        expected = pivot_tab.iloc[:, :-1].sum(axis=1)
+        tm.assert_series_equal(result, expected, check_dtype=False,


Can you remove check_dtype and check_names?

pandas/tests/reshape/test_pivot.py

simonjayhawkins · 2019-07-06T11:27:27Z

see #27233 for details on black code style to resolve the CI failure.

simonjayhawkins

@peterpanmj changes look good. just a couple more and I think we're done. @WillAyd

simonjayhawkins · 2019-07-08T10:47:43Z

pandas/tests/reshape/test_pivot.py

+        ordered_cat = pd.IntervalIndex.from_arrays([0, 0, 1, 1], [1, 1, 2, 2])
+        df = pd.DataFrame(
+            {
+                "A": np.arange(4, 0, -1).astype('int32'),


Suggested change

"A": np.arange(4, 0, -1).astype('int32'),

"A": np.arange(4, 0, -1, dtype=np.intp),

simonjayhawkins · 2019-07-08T10:48:45Z

pandas/tests/reshape/test_pivot.py

+        )
+
+        result = pivot_tab["All"]
+        expected = pd.Series([3, 7, 10], index=result.index, name="All", dtype="int32")


Suggested change

expected = pd.Series([3, 7, 10], index=result.index, name="All", dtype="int32")

expected = Series(

[3, 7, 10],

index=Index([pd.Interval(0, 1), pd.Interval(1, 2), "All"], name="C"),

name="All",

)

Why not using index from result , seems easier ?

I'm not sure that would be in the spirit of constructing the expected result explicitly as suggested in #27245 (comment)

peterpanmj force-pushed the add_pivot_test branch from 532860e to afe8ab8 Compare July 5, 2019 09:37

TST: add tests to validate margin results for pivot (pandas-dev#25815)

c5f4df6

peterpanmj force-pushed the add_pivot_test branch from afe8ab8 to c5f4df6 Compare July 5, 2019 10:12

WillAyd requested changes Jul 5, 2019

View reviewed changes

WillAyd added Categorical Categorical Data Type Testing pandas testing functions or related to the test suite labels Jul 5, 2019

simonjayhawkins added the Reshaping Concat, Merge/Join, Stack/Unstack, Explode label Jul 5, 2019

simonjayhawkins reviewed Jul 5, 2019

View reviewed changes

pandas/tests/reshape/test_pivot.py Outdated Show resolved Hide resolved

pandas/tests/reshape/test_pivot.py Outdated Show resolved Hide resolved

pandas/tests/reshape/test_pivot.py Outdated Show resolved Hide resolved

peterpanmj added 2 commits July 8, 2019 17:27

fix style errors, construct expected results manually

7547903

force to use same dtype in tests

0c8a66c

simonjayhawkins added this to the 0.25.0 milestone Jul 8, 2019

simonjayhawkins requested changes Jul 8, 2019

View reviewed changes

fix style errors in pivot test

c536e78

simonjayhawkins approved these changes Jul 9, 2019

View reviewed changes

jreback merged commit dc5a848 into pandas-dev:master Jul 9, 2019

peterpanmj deleted the add_pivot_test branch January 31, 2023 05:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TST: add tests to validate margin results for pivot (#25815) #27245

TST: add tests to validate margin results for pivot (#25815) #27245

peterpanmj commented Jul 5, 2019

WillAyd left a comment

WillAyd Jul 5, 2019

WillAyd Jul 5, 2019

simonjayhawkins commented Jul 6, 2019 •

edited

Loading

simonjayhawkins left a comment

simonjayhawkins Jul 8, 2019

simonjayhawkins Jul 8, 2019

peterpanmj Jul 9, 2019

simonjayhawkins Jul 9, 2019

	"A": np.arange(4, 0, -1).astype('int32'),
	"A": np.arange(4, 0, -1, dtype=np.intp),

TST: add tests to validate margin results for pivot (#25815) #27245

TST: add tests to validate margin results for pivot (#25815) #27245

Conversation

peterpanmj commented Jul 5, 2019

WillAyd left a comment

Choose a reason for hiding this comment

WillAyd Jul 5, 2019

Choose a reason for hiding this comment

WillAyd Jul 5, 2019

Choose a reason for hiding this comment

simonjayhawkins commented Jul 6, 2019 • edited Loading

simonjayhawkins left a comment

Choose a reason for hiding this comment

simonjayhawkins Jul 8, 2019

Choose a reason for hiding this comment

simonjayhawkins Jul 8, 2019

Choose a reason for hiding this comment

peterpanmj Jul 9, 2019

Choose a reason for hiding this comment

simonjayhawkins Jul 9, 2019

Choose a reason for hiding this comment

simonjayhawkins commented Jul 6, 2019 •

edited

Loading