BUG: any/all not returning booleans for object type #41102

mzeitlin11 · 2021-04-22T16:32:40Z

closes Series.any() and .all() don't return bool values if dtype=object #12863, closes any/all reductions on boolean object-typed Series #27709, closes BUG: dataframe.any() method behaves differently for empty rows and columns #35450
tests added / passed
Ensure all linting tests pass, see here for how to run them
whatsnew entry

Also allows removing some of the any/all specific boolean conversion logic in frame reduction code, which (hopefully) also fixes part of #41079.

jreback · 2021-04-22T19:31:21Z

pandas/core/nanops.py

@@ -486,6 +486,12 @@ def nanany(
    False
    """
    values, _, _, _, _ = _get_values(values, skipna, fill_value=False, mask=mask)
+
+    # For object type, any won't necessarily return


can you do this in _get_values?

Right now _get_values is unaware of the nanop it is being used for, so would need slight refactoring to allow that.

While the current solution duplicates the workaround, I like it better than adding in _get_values since this workaround only affects any/all. I think keeping this weird condition out of a path all other nanops hit makes sense.

jreback · 2021-04-22T19:33:09Z

pandas/tests/frame/test_reductions.py

        assert_bool_op_api(
            opname, bool_frame_with_na, float_string_frame, has_bool_only=True
        )

+    @pytest.mark.xfail(reason="GH12863: numpy result won't match for object type")
+    @pytest.mark.parametrize("opname", ["any", "all"])


so this is trying to match numpy? do we need this anymore?

I doubt the original purpose was matching numpy, probably more for adding tests without having to hardcode an expected result. For now I modified this slightly to remove the need for the xfail, but probably could be removed (there's coverage, but it's scattered).

jreback · 2021-04-22T19:33:37Z

pandas/tests/frame/test_reductions.py

@@ -1108,6 +1112,23 @@ def test_any_all_extra(self):
        result = df[["C"]].all(axis=None).item()
        assert result is True

+    @pytest.mark.parametrize("axis", [0, 1])


do we have testing of the skipna=True

Some scattered testing elsewhere, added skipna parameterization in this test

jreback · 2021-04-22T19:34:17Z

pandas/tests/test_nanops.py

@@ -270,6 +270,7 @@ def _badobj_wrap(self, value, func, allow_complex=True, **kwargs):
                value = value.astype("f8")
        return func(value, **kwargs)

+    @pytest.mark.xfail(reason="GH12863: numpy result won't match for object type")


do we need this ? (same reason above)

Similar response - think more for coverage than matching numpy. Since the nanop gets indirectly tested for all other any/all reductions, I think it could be removed

jreback · 2021-05-06T01:35:22Z

thanks @mzeitlin11

mzeitlin11 added 2 commits April 22, 2021 12:19

BUG: any/all not returning booleans for object type

02dcdc3

Add gh ref

1c24df9

mzeitlin11 added Bug Reduction Operations sum, mean, min, max, etc. labels Apr 22, 2021

jreback requested changes Apr 22, 2021

View reviewed changes

mzeitlin11 added 6 commits April 22, 2021 18:36

Unxfail test

d58eb97

Fix comment typo

7ce87dc

Merge remote-tracking branch 'origin' into any_all_object_type

98651e8

Merge remote-tracking branch 'upstream/master' into any_all_object_type

01c56d1

Merge remote-tracking branch 'upstream/master' into any_all_object_type

6fce897

Merge remote-tracking branch 'upstream/master' into any_all_object_type

c1f5326

jreback added this to the 1.3 milestone May 6, 2021

jreback approved these changes May 6, 2021

View reviewed changes

jreback merged commit fd3e205 into pandas-dev:master May 6, 2021

mzeitlin11 deleted the any_all_object_type branch May 6, 2021 04:33

JulianWgs pushed a commit to JulianWgs/pandas that referenced this pull request Jul 3, 2021

BUG: any/all not returning booleans for object type (pandas-dev#41102)

4f64a08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: any/all not returning booleans for object type #41102

BUG: any/all not returning booleans for object type #41102

mzeitlin11 commented Apr 22, 2021

jreback Apr 22, 2021

mzeitlin11 Apr 22, 2021

jreback Apr 22, 2021

mzeitlin11 Apr 22, 2021

jreback Apr 22, 2021

mzeitlin11 Apr 22, 2021

jreback Apr 22, 2021

mzeitlin11 Apr 22, 2021

jreback commented May 6, 2021

BUG: any/all not returning booleans for object type #41102

BUG: any/all not returning booleans for object type #41102

Conversation

mzeitlin11 commented Apr 22, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jreback commented May 6, 2021