DEPR: dropping nuisance columns in rolling aggregations #42834

jbrockmendel · 2021-07-31T19:14:21Z

closes DEPR: dropping nuisance columns in BaseWindow._apply #42738
tests added / passed
Ensure all linting tests pass, see here for how to run them
whatsnew entry

we did this in 1.3 for DataFrame reductions and GroupBy reductions. AFAIK this is the last place we use this pattern.

mroeschke · 2021-08-01T05:04:05Z

pandas/tests/window/test_ewm.py

-    expected = df.ewm(halflife=1.0, min_periods=min_periods).mean()
+    with tm.assert_produces_warning(FutureWarning, match="nuisance columns"):
+        # GH#42738
+        result = df.ewm(halflife=halflife, min_periods=min_periods, times=times).mean()


Also should probably add a deprecation here too that allows times to be a string of a column label (which is already kinda flawed because a column label isn't always a string). A column of times will always be a nuisance column I think

pandas/pandas/core/window/ewm.py

Line 317 in a5f8c9a

if isinstance(self.times, str):

can this be done separately? this is less obvious to me

I think so, yes.

mroeschke · 2021-08-01T05:18:53Z

pandas/tests/window/test_numba.py

-        result = ewm.mean(engine="numba", engine_kwargs=engine_kwargs)
-        expected = ewm.mean(engine="cython")
+
+        # TODO: why only in these cases?


Because column "A" is string dtype and in the non-group case there's an attempt to agg over the column, while in the groupby case, the grouping column should be dropped internally before hand

mroeschke · 2021-08-01T05:19:59Z

doc/source/whatsnew/v1.4.0.rst

@@ -159,6 +159,7 @@ Deprecations
 - Deprecated treating ``numpy.datetime64`` objects as UTC times when passed to the :class:`Timestamp` constructor along with a timezone. In a future version, these will be treated as wall-times. To retain the old behavior, use ``Timestamp(dt64).tz_localize("UTC").tz_convert(tz)`` (:issue:`24559`)
 - Deprecated ignoring missing labels when indexing with a sequence of labels on a level of a MultiIndex (:issue:`42351`)
 - Creating an empty Series without a dtype will now raise a more visible ``FutureWarning`` instead of a ``DeprecationWarning`` (:issue:`30017`)
+- Deprecated dropping of nuisance columns in :class:`Rolling` aggregations (:issue:`42738`)


Good to include Expanding and EWM as well.

jreback

on board. @mroeschke suggestions + rebase

jbrockmendel · 2021-08-08T21:29:34Z

comments addressed + rebased + green

jreback · 2021-08-08T23:12:07Z

pandas/core/window/rolling.py

+        if 0 != len(new_mgr.items) != len(mgr.items):
+            # GH#42738 ignore_failures dropped nuisance columns
+            warnings.warn(
+                "Dropping of nuisance columns in rolling operations "


can you list the columns that are problematic in the error message?

updated + green

… into depr-window-nuisance

…2834)

jbrockmendel added 2 commits July 31, 2021 12:09

DEPR: dropping nuisance columns in rolling methods

121e2df

whatsnew

0c8f332

jbrockmendel added Nuisance Columns Identifying/Dropping nuisance columns in reductions, groupby.add, DataFrame.apply Window rolling, ewma, expanding labels Jul 31, 2021

mroeschke reviewed Aug 1, 2021

View reviewed changes

jreback added this to the 1.4 milestone Aug 5, 2021

jreback mentioned this pull request Aug 5, 2021

DEPR: log of deprecations in 1.x (to be removed in 2.0) #30228

Closed

jreback requested changes Aug 5, 2021

View reviewed changes

Merge branch 'master' into depr-window-nuisance

eb52049

Merge branch 'master' into depr-window-nuisance

ff82b73

jreback reviewed Aug 8, 2021

View reviewed changes

jbrockmendel added 2 commits August 8, 2021 21:03

add dropped columns to warning message

083d21b

Merge branch 'depr-window-nuisance' of github.com:jbrockmendel/pandas…

4bcbd40

… into depr-window-nuisance

mroeschke approved these changes Aug 10, 2021

View reviewed changes

jreback approved these changes Aug 10, 2021

View reviewed changes

jreback merged commit 93af2f0 into pandas-dev:master Aug 10, 2021

jbrockmendel deleted the depr-window-nuisance branch August 10, 2021 19:57

mroeschke mentioned this pull request Aug 27, 2021

DEPR: Passing in a string column label for DataFrame.ewm(times=...) #43265

Merged

4 tasks

feefladder pushed a commit to feefladder/pandas that referenced this pull request Sep 7, 2021

DEPR: dropping nuisance columns in rolling aggregations (pandas-dev#4…

e13f770

…2834)

jbrockmendel mentioned this pull request Oct 6, 2021

API/DEPR: numeric_only kwarg for apply/reductions #28900

Closed

mroeschke mentioned this pull request Jan 4, 2023

DEPR: Remove silent dropping of nuisance columns in window ops #50576

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DEPR: dropping nuisance columns in rolling aggregations #42834

DEPR: dropping nuisance columns in rolling aggregations #42834

jbrockmendel commented Jul 31, 2021

mroeschke Aug 1, 2021

jbrockmendel Aug 8, 2021

mroeschke Aug 8, 2021

mroeschke Aug 1, 2021

mroeschke Aug 1, 2021

jreback left a comment

jbrockmendel commented Aug 8, 2021

jreback Aug 8, 2021

jbrockmendel Aug 10, 2021

DEPR: dropping nuisance columns in rolling aggregations #42834

DEPR: dropping nuisance columns in rolling aggregations #42834

Conversation

jbrockmendel commented Jul 31, 2021

mroeschke Aug 1, 2021

Choose a reason for hiding this comment

jbrockmendel Aug 8, 2021

Choose a reason for hiding this comment

mroeschke Aug 8, 2021

Choose a reason for hiding this comment

mroeschke Aug 1, 2021

Choose a reason for hiding this comment

mroeschke Aug 1, 2021

Choose a reason for hiding this comment

jreback left a comment

Choose a reason for hiding this comment

jbrockmendel commented Aug 8, 2021

jreback Aug 8, 2021

Choose a reason for hiding this comment

jbrockmendel Aug 10, 2021

Choose a reason for hiding this comment