BUG: RollingGroupby ignored as_index=False #40789

mroeschke · 2021-04-05T18:18:30Z

closes BUG: groupby.rolling: Originial index is not being preserved when using date_part of DatetimeIndex and as_index key word seems to have no effect #39433
tests added / passed
Ensure all linting tests pass, see here for how to run them
whatsnew entry

…ng_as_index

phofl · 2021-04-06T11:40:11Z

Does this fix #31007? Haven't looked closely but seen this in the past

…ng_as_index

mroeschke · 2021-04-06T15:40:10Z

@phofl unfortunately doesnt look like it

In [1]: import pandas as pd
   ...:
   ...: data = {
   ...:     'groupby_col': ['A', 'A', 'A', 'A', 'A', 'B', 'B', 'B', 'B', 'B', ],
   ...:     'agg_col': [1, 1, 0, 1, 0, 0, 0, 0, 1, 0],
   ...: }
   ...: df = pd.DataFrame(data)
   ...: df.groupby(['groupby_col'], as_index=False).rolling(4).agg({'agg_col': 'mean'})
Out[1]:
               agg_col
groupby_col
A           0      NaN
            1      NaN
            2      NaN
            3     0.75
            4     0.50
B           5      NaN
            6      NaN
            7      NaN
            8     0.25
            9     0.25

agg still goes partially through groupby code I think to calculate the index result.

phofl · 2021-04-06T15:44:31Z

Pitty, thanks for checking

rhshadrach · 2021-04-08T02:37:03Z

pandas/core/window/rolling.py

@@ -619,6 +621,8 @@ def _apply(
        )

        result.index = result_index
+        if not self._as_index:
+            result = result.reset_index(level=list(range(len(groupby_keys))))


What happens here when the groupby is on an explicit list, e.g. in your test use groupby(["A", "A", "B", "B"]) instead. What is groupby_keys in this case?

Here's the result

In [4]: df.groupby(["A", "A", "B", "B"], as_index=False).rolling(window=2, min_periods=1).mean() > /Users/matthewroeschke/pandas-mroeschke/pandas/core/window/rolling.py(580)_apply() -> result_index_names = groupby_keys + grouped_index_name (Pdb) groupby_keys [None] (Pdb) c Out[4]: level_0 num date 2018-01-01 A 100.0 2018-01-02 A 150.0 2018-01-01 B 150.0 2018-01-02 B 200.0 In [5]: df.groupby(["A", "A", "B", "B"], as_index=False).mean() Out[5]: num 0 150.0 1 200.0

Not sure if the normal groupby has the expected result but appears that groupby.rolling brings the list into the dataframe as a column

…ng_as_index

jreback · 2021-04-08T14:39:12Z

pandas/tests/window/test_groupby.py

@@ -732,6 +732,42 @@ def test_groupby_level(self):
        )
        tm.assert_series_equal(result, expected)

+    def test_as_index_false(self):


can you add the multi-key groupby here as well (parameterize if you can)

df_res2 = df.groupby([df.id, df.index.weekday], as_index=False).rolling(window=2, min_periods=1).mean() df_res3 = df.groupby([df.id]).rolling(window=2, min_periods=1).mean() df_res4 = df.groupby([df.id], as_index=False).rolling(window=2, min_periods=1).mean()

e.g. 2 & 4 (we likley have 1 & 3 covered, but wouldn't object to those included as well)

Hmm all these tested (except the as_index=True cases which are tested everywhere else). So you just want it parameterized?

if they are elsewhere then don't (just the cases that is covering in this PR are fine). parameterize if you can.

…ng_as_index

jreback · 2021-04-09T19:42:51Z

thanks @mroeschke

mroeschke added 4 commits April 4, 2021 00:03

Add as_index before 40701 is merged

90498f7

Merge remote-tracking branch 'upstream/master' into bug/groupby_rolli…

76f9202

…ng_as_index

Add test for as_index behavior

37e53f3

Specifically mention False

0626125

mroeschke added Bug Groupby Window rolling, ewma, expanding labels Apr 5, 2021

mroeschke added this to the 1.3 milestone Apr 5, 2021

Merge remote-tracking branch 'upstream/master' into bug/groupby_rolli…

9a99b5b

…ng_as_index

rhshadrach reviewed Apr 8, 2021

View reviewed changes

Merge remote-tracking branch 'upstream/master' into bug/groupby_rolli…

d5f272f

…ng_as_index

jreback reviewed Apr 8, 2021

View reviewed changes

mroeschke added 2 commits April 8, 2021 13:27

Parameterize

3fa8101

Merge remote-tracking branch 'upstream/master' into bug/groupby_rolli…

f70d381

…ng_as_index

jreback merged commit cd2aaa3 into pandas-dev:master Apr 9, 2021

mroeschke deleted the bug/groupby_rolling_as_index branch April 10, 2021 05:24

JulianWgs pushed a commit to JulianWgs/pandas that referenced this pull request Jul 3, 2021

BUG: RollingGroupby ignored as_index=False (pandas-dev#40789)

343e4ba

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: RollingGroupby ignored as_index=False #40789

BUG: RollingGroupby ignored as_index=False #40789

mroeschke commented Apr 5, 2021

phofl commented Apr 6, 2021

mroeschke commented Apr 6, 2021

phofl commented Apr 6, 2021

rhshadrach Apr 8, 2021

mroeschke Apr 8, 2021 •

edited

Loading

jreback Apr 8, 2021

mroeschke Apr 8, 2021

jreback Apr 8, 2021

jreback commented Apr 9, 2021

BUG: RollingGroupby ignored as_index=False #40789

BUG: RollingGroupby ignored as_index=False #40789

Conversation

mroeschke commented Apr 5, 2021

phofl commented Apr 6, 2021

mroeschke commented Apr 6, 2021

phofl commented Apr 6, 2021

rhshadrach Apr 8, 2021

Choose a reason for hiding this comment

mroeschke Apr 8, 2021 • edited Loading

Choose a reason for hiding this comment

jreback Apr 8, 2021

Choose a reason for hiding this comment

mroeschke Apr 8, 2021

Choose a reason for hiding this comment

jreback Apr 8, 2021

Choose a reason for hiding this comment

jreback commented Apr 9, 2021

mroeschke Apr 8, 2021 •

edited

Loading