BUG: Fix bug in SeriesGroupBy.value_counts when DataFrame has one row (#42618) #42640

neelmraman · 2021-07-20T22:58:32Z

closes BUG: SeriesGroupBy.value_counts() throws IndexError if there is only one group #42618
tests added / passed
Ensure all linting tests pass, see here for how to run them
whatsnew entry

Was introduced in v1.3.0 here. I couldn't find a reason why len(lchanges) should be used and the unit tests still passed with len(val) instead.

…pandas-dev#42618)

rhshadrach

Thanks for the PR! Code change lgtm, some minor requests. As you identified (thanks for that!), this is a regression so should be fixed in 1.3.1 if we can get it in (otherwise, will be 1.3.2).

rhshadrach · 2021-07-23T03:14:05Z

doc/source/whatsnew/v1.4.0.rst

@@ -258,7 +258,7 @@ Groupby/resample/rolling
 ^^^^^^^^^^^^^^^^^^^^^^^^
 - Fixed bug in :meth:`SeriesGroupBy.apply` where passing an unrecognized string argument failed to raise ``TypeError`` when the underlying ``Series`` is empty (:issue:`42021`)
 - Bug in :meth:`Series.rolling.apply`, :meth:`DataFrame.rolling.apply`, :meth:`Series.expanding.apply` and :meth:`DataFrame.expanding.apply` with ``engine="numba"`` where ``*args`` were being cached with the user passed function (:issue:`42287`)
-
+- Fixed bug in :meth:`SeriesGroupBy.value_counts` when the DataFrame/Series you are grouping has one row (:issue:`42618`)


Only Series, I think. Even if you start with a DataFrame/DataFrameGroupBy object and then subset, this method is only acting on a Series.

Also, move to 1.3.1

rhshadrach · 2021-07-23T03:16:50Z

pandas/tests/groupby/test_value_counts.py

+    result = dfg["B"].value_counts()
+    expected = df.value_counts()
+
+    tm.assert_series_equal(result, expected, check_names=False)


Can you rename to the expected value instead of not checking

rhshadrach · 2021-07-23T03:18:34Z

pandas/tests/groupby/test_value_counts.py

+
+    tm.assert_series_equal(result, expected, check_names=False)
+
+    df = DataFrame([[1, 2, 3]], columns=["A", "B", "C"])


Parametrize the test instead, e.g.

pytest.mark.parametrize("columns", [["A", "B"], ["A", "B", "C"]])

then use data=range(len(columns)), and groupby(columns)[:-1]

rhshadrach

Clicked the wrong button... :)

rhshadrach

lgtm

…nts when DataFrame has one row (pandas-dev#42618)

…ataFrame has one row (#42618) (#42696) Co-authored-by: neelmraman <[email protected]>

…pandas-dev#42618) (pandas-dev#42640)

BUG: Fix bug in SeriesGroupBy.value_counts when DataFrame has one row (…

ff73cec

…pandas-dev#42618)

rhshadrach added the Regression Functionality that used to work in a prior pandas version label Jul 23, 2021

rhshadrach approved these changes Jul 23, 2021

View reviewed changes

rhshadrach requested changes Jul 23, 2021

View reviewed changes

simonjayhawkins added this to the 1.3.1 milestone Jul 23, 2021

add whatsnew entry (pandas-dev#42618)

0a10eec

neelmraman force-pushed the groupby_value_counts_42618 branch from 8206ca7 to 0a10eec Compare July 24, 2021 02:04

Merge branch 'master' into groupby_value_counts_42618

bbf362a

rhshadrach approved these changes Jul 24, 2021

View reviewed changes

rhshadrach added Groupby Bug labels Jul 24, 2021

rhshadrach merged commit baf9e4b into pandas-dev:master Jul 24, 2021

meeseeksmachine pushed a commit to meeseeksmachine/pandas that referenced this pull request Jul 24, 2021

Backport PR pandas-dev#42640: BUG: Fix bug in SeriesGroupBy.value_cou…

0c129a8

…nts when DataFrame has one row (pandas-dev#42618)

meeseeksmachine mentioned this pull request Jul 24, 2021

Backport PR #42640 on branch 1.3.x (BUG: Fix bug in SeriesGroupBy.value_counts when DataFrame has one row (#42618)) #42696

Merged

simonjayhawkins pushed a commit that referenced this pull request Jul 24, 2021

Backport PR #42640: BUG: Fix bug in SeriesGroupBy.value_counts when D…

176e8d3

…ataFrame has one row (#42618) (#42696) Co-authored-by: neelmraman <[email protected]>

CGe0516 pushed a commit to CGe0516/pandas that referenced this pull request Jul 29, 2021

BUG: Fix bug in SeriesGroupBy.value_counts when DataFrame has one row (…

e5b2692

…pandas-dev#42618) (pandas-dev#42640)

feefladder pushed a commit to feefladder/pandas that referenced this pull request Sep 7, 2021

BUG: Fix bug in SeriesGroupBy.value_counts when DataFrame has one row (…

f8f281d

…pandas-dev#42618) (pandas-dev#42640)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

BUG: Fix bug in SeriesGroupBy.value_counts when DataFrame has one row (#42618) #42640

BUG: Fix bug in SeriesGroupBy.value_counts when DataFrame has one row (#42618) #42640

Uh oh!

neelmraman commented Jul 20, 2021 •

edited

Loading

Uh oh!

rhshadrach left a comment

Uh oh!

rhshadrach Jul 23, 2021

Uh oh!

rhshadrach Jul 23, 2021

Uh oh!

rhshadrach Jul 23, 2021

Uh oh!

rhshadrach Jul 23, 2021

Uh oh!

rhshadrach left a comment

Uh oh!

rhshadrach left a comment

Uh oh!

Uh oh!


		tm.assert_series_equal(result, expected, check_names=False)

		df = DataFrame([[1, 2, 3]], columns=["A", "B", "C"])

Uh oh!

BUG: Fix bug in SeriesGroupBy.value_counts when DataFrame has one row (#42618) #42640

BUG: Fix bug in SeriesGroupBy.value_counts when DataFrame has one row (#42618) #42640

Uh oh!

Conversation

neelmraman commented Jul 20, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rhshadrach left a comment

Choose a reason for hiding this comment

Uh oh!

rhshadrach Jul 23, 2021

Choose a reason for hiding this comment

Uh oh!

rhshadrach Jul 23, 2021

Choose a reason for hiding this comment

Uh oh!

rhshadrach Jul 23, 2021

Choose a reason for hiding this comment

Uh oh!

rhshadrach Jul 23, 2021

Choose a reason for hiding this comment

Uh oh!

rhshadrach left a comment

Choose a reason for hiding this comment

Uh oh!

rhshadrach left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

neelmraman commented Jul 20, 2021 •

edited

Loading