BUG: SeriesGroupBy.value_counts sorts when sort=False #50482

rhshadrach · 2022-12-29T05:10:04Z

In the 2nd example, the column gender is reverse-lexicographically sorted (male then female).

df = DataFrame(
    {
        "gender": ["male", "male", "female", "male", "female", "male"],
        "education": ["low", "medium", "high", "low", "high", "low"],
        "country": ["US", "FR", "US", "FR", "FR", "FR"],
    }
)
gb = df.groupby(["country", "gender"], as_index=True, sort=False)
result = gb.value_counts(sort=False)
print(result)
# country  gender  education
# US       male    low          1
# FR       male    medium       1
# US       female  high         1
# FR       male    low          2
#          female  high         1
# dtype: int64

gb2 = df.groupby(["country", "gender"], as_index=True, sort=False)["education"]
result2 = gb2.value_counts(sort=False)
print(result2)
# country  gender  education
# US       male    low          1
# FR       male    low          2
#                  medium       1
# US       female  high         1
# FR       female  high         1
# Name: education, dtype: int64

The text was updated successfully, but these errors were encountered:

rhshadrach added Bug Groupby Algos Non-arithmetic algos: value_counts, factorize, sorting, isin, clip, shift, diff labels Dec 29, 2022

rhshadrach self-assigned this Dec 29, 2022

rhshadrach mentioned this issue Jan 4, 2023

BUG/PERF: SeriesGroupBy.value_counts sorting bug and categorical performance #50548

Merged

6 tasks

rhshadrach added this to the 2.0 milestone Jan 11, 2023

rhshadrach closed this as completed in #50548 Jan 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: SeriesGroupBy.value_counts sorts when sort=False #50482

BUG: SeriesGroupBy.value_counts sorts when sort=False #50482

rhshadrach commented Dec 29, 2022

BUG: SeriesGroupBy.value_counts sorts when sort=False #50482

BUG: SeriesGroupBy.value_counts sorts when sort=False #50482

Comments

rhshadrach commented Dec 29, 2022