DOC: Adding examples to DataFrameGroupBy.rank #38972 #42402

debnathshoham · 2021-07-06T12:12:25Z

closes pandas.core.groupby.DataFrameGroupBy.rank() dense_rank does not work #38972
tests added / passed
Ensure all linting tests pass, see here for how to run them
whatsnew entry

pep8speaks · 2021-07-06T12:12:28Z

Hello @debnathshoham! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2021-07-11 19:41:09 UTC

mzeitlin11

Thanks for the pr @debnathshoham! Generally looks like a great improvement, some comments. Can you also post a screenshot of the new built docs to make sure everything looks as expected (see https://pandas.pydata.org/docs/development/contributing_documentation.html#how-to-build-the-pandas-documentation)

mzeitlin11 · 2021-07-06T15:12:13Z

pandas/core/groupby/groupby.py

+
+        See Also
+        --------
+        Series.groupby : Apply a function groupby to a Series.


I know that other groupby docs include Series/DataFrame .groupby in the See Also, but IMO they're not helpful (especially since they don't link to anything).

Have opened #42406 for this

mzeitlin11 · 2021-07-06T15:13:01Z

pandas/core/groupby/groupby.py

+        6     b   0.21
+        7     b   0.40
+        8     b   0.01
+        9     a   0.20


I think the example would be easier to see how different groups are treated if groups are contiguous, eg a, a, a, a...b, b, b, b

Also think it would be clearer to have fewer distinct values (and maybe use ints instead of floats, with values that are easy to tell at a glance what is smallest, largest, etc

mzeitlin11 · 2021-07-06T15:14:57Z

pandas/core/groupby/groupby.py

+        >>> df['min_rank'] = df.groupby('group')['value'].rank('min')
+        >>> df['max_rank'] = df.groupby('group')['value'].rank('max')
+        >>> df['dense_rank'] = df.groupby('group')['value'].rank('dense')
+        >>> df['first_rank'] = df.groupby('group')['value'].rank('first')


This might be clearer as is, but could be written more concisely along the lines of

for method in ["average", ..., "first"]: df[f"{method}_rank"] = df.groupby("group")["value"].rank(method)

debnathshoham · 2021-07-06T16:04:01Z

Hi @mzeitlin11 , just made the suggested changes. I have also attached how it looks now. Please let me know if this looks fine.

debnathshoham · 2021-07-08T14:37:41Z

please review

mzeitlin11 · 2021-07-09T21:02:23Z

pandas/core/groupby/groupby.py

+        DataFrame.groupby : Apply a function groupby
+            to each row or column of a DataFrame.
+        Series.rank : Apply a function rank to a Series.
+        DataFrame.rank : Apply a function rank


Based on the screenshot you posted, looks like this doesn't render as a link, so not that useful in current form. I think best to keep scope small and remove changes to the See Also (which could then be tackled as part of #42406 if you're interested!).

reverted the change on See Also

mzeitlin11 · 2021-07-09T21:03:34Z

pandas/core/groupby/groupby.py

+        --------
+        >>> df = pd.DataFrame({'group': ['a', 'a', 'a', 'a',
+        ...                              'a', 'b', 'b', 'b', 'b', 'b'],
+        ...                    'value': [2, 4, 2, 3, 5, 1, 2, 4, 1, 5]})


The formatting of this looks a bit awkward, can you use black on this (or at least would be clearer if a's were all on same line

updated with black

This is how it looks now.

mzeitlin11 · 2021-07-09T21:04:12Z

please review

Sorry for later review, pandas is volunteer, so review may sometimes take a while :) Left some small comments, otherwise LGTM!

… groupbyrank-doc

mroeschke · 2021-07-11T19:24:09Z

@debnathshoham of note: The docstring validation is failing

Error: /home/runner/work/pandas/pandas/pandas/core/groupby/groupby.py:2638:GL07:pandas.core.groupby.DataFrameGroupBy.rank:Sections are in the wrong order. Correct order is: Parameters, Returns, See Also, Examples

debnathshoham · 2021-07-11T19:42:35Z

thanks @mroeschke . I believe it would be fixed now.

mzeitlin11

LGTM, thanks @debnathshoham!

mroeschke · 2021-07-12T19:32:05Z

Thanks @debnathshoham

DOC: Adding examples to DataFrameGroupBy.rank pandas-dev#38972

adbdcb1

DOC: Adding examples to DataFrameGroupBy.rank pandas-dev#38972

ff71390

mzeitlin11 reviewed Jul 6, 2021

View reviewed changes

mzeitlin11 added Algos Non-arithmetic algos: value_counts, factorize, sorting, isin, clip, shift, diff Docs Groupby labels Jul 6, 2021

DOC: made the suggested changes

ab42746

mzeitlin11 reviewed Jul 9, 2021

View reviewed changes

debnathshoham added 5 commits July 11, 2021 12:26

Merge branch 'master' of https://github.com/debnathshoham/pandas into…

76e4df1

… groupbyrank-doc

DOC: changed as suggested

1b2dde5

DOC: changed as suggested

81000b7

DOC: updating with black output

98b90c6

DOC: updating with black output

ed08832

DOC: corrected docstring order

0032ae4

debnathshoham requested a review from mzeitlin11 July 12, 2021 18:04

mzeitlin11 approved these changes Jul 12, 2021

View reviewed changes

mroeschke added this to the 1.4 milestone Jul 12, 2021

mroeschke approved these changes Jul 12, 2021

View reviewed changes

mroeschke merged commit 4c90215 into pandas-dev:master Jul 12, 2021

debnathshoham deleted the groupbyrank-doc branch July 12, 2021 19:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC: Adding examples to DataFrameGroupBy.rank #38972 #42402

DOC: Adding examples to DataFrameGroupBy.rank #38972 #42402

debnathshoham commented Jul 6, 2021 •

edited

Loading

pep8speaks commented Jul 6, 2021 •

edited

Loading

mzeitlin11 left a comment

mzeitlin11 Jul 6, 2021

mzeitlin11 Jul 6, 2021

mzeitlin11 Jul 6, 2021

mzeitlin11 Jul 6, 2021

mzeitlin11 Jul 6, 2021

debnathshoham commented Jul 6, 2021

debnathshoham commented Jul 8, 2021

mzeitlin11 Jul 9, 2021

debnathshoham Jul 11, 2021

mzeitlin11 Jul 9, 2021

debnathshoham Jul 11, 2021

debnathshoham Jul 11, 2021

mzeitlin11 commented Jul 9, 2021 •

edited

Loading

mroeschke commented Jul 11, 2021

debnathshoham commented Jul 11, 2021

mzeitlin11 left a comment

mroeschke commented Jul 12, 2021

+b   0.21
+b   0.40
+b   0.01
+a   0.20

DOC: Adding examples to DataFrameGroupBy.rank #38972 #42402

DOC: Adding examples to DataFrameGroupBy.rank #38972 #42402

Conversation

debnathshoham commented Jul 6, 2021 • edited Loading

pep8speaks commented Jul 6, 2021 • edited Loading

Comment last updated at 2021-07-11 19:41:09 UTC

mzeitlin11 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

debnathshoham commented Jul 6, 2021

debnathshoham commented Jul 8, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mzeitlin11 commented Jul 9, 2021 • edited Loading

mroeschke commented Jul 11, 2021

debnathshoham commented Jul 11, 2021

mzeitlin11 left a comment

Choose a reason for hiding this comment

mroeschke commented Jul 12, 2021

debnathshoham commented Jul 6, 2021 •

edited

Loading

pep8speaks commented Jul 6, 2021 •

edited

Loading

mzeitlin11 commented Jul 9, 2021 •

edited

Loading