DOC: GH5456 Adding workaround info on NA / NaT handling for groupby #47337

aamnv · 2022-06-13T20:45:41Z

closes DOC: update groupby NA group handing / workaround #5456
[NA] Tests added and passed if fixing a bug or adding a new feature
All code checks passed.
[NA] Added type annotations to new arguments/methods/functions.
[NA] Added an entry in the latest doc/source/whatsnew/vX.X.X.rst file if fixing a bug or adding a new feature.

Took a stab at this by parsing through the initial issue and related topics - let me know if the intent was different, happy to edit.

mroeschke · 2022-06-13T20:58:01Z

doc/source/user_guide/groupby.rst

@@ -1268,6 +1268,11 @@ automatically excluded. In other words, there will never be an "NA group" or
 generally discarding the NA group anyway (and supporting it was an
 implementation headache).

+.. note::
+   If you need to include NaN or NaT values in your grouping, you can workaround the automatic exclusion by replacing
+   the NaN / NaT values with a placeholder string. For example, you can use ``df.fillna("default", inplace=True)`` to


Thanks for the PR!

Could you demonstrate this by including a .. ipython:: python example (like above)

In the example, could you exclude inplace=True? We are trying to discourage that keyword generally

Can do on both!

Do you think it makes sense to pull it out of a note and just include it in line with the rest of the text if we're including an ipython?

Yeah sure that sounds reasonable

phofl

I might be missing something here, but these days you can simply set dropna=False, which would include the NaNs



df = pd.DataFrame({"a": [1, np.nan, 2], "b": 3})

print(df.groupby("a", dropna=False).sum())

returns

     b
a     
1.0  3
2.0  3
NaN  3

mroeschke · 2022-06-14T16:35:44Z

Good point @phofl. Looks we also have an example in the groupby docstring demonstrating this behavior so I don't think this workaround explanation is needed.

Thanks for the PR @aamnv but going to close since I think this old issue is not actionable anymore, but feel free to address any of the other good first issue issues!

DOC: GH5456 Adding workaround info on NA / NaT handling for groupby

5a77eba

mroeschke reviewed Jun 13, 2022

View reviewed changes

phofl reviewed Jun 14, 2022

View reviewed changes

mroeschke closed this Jun 14, 2022

mroeschke mentioned this pull request Jun 14, 2022

DOC: update groupby NA group handing / workaround #5456

Closed

aamnv deleted the doc-update-groupby-na-handling branch June 14, 2022 18:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC: GH5456 Adding workaround info on NA / NaT handling for groupby #47337

DOC: GH5456 Adding workaround info on NA / NaT handling for groupby #47337

aamnv commented Jun 13, 2022

mroeschke Jun 13, 2022

aamnv Jun 13, 2022

mroeschke Jun 13, 2022

phofl left a comment

mroeschke commented Jun 14, 2022

DOC: GH5456 Adding workaround info on NA / NaT handling for groupby #47337

DOC: GH5456 Adding workaround info on NA / NaT handling for groupby #47337

Conversation

aamnv commented Jun 13, 2022

mroeschke Jun 13, 2022

Choose a reason for hiding this comment

aamnv Jun 13, 2022

Choose a reason for hiding this comment

mroeschke Jun 13, 2022

Choose a reason for hiding this comment

phofl left a comment

Choose a reason for hiding this comment

mroeschke commented Jun 14, 2022