BUG: GroupBy.quantile fails with pd.NA #43150

debnathshoham · 2021-08-21T07:48:39Z

closes BUG: GroupBy's quantile incompatible with pd.NA #42849
tests added / passed
Ensure all linting tests pass, see here for how to run them
whatsnew entry

jreback

small comment, ping on green

pandas/core/groupby/groupby.py

pandas/tests/groupby/test_quantile.py

debnathshoham · 2021-08-21T18:23:02Z

@jreback @jbrockmendel Green

pandas/core/groupby/groupby.py

debnathshoham · 2021-08-30T19:46:05Z

Hi @jreback , @jbrockmendel - could you please take a look if this looks fine

pandas/core/groupby/groupby.py

jbrockmendel · 2021-08-31T21:49:22Z

pandas/tests/groupby/test_quantile.py

+
+def test_groupby_quantile_allNA_column():
+    # GH#42849
+    df = DataFrame({"x": [1, 1], "y": [pd.NA] * 2}, dtype="Float64")


any reason for this to be Float64 instead of any_foo_dtype?

not sure what any_foo_dtype means.
In case it means why the explicit dtype? then without explicitly defining with nullable dtypes, the columns becomes object, and quantile fails (maybe I am wrong, but I believe this is the expected behaviour). Below on master.

In [3]: import pandas as pd In [4]: pd.__version__ Out[4]: '1.4.0.dev0+540.ga826be1f61' In [5]: DataFrame({"x": [1, 1], "y": [pd.NA] * 2}).dtypes Out[5]: x int64 y object dtype: object In [6]: DataFrame({"x": [1, 1], "y": [pd.NA] * 2}, dtype=float).dtypes <ipython-input-6-4731f064b6c6>:1: FutureWarning: Could not cast to float64, falling back to object. This behavior is deprecated. In a future version, when a dtype is passed to 'DataFrame', either all columns will be cast to that dtype, or a TypeError will be raised. DataFrame({"x": [1, 1], "y": [pd.NA] * 2}, dtype=float).dtypes Out[6]: x float64 y object dtype: object

jbrockmendel · 2021-08-31T21:50:28Z

pandas/tests/groupby/test_quantile.py

+def test_groupby_quantile_NA_float(any_float_dtype):
+    # GH#42849
+    df = DataFrame({"x": [1, 1], "y": [0.2, np.nan]}, dtype=any_float_dtype)
+    result = df.groupby("x")["y"].quantile(0.5)


can you do a case with a listlike qs e.g. [0.5, 0.75]

added below

pandas/tests/groupby/test_quantile.py

…o gh42849

debnathshoham · 2021-09-04T02:37:07Z

Hi @jbrockmendel could you pls take a quick look if this is fine now?

jbrockmendel · 2021-09-04T20:30:22Z

pandas/tests/groupby/test_quantile.py

+    tm.assert_series_equal(expected, result)
+
+    result = df.groupby("x").quantile(0.5)
+    expected = DataFrame({"y": 3.5}, index=Index([1], name="x"))


nitpick: clearer to just do expected = expected["y"] next time

jbrockmendel · 2021-09-04T20:32:00Z

LGTM

wouldnt surprise me if some of the tests could be condensed/parametrized, OK for follow-up

jreback · 2021-09-04T23:29:45Z

thanks @debnathshoham (followup to parameterize tests here would be great, that can be on master)

…pd.NA) (#43417)

BUG: GroupBy.quantile fails with pd.NA

b94ee25

jreback requested changes Aug 21, 2021

View reviewed changes

pandas/core/groupby/groupby.py Outdated Show resolved Hide resolved

jreback added this to the 1.3.3 milestone Aug 21, 2021

jreback added Groupby Missing-data np.nan, pd.NaT, pd.NA, dropna, isnull, interpolate labels Aug 21, 2021

suggested change

5712bf8

jreback reviewed Aug 21, 2021

View reviewed changes

pandas/tests/groupby/test_quantile.py Outdated Show resolved Hide resolved

added fixture; colocated near dtype tests

105abf4

jbrockmendel reviewed Aug 21, 2021

View reviewed changes

pandas/tests/groupby/test_quantile.py Outdated Show resolved Hide resolved

added tests for int_EA & allNA

485402c

debnathshoham requested a review from jreback August 21, 2021 19:55

jbrockmendel reviewed Aug 23, 2021

View reviewed changes

pandas/core/groupby/groupby.py Show resolved Hide resolved

Merge branch 'master' into gh42849

e64964f

jbrockmendel reviewed Aug 31, 2021

View reviewed changes

pandas/core/groupby/groupby.py Show resolved Hide resolved

jbrockmendel reviewed Aug 31, 2021

View reviewed changes

pandas/tests/groupby/test_quantile.py Show resolved Hide resolved

debnathshoham added 3 commits September 1, 2021 12:36

added suggested tests

8a57188

Merge branch 'gh42849' of https://github.com/debnathshoham/pandas int…

2443008

…o gh42849

Merge branch 'master' into gh42849

75aa693

Merge branch 'master' into gh42849

7c15bb2

jreback approved these changes Sep 4, 2021

View reviewed changes

jbrockmendel reviewed Sep 4, 2021

View reviewed changes

jreback merged commit 4ec87eb into pandas-dev:master Sep 4, 2021

meeseeksmachine mentioned this pull request Sep 4, 2021

Backport PR #43150 on branch 1.3.x (BUG: GroupBy.quantile fails with pd.NA) #43408

Closed

meeseeksmachine pushed a commit to meeseeksmachine/pandas that referenced this pull request Sep 4, 2021

Backport PR pandas-dev#43150: BUG: GroupBy.quantile fails with pd.NA

bdf34c6

feefladder pushed a commit to feefladder/pandas that referenced this pull request Sep 7, 2021

BUG: GroupBy.quantile fails with pd.NA (pandas-dev#43150)

13ee2eb

simonjayhawkins mentioned this pull request Sep 7, 2021

Backport PR #43150 on branch 1.3.x (BUG: GroupBy.quantile fails with pd.NA) #43417

Merged

jreback pushed a commit that referenced this pull request Sep 9, 2021

Backport PR #43150 on branch 1.3.x (BUG: GroupBy.quantile fails with …

5d6e352

…pd.NA) (#43417)

debnathshoham deleted the gh42849 branch September 12, 2021 17:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: GroupBy.quantile fails with pd.NA #43150

BUG: GroupBy.quantile fails with pd.NA #43150

debnathshoham commented Aug 21, 2021

jreback left a comment

debnathshoham commented Aug 21, 2021

debnathshoham commented Aug 30, 2021

jbrockmendel Aug 31, 2021

debnathshoham Sep 1, 2021

jbrockmendel Aug 31, 2021

debnathshoham Sep 1, 2021

debnathshoham commented Sep 4, 2021

jbrockmendel Sep 4, 2021

jbrockmendel commented Sep 4, 2021

jreback commented Sep 4, 2021

BUG: GroupBy.quantile fails with pd.NA #43150

BUG: GroupBy.quantile fails with pd.NA #43150

Conversation

debnathshoham commented Aug 21, 2021

jreback left a comment

Choose a reason for hiding this comment

debnathshoham commented Aug 21, 2021

debnathshoham commented Aug 30, 2021

jbrockmendel Aug 31, 2021

Choose a reason for hiding this comment

debnathshoham Sep 1, 2021

Choose a reason for hiding this comment

jbrockmendel Aug 31, 2021

Choose a reason for hiding this comment

debnathshoham Sep 1, 2021

Choose a reason for hiding this comment

debnathshoham commented Sep 4, 2021

jbrockmendel Sep 4, 2021

Choose a reason for hiding this comment

jbrockmendel commented Sep 4, 2021

jreback commented Sep 4, 2021