SeriesGroupBy.transform cannot handle empty series #26208

HHest · 2019-04-25T09:27:53Z

Code Sample, a copy-pastable example if possible

d = pd.DataFrame({1: [], 2: []})
g = d.groupby(1)
g[2].transform(lambda x: x)

Traceback (most recent call last):
  File "<input>", line 1, in <module>
  File "C:\python37\lib\site-packages\pandas\core\groupby\generic.py", line 945, in transform
    result = concat(results).sort_index()
  File "C:\python37\lib\site-packages\pandas\core\reshape\concat.py", line 228, in concat
    copy=copy, sort=sort)
  File "C:\python37\lib\site-packages\pandas\core\reshape\concat.py", line 262, in __init__
    raise ValueError('No objects to concatenate')

Problem description

Crashes on SeriesGroupby obj with zero length, which came from an empty dataframe. Would be nicer if pandas can handle this case without raising errors, by for example, just return an empty series. Thanks.

Expected Output

Series([], Name: 2, dtype: float64)

Output of `pd.show_versions()`

INSTALLED VERSIONS ------------------ commit: None python: 3.7.3.final.0 python-bits: 64 OS: Windows OS-release: 10 machine: AMD64 processor: Intel64 Family 6 Model 142 Stepping 9, GenuineIntel byteorder: little LC_ALL: None LANG: None LOCALE: None.None pandas: 0.24.2 pytest: 4.4.1 pip: 19.0.3 setuptools: 41.0.1 Cython: None numpy: 1.15.4 scipy: 1.1.0 pyarrow: None xarray: None IPython: None sphinx: None patsy: 0.5.1 dateutil: 2.7.5 pytz: 2018.7 blosc: None bottleneck: None tables: None numexpr: None feather: None matplotlib: 3.0.2 openpyxl: 2.5.12 xlrd: 1.2.0 xlwt: 1.3.0 xlsxwriter: None lxml.etree: None bs4: None html5lib: None sqlalchemy: None pymysql: None psycopg2: None jinja2: None s3fs: None fastparquet: None pandas_gbq: None pandas_datareader: None gcsfs: None

The text was updated successfully, but these errors were encountered:

WillAyd · 2019-04-25T15:32:07Z

I suppose that would make this consistent with apply and agg:

>>> g[2].apply(lambda x: x)
Series([], Name: 2, dtype: float64)
>>> g[2].agg(lambda x: x)
Series([], Name: 2, dtype: float64)

If you want to take a look and have a simple way of making it work would take a PR

WillAyd · 2019-04-25T15:34:16Z

Related to #17093

HHest · 2019-04-26T11:59:24Z

Okay, thanks. I see a possible patch. Need to read over contribution guideline and set up a working env to create a PR.

HHest · 2019-04-27T19:53:23Z

The PR I just submitted doesn't fix #17093

WillAyd added the Groupby label Apr 25, 2019

WillAyd added this to the Contributions Welcome milestone Apr 25, 2019

HHest mentioned this issue Apr 29, 2019

Fix .transform crash when SeriesGroupBy is empty (#26208) #26228

Merged

4 tasks

jreback modified the milestones: Contributions Welcome, 0.25.0 May 14, 2019

jreback closed this as completed in #26228 May 15, 2019

jreback pushed a commit that referenced this issue May 15, 2019

Fix .transform crash when SeriesGroupBy is empty (#26208) (#26228)

ff4437e

mroeschke mentioned this issue Aug 16, 2020

BUG: DataFrame.groupby(., dropna=True, axis=0) incorrectly throws ShapeError #35751

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SeriesGroupBy.transform cannot handle empty series #26208

SeriesGroupBy.transform cannot handle empty series #26208

HHest commented Apr 25, 2019

WillAyd commented Apr 25, 2019

WillAyd commented Apr 25, 2019

HHest commented Apr 26, 2019

HHest commented Apr 27, 2019

SeriesGroupBy.transform cannot handle empty series #26208

SeriesGroupBy.transform cannot handle empty series #26208

Comments

HHest commented Apr 25, 2019

Code Sample, a copy-pastable example if possible

Problem description

Expected Output

Output of pd.show_versions()

WillAyd commented Apr 25, 2019

WillAyd commented Apr 25, 2019

HHest commented Apr 26, 2019

HHest commented Apr 27, 2019

Output of `pd.show_versions()`