BUG: incorrect output of first('1M') in case if first index is the last day of the month #29623

vfilimonov · 2019-11-14T22:17:40Z

Code Sample

x = pd.Series(1, index=pd.bdate_range('2010-03-31', periods=100))
print(x.first('1M'))  # Returns March and April
print(x.first('2M'))  # Returns March, April and May
# etc...

Problem description

In case if the first index of the series falls on the last day of the month, Series.first('1M') returns two first months (this one day + the next month).

If the first value is not on the last day of the month - result is correct:

x = pd.Series(1, index=pd.bdate_range('2010-03-30', periods=100))
print(x.first('1M'))  # Returns only March
print(x.first('2M'))  # Returns March and April
# etc...

Output of `pd.show_versions()`

INSTALLED VERSIONS

commit : None
python : 3.7.1.final.0
python-bits : 64
OS : Darwin
OS-release : 18.7.0
machine : x86_64
processor : i386
byteorder : little
LC_ALL : en_US.UTF-8
LANG : en_US.UTF-8
LOCALE : en_US.UTF-8

pandas : 0.25.3
numpy : 1.17.2
pytz : 2019.3
dateutil : 2.8.0
pip : 19.3.1
setuptools : 41.4.0
Cython : None
pytest : 5.2.1
hypothesis : None
sphinx : 2.2.0
blosc : None
feather : None
xlsxwriter : None
lxml.etree : 4.4.1
html5lib : None
pymysql : 0.9.3
psycopg2 : None
jinja2 : 2.10.1
IPython : 7.8.0
pandas_datareader: None
bs4 : 4.6.3
bottleneck : None
fastparquet : 0.1.6
gcsfs : None
lxml.etree : 4.4.1
matplotlib : 3.0.1
numexpr : 2.7.0
odfpy : None
openpyxl : None
pandas_gbq : None
pyarrow : 0.12.1
pytables : None
s3fs : 0.3.4
scipy : 1.3.1
sqlalchemy : 1.3.8
tables : 3.4.4
xarray : None
xlrd : 1.1.0
xlwt : None
xlsxwriter : None

The text was updated successfully, but these errors were encountered:

rachel-frenkel · 2019-11-18T02:34:04Z

take

datapythonista · 2019-11-19T00:30:38Z

@jbrockmendel mind taking a look? That looks like a bug to me too, but may be we're missing something.

@rachel-frenkel let us know if you need help.

jbrockmendel · 2019-11-19T01:01:33Z

It looks like in first before setting end_date = end = self.index[0] + offset there needs to be a check for offset.onOffset(index[0]), in which case we should return self.loc[:index[0]]

simonjayhawkins · 2020-12-13T15:46:43Z

#38331 reverted #38448

github-actions bot assigned rachel-frenkel Nov 18, 2019

datapythonista added Datetime Datetime data dtype Bug labels Nov 19, 2019

Franklinluo17 mentioned this issue Dec 8, 2019

BUG: incorrect output of first('1M') in case if first index is the last day of the month (#29623) #30138

Closed

5 tasks

jreback added the Frequency DateOffsets label Dec 8, 2019

phofl mentioned this issue Dec 6, 2020

BUG: first("1M") returning two months when first day is last day of month #38331

Merged

5 tasks

jreback modified the milestones: 1.3, 1.2 Dec 12, 2020

jreback closed this as completed in #38331 Dec 12, 2020

phofl mentioned this issue Dec 13, 2020

BUG: first("2M") returning incorrect results #38446

Merged

4 tasks

simonjayhawkins removed this from the 1.2 milestone Dec 13, 2020

simonjayhawkins reopened this Dec 13, 2020

jreback added this to the 1.3 milestone Dec 13, 2020

jreback closed this as completed in #38446 Dec 19, 2020

DeaMariaLeon mentioned this issue Apr 24, 2023

BUG fixing module first() and DateOffset #52487

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: incorrect output of first('1M') in case if first index is the last day of the month #29623

BUG: incorrect output of first('1M') in case if first index is the last day of the month #29623

vfilimonov commented Nov 14, 2019

INSTALLED VERSIONS

rachel-frenkel commented Nov 18, 2019

datapythonista commented Nov 19, 2019

jbrockmendel commented Nov 19, 2019

simonjayhawkins commented Dec 13, 2020

BUG: incorrect output of first('1M') in case if first index is the last day of the month #29623

BUG: incorrect output of first('1M') in case if first index is the last day of the month #29623

Comments

vfilimonov commented Nov 14, 2019

Code Sample

Problem description

Output of pd.show_versions()

INSTALLED VERSIONS

rachel-frenkel commented Nov 18, 2019

datapythonista commented Nov 19, 2019

jbrockmendel commented Nov 19, 2019

simonjayhawkins commented Dec 13, 2020

Output of `pd.show_versions()`