DateFrame.truncate is not equivalent to slicing as mentioned in docs #17763

tdpetrou · 2017-10-03T13:48:44Z

Code Sample, a copy-pastable example if possible

>>> index = pd.date_range('2016-1-1', '2016-2-1', freq='s')
>>> df = pd.DataFrame(index=d, data={'a':1})  
>>> df.loc['2016-1-5 05':'2016-1-10']                  
                     a
2016-01-05 00:00:00  1
2016-01-05 00:00:01  1
2016-01-05 00:00:02  1
...
2016-01-10 23:59:57  1
2016-01-10 23:59:58  1
2016-01-10 23:59:59  1

>>> df.truncate('2016-1-5', '2016-1-10')
                     a
2016-01-05 00:00:00  1
2016-01-05 00:00:01  1
2016-01-05 00:00:02  1
...
2016-01-09 23:59:58  1
2016-01-09 23:59:59  1
2016-01-10 00:00:00  1

Problem description

The documentation for truncate has misleading information. truncate and slicing are not the same. Slicing returns all partially matching dates while truncate stops at an exact datetime - it assumes 0's for any unspecified date component.

Also, truncate can take datetime and partial string datetimes which, to me, isn't apparent in the docstrings.

Expected Output

Output of `pd.show_versions()`

INSTALLED VERSIONS

commit: None
python: 3.6.1.final.0
python-bits: 64
OS: Darwin
OS-release: 15.6.0
machine: x86_64
processor: i386
byteorder: little
LC_ALL: None
LANG: en_US.UTF-8
LOCALE: en_US.UTF-8

pandas: 0.20.3
pytest: 3.0.7
pip: 9.0.1
setuptools: 35.0.2
Cython: 0.25.2
numpy: 1.13.1
scipy: 0.19.0
xarray: None
IPython: 6.0.0
sphinx: 1.5.5
patsy: 0.4.1
dateutil: 2.6.1
pytz: 2017.2
blosc: None
bottleneck: 1.2.0
tables: 3.4.2
numexpr: 2.6.2
feather: None
matplotlib: 2.0.2
openpyxl: 2.4.7
xlrd: 1.0.0
xlwt: 1.2.0
xlsxwriter: 0.9.6
lxml: 3.7.3
bs4: 4.6.0
html5lib: 0.9999999
sqlalchemy: 1.1.9
pymysql: None
psycopg2: None
jinja2: 2.9.6
s3fs: None
pandas_gbq: None
pandas_datareader: 0.3.0.post

The text was updated successfully, but these errors were encountered:

jorisvandenbossche · 2017-10-03T14:15:10Z

The docstring can indeed use some love (and examples). PR's welcome.

That said, we could also question the need for this function. Is it that useful given you can do the same with indexing? (if you are aware of the difference in how the labels are interpreted)

Just as a question (I never use it myself), do you use this function a lot? And if so, would it be hard to convert to using slicing?

tdpetrou · 2017-10-03T14:33:26Z

I've never used it and it seems to rarely come up on stackoverflow.

I suppose it could be useful to add parameters include_start and include_end like between_time but even then I'm not sure there is an example that would necessitate it over slicing.

At a more macro-scale, I find it strange that there are several DataFrame/Series methods that only work with DatetimeIndex - first, last, truncate, between_time, at_time, as_freq, asof and probably more that I can't think of at the moment.

The naming of some of these methods is completely non-intuitive. Especially something like first/last which have groupby methods of the same name (that are intuitive).

It would take a major break in API to refactor these methods out somewhere else.

jorisvandenbossche added the Docs label Oct 3, 2017

jreback added the Datetime Datetime data dtype label Oct 3, 2017

reidy-p added a commit to reidy-p/pandas that referenced this issue Oct 19, 2017

DOC: Improve truncate docstring (pandas-dev#17763)

83942e5

reidy-p added a commit to reidy-p/pandas that referenced this issue Oct 19, 2017

DOC: Improve truncate docstring (pandas-dev#17763)

ccd3ca0

reidy-p added a commit to reidy-p/pandas that referenced this issue Oct 19, 2017

DOC: Improve truncate docstring (pandas-dev#17763)

f5930af

reidy-p mentioned this issue Oct 19, 2017

DOC: Improve truncate docstring (#17763) #17925

Merged

1 task

jreback closed this as completed in #17925 Oct 21, 2017

jreback added this to the 0.21.0 milestone Oct 21, 2017

jreback pushed a commit that referenced this issue Oct 21, 2017

DOC: Improve truncate docstring (#17763) (#17925)

95b422c

peterpanmj pushed a commit to peterpanmj/pandas that referenced this issue Oct 31, 2017

DOC: Improve truncate docstring (pandas-dev#17763) (pandas-dev#17925)

6f8f684

alanbato pushed a commit to alanbato/pandas that referenced this issue Nov 10, 2017

DOC: Improve truncate docstring (pandas-dev#17763) (pandas-dev#17925)

0d78812

No-Stream pushed a commit to No-Stream/pandas that referenced this issue Nov 28, 2017

DOC: Improve truncate docstring (pandas-dev#17763) (pandas-dev#17925)

1d616a3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DateFrame.truncate is not equivalent to slicing as mentioned in docs #17763

DateFrame.truncate is not equivalent to slicing as mentioned in docs #17763

tdpetrou commented Oct 3, 2017 •

edited

Loading

INSTALLED VERSIONS

jorisvandenbossche commented Oct 3, 2017

tdpetrou commented Oct 3, 2017

DateFrame.truncate is not equivalent to slicing as mentioned in docs #17763

DateFrame.truncate is not equivalent to slicing as mentioned in docs #17763

Comments

tdpetrou commented Oct 3, 2017 • edited Loading

Code Sample, a copy-pastable example if possible

Problem description

Expected Output

Output of pd.show_versions()

INSTALLED VERSIONS

jorisvandenbossche commented Oct 3, 2017

tdpetrou commented Oct 3, 2017

tdpetrou commented Oct 3, 2017 •

edited

Loading

Output of `pd.show_versions()`