BUG: `date_range` `inclusive` parameter behavior doesn't match interval notation #55319

PiotrekB416 · 2023-09-28T21:34:01Z

closes BUG: date_range inclusive parameter behavior doesn't match interval notation when start == end #55293
closes BUG: the behavior of "date_range" function with "periods & inclusive" arguments #46331
Added an entry in the latest doc/source/whatsnew/v2.1.3.rst file if fixing a bug or adding a new feature.

This pr fixes the issues with inclusive in date_range

Got rid of the problematic expression which seems to be i8values[] == start/end_i8, also the and in line 498 should have been an or. However after those fixes the if-statement was over complicated, so I simplified it.

github-actions · 2023-10-31T00:05:21Z

This pull request is stale because it has been open for thirty days with no activity. Please update and respond to this comment if you're still interested in working on this.

PiotrekB416 · 2023-10-31T10:19:28Z

This pr is ready for review

datapythonista

Thanks for the contribution @PiotrekB416, and sorry it too so long to review, seems like this PR was forgotten in the long list of PRs we have.

It's a bit difficult to understand all possible cases reading the diff, and I'm worried that by fixing the bug we may be introducing new ones.

What I'd like to see in general is that you keep existing tests as they are (in general we shouldn't have wrong cases as tests), and that you add one new test that it reproduces the fixed issue.

In this case the test seems a bit cumbersome, was it asserting cases with the behavior we consider a bug in this PR?

@MarcoGorelli do you have an opinion here?

PiotrekB416 · 2023-11-18T16:26:22Z

In this case the test seems a bit cumbersome, was it asserting cases with the behavior we consider a bug in this PR?

Yes, as far as I could tell the tests had a few mistakes in them, also they were written in a really strange and unnecessarily complex way.
For example this line:

expected = date_range(bday_start, bday_end, freq="D")

should get the expected range, but the inclusive argument is not passed to it

datapythonista · 2023-11-19T08:54:34Z

Thanks for the feedback and the work on this @PiotrekB416

Marco will be able to provide better feedback when he has the time to review, but what makes sense to me is:

Leave the existing tests as they are in this PR
Add a clearer brand new test that hits the specific bug you are fixing. If it makes sense we can test more on it
Only if there is any specific part of the existing tests that fail after your changes, remove that part, in a ver controlled way of what's the behavior we were testing to be true, that we now want to be changed
In a follow up, we can do a proper refactoring of the tests of this functionality

While what you are doing here seems reasonable, it feels strange that the complex logic you are making much simpler here wasn't implemented for a reason, and that we may be missing some use case where your changes fix a bug, but introduce a new one. Changing too much here makes difficult to see if that will be the case. I feel like the approach I'm proposing should minimize the chances of it happening. What do you think?

MarcoGorelli · 2023-12-17T09:17:23Z

pandas/tests/indexes/datetimes/test_date_range.py

-        begintz = Timestamp("2011/1/1", tz="US/Eastern")
-        endtz = Timestamp("2014/1/1", tz="US/Eastern")
+        # begintz = Timestamp("2011/1/1", tz="US/Eastern")
+        # endtz = Timestamp("2014/1/1", tz="US/Eastern")


why is this part of the test comment out?

The shortest answer would be that those lines are not needed and can be safely deleted. I just commented them out when I was testing and then forgot to remove them.

And as for why they were there in the first place. They were used by _get_expected_range, but they are not needed for the function to work

ok thanks - if they're not needed, then to simplify reviews, could you please first open a precursor PR in which you remove anything that's not needed, and then we keep this PR focused on the logic changes (+ relevant tests)? thanks 🙏

I'm afraid that might not be possible. The code is complicated because of its logic errors, thus I can't simplify it without making logic changes.
For example:
in test_range_closed_boundary

expected_right = both_boundary[1:] expected_left = both_boundary[:-1] expected_both = both_boundary[:] expected_neither = both_boundary[1:-1]

this simplification doesn't work if there are no logic changes in the _generate_range function, because then the expected values are compared to incorrectly generated ranges.

mroeschke · 2024-04-15T17:28:33Z

Thanks for the pull request, but it appears to have gone stale. If interested in continuing, please merge in the main branch, address any review comments and/or failing tests, and we can reopen.

PiotrekB416 and others added 8 commits September 28, 2023 23:10

FIX date_range inclusive

c054c3d

Merge branch 'pandas-dev:main' into FIX-date_range-inclusive

3bc308b

FIX tests

5aa1ca5

Update datetimes.py

0137349

Update test_date_range.py

1edced5

Update test_date_range.py

a2d8ae6

Fixed random paste

048e00f

Merge branch 'main' into FIX-date_range-inclusive

bf5cfe1

github-actions bot added the Stale label Oct 31, 2023

PiotrekB416 and others added 3 commits October 31, 2023 05:34

Merge branch 'main' into FIX-date_range-inclusive

f20d078

Update whatsnew

09d5b94

Merge branch 'main' into FIX-date_range-inclusive

dcb4707

PiotrekB416 added 2 commits October 31, 2023 21:09

Merge branch 'main' into FIX-date_range-inclusive

a3f2097

Merge branch 'main' into FIX-date_range-inclusive

23ef653

datapythonista added Bug Datetime Datetime data dtype and removed Stale labels Nov 17, 2023

datapythonista reviewed Nov 17, 2023

View reviewed changes

MarcoGorelli self-requested a review November 17, 2023 14:59

PiotrekB416 and others added 3 commits November 18, 2023 17:12

Minor refactoring in tests

fed521d

Merge branch 'main' into FIX-date_range-inclusive

2cfea14

merge main into branch

06cdc3c

PiotrekB416 and others added 5 commits November 18, 2023 17:31

Update Dockerfile

dc339f4

Update .devcontainer.json

24856e7

Update test_date_range.py

a2bcae5

fix formatting

584dc2f

fixed Y is deprected use YE instead

94f8039

MarcoGorelli reviewed Dec 17, 2023

View reviewed changes

Merge branch 'pandas-dev:main' into FIX-date_range-inclusive

476ced9

mroeschke closed this Apr 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: `date_range` `inclusive` parameter behavior doesn't match interval notation #55319

BUG: `date_range` `inclusive` parameter behavior doesn't match interval notation #55319

PiotrekB416 commented Sep 28, 2023 •

edited

Loading

github-actions bot commented Oct 31, 2023

PiotrekB416 commented Oct 31, 2023

datapythonista left a comment

PiotrekB416 commented Nov 18, 2023

datapythonista commented Nov 19, 2023

MarcoGorelli Dec 17, 2023

PiotrekB416 Dec 17, 2023

MarcoGorelli Dec 17, 2023

PiotrekB416 Dec 17, 2023

mroeschke commented Apr 15, 2024

BUG: date_range inclusive parameter behavior doesn't match interval notation #55319

BUG: date_range inclusive parameter behavior doesn't match interval notation #55319

Conversation

PiotrekB416 commented Sep 28, 2023 • edited Loading

github-actions bot commented Oct 31, 2023

PiotrekB416 commented Oct 31, 2023

datapythonista left a comment

Choose a reason for hiding this comment

PiotrekB416 commented Nov 18, 2023

datapythonista commented Nov 19, 2023

MarcoGorelli Dec 17, 2023

Choose a reason for hiding this comment

PiotrekB416 Dec 17, 2023

Choose a reason for hiding this comment

MarcoGorelli Dec 17, 2023

Choose a reason for hiding this comment

PiotrekB416 Dec 17, 2023

Choose a reason for hiding this comment

mroeschke commented Apr 15, 2024

BUG: `date_range` `inclusive` parameter behavior doesn't match interval notation #55319

BUG: `date_range` `inclusive` parameter behavior doesn't match interval notation #55319

PiotrekB416 commented Sep 28, 2023 •

edited

Loading