BUG: Unexpected behaviour when inserting timestamps into Series #57596 #57628

wleong1 · 2024-02-26T00:25:00Z

Resolution info:

Resolution.RESO_SEC - 3 for seconds
Resolution.RESO_MIN - 4 for minute

Observation:
With the previous code, when the current entry (resolution: 4, entry: '2024-02-24 10:08') has a greater resolution value than the smallest amongst previous entries (in this case resolution: 3, entry: '2024-02-24 10:2:30'), it is shown that the key is said to exist within the list of keys within the Series even though it shouldn't. This does not happen when a smaller or equal resolution value entry is inserted into the Series.

Thought process:
With the previous code, if the current entry's resolution value is greater than the smallest resolution value amongst the previous entries, the _LocIndexer._convert_to_indexer method will only return an empty list (output of the method labels.get_loc(key)), which might be causing entries with larger resolution values not being inserted into the Series correctly.

Potential solution:
I have changed the method _LocIndexer._convert_to_indexer to return the key ('2024-02-24 10:08') instead of an empty list when the resolution value is greater than the smallest resolution value.

closes BUG: Unexpected behaviour when inserting timestamps into Series #57596
Tests added and passed if fixing a bug or adding a new feature
All code checks passed.
Added type annotations to new arguments/methods/functions.
Added an entry in the latest doc/source/whatsnew/v2.2.1.rst file if fixing a bug or adding a new feature.

…d on values for Series with string dtype (#57116)

* TST: Interchange implementation for timestamp[ns][pyarrow] * Check timestamp explicitly * Use pytest.importorskip --------- Co-authored-by: Marco Edward Gorelli <[email protected]>

…s.read_hdf, pandas.HDFStore.append (#57114) * Resolve PR02 errors in docstrings: pandas.Series.interpolate, pandas.read_hdf, pandas.HDFStore.append * Update code_checks.sh

Update missing_data.rst Fix typo

Bumps [pypa/cibuildwheel](https://github.com/pypa/cibuildwheel) from 2.16.2 to 2.16.4. - [Release notes](https://github.com/pypa/cibuildwheel/releases) - [Changelog](https://github.com/pypa/cibuildwheel/blob/main/docs/changelog.md) - [Commits](pypa/cibuildwheel@v2.16.2...v2.16.4) --- updated-dependencies: - dependency-name: pypa/cibuildwheel dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

…et_option, pandas.reset_option (#57117) * Copy the signature from the implementation * updated code_checs.sh

…ixed type index (#57101) * fix regression in join with empty * mypy * move empty check

* CI: autouse add_doctest_imports * add ref

The smallest typo correction, "engine" lacked an "n".

* ambiguous takes bool as an argument, not bool-ndarray ambiguous takes bool as an argument, not bool-ndarray * Update pandas/core/generic.py Co-authored-by: Matthew Roeschke <[email protected]> * Update pandas/core/generic.py Co-authored-by: Matthew Roeschke <[email protected]> --------- Co-authored-by: Marc Garcia <[email protected]> Co-authored-by: Matthew Roeschke <[email protected]>

* ENH: Add skipna to groupby.first and groupby.last * resample & tests * Improve test * Fixups * fixup test * Rework na_value determination

* fix masked indexing regression * fix test * fix test * dedup resizing logic * add types

* Make to_dict lazier * Remove some extra looping and indexing * Add erroneous ignore

)

* fix PR02 error in clip function * fix PR02 error in argsort function

…orrectly (#57175) fix from_dataframe for empty dataframes

DataFrame.sort_index not producing stable sort

…moval is … (#57162) 'freq' is removed in GH 14146), not moved to kw-only arg. Removal is already captured under 'Removal of prior version deprecations/changes'

…57127)

Remove redundant code style check

* DEPR: Enforce deprecation of removing axis from all groupby ops * Add note on fillna and cleanup * Doc fixups * Remove corrwith axis=1 test * Remove corrwith axis=1 test * Skip corrwith docstring validation

* Document some more categorical methods * Render pages for Categorical methods

fix ES01 for pandas.Flags

* Revert "DEPS: Add warning if pyarrow is not installed (#56896)" This reverts commit 5c2a407 * Add whatsnew * Update * Update doc/source/whatsnew/v2.2.1.rst Co-authored-by: Matthew Roeschke <[email protected]> --------- Co-authored-by: Matthew Roeschke <[email protected]>

* Centeralize methods to class * Cleanups * CLN: Remove pickle support pre-pandas 1.0 * Typing * clean:

* attempt failing test * expand test for demonstration purposes * fix near-minimum timestamp overflow when scaling from microseconds to nanoseconds * minor refactor * add comments around specifically handling near-minimum microsecond and nanosecond timestamps * consolidate comments --------- Co-authored-by: Robert Schmidtke <[email protected]>

* CLN: Assort khash-python cleanups * add static inline to memory tracers * revert mistake with hashdouble * try less

…ame incorrect (#57570) string

….__array__ (#57561) * fixing PR01 errors for pandas.Categorical and pandas.Categorical.__array__ * updated dtype to np.dtype Co-authored-by: Matthew Roeschke <[email protected]> --------- Co-authored-by: Matthew Roeschke <[email protected]>

* ENH: Allow performance warnings to be disabled * ENH: Allow performance warnings to be disabled * Add tests * fixup

* PERF: groupby(...).__len__ * GH#

) * Avoid Series constructor inference in dict_to_mgr * test_constructors passes * Use construct_1d_arraylike_from_scalar * PERF: Avoid Series constructor in DataFrame(dict(...), columns=) * Fix whitespace and comment * typing * Just ignore * add bug fix and test * don't overwrite dtype

* CoW: Remove a few copy=False statements * Cow: Deprecate copy keyword from first set of methods * Fixup * Update * Update * Update

…egories (#57603) * Add description to Index._to_numpy method. * Fix description of default value for parameter "ordered" in set_categories * Add description to return value of Categorical.set_categories and fix typo in description. * Remove fixed docstrings from code_checks.sh

* Remove use_numexpr * remove return_filelike in ensure_clean * Start using temp_file * Use more unique name?' * Fix failure * remove from all

…57596

datapythonista · 2024-02-27T14:41:58Z

Thanks for the contribution @wleong1. Seems like something is wrong with this PR. I think you may want to clone pandas again, and before implementing your changes create a branch in git. If you work in the main branch, things get mixed with other changes, as I think it's the case here.

I'll close this, since you'll have to create a new PR if you work work in a different branch. Please let us know if you need help.

mroeschke and others added 30 commits January 29, 2024 00:01

BUG: np.matmul with Index raising TypeError (#57079)

6de74dd

BUG: fix Series.value_counts with sort=False returns result sorte…

46163c5

…d on values for Series with string dtype (#57116)

TST: Interchange implementation for timestamp[ns][pyarrow] (#57026)

56b5979

* TST: Interchange implementation for timestamp[ns][pyarrow] * Check timestamp explicitly * Use pytest.importorskip --------- Co-authored-by: Marco Edward Gorelli <[email protected]>

DOC: fix PR02 errors in docstrings - pandas.Series.interpolate, panda…

820ad77

…s.read_hdf, pandas.HDFStore.append (#57114) * Resolve PR02 errors in docstrings: pandas.Series.interpolate, pandas.read_hdf, pandas.HDFStore.append * Update code_checks.sh

Fix typo in missing_data.rst (#57131)

a3ca439

Update missing_data.rst Fix typo

TYP: misc return values (#57108)

b0b1685

DOC: fix PR02 errors in docstrings - pandas.describe_option, pandas.g…

42e4e4c

…et_option, pandas.reset_option (#57117) * Copy the signature from the implementation * updated code_checs.sh

REGR: Index.join raising TypeError when joining an empty index to a m…

f5bb775

…ixed type index (#57101) * fix regression in join with empty * mypy * move empty check

CI: autouse add_doctest_imports (#57122)

3eea5fd

* CI: autouse add_doctest_imports * add ref

CLN: The smallest typo correction, "engine" lacked an "n". (#57134)

1f8b763

The smallest typo correction, "engine" lacked an "n".

ENH: Add skipna to groupby.first and groupby.last (#57102)

ab3d4bf

* ENH: Add skipna to groupby.first and groupby.last * resample & tests * Improve test * Fixups * fixup test * Rework na_value determination

REGR: non-unique, masked dtype index raising IndexError (#57061)

a302b1b

* fix masked indexing regression * fix test * fix test * dedup resizing logic * add types

BUG: Index(Series) makes array read only for object dtype (#57139)

b6fb905

DOC: Updated community slack link (#57146)

f636d7d

CI: Fix _get_dst_hours for numpy 2.0 change (#57144)

9b95e45

BUG: Fix to_dict with datelike types and orient=list (#57157)

b41ea09

CLN: to_dict (#57159)

c811353

* Make to_dict lazier * Remove some extra looping and indexing * Add erroneous ignore

DEPR: removed deprecated argument obj from GroupBy get_group (#57136

db11e25

)

CI: Use new documentation previewer (#57112)

2c32439

Fix docstring (#57167)

27b6996

* fix PR02 error in clip function * fix PR02 error in argsort function

BUG: Interchange protocol implementation handles empty dataframes inc…

dc16177

…orrectly (#57175) fix from_dataframe for empty dataframes

REGR: DataFrame.sort_index not producing stable sort (#57169)

9760b64

DataFrame.sort_index not producing stable sort

Timestamp 'freq' is removed in GH 14146, not moved to kw-only arg. Re…

89b633d

…moval is … (#57162) 'freq' is removed in GH 14146), not moved to kw-only arg. Removal is already captured under 'Removal of prior version deprecations/changes'

CI: avoid DeprecationWarning in validate_min_versions_in_sync script (#…

0ec21fa

…57127)

CI: Remove redundant check (#57156)

a3e4391

Remove redundant code style check

DEPR: Enforce deprecation of removing axis from all groupby ops (#57109)

366691e

* DEPR: Enforce deprecation of removing axis from all groupby ops * Add note on fillna and cleanup * Doc fixups * Remove corrwith axis=1 test * Remove corrwith axis=1 test * Skip corrwith docstring validation

DEP: Loosely pin Cython to 3.0 (#56993)

1785fdc

Document some more categorical methods (#56928)

29a3682

* Document some more categorical methods * Render pages for Categorical methods

mroeschke and others added 20 commits February 22, 2024 15:39

PERF: Switch conditions in RangeIndex._shallow_copy (#57560)

1debaf3

DOC: fix ES01 for pandas.Flags (#57563)

83fec18

fix ES01 for pandas.Flags

CLN: Remove pickle support pre-pandas 1.0 (#57555)

ada009a

* Centeralize methods to class * Cleanups * CLN: Remove pickle support pre-pandas 1.0 * Typing * clean:

DOC: Add release date for 2.2.1 (#57576)

02765b4

Updating timeseries.offset_aliases links to use :ref: format (#57581)

a305c1f

DOC: Add contributors for 2.2.1 (#57582)

7bb427e

CLN: Assorted khash-python cleanups (#57575)

6ef44f2

* CLN: Assort khash-python cleanups * add static inline to memory tracers * revert mistake with hashdouble * try less

BUG: Interchange object data buffer has the wrong dtype / from_datafr…

5f87a2d

…ame incorrect (#57570) string

ENH: Allow performance warnings to be disabled (#56921)

a730486

* ENH: Allow performance warnings to be disabled * ENH: Allow performance warnings to be disabled * Add tests * fixup

PERF: groupby(...).__len__ (#57595)

1d70500

* PERF: groupby(...).__len__ * GH#

CLN: Use group_info less (#57598)

e103a4c

REF: Move compute to BinGrouper.result_index_and_ids (#57599)

87dd2ee

CoW: Deprecate copy keyword from first set of methods (#57347)

3f05c4f

* CoW: Remove a few copy=False statements * Cow: Deprecate copy keyword from first set of methods * Fixup * Update * Update * Update

TST: Clean contexts (#57571)

9530851

* Remove use_numexpr * remove return_filelike in ensure_clean * Start using temp_file * Use more unique name?' * Fix failure * remove from all

Fixed BUG: Unexpected behaviour when inserting timestamps into Series #…

3065fce

…57596

wleong1 requested review from attack68, rhshadrach, MarcoGorelli, Dr-Irv, WillAyd, noatamir, datapythonista and mroeschke as code owners February 26, 2024 00:25

datapythonista closed this Feb 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: Unexpected behaviour when inserting timestamps into Series #57596 #57628

BUG: Unexpected behaviour when inserting timestamps into Series #57596 #57628

wleong1 commented Feb 26, 2024 •

edited

Loading

datapythonista commented Feb 27, 2024

BUG: Unexpected behaviour when inserting timestamps into Series #57596 #57628

BUG: Unexpected behaviour when inserting timestamps into Series #57596 #57628

Conversation

wleong1 commented Feb 26, 2024 • edited Loading

datapythonista commented Feb 27, 2024

wleong1 commented Feb 26, 2024 •

edited

Loading