REF: use check_setitem_lengths in DTA.setitem #36339

jbrockmendel · 2020-09-13T16:16:57Z

Turning this into a two-line check will make it easy to add to PandasArray.__setitem__ and Categorical.__setitem__, at which point we'll be able to move these methods up to NDArrayBackedExtensionArray.__setitem__

…eck-empty-setitem

jorisvandenbossche

Looks good! Only a comment regarding the todo

pandas/core/indexers.py

jorisvandenbossche · 2020-09-13T19:30:56Z

pandas/core/indexers.py

+                        "different length than the value"
+                    )
+            else:
+                # TODO: dont we still need lengths to match?


I think yes?
But can you do that here, since the original code (I think?) did it.

the existing code does not check for length-match in the len==0 case. ill take a look at adding that check and getting rid of this comment

It actually is checked right now, because of the len(key) != len(value) before checking elif not len(key): (but it's clearly not tested ..)

As example:

In [1]: arr = pd.date_range("2012", periods=3)._data In [2]: arr Out[2]: <DatetimeArray> ['2012-01-01 00:00:00', '2012-01-02 00:00:00', '2012-01-03 00:00:00'] Length: 3, dtype: datetime64[ns] In [3]: arr[[]] = [pd.Timestamp("2012")] --------------------------------------------------------------------------- ValueError Traceback (most recent call last) <ipython-input-3-d4b75e105bf5> in <module> ----> 1 arr[[]] = [pd.Timestamp("2012")] ~/scipy/pandas/pandas/core/arrays/datetimelike.py in __setitem__(self, key, value) 620 f"'{len(value)}'." 621 ) --> 622 raise ValueError(msg) 623 elif not len(key): 624 return ValueError: shape mismatch: value array of length '0' does not match indexing result of length '1'.

Good catch; i was comparing against the existing check_setitem_lengths

jreback · 2020-09-13T20:26:32Z

pandas/core/indexers.py

            if len(value) != length_of_indexer(indexer, values):
                raise ValueError(
                    "cannot set using a slice indexer with a "
                    "different length than the value"
                )
+            if len(value) == 0:


use not len(value) (and same as above)

Personally I find if len(value) == 0: more readable / clearer about the intent ...

its not idiomatic python (though i tend to agree), either way as long as we are consistent, and unfortunately this is not consistent with the rest of the code base

jorisvandenbossche · 2020-09-13T20:37:41Z

@jbrockmendel can you add a test for the setitem case with zero-length indexer and non-zero length value (as it is clearly untested ..)

jbrockmendel · 2020-09-14T01:45:14Z

updated per suggestions

jorisvandenbossche · 2020-09-14T06:28:44Z

Thanks!

jbrockmendel added 4 commits September 10, 2020 19:58

dummy to see if pre-commit fails

fba224d

Merge branch 'master' of https://github.com/pandas-dev/pandas into ch…

4bf12b1

…eck-empty-setitem

REF: use check_setitem_lengths in DatetimeArray.__setitem__

cb86e9e

Merge branch 'master' of https://github.com/pandas-dev/pandas into ch…

e6e5e48

…eck-empty-setitem

jorisvandenbossche requested changes Sep 13, 2020

View reviewed changes

suggested edits

8bcd902

jreback added the Indexing Related to indexing on series/frames, not to indexes themselves label Sep 13, 2020

jreback requested changes Sep 13, 2020

View reviewed changes

sty, tests

a8c0494

jorisvandenbossche approved these changes Sep 14, 2020

View reviewed changes

jorisvandenbossche merged commit 8df0218 into pandas-dev:master Sep 14, 2020

jorisvandenbossche added this to the 1.2 milestone Sep 14, 2020

jbrockmendel deleted the check-empty-setitem branch September 14, 2020 15:58

kesmit13 pushed a commit to kesmit13/pandas that referenced this pull request Nov 2, 2020

REF: use check_setitem_lengths in DTA.__setitem__ (pandas-dev#36339)

e4b0670

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

REF: use check_setitem_lengths in DTA.setitem #36339

REF: use check_setitem_lengths in DTA.setitem #36339

jbrockmendel commented Sep 13, 2020

jorisvandenbossche left a comment

jorisvandenbossche Sep 13, 2020

jbrockmendel Sep 13, 2020

jorisvandenbossche Sep 13, 2020

jbrockmendel Sep 13, 2020

jreback Sep 13, 2020

jorisvandenbossche Sep 13, 2020

jreback Sep 13, 2020

jorisvandenbossche commented Sep 13, 2020

jbrockmendel commented Sep 14, 2020

jorisvandenbossche commented Sep 14, 2020

REF: use check_setitem_lengths in DTA.__setitem__ #36339

REF: use check_setitem_lengths in DTA.__setitem__ #36339

Conversation

jbrockmendel commented Sep 13, 2020

jorisvandenbossche left a comment

Choose a reason for hiding this comment

jorisvandenbossche Sep 13, 2020

Choose a reason for hiding this comment

jbrockmendel Sep 13, 2020

Choose a reason for hiding this comment

jorisvandenbossche Sep 13, 2020

Choose a reason for hiding this comment

jbrockmendel Sep 13, 2020

Choose a reason for hiding this comment

jreback Sep 13, 2020

Choose a reason for hiding this comment

jorisvandenbossche Sep 13, 2020

Choose a reason for hiding this comment

jreback Sep 13, 2020

Choose a reason for hiding this comment

jorisvandenbossche commented Sep 13, 2020

jbrockmendel commented Sep 14, 2020

jorisvandenbossche commented Sep 14, 2020

REF: use check_setitem_lengths in DTA.setitem #36339

REF: use check_setitem_lengths in DTA.setitem #36339