API/BUG: make .at raise same exceptions as .loc #31724

jbrockmendel · 2020-02-05T21:51:48Z

closes API/BUG: Inconsistent errors/msgs between loc vs at #31722
tests added / passed
passes black pandas
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

This also (very) indirectly addresses #31683 which in turn will let us get rid of CategoricalIndex.get_value altogether.

pandas/tests/indexing/test_scalar.py

WillAyd · 2020-02-06T01:10:06Z

doc/source/whatsnew/v1.1.0.rst

@@ -63,7 +63,8 @@ Backwards incompatible API changes
 - :meth:`DataFrameGroupby.mean` and :meth:`SeriesGroupby.mean` (and similarly for :meth:`~DataFrameGroupby.median`, :meth:`~DataFrameGroupby.std`` and :meth:`~DataFrameGroupby.var``)
  now raise a  ``TypeError`` if a not-accepted keyword argument is passed into it.
  Previously a ``UnsupportedFunctionCall`` was raised (``AssertionError`` if ``min_count`` passed into :meth:`~DataFrameGroupby.median``) (:issue:`31485`)
-
+- :meth:`DataFrame.at` and :meth:`Series.at` will raise a ``TypeError`` instead of a ``ValueError`` if an incompatible key is passed, matching the behavior of ``.loc[]`` (:issue:`31722`)


Should mention the KeyError here as well?

…-errors

jorisvandenbossche

Can you check some timings for at? (as this is supposed to be the fast one)

How does this indirectly also address #31683 ? And if it does, shouldn't there be tests involving categorical / a whatsnew?

jorisvandenbossche · 2020-02-06T08:40:12Z

pandas/tests/indexing/test_scalar.py

        msg = (
-            "At based indexing on an non-integer index can only have "
-            "non-integer indexers"
+            "cannot do label indexing on <class 'pandas.core.indexes.base.Index'> "


I have to say that I found the previous error message more readable ..

I generally agree, that suggests we should improve the existing messages for "loc"

jbrockmendel · 2020-02-06T16:53:25Z

Can you check some timings for at? (as this is supposed to be the fast one)

Yah we take a hit

In [2]: ser = pd.Series(range(10000))                                           
In [3]: %timeit ser.at[501]                                                     
4.54 µs ± 139 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)  # <-- master
7.8 µs ± 75.2 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)  # <-- PR

In [4]: df = ser.to_frame("A")                                                  
In [5]: %timeit df.at[501, "A"]                                                 
3.9 µs ± 25.4 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)  # <-- master
7.71 µs ± 99.4 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)  # <-- PR

We might get some of that back from #31709.

jbrockmendel · 2020-02-06T16:56:07Z

How does this indirectly also address #31683 ? And if it does, shouldn't there be tests involving categorical / a whatsnew?

With this in place, all the places from which index.get_value are called are preceeded by maybe_cast_scalar_indexer (except one in which the key is a tuple). That makes the call inside CategoricalIndex.get_value unnecessary, which addresses the motivating issue for #31683. The "indexing on CategoricalIndex is hard" problem is still a problem.

jreback

consistency is a huge + here, makes the code much better.

I am not sure if @WillAyd or @jorisvandenbossche have comments, but lgtm.

WillAyd

lgtm. parametrization will be a nice follow on

jreback · 2020-02-06T23:58:11Z

thanks @jbrockmendel

jorisvandenbossche · 2020-02-07T06:25:42Z

@jreback maybe you didn't notice, but this time I had a "request changes" open, as you asked previous time to indicate that you shouldn't merge without me having the chance to answer back.
If there is still discussion going on, please give it some time.

jorisvandenbossche · 2020-02-07T06:26:37Z

@jbrockmendel can you do a follow up with:

fixing the performance regression
fixing the usability regression in the error message

jorisvandenbossche · 2020-02-07T08:49:26Z

fixing the usability regression in the error message

Looking back at it, it's actually mainly the long fully qualified class name that is annoying, so that is an easy fix -> #31769

API/BUG: make .at raise same exceptions as .loc

37c0246

jbrockmendel added the Indexing Related to indexing on series/frames, not to indexes themselves label Feb 5, 2020

WillAyd requested changes Feb 6, 2020

View reviewed changes

jbrockmendel added 3 commits February 5, 2020 17:49

Merge branch 'master' of https://github.com/pandas-dev/pandas into at…

b67d998

…-errors

add KeyError to whatsnew

630b874

targetd tests

37feaf6

jorisvandenbossche requested changes Feb 6, 2020

View reviewed changes

jreback added this to the 1.1 milestone Feb 6, 2020

jreback approved these changes Feb 6, 2020

View reviewed changes

WillAyd approved these changes Feb 6, 2020

View reviewed changes

jreback merged commit e1ca66b into pandas-dev:master Feb 6, 2020

jbrockmendel deleted the at-errors branch February 7, 2020 00:12

This was referenced Feb 7, 2020

Why does CategoricalIndex.get_value call _convert_scalar_indexer? #31683

Closed

REF: Remove CategoricalIndex.get_value #31765

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API/BUG: make .at raise same exceptions as .loc #31724

API/BUG: make .at raise same exceptions as .loc #31724

jbrockmendel commented Feb 5, 2020 •

edited

Loading

WillAyd Feb 6, 2020

jorisvandenbossche left a comment

jorisvandenbossche Feb 6, 2020

jbrockmendel Feb 6, 2020

jbrockmendel commented Feb 6, 2020

jbrockmendel commented Feb 6, 2020

jreback left a comment

WillAyd left a comment

jreback commented Feb 6, 2020

jorisvandenbossche commented Feb 7, 2020

jorisvandenbossche commented Feb 7, 2020 •

edited

Loading

jorisvandenbossche commented Feb 7, 2020

API/BUG: make .at raise same exceptions as .loc #31724

API/BUG: make .at raise same exceptions as .loc #31724

Conversation

jbrockmendel commented Feb 5, 2020 • edited Loading

WillAyd Feb 6, 2020

Choose a reason for hiding this comment

jorisvandenbossche left a comment

Choose a reason for hiding this comment

jorisvandenbossche Feb 6, 2020

Choose a reason for hiding this comment

jbrockmendel Feb 6, 2020

Choose a reason for hiding this comment

jbrockmendel commented Feb 6, 2020

jbrockmendel commented Feb 6, 2020

jreback left a comment

Choose a reason for hiding this comment

WillAyd left a comment

Choose a reason for hiding this comment

jreback commented Feb 6, 2020

jorisvandenbossche commented Feb 7, 2020

jorisvandenbossche commented Feb 7, 2020 • edited Loading

jorisvandenbossche commented Feb 7, 2020

jbrockmendel commented Feb 5, 2020 •

edited

Loading

jorisvandenbossche commented Feb 7, 2020 •

edited

Loading