Extend Scalar sub-type of Index to the iter() method #367

gandhis1 · 2022-10-06T05:10:11Z

Closes DataFrame columns should return a type containing Hashable (or Scalar?) elements #365
Tests added: Please use assert_type() to assert the type of any return value

As mentioned in that issue, I am a little bit confused about what the type of a column or index label is.....feels like in some places we decided it is a Hashable and in other places we decided it's a Scalar. But just to stay consistent with the way this was already annotated, I continued to use Scalar.

pandas-stubs/core/indexes/base.pyi

tests/test_frame.py

pandas-stubs/core/frame.pyi

gandhis1 · 2022-11-06T03:34:38Z

pandas-stubs/core/frame.pyi

@@ -497,8 +497,7 @@ class DataFrame(NDFrame, OpsMixin):
    @overload
    def __getitem__(
        self,
-        idx: tuple
-        | Series[_bool]
+        idx: Series[_bool]


x = pd.DataFrame([[1, 2, 3], [4, 5, 6], [7, 8, 9]], index=["a", "b", "c"])

0 1 2 a 1 2 3 b 4 5 6 c 7 8 9

x[[0, 1]] works

x[(0, 1)] does not, because that is interpreted as a column label

gandhis1 · 2022-11-06T05:39:16Z

So it appears NDArray is being treated as a Hashable, which creates overlapping overloads per MyPy. I don't see any way to fix this currently.

So I took a different approach. Went back to annotating using Scalar, but then augmenting that Union with a Tuple[Scalar, ...]. I added some tests to illustrate some of the edge cases. There is one old test that is failing, and not sure what the best course of action there is, I added a note.

tests/test_frame.py

pandas-stubs/core/frame.pyi

Dr-Irv

thanks @gandhis1

Dr-Irv reviewed Oct 6, 2022

View reviewed changes

pandas-stubs/core/indexes/base.pyi Outdated Show resolved Hide resolved

gandhis1 commented Oct 12, 2022

View reviewed changes

tests/test_frame.py Outdated Show resolved Hide resolved

Dr-Irv requested changes Oct 12, 2022

View reviewed changes

pandas-stubs/core/frame.pyi Outdated Show resolved Hide resolved

gandhis1 marked this pull request as draft October 20, 2022 02:47

gandhis1 force-pushed the index branch 2 times, most recently from 3190539 to 0f5c899 Compare November 6, 2022 03:30

gandhis1 commented Nov 6, 2022

View reviewed changes

gandhis1 force-pushed the index branch from 0f5c899 to 5426990 Compare November 6, 2022 05:40

gandhis1 commented Nov 6, 2022

View reviewed changes

tests/test_frame.py Outdated Show resolved Hide resolved

Dr-Irv reviewed Nov 8, 2022

View reviewed changes

pandas-stubs/core/frame.pyi Outdated Show resolved Hide resolved

gandhis1 force-pushed the index branch 2 times, most recently from ce62af9 to 4337b8d Compare November 26, 2022 17:22

gandhis1 marked this pull request as ready for review December 3, 2022 22:31

Fix return type of __iter__() and tuple-handling for __getitem__()

1797a49

gandhis1 force-pushed the index branch from 4337b8d to 1797a49 Compare December 3, 2022 22:33

Dr-Irv approved these changes Dec 5, 2022

View reviewed changes

Dr-Irv merged commit bc61543 into pandas-dev:main Dec 5, 2022

Dr-Irv mentioned this pull request Feb 10, 2023

remove types from Index.__iter__()` #532

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend Scalar sub-type of Index to the iter() method #367

Extend Scalar sub-type of Index to the iter() method #367

gandhis1 commented Oct 6, 2022

gandhis1 Nov 6, 2022

gandhis1 commented Nov 6, 2022

Dr-Irv left a comment

Extend Scalar sub-type of Index to the __iter__() method #367

Extend Scalar sub-type of Index to the __iter__() method #367

Conversation

gandhis1 commented Oct 6, 2022

gandhis1 Nov 6, 2022

Choose a reason for hiding this comment

gandhis1 commented Nov 6, 2022

Dr-Irv left a comment

Choose a reason for hiding this comment

Extend Scalar sub-type of Index to the iter() method #367

Extend Scalar sub-type of Index to the iter() method #367