Support for pyright 1.1.310 #716

Dr-Irv · 2023-05-29T19:57:10Z

Closes pyright above 1.1.308 has failures on our tests #711
Tests added: None

See discussion in microsoft/pyright#5178

Recommendation was to return Self instead of Series[S1] . But if you do that when we don't know the type, it breaks mypy. So if we don't know the type of the Series, we return Series and that only broke a test related to pandera sublcassing, but only for pyright, so I modified that test to make pyright happy.

Dr-Irv · 2023-05-29T19:59:00Z

pandas-stubs/_typing.pyi

-    Interval[Timestamp],
-    Interval[Timedelta],
-    CategoricalDtype,
+    bound=str


See point #3 at microsoft/pyright#5178 (comment) , Recommended to use a Union rather than a constrained type var.

Does this also affect other TypeVars (could be done later)?

Can we use S1 = TypeVar("S1", bound=Scalar)? You replaced S1 with Scalar in some places

Does this also affect other TypeVars (could be done later)?

Yes, and I created an issue for that #719

Dr-Irv · 2023-05-29T20:00:57Z

pandas-stubs/core/series.pyi

+        name: Hashable | None = ...,
+        copy: bool = ...,
+        fastpath: bool = ...,
+    ) -> Self: ...


This overload was previously connected to the next one in terms of the data argument, but I split it into two to have it return Self when S1 is known, and to return Series if not.

Can the two Self overloads be combined?

edit: Should we relax S1 as we explicitly allow object (which is correct)?

edit2: The following replaces the last three overloads and passes all tests with pyright (but not mypy):

@overload def __new__( cls, data: Series[S1] | dict[int, S1] | dict[_str, S1] | object= ..., index: Axes | None = ..., dtype: type[S1] | type | str | None=..., name: Hashable = ..., copy: bool = ..., fastpath: bool = ..., ) -> Self: ...

About mypy errors: since they now have more frequent releases, I wouldn't mind making changes that break mypy-compatibility IF 1) there is a long-standing mypy issue, 2) after informing them that we will break compatibility, and 3) if it helps clean pandas-stubs up.

Can the two Self overloads be combined?

The reason I separated them is that we want to allow people to have an untyped data and specify dtype = type[S1], OR specify a typed data (where S1 can be inferred) and not specify dtype. If we combine them, then it allows any object along with dtype: str and that is an untyped Series. That's where mypy has trouble.

edit: Should we relax S1 as we explicitly allow object (which is correct)?

No. I want to make it that if we know the type and can do something special with it (e.g., Series[Timestamp].__add__(Timedelta)) we differentiate that. object is too wide, so that will correspond to Series[Any] (or in pyright terms, Series[Unknown])

edit2: The following replaces the last three overloads and passes all tests with pyright (but not mypy):

@overload def __new__( cls, data: Series[S1] | dict[int, S1] | dict[_str, S1] | object= ..., index: Axes | None = ..., dtype: type[S1] | type | str | None=..., name: Hashable = ..., copy: bool = ..., fastpath: bool = ..., ) -> Self: ...

See above reason why I don't want to combine them.

The mypy error is reported here: python/mypy#15322

About mypy errors: since they now have more frequent releases, I wouldn't mind making changes that break mypy-compatibility IF 1) there is a long-standing mypy issue, 2) after informing them that we will break compatibility, and 3) if it helps clean pandas-stubs up.

I'm more hesitant to do that. Most of our bug reports from the community are things people find by using mypy. It is the dominant type checker for now, so we should make sure we handle people's code with it.

edit: Should we relax S1 as we explicitly allow object (which is correct)?

No. I want to make it that if we know the type and can do something special with it (e.g., Series[Timestamp].__add__(Timedelta)) we differentiate that. object is too wide, so that will correspond to Series[Any] (or in pyright terms, Series[Unknown])

I didn't mean to use data: object but to simply widen S1 to S1 = TypeVar("S1"), which allows any "object". Wouldn't this still allow special overloads? In either way, this PR is targeted for pyright compatibility and shouldn't depend on that.

Dr-Irv · 2023-05-29T20:01:30Z

pandas-stubs/core/series.pyi

-    def __add__(self, other: TimedeltaSeries | np.timedelta64) -> TimestampSeries: ...  # type: ignore[override]
-    def __radd__(self, other: TimedeltaSeries | np.timedelta64) -> TimestampSeries: ...  # type: ignore[override]
+    def __add__(self, other: TimedeltaSeries | np.timedelta64 | timedelta) -> TimestampSeries: ...  # type: ignore[override]
+    def __radd__(self, other: TimedeltaSeries | np.timedelta64 | timedelta) -> TimestampSeries: ...  # type: ignore[override]


Apparently pyright 1.1.310 had bugs that revealed we were missing some types here

tests/test_frame.py

Dr-Irv · 2023-05-29T20:02:36Z

tests/test_series.py

@@ -1604,12 +1605,14 @@ def test_pandera_generic() -> None:
    T = TypeVar("T")

    class MySeries(pd.Series, Generic[T]):
-        ...
+        def __new__(cls, *args, **kwargs) -> Self:
+            return object.__new__(cls)


This makes pyright not complain.

Dr-Irv · 2023-05-29T20:03:00Z

tests/test_series.py


    def func() -> MySeries[float]:
        return MySeries[float]([1, 2, 3])

-    func()
+    result = func()
+    assert result.iloc[1] == 2


I modified the test to make sure that defining __new__() didn't change the result.

twoertwein · 2023-05-30T15:23:35Z

Thanks @Dr-Irv !

Dr-Irv added 2 commits May 25, 2023 19:42

Use Self with __new__

8f1a853

change S1 to use Union, add __new__ to test_pandera_generic

ac865c8

Dr-Irv requested a review from twoertwein May 29, 2023 19:57

Dr-Irv commented May 29, 2023

View reviewed changes

tests/test_frame.py Show resolved Hide resolved

Dr-Irv commented May 29, 2023

View reviewed changes

Dr-Irv mentioned this pull request May 29, 2023

CLEAN: Investigate whether any TypeVar that constraints to a list of types can change to being bound to a Union #719

Closed

twoertwein merged commit 630efce into pandas-dev:main May 30, 2023

Dr-Irv deleted the pyright310 branch December 2, 2024 20:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for pyright 1.1.310 #716

Support for pyright 1.1.310 #716

Dr-Irv commented May 29, 2023

Dr-Irv May 29, 2023

twoertwein May 29, 2023

twoertwein May 29, 2023

twoertwein May 29, 2023

Dr-Irv May 30, 2023

Dr-Irv May 29, 2023

twoertwein May 29, 2023 •

edited

Loading

Dr-Irv May 30, 2023

twoertwein May 30, 2023

Dr-Irv May 29, 2023

Dr-Irv May 29, 2023

Dr-Irv May 29, 2023

twoertwein commented May 30, 2023

Support for pyright 1.1.310 #716

Support for pyright 1.1.310 #716

Conversation

Dr-Irv commented May 29, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

twoertwein May 29, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

twoertwein commented May 30, 2023

twoertwein May 29, 2023 •

edited

Loading