bare-bones normalizations via type hints #70

ev-br · 2023-02-27T09:23:04Z

An experiment along the lines of the suggestion in #32 : (ab)use type annotations to mark the logic for normalizing user-facing arguments into pytorch tensors and dtypes, reshuffle arguments etc via static types.

A somewhat clumsy bit is normalizing *args --- case in point is e.g. atleast_1d(*arys).
The issue is that func(*args : Annotation) gives a single annotation for a runtime-determined number of arguments. There is no way to annotate individual elements of *args AFAICS.

Thus register a special annotation to repack args into a tuple and a corresponding normalizer to normalize this tuple.

ev-br · 2023-03-04T13:59:53Z

The initial scaffolding is there, so am removing the WIP admonition. While this is not yet complete (ufuncs, reductions), this is something to take a look at, if only to see if a general pattern is tolerable.

torch_np/_detail/_reductions.py

…arguments

Gradual (!) typing WTF: only annotate the dtype can get rid of dtype_to_torch decorator. Annotating SeqArrayLike typing TBD.

This is a bit clumsy: func(*args : Annotation) gives a single annotation for a runtime-determined number of arguments. There is no way to annotate individual elements of *args AFAICS. Thus register a special annotation to repack args into a tuple and a normalizer to normalize this tuple.

ev-br · 2023-03-11T09:49:24Z

This is ready from my side.

Further possible cleanups include

annotate the return type. This could centralize the majority of _helpers.array_from / tuple_arrays_from incantations, and result_or_out. --- Use return annotation to wrap tensors into ndarrays/sequences of ndarrays #83
possibly centralize normalization/validation of axis arguments.

both are somewhat large and need some experimentation, so are best postponed to follow-up PRs.

lezcano · 2023-03-11T13:34:26Z

Just passing by to say that the output one can be done without anotation. Just do a if isintance(out, torch.Tensor): return as_array(out) in the normalization wrapper, and that'll get rid of all those _helper.array_from. You can do similarly for tuples. This can be done because a function's returned value is already "normalised" (it'll always be a tensor or a tuple of tensors or things that we don't want to touch).

For the out= kwarg, we can do it like in PrimTorch:
https://github.com/pytorch/pytorch/blob/ab148da66cb9433effac90c7bd4930a961481d19/torch/_prims_common/wrappers.py#L187
Note that that code is quite tricky, because it handles namedtuples and it also puts in the correct annotation, but you can probably simplify it in our case.

ev-br · 2023-03-11T15:58:38Z

Just do a if isintance(out, torch.Tensor): return as_array(out)

Hmm, this breaks in presence of out= argument unless as_array has the out= semantics?

lezcano · 2023-03-11T16:28:13Z

But if we implement the out= kwarg ourselves as a decorator, we should be able make both things work with each other.

ev-br · 2023-03-11T16:40:12Z

Certainly can. My point is simply that it's a bit more than just using if isinstance(result, torch.Tensor): return asarray(result), and I'd prefer to limit the scope of this PR and deal with returns in a follow-up.

lezcano · 2023-03-12T19:27:22Z

~~About annotating variadic args, does this SO solve the question? https://stackoverflow.com/a/37032111/5280578~~

Nevemind, I see what you mean. Adding a different annotation for this sort of arguments LGTM

lezcano

torch_np/_wrapper starts looking good! Let's revisit the idea of moving the actual implementations of the torch functions to the wrapper after we have merged this PR and the return PR.

lezcano · 2023-03-19T21:50:12Z

torch_np/_normalizations.py

+        # first, check for *args in positional parameters. Case in point:
+        # atleast_1d(*arys: UnpackedSequenceArrayLike)
+        # if found,  consume all args into a tuple to normalize as a whole
+        for j, param in enumerate(sig.parameters.values()):
+            if param.annotation == UnpackedSeqArrayLike:
+                if j == 0:
+                    args = (args,)
+                else:
+                    # args = args[:j] + (args[j:],) would likely work
+                    # not present in numpy codebase, so do not bother just yet.
+                    # NB: branching on j ==0 is to avoid the empty tuple, args[:j]
+                    raise NotImplementedError
+                break


A better way to do this is to check below whether param.kind == VAR_POSITIONAL, and then, if so, treat it as a List[T], where T is the annotation.

So your suggestion is to check for parm.kind == VAR_POSITIONAL instead of adding a dedicated annotation?
Note that args = (args,) is still needed in some form, in typing language it's not just List[T], it's Union[T, List[T]]
Probably I'm being dense here, would you mind elaborating?

It'd be a matter of adding an if in normalize_this checking whether it's a VAR_POSTIIONAL arg and then process the argument/arguments accordingly.

That won't work, because tnp.atleast(1, 2) gets two arguments and only a single annotation. I can of course check param.kind == VAR_POSITIONAL instead of a special annotation and consume the rest of args instead of checking if param.annotation == UnpackedSeqArrayLike, this does not seem any simpler or clearer, does it. I mean, what's the endgame here.

OK, how about 9d75cab

torch_np/_normalizations.py

torch_np/tests/test_ndarray_methods.py

torch_np/_detail/_reductions.py

lezcano · 2023-03-20T08:39:07Z

torch_np/_detail/_unary_ufuncs.py

+
+import torch
+
+# renames


Why the different imports here, rather than just do a from torch import ( blablalba )?

This is isort from pre-commit run.

Terribly weird. @honno any idea what's going on here?

lezcano · 2023-03-20T08:47:16Z

torch_np/_wrapper.py

-    return tuple(asarray(_) for _ in res)
+@normalizer
+def broadcast_arrays(*args: UnpackedSeqArrayLike, subok: SubokLike = False):
+    args = args[0]  # undo the *args wrapping in normalizer


We should hopefully be able to fix this with a proper preprocessing of variadic inputs in the normalizer.

Done in 9d75cab

Let me make a counter-offer:

def maybe_normalize(arg, parm): """Normalize arg if a normalizer is registred.""" normalizer = normalizers.get(parm.annotation, None) return normalizer(arg) if normalizer else arg def normalizer(func): def wrapped(*args, **kwds): params = inspect.signature(func).parameters first_param = next(iter(params.values())) # NumPy's API does not have positional args before variadic positional args if first_param.kind == inspect.Parameter.VAR_POSITIONAL: args = [maybe_normalize(arg, first_param) for arg in args] else: args = [maybe_normalize(arg, parm) for arg, parm in zip(args, params.values())] kwds = { name: maybe_normalize(arg, sp[name]) if name in sp else arg for name, arg in kwds.items() } return func(*args, **kwds) return wrapped

If you think that that comment about NumPy's API is not good enough and you want extra safety (I don't think we need it) you assert that there isn't any other parameter that is variadic that's not on the first position. This should be one line as well.

Note: Do we need the lst += args[len(lst) :]? What would be an example of call where we need it?

This is slick! Taken over in 8dc2628, together with the args += extra_args_to_raise_later addition. What's the email to attribute the commit to?

Note: Do we need the lst += args[len(lst) :]? What would be an example of call where we need it?

Yes. Extra unknown positional arguments. The issue is that zip(short_sequence, longer_sequence) drops trailing elements from longer_sequence. Here's a test (np.nonzero only accepts a single argument):

def test_unknown_args(self): # Check that unknown args to decorated functions fail a = w.arange(7) % 2 == 0 # unknown positional args with assert_raises(TypeError): > w.nonzero(a, "kaboom") E Failed: DID NOT RAISE <class 'TypeError'>

Ah, right, I forgot we need to error out on those. In my head I always thought of the non-error case.

Also, I appreciate it, but no need to attribute the commit really :)

lezcano · 2023-03-20T08:53:39Z

torch_np/_wrapper.py

-
-from . import _dtypes, _helpers, _decorators  # isort: skip  # XXX
+from ._ndarray import array, asarray, maybe_set_base, ndarray
+from ._normalizations import (


What's the difference between _funcs.py and _wrapper.py? Should we merge them?

Absolutely. The split is temporary, and we'll cleanly merge them once the cleanups from other PRs further up the stack are done.
At this stage, imports from ._ndarray cause circular imports. So either let's live with the split for a while, or I can bloat this PR with what's up the stack. Reviewer's choice :-).

lezcano

Let's avoid copying the args, but otherwise this LGTM. As discussed, let's merge all the PRs as-is (as-are?) and let's try to factor out the out= kwarg implementation and then have a simple implementation of the wrapping of the outputs.

torch_np/_normalizations.py

This was referenced Feb 27, 2023

move ndarray methods to free functions #69

Merged

wrap more functions to complete a minimal viable feature set #65

Merged

ev-br force-pushed the free_funcs branch from 3a16b37 to bb75c17 Compare February 27, 2023 17:19

Base automatically changed from free_funcs to main February 27, 2023 17:25

ev-br force-pushed the normalizations branch from 23abbb0 to 7b7224e Compare February 27, 2023 17:33

ev-br mentioned this pull request Feb 28, 2023

Add ndarray.dot #72

Merged

ev-br force-pushed the normalizations branch 4 times, most recently from c81bb9e to 3dfddd6 Compare March 4, 2023 13:56

ev-br changed the title ~~WIP: bare-bones normalizations via type hints~~ bare-bones normalizations via type hints Mar 4, 2023

ev-br requested review from lezcano, honno and rgommers March 4, 2023 13:59

ev-br force-pushed the normalizations branch from 3dfddd6 to 2731c26 Compare March 4, 2023 17:32

honno reviewed Mar 8, 2023

View reviewed changes

torch_np/_detail/_reductions.py Show resolved Hide resolved

ev-br force-pushed the normalizations branch 3 times, most recently from e94647a to edda25b Compare March 10, 2023 14:10

ev-br added 10 commits March 10, 2023 20:16

MAINT: bare-bones normalizations via type hints

9647fb6

BUG: normalizations: raise on mismatch between parameters and actual …

0b8264f

…arguments

MAINT: normalize dtype in concatenate and *stack family

d583c62

Gradual (!) typing WTF: only annotate the dtype can get rid of dtype_to_torch decorator. Annotating SeqArrayLike typing TBD.

normalize Optional[ArrayLike] via annotations

ffe46fa

MAINT: use normalizer/ArrayLike in _funcs

352f715

MAINT: modify tests arr.base --> arr.get()._base

7d26871

BUG: handle positional-only parameters in @ normalize

ce9861a

MAINT: remove to_tensors_or_none, use Optional[ArrayLike] instead

eec7bc3

ENH: normalize tuples of array_likes

0c98dfb

ev-br force-pushed the normalizations branch from be59ba3 to b3d5f0a Compare March 10, 2023 17:39

ev-br mentioned this pull request Mar 11, 2023

API: remove ndarray.base #80

Merged

ev-br mentioned this pull request Mar 12, 2023

MAINT: split remaining functions into normalizations and implementations #82

Merged

ev-br mentioned this pull request Mar 14, 2023

Use return annotation to wrap tensors into ndarrays/sequences of ndarrays #83

Closed

TST: unxfail tests of out and dtype as positional args

69e657a

ev-br mentioned this pull request Mar 16, 2023

Short- to medium-term planning #86

Closed

6 tasks

lezcano reviewed Mar 20, 2023

View reviewed changes

ev-br added 4 commits March 21, 2023 00:00

MAINT: remove debug leftovers

27cb10f

MAINT: add a comment on axis=() in reductions

a6eb581

MAINT: simplify arg/param handing in normalize

8c78725

MAINT: simplify handling of variadic *args in normalize

9d75cab

ev-br force-pushed the normalizations branch from 3aa3134 to 9d75cab Compare March 21, 2023 22:46

ev-br mentioned this pull request Mar 21, 2023

ENH: add a naive divmod, un-xfail relevant tests #84

Closed

ev-br force-pushed the normalizations branch from a394a88 to 8dc2628 Compare March 22, 2023 20:52

lezcano approved these changes Mar 22, 2023

View reviewed changes

torch_np/_normalizations.py Outdated Show resolved Hide resolved

ev-br force-pushed the normalizations branch 2 times, most recently from cc6cc7c to f5e5eaf Compare March 22, 2023 23:28

MAINT: simplify normalizer

7dced32

ev-br force-pushed the normalizations branch from f5e5eaf to 7dced32 Compare March 22, 2023 23:29

ev-br merged commit 023d453 into main Mar 22, 2023

ev-br deleted the normalizations branch March 22, 2023 23:41

This was referenced Mar 23, 2023

prettify autogenerating ufuncs #31

Closed

Split NumPy -> PyTorch preprocessing and PyTorch implementation #32

Closed

bare-bones normalizations via type hints #70

bare-bones normalizations via type hints #70

Uh oh!

Conversation

ev-br commented Feb 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ev-br commented Mar 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ev-br commented Mar 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lezcano commented Mar 11, 2023

Uh oh!

ev-br commented Mar 11, 2023

Uh oh!

lezcano commented Mar 11, 2023

Uh oh!

ev-br commented Mar 11, 2023

Uh oh!

lezcano commented Mar 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lezcano left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ev-br Mar 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lezcano Mar 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ev-br Mar 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lezcano left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ev-br commented Feb 27, 2023 •

edited

Loading

ev-br commented Mar 4, 2023 •

edited

Loading

ev-br commented Mar 11, 2023 •

edited

Loading

lezcano commented Mar 12, 2023 •

edited

Loading

ev-br Mar 21, 2023 •

edited

Loading

lezcano Mar 22, 2023 •

edited

Loading

ev-br Mar 22, 2023 •

edited

Loading