ENH: add einsum #127

ev-br · 2023-04-25T16:33:40Z

A couple of notes:

the generic normalization machinery does not handle signatures with positional args before varargs. This is einsum and gradient, so keep the generic machinery simple and do things manually here.
einsum signature is a bit weird even in numpy, with **kwargs, so follow that. Not sure why it is what it is, one possible reason could be that keyword-only args after varargs is a syntax error (?) E.g. def func(a, *args, *, out=None): pass.
IIUC, pytorch does not allow the same level of control over the optimize = {False, True, ‘greedy’, ‘optimal’} argument, so silenty ignore this argument

Also while at it, add a dedicated annotation to validate the casting=... argument. This is strictly speaking extraneous to this PR, so can take it out if desired.

lezcano

I can just imagine the pain that was implementing this one. I left a few comments, but overall looks good.

lezcano · 2023-04-25T16:48:38Z

torch_np/_funcs_impl.py

+    from ._normalizations import (
+        maybe_copy_to,
+        normalize_casting,
+        normalize_dtype,
+        normalize_not_implemented,
+        normalize_outarray,
+        wrap_tensors,
+    )


Just define it at the end of _funcs.py really and leave a note as to why.

If removing imports is the goal, that won't help really. These normalizers are otherwise only used in _normalizations.py, so these need to be imported either way.

What I'd like to remove is local imports. They are a massive red flag

That would create an import cycle since _funcs.py imports _funcs_impl. Also from _ndarray import ndarray must be local (as it is in _normalizations).

We can of course, import these at the _funcs_impl module level and special-case them to not get dumped into the global namespace, if that's really what you think is needed?

We would just need to import things from _normalizations (which we already do) and well, we would still have the ndarray import dangling there, but there's not much that can be done about that really, as we have that one all throughout the codebase.

OK, done in the last commit

torch_np/_funcs_impl.py

lezcano · 2023-04-25T16:53:54Z

torch_np/_funcs_impl.py

+    parm = lambda _: None  # a fake duck-typed inspect.Parameter stub
+    parm.name = "out"
+    out = normalize_outarray(out, parm=parm)
+
+    parm.default = "K"
+    parm.name = "order"
+    order = normalize_not_implemented(kwargs.pop("order", "K"), parm=parm)
+    if kwargs:
+        raise TypeError("unknown arguments: ", kwargs)


These two normalizers don't do all that much, so let's just do the error checking directly here for conciseness.

Also assert that optimize is False otherwise raise a NotImplementedError similar to order and so on.

Oh. Now that I re-read https://pytorch.org/docs/stable/generated/torch.einsum.html, there is torch.backends.opt_einsum = "greedy" so we can support this :-). Question: is there a way of controlling torch.backends in a context manager, other than a try... finally block?

I don't think we have a context manager for that, no. try...finally seems reasonable to me.

Thanks, done in 211a461

lezcano · 2023-04-25T17:09:17Z

torch_np/_funcs_impl.py

+    else:
+        # op, str, op, str ... format: normalize every other argument
+        sublist_format = True
+        array_operands = operands[:-1][::2]


Why the [:-1]? Isn't sublistout optional? Also, don't we want to preprocess that one as well, perhaps asserting that it's an ndarray?

Exactly!

If sublistout is not given, the length of operands is even, and we pick odd-numbered elements, which are arrays.

If sublistout is given, the length of operands is odd, we peel off the last one, and pick odd-numbered elements, which are arrays. Without [:-1], we would have picked sublistout, too --- and it's a sublist, not an array.

And, no, it's not an array really. Can contain e.g. an Ellipsis:

assert_equal(np.einsum("...i->...", a, optimize=do_opt), np.sum(a, axis=-1).astype(dtype)) assert_equal(np.einsum(a, [Ellipsis, 0], [Ellipsis], optimize=do_opt), np.sum(a, axis=-1).astype(dtype))

Right. Could you please leave a comment or a link to this explanation?

torch_np/_funcs_impl.py

…the rest

lezcano · 2023-04-27T08:42:43Z

torch_np/_funcs_impl.py

+
+    is_short_int = target_dtype in [torch.uint8, torch.int8, torch.int16, torch.int32]
+    if is_short_int:
+        target_dtype, result_dtype = torch.int64, target_dtype


result_dtype is not being used.

thanks, removed.

NB it is not used because the current implementation follows pytorch in that e.g.

In [48]: a = np.arange(8, dtype=np.int8) In [49]: torch.einsum(torch.as_tensor(a), [0], []).dtype Out[49]: torch.int64

unlike numpy where the last line is int8 (sigh)

Yeah, this is because, as sum and reductions that accumulate, integeras are upcasted to int64. This function simply uses sum and bmm internally so it has the same semantics as those two.

lezcano

Great! What a pain of a function...

ev-br · 2023-04-27T10:18:16Z

Thanks Mario for the review!

ev-br requested a review from lezcano April 25, 2023 16:33

ev-br force-pushed the einsum branch from afc36f9 to 53447cd Compare April 25, 2023 16:45

lezcano reviewed Apr 25, 2023

View reviewed changes

ev-br added 2 commits April 26, 2023 10:18

ENH: add einsum

78c333a

MAINT: address a review comment

344b3f7

ev-br force-pushed the einsum branch from 59bd9b8 to 344b3f7 Compare April 26, 2023 07:23

MAINT: review comments

66d62db

ev-br force-pushed the einsum branch from 868b56b to 66d62db Compare April 26, 2023 07:48

ev-br added 2 commits April 27, 2023 11:17

MAINT: einsum: work around some short int / float limitations, xfail …

1dd74bf

…the rest

einsum: forward optimize= to torch.backends

211a461

lezcano reviewed Apr 27, 2023

View reviewed changes

ev-br force-pushed the einsum branch from 4b21903 to aa2d240 Compare April 27, 2023 09:55

lezcano approved these changes Apr 27, 2023

View reviewed changes

MAINT: remove local imports from einsum

7e9f49c

ev-br force-pushed the einsum branch from aa2d240 to 7e9f49c Compare April 27, 2023 10:09

ev-br merged commit cd5f74a into einsum_tests Apr 27, 2023

ev-br deleted the einsum branch April 27, 2023 10:18

ev-br mentioned this pull request Apr 27, 2023

ENH: add einsum + its numpy tests #126

Merged

ENH: add einsum #127

ENH: add einsum #127

Uh oh!

Conversation

ev-br commented Apr 25, 2023

Uh oh!

lezcano left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ev-br Apr 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lezcano Apr 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lezcano Apr 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lezcano left a comment

Choose a reason for hiding this comment

Uh oh!

ev-br commented Apr 27, 2023

Uh oh!

Uh oh!

ev-br Apr 27, 2023 •

edited

Loading

lezcano Apr 27, 2023 •

edited

Loading

lezcano Apr 27, 2023 •

edited

Loading