Add specification for returning the least-squares solution to a linear matrix equation (linalg: lstsq) #119

kgryte · 2021-01-25T19:57:42Z

This PR

specifies the interface for returning the least-squares solution to a linear matrix equation.
is derived from comparing signatures across array libraries.

Notes

Only TF allows for providing a stack of matrices. Torch, MXNet, CuPy, NumPy, and JAX do not. This proposal follows TF and ensures consistency with other linalg interfaces which currently support stacks.
TF supports l2_regularizer and fast keyword arguments and is alone in doing so.
Neither Dask, Torch, nor TF support an rcond keyword argument. This proposal includes an rtol argument (note: rtol is renamed from rcond to unify tolerance keywords across pinv, lstsq, and matrix_rank), similar to the pinv proposal.
Similar to pinv, the rcond argument can either be a float or an array and have default values determined by type promotion rules.
NumPy, MXNet, CuPy, and JAX all support b being specified as either a vector or matrix. TF requires an (..., M,K) matrix. This PR follows NumPy.
Return results:
- TF only returns an array containing solutions.
- Torch returns a namedtuple of solutions and the QR factorization.
- NumPy et al return a tuple.
- Dask returns a tuple with a rank field which is an array.
- NumPy et al return a rank field which is an integer.
- NumPy returns a residuals field which is empty for low-rank or over-determined solutions. JAX always returns residuals for JIT purposes, unless one sets numpy_resid=True.
This proposal returns a namedtuple with a rank field which is an array due to support for providing stacks of matrices and also returns that the residuals field always be returned, following JAX.

rgommers · 2021-01-26T14:27:11Z

NumPy returns a residuals field which is empty for low-rank or over-determined solutions. JAX always returns residuals for JIT purposes.

The JAX docs say that behaviour matches NumPy, empty array can be returned: jax.readthedocs.io/en/latest/_autosummary/jax.numpy.linalg.lstsq.html. The in-progress PR for torch.linalg.lstsq will also match NumPy mostly - returning empty residuals probably (pytorch/pytorch#49093 (comment))

kgryte · 2021-01-26T18:42:32Z

@rgommers Re: JAX. Sorry, I should have clarified. JAX's default behavior does not match NumPy's.

LAX-backend implementation of lstsq(). It has two important differences:

In numpy.linalg.lstsq, the default rcond is -1, and warns that in the future the default will be None. Here, the default rcond is None.

In np.linalg.lstsq the returned residuals are empty for low-rank or over-determined solutions. Here, the residuals are returned in all cases, to make the function compatible with jit. The non-jit compatible numpy behavior can be recovered by passing numpy_resid=True.

I've updated the OP accordingly.

kgryte · 2021-02-16T05:14:00Z

Renamed rcond to tol to unify keyword arguments across pinv, lstsq, and matrix_rank APIs.

kgryte · 2021-03-04T09:17:02Z

Renamed tol to rtol to more explicitly indicate relative tolerance and pave the way for future specification evolution (e.g., atol).

leofang · 2021-03-11T17:20:53Z

The PR looks fine to me, just a few high-level design questions:

NumPy, MXNet, CuPy, and JAX all support b being specified as either a vector or matrix. TF requires an (..., M,K) matrix. This PR follows TF to reduce API surface area and ensure consistency across all invocations.

I feel it's not very convenient to always request a matrix and forbid vector inputs. I understand we can always broadcast (..., M) to (..., M, 1) to make it work, but can't we do this internally (likely done in several libraries) to give users a bit more flexibility? For example, we can do pre-processing like this:

if x1.ndim == x2.ndim + 1:
    x2 = x2[..., None]  # or use newaxis
assert x1.ndim == x2.ndim

NumPy et al return a namedtuple.

No, NumPy and CuPy return a tuple. Given that we didn't return namedtuple in SVD, perhaps we shouldn't do it here either to be consistent?

kgryte · 2021-03-11T19:39:50Z

@leofang Re: namedtuple and NumPy. You are correct. I misread the NumPy docs. I updated the OP. However, for the SVD proposal, we do return a namedtuple (see here). So returning one here is consistent with that proposal.

leofang · 2021-03-11T19:51:39Z

However, for the SVD proposal, we do return a namedtuple (see here).

Ah OK, thanks Athan! I missed that and thought _Tuple\[ ... refers to (unnamed) tuple.

kgryte · 2021-03-24T21:46:40Z

@leofang Re: matrix/vector input. I've updated the proposal to include support for an ordinate vector.

…lstsq

leofang

LGTM! Thanks @kgryte!

kgryte · 2021-05-12T04:58:26Z

Thanks, @leofang, for the review! This PR is ready for merge...

kgryte added 3 commits January 25, 2021 11:07

Stub spec

5bc330b

Document keyword

fc28df7

Update spec

e3f336d

This was referenced Jan 26, 2021

API for variable number of returns in linalg #95

Closed

Add specification for computing the pseudo-inverse (linalg: pinv) #118

Merged

kgryte added 5 commits February 1, 2021 11:15

Add missing parenthesis

b698326

Remove duplicate words

b5c3ee7

Rename keyword argument and add support for setting tolerance to a float

dcbedd4

Update copy

1e565ed

Reorder sentences

37e5558

Rename keyword argument

0443dcf

Fix name

27172b0

rgommers added the API extension Adds new functions or objects to the API. label Mar 20, 2021

kgryte added 2 commits March 24, 2021 13:57

Update copy

827153c

Add support for providing an ordinate vector

066c58f

kgryte added 5 commits March 24, 2021 17:04

Merge branch 'main' of https://github.com/pydata-apis/array-api into …

81db997

…lstsq

Update dtype requirements

b31ae6c

Update dtype requirements

bd013d1

Update type annotation

b10efee

Update copy

d406aa4

leofang approved these changes Mar 28, 2021

View reviewed changes

rgommers force-pushed the main branch from 35e967e to 2f8f5e4 Compare April 19, 2021 20:09

rgommers force-pushed the main branch 2 times, most recently from 0607525 to 138e963 Compare April 19, 2021 20:25

Move API to submodule

d670e12

kgryte merged commit e32a6a8 into main May 12, 2021

kgryte deleted the lstsq branch May 12, 2021 04:58

lezcano mentioned this pull request Jul 14, 2021

Return just the solution in linalg.lstsq #227

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add specification for returning the least-squares solution to a linear matrix equation (linalg: lstsq) #119

Add specification for returning the least-squares solution to a linear matrix equation (linalg: lstsq) #119

kgryte commented Jan 25, 2021 •

edited

Loading

rgommers commented Jan 26, 2021

kgryte commented Jan 26, 2021

kgryte commented Feb 16, 2021

kgryte commented Mar 4, 2021

leofang commented Mar 11, 2021

kgryte commented Mar 11, 2021

leofang commented Mar 11, 2021

kgryte commented Mar 24, 2021

leofang left a comment

kgryte commented May 12, 2021

Add specification for returning the least-squares solution to a linear matrix equation (linalg: lstsq) #119

Add specification for returning the least-squares solution to a linear matrix equation (linalg: lstsq) #119

Conversation

kgryte commented Jan 25, 2021 • edited Loading

Notes

rgommers commented Jan 26, 2021

kgryte commented Jan 26, 2021

kgryte commented Feb 16, 2021

kgryte commented Mar 4, 2021

leofang commented Mar 11, 2021

kgryte commented Mar 11, 2021

leofang commented Mar 11, 2021

kgryte commented Mar 24, 2021

leofang left a comment

Choose a reason for hiding this comment

kgryte commented May 12, 2021

kgryte commented Jan 25, 2021 •

edited

Loading