Mv cleanup #3008

gBokiau · 2018-06-05T20:05:47Z

This picks up some of the most straight-forward edits from #2847. Guiding principle:

Whatever is implemented by the Mv distributions, is best delegated to Mv distributions

Aka DRY.

For review:
In GP it appeared that covariance matrices were being stabilised multiple times over. Perhaps this was a side-effect of not following the above principle, unless there was some ground for doing this, eg the _build_marginals() have some way of screwing up (rendering non-positive definite) the covs even after they have first been stabilised. Please review.

gBokiau

Remaining doubts about stabilisation

gBokiau · 2018-06-05T20:37:03Z

pymc3/gp/gp.py

@@ -418,15 +417,15 @@ def marginal_likelihood(self, name, X, y, noise, is_observed=True, **kwargs):
        if not isinstance(noise, Covariance):
            noise = pm.gp.cov.WhiteNoise(noise)
        mu, cov = self._build_marginal_likelihood(X, noise)
-        chol = cholesky(stabilize(cov))
+        cov = stabilize(cov)


I'm not sure the stabilisation is needed here since noise is already being added to the diagonal in _build_marginal_likelihood.

gBokiau · 2018-06-05T20:38:00Z

pymc3/gp/gp.py

@@ -959,7 +956,7 @@ def _build_conditional(self, Xnew, pred_noise, diag):
            cov = Km - Asq
            if pred_noise:
                cov += sigma * np.eye(cov.shape)
-        return mu, cov
+        return mu, stabilize(cov)


Ditto, actually almost certain this is redundant. (Moved from line 999, BTW)
Maybe not redundant when pred_noise == False.

twiecki · 2018-06-06T09:00:20Z

pymc3/distributions/timeseries.py

        self.nu = nu = tt.as_tensor_variable(nu)
-        self.init = init


Nice simplification!

twiecki · 2018-06-06T09:02:11Z

@gBokiau Looks great, I think iterative changes are the way to go. Definitely want @aseyboldt and @bwengals to take a close look.

aseyboldt

The error message would be nice, but looks good to me. Thanks!

aseyboldt · 2018-06-06T09:05:04Z

pymc3/distributions/timeseries.py


        self.mu = mu = tt.as_tensor_variable(mu)
        self.init = init
        self.mean = tt.as_tensor_variable(0.)
+        self.innovArgs = (self.mu, cov, tau, chol, lower)
+        self.innov = multivariate.MvNormal.dist(*self.innovArgs)

    def logp(self, x):


This only works if x has rank 1, right? Maybe we should intercept the shape in __init__ (one of the kwargs) and throw an error if that doesn't have length 1, just to give people a better error message.

Er, wait, I would hope not, wouldn't rank 1 defeat the purpose of Mv?
I'll run the demo to get a better grasp. More tests for the timeseries would be nice wrt shapes #2859 (comment)

I checked to be sure: this only accepts x and cov/chol with ndim 2, and correctly controls that the shapes are compatible.

Related observations:

To be more precise, it controls/catches any glitch between shape and cov upon sampling, not at declaration time. That is also the behaviour of MvN/MvT. While that could be improved, I think it should then happen in those classes.

Another question entirely (for later, perhaps) is wether we want to intercept any such errors here, as to make them more contextual for the timeseries classes.

However this is all probably not what your concern about the rank was?

I should have said rank 2, not 1. I didn't actually think about the fact that mvnormal doesn't support higher rank input either, but I still think we should check this in this class. If at some point in the future we extend mvnormal, people wouldn't get an error then, but I'm not sure the results are what we want. (eg which dimension is the one with the correlations? In mvnormal it is the last one....)
Here is the code in mvnormal that I have in mind:
https://github.com/pymc-devs/pymc3/blob/master/pymc3/distributions/multivariate.py#L37

Right, gotcha. If my understanding is correct, in the case of a random walk, ndim=1 doesn't make sense either, right? So shape can only be two-dimensional for these distributions.

So, as it stands, I would simply add this control in __init__:

if len(self.shape) != 2: raise ValueError("Only 2 dimensions are allowed.")

Perhaps for a separate PR — when given 3 dims, we could (I think?) diff the obs separately along the time axis, concatenate/stack the thus obtained innovations to two dims, and pass that on to Mv.

bwengals · 2018-06-08T23:43:58Z

these look good to me!

twiecki · 2018-06-11T08:39:41Z

Thanks @gBokiau!

gBokiau added 3 commits June 5, 2018 21:11

Simplify timeseries code, delegate to Mv

81773f3

GP: use theano cholesky op, stabilize only once, delegate cholesky to Mv

22d24af

Fix typo

6bf1d85

gBokiau mentioned this pull request Jun 5, 2018

Log Likelihood of MvGaussianRandomWalk Returns Array Rather Than Scalar #2859

Closed

junpenglao requested review from bwengals and aseyboldt June 5, 2018 20:35

gBokiau commented Jun 5, 2018

View reviewed changes

twiecki reviewed Jun 6, 2018

View reviewed changes

pymc3/distributions/timeseries.py Outdated

self.nu = nu = tt.as_tensor_variable(nu)

self.init = init

Copy link

Member

twiecki Jun 6, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice simplification!

aseyboldt reviewed Jun 6, 2018

View reviewed changes

pymc-devs deleted a comment from gBokiau Jun 6, 2018

gBokiau added 5 commits June 6, 2018 16:34

oversights and fix pymc-devs#2859

4d55c13

remove unused cholesky ref, and more consistent MvTimeseries init

647557d

remove unused import

05efae0

less disruptive

72379e1

more consistent

2f40a95

twiecki merged commit 1ef580e into pymc-devs:master Jun 11, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Mv cleanup #3008

Mv cleanup #3008

Uh oh!

gBokiau commented Jun 5, 2018 •

edited

Loading

Uh oh!

gBokiau left a comment

Uh oh!

gBokiau Jun 5, 2018

Uh oh!

gBokiau Jun 5, 2018 •

edited

Loading

Uh oh!

twiecki Jun 6, 2018

Uh oh!

twiecki commented Jun 6, 2018

Uh oh!

aseyboldt left a comment

Uh oh!

aseyboldt Jun 6, 2018

Uh oh!

gBokiau Jun 6, 2018

Uh oh!

gBokiau Jun 6, 2018

Uh oh!

gBokiau Jun 6, 2018

Uh oh!

aseyboldt Jun 7, 2018

Uh oh!

gBokiau Jun 9, 2018

Uh oh!

bwengals commented Jun 8, 2018

Uh oh!

twiecki commented Jun 11, 2018

Uh oh!

Uh oh!

Uh oh!

Mv cleanup #3008

Mv cleanup #3008

Uh oh!

Conversation

gBokiau commented Jun 5, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gBokiau left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gBokiau Jun 5, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

twiecki commented Jun 6, 2018

Uh oh!

aseyboldt left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bwengals commented Jun 8, 2018

Uh oh!

twiecki commented Jun 11, 2018

Uh oh!

Uh oh!

gBokiau commented Jun 5, 2018 •

edited

Loading

gBokiau Jun 5, 2018 •

edited

Loading