Possible bug in LKJ ?? #873

agoldin · 2015-11-12T21:27:39Z

I tried to use LKJ distribution to sample correlation (and then covariance) matrix for MvNorm and, if I sample long enough, I always stumble into problem. It regularly happens with number of columns ~16, does not happen with 4 and I am not completely sure where the cutoff is.

import pymc3 as pm,numpy as np
from theano import tensor as T
import scipy as sp

def triIndex(N):
    n_elem = int(N * (N - 1) / 2) # move into function
    tri_index = np.zeros([N, N], dtype=int)
    tri_index[np.triu_indices(N, k=1)] = np.arange(n_elem)
    tri_index[np.triu_indices(N, k=1)[::-1]] = np.arange(n_elem)
    return tri_index    


import scipy as sp
N = 100
Nf = 16

mu_actual = sp.stats.uniform.rvs(-5, 10, size=Nf)
cov_actual_sqrt = sp.stats.uniform.rvs(0, 1, size=(Nf, Nf))
cov_actual = np.dot(cov_actual_sqrt.T, cov_actual_sqrt)

x = sp.stats.multivariate_normal.rvs(mu_actual, cov_actual, size=N)

#cov_actual

np.random.seed(3264602) 
tri = triIndex(Nf)
with pm.Model() as model:
     sigma = pm.Lognormal('sigma', np.zeros(Nf), np.ones(Nf), shape=Nf)


     C_triu = pm.LKJCorr('C_triu', 1, Nf) 

     C = C_triu[tri]
     C = T.fill_diagonal(C, 1)



     sigma_diag = pm.Deterministic('sigma_mat', T.nlinalg.diag(sigma))
     cov = pm.Deterministic('cov', T.nlinalg.matrix_dot(sigma_diag, C, sigma_diag))
     tau = pm.Deterministic('tau', T.nlinalg.matrix_inverse(cov))
     mu = pm.MvNormal('mu', 0, tau, shape=Nf)

     x_ = pm.MvNormal('x', mu, tau, observed=x)

     s = pm.Metropolis()
     trace = pm.sample(1000, s)

Symptoms of the problem:

from matplotlib import pyplot as plt
lp = [model.logp(trace[i]) for i in range(0,200)]
plt.plot(lp)

and you will see that logp becomes vertical.

tr16 = triIndex(Nf)
cr = trace[-1]['C_triu'][tr16]
np.fill_diagonal(cr,1)

np.linalg.eig(cr)[0]

array([-0.09574   , -0.01150316,  0.22189297,  1.98412337,  1.87778831,
    0.47466721,  0.57414119,  1.71681892,  1.58021031,  1.45534505,
    1.33115075,  1.20560476,  0.79869473,  1.0230756 ,  0.91071542,
    0.95301458])

Some of eigenvalues of reconstructed correlation matrix are negative, which, I suspect, is not correct.

model.logp() for LKJ happily calculates logp for all steps in a trace, except that for last values value of log() is ~ 10^156, and certainly not -np.inf

theano.version = '0.7.0.dev-RELEASE'
Mac OS, CUDA 7.5

$nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2015 NVIDIA Corporation
Built on Tue_Aug_11_15:14:46_CDT_2015
Cuda compilation tools, release 7.5, V7.5.17

Am I doing something wrong?

The text was updated successfully, but these errors were encountered:

agoldin · 2015-11-12T21:45:58Z

Looks like adding

all(gt(eigh(X)[0], 0))

to

return bound() (just like in wishart) solves the problem....

agoldin · 2015-11-13T03:35:43Z

However when I am trying to run pm.guess_scaling(), I get error

/Users/alexey/anaconda/lib/python2.7/site-packages/theano/gradient.pyc in access_term_cache(node)
1087 str(g_shape))
1088
-> 1089 input_grads = node.op.grad(inputs, new_output_grads)
1090
1091 if input_grads is None:

AttributeError: 'EighGrad' object has no attribute 'grad'

which may or may not be related to my fix.

twiecki · 2015-11-13T07:46:26Z

CC @kiudee

kiudee · 2015-11-13T15:33:50Z

I just wanted to drop in to say that I will only be able to look at the problem next week.
It is certainly possible that we need to ensure that the matrix is positive semidefinite.
It could be useful to look at the implementation in Stan.

They apply the following checks to the matrix:
https://github.com/stan-dev/math/blob/master/stan/math/prim/mat/err/check_corr_matrix.hpp

agoldin · 2015-11-13T17:06:23Z

Looks like they are doing Cholesky decomposition and checking diagonals, this is probably more efficient than doing eigenvectors.

twiecki · 2015-11-15T12:44:14Z

I implemented Cholesky decomposition for the Wishart here: #701 that might help.

twiecki · 2015-12-22T10:09:33Z

@agoldin Any interest in taking a crack at this?

agoldin · 2016-01-22T02:42:36Z

I apologize, I did not see email about your comment and I was away.

I do not understand the insides of pymc3 very well. As I wrote before

adding

all(gt(eigh(X)[0], 0))

to

return bound() (just like in wishart) solves the problem. However it breaks the derivatives of likelihood. I need to do some check for positive definitiveness that will not break gradients.

If you point me where to dig, I might try. No guarantees I will succeed though.

Thanks!

twiecki · 2016-01-25T17:16:34Z

I'm not sure why the gradient for eigh doesn't work, I think it should exist. Perhaps asking @Theano devs might help?

agoldin · 2016-01-26T03:10:26Z

I'll try.

agoldin · 2016-02-17T03:44:49Z

I think I fixed it. I'll just figure out how to do pull request (gimme couple of days, I am fairly busy otherwise right now).

twiecki · 2016-02-17T16:51:51Z

👍

check MCMC strays into neverland with negative eigenvalues for covariance matrix. This is not good at all ( see #873 ).

agoldin changed the title ~~Possible bug in LJK ??~~ Possible bug in LKJ ?? Nov 12, 2015

agoldin mentioned this issue Feb 17, 2016

LKJ distribution: check if matrix is positive definite. #983

Merged

fonnesbeck pushed a commit that referenced this issue Feb 23, 2016

LKJ distribution: check if matrix is positive definite. Without this

89d0ca2

check MCMC strays into neverland with negative eigenvalues for covariance matrix. This is not good at all ( see #873 ).

twiecki closed this as completed Apr 12, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Possible bug in LKJ ?? #873

Possible bug in LKJ ?? #873

agoldin commented Nov 12, 2015

agoldin commented Nov 12, 2015

Uh oh!

agoldin commented Nov 13, 2015

Uh oh!

twiecki commented Nov 13, 2015

Uh oh!

kiudee commented Nov 13, 2015

Uh oh!

agoldin commented Nov 13, 2015

Uh oh!

twiecki commented Nov 15, 2015

Uh oh!

twiecki commented Dec 22, 2015

Uh oh!

agoldin commented Jan 22, 2016

Uh oh!

twiecki commented Jan 25, 2016

Uh oh!

agoldin commented Jan 26, 2016

Uh oh!

agoldin commented Feb 17, 2016

Uh oh!

twiecki commented Feb 17, 2016

Uh oh!

Uh oh!

Possible bug in LKJ ?? #873

Possible bug in LKJ ?? #873

Comments

agoldin commented Nov 12, 2015

agoldin commented Nov 12, 2015

Uh oh!

agoldin commented Nov 13, 2015

Uh oh!

twiecki commented Nov 13, 2015

Uh oh!

kiudee commented Nov 13, 2015

Uh oh!

agoldin commented Nov 13, 2015

Uh oh!

twiecki commented Nov 15, 2015

Uh oh!

twiecki commented Dec 22, 2015

Uh oh!

agoldin commented Jan 22, 2016

Uh oh!

twiecki commented Jan 25, 2016

Uh oh!

agoldin commented Jan 26, 2016

Uh oh!

agoldin commented Feb 17, 2016

Uh oh!

twiecki commented Feb 17, 2016

Uh oh!