Metropolis fails to converge on simple model (PyMC 3) #358

fonnesbeck · 2013-10-09T20:40:51Z

I have attempted to replicate a very simple partial pooling model from Gelman and Hill in PyMC 3, but find that the Metropolis step method fails to converge even in a large number of iterations. NUTS seems to handle it fine, but there appears to be an issue with Metropolis. Perhaps related to adaptation?

Here is a link to a zip file containing the model and data.

The text was updated successfully, but these errors were encountered:

fonnesbeck · 2013-10-09T22:10:39Z

I have verified that the equivalent model in PyMC 2 converges to reasonable values under Metropolis (using Gelman and Hill's result as the gold standard), so something appears to be up with the PyMC 3 implementation.

Edited: figured out the NUTS issue

jsalvatier · 2013-12-04T11:00:30Z

Any update?

fonnesbeck · 2013-12-04T15:10:16Z

No, still a mystery.

bjedwards · 2013-12-04T20:50:44Z

Does the model still work? I get a AsTensorError on line 49, when trying to call y_hat = a[county]. I am running theano off the github-dev.

aflaxman · 2013-12-08T00:35:27Z

@bjedwards it was giving me this error also, and adding county = county.astype(int) before setting up the model made the error go away. I can confirm that Metropolis is not converging.

@fonnesbeck do you have a PyMC 2 version for comparison?

aflaxman · 2013-12-10T23:58:34Z

Given long enough, it seems to work for me. I think I know what the issue is, but I'll decide for sure when I see what PyMC2 is doing for @fonnesbeck ---

fonnesbeck · 2013-12-11T00:00:55Z

Sorry. Will have a go at this tonight.

aflaxman · 2013-12-11T00:29:12Z

No worries. I was suspicious that the PyMC2 defaults would actually correspond more to a CompoundStep of Metropoli in PyMC3. However, that doesn't converge any faster:

with partial_pooling:
    steps = []
    steps.append(pm.Metropolis(vars=[mu_a], tune_interval=1000))
    steps.append(pm.Metropolis(vars=[sigma_a], tune_interval=1000))
    steps.append(pm.Metropolis(vars=[a], tune_interval=1000))
    steps.append(pm.Metropolis(vars=[sigma_y], tune_interval=1000))

    partial_pooling_samples = pm.sample(10000, pm.CompoundStep(steps))

fonnesbeck · 2013-12-11T01:06:02Z

The following implementation in PyMC2 seems to work, using AdaptiveMetropolis for sampling the random intercepts:

def partial_pooling():

    # Priors
    mu_a = Normal('mu_a', mu=0., tau=0.0001, value=0)
    sigma_a = Uniform('sigma_a', lower=0, upper=100, value=1.)
    tau_a = sigma_a**-2

    # Random intercepts
    a = Normal('a', mu=mu_a, tau=tau_a, value=np.zeros(len(set(county))))

    # Model error
    sigma_y = Uniform('sigma_y', lower=0, upper=100)
    tau_y = sigma_y**-2

    # Expected value
    y_hat = Lambda('y_hat', lambda a=a: a[county])

    # Data likelihood
    y_like = Normal('y_like', mu=y_hat, tau=tau_y, value=log_radon, observed=True)

    return locals()

Going to go back and try again in PyMC3.

twiecki · 2014-08-17T21:38:12Z

This works fine for me on this branch: #587 Please reopen if problem persists.

lwahedi · 2017-03-13T15:14:00Z

I ran into some trouble with this and wanted to post a note for later readers who are also having trouble. I made some toy data for a pretty simple partial pooling model based on the one discussed here, NUTS converges on the true values in not that many iterations (a few hundred), but very slowly. (I'm having some serious stalling issue on my actual data with NUTS.) For the first ten thousand or so iterations Metropolis settles over 0. At around 40k it's finally centered over the true value. At 100k, it's still a flat topped curve with a peak that isn't over the true value, and credible intervals that still cross 0. And this is with a pretty large effect size in the toy data. At 150k it starts to tighten, but still isn't really there yet for all the parameters. But 150k is still much faster than a couple hundred on NUTS,
TLDR: Metropolis needs a few hundred thousand iterations to converge on this model.

twiecki · 2017-03-13T15:18:11Z

Thanks @lwahedi. Have you used ADVI initialization? I would also set tune=100 for NUTS. You can also investigate the sampler stats to identify difficult regions: http://pymc-devs.github.io/pymc3/notebooks/Diagnosing_biased_Inference_with_Divergences.html

fonnesbeck · 2017-03-13T19:19:49Z

@lwahedi If you just run trace = sample(2000) (for example), PyMC will automatically select NUTS and initialize it with ADVI automatically.

lwahedi · 2017-03-14T06:33:25Z

Thanks for the tips. ADVI does make this particular model run much faster. It still stalls when I add a probit transformation on the outcome, but I've now learned that Metropolis crashes the kernel at somewhere between 30k-50k iterations on the extended model too, so that wasn't the solution I'd hoped it would be. Either way, off topic for this issue. I have a stackexchange question with more detail about it here. . It may be a specification issue--not sure what's going on.

ghost assigned fonnesbeck Dec 4, 2013

jsalvatier mentioned this issue Jan 5, 2014

Mixture Modeling example not working in pymc3 #443

Closed

twiecki mentioned this issue Jan 7, 2014

Add single component samplers. #450

Closed

twiecki mentioned this issue Jun 7, 2014

Efficient sampling from mixture models #547

Closed

twiecki closed this as completed Aug 17, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Metropolis fails to converge on simple model (PyMC 3) #358

Metropolis fails to converge on simple model (PyMC 3) #358

fonnesbeck commented Oct 9, 2013

fonnesbeck commented Oct 9, 2013

Uh oh!

jsalvatier commented Dec 4, 2013

Uh oh!

fonnesbeck commented Dec 4, 2013

Uh oh!

bjedwards commented Dec 4, 2013

Uh oh!

aflaxman commented Dec 8, 2013

Uh oh!

aflaxman commented Dec 10, 2013

Uh oh!

fonnesbeck commented Dec 11, 2013

Uh oh!

aflaxman commented Dec 11, 2013

Uh oh!

fonnesbeck commented Dec 11, 2013

Uh oh!

twiecki commented Aug 17, 2014

Uh oh!

lwahedi commented Mar 13, 2017

Uh oh!

twiecki commented Mar 13, 2017

Uh oh!

fonnesbeck commented Mar 13, 2017

Uh oh!

lwahedi commented Mar 14, 2017 •

edited

Loading

Uh oh!

Uh oh!

Metropolis fails to converge on simple model (PyMC 3) #358

Metropolis fails to converge on simple model (PyMC 3) #358

Comments

fonnesbeck commented Oct 9, 2013

fonnesbeck commented Oct 9, 2013

Uh oh!

jsalvatier commented Dec 4, 2013

Uh oh!

fonnesbeck commented Dec 4, 2013

Uh oh!

bjedwards commented Dec 4, 2013

Uh oh!

aflaxman commented Dec 8, 2013

Uh oh!

aflaxman commented Dec 10, 2013

Uh oh!

fonnesbeck commented Dec 11, 2013

Uh oh!

aflaxman commented Dec 11, 2013

Uh oh!

fonnesbeck commented Dec 11, 2013

Uh oh!

twiecki commented Aug 17, 2014

Uh oh!

lwahedi commented Mar 13, 2017

Uh oh!

twiecki commented Mar 13, 2017

Uh oh!

fonnesbeck commented Mar 13, 2017

Uh oh!

lwahedi commented Mar 14, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lwahedi commented Mar 14, 2017 •

edited

Loading