Return posterior predictive samples from all chains in `ModelBuilder` #140

mbjoseph · 2023-04-13T16:23:43Z

This fixes a bug where only values from one chain were returned. It also refactors the prediction logic to reduce duplication. I've used arviz.extract() to get the posterior predictive samples as hinted by @twiecki in #139, removing the _extract_samples() method from the model builder class.

Behavior change proposal

This PR also includes a proposed behavior change that makes the output of predict_posterior() consistent in type and shape with the output of pymc.sample_posterior_predictive().

Namely, predict_posterior() would here return an xarray Dataset from the posterior predictive distribution, rather than a dictionary of numpy arrays. This is an admittedly opinionated change in behavior, though it does also streamline the code. From my perspective, retaining the metadata related to chains, draws, and other attributes is useful for three key reasons:

downstream processing for posterior predictive checks, enabling easier alignment of chains and draws with other pushforward quantities, and
tracking the provenance of predictions (e.g., creation time and package versions attributes) in a production environment, and
consistency with the output from pymc.sample_posterior_predictive(), which could make behavior easier to predict for users, and allow greater interoperability with other functions that process and plot posterior predictive samples

That said, I'm very open to other perspectives on how to bundle up posterior predictive samples.

This fixes a bug where only values from one chain were returned. It also refactors the prediction logic to reduce duplication, and makes the output of predict_posterior() consistent in type and shape with the output of pymc.sample_posterior_predictive().

michaelraczycki

Looks good, for the future reference please try not to just switch places of functions if it's not needed, as it makes the code changes less readable for PR

pymc_experimental/model_builder.py

mbjoseph · 2023-04-14T16:43:33Z

Thanks for the review @michaelraczycki. I've added combined as a named argument, tested behavior of combined=True and combined=False, and restored the original method order in 02faac7.

michaelraczycki

Looks good to me

twiecki · 2023-04-20T08:16:19Z

Thanks @mbjoseph and for the review @michaelraczycki!

…pymc-devs#140) * Return posterior predictive samples from all chains This fixes a bug where only values from one chain were returned. It also refactors the prediction logic to reduce duplication, and makes the output of predict_posterior() consistent in type and shape with the output of pymc.sample_posterior_predictive(). * keep attributes even when computing posterior means * Add/test combined arg, revert method order * Fix import order. --------- Co-authored-by: Max Joseph <[email protected]> Co-authored-by: Thomas Wiecki <[email protected]>

* docstrings update in model_builder.py * bringing back accidentally removed line from example * Return posterior predictive samples from all chains in `ModelBuilder` (#140) * Return posterior predictive samples from all chains This fixes a bug where only values from one chain were returned. It also refactors the prediction logic to reduce duplication, and makes the output of predict_posterior() consistent in type and shape with the output of pymc.sample_posterior_predictive(). * keep attributes even when computing posterior means * Add/test combined arg, revert method order * Fix import order. --------- Co-authored-by: Max Joseph <[email protected]> Co-authored-by: Thomas Wiecki <[email protected]> * fixing merge conflicts --------- Co-authored-by: Max Joseph <[email protected]> Co-authored-by: Max Joseph <[email protected]> Co-authored-by: Thomas Wiecki <[email protected]>

Max Joseph added 2 commits April 13, 2023 10:06

keep attributes even when computing posterior means

3eb5b35

twiecki requested a review from michaelraczycki April 13, 2023 19:29

michaelraczycki requested changes Apr 14, 2023

View reviewed changes

pymc_experimental/model_builder.py Outdated Show resolved Hide resolved

Add/test combined arg, revert method order

02faac7

twiecki previously approved these changes Apr 15, 2023

View reviewed changes

ricardoV94 added the bug Something isn't working label Apr 19, 2023

ricardoV94 changed the title ~~Return posterior predictive samples from all chains~~ Return posterior predictive samples from all chains in ModelBuilder Apr 19, 2023

michaelraczycki previously approved these changes Apr 20, 2023

View reviewed changes

Merge branch 'main' into main

b9eb478

twiecki dismissed stale reviews from michaelraczycki and themself via b9eb478 April 20, 2023 06:52

Fix import order.

9d49e74

michaelraczycki approved these changes Apr 20, 2023

View reviewed changes

twiecki merged commit b3be15f into pymc-devs:main Apr 20, 2023

mbjoseph mentioned this pull request Apr 20, 2023

ModelBuilder's predict_posterior returns draws from just one chain? #139

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Return posterior predictive samples from all chains in `ModelBuilder` #140

Return posterior predictive samples from all chains in `ModelBuilder` #140

mbjoseph commented Apr 13, 2023 •

edited

Loading

michaelraczycki left a comment

mbjoseph commented Apr 14, 2023

michaelraczycki left a comment

twiecki commented Apr 20, 2023

Return posterior predictive samples from all chains in ModelBuilder #140

Return posterior predictive samples from all chains in ModelBuilder #140

Conversation

mbjoseph commented Apr 13, 2023 • edited Loading

michaelraczycki left a comment

Choose a reason for hiding this comment

mbjoseph commented Apr 14, 2023

michaelraczycki left a comment

Choose a reason for hiding this comment

twiecki commented Apr 20, 2023

Return posterior predictive samples from all chains in `ModelBuilder` #140

Return posterior predictive samples from all chains in `ModelBuilder` #140

mbjoseph commented Apr 13, 2023 •

edited

Loading