Initialize a prior from a fitted posterior #56

ferrine · 2022-06-29T18:18:44Z

If you want to do knowledge transfer in a smart way, this is how you do this

from pymc.distributions import transforms

with model1:
    trace = pm.sample()
    # trace.posterior.keys() ~ ["a", "b", "c", "d", "f", "g"]
    # a - vector
    # b - matrix
    # c - positive
    # d, f, g - some other variable we do not care about


with pm.Model(coords=dict(test=range(3))) as model:
    priors = pmx.utils.prior.prior_from_idata(
        trace, 
        var_names=["a"],
        b=("test", "test"),
        c=transforms.log, 
        d="e", 
        f=dict(dims="test"),
        g=dict(name="h", dims="test", transform=transforms.log)
    )
    # 0. do nothing special to 'a' and other items in "var_names"
    # 1. 'b' has coords ("test", "test")
    # 2. transform 'c' to logspace
    # 3. rename 'd' to 'e'
    # 4. say 'f' has coords 'test'
    # 5. do everything mentioned with 'g'
    # priors will be a dictionary with all the priors, variables are available by final name keys

ricardoV94 · 2022-06-29T18:31:00Z

I don't see the prior being set. Is this still a draft? I assume you will use interpolation / kde?

ferrine · 2022-06-29T18:32:51Z

Yes, this is a draft. In the snippet, I've added the API to discuss. The API is permissive for types and flexible since it is easy to figure out the intension. I plan to approximate the prior using the MvNormal in the transformed space.

The prior is set inside the function, so you only get the dictionary with the final result

OriolAbril · 2022-06-30T14:20:45Z

pymc_experimental/utils/prior.py

+    for key, cfg in kwargs.items():
+        data = posterior[key].values
+        # omitting chain, draw
+        shape = data.shape[2:]


There is no guarantee the chain and draw dimensions will always be in the beginning, there are perfectly valid xarray operations that modify the dimension order. In xarray only the dimension name is relevant.

A quick change to the code to take this into account would be:

sample_dims = ["chain", "draw"] for ... batch_dims = [dim for dim in posterior[key].dims if dim not in sample_dims] data = posterior[key].stack(__sample__=sample_dims, __batch__=batch_dims) end = begin + len(data["__batch__"])

I suspect it might even be possible to simplify this further using https://docs.xarray.dev/en/latest/generated/xarray.Dataset.to_stacked_array.html#xarray.Dataset.to_stacked_array plus a where to check the start and end positions of each variable. I can take a look towards the end of July if it were to still be helpful by then.

ferrine · 2022-07-01T18:37:18Z

ready for review

twiecki · 2022-07-04T17:22:17Z

About API, the most common case will be to just transfer a single parameter from posterior to prior, so I wonder what that API looks like to make sure that we're not doing the general case well but not the common case well.

ferrine · 2022-07-04T17:45:30Z

About API, the most common case will be to just transfer a single parameter from posterior to prior, so I wonder what that API looks like to make sure that we're not doing the general case well but not the common case well.

A scalar variable without transform

with pm.Model(coords=dict(test=range(3))) as model:
    a = pmx.utils.prior.prior_from_idata(trace, var_names=["a"])["a"]

and if you rename

with pm.Model(coords=dict(test=range(3))) as model:
    b = pmx.utils.prior.prior_from_idata(trace, a="b")["b"]

A vector variable without transform, adding dims

with pm.Model(coords=dict(test=range(3))) as model:
    a = pmx.utils.prior.prior_from_idata(trace, a=("test", ))["a"]

or

with pm.Model(coords=dict(test=range(3))) as model:
    a = pmx.utils.prior.prior_from_idata(trace, a=dict(dims=("test", )))["a"]

A vector variable with transform ("what's left from Dirichlet")

with pm.Model(coords=dict(test=range(3))) as model:
    a = pmx.utils.prior.prior_from_idata(trace, a=dict(dims=("test", ), transform=transforms.simplex))["a"]

If we do not need coords

with pm.Model(coords=dict(test=range(3))) as model:
    a = pmx.utils.prior.prior_from_idata(trace, a=transforms.simplex)["a"]

pymc_experimental/utils/prior.py

OriolAbril · 2022-07-05T15:27:54Z

pymc_experimental/utils/prior.py

+    ...                                 # set a name, assign a coord and apply simplex transform
+    ...         f=dict(name="new_f", dims="options", transform=transforms.simplex)
+    ...     )
+    ...     trace1 = pm.sample_prior_predictive(100)


Might be worth adding a note or even the code to use plot_pair to compare the obtained posterior to the generated prior. Even with a mvnormal and transforms, there might be cases where the posterior is not retrieved correctly, and it will generally fail if the wrong transform is used, not sure how aware of default transforms are users, I'd think the vast majority have no idea a transform is happening when they use half distributions for example.

Note: regarding auto-use of default transforms. I think that arviz-devs/arviz#2056 plus a key code to map the strings in the attributes to common transforms will generally fix this issue.

I've added an explanation about this briefly

Co-authored-by: Oriol Abril-Pla <[email protected]>

twiecki · 2022-07-05T22:34:49Z

@ferrine I like the API.

ferrine · 2022-07-06T05:59:42Z

Time to merge then. Thanks for reviews

twiecki marked this pull request as draft June 29, 2022 18:36

ferrine added 6 commits June 30, 2022 12:07

add argument parser

f41ffcd

extend argument parser

70d1b10

prepare a valid fixture

5fdb819

improve fixture

45582f5

improve fixture

8ea35db

use simplex transform for the test case

8d047fc

ferrine force-pushed the prior-from-posterior branch from ceb4d06 to 8d047fc Compare June 30, 2022 12:07

ferrine added 2 commits June 30, 2022 12:26

add parse args

20c76bf

add flatten util

661eacb

OriolAbril reviewed Jun 30, 2022

View reviewed changes

ferrine added 10 commits July 1, 2022 13:24

fix typo

58ff72a

refactor flattening

2ba7a63

add mean chol

7b09ff6

add test for mvn_prior

bb5cfd1

test final api

1eeb82d

add additional argument

83beb1d

add type hints

0c2a3a7

fix tests

a2e0db2

add a docstring

c10e880

add to docs

71eb6b7

ferrine marked this pull request as ready for review July 1, 2022 18:37

simplify implementation

f9b5632

ferrine requested a review from OriolAbril July 3, 2022 06:44

OriolAbril reviewed Jul 5, 2022

View reviewed changes

ferrine and others added 4 commits July 5, 2022 20:22

Update pymc_experimental/utils/prior.py

88ee0a9

Co-authored-by: Oriol Abril-Pla <[email protected]>

Update pymc_experimental/utils/prior.py

678c849

Co-authored-by: Oriol Abril-Pla <[email protected]>

update the docstring

c2baf4c

update the docstring

27ec67b

ferrine merged commit dea6bc9 into main Jul 6, 2022

ferrine deleted the prior-from-posterior branch July 6, 2022 05:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Initialize a prior from a fitted posterior #56

Initialize a prior from a fitted posterior #56

Uh oh!

ferrine commented Jun 29, 2022 •

edited

Loading

Uh oh!

ricardoV94 commented Jun 29, 2022

Uh oh!

ferrine commented Jun 29, 2022 •

edited

Loading

Uh oh!

OriolAbril Jun 30, 2022

Uh oh!

ferrine commented Jul 1, 2022

Uh oh!

twiecki commented Jul 4, 2022

Uh oh!

ferrine commented Jul 4, 2022 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

OriolAbril Jul 5, 2022 •

edited

Loading

Uh oh!

ferrine Jul 5, 2022

Uh oh!

twiecki commented Jul 5, 2022

Uh oh!

ferrine commented Jul 6, 2022

Uh oh!

Uh oh!

Uh oh!

Initialize a prior from a fitted posterior #56

Initialize a prior from a fitted posterior #56

Uh oh!

Conversation

ferrine commented Jun 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ricardoV94 commented Jun 29, 2022

Uh oh!

ferrine commented Jun 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

OriolAbril Jun 30, 2022

Choose a reason for hiding this comment

Uh oh!

ferrine commented Jul 1, 2022

Uh oh!

twiecki commented Jul 4, 2022

Uh oh!

ferrine commented Jul 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

OriolAbril Jul 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ferrine Jul 5, 2022

Choose a reason for hiding this comment

Uh oh!

twiecki commented Jul 5, 2022

Uh oh!

ferrine commented Jul 6, 2022

Uh oh!

Uh oh!

ferrine commented Jun 29, 2022 •

edited

Loading

ferrine commented Jun 29, 2022 •

edited

Loading

ferrine commented Jul 4, 2022 •

edited

Loading

OriolAbril Jul 5, 2022 •

edited

Loading