Misc RandomVariable improvements #79

ricardoV94 · 2022-12-02T14:57:17Z

This PR improves a couple of edge issues related to RandomVariable static shape inference, and simplifies and extends the local_dimshuffle_rv_lift rewrite.

Allowing the rewrite to apply in more cases increases the range of graphs we can infer the logprob in PyMC: https://github.com/pymc-devs/pymc/blob/a0d6ba079eac2f044ed40cc5747f1079d99f9f16/pymc/logprob/tensor.py#L288-L290

Closes #60 and makes progress related to #49

This allows PyTensor to infer more broadcastable patterns, by placing the casting inside the MakeVector Op

michaelosthege · 2022-12-02T15:48:47Z

@ricardoV94 I unsubscribed - ping me when CI is green

ricardoV94 · 2022-12-02T16:47:15Z

@ricardoV94 I unsubscribed - ping me when CI is green

99.2% confident it will be green this run @michaelosthege

codecov-commenter · 2022-12-02T17:01:15Z

Codecov Report

Merging #79 (e59ddb7) into main (491f93e) will increase coverage by 0.03%.
The diff coverage is 100.00%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #79      +/-   ##
==========================================
+ Coverage   74.22%   74.26%   +0.03%     
==========================================
  Files         174      175       +1     
  Lines       48734    48929     +195     
  Branches    10367    10395      +28     
==========================================
+ Hits        36175    36335     +160     
- Misses      10272    10291      +19     
- Partials     2287     2303      +16

Impacted Files	Coverage Δ
pytensor/tensor/elemwise.py	`88.08% <100.00%> (+0.01%)`	⬆️
pytensor/tensor/random/basic.py	`99.03% <100.00%> (+<0.01%)`	⬆️
pytensor/tensor/random/op.py	`97.46% <100.00%> (+0.06%)`	⬆️
pytensor/tensor/random/rewriting.py	`93.54% <100.00%> (-0.62%)`	⬇️
pytensor/tensor/random/utils.py	`100.00% <100.00%> (ø)`
pytensor/link/numba/dispatch/extra_ops.py	`92.24% <0.00%> (-5.77%)`	⬇️
pytensor/link/numba/dispatch/basic.py	`90.06% <0.00%> (-2.62%)`	⬇️
pytensor/sparse/sandbox/sp.py	`73.48% <0.00%> (ø)`
pytensor/link/numba/dispatch/nlinalg.py	`100.00% <0.00%> (ø)`
pytensor/link/numba/dispatch/cython_support.py	`86.95% <0.00%> (ø)`
... and 2 more

ricardoV94 · 2022-12-03T05:24:43Z

pytensor/tensor/random/op.py

@@ -319,8 +319,23 @@ def make_node(self, rng, size, dtype, *dist_params):
                "The type of rng should be an instance of either RandomGeneratorType or RandomStateType"
            )

+        # Fail early when size is incompatible with parameters
+        size_len = get_vector_length(size)


Perhaps I should move this to infer_shape. If for some reason an RV needs different batching semantics it can override it?

can you come up with an example?
one of the zero-sum RVs maybe?

if not I'd say apply the YAGNI rule

There was already a case with the ChoiceRV and PermutationRVs which need special handling for the shape

michaelosthege

can't say I understand everything, but I noticed nothing unplausible either.

michaelosthege · 2022-12-03T10:36:15Z

pytensor/tensor/random/op.py

@@ -319,8 +319,23 @@ def make_node(self, rng, size, dtype, *dist_params):
                "The type of rng should be an instance of either RandomGeneratorType or RandomStateType"
            )

+        # Fail early when size is incompatible with parameters
+        size_len = get_vector_length(size)


can you come up with an example?
one of the zero-sum RVs maybe?

if not I'd say apply the YAGNI rule

… parameters dimensionality

* The rewrite no longer bails out when dimshuffle affects both unique param dimensions and repeated param dimensions from the size argument. This requires: 1) Adding broadcastable dimensions to the parameters, which should be "cost-free" and would need to be done in the `perform` method anyway. 2) Extend size to incorporate implicit batch dimensions coming from the parameters. This requires computing the shape resulting from broadcasting the parameters. It's unclear whether this is less performant, because the `perform` method can now simply broadcast each parameter to the size, instead of having to broadcast the parameters together. * The rewrite now works with Multivariate RVs * The rewrite bails out when dimensions are dropped by the Dimshuffle. This case was not correctly handled by the previous rewrite

ricardoV94 added bug Something isn't working enhancement New feature or request graph rewriting random variables labels Dec 2, 2022

ricardoV94 requested review from michaelosthege and ferrine December 2, 2022 14:57

ricardoV94 added 2 commits December 2, 2022 15:59

Fix test name

6b05645

Apply casting in as_tensor_variable in normalize_size_param

df2f8a5

This allows PyTensor to infer more broadcastable patterns, by placing the casting inside the MakeVector Op

ricardoV94 force-pushed the fix_rv_ds_lift branch 2 times, most recently from a18e46a to 128531f Compare December 2, 2022 15:39

ricardoV94 commented Dec 3, 2022

View reviewed changes

michaelosthege approved these changes Dec 3, 2022

View reviewed changes

ricardoV94 marked this pull request as draft December 3, 2022 13:39

ricardoV94 added 4 commits December 4, 2022 11:56

Fix shape_inference of ChoiceRV when param_shapes are provided

8ebd2c3

Fail early in RandomVariable.make_node when size is incompatible with…

eff4b6b

… parameters dimensionality

Add drop property to Dimshuffle

97c0b72

ricardoV94 force-pushed the fix_rv_ds_lift branch from 128531f to e59ddb7 Compare December 4, 2022 11:00

ricardoV94 marked this pull request as ready for review December 4, 2022 11:01

ricardoV94 merged commit 2ebfbf1 into pymc-devs:main Dec 4, 2022

ricardoV94 deleted the fix_rv_ds_lift branch January 20, 2023 13:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Misc RandomVariable improvements #79

Misc RandomVariable improvements #79

Uh oh!

ricardoV94 commented Dec 2, 2022

Uh oh!

michaelosthege commented Dec 2, 2022

Uh oh!

ricardoV94 commented Dec 2, 2022 •

edited

Loading

Uh oh!

codecov-commenter commented Dec 2, 2022 •

edited

Loading

Uh oh!

ricardoV94 Dec 3, 2022

Uh oh!

michaelosthege Dec 3, 2022

Uh oh!

ricardoV94 Dec 4, 2022

Uh oh!

michaelosthege left a comment

Uh oh!

michaelosthege Dec 3, 2022

Uh oh!

Uh oh!

Misc RandomVariable improvements #79

Misc RandomVariable improvements #79

Uh oh!

Conversation

ricardoV94 commented Dec 2, 2022

Uh oh!

michaelosthege commented Dec 2, 2022

Uh oh!

ricardoV94 commented Dec 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Dec 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ricardoV94 Dec 3, 2022

Choose a reason for hiding this comment

Uh oh!

michaelosthege Dec 3, 2022

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Dec 4, 2022

Choose a reason for hiding this comment

Uh oh!

michaelosthege left a comment

Choose a reason for hiding this comment

Uh oh!

michaelosthege Dec 3, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ricardoV94 commented Dec 2, 2022 •

edited

Loading

codecov-commenter commented Dec 2, 2022 •

edited

Loading