Statespace: Don't automatically save statespace matrices as `Deterministic` variables #302

jessegrabowski · 2024-02-05T00:25:34Z

I had originally done this to facilitate out-of-sample sampling tasks, so I could do e.g. pm.Flat('T', T). The result was that all matrices were saved to the idata, creating pretty horrible pm.model_to_graphviz outputs like this:

This also was extremely memory wasteful. Many of the matrices are not random at all, and they were being saved (chain, draw) times.

After this refactor, the matrices are dynamically rebuilt as needed from the parameter samples. The new graphs look like this:

There are also some en-passant changes to how exogenous data are handled that breaks the Structural example notebook. I will open a new PR after this one to address that, because I think I finally have it set up to handle forecasting with exogenous data. Basically, I was previously treating exogenous data like a type of "parameter". It's been upgraded to a first-class object, and custom models subclassing PyMCStateSpace that use exogenous data will now need to implement data_names and data_info properties.

pymc_experimental/statespace/core/statespace.py

ricardoV94 · 2024-02-05T09:27:23Z

For the memory question, matrices that don't change could be defined as constant or mutabledata

jessegrabowski · 2024-02-05T09:44:34Z

They're stored as TensorVariables in the statespace representation. This new way of doing this will directly use those once instead of copying them over and over into the idata.

ricardoV94 · 2024-02-05T09:46:09Z

They're stored as TensorVariables in the statespace representation. This new way of doing this will directly use those once instead of copying them over and over into the idata.

ConstantData and MutableData are also stored only once in the idata as opposed to Deterministics which are stored per draw. Seems like that's what you wanted?

jessegrabowski · 2024-02-05T09:53:41Z

Something like that, but I don't want to have a bunch of logic to decide if a matrix is static or contains parameters. If the user really wants to inspect matrices, he can ask for them manually and save them however he wants. In general, I think the important outputs for most users is going to be the parameters and the states. The rest can be more hidden away.

ricardoV94

Just a tiny type-hint thing

ricardoV94 · 2024-02-06T10:25:57Z

pymc_experimental/statespace/core/statespace.py

@@ -291,7 +314,7 @@ def _unpack_statespace_with_placeholders(self) -> Tuple:

        return a0, P0, c, d, T, Z, R, H, Q

-    def unpack_statespace(self) -> Tuple:
+    def unpack_statespace(self) -> list[pt.TensorVariable, ...]:


List doesn't need ellipsis

Suggested change

def unpack_statespace(self) -> list[pt.TensorVariable, ...]:

def unpack_statespace(self) -> list[pt.TensorVariable]:

Pycharm actually complains when I make this change, it wants me to literally list out tuple[ TensorVariable, TensorVariable, TensorVariable, TensorVariable, TensorVariable, TensorVariable, TensorVariable, TensorVariable, TensorVariable], which seems silly. list[TensorVariable, ...] was my hack-around.

Ignore pycharm

jessegrabowski · 2024-02-11T02:19:07Z

What's up with these jax test failures on the Ubuntu CI?

ricardoV94 · 2024-02-11T08:08:15Z

What's up with these jax test failures on the Ubuntu CI?

#305 ?

jessegrabowski added enhancements New feature or request major statespace labels Feb 5, 2024

jessegrabowski requested a review from ricardoV94 February 5, 2024 00:25

jessegrabowski commented Feb 5, 2024

View reviewed changes

pymc_experimental/statespace/core/statespace.py Outdated Show resolved Hide resolved

jessegrabowski force-pushed the clean-graph branch 2 times, most recently from 0c806d5 to 6ac1d0d Compare February 5, 2024 23:46

ricardoV94 approved these changes Feb 6, 2024

View reviewed changes

jessegrabowski force-pushed the clean-graph branch from 93265aa to 97c3ab8 Compare February 12, 2024 13:33

Don't add statespace matrices or outputs to PyMC graph by default.

eb44c83

ricardoV94 force-pushed the clean-graph branch from 97c3ab8 to eb44c83 Compare February 13, 2024 07:24

ricardoV94 merged commit 015ba1f into pymc-devs:main Feb 13, 2024

jessegrabowski deleted the clean-graph branch February 20, 2024 16:38

jessegrabowski mentioned this pull request Apr 9, 2024

Bug fixes for statespace #326

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Statespace: Don't automatically save statespace matrices as `Deterministic` variables #302

Statespace: Don't automatically save statespace matrices as `Deterministic` variables #302

jessegrabowski commented Feb 5, 2024

ricardoV94 commented Feb 5, 2024 •

edited

Loading

jessegrabowski commented Feb 5, 2024

ricardoV94 commented Feb 5, 2024 •

edited

Loading

jessegrabowski commented Feb 5, 2024

ricardoV94 left a comment

ricardoV94 Feb 6, 2024

jessegrabowski Feb 11, 2024

ricardoV94 Feb 11, 2024

jessegrabowski commented Feb 11, 2024

ricardoV94 commented Feb 11, 2024

	def unpack_statespace(self) -> list[pt.TensorVariable, ...]:
	def unpack_statespace(self) -> list[pt.TensorVariable]:

Statespace: Don't automatically save statespace matrices as Deterministic variables #302

Statespace: Don't automatically save statespace matrices as Deterministic variables #302

Conversation

jessegrabowski commented Feb 5, 2024

ricardoV94 commented Feb 5, 2024 • edited Loading

jessegrabowski commented Feb 5, 2024

ricardoV94 commented Feb 5, 2024 • edited Loading

jessegrabowski commented Feb 5, 2024

ricardoV94 left a comment

Choose a reason for hiding this comment

ricardoV94 Feb 6, 2024

Choose a reason for hiding this comment

jessegrabowski Feb 11, 2024

Choose a reason for hiding this comment

ricardoV94 Feb 11, 2024

Choose a reason for hiding this comment

jessegrabowski commented Feb 11, 2024

ricardoV94 commented Feb 11, 2024

Statespace: Don't automatically save statespace matrices as `Deterministic` variables #302

Statespace: Don't automatically save statespace matrices as `Deterministic` variables #302

ricardoV94 commented Feb 5, 2024 •

edited

Loading

ricardoV94 commented Feb 5, 2024 •

edited

Loading