Reparam vs reinforce

Pawel · September 10, 2025, 9:08am

Hi, I’ve just read the series of “SVI Pat I-IV” articles at pyro.ai/examples. Part III in particular talks about the reparam and reinforce options of computing the gradients wrt to the posterior’s params. I’d like to ask:

How does pyro (and numpyro) know in practice when to reparametrize?
I’ve noticed that the inbuilt distributions in numpyro have the `reparametrized_params` attribute (not so in pyro?) But still, it would seem to me that in order to reparametrize, it is not enough to just know that a given distribution is reparametrizable (wrt to a given parameter), but that one would also need an explicit transformation, the `g` function using the notation from the article along with the new posterior `q_\epsilon`?
If I implement a custom distribution (with logprob and sample methods) how do I make it reparametrizable in pyro/numpyro (assuming I have a concrete reparametrization in mind)?

Pawel · September 17, 2025, 7:33pm

Did I missunderstand things so terribly that my questions don’t make any sense? If that’s the case, please let me know as well

fehiepsi · November 26, 2025, 12:31pm

Hi @Pawel, reparametrized distributions are determined via has_rsample attribute. You can implement rsample method. You might also be interested in using transformed distribution.