I’m working on a small PR (https://github.com/pyro-ppl/pyro/pull/2723) for the SVI tutorial, and I was wondering about “dynamic optimizers” mentioned in the tutorial.
Can someone explain what kind of models would require the optimizer to be dynamic? And why is this different for pyro (as opposed to pytorch)?
My guess was that this is because parameters in pyro are your latent variables, and so given data, you may have more latent variables if you are working with a plate model. But its just a guess.
Thanks for your time.