About the Gradient Calculation of TraceELBO

ZQLIU · October 2, 2022, 3:18am

I’m reading the source codes for TraceELBO class. However, I do not fully understand its calculation for gradients for non-reparametrical cases.

The codes inside the blue frame are to compute the surrogate loss particle for guide trace. And the codes inside the red frame are the modifications in each iteration. However, I notice that in the REINFORCE estimator, there is another term in the surrogate loss, shown in the red frame in the second figure. (I mean, is there a need to to add an extra term score_function_term in the red frame in first figure?)

How does TraceELBO take account to this term? Thanks!

martinjankowiak · October 9, 2022, 10:04pm

well for example in the case of the elbo “f” contains terms like log q(z), which is taken care of in the entropy_term