Hi~
It seems that mask sets the log prob to 0 for the masked sample site, and block hides this sample site totally.
So i believe block should be used when we don’t want to infer the sample site, but what’s the intended use of mask? If we fix the log prob to 0, the inference algo can’t infer the posterior either, so why differentiate between them?
Thanks!