A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues

that would probably be pretty straightforward. although pyro doesn’t currently have any support for analytic kl divergences, so the simplest implementation of something like this in pyro would do without that.