Hello! I'm attempting to make a Python open-source implementation of https://arxiv.org/abs/1502.07257 and I am trying to figure out if Pyro is the correct tool for that application. The paper uses the stick breaking process and Stochastic Variational Inference to perform word sense disambiguated word vectors, which based on my searching seems to say that Pyro offers me a toolset that would make my implementation cleaner and easier to understand.
However I know relatively little about Probablistic Programming, and I'm having a hard time understanding the connections between Pyros Model and Guide Concepts and the learning process from the Julia implementation from the paper: https://github.com/sbos/AdaGram.jl/blob/master/src/gradient.jl
Is Pyro a good tool for this task, or am I misunderstanding the goals of the project?
Thanks for any help and insights you may provide