In the given code with the polyphonic music dataset, what exactly are they achieving by reducing the NLL?
a. Are they predicting any future notes of the piano? It doesn’t look like that to me.
b. What does the latent variable z signify for this dataset?
c. Can I tweak this code to make predictions for any other dataset?
Why the z_dim is chosen as 100? Generally, if my final result is a single value(for eg: 130 or 200) instead of a vector at any timestep, can I keep z_dim as 1? Or they are different things?
From the research paper that is mentioned in this link, even for the Health dataset(where the z is the patient’s health state at any time point), z dim is around 100? Why is it so?
Is z_0 defined in the init section of the DMM class, the prior? If I know my result is going to be within a range of values(for eg: 100 and 200), where can I give that information in this code?
Thanks in advance.