Hi,
I do have a rather general question regarding missing values in edward/edward2. In biostatistical applications data is often missing. A standard approach is (multiple) imputation of the missing values before moving to the actual analysis. This is a bit absurd when using edward as the models used for imputation still need to fit in memory and are limited to a few standard implementations (e.g. MICE package in R). A more principaled approach would be to specify of a joint model of observed and missing data where the missing values are treated as latent variables. As far as I understand this is already possible but requires tedious splicing/combination of parameters and data (cf. Stan user manual on missing values). Even for low dimensional models this quickly becomes almost intractable in practice. Is there currently any way of specifying distributions for ‘partially observed tensors’ (cf. discussion https://github.com/greta-dev/greta/issues/117) or are there plans for implementing something like it? I feel that this is extremely important in practice as it would be kind of sad to be forced to still use standard (multiple) imputation techniques before a final analysis with edward. After all, whenever missingness becomes a real issue, the ability to model it properly is extremely important!
It would also be interesting to see whether a fully conditional specification (https://www.tandfonline.com/doi/abs/10.1080/10629360600810434) would be possible with edward as this does not necessarily lead to a valid joint model of the data.
Best,
Kevin