Renyi divergence variational inference

Thanks for the very detailed reply.

Yes that make sense. I’ll change that.

None. It was just to avoid being by default in a special case for testing in case I was not using the correct way to feed the parameter alpha. If I was to release the code I would set a default alpha=1. to get VI by default.

I need to do the math to but yes that my intuition on this. alpha → \infty correspond to a zero-enforcing behavior. In that case it would make sense to learn from the sample with the smallest loss (hand waving here — I’ll do the math).

True. I’ll remove that

Again true.

Sure why not. Let me clean the code a bit, do the math for VR-min and I’ll open a request.

Informative pointer thanks.

2 Likes