Streamed batch training with manual update - scaling considerations

dustin · August 11, 2017, 4:03pm

To some degree, yes. It’s used generally to scale any computation with respect to the random variables. For example, you might use it for masking.

n_samples is an algorithm hyperparameter in ed.KLqp, representing the number of samples to estimate the gradient of the loss function. It is always relevant whenever running the algorithm.

That would only work if after each streaming batch, you re-set the prior distribution to be the inferred posterior from the batch. Otherwise, imagine you had a billion streaming points, with a batch size of 1; without scaling the likelihood by 1 million, the prior overwhelms the likelihood so the posterior will not differ much from the prior.

The discussion in Iterative estimators ("bayes filters") in Edward? - #2 by dustin provides most detail.

Topic		Replies	Views
Scaling batch inference when batch size varies	0	640	June 8, 2018
Updating non-trainable variables used in batch inference .update() call	0	628	March 11, 2018
Varying mini-batch-size, n_samples and learning rate during training	0	638	March 20, 2018
Re-using models/inferences for several independent fits	1	781	July 1, 2017
Biased variance estimates when using KLqp for Bayesian Linear Regression	7	1293	February 9, 2018

Streamed batch training with manual update - scaling considerations

Related Topics