Streamed batch training with manual update - scaling considerations

MushroomHunting · July 26, 2017, 11:24am

Hi there

N/M

Like the two related posts up the top i’m interested in running VI in a streaming context. Regarding the initialisation of ed.KLqp I have a few confusions

Is the scale parameter only relevant if one uses inference.run at first with N > M training data points.
Is n_samples only relevant if one uses an inference. run. call? I.e. if from the beginning if you have a large amount of data to train first on, and then the model is exposed to completely new data in a streaming situation. If not where does this fit in
In the case of having no data initially, and then acquiring data accumulating into buffers of size M, is it enough to simply have N=M giving a scale of 1 ?

cheers!

dustin · August 11, 2017, 4:03pm

To some degree, yes. It’s used generally to scale any computation with respect to the random variables. For example, you might use it for masking.

n_samples is an algorithm hyperparameter in ed.KLqp, representing the number of samples to estimate the gradient of the loss function. It is always relevant whenever running the algorithm.

That would only work if after each streaming batch, you re-set the prior distribution to be the inferred posterior from the batch. Otherwise, imagine you had a billion streaming points, with a batch size of 1; without scaling the likelihood by 1 million, the prior overwhelms the likelihood so the posterior will not differ much from the prior.

The discussion in Iterative estimators ("bayes filters") in Edward? - #2 by dustin provides most detail.

MushroomHunting · August 12, 2017, 12:31am

thanks Dustin, that clears things up

Topic		Replies	Views
Scaling batch inference when batch size varies	0	640	June 8, 2018
Updating non-trainable variables used in batch inference .update() call	0	629	March 11, 2018
Varying mini-batch-size, n_samples and learning rate during training	0	640	March 20, 2018
Re-using models/inferences for several independent fits	1	782	July 1, 2017
Biased variance estimates when using KLqp for Bayesian Linear Regression	7	1296	February 9, 2018

Streamed batch training with manual update - scaling considerations

Related Topics