L2 regularization of weights

nadheesh · August 7, 2018, 4:38am

I’m trying to understand how can we use the regularization with Edward models. I’m not very much familiar with tensorflow. Consider the model below,

# prior
w=Normal(loc=tf.zeros((d,c)),scale=tf.ones((d,c)))

# likelihood
y=Categorical(logits=tf.matmul(X,w))

# posterior
loc_qw = tf.get_variable("qw/loc", [d, c])
scale_qw = tf.nn.softplus(tf.get_variable("qw/scale", [d, c]))
qw = Normal(loc=loc_qw, scale=scale_qw)

# inference 
inference = ed.KLqp({w: qw, b: qb}, data={X:train_X, y:train_y})

I notice that Edward uses regularization losses in the loss function.
loss = -(p_log_lik - kl_penalty - reg_penalty)

However, I can’t figure out how to apply the regularization losses to the Edward model. How can we add L1 or L2 regularization to the above model?

Thanks!

davidaknowles · August 16, 2018, 8:32pm

Possibly I just answered your question on stackoverflow (I answered a very similar question!)

The normal prior on w is the Bayesian analogue to L2 regularization when optimizing parameters.

nadheesh · August 21, 2018, 5:31am

Thanks, I think I asked the wrong question. I knew that Normal prior is equivalent to the l2 regularization. Imaging if the prior is not normal and if we want to regularize the parameters that we are trying the estimate.

I found that this could be done using the regularizer param of the tf variables in the posterior.

loc_qw = tf.get_variable("qw/loc", [d, c], regularizer=tf.contrib.layers.l2_regularizer(reg_scale) )

Topic		Replies	Views
Understanding Edward KLqp algorithm	2	1502	July 23, 2018
Rookie problem (KLqp gets obviously wrong result)	7	1784	October 17, 2017
Use Edward within a tensorflow deep network using standard component	1	1286	March 13, 2017
Bayesian regression	4	1196	June 26, 2017
A toy normal model failed (klqp) and why?	2	1674	July 25, 2017

L2 regularization of weights

Related Topics