Issue applying SGHMC on LeNet CNN

rort1989 · March 18, 2018, 3:46pm

I’m also trying something similar. But I am using HMC with a smaller scale dataset (about 20000 training images and 2000 testing images). I haven’t quite figure out how to get reasonable results. I am monitoring the acceptance rate returned by the hmc inference function. It suppose to be close to 1. A couple of things I noticed could change the acceptance rate of samples are: 1) initial values of the q variables i.e. the variables with Empirical distributions. 2) step_size (SGHMC also has this one); 3) n_steps (For SGHMC this is ‘friction’). By tuning the values of 1),2),3) I manage to get a reasonable acceptance rate. I also found the acceptance rate is quite sensitive to the choice of 1),2),3). However, the issue is that the model still does not learn anything meaning that if I use the samples of q variables to perform inference, the results are pretty poor.

I am not really sure how to debug this. Any further comments on getting HMC/SGHMC working on CNN are appreciated.

Topic		Replies	Views
Issue about SGHMC using mini-batch	0	715	July 15, 2018
Resolve logistic regression parameters on simulated data	20	2112	April 28, 2018
Can I run SGHMC with float64?	1	983	September 13, 2017
Having trouble setting up basic HMC model	4	1943	May 29, 2017
Acceptance Rate 0 for HMC in IRT models	1	1752	January 18, 2019

Issue applying SGHMC on LeNet CNN

Related topics