Issue applying SGHMC on LeNet CNN

interactivetech · November 30, 2017, 5:27pm

Hey all,

I am having issues applying SGHMC inference on a convolutional neural network, such as LeNet. We believe our LeNet architecture is valid, but after 50000 iterations, the network does not learn anything.

We tested on applying MNIST dataset on a single layer NN with SGHMC, and we got 90% after 10000 iteration with batch size of 500.

Does Edward/SGHMC work on when using convolution operation, or are we doing something wrong with our model.

Here is our notebook of training LeNet with SGHMC with MNIST dataset.

Can anyone advise/help with why SGHMC is performing? My teammate and I would really appreciate it!

janislavjankov · December 5, 2017, 7:07am

I didn’t run your code so I could be wrong but I see in the model you define the placeholder X but then you define a new placeholder x which is the one used in the inference run. I hope this helps.

rort1989 · March 18, 2018, 3:46pm

I’m also trying something similar. But I am using HMC with a smaller scale dataset (about 20000 training images and 2000 testing images). I haven’t quite figure out how to get reasonable results. I am monitoring the acceptance rate returned by the hmc inference function. It suppose to be close to 1. A couple of things I noticed could change the acceptance rate of samples are: 1) initial values of the q variables i.e. the variables with Empirical distributions. 2) step_size (SGHMC also has this one); 3) n_steps (For SGHMC this is ‘friction’). By tuning the values of 1),2),3) I manage to get a reasonable acceptance rate. I also found the acceptance rate is quite sensitive to the choice of 1),2),3). However, the issue is that the model still does not learn anything meaning that if I use the samples of q variables to perform inference, the results are pretty poor.

I am not really sure how to debug this. Any further comments on getting HMC/SGHMC working on CNN are appreciated.

Topic		Replies	Views
Issue about SGHMC using mini-batch	0	719	July 15, 2018
Resolve logistic regression parameters on simulated data	20	2134	April 28, 2018
Can I run SGHMC with float64?	1	984	September 13, 2017
Having trouble setting up basic HMC model	4	1954	May 29, 2017
Acceptance Rate 0 for HMC in IRT models	1	1754	January 18, 2019

Issue applying SGHMC on LeNet CNN

Related topics