Unsuccesful Cholesky decomposition - Gaussian Process Regression

bkazi · February 26, 2018, 7:47pm

I’m trying to implement a Gaussian Process regressor using Edward

I’ve used the following code to build the model

X = tf.placeholder(tf.float32, [N, 1])
f = MultivariateNormalFullCovariance(
        loc=tf.zeros([N]),
        covariance_matrix=rbf(X)
    )
y = MultivariateNormalDiag(
        loc=f,
        scale_diag=tf.ones([N]) * 0.3)

and then the proposal distribution and inference as follows

qf = MultivariateNormalFullCovariance(loc=tf.Variable(tf.random_normal([N])),
            covariance_matrix=tf.Variable(tf.random_normal([N, N])))
inference = ed.KLqp({f: qf}, data={X: x_train, y: y_train})
inference.run(n_iter=5000)

I feel like I might have got something wrong. The inference throws an error saying that the Cholesky decomposition was unsuccessful.

Are there any examples that could help?

dustin · February 26, 2018, 8:32pm

Your variational approximation has a N x N matrix of free parameters. It is not guaranteed to be positive-semi definite.

Have you looked at the GP classification tutorial (http://edwardlib.org/tutorials/supervised-classification)? Another helpful reference is examples/cox_process.py which adds an epsilon to the diagonal of the covariance matrix to improve stability.

bkazi · February 27, 2018, 1:53pm

The epsilon was really useful for the rbf, thanks!

My reasoning for the entire N x N matrix was to capture the covariances between the data in the GP, because I’m having issues making predictions.

I make predictions by sampling the predictive posterior and then averaging.

post = ed.copy(y, {f: qf})
samples = [sess.run(post, feed_dict={X: x_test}) for _ in range(1000)]

The issue is that this produces the same mean as the posterior (conditioned on x_train) and doesn’t reflect the x_test data. I thought having the covariances would help.

Is there something wrong with what I’m doing or is there a better approach?

wawaph · March 16, 2018, 11:19pm

Did you ever get your GP to extrapolate to test points that weren’t in the training set? And could you get the test set to be a different size than the training set? I think I can get comparable GP results as I get from sklearn GaussianProcessRegressor but, so far, I can’t get predictions for other points.

charleshamesse · April 3, 2018, 10:10pm

Any luck getting the GP to extrapolate to unobserved test points? I can’t seem to find how to do this.

Topic		Replies	Views
Cholesky decomposition issue when learning covariance matrices	4	3892	February 13, 2018
Cholesky decomposition of sklearn rbf kernel output works but I get a positive definite error when going by example	1	1025	May 1, 2018
Variational Gaussian Process (GP) Regression / 'Tensor' object has no attribute 'log_prob'	3	1887	March 28, 2018
Sampling from the prior of a gaussian process	0	640	May 30, 2018
What priors are available for covariance matrices for multivariate Gaussians?	1	941	April 8, 2017

Unsuccesful Cholesky decomposition - Gaussian Process Regression

Related topics