How to handle missing values in Gaussian Matrix Factorization

dustin · April 22, 2017, 4:25pm

In the literature, this is known as implicit feedback. From my (limited) understanding, there are a few ways to handle the zeroes:

Treat the zeros as part of the data, as you mention. For Gaussian MF, you then have to downweight the zeros somehow during inference via large penalizations. Poisson MF naturally solves this by defining a sparse generative process.
Treat the zeroes as missing values (latent variables), and marginalize them out. This can be tough in most cases. To do this in Edward, include a tf.placeholder for the indicators. Here’s an example.

I = tf.placeholder(tf.int32)
mu = tf.matmul(U, V, transpose_b=True)
sigma = tf.ones([M, N])
Y_obs = Normal(mu=tf.gather(mu, I), sigma=tf.gather(sigma, I))
Y_mis = Normal(mu=tf.gather(mu, 1 - I), sigma=tf.gather(sigma, 1 - I))

qY_mis = Normal(
    mu=tf.Variable(tf.random_uniform(Y_mis.shape)),
    sigma=tf.Variable(tf.nn.softplus(tf.random_uniform(Y_mis.shape)))
)

inference = ed.KLqp({U: qU, V: qV, Y_mis: qY_mis}, data={Y_obs: y_train, I: I_train})

Your mileage will vary depending on how you structure the indicators and nonzero values in the matrix.

Topic		Replies	Views
Matrix factorization with Masking	1	793	November 3, 2017
Factor analysis Example clarification	0	682	May 24, 2018
How to use the result of Probabilistic PCA given in the tutorial?	1	711	June 20, 2017
Biased variance estimates when using KLqp for Bayesian Linear Regression	7	1296	February 9, 2018
Do I need Negative Sampling in Poisson Factorization?	0	694	December 18, 2017

How to handle missing values in Gaussian Matrix Factorization

Related Topics