Using tf.matmul vs ed.dot

MushroomHunting · July 25, 2017, 5:26am

Quick question

In the tutorials, for instance, in the bayesian linear regression, the model is defined as

y = Normal(loc=ed.dot(X, w) + b, scale=1.0)

however for a multiple output case, w is a matrix and one no longer multiplies a matrix by a vector.

My question is is there any reason not to use tf.matmul instead of ed.dot. Looking at the source it seems like it is more of a convenience function wrapping tf.matmul if the input isn’t formatted as a matrix and has NaNs or Infs with

verify_tensor_all_finite

x = control_flow_ops.with_dependencies(dependencies, x)

Is there any reason, assuming you already know your input data is going to be valid, to simply use tf.matmul.
For instance, if your data has NaNs or Infs then the dot product won’t work in the first place and something will break anyway. So it appears to be primarily for logging purposes.
I might be missing something subtle however

So instead is it perfectly fine to use something like

y = Normal(loc=tf.matmul(X, w) + b, scale=1.0)

cheers!

dustin · July 25, 2017, 9:20am

You are correct. ed.dot is a convenience function for matrix-vector products. tf.matmul works only on matrices and returns a matrix. If w and y are indeed both matrices as in your setting, then tf.matmul is more appropriate.

MushroomHunting · July 25, 2017, 10:51am

awesome, thanks dustin

Topic		Replies	Views
Any difference between ed.dot and tf.reduce_sum(..., axis=-1)	1	1021	April 2, 2018
Fixed scales in regression example	6	1395	August 29, 2017
Ed.evaluate() of Bayesian GAN for Classification	8	1219	August 25, 2017
A toy normal model failed (klqp) and why?	2	1689	July 25, 2017
NaNs in Tensor for model, decreasing learning rate doesn't help	0	632	May 5, 2018

Using tf.matmul vs ed.dot

Related topics