Any difference between ed.dot and tf.reduce_sum(..., axis=-1)

bhomass · March 31, 2018, 5:37am

For the Bernoulli sample code I posted before,

C = Bernoulli(logits=z_logits)

I replaced ed.dot with tf.reduce_sum. The results are mostly the same, but the parameters with very small values change from -0.0091895 to 0.03451591. There is no other difference then the need to reshape to use ed.dot. Any reason a reshape or change of ed.dot call can cause this difference on small parameter values?

aksarkar · April 2, 2018, 7:42pm

ed.dot calls into tf.matmul, which calls a specialized matrix multiplication routine.

Multiplying and them summing up floating point numbers propagates errors differently. Refer to http://floating-point-gui.de/errors/propagation/

Topic		Replies	Views
Using tf.matmul vs ed.dot	2	1142	July 25, 2017
Edward example code Mixture	0	879	October 15, 2018
Edward limitations compared to Pyro?	5	1380	May 12, 2018
Ed.evaluate() of Bayesian GAN for Classification	8	1219	August 25, 2017
There is error in Edward exmple	0	964	October 20, 2018

Any difference between ed.dot and tf.reduce_sum(..., axis=-1)

Related topics