Are two mixture models with shared categorical distribution dependent/independent?

Rassibassi · February 25, 2018, 1:39pm

Hej,
I’d like two random variables, one that samples from a mixture of Gaussians and along with that a dependent OneHotCategorical random variable that represents that individual Gaussian from the mixture of Gaussians.
Let’s give it a try and first build a mixture of Gaussians:

from edward.models import Normal, Categorical, Mixture, Dirichlet, OneHotCategorical

mu = np.array([0.,1.,2.,3.], dtype=np.float32)
k=4
n=1000
d = Dirichlet(np.ones(k, dtype=np.float32))
c = Categorical(probs = d, sample_shape=(n))
components = [Normal(loc=mu[kk], scale=1., sample_shape=(n)) for kk in range(k)]
x = Mixture(cat=c, components=components, sample_shape=(n))

and another mixture of “deterministic” OneHotCategorical random variables

oneHotProbs = np.eye(k, k, dtype=np.float32)
oneHotProbs[oneHotProbs==0]=-1000
oneHotProbs[oneHotProbs==1]=1000
oneHotComponents = [OneHotCategorical(logits=oneHotProbs[kk,:], sample_shape=(n)) for kk in range(k)]
y = Mixture(cat=c, components=oneHotComponents, sample_shape=(n))

Are x and y coherently using the identical samples from c?
Is there any way to check this? Or is there a simpler way to do this?

Cheers, Rasmus

dustin · February 25, 2018, 8:12pm

To test:

sess = ed.get_session()
x_cat_sample, y_cat_sample = sess.run([x.cat, y.cat])
assert x_cat_sample == y_cat_sample

For a more end-to-end test, define the Normal’s in x with well-separated modes. For example, set scale to be 1e-8 and set loc to 0, 1, 2, and 3.

I predict both will pass. c is a Categorical random variable, which specifically means it is a Categorical TensorFlow Distribution associated with a sample tensor c* ~ p(c) in the TensorFlow graph (c.value). This sample tensor is the same in both x and y because c is the same in both x and y.

dustin · February 25, 2018, 8:24pm

Actually, the first test will pass but the second test will fail. x and y have a sample tensor (x.value and y.value). These tensors are given by calling {x,y}.sample(): its implementations newly draws a value from the Categorical random variable:

github.com

tensorflow/tensorflow/blob/r1.6/tensorflow/contrib/distributions/python/ops/mixture.py#L311


    mixture_log_cdf = math_ops.reduce_logsumexp(concatted_log_cdfs, [0])
    return mixture_log_cdf


def _sample_n(self, n, seed=None):
  if self._use_static_graph:
    # This sampling approach is almost the same as the approach used by
    # `MixtureSameFamily`. The differences are due to having a list of
    # `Distribution` objects rather than a single object, and maintaining
    # random seed management that is consistent with the non-static code path.
    samples = []
    cat_samples = self.cat.sample(n, seed=seed)
    for c in range(self.num_components):
      seed = distribution_util.gen_new_seed(seed, "mixture")
      samples.append(self.components[c].sample(n, seed=seed))
    x = array_ops.stack(
        samples, -self._static_event_shape.ndims - 1)     # [n, B, k, E]
    npdt = x.dtype.as_numpy_dtype
    mask = array_ops.one_hot(
        indices=cat_samples,                              # [n, B]
        depth=self._num_components,                       # == k
        on_value=np.ones([], dtype=npdt),

Rassibassi · February 26, 2018, 3:45am

Thank you. I will give it a try.
That the second test fails implicates that a Neural Network cannot be trained on classifying x with y as training output. Since x and y are sampled independently. Right?
Any ideas or suggestions how to create a construction as the above, but with x and y being dependent on each other?
How could this be implemented, I’d love to give it a try.

Topic		Replies	Views
Mixture of Linear Regression: How?	3	1557	March 27, 2018
Mixture model where weights are dependent on covariates	0	774	November 25, 2017
Multidimensional MDN	3	1279	July 17, 2018
Bayesian Model Combination (Kim, Ghahramani 2012)	3	1401	August 13, 2017
Variational EM for Independent Factor Analysis	7	1807	March 28, 2017

Are two mixture models with shared categorical distribution dependent/independent?

Related topics