Learning these probability distributions is done with algorithm based on Stochastic Gradient Descent.