Gradient Estimator of Discrete Random Variables

In this post, I will review several popular gradient estimators for discrete random variables. In machine learning, especially latent variable models and reinforcement learning (RL), we are often facing the following situation. We have a discrete random variable \(Z\) which takes values from \(K\) categories where \(K\) could be finite or countably infinite. Assuming the distribution associated with \(Z\) is \(q_{\phi}(Z)\), we would like to optimize the expected function as follows,

Read More

Density Ratio Trick

There are many scenarios in machine learning and statistics where we hope to compute the ratio of two density functions, e.g., KL-divergence, mutual information, importance sampling, and hypothesis testing. Specifically, given two density \(p(X)\) and \(q(X)\) on the same sample space, we’d like to compute

Read More