Visualize: Figure1 & 3. Description: "Sketch of variational attention applied to machine translation. Two alignment distributions are shown, the blue prior p, and the red variational posterior q taking into account future observations. "