Diffusion models
Setup
- : true sampling distribution of observed data
- : learned sampling distribution
- : noisy latent variables at each step
Forward process
In the forward process, noise is continuously added to the data across multiple steps, converging toward white noise : The fixed Gaussian forward process means we can directly calculate/sample from without calculating all the intermediate steps (similar to Autoregressive models): Where , , and is white noise. Again, this is 100% analogous to the forecasting equation for an AR(1).
In practice, is chosen to be close to 1, as this yields the best results.
Denoising
In the denoising or reverse diffusion process, noise is progressively removed over multiple steps, modeling the inverse process: Where .
is parameterized using a denoising network, (of which many exist, including the popular U-Net): This model is trained using the objective function:
References
@kolloviehPredictRefineSynthesize2023