Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation

In this joint work with Vikram Voleti and Christopher Pal, we show that a single diffusion model can solve many video tasks: 1) interpolation, 2) forward/reverse prediction, and 3) unconditional generation through a well-designed masking scheme 🧙‍♂️. See our website, which contains many videos: https://mask-cond-video-diffusion.github.io. The paper can be found here. The code is available … Continue reading Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation

Advertisement