VividCam: Learning Unconventional Camera Motions from Virtual Synthetic Videos

Qiucheng Wu; Handong Zhao; Zhixuan Chu; Jing Shi; Yang Zhang; Shiyu Chang

ICML 2026

Conference paper

06 Jul 2026

VividCam: Learning Unconventional Camera Motions from Virtual Synthetic Videos

Abstract

Although recent text-to-video generative models are getting more capable of following external camera controls, imposed by either text descriptions or camera trajectories, they still struggle to generalize to unconventional camera motions, which is crucial in creating truly original and artistic videos. The challenge lies in the difficulty of finding sufficient training videos with the intended uncommon camera motions. To address this challenge, we propose VIVIDCAM, a training paradigm that enables diffusion models to learn complex camera motions from synthetic videos, releasing the reliance on collecting realistic training videos. VIVIDCAMin- corporates multiple disentanglement strategies that isolate camera motion learning from synthetic appearance artifacts, ensuring more robust motion representation and mitigating domain shift. We show that our design synthesizes a wide range of precisely controlled camera motions using surprisingly simple synthetic data. Notably, this synthetic data often consists of basic geometries within a low-poly 3D scene and can be efficiently rendered by engines like Unity. Our video results can be found inhttps://wuqiuche.github.io/VividCamDemoPage/.

Conference paper