3D-VLA: A 3D Vision-Language-Action Generative World ModelHaoyu ZhenXiaowen Qiuet al.2024ICML 2024Conference paper
ContPhy: Continuum Physical Concept Learning and Reasoning from VideosZhicheng ZhengXin Yanet al.2024ICML 2024Conference paper