Visual Chain-of-Thought Prompting for Knowledge-Based Visual ReasoningZhenfang ChenQinhong Zhouet al.2024AAAI 2024
ComPhy: Compositional Physical Reasoning of Objects and Events from VideosZhenfang ChenKexin Yiet al.2022ICLR 2022
Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and LanguageMingyu DingZhenfang Chenet al.2021NeurIPS 2021
Grounding Physical Object and Event Concepts Through Dynamic Visual ReasoningZhenfang ChenJiayuan Maoet al.2021ICLR 2021
ContPhy: Continuum Physical Concept Learning and Reasoning from VideosZhicheng ZhengXin Yanet al.2024ICML 2024
GENOME: Generative Neuro-Symbolic Visual Reasoning by Growing and Reusing ModulesZhenfang ChenRui Sunet al.2024ICLR 2024
COVLM: COMPOSING VISUAL ENTITIES AND RELATIONSHIPS IN LARGE LANGUAGE MODELS VIA COMMUNICATIVE DECODINGJunyan LiDelin Chenet al.2024ICLR 2024