FlexAttention for Efficient High-Resolution Vision-Language ModelsJunyan LiDelin Chenet al.2024ECCV 2024
PTR: A Benchmark for Part-based Conceptual, Relational, and Physical ReasoningYining HongLi Yiet al.2021NeurIPS 2021