Self-RAG: Learning to Retrieve, Generate, and Critique through Self-ReflectionAkari AsaiZeqiu Wuet al.2024ICLR 2024
Unraveling the Key Components of OOD Generalization via DiversificationHarold BenoitLiangze Jianget al.2024ICLR 2024
Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven PriorsIdo AmosJonathan Berantet al.2024ICLR 2024
A Probabilistic Framework for Modular Continual LearningLazar ValkovAkash Srivastavaet al.2024ICLR 2024
Get a Head Start: On-Demand Pedagogical Policy Selection in Intelligent TutoringGe GaoXi Yanget al.2024AAAI 2024
MANDREL: Modular Reinforcement Learning Pipelines for Material DiscoveryClyde FareGeorge K. Holtet al.2024AAAI 2024
Seed-Guided Fine-Grained Entity Typing in Science and Engineering DomainsYu ZhangYunyi Zhanget al.2024AAAI 2024
Partially Observable Hierarchical Reinforcement Learning with AI PlanningBrandon RozekJunkyu Leeet al.2024AAAI 2024