MemReasoner: A Memory-augmented LLM Architecture for Multi-hop Reasoning

Irene Ko; Sihui Dai; Payel Das; Georgios Kollias; SUBHAJIT CHAUDHURY; Aurelie Lozano

NeurIPS 2024

Workshop paper

10 Dec 2024

MemReasoner: A Memory-augmented LLM Architecture for Multi-hop Reasoning

View publication

Abstract

Recent benchmarks suggest that there remains significant room to improve large language models’ ability to robustly reason across facts distributed in extremely long documents. In this work, we propose MemReasoner, a new memory-augmented LLM architecture that is trained to perform temporal reasoning, along with multiple computational steps, over the context stored in the memory. Experiments show that MemReasoner trained on the core reasoning facts generalizes better, when compared to off-the-shelf large language models and existing recurrent models, on a test distribution where the required facts are scattered across long natural text up to 128k tokens. Further, MemReasoner demonstrates robust reasoning performance relative to the baselines, when the answer distribution in test samples differs from that in the training set.

Conference paper