Automating Thought of Search: A Journey Towards Soundness and CompletenessDaniel CaoMichael Katzet al.2024NeurIPS 2024
Recurrent Transformers Trade-off Parallelism for Length Generalization on Regular LanguagesPaul SoulosAleksandar Terzicet al.2024NeurIPS 2024
MemReasoner: A Memory-augmented LLM Architecture for Multi-hop ReasoningIrene KoSihui Daiet al.2024NeurIPS 2024
Towards Unbiased Evaluation of Time-series Anomaly DetectorDebarpan BhattacharyaSumanta Mukherjeeet al.2024NeurIPS 2024
Predicting LLM Inference Latency: A Roofline-Driven ML MethodSaki ImaiRina Nakazawaet al.2024NeurIPS 2024
Thought of Search: Planning with Language Models Through The Lens of EfficiencyMichael KatzHarsha Kokelet al.2024NeurIPS 2024
Enhancing Reasoning to Adapt Large Language Models for Domain-Specific ApplicationsBo WenXin Zhang2024NeurIPS 2024
Multi-Task Neural Network Mapping onto Analog-Digital Heterogeneous AcceleratorsHadjer BenmezianeCorey Liam Lammieet al.2024NeurIPS 2024
Combining Domain and Alignment Vectors to Achieve Better Knowledge-Safety Trade-offs in LLMsMegh ThakkarYash Moreet al.2024NeurIPS 2024
Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime ImagingIsmail ErbasVikas Pandeyet al.2024NeurIPS 2024