FactReasoner: A Probabilistic Approach to Long-Form Factuality Assessment for Large Language ModelsRadu MarinescuDebarun Bhattacharjyaet al.2025EMNLP 2025
SIMBA UQ: Similarity-Based Aggregation for Uncertainty Quantification in Large Language ModelsDebarun BhattacharjyaBalaji Ganesanet al.2025EMNLP 2025
Optimistic Exploration for Risk-Averse Constrained Reinforcement LearningRadu MarinescuElizabeth Dalyet al.2025ECAI 2025
Assessing Confidence in Large Language Models by Classifying Task Correctness using Similarity FeaturesDebarun BhattacharjyaBalaji Ganesanet al.2025ICLR 2025
WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from WikipediaYufang HouAlessandra Pascaleet al.2024NeurIPS 2024
Consistency-based Black-box Uncertainty Quantification for Text-to-SQLDebarun BhattacharjyaBalaji Ganesanet al.2024NeurIPS 2024
A Framework for Agents Guiding Foundation Models through Knowledge and ReasoningDebarun BhattacharjyaJunkyu Leeet al.2024IJCAI 2024
Designing Machine Learning Pipeline Toolkit for AutoML Surrogate Modeling OptimizationPaulito PalmesAkihiro Kishimotoet al.2023JuliaCon 2023