Building Trustworthy AI Collaborators: Factuality and Source Attribution in Agentic WorkflowsAlessandra PascaleJames Barryet al.2025AGU 2025
FactReasoner: A Probabilistic Approach to Long-Form Factuality Assessment for Large Language ModelsRadu MarinescuDebarun Bhattacharjyaet al.2025EMNLP 2025
WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from WikipediaYufang HouAlessandra Pascaleet al.2024NeurIPS 2024
Practical Perfusion Quantification in Multispectral Endoscopic Video: Using the Minutes after ICG Administration to Assess Tissue PathologyJonathan EpperleinMykhaylo Zayatset al.2021AMIA Annual Symposium 2021