WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from WikipediaYufang HouAlessandra Pascaleet al.2024NeurIPS 2024
Consistency-based Black-box Uncertainty Quantification for Text-to-SQLDebarun BhattacharjyaBalaji Ganesanet al.2024NeurIPS 2024
A Framework for Agents Guiding Foundation Models through Knowledge and ReasoningDebarun BhattacharjyaJunkyu Leeet al.2024IJCAI 2024
Designing Machine Learning Pipeline Toolkit for AutoML Surrogate Modeling OptimizationPaulito PalmesAkihiro Kishimotoet al.2023JuliaCon 2023
Automated AI For Decision Optimization with Reinforcement LearningShankar SubramaniamTakayuki Osogamiet al.2023AAAI 2023
An End-to-end Automated AI System for Reinforcement LearningLong VuTodd Mummertet al.2022INFORMS 2022
Distributed AutoML Pipeline Search in PC/RasPi K8s ClusterPaulito PalmesAkihiro Kishimotoet al.2022JuliaCon 2022
Boolean Decision Rules for Reinforcement Learning Policy SummarisationJames McCarthyRahul Nairet al.2022IJCAI 2022