Measuring the Measuring Tools: An Automatic Evaluation of Semantic Metrics for Text CorporaGeorge KourSamuel Ackermanet al.2022EMNLP 2022
Dynamic graph and polynomial chaos based models for contact tracing data analysis and optimal testing prescriptionShashanka UbaruLior Horeshet al.2021Journal of Biomedical Informatics
A Novel Metric for Measuring the Robustness of Large Language Models in Non-adversarial ScenariosSamuel AckermanElla Rabinovichet al.2024EMNLP 2024
Navigating the Modern Evaluation Landscape: Considerations in Benchmarks and Frameworks for Large Language Models (LLMs)Leshem ChoshenAriel Geraet al.2024LREC-COLING 2024
Deploying automated ticket router across the enterpriseSamuel AckermanLincoln Alexanderet al.2023AI Magazine
Workflow Provenance in the Lifecycle of Scientific Machine LearningRenan Francisco Santos SouzaLeonardo Guerreiro Azevedoet al.2021CCPE