A Novel Metric for Measuring the Robustness of Large Language Models in Non-adversarial ScenariosSamuel AckermanElla Rabinovichet al.2024EMNLP 2024
Predicting Question-Answering Performance of Large Language Models through Semantic ConsistencyElla RabinovichSamuel Ackermanet al.2023EMNLP 2023
Measuring the Measuring Tools: An Automatic Evaluation of Semantic Metrics for Text CorporaGeorge KourSamuel Ackermanet al.2022EMNLP 2022
Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights GenerationGeorge KourMarcel Zalmanoviciet al.2022AAAI 2022
Density-based interpretable hypercube region partitioning for mixed numeric and categorical dataSamuel AckermanEitan Farchiet al.2021JSM 2021
Machine Learning Model Drift Detection Via Weak Data SlicesSamuel AckermanParijat Dubeet al.2021ICSE 2021
FreaAI: Automated extraction of data slices to test machine learning modelsSamuel AckermanOrna Razet al.2020AAAI 2020