FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMsSwanand Ravindra KadheAnisa Halimiet al.2023NeurIPS 2023
Cost-Aware Counterfactuals for Black Box ExplanationsNatalia Martinez GilKanthi Sarpatwaret al.2023NeurIPS 2023
Weakly Supervised Detection of Hallucinations in LLM ActivationsMiriam RateikeCelia Cintaset al.2023NeurIPS 2023
Subtle Misogyny Detection and Mitigation: An Expert-Annotated DatasetAnna RichterBrooklyn Sheppardet al.2023NeurIPS 2023
Influence Based Approaches to Algorithmic Fairness: A Closer LookSoumya GhoshPrasanna Sattigeriet al.2023NeurIPS 2023
Adversarial Auditing of Machine Learning Models under Compound ShiftKaran BhanotDennis Weiet al.2023ESANN 2023
Balancing Social Impact, Opportunities, and Ethical Constraints of Using AI in the Documentation and Vitalization of Indigenous LanguagesClaudio S. PinhanezPaulo Cavalinet al.2023IJCAI 2023
Skin Tone Analysis for Representation in Educational Materials (STAR-ED) Using Machine LearningGirmaw Abebe TadesseCelia Cintaset al.2023npj Digital Medicine
An AI-assisted Workbench for Material DiscoveryEmilio Ashton Vital BrazilRenato Fontoura de Gusmao Cerqueiraet al.2023ACS Fall 2023