Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational AgentsIvoline NgongSwanand Ravindra Kadheet al.2024NeurIPS 2024
Attack Atlas: A Practitioner's Perspective on Challenges and Pitfalls in Red Teaming GenAIAmbrish RawatStefan Schoepfet al.2024NeurIPS 2024
Advanced Physics-AI Models for Rain Enhancement in Arid RegionsLloyd TreinishMukul Tewariet al.2024AGU 2024
Modelling the Extreme July 2023 Hudson Valley Precipitation Event Using WRFAnthony PrainoLloyd Treinishet al.2024AGU 2024
Advancing Applications of Remote Sensing for Detection of and Long-Term Monitoring of Harmful Algal Blooms (HABs)Lloyd TreinishVincent Moriarty2024AGU 2024
Membership Inference Attacks Against Time-Series ModelsNoam KorenAbigail Goldsteenet al.2024ACML 2024
Fuse to Forget: Bias Reduction and Selective Memorization through Model FusionKerem ZamanLeshem Choshenet al.2024EMNLP 2024
A Novel Metric for Measuring the Robustness of Large Language Models in Non-adversarial ScenariosSamuel AckermanElla Rabinovichet al.2024EMNLP 2024