Foundation Model Privacy

Overview

Privacy has always been a concern when seeking to develop trustworthy AI solutions, even with conventional machine learning and deep learning models. Today, with the prevalence of large language models, that serve as foundation models, this concern becomes even more acute. Language models have an inherent tendency to memorize and even reproduce in their outputs text sequences learned during training, may this be pre-training, fine-tuning or even prompt-tuning. If this training data contained sensitive or personal information, this could result in a major privacy breach.

IBM is currently researching and developing methods to assess the privacy risk of large foundational models, adapted to cover these new and evolving attack vectors and able to scale to these huge model sizes. Moreover, we are investigating potential mitigation strategies that can help large language models be more resistant to this kind of attack.

Patents

P202204424 - A System and Method for Privacy Risk Assessment of ML model incorporating shadow models and data in combination with user model and data as part of automated process of selecting and invoking attacks
P202204171 - Selecting statistical queries for synthetic data generation for ML model training
P202204169 - Machine Unlearning Using Model Meta-Editing
P202202702 - Analysis of privacy risk of machine learning features
P202201528 - Explainability guided greedy data minimization method
P202201526 - Using local model explainability to find decision boundaries for data minimization

Publications

AI privacy toolkit
- - Abigail Goldsteen
  - Ola Saadi
  - et al.
- 2023
- SoftwareX
An End-to-end Framework for Privacy Risk Assessment of AI Models
- - Abigail Goldsteen
  - Shlomit Shachor
  - et al.
- 2022
- SYSTOR 2022
Applying Artificial Intelligence Privacy Technology in the Healthcare Domain
- - Abigail Goldsteen
  - Ariel Farkash
  - et al.
- 2022
- MIE 2022
Anonymizing Machine Learning Models
- - Abigail Goldsteen
  - Micha Moffie
  - et al.
- 2021
- ESORICS 2021
Data Minimization for GDPR Compliance in Machine Learning Models
- - Abigail Goldsteen
  - Gilad Ezov
  - et al.
- 2021
- AI and Ethics

Resources

Blog Post

Foundation Model Privacy

Overview

Patents

Publications

AI privacy toolkit

An End-to-end Framework for Privacy Risk Assessment of AI Models

Applying Artificial Intelligence Privacy Technology in the Healthcare Domain

Anonymizing Machine Learning Models

Data Minimization for GDPR Compliance in Machine Learning Models

Resources

AI Privacy Toolkit gets a boost on dataset assessment in latest release

IBM Research AI Minimization Toolkit

Machine Learning model anonymization tool

IBM Research AI Privacy and Compliance Toolkit

Contributors

Abigail Goldsteen