Trustworthy AI

Our trust in technology relies on understanding how it works. It’s important to understand why AI makes the decisions it does. We’re developing tools to make AI more explainable, fair, robust, private, and transparent.

Explore our topics

Overview

Artificial intelligence systems have become increasingly prevalent in everyday life and enterprise settings, and they’re now often being used to support human decision-making. These systems have grown increasingly complex and efficient, and AI holds the promise of uncovering valuable insights across a wide range of applications. But broad adoption of AI systems will require humans to trust their output.

When people understand how technology works, and we can assess that it’s safe and reliable, we’re far more inclined to trust it. Many AI systems to date have been black boxes, where data is fed in and results come out. To trust a decision made by an algorithm, we need to know that it is fair, that it’s reliable and can be accounted for, and that it will cause no harm. We need assurances that AI cannot be tampered with and that the system itself is secure. We need to be able to look inside AI systems, to understand the rationale behind the algorithmic outcome, and even ask it questions as to how it came to its decision.

At IBM Research, we’re working on a range of approaches to ensure that AI systems built in the future are fair, robust, explainable, account, and align with the values of the society they’re designed for. We’re ensuring that in the future, AI applications are as fair as they are efficient across their entire lifecycle.

Our work

An artist’s tribute to modern AI
Q & A
Kim Martineau
27 Oct 2025
Lightweight tools for ‘steering’ LLMs down the right path
Research
Kim Martineau
15 Oct 2025
In AI, alignment is the goal. Steerability is how you get there
Q & A
Kim Martineau
26 Sep 2025
IBM further strengthens Granite for enterprise deployment with HackerOne
News
Mike Murphy
27 Aug 2025
Debugging LLMs to improve their credibility
Research
Kim Martineau
30 Jul 2025
How IBM’s Kush Varshney became the face of the modern ‘camera man’
Q & A
Kim Martineau
21 Jul 2025
See more of our work on Trustworthy AI

Topics

AI Testing
We’re designing tools to help ensure that AI systems are trustworthy, reliable and can optimize business processes.
Adversarial Robustness and Privacy
We’re making tools to protect AI and certify its robustness, and helping AI systems adhere to privacy requirements.
Explainable AI
We’re creating tools to help AI systems explain why they made the decisions they did.
Fairness, Accountability, Transparency
We’re developing technologies to increase the end-to-end transparency and fairness of AI systems.
Trustworthy Generation
We’re developing theoretical and algorithmic frameworks for generative AI to accelerate future scientific discoveries.
Uncertainty Quantification
We’re developing ways for AI to communicate when it's unsure of a decision across the AI application development lifecycle.

Publications

Comparison of simulated and observed methane plumes at oil and gas sites in the Permian Basin using advanced dispersion, coupled mesoscale-LES atmospheric modeling, and scientific machine learning
- - Arash Fathi
  - Joao Lucas de Sousa Almeida
  - et al.
- 2025
- AGU 2025
APILOT: Improving the Security and Usability of LLM Code Suggestions via Outdated API Mitigation
- - Weiheng Bai
  - Keyang Xuan
  - et al.
- 2025
- ACSAC 2025
Cross-Process Defect Attribution using Potential Loss Analysis
- - Ide-San Ide
  - Kohei Miyaguchi
- 2025
- WSC 2025
The Shepherd Test: How Will Superintelligent Agents Balance Care and Control in Asymmetric Relationships?
- - Djallel Bouneffouf
  - Matthew Riemer
  - et al.
- 2025
- NeurIPS 2025
Toward a Coherent Virtual Cell Model: Probing Biological World-Model Coherence in Transcriptomic Foundation Models
- - Noa Moriel
  - Yishai Shimoni
  - et al.
- 2025
- NeurIPS 2025
Foundation Models Enabling Multi-Scale Battery Materials Discovery: From Molecules To Devices
- - Vidushi Sharma
  - Andy Tek
  - et al.
- 2025
- NeurIPS 2025

View all publications

Building trustworthy AI with Watson

Our research is regularly integrated into Watson solutions to make IBM’s AI for business more transparent, explainable, robust, private, and fair.

Learn more