View all topics

Data Management

The future of computing lies in the hybrid cloud. We're creating a hybrid data fabric that provides secure, governed data access from anywhere, enables self-service discovery of the right data at the right time, and takes a holistic view at minimizing total cost of ownership for AI and analytics.

Our work

Bringing the power of semantic AI to IBM Db2
Technical note
Prabhakar Kudva, Apoorva Nitsure, Petr Novotny, Hong Min, and Donna Dillenberger
08 Jun 2026
IBM demonstrates extreme scale for content-aware storage with a 100-billion vector database
News
Peter Hess
13 Apr 2026
Accelerating AI inference with IBM Storage Scale
Technical note
Yue Zhu, Radu Stoica, Animesh Trivedi, Jonathan Terner, Frank Schmuck, Jeremy Cohn, Christof Schmitt, Anthony Hsu, Guy Margalit, Vasily Tarasov, Swaminathan Sundararaman, Talia Gershon, and Vincent Hsu
18 Nov 2025
IBM’s text-to-SQL generator takes top place on a benchmark for handling complex database queries
News
Kim Martineau
02 Jul 2024
IBM’s CodeFlare significantly cuts the time to automate transfer learning tasks for foundation models
Research
Bishwaranjan Bhattacharjee, Raghu Ganti, Carlos Costa, Mudhakar Srivatsa, and Nick Fuller
16 Dec 2021
4 minute read

Publications

Text-to-SQL Evaluation Toolkit
- - Oktie Hassanzadeh
  - Yotam Perlitz
  - et al.
- 2026
- VLDB 2026
Demo paper
4th International Workshop on Tabular Data Analysis (TaDA)
- - Vasilis Efthymiou
  - Oktie Hassanzadeh
  - et al.
- 2026
- VLDB 2026
Workshop
Predicting Table Joinability in Data Lakes using a Metadata Knowledge Graph
- - Sola Shirai
  - Oktie Hassanzadeh
  - et al.
- 2026
- VLDB 2026
Workshop paper
Data valuation model for non-monetary exchanges
- - Julia Blyumen
  - Eitan Farchi
- 2026
- arXiv
Paper
Recall Is Not Enough: Token-Centric Metrics for Agentic Schema
- - Ioana Giurgiu
  - Michael Nidd
- 2026
- aiDM 2026
Workshop paper
Markdown Mayhem : Taming the Agentic Documentation Explosion
- - Harsha Kokel
- 2026
- ACM CAIS 2026
Workshop paper

View all publications

IBM Solution: Data Fabric

Our research is regularly developed into new features for Data Fabric in IBM Cloud Pak for Data.

Learn more

Projects

FlowPilot
An LLM-Powered System for Enterprise Data
Simplified and Performant Access to Data in the Cloud
Reducing friction for scientific and foundation model workflows in Kubernetes.
ProvLake
A lineage data management system for tracking data in hybrid cloud deployments.

View more projects