Data Management
The future of computing lies in the hybrid cloud. We're creating a hybrid data fabric that provides secure, governed data access from anywhere, enables self-service discovery of the right data at the right time, and takes a holistic view at minimizing total cost of ownership for AI and analytics.
Our work
Projects
Virtual experiments — a lab in the cloud
- Accelerated Discovery
- Data Management
- Hybrid Cloud HPC
- Materials Discovery
ProvLake
- Data Management
Publications
- 2023
- APS March Meeting 2023
- 2023
- AAAI 2023
- 2023
- CODS-COMAD 2023
- 2022
- Big Data 2022
- 2022
- Big Data 2022
- 2022
- Big Data 2022
IBM Solution: Data Fabric
Our research is regularly developed into new features for Data Fabric in IBM Cloud Pak for Data.
Tools + code
Fybrik
A cloud native platform to unify data access, governance and orchestration, enabling business agility while securing enterprise data.
View project →Datashim Framework
A kubernetes-based framework for hassle free handling of datasets.
View project →Project CodeFlare
A framework to simplify the integration, scaling and acceleration of complex multi-step analytics and machine learning pipelines on the cloud.
View project →Xskipper
A library for creating, managing and deploying data skipping indexes with Apache Spark
View project →