Data Management
The future of computing lies in the hybrid cloud. We're creating a hybrid data fabric that provides secure, governed data access from anywhere, enables self-service discovery of the right data at the right time, and takes a holistic view at minimizing total cost of ownership for AI and analytics.
Our work
Tools + code
Fybrik
A cloud native platform to unify data access, governance and orchestration, enabling business agility while securing enterprise data.
View project →Datashim Framework
A kubernetes-based framework for hassle free handling of datasets.
View project →Project CodeFlare
A framework to simplify the integration, scaling and acceleration of complex multi-step analytics and machine learning pipelines on the cloud.
View project →Xskipper
A library for creating, managing and deploying data skipping indexes with Apache Spark
View project →
Publications
- 2022
- SIGMOD/PODS 2022
- 2022
- EDBT 2022
- 2021
- BigData Congress 2021
- 2021
- ASE 2021
- 2021
- SBBD 2021
- 2021
- SMDS 2021
IBM Solution: Data Fabric
Our research is regularly developed into new features for Data Fabric in IBM Cloud Pak for Data.