Data Management
The future of computing lies in the hybrid cloud. We're creating a hybrid data fabric that provides secure, governed data access from anywhere, enables self-service discovery of the right data at the right time, and takes a holistic view at minimizing total cost of ownership for AI and analytics.
Our work
IBM’s CodeFlare significantly cuts the time to automate transfer learning tasks for foundation models
ResearchBishwaranjan Bhattacharjee, Raghu Ganti, Carlos Costa, Mudhakar Srivatsa, and Nick Fuller4 minute read- Data Management
- Foundation Models
- Hybrid Cloud Platform
- Machine Learning
Publications
- Pranav Subramaniam
- Udayan Khurana
- et al.
- 2023
- CIKM 2023
- Alexander Zadorojniy
- 2023
- INFORMS 2023
- Syed Qasim
- Mert Toslali
- et al.
- 2023
- IC2E 2023
- Kanat Tangwongsan
- Martin Hirzel
- et al.
- 2023
- VLDB 2023
- Vasilis Efthymiou
- Sainyam Galhotra
- et al.
- 2023
- VLDB 2023
- Rajesh Bordawekar
- Tirthankar Lahiri
- 2023
- VLDB 2023
IBM Solution: Data Fabric
Our research is regularly developed into new features for Data Fabric in IBM Cloud Pak for Data.
Tools + code
Fybrik
A cloud native platform to unify data access, governance and orchestration, enabling business agility while securing enterprise data.
View project →Datashim Framework
A kubernetes-based framework for hassle free handling of datasets.
View project →Project CodeFlare
A framework to simplify the integration, scaling and acceleration of complex multi-step analytics and machine learning pipelines on the cloud.
View project →Xskipper
A library for creating, managing and deploying data skipping indexes with Apache Spark
View project →