Matías Mazzanti, Esteban Mocskos, et al.
ISCA 2025
The management of vast amounts of multi-source, multi-modal data that also ensures compliance with stringent data protection regulations represents a tremendous organizational and technical challenge, particularly in the era of big data, ubiquitous computing technology, data-driven decision making processes, and advanced ML/AI algorithms with voracious data appetite. In this context, this article explores the design, development, and implementation of the multi-modal knowledge integration platform KITE+, conceived as a cloud-native tool for the secure and efficient management of large and complex data sets accessed by a number of geographically distributed entities. KITE+ leverages bulk operations for document management, facilitates data multi-modality via document attachments, and incorporates strong security measures to ensure the adequate administration of sensitive personal data as required by modern data protection laws (such as the California Consumer Privacy Act - CCPA, and the General Data Protection Regulation - GDPR). We delve into the technical requirements that influenced the design of the proposed platform, and discuss the implemented technical measures that ensure alignment with data protection regulations. Finally, through a large-scale data analytics solution in the healthcare domain, we showcase how such a multi-modal database management platform cannot only safeguard personal information but also enable advanced data analytics.
Matías Mazzanti, Esteban Mocskos, et al.
ISCA 2025
Chen Xiong, Xiangyu Qi, et al.
ACL 2025
Zhiyuan He, Yijun Yang, et al.
ICML 2024
Teryl Taylor, Frederico Araujo, et al.
Big Data 2020