- document
-
Hai, R. (author), Koutras, C. (author), Ionescu, A. (author), Li, Z. (author), Sun, W. (author), van Schijndel, Jessie (author), Kang, Yan (author), Katsifodimos, A (author)Machine learning (ML) training data is often scattered across disparate collections of datasets, called data silos. This fragmentation poses a major challenge for data-intensive ML applications: integrating and transforming data residing in different sources demand a lot of manual work and computational resources. With data privacy and...conference paper 2023