Federated privacy-preserving collaborative filtering for on-device next app prediction

Journal Article (2024)
Author(s)

Albert Saiapin (TU Delft - Mechanical Engineering, Skolkovo Institute of Science and Technology)

Gleb Balitskiy (Skolkovo Institute of Science and Technology)

Daniel Bershatsky (Skolkovo Institute of Science and Technology)

Aleksandr Katrutsa (AIRI, Skolkovo Institute of Science and Technology)

Evgeny Frolov (AIRI, Skolkovo Institute of Science and Technology)

Alexey Frolov (Skolkovo Institute of Science and Technology)

Ivan Oseledets (Skolkovo Institute of Science and Technology, AIRI)

Vitaliy Kharin (Independent researcher)

Research Group
Team Kim Batselier
DOI related publication
https://doi.org/10.1007/s11257-024-09395-0 Final published version
More Info
expand_more
Publication Year
2024
Language
English
Research Group
Team Kim Batselier
Issue number
4
Volume number
34
Pages (from-to)
1369-1398
Downloads counter
111
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In this study, we propose a novel SeqMF model to solve the problem of predicting the next app launch during mobile device usage. Although this problem can be represented as a classical collaborative filtering problem, it requires proper modification since the data are sequential, the user feedback is distributed among devices, and the transmission of users’ data to aggregate common patterns must be protected against leakage. According to such requirements, we modify the structure of the classical matrix factorization model and update the training procedure to sequential learning. Since the data about user experience are distributed among devices, the federated learning setup is used to train the proposed sequential matrix factorization model. One more ingredient of our approach is a new privacy mechanism that guarantees the protection of the sent data from the users to the remote server. To demonstrate the efficiency of the proposed model, we use publicly available mobile user behavior data. We compare our model with sequential rules and models based on the frequency of app launches. The comparison is conducted in static and dynamic environments. The static environment evaluates how our model processes sequential data compared to competitors. The dynamic environment emulates the real-world scenario, where users generate new data by running apps on devices. Our experiments show that the proposed model provides comparable quality with other methods in the static environment. However, more importantly, our method achieves a better privacy-utility trade-off than competitors in the dynamic environment, which provides more accurate simulations of real-world usage.

Files

S11257-024-09395-0.pdf
(pdf | 0.803 Mb)
- Embargo expired in 28-09-2024
License info not available