Search results | TU Delft Repositories

Searched for: +

(1 - 2 of 2)

document: Hierarchize Pareto Dominance in Multi-Objective Stochastic Linear Bandits
Cheng, Ji (author), Xue, Bo (author), Jiaxiang, Y. (author), Zhang, Qingfu (author)
Multi-objective Stochastic Linear bandit (MOSLB) plays a critical role in the sequential decision-making paradigm, however, most existing methods focus on the Pareto dominance among different objectives without considering any priority. In this paper, we study bandit algorithms under mixed Pareto-lexicographic orders, which can reflect...
journal article 2024

document: Improved DQN-Based Computation Offloading Algorithm in MEC Environment
Zhao, Zheyu (author), Cheng, H. (author), Xu, Xiaohua (author)
Massive terminal users have brought explosive need of data residing at edge of overall network. Multiple Mobile Edge Computing (MEC) servers are built in/near base station to meet this need. However, optimal distribution of these servers to multiple users in real time is still a problem. Reinforcement Learning (RL) as a framework to solve...
conference paper 2023