Deep Reinforcement Learning for Orchestrating Cost-Aware Reconfigurations of vRANs

None, None; None, None; None, None; None, None

Deep Reinforcement Learning for Orchestrating Cost-Aware Reconfigurations of vRANs

Journal Article (2024)

Author(s)

Fahri Wisnu Murti (University of Oulu)

Samad Ali (University of Oulu)

George Iosifidis (TU Delft - Networked Systems)

Matti Latva-aho (University of Oulu)

Research Group

Networked Systems

DOI related publication

https://doi.org/10.1109/TNSM.2023.3292713

Routing Neural networks Computer architecture Costs Deep reinforcement learning O-RAN Orchestration Computational modeling Load modeling Data models Network virtualization Action branching D3QN Radio access networks (RANs)

To reference this document use:

https://resolver.tudelft.nl/uuid:10311077-2304-46cd-b9e2-0eb920b2295f

More Info

expand_more

Publication Year

2024

Language

English

Research Group

Networked Systems

Issue number

1

Volume number

21

Pages (from-to)

200-216

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Virtualized Radio Access Networks (vRANs) are fully configurable and can be implemented at a low cost over commodity platforms to enable network management flexibility. In this paper, a novel vRAN reconfiguration problem is formulated to jointly reconfigure the functional splits of the base stations (BSs), locations of the virtualized central units (vCUs) and distributed units (vDUs), their resources, and the routing for each BS data flow. The objective is to minimize the long-term total network operation cost while adapting to the varying traffic demands and resource availability. In the first step, testbed measurements are performed to study the relationship between the traffic demands and computing resources, which reveals high variance and depends on the platform and its load. Consequently, finding the perfect model of the underlying system is non-trivial. Therefore, to solve the proposed problem, a deep reinforcement learning (RL)-based framework is proposed and developed using model-free RL approaches. Moreover, the problem consists of multiple BSs sharing the same resources, which results in a multi-dimensional discrete action space and leads to a combinatorial number of possible actions. To overcome this curse of dimensionality, action branching architecture, which is an action decomposition method with a shared decision module followed by neural network is combined with Dueling Double Deep Q-network (D3QN) algorithm. Simulations are carried out using an O-RAN compliant model and real traces of the testbed. Our numerical results show that the proposed framework successfully learns the optimal policy that adaptively selects the vRAN configurations, where its learning convergence can be further expedited through transfer learning even in different vRAN systems. It also offers significant cost savings by up to 59% of a static benchmark, 35% of Deep Deterministic Policy Gradient with discretization, and 76% of non-branching D3QN.

Files

Deep_Reinforcement_Learning_fo... (pdf)

(pdf | 3.25 Mb)