Learning-Based Orchestration for Dynamic Functional Split and Resource Allocation in vRANs

Conference paper (2022)

Authors

Fahri Wisnu Murti University of Oulu

Samad Ali University of Oulu

G. Iosifidis Embedded Systems -

Matti Latva-aho University of Oulu

Research Group

Computer Graphics and Visualisation () (TU Delft)

DOI: https://doi.org/10.1109/EuCNC/6GSummit54941.2022.9815815

To reference this document use:

http://resolver.tudelft.nl/uuid:10f5244b-8785-4f82-ac98-7b494cc6459f

More Info

expand_more

Published Date

2022

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Department

Intelligent Systems

Research Group

Computer Graphics and Visualisation

Abstract

One of the key benefits of virtualized radio access networks (vRANs) is network management flexibility. However, this versatility raises previously-unseen network management challenges. In this paper, a learning-based zero-touch vRAN orchestration framework (LOFV) is proposed to jointly select the functional splits and allocate the virtualized resources to minimize the long-term management cost. First, testbed measurements of the behaviour between the users’ demand and the virtualized resource utilization are collected using a centralized RAN system. The collected data reveals that there are non-linear and non-monotonic relationships between demand and resource utilization. Then, a comprehensive cost model is proposed that takes resource overprovisioning, declined demand, instantiation and reconfiguration into account. Moreover, the proposed cost model also captures different routing and computing costs for each split. Motivated by our measurement insights and cost model, LOFV is developed using a model-free reinforcement learning paradigm. The proposed solution is constructed from a combination of deep Q-learning and a regression-based neural network that maps the network state and users’ demand into split and resource control decisions. Our numerical evaluations show that LOFV can offer cost savings by up to 69% of the optimal static policy and 45% of the optimal fully dynamic policy.

Files

Learning_Based_Orchestration_f... (pdf)

(pdf | 1.02 Mb)

- Embargo expired in 08-01-2023

Unknown license