CBMoS

None, None; None, None; None, None; None, None; None, None

CBMoS

Combinatorial Bandit Learning for Mode Selection and Resource Allocation in D2D Systems

Journal Article (2019)

Author(s)

Andrea Ortiz (Technische Universität Darmstadt)

Arash Asadi (Technische Universität Darmstadt)

Max Engelhardt (Vector Informatik GmbH, Technische Universität Darmstadt)

Anja Klein (Technische Universität Darmstadt)

Matthias Hollick (Technische Universität Darmstadt)

Affiliation

External organisation

DOI related publication

https://doi.org/10.1109/JSAC.2019.2933764

Online learning Device-to-device communications Combinatorial multi-armed bandits Mode selection and resource allocation

To reference this document use:

https://resolver.tudelft.nl/uuid:4c3d5f76-ca56-42a3-9e4e-4ebb62c323ac

More Info

expand_more

Publication Year

2019

Language

English

Affiliation

External organisation

Issue number

10

Volume number

37

Pages (from-to)

2225-2238

Abstract

The complexity of the mode selection and resource allocation (MSRA) problem has hampered the commercialization progress of Device-to-Device (D2D) communication in 5G networks. Furthermore, the combinatorial nature of MSRA has forced the majority of existing proposals to focus on constrained scenarios or offline solutions to contain the size of the problem. Given the real-time constraints in actual deployments, a reduction in computational complexity is necessary. Adaptability is another key requirement for mobile networks that are exposed to constant changes such as channel quality fluctuations and mobility. In this article, we propose an online learning technique (i.e., CBMoS) which leverages combinatorial multi-armed bandits (CMAB) to tackle the combinatorial nature of MSRA. Furthermore, our two-stage CMAB design results in a tight model, which eliminates the theoretically feasible but practicality invalid options from the solution space. We prototype the first SDR-based D2D testbed to verify the performance of CBMoS under real-world conditions. The simulations confirm that the fast learning speed of CBMoS leads to outperforming the benchmark schemes by up to 132%. In experiments, CBMoS exhibits even higher performance (up to 142%) than in the simulations. This stems from the adaptability/fast learning speed of CBMoS in presence of high channel dynamics which cannot be captured via statistical channel models used in the simulators.

No files available

Metadata only record. There are no files for this record.