Network Traffic Matrix Imputation via Large Language Models

None, None; None, None; None, None; None, None; None, None; None, None

Network Traffic Matrix Imputation via Large Language Models

Conference Paper (2025)

Author(s)

Kaiwen Jiang (Hefei University of Technology)

Fenglin Yan (Hefei University of Technology)

Yan Qiao (Hefei University of Technology)

Meng Li (Hefei University of Technology)

Yuxuan Li (The Hong Kong Polytechnic University)

Mauro Conti (TU Delft - Electrical Engineering, Mathematics and Computer Science, Università degli Studi di Padova)

Research Group

Cyber Security

Large language models Adversarial learning Network monitoring Traffic matrix imputation

DOI related publication

https://doi.org/10.1109/ISCC65549.2025.11326227 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:5ee3710d-4900-4671-8e15-88e7c19f9677

More Info

expand_more

Publication Year

2025

Language

English

Research Group

Cyber Security

Publisher

IEEE

ISBN (electronic)

9798331524203

Event

30th IEEE Symposium on Computers and Communications, ISCC 2025 (2025-07-02 - 2025-07-05), Bologna, Italy

Downloads counter

2

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Large Language Models (LLMs) have demonstrated remarkable zero-shot capabilities across various domains. This paper pioneers the application of LLMs' outstanding knowledge and reasoning abilities to the challenging task of Traffic Matrix (TM) imputation. However, the application poses significant challenges due to the skewed TM distribution and the deficient traffic feature under low sampling rate. To address these issues, we propose TM-LLM, the first LLM-based model specifically designed for TM imputation. Our approach includes two critical designs: Firstly, we develop an adversarial training strategy to pre-impute TM data, allowing the LLM to understand the distributional features even when faced with extensive missing data. Secondly, we devise a TM-specific embedding scheme along with a crafted prompt template, which enables our approach to harness LLMs' exceptional inferential ability. Experimental results show that TMLLM significantly outperforms state-of-the-art imputation methods, achieves a notable 16.5% -44.8 % improvement in accuracy over the current best baseline, while reduces measurement costs by 80 % - 96 %. It can accurately capture the traffic pattern even when the sampling rate is extremely low. The code for reproducing our experiments is publicly available1. These findings strongly indicate the breakthrough potential of LLMs in network TM analysis tasks.1The experimental codes with our methods and the datasets are available at https://github.com/FILingK/TM-LLM

Files

Network_Traffic_Matrix_Imputat... (pdf)

(pdf | 5.56 Mb)

Taverne

File under embargo until 13-07-2026