Network Traffic Matrix Imputation via Large Language Models

Conference Paper (2025)
Author(s)

Kaiwen Jiang (Hefei University of Technology)

Fenglin Yan (Hefei University of Technology)

Yan Qiao (Hefei University of Technology)

Meng Li (Hefei University of Technology)

Yuxuan Li (The Hong Kong Polytechnic University)

Mauro Conti (TU Delft - Electrical Engineering, Mathematics and Computer Science, UniversitĂ  degli Studi di Padova)

Research Group
Cyber Security
DOI related publication
https://doi.org/10.1109/ISCC65549.2025.11326227 Final published version
More Info
expand_more
Publication Year
2025
Language
English
Research Group
Cyber Security
Publisher
IEEE
ISBN (electronic)
9798331524203
Event
30th IEEE Symposium on Computers and Communications, ISCC 2025 (2025-07-02 - 2025-07-05), Bologna, Italy
Downloads counter
2
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Large Language Models (LLMs) have demonstrated remarkable zero-shot capabilities across various domains. This paper pioneers the application of LLMs' outstanding knowledge and reasoning abilities to the challenging task of Traffic Matrix (TM) imputation. However, the application poses significant challenges due to the skewed TM distribution and the deficient traffic feature under low sampling rate. To address these issues, we propose TM-LLM, the first LLM-based model specifically designed for TM imputation. Our approach includes two critical designs: Firstly, we develop an adversarial training strategy to pre-impute TM data, allowing the LLM to understand the distributional features even when faced with extensive missing data. Secondly, we devise a TM-specific embedding scheme along with a crafted prompt template, which enables our approach to harness LLMs' exceptional inferential ability. Experimental results show that TMLLM significantly outperforms state-of-the-art imputation methods, achieves a notable 16.5% -44.8 % improvement in accuracy over the current best baseline, while reduces measurement costs by 80 % - 96 %. It can accurately capture the traffic pattern even when the sampling rate is extremely low. The code for reproducing our experiments is publicly available1. These findings strongly indicate the breakthrough potential of LLMs in network TM analysis tasks.1The experimental codes with our methods and the datasets are available at https://github.com/FILingK/TM-LLM

Files

Taverne
warning

File under embargo until 13-07-2026