Learning State Machines from data streams and an application in network-based threat detection

Master thesis (2018)

Authors

Hans Schouten Hans Schouten Electrical Engineering, Mathematics and Computer Science

Contributors

Sicco Verwer Sicco Verwer (supervisor 1)

Matthijs Spaan Matthijs Spaan (supervisor 2)

Nathalie Lokhorst Nathalie Lokhorst (supervisor 2)

Faculty

Electrical Engineering, Mathematics and Computer Science

State machines Blue-fringe Network threat detection

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:4aef512b-5c86-4ae0-b956-e3e9fa6aa966

Published Date

06-12-2018

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Our increasingly interconnected society poses large risks in terms of cyber security. With network traffic volumes increasing and systems becoming more connected, maintaining visibility on IT networks is a challenging yet important task. In recent years the number of cyber threats have increased dramatically. Monitoring and threat detection are more essential than ever to stay in control in a growing threat landscape. The powerful properties of state machines and the similarities between network traffic and traces used to learn state machines makes this a promising approach. Current learning methods; however, maintain an intermediate data structure that is converted in a state machine after all data has been processed. The continuous nature of network traffic makes this conventional approach inapplicable. This study provides a solution by developing a method for learning State Machines on real-time data streams. The proposed algorithm, framework and implementation are generic and can be applied to any use case that benefits from learning state machines on data streams. This thesis explores one specific use case, which is the use of state machine fingerprints in network-based threat detection. A system is designed capable of learning state machines on real-time traffic channels. The proposed detection method is demonstrated to be highly effective in matching traffic from various malware types to pre-learned fingerprints. The work in this thesis forms a stepping stone to the development of a robust detection method, capable of detecting a variety of threats on network data with low false alarm rates.

Files

Master_thesis_hansschouten.pdf

(.pdf | 6.76 Mb)