Reinforcement Learning for Profiled Side-Channel Analysis

Rijsdijk, J.

Reinforcement Learning for Profiled Side-Channel Analysis

Applications of Q-Learning in the SCA Domain

Master thesis (2020)

Authors

J. Rijsdijk Electrical Engineering, Mathematics and Computer Science

Contributors

S. Picek Cyber Security (mentor)

Reginald L. Lagendijk Cyber Security (graduation committee member)

Frans A Oliehoek Interactive Intelligence (graduation committee member)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

Machine Learning Reinforcement Learning (RL) Deep Learning Convolutional Neural Networks (CNNs) Side-channel analysis Q-Learning Neural Architecture Search Countermeasures

To reference this document use:

http://resolver.tudelft.nl/uuid:33694620-a18d-411b-ac7f-4001b6e3a419

More Info

expand_more

Published Date

16-11-2020

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Side-channel attacks (SCA), which use unintended leakage to retrieve a secret cryptographic key, have become more sophisticated over time. With the recent successes of machine learning (ML) and especially deep learning (DL) techniques against cryptographic implementations even in the presence of dedicated countermeasures, various methods have been utilized to construct better and less complex neural network architectures. However, this process takes significant manual effort and expertise, where new architectures are constructed by adapting existing architectures or by following some methodology and filling the gaps with experimentation. While automated neural architecture search (NAS) exists and has been applied in the image classification domain, the side-channel analysis domain requires different metrics, as the machine learning metrics can be misleading in this context. In this work, we present a NAS method based on MetaQNN, which utilizes the Q-Learning reinforcement learning (RL) algorithm to generate Convolutional Neural Networks (CNNs). We define two reward functions based on the guessing entropy (GE) metric, where one of these also rewards less complex networks. We use this NAS method to generate CNNs that rival the current state-of-the-art CNNs while reducing the complexity in terms of trainable parameters significantly. We also consider a naive ensemble, which manages to keep the combined complexity below the state of the art while improving the SCA performance. Since the goal of SCA research is to improve security, there should be a balance in research on improving attacks as opposed to research on how to improve defense mechanisms. In line with this balance, we adapt our Q-Learning based reinforcement learning neural architecture search method to generate sets of countermeasures, apply them a posteriori on existing datasets, and evaluate them against existing state-of-the-art CNNs. Since implementing countermeasures is not without its costs, we also define synthetic cost functions to countermeasures based on their parameters, and both restrict the countermeasure budget and reward unused budget. We use this method to generate cost-effective countermeasure sets capable of defeating different state-of-the-art CNNs.

Files

ReinforcementLearningForProfil... (pdf)

(pdf | 11.1 Mb)

License info not available