Print Email Facebook Twitter Reinforcement Learning for Profiled Side-Channel Analysis Title Reinforcement Learning for Profiled Side-Channel Analysis: Applications of Q-Learning in the SCA Domain Author Rijsdijk, Jorai (TU Delft Electrical Engineering, Mathematics and Computer Science) Contributor Picek, S. (mentor) Lagendijk, R.L. (graduation committee) Oliehoek, F.A. (graduation committee) Degree granting institution Delft University of Technology Programme Computer Science | Cyber Security Date 2020-11-16 Abstract Side-channel attacks (SCA), which use unintended leakage to retrieve a secret cryptographic key, have become more sophisticated over time. With the recent successes of machine learning (ML) and especially deep learning (DL) techniques against cryptographic implementations even in the presence of dedicated countermeasures, various methods have been utilized to construct better and less complex neural network architectures. However, this process takes significant manual effort and expertise, where new architectures are constructed by adapting existing architectures or by following some methodology and filling the gaps with experimentation. While automated neural architecture search (NAS) exists and has been applied in the image classification domain, the side-channel analysis domain requires different metrics, as the machine learning metrics can be misleading in this context. In this work, we present a NAS method based on MetaQNN, which utilizes the Q-Learning reinforcement learning (RL) algorithm to generate Convolutional Neural Networks (CNNs). We define two reward functions based on the guessing entropy (GE) metric, where one of these also rewards less complex networks. We use this NAS method to generate CNNs that rival the current state-of-the-art CNNs while reducing the complexity in terms of trainable parameters significantly. We also consider a naive ensemble, which manages to keep the combined complexity below the state of the art while improving the SCA performance. Since the goal of SCA research is to improve security, there should be a balance in research on improving attacks as opposed to research on how to improve defense mechanisms. In line with this balance, we adapt our Q-Learning based reinforcement learning neural architecture search method to generate sets of countermeasures, apply them a posteriori on existing datasets, and evaluate them against existing state-of-the-art CNNs. Since implementing countermeasures is not without its costs, we also define synthetic cost functions to countermeasures based on their parameters, and both restrict the countermeasure budget and reward unused budget. We use this method to generate cost-effective countermeasure sets capable of defeating different state-of-the-art CNNs. Subject Reinforcement Learning (RL)Side-channel analysisQ-LearningNeural Architecture SearchMachine LearningDeep LearningcountermeasuresConvolutional Neural Networks (CNNs) To reference this document use: http://resolver.tudelft.nl/uuid:33694620-a18d-411b-ac7f-4001b6e3a419 Part of collection Student theses Document type master thesis Rights © 2020 Jorai Rijsdijk Files PDF ReinforcementLearningForP ... ledSCA.pdf 11.14 MB Close viewer /islandora/object/uuid:33694620-a18d-411b-ac7f-4001b6e3a419/datastream/OBJ/view