No (good) loss no gain

None, None; None, None; None, None; None, None

No (good) loss no gain

Systematic evaluation of loss functions in deep learning-based side-channel analysis

Journal Article (2023)

Author(s)

Maikel Kerkhof (Student TU Delft)

Lichao Wu (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Guilherme Perin (Radboud Universiteit Nijmegen)

Stjepan Picek (Radboud Universiteit Nijmegen)

Research Group

Cyber Security

Deep Learning Side-channel analysis Evaluation Loss function

DOI related publication

https://doi.org/10.1007/s13389-023-00320-6 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:7fb72f35-bcf8-4a90-974a-45fe2102dfa4

More Info

expand_more

Publication Year

2023

Language

English

Research Group

Cyber Security

Issue number

3

Volume number

13

Pages (from-to)

311-324

Downloads counter

403

Collections

Institutional Repository

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Deep learning is a powerful direction for profiling side-channel analysis as it can break targets protected with countermeasures even with a relatively small number of attack traces. Still, it is necessary to conduct hyperparameter tuning to reach strong attack performance, which can be far from trivial. Besides many options stemming from the machine learning domain, recent years also brought neural network elements specially designed for side-channel analysis. The loss function, which calculates the error or loss between the actual and desired output, is one of the most important neural network elements. The resulting loss values guide the weights update associated with the connections between the neurons or filters of the deep learning neural network. Unfortunately, despite being a highly relevant hyperparameter, there are no systematic comparisons among different loss functions regarding their effectiveness in side-channel attacks. This work provides a detailed study of the efficiency of different loss functions in the SCA context. We evaluate five loss functions commonly used in machine learning and three loss functions specifically designed for SCA. Our results show that an SCA-specific loss function (called CER) performs very well and outperforms other loss functions in most evaluated settings. Still, categorical cross-entropy represents a good option, especially considering the variety of neural network architectures.

Files

S13389_023_00320_6.pdf

(pdf | 2.59 Mb)