Curriculum Learning for Qubit Mapping Across Hardware Topologies

None, None

Curriculum Learning for Qubit Mapping Across Hardware Topologies

Bachelor Thesis (2026)

Author(s)

A. Govenko (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

S. Feld – Mentor (TU Delft - QCD/Feld Group)

A. Kundu – Mentor (TU Delft - QCD/Feld Group)

M.T.J. Spaan – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)

A. Lukina – Graduation committee member (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Faculty

Electrical Engineering, Mathematics and Computer Science

To reference this document use

https://resolver.tudelft.nl/uuid:629e8ab9-259d-4b1f-b84f-b586acfafe21

More Info

expand_more

Publication Year

2026

Language

English

Graduation Date

26-06-2026

Awarding Institution

Delft University of Technology

Project

CSE3000 Research Project

Programme

Computer Science and Engineering

Faculty

Electrical Engineering, Mathematics and Computer Science

Downloads counter

10

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Compiling quantum circuits for physical hardware requires an initial mapping step that assigns virtual qubits to physical qubits such that interacting pairs are placed on connected hardware locations. Current approaches train a separate agent per device topology, requiring significant compute for each new hardware generation and transferring no knowledge across devices. This work investigates whether curriculum learning --- progressively training a reinforcement learning agent on hardware topologies of increasing size --- can produce a single agent that generalises to unseen topologies. We evaluate three curriculum variants differing in replay ratio and warmup length, alongside three non-curriculum baselines, in the QGym InitialMapping environment using MaskablePPO. Results show that curriculum agents outperform single-topology and single-size training on held-out topologies, reaching strong frontier performance with greater sample efficiency than direct training. Against unordered exposure to the same topology distribution, however, curriculum ordering's advantage holds on the target topology size but not on generalisation to unseen topologies. While absolute performance remains modest and variance across seeds is substantial, the findings establish curriculum learning as a viable approach to topology-general qubit mapping and provide a proof of concept for training a single model that transfers across hardware topologies, reducing the computational cost of re-training for each new device.

Files

Curriculum_Learning_for_Qubit_... (pdf)

(pdf | 1.11 Mb)

License info not available