Generalizable Direct Protein Sequencing With InstaNexus
Marco Reverenna (Technical University of Denmark (DTU))
Maike Wennekers Nielsen (Technical University of Denmark (DTU))
Darian Stephan Wolff (Technical University of Denmark (DTU), Novonesis)
Jemma Daniel (InstaDeep Ltd)
Elpida Lytra (Technical University of Denmark (DTU))
Suthimon Thumtecho (Technical University of Denmark (DTU))
Pasquale D. Colaianni (Technical University of Denmark (DTU))
Alberto Santos (Technical University of Denmark (DTU))
Konstantinos Kalogeropoulos (Technical University of Denmark (DTU), Kavli institute of nanoscience Delft, TU Delft - Applied Sciences)
More Authors (External organisation)
More Info
expand_more
Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.
Abstract
Accurate determination of protein sequences is central to biology. Protein-based therapeutics, such as antibodies and nanobodies, are not encoded in reference genomes, challenging their accurate characterization via standard proteomics. Current methods rely on indirect inference, fragmented outputs, and labor-intensive workflows, which hinder functional insights and routine application. Here, we present a generalizable, end-to-end workflow for direct protein sequencing, combining streamlined sample preparation, artificial intelligence (AI)-driven de novo peptide sequencing, and tailored assembly to reconstruct contiguous protein sequences. A novel composite scoring framework prioritizes longer assemblies and coverage, enhancing accuracy and reducing ambiguity. Validation across diverse protein modalities demonstrates its utility and ability to robustly sequence functionally critical regions of selected proteins. This workflow represents an advance in precision proteomics with promising applications in therapeutic discovery, immune profiling, and protein science.