Surface-Aware Distilled 3D Semantic Features

None, None; None, None; None, None

Surface-Aware Distilled 3D Semantic Features

Conference Paper (2025)

Author(s)

Lukas Uzolas (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Elmar Eisemann (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Petr Kellnhofer (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group

Computer Graphics and Visualisation

Contrastive Learning Semantic Features Shape Correspondences Motion Transfer Reposing

DOI related publication

https://doi.org/10.1145/3757377.3763974 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:2484f464-21f6-4d08-82f6-e6e24ab7d1de

More Info

expand_more

Publication Year

2025

Language

English

Research Group

Computer Graphics and Visualisation

Article number

3

Pages (from-to)

1-12

Publisher

ACM

ISBN (print)

979-8-4007-2137-3

ISBN (electronic)

9798400721373

Event

SIGGRAPH Asia 2025 Conference (2025-12-15 - 2025-12-18), Hong Kong, Hong Kong

Downloads counter

60

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Many 3D tasks such as pose alignment, animation, motion transfer, and 3D reconstruction rely on establishing correspondences between 3D shapes. This challenge has recently been approached by pairwise matching of semantic features from pre-trained vision models. However, despite their power, these features struggle to differentiate instances of the same semantic class such as "left hand"versus "right hand"which leads to substantial mapping errors. To solve this, we learn a surface-aware embedding space that is robust to these ambiguities while facilitating shared mapping for an entire family of 3D shapes. Importantly, our approach is self-supervised and requires only a small number of unpaired training meshes to infer features for new possibly imperfect 3D shapes at test time. We achieve this by introducing a contrastive loss that preserves the semantic content of the features distilled from foundational models while disambiguating features located far apart on the shape's surface. We observe superior performance in correspondence matching benchmarks and enable downstream applications including 2D-to-3D and 3D-to-3D texture transfer, in-part segmentation, pose alignment, and motion transfer in low-data regimes. Unlike previous pairwise approaches, our solution constructs a joint embedding space, where both seen and unseen 3D shapes are implicitly aligned without further optimization. The code is available at https://graphics.tudelft.nl/SurfaceAware3DFeatures.

Files

3757377.3763974.pdf

(pdf | 52.8 Mb)