SemMorph3D
Unsupervised Semantic-Aware 3D Morphing
via Mesh-Guided Gaussians

Mengtian Li1,4, Yunshu Bai1, Yimin Chu1, Xinru Guo1, Haolin Liu3, Zhifeng Xie1,4, Chaofeng Chen2,†
1Shanghai University· 2School of Artificial Intelligence, Wuhan University· 3Tencent· 4Shanghai Engineering Research Center of Motion Picture Special Effects
† Corresponding author
source morph target

Method overview and representative 3D morphing results.

Overview

What if 3D objects could morph without losing topology?

Photorealistic Gaussians fracture during deformation. Rigid meshes demand clean inputs. SemMorph3D bridges both worlds.

SemMorph3D teaser

Abstract

We introduce SemMorph3D, a novel framework for semantic-aware 3D shape and texture morphing directly from multi-view images. While 3D Gaussian Splatting (3DGS) enables photorealistic rendering, its unstructured nature often leads to catastrophic geometric fragmentation during morphing. Conversely, traditional mesh-based morphing enforces structural integrity but mandates pristine input topology and struggles with complex appearances. Our method resolves this dichotomy by employing a mesh-guided strategy where a coarse, extracted base mesh acts as a flexible geometric anchor, providing the topological scaffolding to guide unstructured Gaussians. We further propose a dual-domain optimization that establishes unsupervised semantic correspondence, synergizing geodesic regularization for shape preservation with texture-aware constraints for coherent color evolution. On the proposed TexMorph benchmark, SemMorph3D substantially outperforms prior 2D and 3D methods, reducing color consistency error (ΔE) by 22.2% and EI by 26.2%.

−22.2%
Color error (ΔE)
−26.2%
Edge incoherence (EI)
10+
Object categories
0
Labels required

Mesh-Guided Gaussians

A coarse extracted mesh acts as a deformable anchor, compensating for topological artifacts in 3DGS.

Dual-Domain Optimization

Geodesic shape regularization meets texture-aware color flow — jointly solved in a hybrid representation.

Fully Unsupervised

No labels, no category templates, no specialized 3D assets. Correspondence emerges from the data.

Method

Pipeline Overview

From multi-view images to textured 3D morphs through a unified mesh + Gaussian representation.

SemMorph3D pipeline

SemMorph3D performs high-quality 3D morphing between a source X and a target Y given only their multi-view image observations. We first extract surface meshes from 3D Gaussian Splatting to align geometry and texture. Using the learned correspondence matrix ΠXY, intermediate shapes are interpolated and refined through a joint shape-and-texture loss — producing morphs that are geometrically stable, chromatically faithful, and topologically consistent.

Results

Result Gallery

Morphs across synthetic assets, 3D scans, and in-the-wild mobile phone captures.

Experiments

Qualitative Comparison

Against six state-of-the-art 2D and 3D morphing baselines on the TexMorph benchmark.

Comparison overview
Detailed comparison

We compare against six methods: (1) DiffMorpher and (2) FreeMorph for image morphing; (3) NeuroMorph for untextured 3D shape morphing; (4) MorphFlow for textured multi-view output without true geometry; (5) Interp3D for 3D shape morphing without explicit point-wise correspondence; and (6) Ours, which produces textured 3D morphs with geometric detail directly from image inputs. Our approach uniquely preserves both geometric integrity and appearance coherence across the entire morph trajectory.

Benchmark

The TexMorph Benchmark

A texture-rich, morphing-focused evaluation suite spanning synthetic, scanned, and in-the-wild 3D objects.

TexMorph benchmark

To rigorously evaluate 3D morphing from multi-view images, we propose TexMorph (Texture-rich, Morphing-focused). The benchmark comprises challenging source–target pairs designed to stress both geometric and appearance transformations, including: (1) high-fidelity synthetic models with complex textures rendered from multiple viewpoints; (2) real-world objects captured via 3D scanning; and (3) objects captured in-the-wild using standard mobile phone cameras. The dataset features over ten object categories — animals, fruits, furniture and more — offering diverse topological and textural challenges.

Perceptual Evaluation

User Study

Human preference consistently favors our morphs across color consistency, structure, and edge continuity.

User study

We conducted a user study comparing SemMorph3D against the baselines along three perceptual axes: color consistency, structural similarity, and edge continuity. Across all three, participants overwhelmingly preferred our results, confirming that the gains reported by our quantitative metrics translate to tangible perceptual quality.

Citation

BibTeX

Citation will be available upon publication.