Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders

CVPR 2025

Rui Chen^1,2, Jianfeng Zhang^2*, Yixun Liang^1,3, Guan Luo^2,4, Weiyu Li^1,3,

Jiarui Liu

^1,3

Xiu Li², Xiaoxiao Long^1,3, Jiashi Feng², Ping Tan^1,3*

^*Corresponding authors

¹The Hong Kong University of Science and Technology

²Bytedance Seed

³LightIllusions

⁴Tsinghua University

Code Arxiv Video

Abstract

Recent 3D content generation pipelines commonly employ Variational Autoencoders (VAEs) to encode shapes into compact latent representations for diffusion-based generation. However, the widely adopted uniform point sampling strategy in Shape VAE training often leads to a significant loss of geometric details, limiting the quality of shape reconstruction and downstream generation tasks. We present Dora-VAE, a novel approach that enhances VAE reconstruction through our proposed sharp edge sampling strategy and a dual cross-attention mechanism. By identifying and prioritizing regions with high geometric complexity during training, our method significantly improves the preservation of fine-grained shape features. Such sampling strategy and the dual attention mechanism enable the VAE to focus on crucial geometric details that are typically missed by uniform sampling approaches. To systematically evaluate VAE reconstruction quality, we additionally propose Dora-bench, a benchmark that quantifies shape complexity through the density of sharp edges, introducing a new metric focused on reconstruction accuracy at these salient geometric features. Extensive experiments on the Dora-bench demonstrate that Dora-VAE achieves comparable reconstruction quality to the state-of-the-art dense XCube-VAE while requiring a latent space at least 8x smaller (1,280 vs. > 10,000 codes).

Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders

Jiarui Liu

^*Corresponding authors

¹The Hong Kong University of Science and Technology

²Bytedance Seed

³LightIllusions

⁴Tsinghua University

Abstract

Video

The reconstructed results of Dora-VAE in Dora-bench (Point clouds are interactable)

Image to 3D

character control

Method

Dora-VAE

Dora-bench

BibTeX

Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders

Jiarui Liu

*Corresponding authors

1 The Hong Kong University of Science and Technology

2 Bytedance Seed

3 LightIllusions

4 Tsinghua University

Abstract

Video

The reconstructed results of Dora-VAE in Dora-bench (Point clouds are interactable)

Image to 3D

character control

Method

Dora-VAE

Dora-bench

BibTeX

^*Corresponding authors

¹The Hong Kong University of Science and Technology

²Bytedance Seed

³LightIllusions

⁴Tsinghua University