apple-ml-research

基于多视角采集的大规模高质量3D Gaussian头部重建

Large-Scale High-Quality 3D Gaussian Head Reconstruction from Multi-View Captures

二〇二六年五月九日 · 英文原文

摘要

HeadsUp 是一种 feed-forward 方法，用于在大规模 multi-camera 设置中重建 3D Gaussian 头部。方法采用 encoder-decoder 架构，将输入视图压缩为 latent representation，再解码为锚定到 neutral head template 的 UV-parameterized 3D Gaussians，使 Gaussian 数量与输入图像数量和分辨率解耦。

我们提出了 HeadsUp，一种可扩展的 feed-forward 方法，用于从大规模 multi-camera 设置中重建高质量 3D Gaussian 头部。我们的方法采用高效的 encoder-decoder 架构，将输入视图压缩为紧凑的 latent representation。随后，该 latent representation 被解码为一组 UV-parameterized 3D Gaussians，并锚定到 neutral head template 上。这种 UV representation 将 3D Gaussians 的数量与输入图像的数量和分辨率解耦，使其能够使用大量高分辨率输入视图进行训练。我们在一个…上训练并评估了模型。

译自 apple-ml-research · 录于二〇二六年五月九日