1. 📘 Topic and Domain: The paper presents GaMO, a
geometry-aware multi-view diffusion outpainting method for 3D scene reconstruction from sparse
camera views in computer vision.
2. 💡 Previous Research and New Ideas: Previous work
focused on novel view generation and regularization techniques for sparse-view reconstruction,
while this paper introduces a new outpainting approach that expands existing views rather than
generating new ones.
3. ❓ Problem: The paper addresses the challenge of
reconstructing complete 3D scenes from limited input views, which typically results in holes,
ghosting artifacts, and geometric inconsistencies.
4. 🛠️ Methods: The method uses a three-stage pipeline:
coarse 3D initialization to obtain geometry priors, geometry-aware multi-view outpainting using
a diffusion model with mask latent blending and iterative mask scheduling, and final 3D Gaussian
Splatting refinement.
5. 📊 Results and Evaluation: The approach achieves
state-of-the-art performance on Replica and ScanNet++ datasets across 3, 6, and 9 input views,
with significant improvements in PSNR, SSIM, and LPIPS metrics while being 25x faster than
previous methods.