1. 📘 Topic and Domain: Feed-forward 3D Gaussian Splatting for novel view synthesis using voxel-aligned prediction instead of traditional pixel-aligned approaches.
2. 💡 Previous Research and New Ideas: Based on previous pixel-aligned Gaussian Splatting methods, proposes a new voxel-aligned paradigm that predicts Gaussians from a 3D voxel grid rather than 2D pixels.
3. ❓ Problem: Addresses limitations of pixel-aligned methods including view-dependent density distributions, heavy reliance on input view numbers, and alignment errors in occluded or low-texture regions.
4. 🛠️ Methods: Uses a multi-view transformer for feature extraction, constructs 3D voxel features through unprojection, refines them with a sparse 3D U-Net, and predicts Gaussian parameters directly from the voxel grid.
5. 📊 Results and Evaluation: Achieves state-of-the-art performance on RealEstate10K and ScanNet datasets with higher PSNR/SSIM scores while using fewer Gaussians, demonstrating better geometric consistency and efficiency than pixel-aligned methods.