2025-07-08 Papers

1/2

Paper 1

4DSloMo: 4D Reconstruction for High Speed Scene with Asynchronous Capture

Published: 2025-07-07

Link: http://arxiv.org/pdf/2507.05163

1. 📘 Topic and Domain: 4D reconstruction of high-speed dynamic scenes using asynchronous multi-camera capture and video diffusion models in computer vision.

2. 💡 Previous Research and New Ideas: Based on previous 4D Gaussian Splatting work limited to 30 FPS, introduces novel asynchronous capture scheme and video diffusion model refinement.

3. ❓ Problem: Current 4D capture systems are limited to low frame rates (<30 FPS), making it difficult to reconstruct fast-moving scenes with high fidelity.

4. 🛠️ Methods: Combines asynchronous camera capture (staggering camera start times) with a video diffusion model for artifact removal, implemented through 4D Gaussian Splatting and LoRA-based fine-tuning.

5. 📊 Results and Evaluation: Achieves superior reconstruction quality compared to synchronous methods on both synthetic and real datasets, with significant improvements in PSNR, SSIM, and LPIPS metrics.

4DSloMo: 4D Reconstruction for High Speed Scene with Asynchronous Capture

1/2

Paper 2

Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restoration

Published: 2025-07-07

Link: http://arxiv.org/pdf/2507.05108

1. 📘 Topic and Domain: The paper focuses on historical document restoration using AI, specifically in the domain of computer vision and digital heritage preservation.

2. 💡 Previous Research and New Ideas: Based on previous work in single-modal restoration and limited-size patch restoration, this paper proposes a novel automated three-stage restoration approach that mimics historians' workflow and introduces a comprehensive full-page historical document dataset.

3. ❓ Problem: The paper addresses the limitations of existing historical document restoration methods that focus only on single modality or limited-size restoration, failing to provide a fully automated solution for comprehensive document restoration.

4. 🛠️ Methods: The authors developed AutoHDR, a three-stage approach combining OCR-assisted damage localization, vision-language context text prediction, and patch autoregressive appearance restoration, along with creating the FPHDR dataset containing both real and synthetic damaged documents.

5. 📊 Results and Evaluation: The method improved OCR accuracy from 46.83% to 84.05% for severely damaged documents, with further enhancement to 94.25% through human-machine collaboration, demonstrating superior performance in both text restoration accuracy and historical appearance preservation.

Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restoration

1/2

Paper 3

ArtifactsBench: Bridging the Visual-Interactive Gap in LLM Code Generation Evaluation

Published: 2025-07-07

Link: http://arxiv.org/pdf/2507.04952

1. 📘 Topic and Domain: The paper introduces ArtifactsBench, a benchmark framework for evaluating Large Language Models' ability to generate interactive visual code artifacts in software development.

2. 💡 Previous Research and New Ideas: Based on existing code generation benchmarks that focus mainly on static code evaluation, this paper proposes a novel framework that evaluates both visual fidelity and interactive behavior of generated code.

3. ❓ Problem: The paper addresses the critical gap in evaluating LLMs' ability to generate dynamic, interactive visual artifacts, as current benchmarks cannot assess visual quality and interactive functionality comprehensively.

4. 🛠️ Methods: The authors developed a multi-stage evaluation pipeline using Multimodal LLMs as judges, programmatically rendering generated artifacts and capturing their dynamic behavior through temporal screenshots against fine-grained checklists.

5. 📊 Results and Evaluation: The automated evaluation achieved 94.4% ranking consistency with WebDev Arena (human preference benchmark) and over 90% agreement with human experts, while revealing that generalist models often outperform domain-specific ones in visual code generation tasks.