2025-10-03 Papers

1/2

Paper 1

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Published: 2025-10-02

Link: http://arxiv.org/pdf/2510.02283

1. 📘 Topic and Domain: Long-form video generation using diffusion models, specifically focused on extending video generation beyond traditional short-duration limits.

2. 💡 Previous Research and New Ideas: Based on prior work in diffusion models and autoregressive video generation; introduces a novel approach called Self-Forcing++ that extends beyond the traditional 5-second limit of teacher models.

3. ❓ Problem: The challenge of generating high-quality long videos, as current models suffer from quality degradation, over-exposure, and error accumulation when generating videos beyond 5-10 seconds.

4. 🛠️ Methods: Uses backward noise initialization, extended distribution matching distillation, and rolling KV cache to train a student model on self-generated long rollouts while leveraging guidance from a teacher model.

5. 📊 Results and Evaluation: Achieved generation of high-quality videos up to 4 minutes and 15 seconds long (50x improvement over baseline), while maintaining visual stability and outperforming baseline methods in both fidelity and consistency metrics.

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

1/2

Paper 2

StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions

Published: 2025-10-02

Link: http://arxiv.org/pdf/2510.02314

1. 📘 Topic and Domain: A novel data poisoning attack method for 3D Gaussian Splatting (3DGS) in computer vision, specifically targeting neural rendering systems.

2. 💡 Previous Research and New Ideas: Based on prior poisoning attacks on Neural Radiance Fields (NeRF), proposes new density-guided poisoning specifically for 3DGS's explicit representation, which was previously unexplored.

3. ❓ Problem: Addresses the challenge of injecting visible illusory objects into specific target views of 3D Gaussian Splatting while keeping other viewpoints unaffected.

4. 🛠️ Methods: Uses Kernel Density Estimation (KDE) to identify low-density regions for placing poisoned Gaussian points, combined with adaptive noise scheduling to disrupt multi-view consistency during training.

5. 📊 Results and Evaluation: Achieves superior poisoning performance compared to baselines across multiple datasets, with PSNR >25 on poisoned views while maintaining PSNR drop ≤3 on innocent views, demonstrating successful illusion embedding while preserving scene fidelity.

StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions

1/2

Paper 3

Interactive Training: Feedback-Driven Neural Network Optimization

Published: 2025-10-02

Link: http://arxiv.org/pdf/2510.02297

1. 📘 Topic and Domain: The paper introduces Interactive Training, a framework for real-time, feedback-driven neural network optimization in machine learning.

2. 💡 Previous Research and New Ideas: Based on traditional static neural network training approaches, it proposes a novel interactive paradigm where humans or AI agents can dynamically intervene during the training process.

3. ❓ Problem: The paper addresses the limitations of static training paradigms that lack flexibility to respond to training issues like instabilities or underperformance without restarting the entire process.

4. 🛠️ Methods: The authors implemented a control server architecture with a React-based frontend dashboard that enables real-time monitoring and intervention through commands to adjust hyperparameters, training data, and model checkpoints.

5. 📊 Results and Evaluation: Through three case studies, they demonstrated superior training stability with human intervention, successful automated LLM-based hyperparameter adjustment, and effective real-time model adaptation using user-generated data.