2025-07-03 Papers

1/2

Paper 1

LongAnimation: Long Animation Generation with Dynamic Global-Local Memory

Published: 2025-07-02

Link: http://arxiv.org/pdf/2507.01945

1. 📘 Topic and Domain: Long animation colorization using diffusion models in computer vision and animation generation.

2. 💡 Previous Research and New Ideas: Based on existing short-term animation colorization methods that use local paradigms for feature fusion, proposes a novel dynamic global-local paradigm to maintain long-term color consistency.

3. ❓ Problem: Solving the challenge of maintaining color consistency in long animation sequences (300-1000 frames), which current methods fail to achieve due to their focus on local features and short-term generation.

4. 🛠️ Methods: Introduces LongAnimation framework with three key components: SketchDiT for reference feature extraction, Dynamic Global-Local Memory for historical feature compression and fusion, and Color Consistency Reward for refining color consistency.

5. 📊 Results and Evaluation: Achieves significant improvements over previous methods, with 35.1% improvement in short-term (14 frames) and 49.1% improvement in long-term (500 frames) animation colorization based on FVD metrics.

LongAnimation: Long Animation Generation with Dynamic Global-Local Memory

1/2

Paper 2

Depth Anything at Any Condition

Published: 2025-07-02

Link: http://arxiv.org/pdf/2507.01634

1. 📘 Topic and Domain: A foundation monocular depth estimation model called DepthAnything-AC for handling diverse environmental conditions in computer vision and depth estimation.

2. 💡 Previous Research and New Ideas: Based on previous foundation MDE models like Depth Anything series that work well in general scenes but struggle with complex conditions; proposes new unsupervised consistency regularization and spatial distance constraint approaches.

3. ❓ Problem: Existing foundation MDE models perform poorly in complex real-world environments involving challenging lighting, weather conditions, and sensor distortions, while also struggling with boundary delineation and detail preservation.

4. 🛠️ Methods: Uses perturbation-based consistency framework to generate consistent predictions under different corruptions, and spatial distance constraint to enforce geometric relationships between patches; fine-tuned on 540K unlabeled images with various augmentations.

5. 📊 Results and Evaluation: Outperformed state-of-the-art approaches across multiple benchmarks including DA-2K, real-world adverse weather datasets, and synthetic corruption benchmarks, while maintaining performance on general scenes; showed particular improvements in boundary definition and detail preservation.

Depth Anything at Any Condition

1/2

Paper 3

FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model

Published: 2025-07-02

Link: http://arxiv.org/pdf/2507.01953

1. 📘 Topic and Domain: Image morphing using diffusion models in computer vision, specifically focusing on generating smooth transitions between two input images.

2. 💡 Previous Research and New Ideas: Based on previous work in image warping, GANs, and diffusion models, proposing a novel tuning-free approach that doesn't require per-instance training like existing methods.

3. ❓ Problem: Addressing the challenge of creating high-quality image morphing transitions between images with different semantics or layouts without requiring extensive fine-tuning or training.

4. 🛠️ Methods: Introduces FreeMorph with two key innovations: guidance-aware spherical interpolation for maintaining identity and directional transitions, and step-oriented variation trend for controlled transitions between inputs.

5. 📊 Results and Evaluation: Outperforms existing methods by being 10-50x faster (under 30 seconds per morphing), achieving superior results in FID, PPL, and LPIPS metrics, and receiving 60.13% preference in user studies.