2026-03-30 Papers

1/2

Paper 1

RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models

Published: 2026-03-26

Link: http://arxiv.org/pdf/2603.25502

1. 📘 Topic and Domain: The paper focuses on real-world image restoration using large-scale image editing models to handle diverse degradations.

2. 💡 Previous Research and New Ideas: Based on prior all-in-one restoration and large image editing models, the paper introduces a two-stage training strategy combining synthetic and real-world degradation data for better generalization.

3. ❓ Problem: Existing restoration models struggle with poor generalization due to limited training data and simplified synthetic degradations that do not reflect real-world complexity.

4. 🛠️ Methods: The paper constructs a large-scale dataset covering nine degradation types, fine-tunes an open-source image editing model (Step1X-Edit) with a progressive mixed training strategy, and establishes RealIR-Bench for non-reference evaluation.

5. 📊 Results and Evaluation: RealRestorer ranks first among open-source methods and achieves performance comparable to leading closed-source systems, excelling in deblurring and low-light enhancement while demonstrating strong zero-shot generalization.

RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models

1/2

Paper 2

Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration

Published: 2026-03-25

Link: http://arxiv.org/pdf/2603.24800

1. 📘 Topic and Domain: The paper focuses on improving Diffusion Transformers (DiTs) for text-to-image generation through parameter-efficient calibration techniques.

2. 💡 Previous Research and New Ideas: The paper builds on prior work revealing uneven contributions of DiT blocks (Stable Flow, FreeFlux) and introduces the novel hypothesis that optimal block weighting via learned scaling parameters can significantly enhance model performance.

3. ❓ Problem: The paper addresses the suboptimal weighting of standard DiT architectures, where certain blocks may introduce detrimental artifacts and the overall generation quality can be improved through post-hoc calibration.

4. 🛠️ Methods: The proposed Calibri method frames DiT calibration as a black-box reward optimization problem solved using the gradient-free CMA-ES evolutionary algorithm, optimizing only ~10² parameters through block, layer, or gate scaling.

5. 📊 Results and Evaluation: Experimental results demonstrate consistent performance improvements across FLUX, SD-3.5M, and Qwen-Image models, with up to 18% HPSv3 improvement while reducing inference steps from 30-100 to just 15 steps.

Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration

1/2

Paper 3

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Published: 2026-03-25

Link: http://arxiv.org/pdf/2603.24472

1. 📘 Topic and Domain: The paper investigates self-distillation in large language models (LLMs) for mathematical reasoning tasks, focusing on how post-training methods affect reasoning capability.

2. 💡 Previous Research and New Ideas: Based on prior work showing self-distillation improves performance in domains like agentic environments and scientific reasoning, the paper introduces a new hypothesis that performance degradation in math reasoning stems from suppression of epistemic verbalization—the model's expression of uncertainty during reasoning.

3. ❓ Problem: The paper addresses why self-distillation, while effective in some domains, can degrade reasoning performance in mathematical tasks despite guiding models toward correct answers.

4. 🛠️ Methods: The authors use controlled experiments varying information richness in teacher conditioning and task coverage, analyzing how different conditioning contexts (unguided vs. solution-guided generation) affect epistemic token usage and out-of-distribution performance across multiple models (Qwen3-8B, DeepSeek-Distill-Qwen-7B, and Olmo3-7B-Instruct).

5. 📊 Results and Evaluation: Self-distillation with rich conditioning contexts reduces epistemic verbalization and response length, enabling rapid in-domain optimization with limited task coverage but causing up to 40% performance degradation on OOD benchmarks (AIME24, AMC23); GRPO maintains or improves performance while SDPO degrades it, and performance drops correlate with reduced uncertainty expression.

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Today's Reading Tips 今日阅读推荐

Start with the RealRestorer paper for its practical open-source restoration system and new benchmark; it builds on large‑scale image editing models that share foundations with the Calibri method, which calibrates diffusion transformers for text‑to‑image generation. The third paper on self‑distillation in LLMs addresses a different domain (language‑model reasoning) and can be read later if you are interested in post‑training analysis.