2025-11-14 Papers

1/2

Paper 1

Black-Box On-Policy Distillation of Large Language Models

Published: 2025-11-13

Link: http://arxiv.org/pdf/2511.10643

1. 📘 Topic and Domain: The paper discusses black-box knowledge distillation of Large Language Models (LLMs), focusing on training smaller student models using only text outputs from teacher models without access to their internal parameters.

2. 💡 Previous Research and New Ideas: Building upon previous work in knowledge distillation and generative adversarial networks, the paper introduces a novel Generative Adversarial Distillation (GAD) framework that enables on-policy learning in black-box settings.

3. ❓ Problem: The paper addresses the challenge of effectively distilling knowledge from proprietary LLMs when only their text outputs are available, without access to internal logits or parameters.

4. 🛠️ Methods: The authors implement GAD by framing the student model as a generator and training a discriminator to distinguish between teacher and student responses in a minimax game, using reinforcement learning techniques.

5. 📊 Results and Evaluation: The results show GAD consistently outperforms sequence-level knowledge distillation across multiple datasets, with Qwen2.5-14B-Instruct achieving comparable performance to GPT-5-Chat teacher on LMSYS-Chat evaluation, validated through both automatic and human evaluations.

Black-Box On-Policy Distillation of Large Language Models

1/2

Paper 2

Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising

Published: 2025-11-09

Link: http://arxiv.org/pdf/2511.08633

1. 📘 Topic and Domain: Training-free motion-controlled video generation using dual-clock denoising in the domain of computer vision and AI-generated video.

2. 💡 Previous Research and New Ideas: Based on SDEdit's coarse layout cues for image editing and extends it to video, introducing a novel dual-clock denoising process that allows different regions to denoise at different rates.

3. ❓ Problem: Existing video generation methods lack precise motion control and require expensive model-specific fine-tuning.

4. 🛠️ Methods: Uses crude reference animations as motion guides, employs image conditioning to preserve appearance, and introduces dual-clock denoising that applies different noise schedules to motion-specified regions versus background.

5. 📊 Results and Evaluation: Outperformed existing training-based baselines on object and camera motion benchmarks, achieving better motion control and visual quality while being training-free and compatible with multiple video diffusion models.

Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising

1/2

Paper 3

Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs

Published: 2025-11-10

Link: http://arxiv.org/pdf/2511.07003

1. 📘 Topic and Domain: Large-scale multilingual machine translation focused on both Chinese and English language pairs, covering 60 languages and 234 translation directions.

2. 💡 Previous Research and New Ideas: Based on previous LLM-based translation research but addresses English-centric bias by introducing Chinese as a second pivot language, while proposing Strategic Downsampling and Parallel Multilingual Prompting.

3. ❓ Problem: Addressing the challenges of broad language coverage, consistent translation quality, and English-centric bias in multilingual machine translation systems.

4. 🛠️ Methods: Used a two-stage adaptation framework combining Continued Pre-training (CPT) and Supervised Fine-tuning (SFT), with Strategic Downsampling to prevent directional degeneration and Parallel Multilingual Prompting to enhance cross-lingual transfer.

5. 📊 Results and Evaluation: The 4B model (LMT-60-4B) achieved state-of-the-art performance among comparable models, surpassing larger models like Aya-101-13B and NLLB-54B, with consistent performance across high, medium, and low-resource languages.