2025-10-23 Papers

1/2

Paper 1

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Published: 2025-10-22

Link: http://arxiv.org/pdf/2510.19338

1. 📘 Topic and Domain: The paper presents Ring-linear models (Ring-mini-linear-2.0 and Ring-flash-linear-2.0), which are hybrid architecture language models combining linear and softmax attention for efficient long-context reasoning.

2. 💡 Previous Research and New Ideas: Based on previous research in Linear Attention (Mamba, Gated Linear Attention) and hybrid architectures, the paper proposes a new hybrid architecture that effectively balances between linear and softmax attention with systematic training-inference alignment.

3. ❓ Problem: The paper addresses the challenge of efficiently processing long text sequences in language models while maintaining performance, as traditional attention mechanisms have quadratic computational complexity and linear I/O overhead with increasing sequence length.

4. 🛠️ Methods: The authors implement a hybrid architecture combining linear and softmax attention, optimize FP8 training with fused kernels (LingHe), and develop systematic training-inference alignment for stable reinforcement learning training.

5. 📊 Results and Evaluation: The models achieve comparable or better performance than larger counterparts across various reasoning benchmarks while reducing inference costs by 90% compared to dense models and 50% compared to the original Ring series, with Ring-flash-linear-2.0 scoring particularly well on mathematical reasoning tasks.

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

1/2

Paper 2

LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts

Published: 2025-10-22

Link: http://arxiv.org/pdf/2510.19363

1. 📘 Topic and Domain: Long-context reasoning in large language models through reinforcement learning, focusing on enhancing models' ability to reason over extensive text contexts.

2. 💡 Previous Research and New Ideas: Based on previous research in short-context reasoning and chain-of-thought prompting, proposes new KeyChain data synthesis method to transform short multi-hop QA into challenging long-context tasks.

3. ❓ Problem: Addresses the challenge of improving LLMs' ability to reason over long contexts (up to 128K tokens) while maintaining short-context capabilities and avoiding prohibitive training costs.

4. 🛠️ Methods: Implements LoongRL with KeyChain data construction, which inserts UUID chains to hide questions in long contexts, uses Group Relative Policy Optimization for training, and employs a multi-stage curriculum approach.

5. 📊 Results and Evaluation: Achieved significant improvements in long-context reasoning (+23.5% for 7B and +21.1% for 14B models), with LoongRL-14B reaching 74.2 score rivaling larger models, while maintaining short-context capabilities and generalizing effectively to 128K contexts.

LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts

1/2

Paper 3

GigaBrain-0: A World Model-Powered Vision-Language-Action Model

Published: 2025-10-22

Link: http://arxiv.org/pdf/2510.19430

1. 📘 Topic and Domain: A Vision-Language-Action (VLA) model called GigaBrain-0 for robotic manipulation tasks, operating in the domain of robotics and artificial intelligence.

2. 💡 Previous Research and New Ideas: Based on previous VLA models and world model research, introduces a novel approach using world model-generated data (video generation, real2real transfer, human transfer, view transfer, sim2real transfer) instead of relying heavily on real robot data.

3. ❓ Problem: Addresses the challenge of collecting large-scale real-world robot data, which is expensive, time-consuming, and limited in diversity, hindering the development of robust, general-purpose robotic systems.

4. 🛠️ Methods: Employs a mixture-of-transformers architecture combining Vision-Language Model (VLM) and action Diffusion Transformer (DiT), enhanced with RGB-D input modeling, embodied Chain-of-Thought supervision, and Knowledge Insulation for better spatial reasoning and action generation.

5. 📊 Results and Evaluation: Achieved superior performance across various tasks (dexterous manipulation, long-horizon tasks, mobile manipulation), with significantly improved generalization in appearance, object placement, and camera viewpoint variations, while also offering a lightweight variant (GigaBrain-0-Small) for efficient on-device deployment.