2025-12-09 Papers

1/2

Paper 1

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Published: 2025-12-08

Link: http://arxiv.org/pdf/2512.07525

1. 📘 Topic and Domain: Improving Rotary Position Embeddings (RoPE) for long-context Large Language Models by utilizing complex-valued attention calculations.

2. 💡 Previous Research and New Ideas: Based on standard RoPE which only uses real components of complex-valued dot products; proposes incorporating the previously discarded imaginary components to enhance position encoding.

3. ❓ Problem: Standard RoPE implementations discard imaginary components of complex attention calculations, potentially losing valuable positional information needed for modeling long-range dependencies.

4. 🛠️ Methods: Introduces RoPE++ with two configurations: RoPE++EH (equal heads with halved cache) and RoPE++EC (equal cache with doubled heads), which reincorporates imaginary components into attention calculations.

5. 📊 Results and Evaluation: Both RoPE++ configurations outperformed standard RoPE across short and long-context tasks in 376M and 776M models, with RoPE++EH achieving comparable results using half the cache and RoPE++EC showing significant improvements with the same cache size.

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

1/2

Paper 2

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Published: 2025-12-08

Link: http://arxiv.org/pdf/2512.07461

1. 📘 Topic and Domain: The paper introduces Native Parallel Reasoner (NPR), a framework for enabling Large Language Models to perform parallel reasoning, falling within the domain of artificial intelligence and language model optimization.

2. 💡 Previous Research and New Ideas: Based on previous work in parallel reasoning like Multiverse and MapReduce paradigms, it proposes a novel teacher-free approach where models self-evolve parallel reasoning capabilities without external supervision.

3. ❓ Problem: The paper addresses the challenge of enabling language models to perform genuine parallel reasoning rather than sequential emulation, while avoiding reliance on external teacher models or supervised distillation.

4. 🛠️ Methods: The paper implements a three-stage progressive training paradigm: (1) Format-follow RL to discover parallel structures, (2) Parallel warmup through self-distilled data, and (3) Native-parallel RL using a novel Parallel-Aware Policy Optimization algorithm and NPR Engine.

5. 📊 Results and Evaluation: Testing on eight reasoning benchmarks showed performance gains up to 24.5%, inference speedups up to 4.6×, and achieved 100% genuine parallel execution, with consistent improvements over baseline models like Multiverse-32B and Multiverse-4B.

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

1/2

Paper 3

Voxify3D: Pixel Art Meets Volumetric Rendering

Published: 2025-12-08

Link: http://arxiv.org/pdf/2512.07834

1. 📘 Topic and Domain: The paper presents Voxify3D, a framework for converting 3D meshes into stylized voxel art with controllable abstraction, operating in the domain of 3D graphics and neural rendering.

2. 💡 Previous Research and New Ideas: Based on neural radiance fields and pixel art generation research, it introduces new techniques for combining 2D pixel art supervision with 3D voxel optimization using orthographic projection and palette-constrained color quantization.

3. ❓ Problem: The paper addresses the challenge of automatically generating high-quality voxel art from 3D meshes while maintaining semantic features, geometric consistency, and discrete color palettes.

4. 🛠️ Methods: Uses a two-stage pipeline: first initializes coarse voxel geometry using neural volume rendering, then refines it using orthographic pixel art supervision with CLIP-based semantic loss and Gumbel-Softmax for palette quantization.

5. 📊 Results and Evaluation: Achieves superior performance with CLIP-IQA score of 37.12 and 77.90% user preference, demonstrating better semantic preservation and visual quality compared to existing methods across diverse character models and controllable abstraction levels.