2025-11-25 Papers

1/2

Paper 1

AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning

Published: 2025-11-24

Link: http://arxiv.org/pdf/2511.19304

1. 📘 Topic and Domain: The paper focuses on automated environment generation and cross-environment agent learning evaluation in artificial intelligence, specifically developing a framework called AutoEnv for creating and measuring how well AI agents learn across different environments.

2. 💡 Previous Research and New Ideas: Based on previous work in single-environment agent learning and human-designed environments, it introduces two new ideas: AutoEnv (an automated environment generation framework) and a formal component-centric process for agent learning with Selection, Optimization, and Evaluation stages.

3. ❓ Problem: The paper addresses the lack of diverse, controllable environments for testing AI agents' cross-environment learning abilities and the absence of a unified way to represent how agents learn across different environments.

4. 🛠️ Methods: The authors developed AutoEnv to automatically generate environments by treating them as factorizable distributions over transitions, observations, and rewards, and created AutoEnv-36 (a dataset of 36 environments with 358 validated levels) to test eight different learning methods.

5. 📊 Results and Evaluation: The results showed that seven language models achieved only 12-49% normalized reward on AutoEnv-36, and single learning methods' effectiveness decreased as environment diversity increased, while environment-adaptive selection improved performance but showed diminishing returns as the method space expanded.

AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning

1/2

Paper 2

DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation

Published: 2025-11-24

Link: http://arxiv.org/pdf/2511.19365

1. 📘 Topic and Domain: The paper presents DeCo, a novel frequency-decoupled pixel diffusion framework for end-to-end image generation in computer vision and deep learning.

2. 💡 Previous Research and New Ideas: Based on previous pixel diffusion and latent diffusion models, it proposes a new architecture that separates high and low-frequency components during image generation, unlike traditional methods that process both simultaneously.

3. ❓ Problem: The paper addresses the inefficiency of existing pixel diffusion models that struggle to jointly model complex high-frequency signals and low-frequency semantics within a single diffusion transformer.

4. 🛠️ Methods: Implements a two-part architecture: a Diffusion Transformer (DiT) for low-frequency semantics and a lightweight pixel decoder for high-frequency details, plus a frequency-aware flow-matching loss inspired by JPEG compression.

5. 📊 Results and Evaluation: Achieves superior FID scores of 1.62 (256×256) and 2.22 (512×512) on ImageNet, with a leading GenEval score of 0.86, outperforming existing pixel diffusion methods while matching two-stage latent diffusion approaches.

DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation

1/2

Paper 3

General Agentic Memory Via Deep Research

Published: 2025-11-23

Link: http://arxiv.org/pdf/2511.18423

1. 📘 Topic and Domain: The paper presents a novel memory framework called General Agentic Memory (GAM) in the domain of artificial intelligence, specifically focusing on memory systems for large language models.

2. 💡 Previous Research and New Ideas: The paper builds on previous static memory systems that use Ahead-of-Time compilation, and proposes a new Just-in-Time compilation approach that creates optimized contexts at runtime while maintaining simple memory offline.

3. ❓ Problem: The paper aims to solve the limitations of static memory systems which suffer from information loss and lack of flexibility in adapting to unforeseen requests.

4. 🛠️ Methods: The paper implements a dual-agent framework consisting of a Memorizer that creates lightweight memory while storing complete history in a page-store, and a Researcher that performs deep research to retrieve and integrate relevant information for requests.

5. 📊 Results and Evaluation: The system achieved substantial improvements over existing memory methods across multiple benchmarks including LoCoMo, HotpotQA, RULER, and NarrativeQA, with particularly strong performance on multi-step retrieval and reasoning tasks.