2025-11-17 Papers

1/2

Paper 1

One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models

Published: 2025-11-13

Link: http://arxiv.org/pdf/2511.10629

1. 📘 Topic and Domain: The paper presents a latent upscaling method (LUA) for diffusion models in the domain of high-resolution image generation.

2. 💡 Previous Research and New Ideas: Based on previous work in latent diffusion models and super-resolution techniques, it proposes a novel lightweight adapter that performs upscaling in latent space before decoding, rather than using traditional pixel-space super-resolution or multi-stage diffusion.

3. ❓ Problem: The paper addresses the challenge of scaling diffusion models beyond their training resolutions without introducing artifacts, high computational costs, or requiring additional diffusion stages.

4. 🛠️ Methods: The authors implement a Swin Transformer-based adapter with scale-specific heads for 2x/4x upscaling, trained using a three-stage curriculum combining latent and pixel-space objectives, and designed to work across different VAE architectures.

5. 📊 Results and Evaluation: LUA achieves state-of-the-art single-decode fidelity (FID 180.80/176.90) at 2048² and 4096² resolutions while being significantly faster than baselines (3.52s vs 7.23s for 2048²), demonstrating successful cross-model generalization across SDXL, SD3, and FLUX.

One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models

1/2

Paper 2

AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery

Published: 2025-11-14

Link: http://arxiv.org/pdf/2511.11257

1. 📘 Topic and Domain: Development of an LLM-based agent (AIonopedia) for ionic liquid discovery in chemistry, combining artificial intelligence with materials science.

2. 💡 Previous Research and New Ideas: Based on previous work in LLMs, multimodal learning, and chemical property prediction; introduces a novel approach combining LLM capabilities with specialized tools for automated ionic liquid research.

3. ❓ Problem: Addresses challenges in ionic liquid property prediction including limited data availability, poor model accuracy, and fragmented research workflows that hinder efficient discovery of new ionic liquids.

4. 🛠️ Methods: Implements a two-stage training approach with multimodal contrastive learning, combining molecular graphs, SMILES sequences, and physicochemical descriptors, along with a GPT-5 powered agent that orchestrates multiple specialized tools.

5. 📊 Results and Evaluation: Achieved superior performance across multiple property prediction tasks, demonstrated strong out-of-distribution generalization, and successfully validated through wet-lab experiments, including discovery of a novel phosphorus-centered ionic liquid for NH3 absorption.

AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery

1/2

Paper 3

DoPE: Denoising Rotary Position Embedding

Published: 2025-11-12

Link: http://arxiv.org/pdf/2511.09146

1. 📘 Topic and Domain: Improving Rotary Position Embedding (RoPE) in transformer models to enhance long-context performance through denoising techniques.

2. 💡 Previous Research and New Ideas: Based on RoPE and attention mechanisms in transformers, proposes a novel denoising approach using truncated matrix entropy to identify and suppress noisy attention heads.

3. ❓ Problem: Addressing the inherent limitations of RoPE that weaken length extrapolation and cause attention sink phenomena in transformer models.

4. 🛠️ Methods: Uses truncated matrix entropy to detect outlier frequency bands in attention maps and applies three denoising strategies: DoPE-by-parts (selective band masking), DoPE-by-all (full head masking), and DoPE-by-Gaussian (noise replacement).

5. 📊 Results and Evaluation: Significantly improved retrieval accuracy and reasoning stability across extended contexts up to 64K tokens, with up to 10-point improvement without training, particularly effective in needle-in-a-haystack and many-shot in-context learning tasks.