2026-03-03 Papers

1/2

Paper 1

OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens

Published: 2026-03-02

Link: http://arxiv.org/pdf/2603.02138

1. 📘 Topic and Domain: The paper focuses on generating vector animations in Lottie format from multi-modal instructions (text, image, video) using deep learning approaches in computer vision and graphics.

2. 💡 Previous Research and New Ideas: The paper builds on prior work in vector graphics generation, video generation models, and visual autoregressive models, proposing a novel Lottie tokenizer that converts JSON files into structured command sequences and an end-to-end framework for multi-modal vector animation generation.

3. ❓ Problem: The paper addresses the challenge of generating editable, resolution-independent vector animations from multi-modal inputs, as existing methods either generate raster videos lacking editability or struggle with the complex JSON structure of Lottie files.

4. 🛠️ Methods: The authors develop OmniLottie using a specialized Lottie tokenizer for efficient representation, train on a curated MMLottie-2M dataset with 2 million animations, and employ a pretrained vision-language model (Qwen2.5-VL) for autoregressive generation.

5. 📊 Results and Evaluation: OmniLottie achieves 88.3%, 93.3%, and 88.1% success rates for text-to-Lottie, text-image-to-Lottie, and video-to-Lottie tasks respectively, significantly outperforming baselines in visual quality (lowest FVD scores) and semantic alignment metrics.

OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens

1/2

Paper 2

OpenAutoNLU: Open Source AutoML Library for NLU

Published: 2026-03-02

Link: http://arxiv.org/pdf/2603.01824

1. 📘 Topic and Domain: The paper presents OpenAutoNLU, an open-source AutoML library specifically designed for natural language understanding tasks including text classification and named entity recognition.

2. 💡 Previous Research and New Ideas: The paper builds on existing AutoML frameworks (AutoIntent, AutoGluon, LightAutoML, H2O) but introduces automatic data-aware training regime selection that requires no manual configuration, choosing between AncSetFit, SetFit, or full fine-tuning based on dataset characteristics.

3. ❓ Problem: The paper addresses the challenge that existing AutoML frameworks lack ease of use and NLP-centric design, requiring complex configuration and failing to automatically select appropriate training methods based on data size and label distribution.

4. 🛠️ Methods: The authors use a deterministic method selection based on minimum per-class sample count (AncSetFit for 2-5 examples, SetFit for 5-80 examples, full transformer fine-tuning for >80 examples) with integrated data quality diagnostics, configurable OOD detection, and LLM-powered data augmentation.

5. 📊 Results and Evaluation: OpenAutoNLU achieved best or tied performance on 3 out of 4 intent classification benchmarks (HWU64, MASSIVE, SNIPS) with superior OOD detection capabilities, maintaining strong in-domain classification quality while effectively detecting out-of-distribution samples without explicit OOD supervision.

OpenAutoNLU: Open Source AutoML Library for NLU

1/2

Paper 3

RubricBench: Aligning Model-Generated Rubrics with Human Standards

Published: 2026-03-02

Link: http://arxiv.org/pdf/2603.01562

1. 📘 Topic and Domain: The paper addresses rubric-based evaluation for reward models in large language model alignment, focusing on benchmarking how well models can generate and apply evaluation criteria.

2. 💡 Previous Research and New Ideas: The paper builds on existing reward model benchmarks (RewardBench, RM-Bench) and rubric-guided evaluation paradigms, proposing RubricBench as the first benchmark with human-annotated rubrics for assessing model-generated evaluation criteria.

3. ❓ Problem: The paper aims to solve the lack of reliable benchmarks for rubric-guided evaluation and the gap between model-generated and human-quality evaluation rubrics in reward modeling.

4. 🛠️ Methods: The authors curated 1,147 challenging preference pairs through multi-dimensional filtering, annotated them with expert-derived atomic rubrics, and evaluated models under three conditions: vanilla, self-generated rubrics, and human-annotated rubrics.

5. 📊 Results and Evaluation: Results show a 27% accuracy gap between model-generated and human rubrics, with rubric-aware models reaching ~58% accuracy versus 40-47% for traditional approaches, demonstrating that rubric quality is the primary bottleneck in evaluation performance.