ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas Paper • 2601.21558 • Published 6 days ago • 53
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation Paper • 2601.22153 • Published 5 days ago • 68
Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models Paper • 2601.20354 • Published 7 days ago • 109
Beyond Imitation: Reinforcement Learning for Active Latent Planning Paper • 2601.21598 • Published 6 days ago • 9
view article Article We Got Claude to Build CUDA Kernels and teach open models! +2 7 days ago • 111
Innovator-VL: A Multimodal Large Language Model for Scientific Discovery Paper • 2601.19325 • Published 8 days ago • 76
Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation Paper • 2601.20614 • Published 7 days ago • 115
iFSQ: Improving FSQ for Image Generation with 1 Line of Code Paper • 2601.17124 • Published 11 days ago • 31
VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents Paper • 2601.16973 • Published 11 days ago • 40
Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs Paper • 2601.17058 • Published 13 days ago • 181
SALAD: Achieve High-Sparsity Attention via Efficient Linear Attention Tuning for Video Diffusion Transformer Paper • 2601.16515 • Published 12 days ago • 15
Endless Terminals: Scaling RL Environments for Terminal Agents Paper • 2601.16443 • Published 12 days ago • 16
LLM-in-Sandbox Elicits General Agentic Intelligence Paper • 2601.16206 • Published 12 days ago • 83
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model Paper • 2601.15892 • Published 13 days ago • 53
OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation Paper • 2601.15369 • Published 13 days ago • 20