tencent/CL-bench
Viewer
•
Updated
•
1.9k
•
1
•
27
None defined yet.
Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation
No Global Plan in Chain-of-Thought: Uncover the Latent Planning Horizon of LLMs