view article Article The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+ about 8 hours ago • 7
view article Article Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek 7 days ago • 37
Mistral Large 3 Collection A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 87
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model Paper • 2405.04434 • Published May 7, 2024 • 25
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 140
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels Aug 18, 2025 • 90
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22, 2025 • 437
Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures Paper • 2505.09343 • Published May 14, 2025 • 75
view article Article AI Policy @🤗: Response to the 2025 National AI R&D Strategic Plan Jun 2, 2025 • 14
view article Article 5 Things You Need to Know About Moonshot AI and Kimi K2, the New #1 model on the Hub Jul 15, 2025 • 24
The Gradient of Generative AI Release: Methods and Considerations Paper • 2302.04844 • Published Feb 5, 2023 • 8