How do you keep reinforcement learning for large reasoning models from stalling on a few very long, …
Tag:
Context
-
-
TECH
Moonshot AI Releases Kimi K2: A Trillion-Parameter MoE Model Focused on Long Context, Code, Reasoning, and Agentic Behavior
by Techaiappby Techaiapp 4 minutes readKimi K2, launched by Moonshot AI in July 2025, is a purpose-built, open-source Mixture-of-Experts (MoE) model—1 trillion total …