
09/04/2025 · Groq
Introducing Kimi K2‑0905 on GroqCloud
Moonshot AI’s cutting‑edge model, moonshotai/Kimi-K2-Instruct-0905
, is now live on GroqCloud. This integration brings day zero support for the latest frontier open model alongside production‑grade speed, low latency, and predictable cost empowering developers to take agentic coding to the next level.
Key Features of Kimi K2‑0905 on GroqCloud
- Full 256k Context Window: The largest context window of any model on GroqCloud to date.
- Prompt Caching: Up to 50% cost savings on cached tokens and dramatically faster response times. When paired with the 256k context window, this is a massive unlock for agentic coding applications, where a large amount of context is shared between queries.
- Leading Price‑to‑Performance: 200+ T/s at a blended price of $1.50 / M tokens ($1.00 / M input tokens; $3.00 / M output tokens), helping to provide top‑tier performance without surprise bills.
- Note: With this release, requests to the original Kimi K2 model will be routed to this new version.
Enhanced Capabilities
Kimi K2‑0905 delivers a suite of upgrades for developers over the previous Kimi K2 release including:
- Improved Agentic Coding: More reliable code generation, rivaling that of frontier closed models, especially for front-end development and tool calling.
- 256k Context Window: Support larger complex, multi‑turn interactions without chopping prompts.
Get Started with Kimi K2-0905 on GroqCloud
Try moonshotai/Kimi-K2-Instruct-0905
via the GroqCloud Developer Console, available in the Playground and the API.
Get immediate access to Moonshot.AI’s new open model or scale without rate limits by upgrading to a GroqCloud paid tier.