
08/05/2025 · Groq
Day Zero Support for OpenAI Open Models
Fast and Affordable AI Inference For the World’s Most Popular Models
We're excited to announce that GroqCloud now supports the much anticipated OpenAI open models, gpt-oss-120B
and gpt-oss-20B
! This launch brings day-zero support for the latest open models, empowering developers worldwide to build innovative AI applications with unprecedented speed, scale, and production reliability.
Full Model Capabilities
To get the most out of OpenAI's open models, extended context and tools like code execution and browser search are essential. Groq's platform delivers these capabilities from day zero, with full support for 128K token context length and built-in tools such as code execution and browser search. This enables developers to build complex workflows, provide accurate and relevant information, and leverage real-time reasoning.
In conjunction with the release of these models, we have also added a new Responses API on GroqCloud that is fully compatible with OpenAI's Responses API, making it easy to integrate advanced conversational AI capabilities into your applications.
Unmatched Price-Performance
Groq's purpose-built stack is designed to deliver low cost per token for OpenAI's new models while maintaining speed and accuracy. With gpt-oss-120B
and gpt-oss-20B
available on GroqCloud, developers can run cutting-edge AI workloads while keeping costs low and latency predictable.
gpt-oss-120B
is currently running at 500+ t/s and gpt-oss-20B
is currently running at 1000+ t/s on GroqCloud.
Groq is offering OpenAI’s latest open models at the following pricing:
gpt-oss-120B
: $0.15 / M input tokens and $0.75 / M output tokensgpt-oss-20B
: $0.10 / M input tokens and $0.50 / M output tokens
Note: For a limited time, tool calls used with OpenAI's open models will not be charged. Learn more here.
Global Reach from Day Zero
Groq's global footprint across North America, Europe, and the Middle East ensures reliable, high-performance AI inference wherever developers operate. Through GroqCloud, teams worldwide can access OpenAI's open models with minimal latency with the models deployed at datacenters in the U.S., Canada, Europe, and Middle East.
About OpenAI's New Open Models
OpenAI’s open models give anyone—from individual developers to large enterprises to governments—the freedom to run and customize AI on their own infrastructure, democratizing access to AI across industries, communities, and countries globally.
Get Started with GPT-OSS on GroqCloud
Try gpt-oss via the GroqCloud Developer Console as well as API calls.
Get immediate access to OpenAI’s new open models or scale without rate limits by upgrading to a GroqCloud paid tier.