On-demand Pricing for
Tokens-as-a-Service

Groq powers leading openly-available AI models.

Other models are available for specific customer requests including fine tuned models. Send us your inquiries here.

Large Language Models (LLMs)

AI ModelCurrent Speed(Tokens per Second)Input Token Price(Per Million Tokens)Output Token Price(Per Million Tokens)
Llama 4 Scout (17Bx16E)460$0.11
(9.09M / $1)*
$0.34
(2.94M / $1)*
Try NowModel Card
Llama 4 Maverick (17Bx128E)240$0.20
(5M / $1)*
$0.60
(1.6M / $1)*
Try NowModel Card
DeepSeek R1 Distill Llama 70B275$0.75
(1.33M / $1)*
$0.99
(1.01M / $1)*
Try NowModel Card
Qwen QwQ 32B (Preview) 128k400$0.29
(3.44M / $1)*
$0.39
(2.56M / $1)*
Try NowModel Card
Mistral Saba 24B330$0.79
(1.27M / $1)*
$0.79
(1.27M / $1)*
Try Now
Llama 3.3 70B Versatile 128k275$0.59
(1.69M / $1)*
$0.79
(1.27M / $1)*
Try NowModel Card
Llama 3.1 8B Instant 128k750$0.05
(20M / $1)*
$0.08
(12.5M / $1)*
Try NowModel Card
Llama 3 70B 8k330$0.59
(1.69M / $1)*
$0.79
(1.27M / $1)*
Try NowModel Card
Llama 3 8B 8k1250$0.05
(20M / $1)*
$0.08
(12.5M / $1)*
Try NowModel Card
Gemma 2 9B 8k500$0.20
(5M / $1)*
$0.20
(5M / $1)*
Try NowModel Card
Llama Guard 3 8B 8k765$0.20
(5M / $1)*
$0.20
(5M / $1)*
Try NowModel Card

*Approximate number of tokens per $

Text-to-Speech (TTS) Models

AI ModelCharacters /sPrice(Per M Characters)
PlayAI Dialog v1.0140$50.00Try NowModel Card

Automatic Speech Recognition (ASR) Models

AI ModelSpeed FactorPrice(Per Hour Transcribed)
Whisper V3 Large189x$0.111*Try NowModel Card
Whisper Large v3 Turbo216x$0.04*Try NowModel Card
Distil-Whisper250x$0.02*Try NowModel Card

*For ASR models above, Groq charges a minimum of 10 seconds per request.

Batch API

The Batch API is now available for Dev Tier customers and currently offered at a 25% discount rate. Batch processing lets you run thousands of API requests at scale by submitting your workload as a batch to Groq and letting us process it with a 24-hour turnaround. 

Now through the end of April 2025, we’re doubling our discount on Batch Processing to 50% off for all paid GroqCloud customers!

Learn more about Batch pricing and how to get started here

For enterprise API solutions or on-prem deployments, please fill out the form on our Enterprise Access Page.

Never miss a Groq update! Sign up below for our latest news.