Resources

For general press inquiries, reach out to our PR team.

08/05/2025
Day Zero Support for OpenAI Open Models
08/01/2025
Inside the LPU: Deconstructing Groq’s Speed
07/31/2025
OpenBench: Open, Reproducible Evals
06/16/2025
Build Faster with Groq + Hugging Face
06/10/2025
GroqCloud™ Now Supports Qwen3 32B
06/03/2025
Introducing GroqCloud™ LoRA Fine-Tune Support: Unlock Efficient Model Adaptation for Enterprises
05/27/2025
From Speed to Scale: How Groq Is Optimized for MoE & Other Large Models
05/16/2025
How to Build Your Own AI Research Agent with One Groq API Call
04/29/2025
The Official Llama API, Accelerated by Groq
04/15/2025
Now in Preview: Groq’s First Compound AI System
04/05/2025
Llama 4 Live Today on Groq — Build Fast at the Lowest Cost, Without Compromise
03/26/2025
Build Fast with Text-to-Speech
03/18/2025
Groq & Vercel Partner To Make Building Fast and Simple
03/13/2025
Batch Processing with GroqCloud™ for AI Inference Workloads
03/13/2025
Build Fast with Word-Level Timestamping
03/08/2025
A Guide to Reasoning with Qwen QwQ 32B
03/07/2025
What is a Language Processing Unit?
03/05/2025
Qwen QwQ 32B Running Same Day As Release
03/01/2025
Thank You! 1 Million Developers Now On GroqCloud™
02/28/2025
How to Win Hackathons with Groq
02/26/2025
Mistral Saba Added to GroqCloud™ Model Suite
02/11/2025
GroqCloud™ Now Offers qwen-2.5-32b and deepseek-r1-distill-qwen-32b
02/10/2025
GroqCloud™ Developer Tier Self-serve Access Now Available
02/10/2025
Saudi Arabia Announces $1.5 Billion Expansion to Fuel AI-powered Economy with Groq
01/28/2025
GroqCloud™ Makes DeepSeek R1 Distill Llama 70B Available
01/24/2025
Largest, Most Capable ASR Model Now Faster on GroqCloud™
12/12/2024
Understanding AI 101: What is Inference in Machine Learning and AI Applications?
12/06/2024
A New Scaling Paradigm: Meta's Llama 3.3 70B Challenges "Death of Scaling Law"
12/06/2024
New AI Inference Speed Benchmark for Llama 3.3 70B, Powered by Groq
11/15/2024
Groq First Generation 14nm Chip Just Got a 6x Speed Boost: Introducing Llama 3.3 70B Speculative Decoding on GroqCloud™
11/06/2024
The Five Future Stages of Generative AI
10/21/2024
The Crucial Role of Context Length in Large Language Models for Business Applications
10/09/2024
Whisper Large v3 Turbo Now Available on Groq, Combining Speed & Quality for Speech Recognition
09/25/2024
Meta and Groq Continue To Build Open-source Developer Ecosystem as Llama 3.2 Launches
09/12/2024
Unleashing the Power of Fast AI Inference: Groq and Aramco Digital Partner to Establish World-leading Data Center
09/03/2024
Introducing LLaVA V1.5 7B on GroqCloud
08/20/2024
Distil-Whisper is Now Available to the Developer Community on GroqCloud™ for Faster and More Efficient Speech Recognition
08/09/2024
Insights from VB Transform 2024 with Jonathan Ross, Groq CEO & Founder
07/23/2024
Llama 3.1 by Meta Now Available on Groq
07/16/2024
Introducing Llama-3-Groq-Tool-Use Models
06/24/2024
Groq Runs Whisper Large V3 at a 164x Speed Factor According to New Artificial Analysis Benchmark
04/20/2024
12 Hours Later, Groq Deploys Llama 3 Instruct (8 & 70B) by Meta AI on Its LPU™ Inference Engine
03/25/2024
What NVIDIA Didn’t Say
03/11/2024
Groundbreaking Gemma 7B Performance Running on the Groq LPU™ Inference Engine
02/08/2024
ArtificialAnalysis.ai LLM Benchmark Doubles Axis To Fit New Groq LPU™ Inference Engine Performance Results
01/11/2024
Groq LPU™ Inference Engine Crushes First Public LLM Benchmark
01/03/2024
Retrieval Augmented Generation with Groq API
11/09/2023
HumanPlus: The Co-evolution of AI & Human Potential
07/28/2023
Every. Word. Matters.
03/09/2023
Automated Discovery Meets Automated Compilation
12/16/2021
Groq Accelerates COVID Drug Discovery by 333x for Argonne National Lab
04/14/2021
Brave Thinking
10/22/2019
Why AI Requires a New Chip Architecture
10/21/2019
World, Meet Groq