Groq Smashes LLM Performance Record Again Using an LPU™ System With No Response From GPU Companies
Groq, an artificial intelligence (AI) solutions provider, today announced it has more than doubled its inference performance of the Large Language Model (LLM), Llama-2 70B, in just three weeks and is now running at more than 240 tokens per second (T/s) per user on its LPU™ system. As mentioned in its previous press release, Groq was the […]
Groq Selects Samsung Foundry to Bring Next-gen LPU™ to the AI Acceleration Market
Groq, an artificial intelligence (AI) inference systems innovator, today announced it has contracted with Samsung’s growing Foundry business to be its next-gen silicon partner, solidifying Groq’s product roadmap with a US-based foundry services provider.
Groq’s Record-breaking LPU™ Hits 100 Tokens Per Second Per User On A Massive AI Model
Groq’s newly announced language processor, the Groq LPU, has demonstrated that it can run 70-billion-parameter enterprise-scale language models at a record speed of more than 100 tokens per second.
Groq™ First to Achieve 100 Tokens Per Second Per User on Meta AI’s Llama-2 70B, Leading All Artificial Intelligence Solutions Providers in Inference Performance
Groq, an artificial intelligence (AI) solutions provider, today announced it now runs the Large Language Model (LLM), Llama-2 70B, at more than 100 tokens per second (T/s) per user on a Groq LPU™, the newly defined category for Groq silicon architecture.
AI Accelerator Groq™ Adapts and Runs LLaMA, the Meta™ Chatbot Model and Competitor to ChatGPT, for Its Systems
Facebook® parent, Meta, released LLaMA, which can be used by chatbots to generate human-like text, on February 24th. Three days later the Groq team downloaded the model and within a few days had it running on a production GroqNode™ server, including eight GroqChip™ inference processors. This is a rapid time-to-functionality; a development task that can often […]
Groq adapts Meta’s chatbot for its own chips in race against Nvidia
Groq modified LLaMA, a large language model released last month by Facebook parent Meta Platforms Inc that can be used to power bots to generate human-like text.
Groq™ Partners With New Customer, OneNano™, Providing Ultra-low Latency for Next Generation Cryptocurrency Exchange (CEX)
Today, Groq announced a new partnership with customer OneNano, a next generation cryptocurrency exchange (CEX) platform founded by leaders with decades of experience in financial and high-speed trading.
Researchers accelerate fusion research with Argonne’s Groq AI platform
Argonne scientists leverage ALCF AI Testbed system to address real-time inference demands for fusion energy research project.
Groq First to Announce Performance Advantage Results With STAC-ML™ Markets (Inference) Benchmark, Meeting Needs of Financial Services Industry
Today, the Securities Technology Analysis Center (STAC®) published audited benchmarking results from Groq for the financial industry, showcasing ultra-low latency, especially at low batch sizes such as batch 1. Over the last few years, the financial services industry has been asking vendors to show performance numbers on market-specific workloads. Amongst the compute incumbents in the […]
Cybersecurity Is Entering The High-Tech Era
There’s a sea change underway in how the federal government—specifically the Defense Department—is going to approach cybersecurity. It’s one that’s going to create a more fluid and more complex landscape in which cybersecurity firms and technologies need to be ready to operate—a landscape in which speed can’t be sacrificed for the sake of precision, or […]
US Army Analytics Group Confirms 1000x Performant Cybersecurity Technology by Entanglement AI™, Run on Groq™ Hardware, Advancing National Security Systems
Using Groq hardware, Entanglement AI solves cybersecurity anomaly detection three orders of magnitude faster than traditional methods, US Army Validation Report confirms. Demonstrates a dramatically faster and more accurate cybersecurity anomaly detection capability – with far better accuracy and fewer false positives – than any known technology. Groq now has multiple customers across verticals who […]
Meet us at SC22 | Booth 3047

Today, Groq announced a new partnership with customer OneNano, a next generation cryptocurrency exchange (CEX) platform founded by leaders with decades of experience in financial and high-speed trading.