What is Generative AI

What is large language model (LLM) inference

Why should I care about fast inference