What is Generative AI
What is large language model (LLM) inference
Why should I care about fast inference