Inference
Inference is the stage where a trained AI model is actually used - the moment it takes new input and produces an answer or result.
In this guide
What Inference means
AI has two main phases. Training is when the model learns from data, which happens once and is very intensive. Inference is everything after that: each time the finished model is asked to do something, it performs inference.
Every time you type a question into a chatbot and get a reply, that reply is the result of inference. The model is not learning from your message - it is applying what it already learned to produce your answer.
Why Inference matters
Inference is the part of AI you interact with and pay for when building tools. Understanding it helps you think clearly about speed and cost.
Frequently asked questions
More AI terms
Ready to build the AI skills your future depends on?
Take the free 5-minute quiz and get a personalized learning plan built around your goals, schedule, and experience.