Inference
Definition
The process of running an AI model to get a response. When you send a prompt to ChatGPT and get an answer back, that's inference. It's the 'using' phase, as opposed to the 'training' phase.
Example
Every time you press Enter in ChatGPT, the model runs inference - processing your prompt through billions of parameters to generate a response, one token at a time.