You cannot reliably steer something you do not understand. This lesson demystifies LLMs just enough to make your prompting deliberate instead of superstitious.
An LLM does exactly one thing: given a sequence of tokens, it predicts a probability distribution over the next token.
"The capital of France is" → Paris (0.92) the (0.01) a (0.004) ...
It samples a token, appends it, and repeats. Everything else — reasoning, summarisation, coding — is an emergent behaviour of doing this extremely well over trillions of tokens of training text.
| Stage | What happens | Effect on prompting |
|---|---|---|
| Pre-training | Predict next token on the internet | Broad knowledge, mimics patterns |
| Instruction tuning | Trained on (instruction → response) pairs | Follows commands, not just continues text |
| RLHF / alignment | Optimised toward human preferences | Helpful, cautious, "assistant" persona |
This is why a modern chat model responds to "Summarise this" instead of just continuing your sentence — instruction tuning taught it that instructions expect compliance, not completion.
Good prompting works with these properties (e.g. "show your steps" for math, "say 'I don't know' if unsure" for facts) rather than fighting them.
Mental model: An LLM is an extremely well-read improviser, not a database and not a calculator.