Glossary

Glossary of AI Terms

Plain-English definitions of the AI terms you'll actually run into — no jargon, no fluff.

Agent
An AI system that can plan, take actions, and use tools to complete a multi-step task on your behalf.
API (Application Programming Interface)
A way for software to talk to other software — how apps connect to AI models behind the scenes.
Chain of Thought
A prompting technique that asks the model to reason step by step before giving a final answer.
Context Window
How much text (prompt + conversation + documents) the model can consider at once. Bigger windows let you feed in more material.
Embedding
A numeric representation of text, images, or audio that lets AI compare meaning — the backbone of semantic search and RAG.
Fine-Tuning
Training an existing model further on your own data so it specializes in your style, domain, or task.
GPT (Generative Pre-trained Transformer)
A family of large language models from OpenAI. Often used loosely to mean any chat-style AI.
Generative AI
AI that creates new content — text, images, audio, video, code — rather than just classifying or predicting.
Hallucination
When an AI confidently produces information that is incorrect or made up. Always verify high-stakes outputs.
Inference
The act of running a trained model to get an output. Every time you send a prompt, you're paying for inference.
LLM (Large Language Model)
A model trained on huge amounts of text to understand and generate human-like language. ChatGPT, Claude, and Gemini are LLMs.
Multimodal
A model that can work across multiple input types — text, images, audio, and video — in a single conversation.
Prompt
The instruction you give an AI model. The clearer and more specific, the better the result.
Prompt Engineering
The craft of writing prompts that consistently get useful results from AI models.
RAG (Retrieval-Augmented Generation)
A pattern where the model looks up relevant documents first, then uses them to answer — great for grounding answers in your own data.
Reasoning Model
A model designed to think longer before answering, trading speed for more accurate multi-step problem solving.
Reinforcement Learning from Human Feedback (RLHF)
A training technique where humans rate AI responses to teach the model what 'good' looks like.
System Prompt
Hidden instructions that set the AI's role, tone, and rules before you start chatting.
Temperature
A setting that controls randomness. Low = focused and repeatable. High = more creative and varied.
Token
The unit AI models read and write in — roughly ¾ of a word in English. Pricing and context limits are measured in tokens.
Training Data
The text, images, or other content a model learned from. Quality and recency of training data shape what the model knows.
Transformer
The neural network architecture behind nearly every modern LLM. The 'T' in GPT.
Vector Database
A database optimized for storing and searching embeddings — the storage layer powering most RAG systems.
Zero-Shot / Few-Shot
Zero-shot: asking the model to do a task with no examples. Few-shot: giving it a handful of examples in the prompt first.