AI Agent Glossary
90+ terms and concepts defined clearly
A2A (Agent-to-Agent Protocol)
Protocols & StandardsA communication protocol enabling direct agent-to-agent interaction, message passing, and coordination without human mediation.
Agent
Core ConceptsAn autonomous software entity capable of perceiving its environment, reasoning about information, making decisions, and taking actions to achieve specific goals.
Agent Evaluation
ProductionSystematic testing and measurement of agent performance against benchmarks, success criteria, and quality metrics.
Agentic AI
Core ConceptsAI systems designed to act autonomously with agency—the ability to make decisions, take action, and adapt behavior based on goals and environmental feedback.
AutoGen (Microsoft)
Frameworks & ToolsA framework enabling conversations between multiple LLM agents and human users for collaborative problem-solving.
Autonomous Agent
Core ConceptsAn agent capable of independent operation with minimal human intervention, making decisions and executing actions autonomously.
AWS Bedrock Agents
Frameworks & ToolsAWS managed service for running agents that automatically handle orchestration, memory, and integration with AWS services.
Caching
ProductionStoring frequently accessed results temporarily to avoid redundant computation and improve response times.
Chain of Thought
Core ConceptsA prompting technique where an AI is instructed to break down complex problems into intermediate reasoning steps, improving solution quality.
Claude Agent SDK
Frameworks & ToolsAnthropic\
Constitutional AI
Safety & ControlAn approach training AI systems to follow explicit principles and guidelines, improving safety and alignment with human values.
Context Management
Memory & StateStrategies for maintaining, updating, and optimizing the information available to agents within context limits.
Context Window
Memory & StateThe maximum amount of text (in tokens) that a language model can process at once, limiting how much history and context it can access.
CrewAI
Frameworks & ToolsA framework specifically designed for building multi-agent systems with role-based agents, tasks, and hierarchical management.
Distributed Tracing
ProductionTracking requests and operations across multiple services and components, enabling visibility into complex distributed workflows.
Embeddings
Memory & StateNumerical vector representations of text that capture semantic meaning, enabling comparison and retrieval based on semantic similarity.
Episodic Memory
Memory & StateMemory of specific events, experiences, and interactions, typically time-stamped and contextual, useful for learning from experience.
Error Handling
ProductionMechanisms for detecting, managing, and recovering from failures gracefully without system collapse.
Few-shot Learning
LLM FundamentalsProviding a small number of examples in a prompt to teach LLMs new patterns or behaviors without fine-tuning.
Fine-tuning
LLM FundamentalsTraining an LLM on task-specific data to specialize its behavior, improving performance on particular domains or tasks.
Function Calling
Core ConceptsA mechanism where an LLM generates structured outputs (typically JSON) that describe which function to call with what parameters, enabling programmatic tool invocation.
Goal Decomposition
Core ConceptsBreaking down high-level goals into smaller, manageable subgoals that are easier to achieve and reason about.
Google Agent Discovery Kit
Frameworks & ToolsGoogle\
Graph-based Architecture
Architecture PatternsAn agent architecture where nodes represent agents/tasks and edges represent dependencies, enabling complex workflow management.
Guardrails
Safety & ControlSafety mechanisms and constraints that limit agent behavior, preventing harmful outputs and ensuring compliance with guidelines.
Hallucination
LLM FundamentalsWhen LLMs generate false, incorrect, or fabricated information presented as fact, a critical safety concern.
Handoff Pattern
Architecture PatternsA multi-agent pattern where agents explicitly pass work to each other based on specialization, creating a workflow of transfers.
Hierarchical Architecture
Architecture PatternsA multi-level agent structure where higher-level agents coordinate lower-level agents, creating nested decision-making hierarchies.
Human-in-the-Loop
Safety & ControlA control pattern where significant agent decisions require human review and approval before execution.
JSON Schema
Protocols & StandardsA standard for describing and validating JSON data structures, specifying properties, types, constraints, and requirements.
Knowledge Graph
Memory & StateA structured representation of entities, relationships, and facts organized as a graph, enabling complex reasoning and relationship queries.
LangChain
Frameworks & ToolsA popular framework for building applications with large language models, providing abstractions for chains, memory, and external integrations.
LangGraph
Frameworks & ToolsAn extension of LangChain enabling graph-based workflow definition with cycles, state management, and complex agent patterns.
Latency
ProductionThe time delay between when an action is initiated and when results are returned, critical for user experience.
Levels of Autonomy
Safety & ControlClassification framework describing agent autonomy from fully manual to fully autonomous decision-making.
LlamaIndex
Frameworks & ToolsA data indexing framework for building RAG systems, providing tools for data ingestion, indexing, and retrieval optimization.
Long-term Memory
Memory & StatePersistent storage for important information, patterns, and knowledge accumulated over time that persists beyond individual interactions.
MCP (Model Context Protocol)
Protocols & StandardsAn open protocol for standardizing how AI agents connect to data sources, tools, and other services, enabling interoperability.
Multi-agent Systems
Architecture PatternsArchitectures where multiple independent agents work together, each with specialized capabilities, to solve problems collaboratively.
n8n
Frameworks & ToolsA workflow automation platform enabling visual agent and workflow creation with native LLM integration and 400+ pre-built nodes.
Observability
ProductionThe capability to understand and debug complex systems through comprehensive logging, metrics, and visibility into system behavior.
OpenAPI Specification
Protocols & StandardsAn open standard for describing REST APIs in a machine-readable format, enabling automated documentation, client generation, and tool integration.
Orchestration
Core ConceptsThe coordination and management of multiple agents or components working together to accomplish complex workflows and achieve shared objectives.
Parallel Execution
Architecture PatternsAn execution pattern where multiple agents or tasks run concurrently, improving performance by utilizing multiple resources simultaneously.
Planning
Core ConceptsThe process by which an agent breaks down complex goals into smaller subgoals and sequences of actions needed to achieve them.
Prompt Engineering
LLM FundamentalsThe practice of crafting inputs to LLMs to achieve desired outputs, including instruction clarity, examples, and formatting.
Prompt Injection
Safety & ControlAn attack where malicious input is crafted to manipulate LLM behavior, potentially bypassing safety measures and instructions.
RAG (Retrieval-Augmented Generation)
Memory & StateA technique combining information retrieval from external sources with generation, allowing agents to access relevant knowledge without fine-tuning.
Rate Limiting
ProductionControlling the frequency of requests to prevent overload, manage costs, and ensure fair resource allocation.
ReAct
Core ConceptsReasoning + Acting framework that combines explicit reasoning steps with action steps, enabling agents to interleave thinking with tool usage.
Reasoning
Core ConceptsThe cognitive process by which an agent analyzes information, draws logical conclusions, and generates justifications for decisions and actions.
Red Teaming
Safety & ControlAdversarial testing approach where teams actively try to break agent security and identify vulnerabilities before deployment.
Reflection
Core ConceptsAn agent capability to evaluate its own outputs, identify errors or inconsistencies, and self-correct before returning final results.
Response Formatting
LLM FundamentalsTechniques for ensuring LLM outputs conform to desired formats (JSON, XML, markdown, structured data).
RLHF (Reinforcement Learning from Human Feedback)
LLM FundamentalsA training technique using human feedback to fine-tune model behavior, improving alignment with human preferences.
Sandboxing
Safety & ControlIsolating agent execution environments to limit the impact of potential errors or malicious code, containing damage.
Semantic Kernel (Microsoft)
Frameworks & ToolsA lightweight orchestration framework integrating LLMs with plugins, enabling flexible skill composition and function integration.
Semantic Search
Memory & StateSearch technique that finds results based on semantic meaning rather than keyword matching, using embeddings.
Sequential Execution
Architecture PatternsAn execution pattern where agents or tasks execute one after another in a defined order, with output from one feeding into the next.
Short-term Memory
Memory & StateTemporary working memory that holds recent context, current task information, and intermediate results during active processing.
smolagents
Frameworks & ToolsA lightweight, Hugging Face-based framework for building agents with minimal dependencies, focusing on simplicity and transparency.
State Machine Pattern
Architecture PatternsAn architecture where agents operate as state machines with defined states, transitions, and actions triggered by events or conditions.
Streaming
ProductionSending results incrementally as they\
Supervisor Pattern
Architecture PatternsAn agent architecture where a central supervisor agent delegates work to specialized worker agents and coordinates their activities.
Swarm Intelligence
Architecture PatternsAn agent architecture inspired by natural swarms where simple agents following local rules create emergent global behaviors.
System Prompt
LLM FundamentalsInitial instructions provided to an LLM that define its role, behavior, guidelines, and constraints for the entire conversation.
Task Execution
Core ConceptsThe process by which an agent breaks down goals into concrete actions and executes them systematically to completion.
Temperature Parameter
LLM FundamentalsA setting controlling randomness in LLM outputs; lower values (0-0.3) produce deterministic outputs, higher (0.7-1.0) increase creativity.
Token Budget
ProductionThe allocated number of tokens an agent can consume per time period, controlling costs and managing resource usage.
Tokens
LLM FundamentalsThe fundamental units that LLMs process, typically representing subword units that are smaller than words but larger than characters.
Tool Integration
Frameworks & ToolsThe process of connecting agents to external tools, APIs, and services enabling extended capabilities.
Tool Schemas
Protocols & StandardsFormal definitions of available tools specifying parameters, return types, descriptions, and usage patterns in structured formats.
Tool Use
Core ConceptsThe capability of an AI agent to identify, select, and utilize external tools (APIs, functions, code executors) to accomplish tasks beyond its native knowledge.
Top-p (Nucleus Sampling)
LLM FundamentalsA sampling parameter controlling diversity by including only tokens with cumulative probability up to p, balancing quality and variety.
Vector Store
Memory & StateA database designed to store and retrieve high-dimensional vector embeddings efficiently, enabling semantic search and similarity matching.
Working Memory
Memory & StateThe active memory used during task execution to hold intermediate values, processing state, and temporary information needed for immediate operations.
Zero-shot Learning
LLM FundamentalsUsing an LLM to perform tasks without providing specific examples, relying on the model\