AI Agent Glossary

A2A (Agent-to-Agent Protocol)

Protocols & Standards

A communication protocol enabling direct agent-to-agent interaction, message passing, and coordination without human mediation.

Agent

Core Concepts

An autonomous software entity capable of perceiving its environment, reasoning about information, making decisions, and taking actions to achieve specific goals.

Agent Evaluation

Production

Systematic testing and measurement of agent performance against benchmarks, success criteria, and quality metrics.

Agentic AI

Core Concepts

AI systems designed to act autonomously with agency—the ability to make decisions, take action, and adapt behavior based on goals and environmental feedback.

AutoGen (Microsoft)

Frameworks & Tools

A framework enabling conversations between multiple LLM agents and human users for collaborative problem-solving.

Autonomous Agent

Core Concepts

An agent capable of independent operation with minimal human intervention, making decisions and executing actions autonomously.

AWS Bedrock Agents

Frameworks & Tools

AWS managed service for running agents that automatically handle orchestration, memory, and integration with AWS services.

Caching

Production

Storing frequently accessed results temporarily to avoid redundant computation and improve response times.

Chain of Thought

Core Concepts

A prompting technique where an AI is instructed to break down complex problems into intermediate reasoning steps, improving solution quality.

Claude Agent SDK

Frameworks & Tools

Anthropic\

Constitutional AI

Safety & Control

An approach training AI systems to follow explicit principles and guidelines, improving safety and alignment with human values.

Context Management

Memory & State

Strategies for maintaining, updating, and optimizing the information available to agents within context limits.

Context Window

Memory & State

The maximum amount of text (in tokens) that a language model can process at once, limiting how much history and context it can access.

CrewAI

Frameworks & Tools

A framework specifically designed for building multi-agent systems with role-based agents, tasks, and hierarchical management.

Distributed Tracing

Production

Tracking requests and operations across multiple services and components, enabling visibility into complex distributed workflows.

Embeddings

Memory & State

Numerical vector representations of text that capture semantic meaning, enabling comparison and retrieval based on semantic similarity.

Episodic Memory

Memory & State

Memory of specific events, experiences, and interactions, typically time-stamped and contextual, useful for learning from experience.

Error Handling

Production

Mechanisms for detecting, managing, and recovering from failures gracefully without system collapse.

Few-shot Learning

LLM Fundamentals

Providing a small number of examples in a prompt to teach LLMs new patterns or behaviors without fine-tuning.

Fine-tuning

LLM Fundamentals

Training an LLM on task-specific data to specialize its behavior, improving performance on particular domains or tasks.

Function Calling

Core Concepts

A mechanism where an LLM generates structured outputs (typically JSON) that describe which function to call with what parameters, enabling programmatic tool invocation.

Goal Decomposition

Core Concepts

Breaking down high-level goals into smaller, manageable subgoals that are easier to achieve and reason about.

Google Agent Discovery Kit

Frameworks & Tools

Google\

Graph-based Architecture

Architecture Patterns

An agent architecture where nodes represent agents/tasks and edges represent dependencies, enabling complex workflow management.

Guardrails

Safety & Control

Safety mechanisms and constraints that limit agent behavior, preventing harmful outputs and ensuring compliance with guidelines.

Hallucination

LLM Fundamentals

When LLMs generate false, incorrect, or fabricated information presented as fact, a critical safety concern.

Handoff Pattern

Architecture Patterns

A multi-agent pattern where agents explicitly pass work to each other based on specialization, creating a workflow of transfers.

Hierarchical Architecture

Architecture Patterns

A multi-level agent structure where higher-level agents coordinate lower-level agents, creating nested decision-making hierarchies.

Human-in-the-Loop

Safety & Control

A control pattern where significant agent decisions require human review and approval before execution.

JSON Schema

Protocols & Standards

A standard for describing and validating JSON data structures, specifying properties, types, constraints, and requirements.

Knowledge Graph

Memory & State

A structured representation of entities, relationships, and facts organized as a graph, enabling complex reasoning and relationship queries.

LangChain

Frameworks & Tools

A popular framework for building applications with large language models, providing abstractions for chains, memory, and external integrations.

LangGraph

Frameworks & Tools

An extension of LangChain enabling graph-based workflow definition with cycles, state management, and complex agent patterns.

Latency

Production

The time delay between when an action is initiated and when results are returned, critical for user experience.

Levels of Autonomy

Safety & Control

Classification framework describing agent autonomy from fully manual to fully autonomous decision-making.

LlamaIndex

Frameworks & Tools

A data indexing framework for building RAG systems, providing tools for data ingestion, indexing, and retrieval optimization.

Long-term Memory

Memory & State

Persistent storage for important information, patterns, and knowledge accumulated over time that persists beyond individual interactions.

MCP (Model Context Protocol)

Protocols & Standards

An open protocol for standardizing how AI agents connect to data sources, tools, and other services, enabling interoperability.

Multi-agent Systems

Architecture Patterns

Architectures where multiple independent agents work together, each with specialized capabilities, to solve problems collaboratively.

n8n

Frameworks & Tools

A workflow automation platform enabling visual agent and workflow creation with native LLM integration and 400+ pre-built nodes.

Observability

Production

The capability to understand and debug complex systems through comprehensive logging, metrics, and visibility into system behavior.

OpenAPI Specification

Protocols & Standards

An open standard for describing REST APIs in a machine-readable format, enabling automated documentation, client generation, and tool integration.

Orchestration

Core Concepts

The coordination and management of multiple agents or components working together to accomplish complex workflows and achieve shared objectives.

Parallel Execution

Architecture Patterns

An execution pattern where multiple agents or tasks run concurrently, improving performance by utilizing multiple resources simultaneously.

Planning

Core Concepts

The process by which an agent breaks down complex goals into smaller subgoals and sequences of actions needed to achieve them.

Prompt Engineering

LLM Fundamentals

The practice of crafting inputs to LLMs to achieve desired outputs, including instruction clarity, examples, and formatting.

Prompt Injection

Safety & Control

An attack where malicious input is crafted to manipulate LLM behavior, potentially bypassing safety measures and instructions.

RAG (Retrieval-Augmented Generation)

Memory & State

A technique combining information retrieval from external sources with generation, allowing agents to access relevant knowledge without fine-tuning.

Rate Limiting

Production

Controlling the frequency of requests to prevent overload, manage costs, and ensure fair resource allocation.

ReAct

Core Concepts

Reasoning + Acting framework that combines explicit reasoning steps with action steps, enabling agents to interleave thinking with tool usage.

Reasoning

Core Concepts

The cognitive process by which an agent analyzes information, draws logical conclusions, and generates justifications for decisions and actions.

Red Teaming

Safety & Control

Adversarial testing approach where teams actively try to break agent security and identify vulnerabilities before deployment.

Reflection

Core Concepts

An agent capability to evaluate its own outputs, identify errors or inconsistencies, and self-correct before returning final results.

Response Formatting

LLM Fundamentals

Techniques for ensuring LLM outputs conform to desired formats (JSON, XML, markdown, structured data).

RLHF (Reinforcement Learning from Human Feedback)

LLM Fundamentals

A training technique using human feedback to fine-tune model behavior, improving alignment with human preferences.

Sandboxing

Safety & Control

Isolating agent execution environments to limit the impact of potential errors or malicious code, containing damage.

Semantic Kernel (Microsoft)

Frameworks & Tools

A lightweight orchestration framework integrating LLMs with plugins, enabling flexible skill composition and function integration.

Semantic Search

Memory & State

Search technique that finds results based on semantic meaning rather than keyword matching, using embeddings.

Sequential Execution

Architecture Patterns

An execution pattern where agents or tasks execute one after another in a defined order, with output from one feeding into the next.

Short-term Memory

Memory & State

Temporary working memory that holds recent context, current task information, and intermediate results during active processing.

smolagents

Frameworks & Tools

A lightweight, Hugging Face-based framework for building agents with minimal dependencies, focusing on simplicity and transparency.

State Machine Pattern

Architecture Patterns

An architecture where agents operate as state machines with defined states, transitions, and actions triggered by events or conditions.

Streaming

Production

Sending results incrementally as they\

Supervisor Pattern

Architecture Patterns

An agent architecture where a central supervisor agent delegates work to specialized worker agents and coordinates their activities.

Swarm Intelligence

Architecture Patterns

An agent architecture inspired by natural swarms where simple agents following local rules create emergent global behaviors.

System Prompt

LLM Fundamentals

Initial instructions provided to an LLM that define its role, behavior, guidelines, and constraints for the entire conversation.

Task Execution

Core Concepts

The process by which an agent breaks down goals into concrete actions and executes them systematically to completion.

Temperature Parameter

LLM Fundamentals

A setting controlling randomness in LLM outputs; lower values (0-0.3) produce deterministic outputs, higher (0.7-1.0) increase creativity.

Token Budget

Production

The allocated number of tokens an agent can consume per time period, controlling costs and managing resource usage.

Tokens

LLM Fundamentals

The fundamental units that LLMs process, typically representing subword units that are smaller than words but larger than characters.

Tool Integration

Frameworks & Tools

The process of connecting agents to external tools, APIs, and services enabling extended capabilities.

Tool Schemas

Protocols & Standards

Formal definitions of available tools specifying parameters, return types, descriptions, and usage patterns in structured formats.

Tool Use

Core Concepts

The capability of an AI agent to identify, select, and utilize external tools (APIs, functions, code executors) to accomplish tasks beyond its native knowledge.

Top-p (Nucleus Sampling)

LLM Fundamentals

A sampling parameter controlling diversity by including only tokens with cumulative probability up to p, balancing quality and variety.

Vector Store

Memory & State

A database designed to store and retrieve high-dimensional vector embeddings efficiently, enabling semantic search and similarity matching.

Working Memory

Memory & State

The active memory used during task execution to hold intermediate values, processing state, and temporary information needed for immediate operations.

Zero-shot Learning

LLM Fundamentals

Using an LLM to perform tasks without providing specific examples, relying on the model\

Showing 77 terms

A2A (Agent-to-Agent Protocol)

Agent

Agent Evaluation

Agentic AI

AutoGen (Microsoft)

Autonomous Agent

AWS Bedrock Agents

Caching

Chain of Thought

Claude Agent SDK

Constitutional AI

Context Management

Context Window

CrewAI

Distributed Tracing

Embeddings

Episodic Memory

Error Handling

Few-shot Learning

Fine-tuning

Function Calling

Goal Decomposition

Google Agent Discovery Kit

Graph-based Architecture

Guardrails

Hallucination

Handoff Pattern

Hierarchical Architecture

Human-in-the-Loop

JSON Schema

Knowledge Graph

LangChain

LangGraph

Latency

Levels of Autonomy

LlamaIndex

Long-term Memory

MCP (Model Context Protocol)

Multi-agent Systems

n8n

Observability

OpenAPI Specification

Orchestration

Parallel Execution

Planning

Prompt Engineering

Prompt Injection

RAG (Retrieval-Augmented Generation)

Rate Limiting

ReAct

Reasoning

Red Teaming

Reflection

Response Formatting

RLHF (Reinforcement Learning from Human Feedback)

Sandboxing

Semantic Kernel (Microsoft)

Semantic Search

Sequential Execution

Short-term Memory

smolagents

State Machine Pattern

Streaming

Supervisor Pattern

Swarm Intelligence

System Prompt

Task Execution

Temperature Parameter

Token Budget

Tokens

Tool Integration

Tool Schemas

Tool Use

Top-p (Nucleus Sampling)

Vector Store

Working Memory

Zero-shot Learning

Keep Exploring

📚 AI Agent Learning Hub