Building AI Agents

Q: "What goes into building an AI agent?"

"An AI agent combines an LLM with orchestration logic, external tools, and data retrieval. Matt Eland demonstrates this hands-on in Building a Digital Dungeon Master with Semantic Kernel, wiring together plugins, game-state data, and prompt strategies into a functioning agent."

Q: "What are MCP and A2A, and why do they matter for agents?"

"Model Context Protocol (MCP) standardizes the tools and data an agent can access, while Agent2Agent (A2A) lets independent agents collaborate. Matt Eland compares both protocols and shows how they integrate with frameworks like Semantic Kernel in MCP and A2A: Two bright modular futures for AI."

Q: "How do you add knowledge retrieval to an AI agent?"

"Retrieval Augmented Generation (RAG) grounds agent responses in real data. Document Search in .NET with Kernel Memory covers ingestion, indexing, and search, while the Website RAG Chat reference architecture shows how to embed a RAG agent into a production web application."

Q: "How do you test and evaluate AI agent systems?"

"Matt Eland walks through repeatable evaluation with Microsoft.Extensions.AI.Evaluation in An LLM Evaluation Framework and builds stakeholder-ready dashboards in AI Evaluation Reports. Dennis Whalen covers prompt-level regression testing with promptfoo in Automate the testing of your LLM Prompts."

Q: "Can AI agents improve themselves over time?"

"Yes. Matt Eland demonstrates an architecture where agents collect performance metrics, generate coaching statements, and produce improved system prompts automatically in Self-Improving AI Application Architectures."

How to architect, build, and evaluate AI agent systems using orchestration frameworks, protocols like MCP and A2A, and RAG pipelines.

What Does It Take to Build an AI Agent?

An AI agent is more than a chat prompt — it is an application that combines an LLM with orchestration logic, external tools, and domain data to pursue goals through multi-step reasoning. Building one means choosing an orchestration framework, connecting data sources, defining tool interfaces, and establishing feedback loops that keep the agent grounded.

Building a Digital Dungeon Master with Semantic Kernel shows the full pattern in practice: plugins expose game-state data, prompt strategies guide the LLM, and the orchestration layer ties everything together. MCP and A2A: Two bright modular futures for AI explains the open protocols that standardize how agents access tools (MCP) and collaborate with other agents (A2A), making these systems composable rather than monolithic.

For data-grounded agents, Document Search in .NET with Kernel Memory covers the RAG pipeline — ingestion, vector indexing, and search — that lets agents answer questions from real documents rather than relying on training data alone.

Why It Matters

Shipping a reliable agent requires more than a working prototype. Our reference architectures lay out production patterns for website chat agents, AI-augmented developer tooling, and team-wide AI portals — covering security boundaries, model selection, and integration points that prototypes typically skip.

Evaluation closes the loop. An LLM Evaluation Framework for AI Systems Performance introduces repeatable metrics for groundedness, retrieval quality, and coherence; AI Evaluation Reports turns those metrics into dashboards the whole team can use; and the promptfoo series (getting started, structured testing) adds prompt-level regression testing. For agents that need to improve continuously, Self-Improving AI Application Architectures demonstrates how an agent can measure its own performance and generate better prompts over time.

Writing AI-ready code and understanding how coding agents operate on your codebase are the practical foundations that make all of this work in a real engineering environment.

Self-Improving AI Application Architectures

By Matt Eland · Mar 10, 2026

Tracking AI system performance using AI Evaluation Reports

By Matt Eland · Sep 9, 2025

Add Structured Testing to Your AI Vibe - with promptfoo

By Dennis Whalen · Sep 6, 2025

Automate the testing of your LLM Prompts - with promptfoo

By Dennis Whalen · Aug 24, 2025

Coding Agents are here: Is your team ready for AI devs?

By Matt Eland · Jul 23, 2025

Reference Architecture for Team AI Productivity

By Matt Eland · Jun 19, 2025

Document Search in .NET with Kernel Memory

By Matt Eland · May 20, 2025

MCP and A2A: Two bright modular futures for AI

By Matt Eland · May 13, 2025

Reference Architecture for AI Developer Productivity

By Matt Eland · May 6, 2025

Reference Architecture for Website Chat Agents

By Matt Eland · Apr 1, 2025

Building a Digital Dungeon Master with Semantic Kernel

By Matt Eland · Dec 2, 2024

Frequently Asked Questions

What goes into building an AI agent?

An AI agent combines an LLM with orchestration logic, external tools, and data retrieval. Matt Eland demonstrates this hands-on in Building a Digital Dungeon Master with Semantic Kernel, wiring together plugins, game-state data, and prompt strategies into a functioning agent.

What are MCP and A2A, and why do they matter for agents?

Model Context Protocol (MCP) standardizes the tools and data an agent can access, while Agent2Agent (A2A) lets independent agents collaborate. Matt Eland compares both protocols and shows how they integrate with frameworks like Semantic Kernel in MCP and A2A: Two bright modular futures for AI.

How do you add knowledge retrieval to an AI agent?

Retrieval Augmented Generation (RAG) grounds agent responses in real data. Document Search in .NET with Kernel Memory covers ingestion, indexing, and search, while the Website RAG Chat reference architecture shows how to embed a RAG agent into a production web application.

How do you test and evaluate AI agent systems?

Matt Eland walks through repeatable evaluation with Microsoft.Extensions.AI.Evaluation in An LLM Evaluation Framework and builds stakeholder-ready dashboards in AI Evaluation Reports. Dennis Whalen covers prompt-level regression testing with promptfoo in Automate the testing of your LLM Prompts.

Can AI agents improve themselves over time?

Yes. Matt Eland demonstrates an architecture where agents collect performance metrics, generate coaching statements, and produce improved system prompts automatically in Self-Improving AI Application Architectures.