Self-Improving AI Application Architectures
Mar 10, 2026 by Matt Eland  · 18 min readPart of the Building AI Agents topic
What if your AI agents could learn from their mistakes and get better over time? Check out a sample architecture that does just that.
Learn MoreTracking AI system performance using AI Evaluation Reports
Sep 9, 2025 by Matt Eland  · 12 min readPart of the Building AI Agents topic
Learn how to build interactive AI evaluation reports in .NET to communicate model quality, boost transparency, and engage your team.
Learn MoreAdd Structured Testing to Your AI Vibe - with promptfoo
Sep 6, 2025 by Dennis Whalen  · 11 min readPart of the Building AI Agents topic
When building AI applications with LLMs, many teams are still "vibe testing" their way to production. This guide shows how file-based prompts and comprehensive test scenarios build reliable review analysis systems that handle real-world complexity, allowing you to build a solid set of repeatable automated regression tests.
Learn MoreAutomate the testing of your LLM Prompts - with promptfoo
Aug 24, 2025 by Dennis Whalen  · 9 min readPart of the Building AI Agents topic
LLMs produce non-deterministic outputs, making traditional exact-match testing ineffective. How can you verify an application response is contextually accurate when the response can vary with every request? Let's take a look at promptfoo!
Learn MoreChaos Engineering: Breaking Things On Purpose
Jun 3, 2025 by Ed LeGault  · 7 min readPart of the Cloud-Native Development topic
This article explores what chaos engineering is, how to get started and how to not get fired
Learn MoreAn LLM Evaluation Framework for AI Systems Performance
May 28, 2025 by Matt Eland  · 11 min readPart of the AI-Assisted Development topic
AI systems are powerful, but hard to constrain, measure, and control. Let's see how Microsoft.Extensions.AI.Evaluation helps with AI systems development.
Learn MoreOpen-source Load Testing
May 19, 2025 by Dennis Whalen  · 8 min readPart of the Cloud-Native Development topic
In this guide, we'll set up a complete load testing environment using k6 for testing, Prometheus for metrics collection, and Grafana for visualization, all orchestrated with Docker.
Learn MoreOptimizing Your Test Strategy with Playwright
Apr 29, 2025 by Terry Welsh  · 8 min read
This article guides you through UI, API, and visual testing with Playwright and clarifies places where Playwright should be avoided.
Learn MoreGherkin and Robot Framework
Feb 12, 2023 by Dennis Whalen  · 4 min readRunning Your First Robot Framework Test
Dec 23, 2022 by Dennis Whalen  · 4 min read