Testing

Self-Improving AI Application Architectures
Mar 10, 2026 by Matt Eland · 18 min read
ai architecture agents testing
Part of the Building AI Agents topic
What if your AI agents could learn from their mistakes and get better over time? Check out a sample architecture that does just that.

Learn More
Tracking AI system performance using AI Evaluation Reports
Sep 9, 2025 by Matt Eland · 12 min read
ai csharp testing llm dotnet
Part of the Building AI Agents topic
Learn how to build interactive AI evaluation reports in .NET to communicate model quality, boost transparency, and engage your team.

Learn More
Add Structured Testing to Your AI Vibe - with promptfoo
Sep 6, 2025 by Dennis Whalen · 11 min read
prompt engineering ai testing promptfoo
Part of the Building AI Agents topic
When building AI applications with LLMs, many teams are still "vibe testing" their way to production. This guide shows how file-based prompts and comprehensive test scenarios build reliable review analysis systems that handle real-world complexity, allowing you to build a solid set of repeatable automated regression tests.

Learn More
Automate the testing of your LLM Prompts - with promptfoo
Aug 24, 2025 by Dennis Whalen · 9 min read
prompt engineering ai testing promptfoo
Part of the Building AI Agents topic
LLMs produce non-deterministic outputs, making traditional exact-match testing ineffective. How can you verify an application response is contextually accurate when the response can vary with every request? Let's take a look at promptfoo!

Learn More
Chaos Engineering: Breaking Things On Purpose
Jun 3, 2025 by Ed LeGault · 7 min read
automation testing devops security
Part of the Cloud-Native Development topic
This article explores what chaos engineering is, how to get started and how to not get fired

Learn More
An LLM Evaluation Framework for AI Systems Performance
May 28, 2025 by Matt Eland · 11 min read
ai csharp testing llm dotnet
Part of the AI-Assisted Development topic
AI systems are powerful, but hard to constrain, measure, and control. Let's see how Microsoft.Extensions.AI.Evaluation helps with AI systems development.

Learn More
Open-source Load Testing
May 19, 2025 by Dennis Whalen · 8 min read
load testing k6 grafana testing
Part of the Cloud-Native Development topic
In this guide, we'll set up a complete load testing environment using k6 for testing, Prometheus for metrics collection, and Grafana for visualization, all orchestrated with Docker.

Learn More
Optimizing Your Test Strategy with Playwright
Apr 29, 2025 by Terry Welsh · 8 min read
testautomation playwright qa testing
This article guides you through UI, API, and visual testing with Playwright and clarifies places where Playwright should be avoided.

Learn More
Gherkin and Robot Framework
Feb 12, 2023 by Dennis Whalen · 4 min read
robotframework testing gherkin python
Let's see how Gherkin can clean up and simplify Robot Framework tests.

Learn More
Running Your First Robot Framework Test
Dec 23, 2022 by Dennis Whalen · 4 min read
robotframework testing automation python
Let's talk about how to run your first robot framework test

Learn More
- 1
- 2