Over ons 🤖

Laten we elkaar leren kennen

Vertel me de missie en visie

Leg het verhaal achter Mach8 uit

Stel een vraag!

Hallo daar 👋

Hoe kunnen we je helpen?

Volledige naam

E-mail

Bericht

Mijn gegevens mogen worden gebruikt om me op de hoogte te houden van relevant nieuws van Mach8

Bellen

+31 13 71 13 708

•

E-mail

innovation@mach8.io

Knowledge base›Chatbots & Conversational AI

Chatbots & Conversational AI·6 min·4 May 2025

How do you measure the performance of an AI chatbot?

Running a chatbot is not the same as knowing whether it performs well. Without proper measurement, you only find out something is wrong after complaints arrive. This article explains which metrics matter and how to track them systematically.

After launching an AI chatbot, the real work begins. How do you know whether the chatbot achieves its goal? Satisfied users are a signal, but not enough. You need concrete measurement points to know what works, what does not, and where to adjust.

Containment rate: how many questions does the bot resolve on its own?

The containment rate is the percentage of conversations handled entirely by the chatbot, without human involvement. A high containment rate is good if the answers are also correct. If the bot handles conversations with wrong or vague answers, a high containment rate is actually a problem.

Always measure containment rate in combination with quality metrics. A bot that handles 80% of conversations but answers half of them incorrectly is less valuable than a bot that handles 50% with high accuracy.

Resolution rate: are questions actually resolved?

The resolution rate goes a step further than containment. It is not just about whether the chatbot handled the conversation, but whether the question was actually answered. This is harder to measure and requires user feedback.

Simple ways to measure this: ask at the end of each conversation "Did you find what you were looking for?" with a yes/no button. That data gives you a direct indicator of effectiveness.

Escalation rate: how often does it go wrong?

The escalation rate is the percentage of conversations transferred to a human. A rate that is too low may mean the bot is closing conversations it cannot handle. A rate that is too high means the bot is resolving too little on its own.

The ideal percentage depends on the use case. For a complex internal helpdesk, 30% escalation is fine. For an FAQ bot about opening hours, 30% escalation would signal something is seriously wrong.

Average conversation length and number of turns

A conversation that takes ten turns to answer a simple question is too long. Analyse average conversation length and the number of messages per conversation. Long conversations for simple questions point to unclear answers, poor conversation flow or a bot that keeps asking for clarification.

Compare this per conversation type as well. Complex questions may take longer; simple questions should be resolved quickly.

User satisfaction score (CSAT)

Ask users for a rating after the conversation. Even a simple thumbs up / thumbs down provides valuable data. Analyse conversations with low ratings: what went wrong? Was it a content problem, a tone problem or a technical problem?

A low CSAT is rarely the complete diagnosis; it is a signal to look deeper.

Fallback rate: how often does the bot give up?

The fallback rate is the percentage of messages where the chatbot cannot provide an answer and falls back on a default response such as "I don't understand your question." A high fallback rate points to gaps in the knowledge base, poor intent recognition or unclear system instructions.

Review the messages that lead to fallbacks. They tell you exactly which topics or phrasings the bot is missing and steer your maintenance agenda.

Going from metrics to action

Metrics are only valuable if you act on them. Establish a fixed rhythm for analysis: weekly for operational metrics, monthly for trends. Connect each measurement point to an action: if the fallback rate exceeds 15%, you review the knowledge base. If CSAT drops, you analyse conversations from that week.

Conclusion

Measuring the performance of an AI chatbot requires a combination of quantitative metrics and qualitative analysis. Mach8 helps organisations not only build chatbots, but also set up the right monitoring so you always know how your chatbot is performing.

Want to better understand your chatbot's performance? Get in touch with Mach8.