It’s Not You, It’s Me: Designing Failure States for AI Agents

Poor failure handling undermines AI products. Here's how to transform failures from frustrating to meaningful.

May 23, 2025

What if our most powerful brand-building moment isn’t when the product works perfectly—but when it admits it can’t?

That’s not an argument for embracing failure—no business survives on apology UX—but a recognition that every AI-powered product will fail sometimes. Especially when it has agency. Especially when it takes action. And especially when it lives at the high-stakes edge of money, health, or other essential services.

Designers have long known the value of handling errors gracefully. But AI introduces a new kind of failure: one that isn’t binary, but interpretive. There aren’t clean error codes that we can relate 1:1 with pre-scripted error messages. And without guardrails, AI systems might not even know they’ve failed the user. If we’re not careful, we leave the user holding the emotional weight of that uncertainty.

As designers, we don’t just craft the polished flows and celebrated moments. We’re also responsible for what happens when things break, blur, or simply don’t work as expected. In AI-native systems—where ambiguity is part of the architecture—those moments aren’t edge cases. They’re part of the core experience.

AI agents are already all around us. They help us book appointments, summarise meetings, make plans, write emails, and—perhaps for longer than any other examples—handle customer support. Of all their applications, chatbots have been the earliest and most visible success. They’re available 24/7, they speak in friendly tones, and when things go well, they can resolve issues in seconds.

But of course, things don’t always go well.

A few weeks ago, I asked my bank’s chatbot a simple question: “Did my last payment of £1,000 go through?” It replied, “I couldn’t find that payment. I’ll escalate this to a human support agent.” For a moment, I froze. Did the payment fail? Is there no record of it at all? Or did the bot just not know where to look?

As designers, we’re used to planning beyond the happy path. We account for edge cases, recovery flows, and failure states. We know that graceful failure can be just as important as smooth success. But AI changes the nature of failure: sometimes it guesses wrong, sometimes it can’t access the right data, and sometimes it just doesn’t know.

The more agency we give the machine, the more accountability we must design into the experience.

That’s why we need to rethink how we design failure in AI-native products. It’s not just about preventing harm or avoiding blame. It’s about how we help people make sense of what’s happening when the system reaches the edge of its capabilities. Because if we don’t design those moments with care, we leave people alone with their uncertainty. And when the system’s responses are vague or ambiguous, users tend to fill in the gaps—often with the worst-case scenario.

The Hidden Complexity of Failure States

AI systems are designed to feel seamless. And when they succeed, they do: the user asks, the system responds, and the experience flows forward. But when something goes wrong, it’s easy to fall back on one of two default approaches: To assume the model will “figure it out,” or to wrap the moment in a generic fallback message: “Sorry, I couldn’t complete your request.”

Both are shortcuts. And both flatten what should be a nuanced moment into a vague one.

Over-trusting the model

Sounds fluent, but may be wrong
Can mislead users with false confidence
Undermines trust when actions don’t match words

Over-restricting the model

Sounds safe, but lacks usefulness
Frustrates users with dead ends
Undermines usefulness when answers aren’t given

This is the control paradox: Give the system too much freedom, and users feel misled. Restrict it too tightly, and users feel blocked.

The challenge is to strike the right balance between clarity, pragmatism, and empathy.

Clarity helps users understand what happened—and what they can do next.
Pragmatism acknowledges real constraints, without overselling capability or offering false hope.
Empathy reminds us that every failed request is still an interaction with a person—often one who’s already feeling uncertain, anxious, or frustrated.

Finding the Middle Ground

The truth is, failure has layers—and treating it as a single catch-all event hides critical meaning from users. It leaves them to interpret what just happened, often without the information or context to do so.

Here’s the real breakdown:

1. Misunderstood Input

The user’s request is unclear, incomplete, or ambiguous.

“Send the last report to her.” Who’s “her”? Which report?

What’s needed: Gentle clarification. Don’t apologise for not knowing—ask for what you need to proceed.

2. Agent Misinterpretation

The user asked a valid question, but the model misunderstood or misrouted it.

“Cancel my subscription” gets interpreted as “Show me my subscription details.”

What’s needed: Acknowledge the misunderstanding and offer simple ways to rephrase or correct.

3. System or Capability Limitations

The agent understood, but the system simply can’t do what’s being asked.

“Book me a train from London to Paris.”—but the system only supports UK rail.

What’s needed: Clear, calm explanation of limits—with a suggestion for what is possible nearby.

4. Execution or Data Failures

The system tried to comply, but something broke: an API timed out, a database query failed, the server returned an error.

What’s needed: Transparency. Reassure the user it’s a technical issue, not a user mistake—and offer a retry or a path to human support.

5. True Absence of Data

The thing the user is asking about genuinely doesn’t exist.

“Did my payment go through?” No record, because it never happened.

What’s needed: Soft but unambiguous language—paired with relevant checks or follow-ups (e.g. “There’s no record of that payment on this account. Would you like to confirm the amount or timeframe?”)

When we don’t differentiate these types of failure, users are left in the dark—anxious, confused, and left to assume the worst. A misunderstanding feels like a broken system. A system limitation feels like user error. A missing dataset feels like a financial crisis.

The burden of interpretation shifts unfairly to the user—and the moment stops feeling like a product experience and starts feeling like a personal problem.

Good design reclaims that burden. It anticipates ambiguity, owns responsibility, and guides users toward understanding—not just completion.

Designing Failure States with Empathy

Failure isn’t a dead-end. Well crafted failure messages are a chance to build trust through transparency, agency, and care.

1. Distinguish Between Agent and System Failure

Not all failures are the same—and users shouldn’t have to guess. Make it clear when the agent misunderstood vs. when the system is limited.

Good systems make it clear whether:

The request wasn’t understood
The request was misinterpreted
The system can’t do what’s being asked
Something went wrong while trying to help
The thing simply doesn’t exist

Users care deeply about why something went wrong. Vague failure messages cause confusion or misdirected blame. Was the system limited? Did the agent misunderstand? Was it their input?

Be specific about where the breakdown occurred: misunderstanding, system limits, unavailable data.
Assign clear responsibility with calm and generosity, so users aren’t left wondering if it was them.
Use plain, non-technical language to name the issue without overwhelming with detail.

Example:

“I wasn’t able to access your payment history right now. A support agent can check it in more detail.”

2. Align Before You Act

Users shouldn’t have to debug the system’s reasoning. If the system doesn’t fully understand what’s being asked—or isn’t confident in what it found—it should pause, clarify, or self-correct.

This prevents misfires and creates a more thoughtful and intelligent experience, especially when the stakes are high.

A. Check for Clarity and Completeness

Before taking action, ask:

Is the request specific enough to proceed confidently?
Are key values (names, amounts, dates, entities) clearly defined?

If not, pause to ask for clarification—respectfully, with just enough context to move forward.

Acknowledge the input to show the system is listening.
Be precise in what’s unclear, so users understand why clarification is needed.
Avoid asking for repetition when possible—try narrowing or offering suggestions instead.

Example:

“You mentioned a £1,000 payment—do you remember roughly when it was made?”
“I saw a few large payments recently. Can you confirm the date or recipient so I can find the right one?”

B. Use Anchors to Inform the Response

If the request includes meaningful values (e.g. “£1,000,” “last invoice,” “from Sarah”), treat those as anchors for validating results.

If results are close but not exact, follow up with a gentle clarification.
If no match is found, surface nearby options without making the user feel wrong.
Only consider it a true failure when no meaningful anchor values can be fulfilled.

Example:

“I didn’t find a £10,000 payment, but there was a £9,950 transaction yesterday. Want to check that one?”

C. Self-correct When Misinterpretation Happens

Even when a request is clear, the system might route it to the wrong action or tool.

If the wrong tool was triggered or the result clearly doesn’t fit the context:

Don’t ask the user to evaluate your logic.
When the system misinterprets the query, retry quietly before surfacing any uncertainty.
Only involve the user if repeated attempts fail or if multiple possible interpretations exist.
Avoid exposing internal confusion—users don’t want a play-by-play of system logic. They want clarity.

If needed, surface something like:

“I wasn’t able to find an exact match. Would you like to try a different date or amount?”

3. Offer Specific, Actionable Next Steps

The worst failure messages leave users at a dead end. Even if the system can’t help directly, it should still orient the user toward progress—retries, alternate inputs, human help, or related tasks.

Even when you can’t act, help the user maintain momentum.
Every failure message should include at least one next step: retry, rephrase, escalate, or explore an alternative.

Example:

“That’s not something I can do directly, but I can prepare a draft email for support with what you just shared.”

4. Match Tone to Emotional Context

When something breaks—especially in sensitive domains like finance or healthcare—users often feel exposed, confused, or anxious. The language used in failure moments doesn’t just convey information. It shapes emotion.

Use tone that is calm, respectful, and emotionally aware.
Avoid robotic deflection or over-chattiness in high-stakes contexts.
Use compassionate framing: acknowledge importance without dramatising.
Avoid blame-shifting or emotionally detached language.

Avoid:

“Oops! That’s on me!” (too casual)
“Function not supported.” (too clinical)

Instead, reach for:

“That’s outside what I can do right now. Let me point you to the next best option.”

5. Reinforce Trust Through Transparency

Users are generally forgiving—as long as they understand what’s going on. When the system fails silently or vaguely, it feels like a black box. When it over-explains internal mechanics, it can overwhelm or confuse.

When possible, explain the limitation in user-friendly language, describing what happened in human terms, and what it means.
You don’t have to reveal how the system works, but rather frame boundaries.

When you can’t disclose reasons (due to policy or security):

Be honest and calm.
Name the boundary clearly.
Provide a human resolution path without defensiveness.

Example:

“Some account-related requests need to be handled by our support team directly. Want me to direct you to them?”

6. Keep It On-Brand, Even When It’s Broken

Failure states are still part of your product. Don’t let them feel like someone else wrote them in a rush.

Make sure failure messages reflect your product’s tone and personality.
Avoid sudden shifts from helpful guide to cold system alert.
Don’t default to legalese, placeholders, or copy written under pressure.
Even in failure, the product should still feel like your product.

Designing for failure isn’t about pessimism. It’s about designing completely—ensuring that even when things go wrong, users feel respected, supported, and understood.

Handled well, these moments don’t erode trust. They build it.

Beyond Chatbots: The Expanding Surface Area of Failure

Designing failure well isn’t just a chatbot problem.

It’s easy to focus there—support bots are the some of the most visible and widely deployed AI agents today. But the challenges of failure framing are already spreading far beyond. They show up in multi-agent experiences, action-taking systems, and adaptive interfaces that assemble and evolve in real time.

And as AI takes on more capability, the design stakes change. These aren’t just tools that give answers. They’re systems that make decisions, initiate actions, and collaborate with humans to shape outcomes.

That’s a different level of risk—and a different level of responsibility.

What’s Changing

In traditional software, the user is clearly in charge. They click the button, trigger the flow, own the result. But in AI-native systems, agency is distributed.

The system can:

Interpret intent
Choose tools
Generate outcomes
Take action without explicit direction

This brings us to the heart of the issue: When things go wrong, who owns the failure?

Key Design Shifts for an AI-Native World

As AI systems evolve, we need to rethink how failure is handled—not just technically, but relationally. That means designing for a world where:

Accountability must be visible
Users need to understand what part of the system acted—and what part didn’t. If an agent returns the wrong result or triggers the wrong tool, the user shouldn’t be left guessing who made the decision or why.
Boundaries must be clear
Where does one agent’s role end and another’s begin? What happens when a system knows it’s out of its depth? When do we stop suggesting and start escalating?
Language must be designed, not defaulted
In adaptive, generative systems, language is a consistent thread that ties together fluid experiences—and yet failure responses are often the most under-designed part of the experience. These moments deserve intentionality, not boilerplate.

Potential Failure Surfaces Are Growing

What happens when multiple agents collaborate and hand off between one another? How do we make those transitions feel legible and accountable—not like things are happening “somewhere in the system”?
Who takes responsibility when an agent acts on your behalf and fails? The system? The agent persona? The human who approved it? These roles—and their boundaries—must be communicated clearly.
How do we build trust in distributed workflows? When agents string together decisions across steps, tools, and time, users need both a sense of control and a sense of continuity.

These aren’t just content problems. They’re experience problems, accountability problems, and trust problems. And solving them starts with treating failure—not as a temporary glitch to paper over—but as a moment to affirm the relationship between the user and the system.

Wrapping up

We often celebrate what AI can do—its fluency, its flexibility, its speed. But what defines the quality of an AI-powered product isn’t just how it behaves when things go right. It’s how it responds when things don’t go to plan.

Because users don’t judge systems by how often they succeed.

They judge them by how they behave when they fail.

When failure is handled vaguely, users feel uncertain. When it’s handled defensively, they feel blamed. But when it’s handled with care—when the system is transparent about its boundaries, clear about next steps, and emotionally grounded—users feel supported, respected, and more likely to try again.

We’ve talked about this in the context of customer support, but the patterns apply everywhere:

In productivity tools that offer recommendations
In agents that schedule, summarise, or automate
In systems that hand off between roles, tools, and flows

As these experiences become more distributed and dynamic, failure becomes more fluid, too. It doesn’t look like a single “error state” anymore. It shows up as subtle mismatches, missed connections, incomplete outcomes. And it’s our job to design for those moments—not to hide them, but to guide through them.

Because in AI-native products, language is the interface.

And at the edges of what the system can do, language is the product.

So here’s the opportunity: Treat failure not as a glitch to smooth over, but as a design surface to craft.

It’s Not You, It’s Me: Designing Failure States for AI Agents

Poor failure handling undermines AI products. Here's how to transform failures from frustrating to meaningful.

The Hidden Complexity of Failure States

Finding the Middle Ground

1. Misunderstood Input

2. Agent Misinterpretation

3. System or Capability Limitations

4. Execution or Data Failures

5. True Absence of Data

Designing Failure States with Empathy

1. Distinguish Between Agent and System Failure

2. Align Before You Act

A. Check for Clarity and Completeness

B. Use Anchors to Inform the Response

C. Self-correct When Misinterpretation Happens

3. Offer Specific, Actionable Next Steps

4. Match Tone to Emotional Context

5. Reinforce Trust Through Transparency

6. Keep It On-Brand, Even When It’s Broken

Beyond Chatbots: The Expanding Surface Area of Failure

What’s Changing

Key Design Shifts for an AI-Native World

Potential Failure Surfaces Are Growing

Wrapping up

Discussion about this post