What is AI fluency and why does it matter for AI products?

AI fluency is a user's level of skill in working with AI, ranging from novice to expert. High-fluency users operate in an augmentative mode — iterating, refining goals, and critically assessing outputs. Low-fluency users operate in a delegative mode — passively accepting responses as final. The same AI model produces dramatically different outcomes depending on which mode the user is in, making fluency a critical factor in product success.

Why do AI product metrics look good but users aren't retaining?

Most AI failures are invisible to standard monitoring. Bigspin's analysis of 27,000 conversations found that 86% of novice user failures leave no trace in logs, feedback, or analytics. These users accept flawed outputs without complaint and quietly disengage. Clean conversation logs and positive CSAT scores can mask widespread quality problems that drive silent churn.

Why do expert AI users fail more often than beginners?

Expert users fail 64% of the time compared to 24% for novices, but not because they are worse at using AI. Experts attempt harder tasks (average complexity 3.1 vs 1.5 on a five-point scale) and actively probe for errors. 59% of expert failures are visible — the user catches the problem and works through it. Novices fail less often but miss 86% of their failures entirely.

How does user skill level affect AI product outcomes?

User skill level is the deciding variable in AI conversation quality. In Bigspin's research, 93% of high-fluency interactions were augmentative — users iterated, refined, and challenged the AI. Fewer than 1% of low-fluency interactions were. Teams building AI products need to instrument for invisible failures and design experiences that encourage critical engagement rather than passive acceptance.

What is the difference between augmentative and delegative AI use?

Augmentative users iterate with the AI, refine goals mid-conversation, and critically assess outputs. Delegative users passively accept the AI's plans and responses, treating the output as final. Augmentative use is strongly correlated with high fluency and visible failure recovery. Delegative use is correlated with low fluency and invisible failures that erode product quality silently.

How can AI product teams detect invisible conversation failures?

Standard monitoring tools like thumbs-up/thumbs-down feedback, session length, and error rates systematically miss most failures. Quality monitoring needs to analyze the actual content of conversations, not just count them. Bigspin's multi-pass analysis reads 100% of transcripts to surface failure patterns that leave no trace in conventional analytics — the silent mismatches, walkways, and confidence traps that drive users away without a signal.

Bigspin partners with Microsoft for their Agent Control Specification launch

The paradox of AI fluency: Novices vs. experts

Read the study

The paradox of AI fluency

Read the study

About us

Resources

Research

About us

Resources

Research

Invisible Failures

Bigspin partners with Microsoft for their Agent Control Specification launch

June 3, 2026

Reading time:

min

We at Bigspin were delighted to be a launch partner for Microsoft’s new Agent Control Specification (ACS), which was announced at Microsoft Build yesterday. ACS is a fully open (MIT-licensed) framework for making agentic workflows more controllable and secure. Bigspin + ACS is a powerful stack for monitoring and shaping agentic systems; Bigspin surfaces the patterns that need enforcement, and ACS provides a firm technical foundation for the enforcement step.

The challenge that ACS is addressing feels incredibly urgent to us. As is common with new technologies, agentic systems were initially designed to be powerful and expressive, with security and predictability taking a backseat. This felt exciting for a very brief moment, and then everyone started scrambling to get control of their agents and avoid security nightmares.

Bigspin’s primary users are often the people closest to these problems: they are product managers and technical leads who are tasked with monitoring their organization’s deployed agentic systems. This is a daunting task in the current moment because pretty much anything can go wrong (and what can go wrong eventually will), and true fixes are often elusive.

We try to ensure that the Bigspin agent always surfaces some positive insights for these folks, but there is often no way to sugarcoat it: the agents are misbehaving. Some of the problems are tractable and well-known – for example, the agent says it used a tool when it did not, or it uses a tool but then ignores the tool’s output and confidently proceeds with its own fabrications. At the other end of the spectrum, our embattled PM might discover that their agents are ignoring critical prompt instructions, enabling problematic user behaviors, and gaslighting their poor users.

We think ACS can help substantially in reducing the frequency and severity of these problems. The essence of it is that ACS provides specific interception points that allow for tight, modular control over how the system behaves. These controls are outside of the agentic system being governed, and most of them can be expressed as deterministic rules. In other words, we are approaching something like a set of guarantees. Disciplined use of ACS should turn an amorphous, untamed agent system into one that we can confidently reason about and control.

Problems will still arise, of course; every agentic system requires constant shaping in response to new usage patterns and emerging risks. The real question is how quickly they can be resolved. Here again, we think ACS will be a significant asset, because we expect to be able to trace emerging issues directly to missing ACS interventions or shortcomings of existing ACS interventions. In the context of a five-alarm fire, this is highly impactful for the entire organization and its customers.

ACS is part of Microsoft’s Agent Governance Toolkit, which is also open-source (MIT-licensed). We are eager to see how these tools evolve in response to both innovations and challenges for agentic systems, and we are looking forward to contributing to this effort ourselves. Many thanks to Mohammad Abuomar, Mohamed Elmergawi, Ilvens Jean, Mehrnoosh Sameki, and Mike Shi at Microsoft. It was rewarding to swap stories of agents misbehaving with this group and think creatively about how ACS might help get them in line!

Bigspin Team

Written by the whole team