May 6, 2026

One AI, Three Failure Modes: Why AI Fluency Looks Different in Every Role

The most dangerous hire isn't the one who can't use AI. It's the one who can't stop.

Demand for AI fluency jumped nearly sevenfold in two years - from occupations employing roughly one million workers in 2023 to seven million by mid-2025. Three-quarters of that demand isn't in engineering. It's in management and business operations (McKinsey Global Institute, November 2025).

Companies are responding. AI fluency is now a hiring priority across nearly every function. But most are treating it as a single checkbox - the same interview question, the same rubric, every role.

That's a mistake. Because AI fluency doesn't just vary across roles. It fails differently across roles.

And if you don't understand how it fails, you'll hire people who look fluent on paper and underperform on the job.

Consider: Section's AI Proficiency Report found that only 3% of the workforce qualifies as true AI practitioners - people who embed AI into their workflow with measurable productivity gains. 70% are "experimenters" - they use AI for basic tasks like summarizing notes or rewriting emails. That's familiarity, not fluency. And the gap between the two isn't prompting skill. It's role-specific judgment.

The designer: homogenization

When designers over-rely on AI, everything starts looking the same.

Researchers at Dalarna University ran 700 autonomous AI image generation loops - feeding outputs back as inputs across 100 rounds. Regardless of how diverse the initial prompts were, every run converged to one of just 12 generic visual motifs: stormy lighthouses, Gothic cathedrals, pastoral landscapes. They called it "visual elevator music" - aesthetically safe, commercially generic, creatively dead.

That convergence isn't just a model behavior. It shows up in human-AI collaboration too. A 2024 ACM study found that when people used ChatGPT for creative ideation, they produced more ideas - but less distinctive ones. Different users converged on the same creative territory. Worse, they reported feeling less responsible for the output.

That's the designer failure mode. AI tools collapse toward the median of their training data. A designer who leans on that output without strong creative direction doesn't just produce mediocre work. They produce work that looks like everyone else's.

The fluency signal: Does the designer use AI to explore faster while maintaining a distinct point of view? Or do they let AI dictate the aesthetic? High-fluency designers treat AI as a sketch tool. Low-fluency designers treat it as the art director.

The PM: abdication

When product managers over-rely on AI, they stop owning decisions.

AI is excellent at synthesizing information, generating options, and drafting strategy docs. But product management is fundamentally about making bets under uncertainty - choosing what not to build, prioritizing with incomplete data, defending a direction when the data is ambiguous. A PM who delegates that judgment to AI isn't being fluent. They're abdicating.

The evidence is striking. In a study of nearly 300 executives published in Harvard Business Review, those who consulted ChatGPT to make a financial prediction became significantly more optimistic, more confident - and produced worse forecasts than those who discussed with peers. The AI's authoritative tone inflated confidence without improving accuracy.

This isn't an edge case. Deloitte's 2026 Global Human Capital Trends report, surveying 9,000+ business leaders, found that 60% of executives now regularly use AI to support decisions - but only 5% say they manage the human-AI decision relationship well. AI is "increasingly blurring authorship and eroding confidence" in who actually made the call.

For PMs, this is the abdication trap: the more AI helps you structure your thinking, the easier it becomes to stop thinking for yourself.

The fluency signal: Does the PM use AI to gather and structure information faster while owning the decision? Or do they treat AI recommendations as decisions? High-fluency PMs use AI as a research assistant. Low-fluency PMs use it as a strategy consultant they never push back on.

The vibe coder: fragility

When coders over-rely on AI, they ship code they don't understand.

AI code generation is remarkably capable. It produces working code fast. But "working" and "correct" aren't the same thing - and the gap between them is where production incidents live.

CodeRabbit's 2025 analysis of 470 GitHub pull requests found that AI-authored code contains 1.7x more defects than human-written code. Security vulnerabilities are worse: 2.74x more cross-site scripting flaws, nearly 2x more insecure object references. Performance inefficiencies appear nearly 8x more often. The code compiles. It runs. It breaks in production.

The pattern shows up at scale too. GitClear's analysis of 211 million lines of code found that duplicated code blocks rose eightfold in 2024, and code revised within two weeks of commit nearly doubled - from 3.1% to 5.7%. That's the signature of developers shipping code they accepted but didn't understand: quick to produce, quick to break.

The fluency signal: Does the coder use AI to accelerate implementation while maintaining deep understanding of what they ship? Or do they produce output they can't debug? High-fluency coders treat AI as a pair programmer they review. Low-fluency coders treat it as an oracle.

One root cause, three different symptoms

Every failure mode above shares the same underlying weakness: treating AI as an authority rather than a tool. The designer trusts AI's aesthetic. The PM trusts AI's recommendation. The coder trusts AI's implementation. In each case, the human judgment that makes the role valuable gets outsourced to a system that doesn't have it.

The research bears this out. Harvard Business School and BCG's landmark "Jagged Frontier" study - a field experiment with 758 BCG consultants - found that for tasks within AI's capability boundary, users completed 12.2% more tasks, 25.1% faster, with over 40% quality improvement. But for tasks that required human judgment, AI users were 19% less likely to produce correct solutions. Same people. Same AI. Radically different outcomes depending on whether the task demanded execution or judgment.

The frontier is jagged - and it's shaped differently for every role. A designer's judgment boundary is different from a PM's, which is different from a coder's. That's why a single AI fluency assessment tells you almost nothing.

What this means for hiring

If AI fluency fails differently in every role, you can't assess it the same way in every role.

A designer's fluency shows up in creative distinctiveness under AI augmentation. A PM's shows up in decision ownership when AI is offering easy answers. A coder's shows up in verification depth - whether they understand the code they ship, not just whether they shipped it fast.

One interview question - "what AI tools do you use?" - catches none of this. The only way to see role-specific judgment is to watch someone work in a role-specific context.

That's why hiAIre runs role-specific work simulations. Realistic scenarios where you observe how candidates actually use AI in the context of their role - then score what you see across behavioral dimensions. Not self-reporting. Observable behavior.

AI fluency isn't one skill. It's one word for many different kinds of judgment. Hire accordingly.

Sources

McKinsey Global Institute, "Agents, Robots, and Us" (November 2025) - AI fluency demand jumped ~7x in two years; 3/4 in non-technical occupations
Section, "The AI Proficiency Report" (2024-2026) - only 3% of workforce are AI practitioners; 70% are experimenters
Hintze et al., "Autonomous language-image generation loops converge to generic visual motifs," Patterns (December 2025) - AI image outputs converge to 12 generic motifs ("visual elevator music")
Anderson et al., "Homogenization Effects of Large Language Models on Human Creative Ideation," ACM C&C (June 2024) - LLM users produce less distinctive ideas; feel less ownership
HBR, "Research: Executives Who Used Gen AI Made Worse Predictions" (July 2025) - AI inflated confidence while producing worse forecasts
Deloitte, 2026 Global Human Capital Trends - 60% of executives use AI for decisions; only 5% manage it well
CodeRabbit, "State of AI vs. Human Code Generation" (December 2025) - AI code has 1.7x more defects, 2.74x more XSS vulnerabilities
GitClear, "AI Copilot Code Quality: 2025 Research" (February 2025) - duplicated code 8x higher; code churn nearly doubled
Dell'Acqua et al., "Navigating the Jagged Technological Frontier," HBS/BCG (2024-2025) - AI users 19% less likely correct on judgment tasks