The Confidence Problem

Why the most dangerous AI systems are the ones that can't say "I don't know" — and what it would take to build ones that can.

Apr 11, 2026

Ask an AI assistant for Italy’s flat tax rate for new residents and you may receive EUR 100,000 — a figure that was correct until January 2026, when the Italian Budget Law tripled it to EUR 300,000. Ask about Spain’s Golden Visa and you may receive programme requirements for an investment pathway that closed to new applicants in April 2025. Ask about Portugal’s Golden Visa minimum investment and you may receive EUR 500,000 — a threshold that hasn’t applied since 2023. These answers arrive with identical confidence. The system has no mechanism to distinguish between current data, stale data, and fabrication — because the architecture doesn’t track the difference.

Research confirms this is not anecdotal. AI models are systematically more confident when they are wrong. The gap widens on precisely the kind of questions that define investment migration advisory: cross-jurisdictional, recently changed, dependent on specific applicant circumstances, and without a single-document answer.

The core thesis: The most dangerous AI advisory systems are not the ones that make errors. They are the ones structurally incapable of signalling when an error might be occurring. Structured uncertainty — a system’s ability to express what it doesn’t know, verify what it claims, and surface the provenance and freshness of its knowledge — is the architectural capability that separates advisory intelligence from confident guessing at scale.

What this article delivers:

Evidence that AI confidence is inversely correlated with reliability on the questions that matter most for cross-border advisory
A structural explanation of why better models, more data, and smarter retrieval cannot solve what architecture must solve
The Confidence Audit: a five-question diagnostic for evaluating whether any AI advisory system can express honest uncertainty
A cascade analysis showing what confident-but-wrong AI advisory costs when decisions propagate across jurisdictions
An assessment of how EU AI Act obligations (August 2026) make structured uncertainty a regulatory requirement
The trust-advantage argument: why firms that embrace honest uncertainty outperform those that perform confidence

This is a structural analysis of what machine confidence means in high-stakes advisory — a framework for evaluating AI systems not by what they claim to know, but by whether they can admit what they don’t.

How the argument unfolds:

The Confidence Inversion — the counterintuitive research: AI systems are most confident precisely when they are least reliable, and the gap is widest on cross-domain advisory questions
Why Better Models Won’t Fix This — three reinforcing mechanisms make this structural, not developmental. The “just wait for the next model” escape route is closed.
The Architecture of Honest Uncertainty — the essay’s primary contribution: five architectural requirements for systems that model their own uncertainty, presented as the Confidence Audit
What Changes When the System Can Say “I Don’t Know” — a cascade scenario reworked through the uncertainty architecture, showing the concrete cost difference between performed and structured confidence
The Trust Advantage — why uncertainty, properly expressed, is a competitive and regulatory advantage

The Confidence Inversion

The instinct is reasonable. A system that expresses uncertainty sounds weaker than one that delivers answers. “I’m not sure” reads as less capable than “here’s what you should do.” The entire trajectory of AI development — from awkward autocomplete to fluent conversation — has rewarded confidence. Models are trained on human preference signals, and humans prefer confident answers.

The research reveals something structural.

A 2025 study from Technion — CHOKE, or Certain Hallucinations Overriding Known Evidence — tested what happens when large language models encounter questions they demonstrably know the answer to. The finding was not that models sometimes guess wrong. It was that in 16-43% of hallucination cases, the model produced a confident wrong answer while having the correct answer available in its training data. The model did not lack the information. It confidently overrode it.

This is not an edge case in investment migration advisory. It is the central operating condition. When a model confidently states that Portugal’s Golden Visa minimum investment is EUR 500,000 — it hasn’t been since 2023 — or that Italy’s flat tax for new residents is EUR 100,000 — it tripled in January 2026 — the error pattern is not “the model doesn’t know.” The model may well have encountered the correct figure. The architecture produces confident output regardless.

KalshiBench, a December 2025 benchmark testing five frontier models on questions with verifiable real-world outcomes, quantified the scale. At their highest stated confidence — 90% or above — models were wrong 15-32% of the time. The calibration gap is severe: systems claiming near-certainty are wrong roughly one-fifth to one-third of the time. Reasoning-enhanced modes — the models marketed as thinking more carefully — showed worse calibration than their standard counterparts. More compute spent on reasoning did not produce more honest self-assessment. It produced more elaborately justified overconfidence.

MIT’s Watson AI Lab confirmed the pattern from a different angle in March 2026. Self-consistency checks — asking a model the same question multiple times to see if it agrees with itself — miss a class of failures where the model is consistently, confidently wrong. Cross-model disagreement, where a different model evaluates the output, catches overconfidence that self-consistency misses entirely.

In low-stakes domains, this is a nuisance. In investment migration, it cascades. A confidently stated eligibility threshold that changed last year leads to an application built on false premises. A tax regime described in its pre-reform version leads to corporate structuring optimised for rates that no longer exist. A programme pathway presented as active when it closed months ago leads to committed capital with nowhere to go. The confident delivery is what makes the error dangerous — the system’s tone carries no signal about which outputs are grounded in current, verified data and which are not.

The instinct to call this a temporary problem is understandable. The evidence says otherwise.

Why Better Models Won’t Fix This

Three reinforcing mechanisms make the confidence problem structural. Each individually would be significant. Together, they close the escape route of “just wait for the next model.”

Training incentivises confident guessing. OpenAI’s own researchers published the structural explanation in September 2025: “training and evaluation procedures reward guessing over acknowledging uncertainty.” Nine of the ten most widely used AI benchmarks incentivise producing an answer over abstaining when uncertain. The model learns, through millions of optimisation steps, that confident answers score higher than honest limitation. The result is measurable: OpenAI’s GPT-5-thinking-mini, specifically designed to express uncertainty, has a 52% abstention rate — it declines to answer when unsure. Its predecessor, o4-mini, abstains on 1% of queries. The architecture can express uncertainty. Current training overwhelmingly rewards not doing so.

This is not a criticism of any particular provider. It is a description of the incentive structure that shapes every major model in production. When the leading AI lab publishes research confirming that its own evaluation procedures reward confident guessing, the structural nature of the problem is established by the most authoritative possible source.

Retrieval changes what the model is confident about, not whether it can calibrate that confidence. Issue 8 of this publication established that retrieval-augmented generation — the architecture underlying most current advisory AI tools — is a document search technology applied to a reasoning problem. The confidence dimension compounds the limitation. A system that retrieves a 2019 jurisdiction guide and a 2026 jurisdiction guide scores them identically in semantic similarity. Vector embeddings carry no temporal signal. The AI presents information from both sources with equal authority, because the retrieval architecture provides no mechanism to weight current over stale. RAG solves the knowledge access problem. It does not touch the confidence calibration problem.

The domain is maximally hostile to static confidence. Investment migration data changes with a frequency and granularity that stress-tests any system relying on point-in-time knowledge. Programme-level changes are the visible part: Portugal’s Golden Visa minimum investment changed in 2023, Italy’s flat tax tripled in January 2026, Spain’s Golden Visa closed in April 2025. Below the surface, eligibility varies by programme track, applicant nationality, investment type, and family composition. Treaty provisions between specific jurisdiction pairs override domestic rates through mechanisms that depend on the corporate structures involved. The highest-value advisory questions — the ones clients are paying for — sit at the intersection of multiple jurisdictions, multiple domains, and multiple recently-changed rule sets. These are the questions where model confidence is least justified and calibration data is thinnest.

Mathematics formalises what the evidence suggests. Three independent research teams have proven, through different formal frameworks, that hallucinations cannot be fully eliminated from large language models. Xu et al. demonstrated this through learning theory and computability. Banerjee et al. reached the same conclusion via Gödel’s First Incompleteness Theorem. Karpowicz proved an impossibility result: no LLM can simultaneously achieve truthfulness, information conservation, and knowledge-constrained optimality. OpenAI’s researchers confirm the conclusion — hallucinations are “mathematical constraints” of the technology, not engineering flaws awaiting a fix.

The professional services evidence is already substantial. $145,000 in judicial sanctions for AI-fabricated legal filings in Q1 2026 alone. A Nebraska attorney whose brief contained 57 fabricated citations out of 63. An Alabama attorney sanctioned $55,597 after the court recommended a finding of incompetence to practise. Deloitte refunding AUD $440,000 to the Australian government for a report containing fabricated court citations. Stanford’s finding that purpose-built legal AI — Lexis+ AI and Westlaw AI, products designed specifically for professional research — hallucinate 17-34% of the time, with the highest rates on questions about recent legal changes and cross-jurisdictional matters. The pattern is clear, and it is extending from legal and financial services into every domain where AI provides advisory output.

If the confidence problem is structural — rooted in training incentives, amplified by retrieval limitations, confirmed by mathematical proof — the response must also be structural. Not better prompts. Not more disclaimers appended to output. Architecture.

The Architecture of Honest Uncertainty

What would it mean for an AI advisory system to model its own uncertainty rather than perform confidence?

The question maps to five architectural capabilities — each representing an engineering commitment that cannot be retrofitted onto a system designed around confident delivery. Together, they form what this essay calls the Confidence Audit: a diagnostic for evaluating whether any AI advisory system expresses honest uncertainty or masks structural ignorance behind fluent prose.

1. Can it tell you where its knowledge came from?

Provenance — the ability to trace every data point to its source, authority level, and applicable conditions — is the foundation. “Article 81 of the IRS Code, verified March 2026, applicable to new NHR registrants for a ten-year period” is advisory intelligence. “Portugal has a 20% flat tax” is a search result. The distinction determines whether the output can be verified, challenged, or updated when the underlying data changes.

Structured knowledge with per-field provenance exists in production today — in financial data platforms, in legal research infrastructure, in the purpose-built AI systems the largest professional services firms are deploying. The capability is proven. It is not, however, standard in the AI advisory tools currently marketed to investment migration firms. Most operate on text chunks in vector databases, where the provenance trail ends at a filename.

2. Can it tell you when its knowledge was last verified?

Temporal confidence is invisible in standard retrieval architectures. A vector embedding of a 2019 jurisdiction guide is indistinguishable from a 2026 embedding. Both score on semantic similarity. Neither carries a signal about when the information was confirmed as current. A system that cannot distinguish “verified this quarter” from “ingested three years ago” will present the pre-reform rate with the same authority as the post-reform rate. The architecture has no mechanism to do otherwise.

This is the failure mode behind the most consequential errors in cross-border advisory — where a client acts on a confident assessment grounded in rules that changed since the data was last verified.

3. Can it tell you what it doesn’t know?

“I don’t know” and “this depends on information I don’t have” are required capabilities in any system that claims advisory authority. A model that cannot express uncertainty will fabricate certainty — because the output structure demands a complete answer, and the training signal rewards producing one. Knowledge gaps must be structured fields in the system’s output: explicit, enumerated, actionable. Not hidden behind confident prose.

Multi-agent architectures are beginning to implement structured uncertainty signals — confidence levels, explicit assumption lists, knowledge gap declarations — as first-class fields in output schemas. But the approach is not yet standard. Most advisory AI treats completeness as a feature and incompleteness as a failure, inverting the relationship between honesty and utility in high-stakes contexts.

4. Can it check its own work using a different process?

Self-consistency — asking the same model the same question and checking agreement — measures reliability, not accuracy. A model can be consistently, confidently wrong. MIT’s research demonstrated that cross-model disagreement — where a different model evaluates the output — identifies overconfident responses that self-consistency misses. The LLM-as-judge paradigm, where an independent model evaluates whether claims are supported by retrieved evidence and whether gaps are acknowledged, is the dominant approach in current AI evaluation research. Production implementations exist.

The architectural commitment is real: every advisory response evaluated by an independent process, checking not whether the answer sounds right but whether it is grounded in the evidence actually retrieved. This is more expensive per query. Multiple model calls, more compute, slower responses. In advisory contexts where a wrong answer costs orders of magnitude more than a slow answer, this is the correct tradeoff.

5. Can it tell you what it assumed?

Every advisory analysis rests on assumptions about the client’s circumstances. A tax analysis assumes residency status. An eligibility assessment assumes nationality and family composition. A corporate structuring recommendation assumes beneficial ownership and substance provisions. When these assumptions are structured fields — “I am assuming you do not hold a second EU citizenship, which would change this analysis” — the client or advisor can verify or correct them. When assumptions are implicit, errors compound silently through every downstream recommendation that inherits the unexamined premise.

The “I don’t know” test. Ask any AI advisory tool: “What are the current minimum investment requirements for Portugal’s Golden Visa?” A system performing confidence gives you a number, stated with authority. A system modelling uncertainty responds: “The minimum investment was EUR X as of [date], sourced from [authority]. This programme has undergone significant modifications since 2023. My data was verified on [date]. I recommend confirming current requirements before proceeding.”

The second response is more useful because of its expressed uncertainty. It tells the user what it knows, when it was verified, and what it hasn’t confirmed. The first tells the user nothing about the quality of its own knowledge — and transfers the entire verification burden to someone with no way to assess whether the figure is current.

Five architectural requirements. None achievable through better prompting or larger context windows. Each requires engineering commitment at the infrastructure level — structured knowledge, temporal metadata, output schemas that treat uncertainty as a first-class field, cross-model evaluation, and assumption extraction built into the reasoning process. Together, they define the architectural difference between a system designed to seem right and a system designed to know when it might be wrong.

Where we stand: AI advisory systems are structurally more confident when wrong — a pattern rooted in training incentives, amplified by retrieval limitations, and confirmed by mathematical proof. The Confidence Audit defines five capabilities that separate advisory intelligence from performed confidence. The question is what changes when these capabilities are present.

What Changes When the System Can Say “I Don’t Know”

Consider the cascade analysis from Issue 8 of this publication: a UK technology entrepreneur evaluating Portuguese residency, operating through a UAE free zone company with an Irish intermediate holding company. The same scenario, run through two different architectures.

With performed confidence, the system synthesises responses across tax, immigration, corporate, and banking domains. Each section reads with equal authority. The tax assessment presents the NHR regime. The immigration analysis covers visa requirements. Corporate structuring addresses the UAE entity. Banking notes compliance considerations. The output is fluent, professionally structured, internally consistent. The client reads integrated advisory analysis. She acts.

One domain’s analysis is grounded in pre-reform data. The NHR regime underwent modification, and the system’s knowledge predates the change. The error is invisible because the architecture carries no temporal signal. The tax optimisation cascades through corporate restructuring — arrangements designed around rates that no longer apply. Banking projections are predicated on net-of-tax income levels that will not materialise. An immigration timeline locks in a sequence that assumes the tax conditions the system described confidently but incorrectly. The cascade is discovered after commitment — capital moved, structures established, applications filed.

With structured uncertainty, the same query produces differently structured output. The tax analysis carries a freshness flag: “NHR regime provisions last verified [date]; this regime has undergone reform — current rates should be confirmed with the Autoridade Tributária before commitment.” The immigration assessment surfaces an assumption: “This analysis assumes you do not hold Irish citizenship, which would materially affect freedom-of-movement analysis and may eliminate the visa requirement entirely.” The corporate structuring response identifies a gap: “I do not have current treaty data for the Ireland-Portugal jurisdiction pair needed to confirm dividend withholding rates; professional verification is recommended before proceeding.” The synthesis preserves these signals rather than absorbing them into confident prose.

The cost difference is not marginal. In the first scenario, the client discovers the misalignment after commitment. Unwinding cross-border structures is expensive, slow, and sometimes impossible — capital committed to a programme cannot always be redirected, and residency changes triggered by the move may have tax consequences of their own. In the second, uncertainty signals prompt verification before commitment. The advisor confirms current rates, corrects the assumption, fills the gap. Same advisory question. Same facts. Fundamentally different outcome. The difference is architectural.

The regulatory environment is converging on requiring this capability. The EU AI Act — fully applicable to high-risk AI systems from August 2, 2026 — imposes transparency, explainability, and accuracy monitoring obligations on AI systems influencing decisions in domains including migration processing and access to essential services. Article 13 requires that deployers understand a system’s “capabilities and limitations.” Article 15 requires “appropriate levels of accuracy.” A system that cannot express its own uncertainty — that lacks any mechanism to communicate what it doesn’t know, when its data was last verified, or what it assumed — cannot satisfy these requirements.

Advisory AI influencing decisions about residency, investment pathways, and access to financial services operates in precisely the categories the regulation was designed to govern. Provenance tracking, temporal confidence, structured uncertainty in output, independent verification — these are becoming the regulatory minimum for AI in advisory contexts. Penalties for non-compliance with high-risk obligations reach EUR 15 million or 3% of global annual turnover.

The regulatory argument matters. The strategic argument is stronger.

The Trust Advantage

Issue 8 of this publication asked: does your AI have the six capabilities cross-border advisory requires? This article adds a capability that underpins all the others — one that determines whether the remaining five can be trusted when they produce results.

Can your system tell you when it doesn’t know something?

The Confidence Audit provides five diagnostic questions applicable to any AI advisory tool. But there is a single question that cuts through all five:

“Can your system tell me when the data underlying its recommendation was last verified — and what it assumed about my circumstances?”

The answer is diagnostic. A system that can respond has been designed around honest uncertainty — provenance tracked at the data level, freshness metadata maintained, assumptions extracted and surfaced. A system that cannot has been designed around performed confidence — fluent output without epistemic infrastructure. There is no way to approximate this through disclaimers or carefully worded caveats. Either the architecture tracks it, or it doesn’t.

The firms that adopt AI as “faster answers with confident delivery” will discover that confidence without calibration is a liability — with clients who act on stale data, with regulators who require transparency, with professional standards that increasingly demand traceability. The firms that adopt AI as “structured reasoning with honest uncertainty” will discover what the research consistently demonstrates: clients don’t want a system that pretends to know everything. They want one that knows what it knows, says what it assumed, and flags what it couldn’t verify.

An advisor whose system reports “the tax analysis is current as of March 2026, sourced from Revenue’s published guidance, but the corporate substance assessment relies on 2024 data and should be verified before commitment” is more credible than one whose system presents both assessments with identical authority. The uncertainty is not a limitation on display. It is evidence that the system — and the firm deploying it — takes the advisory obligation seriously enough to say what it hasn’t confirmed.

Uncertainty, properly expressed, compounds as a trust advantage. The client who receives structured uncertainty in the first engagement learns to trust the outputs that arrive without caveats — because the system has demonstrated it flags concerns when they exist. The advisor who works with honest uncertainty spends less time on retrospective verification and more time on the advisory judgement that clients actually value. The firm that builds on uncertainty architecture meets regulatory requirements not as a compliance burden but as a structural consequence of how the technology was designed.

The confidence problem is structural. The solution is architectural. The firms that recognise uncertainty as a capability — not a deficiency to be masked — will define what advisory intelligence means in the decade ahead.

The Framework, Condensed

The Confidence Audit

AI advisory systems in cross-border contexts must model their own uncertainty — not as an afterthought, but as a core architectural capability. The Confidence Audit evaluates this through five questions:

Provenance — Can the system tell you where its knowledge came from? Source, authority level, applicable conditions. Without provenance, output is suggestion, not intelligence.
Freshness — Can the system tell you when its knowledge was last verified? The difference between “verified this quarter” and “ingested three years ago” determines whether output reflects current conditions or historical snapshots.
Gap reporting — Can the system tell you what it doesn’t know? Knowledge gaps must be explicit, structured, actionable. Silence on limitations is not completeness. It is performed confidence.
Independent verification — Can the system check its own work using a different process? Self-consistency is not accuracy. Cross-model evaluation — where an independent process assesses whether claims are supported by evidence — is the emerging standard for trustworthy advisory AI.
Explicit assumptions — Can the system tell you what it assumed about your circumstances? Every advisory analysis rests on assumptions. When they are surfaced, they can be verified. When they are implicit, errors compound silently.

Application. A managing partner evaluating AI for the firm applies these five questions to any tool under consideration. The gaps reveal where advisory risk accumulates — not in the dramatic form of fabricated citations, but in the quieter form of confident output grounded in data the system cannot distinguish from current.

A client evaluating an advisor’s AI capability asks a single question: “Can your system tell me when the data underlying its recommendation was last verified, and what it assumed about my circumstances?” The answer distinguishes advisory intelligence from automated confidence.

A technology leader building advisory AI infrastructure maps each question to an architectural requirement: per-field provenance in the knowledge layer, temporal metadata on every data point, structured uncertainty fields in output schemas, cross-model evaluation pipelines, and assumption extraction built into the reasoning process.

The gap between performed confidence and structured uncertainty is architectural. It will not close with better prompts, larger models, or more data. It closes when the system is designed — from the ground up — to know what it doesn’t know, and to say so.

The Sovara Briefing

Discussion about this post

Ready for more?