Skip to main content
Back to blog
7 min read

How to think about AI risk in your organization

Most organizations either overestimate AI risk (paralysis) or underestimate it (blind deployment). A calibrated approach to AI risk is not about building compliance frameworks. It is about understanding which failures actually matter and designing proportionate mitigations.

By Ramiro Enriquez

AI risk conversations in organizations tend to cluster at two extremes. One is paralysis: a risk assessment process that produces so many categories of potential harm that deployment becomes politically impossible. The other is blindness: treating AI systems like traditional software and discovering that AI failures have different properties than the failures the organization has procedures for.

Neither approach serves the organization well. Paralysis means forgoing real value while competitors who are less cautious move ahead. Blindness means deploying systems that fail in ways that damage user trust, operational quality, or in serious cases the people the organization serves.

The calibrated middle path is not a compliance exercise. It is a systematic way of asking which failures matter most, how likely they are, how detectable they are, and what mitigations are proportionate to the actual risk profile.

Why AI risk is different from traditional software risk

Traditional software fails in ways that are generally detectable and discrete. A function throws an exception. A query returns an error. A page fails to load. These failures are visible, reproducible, and fixable. The risk management question is: how do we prevent defects, and how do we catch and fix them when they occur?

AI systems fail differently. The most common AI failure mode is not an exception but a wrong answer delivered confidently. The output looks correct. It is formatted correctly. It is responsive to the input. It is wrong. The user may not know it is wrong. The system does not know it is wrong. The failure is invisible until something downstream goes wrong, and by then it may be hard to attribute to the AI system.

A second difference is that AI failures are probabilistic rather than deterministic. A traditional system either handles an input correctly or it does not, reliably. An AI system handles most inputs of a given type correctly and handles some percentage of them incorrectly, in ways that are difficult to predict in advance. This means that testing cannot provide the same assurance it provides for traditional software. You can verify that a system works on a sample; you cannot verify that it works on all inputs.

A third difference is distributional shift: AI systems trained and evaluated on one distribution of inputs may perform differently as the distribution changes. A system that works well at launch may gradually work less well as the user population changes, without any change to the system itself. This creates a risk category that does not have a direct analog in traditional software: quiet degradation over time.

A practical risk framework

Instead of a taxonomy of possible harms (which tends to produce paralysis), a useful framework asks three questions about each AI application.

How bad is a failure? Consequence severity varies enormously across AI applications. An AI that recommends a slightly suboptimal marketing subject line has low consequence severity. An AI that informs a clinical decision has high consequence severity. An AI that automates a customer communication has moderate consequence severity depending on what the communication is about. Consequence severity determines how much investment in quality and monitoring is justified.

How detectable is a failure? Some AI failures are immediately detectable. A user asks a question, gets a response, checks it against their knowledge, and notices it is wrong. The failure is visible in real time. Other failures are not detectable until something downstream fails, which may be days, weeks, or months after the failure occurred. An AI that produces subtly incorrect data that feeds into a report is not detectable until someone reads the report carefully. Detectability determines how much of the risk management burden falls on preventing failures versus catching them.

What is the failure rate for this input distribution? A failure rate of one percent is acceptable for some applications and unacceptable for others, depending on volume and consequence. An AI that processes a thousand requests per day with a one percent failure rate produces ten failures per day. Whether that is acceptable depends entirely on what those failures are and who experiences them.

Mapping AI applications across these three dimensions produces a risk profile that is actually useful for prioritization. High consequence, low detectability, high failure rate: this application needs significant investment in quality before deployment and ongoing monitoring after. Low consequence, high detectability, low failure rate: deploy and monitor, but do not over-engineer the safety infrastructure.

The asymmetry of AI failure types

Not all AI failures carry the same organizational risk. Two failure types are worth distinguishing.

Quiet failures accumulate slowly and become visible only when they have already done significant damage. The AI system that gradually becomes less accurate as the input distribution shifts is a quiet failure. The AI feature that most users have stopped trusting but are too polite to report is a quiet failure. Quiet failures require proactive monitoring to detect. Organizations that do not monitor AI quality in production do not know about their quiet failures until they surface as user complaints, operational problems, or trust collapse.

Visible failures surface immediately and produce an organizational response. The AI output that is obviously wrong generates a support ticket or a social media post. The AI system that produces an offensive response gets escalated. These failures are unpleasant but they are self-correcting: the organization becomes aware of them quickly and responds. The risk management challenge for visible failures is having a clear response process.

Most AI risk management focuses on visible failures because they are the ones that generate news coverage and internal escalations. Quiet failures are more dangerous in practice because they degrade quality without triggering response, and by the time they become visible the damage has accumulated.

The organizational capacity risk

There is a risk category that does not appear in most AI risk frameworks but is often the most relevant one for organizations in the early phases of AI adoption: the risk of building more AI than you can monitor.

Each AI system deployed in production requires ongoing quality monitoring to catch quiet failures, a process for routing failure reports back to people who can act on them, and periodic evaluation to verify that quality has not degraded. These are operational commitments, not one-time investments.

Organizations that deploy AI faster than they build monitoring and evaluation capacity end up with a portfolio of AI systems in unknown states. Some are performing well. Some have quietly degraded. Some are being used in ways they were not designed for. Without monitoring, the organization cannot distinguish between them.

The risk management implication is: before deploying a new AI application, verify that you have the monitoring capacity to maintain it. This is not an argument against deployment. It is an argument against deploying into a monitoring vacuum.

Proportionate mitigation

Risk assessment is only useful if it informs mitigation design. For AI applications, the proportionate mitigations depend on the risk profile.

For high-consequence applications, the primary mitigation is human oversight: designing the system so that AI outputs inform human decisions rather than replace them, with clear protocols for when humans should override the AI and how overrides are tracked and reviewed. Human oversight adds cost and latency but provides a catch mechanism for the failures that matter most.

For low-detectability failures, the primary mitigation is proactive monitoring: sampling production outputs regularly for quality review, establishing baseline quality metrics that alert when they degrade, and building feedback mechanisms that route production failures back to the development team.

For high-failure-rate inputs, the primary mitigation is scope control: routing inputs that are outside the system’s reliable zone to humans rather than forcing the AI to handle them. This requires having enough visibility into the input distribution to know which inputs are in the reliable zone and which are not, which in turn requires evaluation.

The mitigations are not mysterious. The challenge is applying them proportionately: investing heavily in mitigation for high-risk applications and avoiding over-engineering for low-risk ones. Getting that calibration right requires actually doing the risk assessment rather than treating all AI applications as equally risky or equally safe.

Starting the conversation

Most organizations do not have a systematic process for assessing AI risk before deployment. Starting one does not require a large investment. It requires asking three questions about each AI application before it ships: what are the most likely failure modes, what happens when those failures occur, and how will we know if they are occurring at a rate or severity that requires response?

Teams that answer these questions before deployment make better design decisions during development and respond faster when problems surface in production. Teams that do not answer them tend to discover the answers in the worst possible way.

Zylver ships AI products: Forge, Signal, Agents, Flows, and Meter. View all products.

Get insights like this delivered monthly.

No spam. Unsubscribe anytime.