The AI product manager: a new role taking shape
Building products with AI components requires product managers to develop new skills, own new responsibilities, and apply different judgment than traditional software PM work demands. The role is evolving faster than most PM playbooks have caught up.
By Ramiro Enriquez
Product management as a discipline developed around a set of assumptions about how software works: it behaves deterministically, its capabilities are bounded by what engineers explicitly implement, and quality is assessed by whether the software does what it was specified to do. A form field either accepts valid input or rejects invalid input. A button either works or it does not. The product manager’s job, in this model, is to define what the software should do and verify that it does it.
AI components break most of these assumptions. An AI system does not behave deterministically: the same input can produce different outputs at different times. Its capabilities are not fully bounded by explicit implementation: the model’s behavior emerges from training in ways that can surprise both the engineers who integrated it and the product managers who specified the requirements. Quality is not a binary: the AI’s output exists on a spectrum from excellent to harmful, and where on that spectrum any given output falls is often a matter of judgment rather than specification.
Product managers working on AI-enabled products are discovering that their existing skills transfer partially but not completely. The parts that transfer are the fundamentals: understanding users, defining success criteria, prioritizing ruthlessly, communicating across technical and business stakeholders. The parts that do not transfer are the parts that depend on the deterministic model of software, and those parts need to be rebuilt on different foundations.
Owning quality, not just capability
In traditional software, quality assurance is typically someone else’s job. Engineers write tests. QA teams verify behavior. The product manager defines requirements and accepts or rejects delivered software against those requirements.
AI components require product managers to take a more active ownership role in quality definition. This is not because QA and engineering judgment is less important, but because quality for an AI component cannot be fully specified in advance. The range of inputs the AI will encounter in production is too wide to enumerate. The judgment calls required to assess whether an output is “good” are often deeply intertwined with the product’s value proposition and user context. Only the product manager has the combination of user understanding and product context required to specify what good looks like across this range.
This means product managers working on AI need to be able to define evaluation criteria, not just acceptance criteria. What makes a response helpful rather than unhelpful? What kinds of errors are tolerable and which are not? What is the quality floor below which the AI’s output damages the user experience? These are product questions with quality implications, and answering them is part of the product manager’s job in a way it was not before.
Managing probabilistic quality
One of the most disorienting shifts for product managers new to AI is moving from binary quality to probabilistic quality. Traditional software is either working correctly or it is not. AI components are working correctly some percentage of the time, with the percentage varying by input category, model version, and context.
This changes how product managers talk about readiness, communicate with stakeholders, and make deployment decisions. Saying “the feature is 94% accurate” requires context that “the feature works” does not: accurate at what, for which inputs, measured how, and what happens in the other 6%?
Product managers who have not developed fluency with probabilistic quality tend to communicate AI feature readiness in ways that create unrealistic expectations. They describe the good case as typical. They understate the frequency and impact of failure cases. They present aggregate accuracy numbers without the distribution details that would change stakeholders’ risk assessment.
The product managers who are most effective working on AI develop a habit of leading with the failure mode, not the capability. Before communicating what the AI can do, they communicate what it cannot do, when it tends to fail, and what happens to users when it does. This creates more calibrated stakeholder expectations and makes deployment decisions more honest.
Prioritizing differently when AI accelerates implementation
One of the structural changes AI tools are creating in product development is a compression of implementation time for many types of features. Features that would have taken weeks to implement can now be prototyped in hours. This changes the economics of product prioritization.
When implementation is slow and expensive, the product manager’s most important prioritization job is saying no: preventing the team from working on lower-value things when higher-value things need to get done. The bottleneck is implementation capacity.
When implementation is fast, the bottleneck shifts. Implementation moves faster than evaluation, validation, and the organizational work of deploying, communicating, and supporting a new feature. The product manager’s most important prioritization job becomes ensuring that the team does not ship faster than their ability to evaluate what they are shipping.
This requires product managers to develop and maintain evaluation infrastructure as a first-class concern, not a nice-to-have. The question “how will we know if this is working” is not downstream of “can we build this” anymore. It has to be answered before or in parallel with building, because the evaluation infrastructure needs to exist before the feature ships.
The feedback loop problem
Traditional product metrics are designed to measure user behavior: did they click, did they convert, did they retain? These metrics work well when the product’s value delivery is discrete and observable.
AI features often have value delivery that is harder to observe directly. When an AI assistant helps someone write a better email, the improvement in the email quality may not manifest in any observable downstream event. When an AI feature helps someone make a better decision, the decision’s outcome may not be attributable back to the AI’s contribution in any measurable way.
Product managers working on AI features need to design feedback loops that capture signal the AI’s contribution actually generates: not just downstream outcomes, but signals closer to the AI’s output. Was the AI’s suggestion accepted without modification? Was it edited before use? Was it ignored? Was it actively rejected? Did the user ask for a different output? These signals, aggregated at scale, give a picture of the AI’s quality that downstream metrics alone do not provide.
Designing these feedback loops is product work. It requires understanding what signals are detectable in the product context, what infrastructure is required to capture them, and how they map to quality dimensions that matter. Product managers who do not own this work tend to find themselves making decisions about AI feature quality based on limited signal, which produces poor calibration about what is and is not working.
Working with model uncertainty
Traditional software has a version. You can specify exactly which version of a dependency is in production, what it does, and how it behaves. AI models are less like dependencies and more like vendors: they have versions, but the behavior differences between versions are often emergent and not fully documented, and the model provider may update or retire versions on their own timeline.
This creates a new class of product management work: managing model lifecycle. When a model provider announces that a model version is being deprecated, the product manager needs to understand what the migration to the replacement means for the product’s behavior, evaluate the replacement against the existing evaluation criteria, and plan the migration in a way that preserves quality guarantees.
When a model update changes behavior in ways that affect the product, the product manager needs to detect that change (which requires the evaluation infrastructure described above), assess its impact, and decide how to respond. This is not entirely unlike managing a third-party API or library, but the nature of AI model behavior means changes can be subtle, widespread, and hard to detect without systematic evaluation.
What this adds up to for the role
The product managers who are most effective working on AI-enabled products are not a different type of person from the ones who excel at traditional software products. They have the same core skills: user empathy, clear communication, rigorous prioritization, and the ability to make decisions with incomplete information.
What they have added is a set of AI-specific capabilities: fluency with probabilistic quality, the ability to define and own evaluation criteria, skill at designing AI-specific feedback loops, and enough technical understanding of AI systems to have productive conversations about failure modes and model behavior without needing to go deep on the underlying machine learning.
These capabilities are learnable. They are also not yet widely understood as a distinct set of skills that product managers working on AI need to develop. Organizations that treat AI product management as ordinary product management with AI features added are likely to discover that the ordinary PM playbook produces predictable outcomes for the ordinary parts and unpredictable outcomes for the AI parts.
The role is taking shape in practice faster than it is being taught in PM communities. The product managers who develop these skills now are building an advantage that will compound as AI becomes a larger fraction of what software products do.
Zylver ships AI products: Forge, Signal, Agents, Flows, and Meter. View all products.
More from Zylver
What your board needs to know about AI
Boards are being asked to provide oversight on AI at a moment when most board members lack the background to evaluate what they are hearing. The gap between what boards need to know and what they typically get in management presentations is real and consequential.
How AI is changing customer service
Customer service is one of the business functions most visibly transformed by AI. The changes are happening faster than most organizations planned for, and the outcomes depend heavily on implementation decisions that are easy to get wrong.
How to scale AI adoption from one team to the whole organization
Getting AI to work in one team is a different challenge from scaling it across an organization. What worked for the first team often fails when applied elsewhere, and the failure mode is usually invisible until the expansion is already stalled.
Get insights like this delivered monthly.
No spam. Unsubscribe anytime.