Chapter 04 of 17
AI Competitive Positioning
When your competitor calls the same API you do, raw model capability is not a moat. Here's what actually is.
Overview
The Commodity Trap
Here's the competitive reality most product teams face right now: your company wants to add an AI feature, your primary competitor wants to add the same AI feature, and you're both going to call the same foundation model APIs to build it. Same OpenAI endpoint. Same Anthropic or Google Gemini API. Same underlying model capabilities.
So what's your moat?
This question, asked honestly before you commit to a build-and-release strategy, is one of the most important conversations a PM can lead. Companies that answer it poorly ship AI features that are roughly equivalent to competitors and compete on price, marketing, and distribution. Companies that answer it well build AI features that are genuinely hard to replicate — not because they have better models, but because they've built on an asset the model API itself cannot provide.
That asset is almost always proprietary data, proprietary workflow integration, or a network effect that improves with usage.
Build vs. Buy vs. Integrate: The PM's Decision Framework
Before any competitive positioning discussion, you need to make a fundamental architectural decision: are you building the AI capability, buying a packaged AI product, or integrating an AI API into your own product? These aren't technical decisions delegated to engineering. They have major product, commercial, and strategic implications that the PM needs to own.
Buying a Packaged AI Product
You purchase an AI-powered product from a vendor and use it as-is or with limited configuration. The vendor maintains the models, the infrastructure, the evaluation, and the quality.
Best when: The capability is adjacent to but not central to your core value proposition. The vendor is clearly best-in-class and unlikely to be displaced. Data privacy requirements allow data to flow to the vendor. Your competitive differentiation doesn't depend on this capability.
Watch out for: Vendor lock-in that makes switching painful if quality drops or pricing changes. The vendor expanding their product scope and becoming a competitor. Loss of product surface control that degrades your user experience.
Integrating an AI API (Prompting)
You connect to a foundation model API and craft the prompts, instructions, and workflow logic that shape its outputs. The model provider maintains the underlying intelligence; you maintain the product layer.
Best when: You need the flexibility of a general-purpose model with specific domain focus. Your differentiation is in how you present and integrate the AI output, not in the model's raw capability. You need to move fast and the prompting approach can achieve sufficient quality.
Watch out for: API pricing and availability you don't control. Model provider policy changes that affect what you can do. The risk that a competitor builds the same prompting pattern and you have no differentiation below the product surface.
Building Custom AI
You train, fine-tune, or develop AI models specifically for your use case. The model is yours, the training data is yours, the specialization provides capability that generic models can't match.
Best when: Your use case requires specialized capability that foundation models don't handle well. You have proprietary labeled data that represents a genuine advantage. The model's performance is central to your competitive differentiation. You have the ML engineering resources to do this properly.
Watch out for: The cost and time — often 3–5x the investment of prompting approaches. The ongoing maintenance burden of model versioning, retraining, and evaluation. The risk of falling behind foundation model improvements that happen faster than your custom development cycle.
The Decision Matrix
| Factor | Buy | Integrate API | Build Custom |
|---|---|---|---|
| Time to first value | Weeks | Weeks–months | Months–years |
| Ongoing cost | License fee (predictable) | Usage-based (scales with volume) | Infrastructure + ML team (fixed + variable) |
| Data control | Shared with vendor | Shared with provider | Fully owned |
| Quality control | Vendor-dependent | Prompt-layer control | Full control |
| Competitive differentiation | Low (competitors can buy same product) | Medium (depends on workflow integration) | High (if data asset is proprietary) |
| Switching cost | Medium–high | Low–medium | Very high |
| Best for | Adjacent capabilities | Core features with speed priority | Core differentiating features with data advantage |
A practical rule of thumb: start with API integration for most features. Move to custom builds only when you have (1) a meaningful proprietary data advantage and (2) clear evidence the API approach can't reach your quality threshold. Buy for capabilities that are clearly commodity and where owning them doesn't serve your competitive strategy.
Evaluating AI Vendor Claims
When buying a packaged AI product or evaluating an AI API, you'll encounter vendor claims ranging from accurate to optimistic to technically misleading. Here's how to evaluate them without deep ML expertise.
The Benchmark Question
Vendors love to cite benchmark scores. Before accepting them as meaningful:
Is the benchmark representative of your use case? Industry benchmarks measure general capability, not the specific task your feature performs. An LLM that scores well on general reasoning benchmarks may perform poorly on your domain documents.
Is the evaluation methodology sound? Some benchmarks are contaminated — the model was trained on data overlapping with the test set. Ask how the benchmark was constructed and whether the vendor can demonstrate results on a fresh evaluation set.
What is it actually measuring? Accuracy on a balanced test set is different from accuracy on the long-tail edge cases your users will actually submit. Ask to see performance on adversarial or edge case examples.
The Demo Question
Vendor demos are their best-case scenario. Ask:
What data is the demo using? Carefully curated example data or production-representative data? Ask to run the demo on your own examples.
What fails? Ask explicitly: "Show me a case where this doesn't work well." A vendor who can't show you failure modes hasn't tested their own product. A vendor who shows you failure modes honestly and explains the mitigations is significantly more credible.
What's the latency in production? Demo environments aren't production-representative on latency. Ask for production latency percentiles (P50, P95, P99), not averages.
The Reference Question
References are your best source of unfiltered signal. Ask for references specifically in your industry or use case. Ask them:
- What were the first 90 days like? Did quality meet expectations?
- What failure modes have you encountered in production that weren't obvious in the demo?
- How has the vendor responded to quality issues?
- How has pricing evolved since you started?
- Knowing what you know now, would you make the same buy decision?
The answers to the last two reveal vendor trustworthiness in ways that demos and benchmarks never will.
Moat Analysis: What's Defensible?
In traditional software, moats come from network effects, switching costs, brand, and distribution. Those still apply in AI — plus one additional dimension: quality advantage that accumulates from proprietary data and feedback loops.
The Four AI Moat Sources
Proprietary data. Your AI is trained on, retrieves from, or is fine-tuned with data competitors don't have access to — historical transaction data, user behavior data, proprietary document libraries, or labeled training data you've invested significantly in creating. The test: if a competitor deployed the same model architecture with different data, would they reach the same quality? If no, that's a real moat.
Feedback loops. Your AI improves as users interact with it, and that improvement compounds. Every correction a user makes, every rating, every implicit quality signal — if you're capturing and using that to improve the model, you're building an advantage that grows with usage. Competitors who start later start with less signal. This is the data flywheel, and it's genuinely defensible when it works.
Workflow integration depth. Your AI is embedded in the workflow at a depth that would require significant switching costs to replicate. It knows the user's history, preferences, and context because it's integrated with your CRM, your document repository, your communication tools. A competitor's AI starts from scratch. The integration is the moat, not the model.
Domain specialization. Your AI has been fine-tuned, prompted, or evaluated specifically for a domain where generic models underperform. You've invested in domain expert labeling, domain-specific evaluation criteria, and iterative quality improvement in a narrow area that's hard to replicate quickly. Foundation models increasingly have broad domain knowledge — the moat is the evaluation infrastructure and quality standard you've built for the specific task.
The Moat Analysis Table
| Moat Source | Do You Have It? | How Strong? | How Long to Replicate? |
|---|---|---|---|
| Proprietary data | Yes / Partial / No | Strong / Moderate / Weak | [Timeframe estimate] |
| Feedback loops | Yes / Planned / No | Strong / Moderate / Weak | [Timeframe estimate] |
| Workflow integration | Yes / Partial / No | Strong / Moderate / Weak | [Timeframe estimate] |
| Domain specialization | Yes / Partial / No | Strong / Moderate / Weak | [Timeframe estimate] |
A feature with no moats — one any competitor could build to equivalent quality in 60 days using the same API — is a feature parity play. Necessary, maybe. Worth disproportionate investment, no. A feature with strong moats in two or more categories is genuinely defensible and worth building around.
First-Mover vs. Fast-Follower in AI
The conventional wisdom in software: first-mover advantage is real but often overstated. Distribution, execution, and resources matter more than who ships first. In AI features, this dynamic is more nuanced.
When First-Mover Matters More
Feedback loop moats. If your feature benefits from a data flywheel — quality improves as more users interact with it — then being first gives you a compounding advantage. Every month of production usage is a month of quality improvement a later entrant doesn't have.
Trust establishment. Users' trust in AI features is partly a function of category experience. If your product is the first AI tool a user trusts for a specific category of task, you benefit from anchoring effects. Users develop mental models around your feature's behavior and compare competitors to you, not to their pre-AI baseline. This is real in high-trust domains — medical, legal, financial — where user skepticism is elevated.
Dataset creation windows. In some domains, the training and evaluation data you can create early is difficult for later entrants to replicate because the raw signal is time-bounded. Historical conversations, historical market data, historical events — being first means you have data that was created under conditions that no longer exist.
When Fast-Follower Wins
Foundation model rapid improvement. The models underlying most AI features are improving fast. A feature built on a less capable model in 2023 can be replicated and exceeded by a fast-follower building on a more capable 2025 model with less effort. First-mover advantage evaporates when the technology improvement rate is faster than the moat accumulation rate.
Learning from first-mover mistakes. Early AI feature launches are often cautious, under-featured, or calibrated for the wrong user segment. Fast-followers observe what works, what fails, and what users actually need — and build a better version of the category without the exploratory investment.
Avoiding adoption friction. First-movers in AI categories often face the hardest user education challenges. Users don't yet know how to use the category, what to expect, or how to integrate it into their workflow. Fast-followers enter when the category is established and users have been educated, partly by the first-mover's investment.
The Practical Implication
The relevant question isn't "should we be first?" It's "will the moats we build as first-mover compound faster than our competitors can replicate them?" If the primary moat is a feedback loop and your feature will have meaningful scale within 12 months, being first matters. If the primary moat is workflow integration and your competitor has better distribution to the integration points, being second with better integration may be the stronger position.
Positioning When Everyone Uses the Same Model
If your feature and your competitor's both call the same foundation model with similar prompts and produce similar outputs, differentiation has to come from elsewhere.
Position on context depth. Your AI knows more about the user, the workflow, and the specific domain because you've invested in integration infrastructure that provides that context. Your competitor's AI only knows what the user typed today.
Position on reliability and safety. Many AI features are roughly equivalent in average-case quality. Differentiation comes from how they handle edge cases, adversarial inputs, and failure modes. If you've invested in robust evaluation, conservative outputs in high-stakes situations, and transparent uncertainty communication, you can position on trustworthiness rather than raw capability.
Position on workflow integration. The AI is a component of a workflow product, not a standalone tool. Your differentiation is that the AI output feeds directly into the next step without a copy-paste journey, and the workflow provides the feedback mechanism that makes the AI better over time.
Position on the human-plus-AI combination. The feature isn't a replacement for human judgment — it's a force multiplier for the expertise your users already have. Position on the quality of the human-AI collaboration your product enables, not on the AI's standalone capability.
The most durable positioning builds context depth, workflow integration, and reliability standards that make your AI feature genuinely harder to replicate — regardless of which model is underneath it.