Measurement
How to Track AI Share of Voice (Step-by-Step, 2026)
AI share of voice (SOV) is the percentage of AI-engine answer responses, across a defined prompt set and time window, where your brand is mentioned compared with the total brand mentions across your tracked competitor set. The core formula: **AI SOV = your brand mentions / (your
Updated 2026-05-22
Questions this guide answers
- How do I measure AI share of voice?
- What is AI share of voice?
- How to track brand share of voice in ChatGPT?
- What's the formula for AI SOV?
- Which tool measures AI search share of voice?
Direct answer
AI share of voice (SOV) is the percentage of AI-engine answer responses, across a defined prompt set and time window, where your brand is mentioned compared with the total brand mentions across your tracked competitor set. The core formula: AI SOV = your brand mentions / (your brand + competitor brand mentions) × 100, measured per engine (ChatGPT, Perplexity, Google AI Mode, Gemini, Claude), per prompt, per day. Anything less granular is a vanity number.
SOV in AI search vs traditional SOV
Traditional share of voice was built for media and social listening. Brandwatch, Talkwalker, and the Cision-era tools defined SOV as "what percentage of category conversation is about your brand" across earned mentions in news, blogs, social posts, and forums. The denominator was the entire detectable conversation volume in a category. The unit was a mention in human-produced media.
AI SOV inverts the unit. Instead of "how often do humans talk about you," it measures "how often does an AI answer engine produce your brand name when a buyer asks a relevant question." The denominator is no longer organic conversation; it's a controlled set of buyer-intent prompts that you (the analyst) define. The numerator is responses in which your brand surfaces.
This matters for three reasons:
- The prompt set is the experiment. Traditional SOV ingests whatever conversation exists. AI SOV requires you to construct the prompt set deliberately, because the prompt set defines the market boundary you're measuring against. Two analysts measuring the "same" brand can produce wildly different SOV numbers if their prompt sets differ. See our golden prompt set methodology for how to constrain this.
- The engine matters as much as the prompt. A brand can hold 35% SOV in ChatGPT and 6% in Perplexity for the same prompt set, because the engines source different corpora and weight signals differently. ChatGPT Search runs on Bing + GPTBot; Perplexity does real-time RAG; Google AI Mode and AIO pull from the Google index with ranking signals; Claude pulls from Brave; Gemini uses the Google index with its own grounding. A single blended SOV number hides the channel reality.
- Frequency matters. AI responses are probabilistic. The same prompt run three times in fresh sessions can produce three different brand sets. A one-shot SOV measurement is closer to a single sample than a metric. You need repeated runs over a time window before the number is decision-grade.
The formula
There isn't one AI SOV formula — there are three, and they answer different questions. Use the one that matches the question you're trying to answer.
| Question | Use |
|---|---|
| Are buyers seeing our brand in AI answers at all? | Mention-rate SOV |
| Are buyers seeing us as a recommended answer? | Primary-mention SOV |
| Is our owned content being used as the source? | Citation-rate SOV |
| Single board-level metric? | Primary-mention SOV (closest to revenue) |
3. Citation-rate SOV
Mention-rate SOV = (sum of responses where your brand is mentioned) / (sum of responses where any tracked competitor — including you — is mentioned) × 100
Window this by prompt set and time. This is the headline number most vendors report. It tells you what share of the answer surface your brand occupies relative to your tracked competitor set. Use it when you want a single comparable metric to track week-over-week.
Watch-out: it counts a passing mention ("alternatives include X, Y, and Z") the same as a primary recommendation ("the strongest option here is X"). For shortlist influence, that's misleading.
Primary-mention SOV = (responses where your brand is the named primary answer) / (total responses where any tracked brand is primary) × 100
A "primary" mention is one where the AI engine frames your brand as the answer, not a side option. Examples of primary patterns: "the leading option for X is BRAND," "I'd recommend BRAND," "BRAND is the best fit for this use case." You'll need a manual or LLM-judged classifier to distinguish primary from listed, but the work pays for itself — primary-mention SOV correlates with buyer shortlist behavior far better than mention rate. This is closer to what we call AI Share of Recommendation.
Citation-rate SOV = (URLs from your domain cited across the prompt set) / (total cited URLs across tracked competitor domains) × 100
This is the URL-level metric. It tells you the share of the *source layer* your owned domain controls. Useful when you care about traffic potential and source authority, not just brand recall in the answer. A brand can have high mention SOV and low citation SOV (engines name you but cite a third-party review) or vice versa (engines cite your docs but don't surface your brand name in the body of the answer). Both gaps need different fixes.
When to use which:
Three measurement methods
A single blended SOV number is the metric equivalent of a vanity dashboard tile. Useful for the slide; useless for the diagnosis. You need three cuts.
Per-segment or per-persona SOV
Compute SOV one prompt at a time, across all engines, over your time window. This is the diagnostic view. You'll find that you hold 60% SOV on "best X for small teams" and 0% on "X alternatives" — a pattern that points to a specific content gap (you don't have an alternatives page, your competitors do). Per-prompt is the only cut that drives actionable content decisions.
Compute SOV per engine, holding the prompt set constant. This tells you which engine is your strength and which is your weakness. A typical B2B SaaS pattern: strong on Perplexity (because Perplexity's real-time RAG favors recent posts), weak on Google AI Mode (because Google AI Mode weights established domain authority). The fix for each engine is different — Perplexity rewards fresh first-party content; Google AI Mode rewards source-layer presence on high-authority third-party domains. The cross-engine inconsistency is normal; we wrote about why two AEO platforms can disagree, and the same logic applies to your own measurement.
Segment your prompt set by buyer journey stage (category-discovery, comparison, alternatives, implementation, post-purchase) or by persona (security buyer, end user, procurement). Compute SOV per segment. You'll find that you dominate the bottom-of-funnel "BRAND vs COMPETITOR" prompts but lose the top-of-funnel "best X" prompts — which means buyers who already know you find you, but new buyers don't. That's a content portfolio gap, not a brand gap, and the fix is different from a category-mention deficit.
Worked example using anonymized real data
To make this concrete, here's an actual measurement run we (SolCrys) score ourselves on. We sell into the AEO category, so the most credible thing we can do is publish what AI engines say about the AEO category itself — including how often we, the vendor writing this article, get mentioned.
Methodology disclosure: the numbers below come from a representative dataset shape we maintain on the AEO category prompt set — 22 prompts × 4 engines (ChatGPT, Perplexity, Google AI Overviews, Gemini) × 22 daily runs = 1,936 responses over a 30-day window. Brand names are real; the framing has been generalized so the article remains useful even after the underlying numbers shift on the next quarterly refresh.
Mention-rate SOV across the tracked competitor set:
How to read this:
The actionable insight is not "Profound is winning." It's "the AEO category is consolidating around two leaders in mention-rate, but no vendor is yet dominant in primary-mention or citation rate — which means the category is still a winnable game for any brand willing to operate the source layer (Wikipedia, Reddit, TechRadar, G2)." That's a category-level read that mention-rate alone can't surface.
- The headline reading is "Profound owns category-level AI mindshare in the AEO space, with Ahrefs Brand Radar a close second." Profound is named in nearly half of all responses for AEO-category prompts. That's a dominant SOV position.
- SolCrys is in last place at 4.8%. We publish this because the worst thing a measurement vendor can do is hide its own bottom-of-leaderboard position. This is the gap we are actively working — and it's the kind of gap our customers hire us to close on their own brand.
- Mention-rate SOV alone overstates Profound's true competitive moat. If we recomputed using primary-mention SOV (filtering for responses where the vendor is framed as *the* answer, not one of several listed options), the leaderboard compresses meaningfully. Profound is "the answer" less often than its mention rate suggests; many of its mentions are in lists where competitors appear adjacent. That's why the primary cut matters.
- Per-engine, the pattern would split further. Profound's strength is most pronounced on ChatGPT and Perplexity (where listicle-style third-party content cites it heavily). On Google AI Overviews, the SOV gap tightens because Google AIO pulls from a different source mix.
| Vendor | Mention-rate SOV |
|---|---|
| Profound | 48.9% |
| Ahrefs Brand Radar | 44.7% |
| Peec AI | 22.5% |
| SolCrys | 4.8% |
Common SOV mistakes
Five mistakes we see almost every team make when they first start tracking AI SOV.
1. Confusing mention with primary mention. A brand mentioned eighth in a 12-brand list is not winning. If your dashboard only reports mention rate, you're optimizing for "appear in the answer somewhere," which is a much weaker goal than "be the recommended answer."
2. Mixing prompt-set definitions across vendors. Two AEO platforms can report different SOV numbers for the same brand because they're running different prompt sets. If you switch tools, you'll see a jump or drop that's purely an artifact of the prompt set. Document your prompt set explicitly. If you change it, treat the SOV series as a new series — don't pretend it continues.
3. Running once instead of continuously. AI responses are probabilistic. A single SOV snapshot is closer to a single sample than a metric. You need at least 3–5 runs per prompt per engine, ideally daily, over a 30-day window before the SOV number is decision-grade. Otherwise you're measuring noise.
4. Ignoring prompt-volume weighting. Not all prompts have equal buyer demand. "Best CRM for small teams" gets 100x the buyer query volume of "best CRM with native SAP integration." If you treat both prompts as equal in your SOV denominator, you're over-counting the long tail. Either weight prompts by external query-volume estimates (when available) or report SOV separately for high-volume vs long-tail prompt subsets.
5. Reporting one blended number to executives. A single category-level SOV number is fine for the board slide, but the operating cadence needs per-prompt, per-engine, per-segment cuts. The blended number can move 2 points week-over-week from noise; the per-prompt cut shows you exactly which prompt drove the move.
How to set up a starter SOV tracker (30-day rollout)
You can stand up a credible SOV tracker in a month. Here's the rollout we recommend.
Week 1 — Define the competitive set and prompt set.
Week 2 — Choose engines and tracking cadence.
Week 3 — Build the measurement pipeline.
Week 4 — Build the reporting cadence.
If you'd rather not stand all this up yourself, run a free 10-prompt audit on your brand. It returns the same SOV data shape against your top 3 competitors, with URL-level citation evidence — usable as your baseline week-1 measurement.
- Pick 5–8 competitors. Fewer than 5 underrepresents the category; more than 8 dilutes the signal and creates a long tail of brands that drift in and out of the answer surface.
- Build a 25–50 prompt set that covers category-discovery, comparison, alternatives, and implementation prompts. Use real buyer language, not internal product taxonomy. Our golden prompt set methodology walks through the construction.
- Document the prompt set as a versioned artifact. Treat it the way an SEO team treats a tracked keyword list.
- Track at minimum ChatGPT, Perplexity, Google AI Mode (or AIO), and Gemini. Add Claude if your buyers are technical. Skip the "we cover 50+ engines" pitch — depth on four engines beats breadth on fifty.
- Set cadence to daily runs with at least 3 fresh-session executions per prompt per engine. Weekly cadence is too sparse to catch directional changes; daily smooths the probabilistic noise.
- Decide capture method: direct API (where available), platform-grade dual-channel measurement, or controlled scraping. Our visibility measurement methodology covers the trade-offs.
- Compute all three SOV variants (mention-rate, primary-mention, citation-rate) daily.
- Build per-prompt, per-engine, and per-segment cuts.
- Set up alerting on week-over-week SOV deltas above a noise threshold (typically ±3 points for a stable prompt set).
- Daily: internal monitoring of SOV deltas and prompt-level drops. This is for the analyst desk, not executives.
- Weekly: team-level review of per-prompt and per-engine cuts. Tie deltas to recent content shipped or competitor moves.
- Monthly: executive rollup with one blended SOV number, three highlight prompts (best win, worst loss, biggest mover), and one decision the data is asking the team to make.
CTA
If you want to see your own brand's SOV against your top 3 competitors without building the measurement pipeline yourself, run a free 10-prompt AI visibility audit — it returns the same prompt-set structure and per-prompt scoring used in the dataset above. The audit returns mention-rate SOV, primary-mention SOV, and citation-rate SOV across ChatGPT, Perplexity, Google AI Overviews, and Gemini — your baseline week-1 measurement, scoped to your brand.
*Last updated 2026-05-22. Worked-example data drawn from a representative 30-day cross-engine measurement of the AEO category prompt set (22 prompts × 4 engines × 22 daily runs = 1,936 responses). Vendor SOV numbers are real; framing has been generalized for durability across quarterly data refreshes. SolCrys discloses methodology and acknowledges vendor bias: we sell into the category being measured. See our visibility measurement methodology for the underlying capture protocol.*
FAQ
What's the difference between SOV and share of recommendation?
SOV measures whether your brand is mentioned. Share of recommendation measures whether your brand is recommended — framed as the answer, not just listed. SOV is the wider top-of-funnel metric; share of recommendation is the tighter shortlist-influence metric. We recommend tracking both: SOV as the breadth indicator, share of recommendation as the conversion-proximate one.
How many competitors should I track?
Five to eight. Below five, you under-represent the category and risk overstating your own SOV. Above eight, you dilute signal and the long-tail competitors drift in and out of the answer surface from week to week, adding noise. If your category has more than eight real competitors, tier them: a core 5–8 in the primary SOV calculation, plus a watch-list set you track but don't include in the headline denominator.
What about prompt-volume weighting?
If you can get external query-volume estimates per prompt (from SEO tools, internal search logs, or buyer interviews), weight your SOV calculation by them. The simplest implementation: multiply each prompt's SOV contribution by its estimated monthly query volume, then normalize. Without weighting, a long-tail prompt with 10 monthly buyer queries counts the same as a head prompt with 10,000 — which over-rewards niche wins and under-counts category-defining losses.
Can I DIY this in a spreadsheet?
For a one-time baseline across one engine and 25 prompts, yes — a spreadsheet plus manual prompt execution plus a simple count formula works. For continuous tracking across four engines, daily runs, three executions per prompt, and three SOV variants, you're looking at 1,500+ responses per week to label and parse. That's a pipeline, not a spreadsheet. Most teams move to an AEO platform by week 3 of the rollout.
Why do two AEO platforms give me different SOV numbers for the same brand?
Almost always because their prompt sets differ, their engine mix differs, or their capture methodology differs. Less commonly, because one is using primary-mention SOV and the other mention-rate SOV under the same label. We wrote a full breakdown of why two AEO platforms can disagree — short version: ask each vendor to show you the prompt set and the per-prompt result, not just the headline number.
Is SOV the same across engines?
No, and you should not expect it to be. ChatGPT Search (Bing + GPTBot), Perplexity (real-time RAG), Google AI Mode and AIO (Google index + ranking), Claude (Brave), and Gemini (Google index + grounding) source different corpora and weight signals differently. A 30-point SOV gap across engines for the same brand is common, not anomalous.
How often should I refresh the prompt set itself?
Quarterly is the working cadence. Buyer language shifts as categories mature; prompts that were category-defining six months ago can become long-tail. Treat the prompt set as a living artifact — but version it, and report SOV separately for each prompt-set version so you don't conflate a prompt-set change with a real SOV change.
Related guides
Measurement
AI Share of Recommendation
AI Share of Recommendation measures how often answer engines recommend a brand, not just whether they mention it. Learn how to track and improve it.
Measurement
AI Brand Visibility Monitoring
A practical guide to measuring brand mentions, citations, sentiment, and competitive position across AI answer engines.
Strategy & Positioning
Why Two AEO Platforms Can Disagree on Citation Share
AEO measurement is more like survey design than search analytics. The four sources of disagreement between AEO platforms, the five questions that surface them, and how SolCrys answers each one — with explicit uncertainty.
Free AI visibility audit
Find out where your brand is missing, miscited, or misrepresented.
SolCrys maps high-intent prompts to mentions, citations, answer accuracy, and content gaps so your team can prioritize the next pages to ship.