AI Search Tools

How to track brand visibility in ChatGPT — a 6-step operator's guide

Q: What metrics should I track for AI search visibility?

Three: **mention rate** (out of N prompts, how often does ChatGPT name your brand?), **share of voice** (your mention count divided by total brand mentions across the same prompt set), and **citation sources** (which URLs ChatGPT cited, tagged into Owned/Editorial/UGC/Competitor buckets). Mention rate is the AI-search equivalent of impressions; share of voice travels across time even when output style shifts; citation sources tell you where to invest content effort.

Q: What's the difference between mention rate and share of voice?

**Mention rate** is absolute — out of N prompts, the percentage where your brand was named at all. **Share of voice** is relative — of all brand mentions across the same prompt set (yours + competitors), the percentage that are yours. Mention rate moves with prompt selection and engine drift; share of voice is more comparable across time because it normalizes for absolute output volume.

Tracking your brand's visibility in ChatGPT is a measurement discipline, not a one-time scan. The job has six concrete steps: (1) define a buyer-intent prompt set, (2) run it against ChatGPT, (3) extract mention rate and share of voice, (4) log citation sources, (5) set a tracking cadence, (6) close the gaps with content. You can do every step manually with a spreadsheet and 90 minutes a month — or automate it once 30+ prompts and multi-engine coverage make the math break. This guide walks through both paths, then names where a tool earns its price. Run a 5-minute free audit for a baseline →

Updated 2026-05-26

Questions this guide answers

How do I track my brand's visibility in ChatGPT?
What's the best way to track brand mentions in ChatGPT?
Can I track ChatGPT visibility manually or do I need a tool?
How often should I track ChatGPT visibility?
What metrics should I track for AI search visibility?
What tools should I use to track brand visibility in ChatGPT?
How do I measure share of voice in ChatGPT?
How many prompts should be in my ChatGPT tracking set?

Direct answer

Tracking brand visibility in ChatGPT means running a fixed set of buyer-intent prompts through ChatGPT on a regular cadence and measuring three signals every time: how often ChatGPT names your brand (mention rate), what percentage of all brand mentions on the same prompt set are yours (share of voice), and which URLs ChatGPT cited when it formed the answer (citation sources). The six steps below are the same whether you do it manually in a spreadsheet or automate it with a tool.

If you want a baseline before you commit to a methodology, the SolCrys free audit runs 10 buyer prompts through ChatGPT and returns mention rate, share of voice, and citation sources in five minutes. The rest of this guide is the operator's view: how to define the prompt set, score the answers, and decide when manual tracking stops earning its hours back.

Step 1 — Define a buyer-intent prompt set

The first measurement decision is which prompts you run. Get this wrong and every other step is wasted effort. The two most common mistakes: (a) tracking ego queries ("What is [Brand]?") instead of how buyers actually research, and (b) tracking too few prompts (under 10) so the noise floor swallows the signal.

Aim for 30 prompts as the working minimum, split across three buyer intents:

Discovery (10 prompts) — top-of-funnel research the buyer runs before they have a brand in mind. Examples: "best [category] for [persona]", "how to solve [problem]", "who makes [product type]". These prompts test whether ChatGPT recommends you at all when the buyer doesn't know to ask for you.
Comparison (10 prompts) — mid-funnel evaluation, where buyers compare two or three named candidates. Examples: "[Your brand] vs [Competitor 1]", "alternatives to [Competitor 2]", "[Competitor] pros and cons". These prompts test whether ChatGPT positions you correctly against named alternatives.
Decision (10 prompts) — bottom-funnel purchase intent. Examples: "pricing for [category]", "best [category] for [specific use case]", "is [Your brand] worth it for [use case]". These prompts test whether ChatGPT closes the buyer on you when intent is high.

Step 2 — Run the prompts (manually the first time)

Run the full prompt set through ChatGPT manually the first time, even if you intend to automate later. Why: you'll see exactly what the consumer-facing engine does — which prompts trigger web search, which produce stale training-data answers, which the model refuses to answer at all. That qualitative read is the calibration you need before you trust any tool's automated score.

Concrete procedure:

Use the consumer surface, not the API. Run from chat.openai.com (logged in is fine; logged out works too — note which one you used so you can stay consistent). The API uses a different model and has web search disabled by default, so API runs will not match what real buyers see.
One new conversation per prompt. Open a fresh chat for each prompt so prior conversation context never leaks across answers. Conversation memory contaminates measurement.
Capture the full response text, the cited sources, and the timestamp. Copy each answer into a spreadsheet row, including any inline citations the model provided. Timestamp matters because engine outputs drift week to week.
Plan ~90 minutes the first time. Thirty prompts × ~90 seconds each (read + copy + paste + tag) = roughly 90 minutes. Subsequent runs go faster because your spreadsheet template is built.

Step 3 — Score each answer on three signals

Once the responses are captured, score each one on the three signals that matter for AI search. These are the AI-search equivalent of impressions, clicks, and referring sources for SEO — different mechanics, same purpose: a small, stable set of comparable numbers.

Mention rate — out of N prompts, how many mention your brand by name at all? Ignore qualifiers ("could be a fit", "some people use") at this step — count any non-zero mention as a mention. Mention rate is the AI-search impression metric: if ChatGPT never names you, no downstream metric matters.
Share of voice — out of all brand mentions across the same N prompts, what percentage belong to you vs each named competitor? Share of voice is a relative metric, so it travels across time even when ChatGPT changes its absolute output style. Aim to track 3-5 named competitors plus an "other" bucket.
Citation sources — when ChatGPT names your brand or a competitor, which URLs is it citing? Tag each cited URL into one of four buckets: Owned (your or competitor's domain), Editorial (third-party journalism — Forbes, TechCrunch, niche trade press), UGC (Reddit, Quora, review sites, forums), Competitor (competitor's owned content driving your mention indirectly). Citation source mix is what tells you *where* to invest content effort to actually shift the mention rate.

Step 4 — Set a tracking cadence

How often you re-run the prompt set is a trade-off between measurement noise and operator effort. Three reference cadences:

Monthly — the minimum useful cadence for manual tracking. Engine output drifts slowly enough that month-over-month deltas are visible; less than monthly and you'll miss the rebrand cycle (90 days) and lose the chance to react.
Weekly — the practical sweet spot if you care about content iteration. After publishing a new piece, weekly cadence lets you see citation pickup within 2-4 weeks rather than 8-12. Most operating teams running manual tracking land here.
Daily — the cadence you want if you've automated, you have an outcome-bonus contract on visibility lift, or you're in the first 60 days after a category-defining campaign launch. Daily cadence detects engine model rotations and competitor moves the same day they happen. Below that, daily is overkill — you'll burn hours on noise.

When daily becomes mandatory

Two scenarios force daily cadence regardless of operator effort: (1) you've signed a contract — internal OKR, external service-level agreement, or vendor outcome-bonus clause — that pays out on a specific mention-rate or share-of-voice target, in which case the audit trail needs to be daily to be defensible; (2) you've just launched a category-defining campaign or product, and the early 60-day cycle is when most of the ChatGPT citation graph will form for that launch, so weekly cadence will miss the inflection.

Step 5 — Pick a tool (or stay manual, honestly)

Manual tracking earns its hours back when your prompt set is small (≤20 prompts), ChatGPT is the only engine you care about, and your cadence is monthly. Past those thresholds the math breaks — 30 prompts × 4 engines × weekly cadence = 480 individual runs a month, which is no longer a defensible use of an operator's calendar. That's where a tool starts to earn the price.

Six things to look for when you evaluate a ChatGPT visibility tracker (regardless of vendor):

Daily cadence option — not just weekly. Engine behavior drifts faster than weekly snapshots resolve.
Same model the consumer sees — the tool should measure the version of ChatGPT a real buyer experiences (chatgpt.com default model with web search), not a developer-grade API call.
Citation source breakdown — Owned/Editorial/UGC/Competitor tagging is the action layer. A tracker that only reports mention rate without telling you which URLs are doing the work has limited operator value.
Custom prompt sets — a fixed generic prompt set per vendor will not match your category's buyer journey. You need to bring your own prompts.
Raw response export — at minimum the tool should let you download the actual ChatGPT response text. If the tool hides raw responses behind an opaque score, you can't audit, you can't reproduce, you can't trust.
Multi-engine coverage — even if you only care about ChatGPT today, your buyers will use Gemini and Perplexity within the next 12 months. A tracker that locks you into ChatGPT-only will become a re-purchase decision.

What's actually in the category in 2026

Public AEO platforms in the visibility-tracking category as of mid-2026 include Profound, Otterly.ai, AthenaHQ, Peec.ai, and SolCrys. Pricing bands run from free (SolCrys free audit, Goodie free tier) up through enterprise (Profound enterprise, multi-thousand monthly). The category-defining differences are cadence (daily vs weekly), engine coverage (ChatGPT-only vs four-engine), methodology disclosure (transparent — see our visibility measurement methodology — vs opaque), and whether the tool ships an action layer that turns gap data into content briefs.

Disclosure: this guide is published by SolCrys. The advice in steps 1-4 is tool-agnostic — you can run all of it in a Google Sheet. The free audit at app.solcrys.com/audit is one way to get a baseline score in 5 minutes; the best tools writeup covers the rest of the category neutrally.

Step 6 — Close the gaps (the point of the whole exercise)

Tracking that doesn't produce action is theater. Every weekly or monthly tracking cycle should end with a small, ranked list of actions targeted at the specific prompts where you're losing. Three action paths, ordered by typical leverage:

Owned content — publish a new article, FAQ, or landing page that directly answers a prompt where ChatGPT currently mentions competitors but not you. Target the prompt verbatim; structure the answer in passages an LLM can extract (clear H2/H3 + direct answers in the first 1-2 sentences of each section).
Third-party citation seeding — when the citation source breakdown shows ChatGPT is citing Reddit threads, niche editorial, or Quora answers for your category, the move is to seed those surfaces with brand-accurate content (community posts, expert AMAs, founder bylines in niche trade publications). Owning the citation source surface is a slower lever but a harder moat to dislodge.
Hallucination correction — when ChatGPT confidently misstates a fact about you (wrong pricing, wrong founder, wrong category claim), the action is to ship corrections to the sources ChatGPT cited (review-site updates, Wikipedia edits, vendor-listing corrections) so the next index refresh fixes the answer.
Rank each action by (prompt search volume × current loss × content cost to fix). Ship the top 1-3 per cycle. Re-run the tracking set next cycle. Iterate.

Three pitfalls to avoid

Three common mistakes that quietly waste tracking effort:

Too few prompts. Under 10 prompts, the noise floor (one prompt going your way changes mention rate by 10+ percentage points) swallows the signal. Thirty is the working minimum; sixty is comfortable.
Confusing impressions with mentions. "My brand appeared in 12 of 30 ChatGPT answers" is mention rate. "My brand has 12 mentions across 30 answers" can be 12 in one answer and zero in the other 29 — totally different signal. Score per-answer, not aggregate.
Ignoring citation sources because they're not on your domain. The instinct is to focus on owned content. The data routinely shows that 30-60% of citation lift in a category comes from Reddit, niche editorial, and review sites — not the brand's own properties. If you only optimize owned content, you'll move the needle slower than competitors who also seed the third-party surfaces.

Manual vs tool — a 60-second decision matrix

Use this to decide whether to keep tracking manually or to subscribe to a tool:

Situation	Manual is fine	A tool earns its price
Prompt set size	≤20 prompts	30+ prompts
Engines tracked	ChatGPT only	ChatGPT + 1-3 others
Cadence	Monthly	Weekly or daily
Operator hours/month available	≥4 hours predictable	Hours are not the constraint, focus is
Need to alert on drops	No — periodic review fine	Yes — you want a Slack ping the day a drop happens
Outcome-bonus contract	No	Yes — the audit trail must be defensible

Get started — a 5-minute baseline before you commit to a methodology

If you've never run a structured ChatGPT visibility check on your brand, the cheapest first move is a one-time baseline. The SolCrys free audit runs 10 buyer-intent prompts through ChatGPT and returns mention rate, competitor share of voice, and the top citation sources in 5 minutes — no credit card, no setup. Use it as your zero-week reference; if the numbers are worth tracking forward, build the manual 30-prompt set in step 1 or upgrade to the Brand plan for daily multi-engine tracking. Run the free audit →

FAQ

How often should I track ChatGPT visibility?

Monthly is the minimum useful cadence for manual tracking — engine output drifts slowly enough that month-over-month deltas are visible without burning operator hours on noise. Weekly is the sweet spot once you start iterating on content. Daily becomes mandatory if you have an outcome-bonus contract on mention rate or if you're in the first 60 days after a category-defining campaign launch.

Can I track ChatGPT visibility manually?

Yes, and you should at least once even if you plan to automate. Run your prompt set through chatgpt.com (consumer surface, not the API) in fresh conversations, log the full response text and any cited sources into a spreadsheet, and score each one for mention, share of voice, and citation source. Plan ~90 minutes for the first 30-prompt run; subsequent runs go faster once your template is built.

How many prompts should be in my ChatGPT tracking set?

Thirty is the working minimum — split 10/10/10 across discovery, comparison, and decision intents. Under 10 prompts, the noise floor swallows the signal. Sixty prompts gives you the cleanest signal but takes more time to maintain; most operators land between 30 and 50.

What metrics should I track for AI search visibility?

Three: mention rate (out of N prompts, how often does ChatGPT name your brand?), share of voice (your mention count divided by total brand mentions across the same prompt set), and citation sources (which URLs ChatGPT cited, tagged into Owned/Editorial/UGC/Competitor buckets). Mention rate is the AI-search equivalent of impressions; share of voice travels across time even when output style shifts; citation sources tell you where to invest content effort.

What's the difference between mention rate and share of voice?

Mention rate is absolute — out of N prompts, the percentage where your brand was named at all. Share of voice is relative — of all brand mentions across the same prompt set (yours + competitors), the percentage that are yours. Mention rate moves with prompt selection and engine drift; share of voice is more comparable across time because it normalizes for absolute output volume.

Why are citation sources important?

Citation sources tell you *where* ChatGPT formed its answer. If 40% of your category's citations come from Reddit threads and 20% come from Forbes, then publishing more content on your own blog has structurally lower leverage than seeding Reddit and pitching Forbes. Citation source breakdown is what turns tracking from a vanity dashboard into an action plan.

When should I move from manual tracking to a tool?

Three signals each on their own justify a tool: (1) your prompt set has grown past 30 prompts, (2) you want to track Gemini, Perplexity, or AI Overviews alongside ChatGPT, (3) you've moved from monthly to weekly or daily cadence. Once any two of those apply, the math on operator hours stops working — a $29-$399/month tool will save more time than it costs.

How is ChatGPT visibility tracking different from SEO?

Traditional SEO measures your URLs' rank on Google's keyword-results page. The unit of analysis is *your URL* and its SERP position. ChatGPT visibility tracking measures whether AI engines mention your *brand name* by name in an answer — a different signal that doesn't depend on URL rank. You can rank #1 on Google and never get mentioned by ChatGPT for the same query. As buyers shift to AI answer engines, mention rate becomes the new share-of-voice metric.

Related guides

AI Search Tools

Turn AI answer gaps into governed marketing execution.

Start free with a ChatGPT visibility read, then add multi-engine tracking, Corporate Context governance, and the action-to-result loop when you are ready.

Start Free