Citation share — the percentage of a tracked query set where a brand earns a citation in an AI engine's answer — is the cleanest single number for AI search visibility we've found. It compresses a complex behavior into a metric that can be tracked over time, compared to competitors, and acted on. It is also a metric that's only as trustworthy as the methodology behind it.
Most "AI visibility" dashboards we evaluate fail on the methodology. They sample the engines inconsistently, they use query sets that drift, they parse citations in ways that miss substantial mentions, and they aggregate across engines in ways that obscure the engine-specific behavior that actually matters. The number they produce may look stable; the stability is a measurement artifact.
Here's the methodology Workspace runs, the engineering choices behind it, and where off-the-shelf alternatives fit for teams that don't have a Workspace at their disposal.
What citation share has to handle to be trustworthy
Six things break naive citation share methodology:
- Query set drift. If the query set changes between measurements, the metric isn't comparing apples to apples.
- Engine non-determinism. Asking the same engine the same question twice often returns different answers. A single sample is noisy; many samples per query are needed.
- Citation parsing inconsistency. Different engines surface citations in different structural forms — clickable footnotes, inline links, "Sources" panels, plain-text mentions. The parser has to recognize all of them and not double-count.
- Brand entity ambiguity. "Apple" is a fruit, a company, and a film studio. The parser has to distinguish "this is OUR brand" from "this is a different entity with the same name."
- Engine version drift. ChatGPT-4o, ChatGPT-o1, ChatGPT-o3, Perplexity Sonar, Perplexity Sonar Pro — versions matter and the measurement should record which one produced which answer.
- Time-of-day and geography sampling. Engines route queries through different backends depending on time and geography; consistent sampling windows matter.
The methodology below addresses each.
Query set construction
Every client engagement starts with a tracked query set. The set is built from:
- Real customer questions drawn from sales, support, and onboarding transcripts where available.
- Search data from Google Search Console and any first-party search tooling, filtered to the question-form queries that match AEO intent.
- Competitive queries the named competitors are likely to be cited for.
- Entity queries that probe the engines' understanding of the brand directly ("what does [brand] do," "is [brand] reliable," "what makes [brand] different").
The set sizes range from roughly 50 queries for a focused single-category brand to several hundred for a multi-category or multi-region brand. Once set, the query list is versioned. Adding queries is allowed (with explicit version notation); removing queries breaks comparability.
Engine sampling
For each engine in scope — typically some subset of ChatGPT, Perplexity, Claude, Gemini, and Google AI Overviews — Workspace runs the query set on a defined cadence:
- Daily for high-volatility tracking on a smaller priority subset.
- Weekly for the full set on standard engagements.
- Multiple samples per query per engine to handle non-determinism. We typically run three samples and take the citation behavior across the set rather than relying on a single sample.
Each sample records the engine version, the timestamp, the geography of the sampling node, and the raw response text along with any structured citation payload the engine provides.
Citation parsing
Parsing is where most off-the-shelf tooling fails. The parser has to:
- Recognize structured citations when the engine provides them (Perplexity's source panel, Claude's footnotes, ChatGPT's link cards).
- Recognize inline mentions when the engine cites a brand in the answer body without a clickable source.
- Resolve URLs to brand entities —
example.com,m.example.com,example.com.au,subdomain.example.comall resolve to the same brand for citation accounting. - Distinguish citations to the brand's own pages from citations to third-party pages about the brand. Both are relevant, but they're different metrics — own-source citation share vs. total mention rate.
- De-duplicate when the same brand is cited multiple times in one answer (count once per query, not once per citation).
For citation share specifically, the count is "of the queries in the set, how many returned an answer that cited the brand (at least once) as a source." That's a clean per-query binary that can be averaged across the set into the citation share percentage.
Brand entity disambiguation
This is the part most teams underestimate. Brand entity disambiguation requires:
- A canonical list of the brand's owned domains and subdomains.
- A canonical list of the brand's social profiles, knowledge graph identifiers (Wikidata QID, Google Knowledge Graph ID), and any "sameAs" properties from the brand's own
Organizationschema. - A disambiguation rule for ambiguous names — explicit rules about how to treat "Apple," "Patagonia," "Shopify," or any brand name that could refer to multiple entities.
Without disambiguation, the metric pollutes — false positives from same-name entities and false negatives from missing the brand's own subdomains.
Competitor handling
For every client, Workspace tracks a named competitor set on the same query set on the same engines on the same cadence. The competitor share data is what makes the client's share interpretable: 35% citation share is meaningless without knowing whether the top competitor is at 20% or 70%.
Competitor disambiguation uses the same machinery as brand disambiguation — canonical domains, sameAs identifiers, explicit ambiguity rules.
Reporting cadence
Workspace produces:
- Weekly snapshots of citation share per engine, with delta vs. prior week.
- Monthly reports with competitor comparison, first-mention position trends, and entity accuracy spot-checks.
- Quarterly strategic reviews with the content-roadmap implications.
The reports are not separate from the platform — they're views over the same continuously-updated data. The client sees what the strategist sees.
We document the broader Workspace context — what we built and why — on Why We Built Workspace. The methodology page is citation share monitoring.
Where off-the-shelf alternatives fit
Several tools in the market offer some version of AI citation tracking — Profound, Otterly, Peec AI, AthenaHQ, BrandRank, AI Visibility, and a growing list of others. They differ in engine coverage, sampling cadence, parsing fidelity, and pricing.
For teams that don't have an agency partner running citation share in a managed way, the off-the-shelf options are a reasonable starting point. The two questions to ask of any of them:
- What's the engine coverage and sampling cadence? Does the tool actually sample the engines you care about, frequently enough to catch the volatility?
- Can you bring your own query set, and is the query set versioned? Tools that auto-generate queries from your domain are convenient but lose comparability when their generation changes.
The trap to avoid: tools that show a single aggregated "AI visibility score" with no engine breakout, no competitor comparison, and no query-set transparency. Those numbers are not actionable. You can't move a metric you can't decompose.
How the data feeds action
Citation share is not an end. It's the highest-level signal in a system whose purpose is to surface action.
The action pipeline:
- Citation share by engine identifies which engines are gaining or losing the brand.
- First-mention position identifies the queries where the brand is cited but not prominently.
- Mention rate vs. citation rate identifies the queries where the brand is mentioned but not cited (the source-document fix opportunity).
- Competitor diff identifies the content gaps the editorial roadmap needs to address.
- Entity accuracy spot-checks identify the hallucinations and outdated information the source-correction work needs to address.
That pipeline is what makes citation share useful. Without it, the metric is interesting trivia. With it, the metric is the dashboard of an active operating program.
The broader landscape view — what citation share looks like across the brands and categories we've tracked — is in /reports/state-of-ai-shopping-citations-2026. The AEO services view is on AEO services; the broader AI SEO frame is on AI SEO services; the audit-side view is on AEO audit.
If you want to talk through citation share monitoring for your brand specifically — whether running through Workspace on an engagement basis, or working out what tooling you can set up internally — we'd like to hear from you.
