SolCrys Logo

Strategy & Positioning

The Pages You Were About to Kill Are Your New Content Template

When AI answer boxes absorbed your informational traffic, they did not penalize your pages uniformly; they replaced the ones that existed to be the destination and left the ones a model has to cite, so a single binary test on each page tells you which to kill or merge and which to study as the template for everything you build next.

Updated

Questions this guide answers

  • which pages survive AI Overviews
  • how to know if an AI answer box replaced my page
  • what content gets cited by AI
  • should I delete how-to pages for AI search
  • AI-era content audit

The inversion in one line: your audit's worst pages are not the wound, they are the map

Open the content audit. The pages bleeding traffic are not a random sample of what you publish — they are a specific, legible class, and that legibility is the whole point. The mechanics of why are settled: zero-click answer boxes absorbed the informational query, so the page that used to receive the click no longer does (the full recovery playbook for that covers the mechanics). Treat that as covered and move on, because re-litigating it is the trap.

The non-obvious move is to read the audit as an inversion rather than a verdict. The pages that collapsed were not penalized for quality; they were absorbed because they did a job the box now does itself. The pages that held were not lucky; they did a job the box cannot do. That is a reallocation of attention, not a punishment — which means the grim half of your audit is not a list of failures to mourn. It is a brief. The losers tell you what to stop making. The survivors tell you, line by line, what to make more of.

So the rest of this page is not about saving the absorbed pages. It is about reading the survivors as a template, and turning the audit itself into the first step of a loop you run on a schedule rather than a one-time cleanup.

The one-question page test: could an AI answer box replace this page outright?

Run every informational page through a single binary question: could an AI answer box replace this page outright? Not summarize it, not link to it — replace it, so that a reader who got the box would have no reason to arrive.

A yes means the page exists to be the destination. It defines a term, lists the steps, states the formula, recaps the well-known. All of that is exactly what the box now generates on demand from a hundred other pages that said the same thing. The page was the destination, and the destination is what the box already became. A yes is not a quality judgment; some of these pages are well written. It is a redundancy judgment.

A no means the page contains something the box cannot stand in for: a position it would have to attribute, a comparison it would have to reproduce, a number only you measured, a judgment call a model will not make on its own behalf. The box can quote this page, but it cannot be this page. That asymmetry — citable but not replaceable — is the entire survivor signal, and the test is binary on purpose so it generates a build list, not just a kill list.

The test is posed to produce, not only to prune. Each no is a worked example of a page worth more of. Each yes is either a candidate for the merge pile or a prompt: what would it take to turn this destination page into a page the box has to cite?

Kill, merge, build: a page-portfolio triage

Point the keep / rewrite / archive / add muscle at pages, not prompts. The output of the one-question test sorts cleanly into four moves, and the discipline is to act on the whole portfolio rather than chase one URL.

The yeses split between archive and merge. A thin, fully-absorbed definition page that nothing links to is an archive: removing it concentrates authority on the pages that survive. Three overlapping how-to pages that each got absorbed are a merge: fold them into one page that adds the thing the box cannot generate — your own result from doing it, the trade-off you hit, the case where the standard advice fails.

The noes are the template, not the to-do list. Before you write a single new brief, study them. What do your surviving pages share — a stated take, a side-by-side, a decision recommendation, a dataset? That shared shape is the brief for everything in the add column. You are not inventing a new content strategy; you are reverse-engineering the one your own survivors already prove.

Page-test resultMoveWhy
Yes — thin, absorbed, unlinkedArchiveThe box replaced it; removing it concentrates authority on pages that survive.
Yes — several overlapping absorbed pagesMergeFold into one page and add what the box cannot generate (your result, the trade-off, the failure case).
No — a take, comparison, decision call, or first-hand dataKeep and deepenThis is a survivor; sharpen the citable element and keep it current.
Gap — a no-shaped page you have not builtBuildUse the survivors as the template; brief the page the box has to cite, not the one it replaced.

Why the survivors survive: they exist to be cited, not to be the destination

The pages that held a line through the answer-box transition share one property: they exist to be cited, not to be the destination. A model assembles its answer from sources, and it reaches for a source when the source supplies something the model cannot generate from consensus alone.

Four things qualify. A real take — a stated position with a point of view, not a hedged survey of all views. A comparison — a side-by-side the model would otherwise have to reconstruct, which is expensive enough that citing yours is easier. A decision call — an actual recommendation, the named choice rather than 'it depends.' And first-hand data — a number you measured that exists nowhere else, so the model cannot regenerate it and must attribute it. Our breakdown of how to build each of these into a page on purpose walks through all four.

First-hand data is the strongest of the four because it is the one a model structurally cannot fake. A take can be paraphrased; a comparison can be re-derived; but a measurement only you ran has no substitute in the training distribution, so attribution is the only honest path to it. This is why production is cheap and trust is scarce holds in the answer era: when generation is free, the scarce input is the thing that had to be observed rather than written, and that is what survives.

Why it is hard, not automatic: a worked example in three failure modes

Survivor pages get cited; they are not handed citations. The cleanest way to see how hard it is — and how unevenly citations land — is to walk one illustrative example through the three ways a page can fail to earn the citation it deserves. Picture a mid-market cloud data-warehouse vendor: a real product with real docs, sitting in a category whose answer boxes are owned by the well-known leaders. The pattern below is a teaching example, not a measured readout, but it mirrors what the cited layer actually looks like.

Mode one is absent. Ask an engine an unbranded buyer prompt — 'best data warehouse for real-time analytics,' 'cheapest cloud data warehouse for a startup,' 'best warehouse for a small data team' — and the answer reaches for the names everyone reaches for: Snowflake, Databricks, BigQuery, Redshift. The mid-market vendor is simply never named on the large majority of those unbranded prompts. Not ranked low — absent. Its informational pages exist, but they exist to be the destination, so the box generated the answer from the consensus pages and never had to cite them.

Mode two is mis-described. On a narrower comparison prompt — 'Snowflake vs [the vendor] for streaming' — the vendor does get named, because a head-to-head is a shape the model has to attribute. But the description is unstable: one engine renders it flat and generic the same week another renders it accurately and positive. That instability is the tell. The page that anchors the comparison is thin enough that each engine reconstructs the vendor differently, instead of one citable side-by-side pinning the description down.

Mode three is out-competed. On the prompts where the vendor does appear, a leader is the primary recommendation and the vendor sits near the bottom of the share of voice. The fix is not to shout louder; it is to read which sources the engine cited when it recommended the leader. Those cited sources — the community threads, the editorial comparisons, the decision-call posts — are the survivor shapes the box reached for. They are the template. You out-compete by building the page the box has to cite, in the shape it already rewards, not by adding another page it can absorb.

What the cited layer is actually made of

Step back from any single brand and look at the composition of the cited layer itself, because that is what the failure modes are reacting to. In a category like cloud data warehousing, AI answers do not draw their sources from one dominant site. Citations spread across a long tail of domains, and no single source owns a large share of the pool — the most-cited source holds a sliver and the field falls away fast from there.

The shapes that dominate that pool are exactly the survivor shapes: community forums where practitioners argue trade-offs, reference and editorial pages, and head-to-head comparison content. Competitor and category blogs sit a tier down, and thin owned how-to pages — across every brand — are the thinnest slice of all. That is the structural reason volume does not buy citations: the box is not assembling answers from definition pages, it is assembling them from the take-comparison-decision-data layer.

For the dated, measured breakdowns of this pattern, the canonical references are the owned-vs-earned baseline and the measurement methodology; the GEO study of what gets cited covers the experimental side of the same question.

The lesson is blunt: volume of informational pages does not buy citations. A thousand absorbed how-to pages and one cited comparison page are not on the same curve. The cited layer rewards the survivor shape, and it does so unevenly, prompt by prompt — which is why citation is consensus, not authority frames it as something earned against a contested field rather than granted by output.

Turn the audit into the loop: measure, execute, verify

The one-question test is not a one-time cleanup; it is the measure step of a loop you run on a schedule. The audit measures which pages the box replaced and which it has to cite. The survivor template is what you execute against — new pages and merged pages built in the shape your survivors already prove. And the verify step is the part most teams skip: re-check citation on your actual prompt set after the work ships.

Verify matters because a single reading lies. Run the same prompt set on consecutive days and a brand's overall mention rate can swing several points up and back down with no real trend — one run is one draw, not a result. A page that looks newly cited on Monday may simply have caught a favorable run. The honest signal is whether it holds citation across runs, on the prompts you care about, which is why the prompt set is a maintained asset rather than a fixed list — see the AEO prompt-set lifecycle.

Closed, the loop is the discipline: measure the portfolio with the page test, execute the survivor template through governed, human-approved changes, verify re-citation on the prompt set, then read the next audit as the next brief. Run it once and you get a snapshot; run it on a schedule and the audit stops being a post-mortem and becomes a content pipeline.

The takeaway: brief the pages the box has to cite

Stop briefing the pages the box replaced. Every additional definition page, step list, or term explainer is a page you are building to be absorbed — work the answer box will happily do for free, sourced from everyone but you.

Brief the pages it has to cite instead. Take a position. Run the comparison. Make the call. Publish the number only you measured. Those are the survivors, and your own audit already named them for you — the worst pages are the map to the best ones.

SolCrys runs that loop as governed AEO execution: measure which pages got absorbed, diagnose the survivor shapes worth scaling, execute the template through human-approved changes, and verify re-citation on your prompt set rather than trusting a single run. You can start with the measure step — point it at your prompts and see which of your pages the box has to cite — free, no credit card, at app.solcrys.com/audit. Start Free.

Sources

FAQ

How do I know if an AI answer box replaced my page?

Run one binary test: could an answer box replace the page outright — not summarize or link to it, but stand in for it so a reader who saw the box has no reason to arrive? If yes, the page existed to be the destination, and the destination is what the box became. Pages that define a term, list known steps, or recap consensus almost always fail this test. The confirming signal is in your analytics: informational pages that lost clicks without losing rankings were absorbed, not penalized.

Should I delete my how-to pages?

Not reflexively. Run each through the page test. A thin how-to page that was fully absorbed and nothing links to is an archive candidate — removing it concentrates authority on pages that survive. Several overlapping how-to pages are a merge: fold them into one and add the thing the box cannot generate, such as your own result from doing it, the trade-off you hit, or the case where the standard advice fails. A how-to page that already carries first-hand data or a real decision call is a survivor — keep and deepen it.

What does a page that gets cited by AI look like?

It exists to be cited, not to be the destination. Four shapes qualify: a real take (a stated position, not a hedged survey), a comparison (a side-by-side the model would otherwise have to rebuild), a decision call (a named recommendation, not 'it depends'), and first-hand data (a number you measured that exists nowhere else). First-hand data is the strongest because a model structurally cannot regenerate it and must attribute it. The common thread: the box can quote the page but cannot be the page.

Does publishing more content help AI visibility?

Volume of informational pages does not buy citations. Picture a crowded category like cloud data warehousing: AI answers draw on a long tail of domains with no single source owning a large share, and the cited layer is dominated by takes, comparisons, and community and editorial content — not how-to pages. A mid-market vendor with a large library of informational pages is often never named on the unbranded buyer prompts at all, while the named leaders get the primary recommendation. A large library of absorbed pages and one cited comparison page are not on the same curve; the cited layer rewards the survivor shape, prompt by prompt.

How is this different from a normal content audit?

A normal audit reads low-traffic pages as failures to fix or cut. This reads them as a map: the pages the answer box absorbed tell you what to stop making, and the pages that survived tell you what to make more of. It also closes a loop instead of running once — measure the portfolio with the page test, execute the survivor template through human-approved changes, then verify re-citation on your actual prompt set, since a single run is one draw and not a result. The audit stops being a post-mortem and becomes a content pipeline.

Related guides

Strategy & Positioning

Production Is Cheap. Trust Is Scarce.

AI made producing brand content nearly free, but made being cited, recommended, and trusted by AI engines radically more scarce and concentrated. A field note from SolCrys Co-Founder & CPO Eason Wang on what changes when production stops being the marketing bottleneck — and what the new substrate is.

Citation & Source Influence

Owned, Earned, and Community Sources in AI Answers: A 3-Layer Strategy

AI engines cite three source layers — owned (your site), earned (PR/editorial), and community (Reddit/G2/forums). In our own data, owned is only ~1.6% of citations yet still a top-10 source. Why third parties get you into the answer, why your own site still matters, and how to balance the three.

How SolCrys Works

Don't delete 0-visibility prompts: a four-state lifecycle for AEO prompt sets

Most teams delete prompts that show 0 visibility. Eason Wang, SolCrys CPO, on why that's the wrong heuristic — a 0-visibility prompt with real buyer intent is a gap to fix, not a prompt to remove — and the four-state lifecycle (Keep + Act, Rewrite, Archive, Add) we ship to enforce it.

How SolCrys Works

AI Visibility Measurement Methodology

How we capture your AI visibility data across supported engines, with each response traceable to a prompt, engine, capture method, available model or surface signal, and timestamp. Consumer-surface and retail-assistant validation are scoped where technically reliable.

Citation & Source Influence

Schema Markup for AI Search: What Structured Data Actually Does (and Where It Stops)

Schema markup helps AI engines parse and classify your brand, and makes you eligible for richer treatment. It does not, on its own, get you cited or trusted. The honest, complete guide to structured data for AEO: what it does, what it doesn't, which types matter, and how to ship it so it moves the answer.

Free AI visibility audit

Find out where your brand is missing, miscited, or misrepresented.

SolCrys maps high-intent prompts to mentions, citations, answer accuracy, and content gaps so your team can prioritize the next pages to ship.

Get a free audit