I'm Gen Furukawa, founder of SuperMarketers. I've built AI visibility systems for B2B SaaS companies and run AI search audits across dozens of clients. The pattern is consistent: the companies ChatGPT cites are rarely the ones with the most content. They're the ones whose pages are built for extraction and whose names show up on sources beyond their own domain. This is the five-step procedure I use to close that gap, in the order that matters.
Each step builds on the one before it. Crawlability comes first because if ChatGPT can't reach the page, nothing else counts. Below the procedure, I cover how long it takes and the questions founders ask most. If you want to know what AEO is and why your buyers' research moved to AI, start there first. This page is the execution.
The five steps to get cited by ChatGPT
- Step 1 - Make your site crawlableChatGPT can only cite pages its crawler can reach. Open your robots.txt and confirm you allow GPTBot (and the other AI crawlers, like PerplexityBot and ClaudeBot) instead of blocking them by default. Then publish an llms.txt file at your domain root, a plain-text map that points AI engines to the pages you most want cited. Most companies discover they've been quietly blocking the exact bots they're trying to win. Fix this first. It's the cheapest, highest-impact change on the list, and if the crawler can't see the page, every later step is wasted effort.
- Step 2 - Structure pages for extractionChatGPT lifts answers out of pages. It does not read them the way a person does. So give it passages it can lift cleanly. Lead each page with a direct two-to-three-sentence definition in the first 100 words. Follow it with numbered steps. End with an FAQ block where each answer stands on its own without needing the rest of the page. A 500-word page with a clear definition, numbered steps, and a self-contained FAQ gets cited over a 3,000-word essay with the same facts buried in prose. The format is the signal. Your competitor isn't smarter. Their content is structured for extraction and yours isn't.
- Step 3 - Add schemaSchema is how you confirm, in machine-readable form, what your prose already says. Add the JSON-LD types that fit the page: Organization (who you are, consistently) and FAQPage (so your FAQ answers map to questions an engine recognizes) belong on almost every page, HowTo on step-by-step guides like this one, and DefinedTerm on any page that defines a term. This is not optional polish. It removes the guesswork an engine would otherwise do to figure out what your page is about. Validate it with Google's Rich Results Test before you ship. Schema described in a doc does nothing; it has to be embedded in the page.
- Step 4 - Earn third-party validationThis is the step founders skip, and it's the one that matters most. The majority of the citation signal is off-domain, not on your own site. AI engines weight independent corroboration: analyst coverage, Reddit and Quora threads, guest articles, partner pages, podcast mentions. A company named by three credible independent sources outranks one with a flawless website and nothing else. One practitioner estimate puts most citation drivers off your own domain, and that lines up with what we see in audits. Your pages set up the answer. Third-party mentions are what make the engine trust it. Get named where your buyers already look.
- Step 5 - Measure whether it workedRe-run your target queries once a month and score your citation rate: the percentage of queries where ChatGPT names your company. This is the number that replaces "posts published" as your AEO metric. If you've never baselined it, run a 30-minute AI visibility audit first, then re-measure monthly so you know which of the four fixes above actually moved the needle. What you don't measure, you can't improve, and you'll waste the next quarter guessing.
That's the whole procedure. Five steps, executed in order, on your top 5 to 10 pages. The underlying methodology is the 9-step visibility system we run for clients, but the founder-doable version is exactly this list.
You'll see a claim circulating that "80% of citation drivers are off-domain." Treat that as a practitioner estimate, not a measured law. The directionally safe version is the one that holds up across our audits: the majority of the citation signal lives off your own site, not on it. Your pages get you eligible. Third-party validation gets you cited.
Why this order, specifically
Founders try to do all five at once, get overwhelmed, and do none of them well. The order isn't arbitrary. Steps 1 through 3 make a page citable: reachable, extractable, and machine-parseable. Step 4 makes it trusted: corroborated by sources the engine already weights. Step 5 makes the whole thing a system instead of a one-time project, because AI search visibility compounds only if you keep measuring and closing gaps.
The research supports the emphasis on structure and validation. The GEO study (Aggarwal et al., KDD 2024) measured how structural content changes affect AI engine visibility. Adding citations and authoritative source references improved visibility scores by up to 40% in the benchmark. Structure and attributed evidence outperform volume. That is the entire bet behind this procedure.
How long does it take to get cited?
Most companies that implement these structural changes to their top 5 pages see measurable citation rate improvement within 60-90 days. The crawlability and schema fixes (Steps 1 and 3) can register sooner, as soon as the page is re-crawled. The extraction rebuild (Step 2) shows up as the page gets re-indexed. The slowest signal is off-domain validation (Step 4), because earning credible third-party mentions takes time and isn't fully in your control.
More significant movement on competitive queries takes 3-6 months. The compounding effect, where consistent citations build trust signals that generate more citations, typically becomes visible around the six-month mark. The companies that start now compound that visibility. The ones that wait pay more to catch up later, against competitors who already own the answer.
| Fix | Effort | Time to show up |
|---|---|---|
| Crawlability | Under an hour | Next crawl (days to weeks) |
| Extraction structure | A few hours per page | Re-index (2-6 weeks) |
| Schema | Under an hour per page | Next crawl (days to weeks) |
| Third-party validation | Ongoing | 3-6 months |
| Measurement | 30 minutes a month | Immediate baseline |