Chapter 02 / 5How AI Search Actually Works: The Real Mechanics Behind ChatGPT and Perplexity Recommendations
AI doesn't know your brand from training — it searches Google. Here's exactly how the query fan-out works, how to see the actual searches ChatGPT runs, what the research says moves citations, and what wastes your time.
On this chapter
- The fundamental mechanic: AI searches the web in real time
- Query fan-out: why your prompt isn't what the AI searches
- How to see exactly what ChatGPT searches (the network tab technique)
- What actually moves AI citations (the research)
- The conversion advantage: why AI traffic is worth pursuing
- The different AI platforms and what they search
- Finding AI-style searches in your own Google Search Console
- Measuring your AI visibility
- Summary: the mechanics in five points
A marketer spent three months optimizing her site specifically for AI visibility. She added structured data. She created an llms.txt file. She chunked her content into "AI-readable" sections based on what she'd read in GEO guides. She paid a consultant $3,000 to audit her "AI readiness."
Her AI citation numbers didn't move.
Meanwhile, a competitor published a comparison page targeting "best [category] for [audience]" with a clean answer-first structure and a keyword in the title. Within six weeks, that page was showing up in ChatGPT recommendations. No schema. No llms.txt. No special AI optimization. Just a well-optimized page that ranked on Google.
The difference between these two outcomes is understanding how AI search actually works — which turns out to be very different from what most GEO guides describe.
The fundamental mechanic: AI searches the web in real time
When you ask a modern AI assistant — ChatGPT, Perplexity, Gemini, Claude — about a brand, a product, or anything current, it does not recall information from its training data. It runs a live web search, reads the results, and quotes what it finds.
This is not speculation. Multiple lines of evidence confirm it:
The nonsense word experiment: A researcher created a page containing a made-up word, submitted it only to Google Search Console, and asked ChatGPT to define it. ChatGPT quoted the page verbatim. Bing, DuckDuckGo, and Yandex — which hadn't indexed the page — found nothing. The conclusion is unambiguous: ChatGPT searches Google's index.
The 70–80% overlap finding: A separate analysis found that ChatGPT's results have a 70–80% overlap with Google's search results. The snippets ChatGPT produces are closely similar to those in Google's index. ChatGPT is not running an independent search; it's reading from Google.
Reddit's trap: Reddit created a test post that could only be crawled by Google — it was inaccessible via any other means. Within hours, Perplexity surfaced the content. Perplexity's business model is, in part, to take Reddit content from Google's search results and feed it to an AI model. Reddit is suing Perplexity over this. The test proved the point: AI search = Google index.
Google's own confirmation: Google published an article called "Optimizing your website for generative AI features" in which it states explicitly that SEO best practices remain relevant because "generative AI features are rooted in core search ranking and quality systems." The AI is reading their index.
The practical implication is enormous: you do not need to understand the internals of any AI model to get cited. You need to rank on Google. Every tactic in this playbook flows from that.
Query fan-out: why your prompt isn't what the AI searches
Here's the subtlety that trips up even sophisticated marketers: AI doesn't search for your prompt. It rewrites your question into a series of different, more specific web searches — a process called query fan-out.
Ask ChatGPT "has the Empire State Building made any changes to its observation deck?" and it might search: "Empire State Building observation deck changes 2025 ticketing." The words are different. The intent is the same, but the specific language the AI uses to find information is its own, not yours.
This matters because optimizing for your prompt's wording and optimizing for the AI's actual search language are different things. If you're trying to appear in ChatGPT answers about your product category, you need to know what language ChatGPT actually searches when it generates those answers — not what language you'd use to describe your category.
How to see exactly what ChatGPT searches (the network tab technique)

This is the single most actionable technique in all of GEO, and almost nobody uses it.
Step-by-step:
- Go to ChatGPT and run a prompt about your category — something like "what's the best [your product type] for [your audience]?"
- After ChatGPT responds, look at the URL bar. Copy the string of characters that appears after
/c/in the URL. - Right-click anywhere on the page and select Inspect (or press F12 to open Developer Tools).
- Click the Network tab at the top of the developer panel.
- In the filter box within the Network tab, paste the string you copied from the URL.
- Refresh the page (press F5 or Ctrl+R).
- You'll see a list of network requests appear. Find the one matching your string — it often appears with orange or highlighted brackets.
- Click on that request, then click the Response tab on the right panel.
- In the Response content, search (Ctrl+F) for the word "queries".
- You'll see the literal web searches ChatGPT executed. These are often completely different from your original prompt.
For Perplexity: Much simpler. Perplexity shows its search steps openly by default. Click "steps completed" under any response to see exactly what it searched for.
What to do with the searches you find:
Use that exact language in:
- Your page title (beginning)
- Your URL slug
- Your H1 (main page headline)
- The first sentence of your content
- H2 subheadings within the page
Run 20–30 prompts about your brand and category. Build a complete map of the language AI uses when searching for information about you. Publish good content in that language. That's the whole GEO strategy — and it's just SEO.
What actually moves AI citations (the research)
An analysis of 1.4 million AI prompts identified the actual ranking factors for AI citation. The results are clear — and significantly different from what most GEO guides emphasize.
What matters most:
1. Semantic similarity — the #1 factor by a significant margin.
How closely does your content match the AI's actual search query? This is the dominant citation factor. Not the number of pages you have, not your schema markup, not your domain authority — how well your content's language matches the language the AI searched with.
The implication: the network-tab technique above is the most valuable tool you have. Know what the AI searches, use that language prominently, and you'll be cited. Ignore that language and add schema instead, and you won't.
2. Conversion pages, not blog posts.
Research by Prompt Watch found that landing pages and product pages get cited in approximately 29% of AI recommendations. Blog posts get cited far less. This is because AI recommendations for commercial queries — "what's the best X for Y" — pull from pages that are built to answer that specific intent, not from pages that discuss the topic generally.
Build conversion pages. Put the answer at the top. Make them the kind of page a ready buyer needs, not the kind of page a curious browser enjoys.
3. Answer-first structure.
AI tools prefer content where the key claim or recommendation appears near the top of the page. This isn't just an AI preference — it's what users want too. A 2–3 sentence summary at the top of a page that immediately gives the main point, followed by supporting detail, is both more likely to be cited and more likely to convert.
The Reddit example: a widely-shared post showed that adding a short TL;DR to the top of articles boosted conversions by 33%. The same structural principle applies to both human readers and AI citation rates.
4. Citing sources and linking out.
A controlled test using a made-up word ("Volandasysik") set up 10 identical pages — identical in every way except five linked out to sources and five didn't. The five that linked out ranked above the five that didn't, consistently. Citing your sources helps both AI citation rates and search rankings. AI tools are more likely to quote a page that demonstrates it has done research.
What does NOT matter (despite what most guides claim):
Google's own documentation on AI features explicitly debunks these myths:
- Schema / structured data: Ahrefs studied 1,885 pages that added schema markup. AI citations "barely moved." Google is dropping support for more schema types each year, including FAQ schema — which ChatGPT still recommends when asked about AI SEO, demonstrating exactly why you shouldn't take AI's SEO advice.
- llms.txt files: None of the major SEO authority sites (Ahrefs, Moz, HubSpot, Semrush) use llms.txt. It has no documented impact on citations.
- Content "chunking" for AI: Google's documentation states there is "no requirement" for this.
- Content freshness: The average page cited by AI is 500 days old. Freshness barely registers as a citation signal.
- Rewriting content specifically for AI: Unnecessary. If your content is well-written and semantically relevant, AI will cite it.
The GEO startup that shut down because its founder realized "there's no such thing as a GEO strategy" had it exactly right. What earns AI citations — quality content, mentions in authoritative publications, genuine expertise, semantic relevance — is the same stuff that's always worked for SEO and PR.
The conversion advantage: why AI traffic is worth pursuing
AI-referred traffic is a small percentage of total search volume — under 1% by most measures. A reasonable question is whether it's worth optimizing for at all.
The data makes the case clearly: data from LendingTree showed that AI-referred traffic converts at 4–5× the rate of standard organic search. The volume is smaller. The quality is dramatically higher.
When an AI recommends your brand, the visitor arrives pre-sold. The AI said "use this" — so they arrive with trust already established. The visitor who clicks your organic result is still evaluating. The visitor the AI sent has already made a decision in your favor.
Additionally, 95.8% of ChatGPT users also use Google. Optimizing for AI visibility and optimizing for Google are not separate tracks — they're the same track, because AI reads Google's index.
The different AI platforms and what they search

ChatGPT: Searches via Bing. This means ranking on Bing is a direct lever for ChatGPT visibility. Connect your site to Bing Webmaster Tools (Bing's equivalent of Google Search Console) and submit your sitemaps there. The same five on-page keyword placements that work for Google work for Bing — and Bing is dramatically less competitive. Many keywords that take months to rank for in Google can be won on Bing in weeks.
Perplexity: Searches through a combination of sources, heavily influenced by Google's index. Shows its searches transparently (click "steps completed"). Reddit content that ranks in Google appears frequently in Perplexity results — the Reddit/Perplexity lawsuit confirmed that Perplexity specifically pulls Reddit content from Google's results.
Gemini (Google): Pulls directly from Google's index. Standard Google SEO applies entirely. Gemini is also integrated with Google Business Profiles for local businesses, which creates a separate opportunity for local SEO (discussed in the local playbook).
Claude: Suspected to use Brave Search's index in addition to other sources. This is the basis of the "own subreddit → Brave → Claude" tactic discussed in Part 4 — but the connection is not confirmed and the tactic is fragile.
Finding AI-style searches in your own Google Search Console
You don't need to run AI prompts to find the language AI uses when searching about your category. It's already in your Search Console data — because AI tools search through Google's index.
7+ word searches (AI-assisted queries):
Go to Search Console → Performance → Search Results → Add Filter → Query → Custom (Regex) and paste:
^(?:\S+\s+){6,}\S+$
This shows queries that are 7 or more words long — the length of natural-language prompts that suggest someone used an AI tool to formulate their search, or is describing a complex situation. These are exactly the searches AI models generate in their query fan-out.
For each keyword this surfaces: find the ranking page, add a dedicated section using that exact language, request indexing.
"I am" searches:
Filter queries containing "I am" — these are personal prompts where users describe themselves to get a recommendation ("I am a freelance designer looking for a client management tool"). These reveal exactly how your audience presents their needs to AI tools. Create content that mirrors this language — "if you're a freelance designer looking for…" — and you'll surface when AI runs similar searches.
Measuring your AI visibility
Three practical measurement tools:
Google Analytics 4 — ai-assistant medium: GA4 now automatically tags visits from ChatGPT, Gemini, and Claude under a medium called ai-assistant. Filter your traffic by this medium to see total AI-referred volume and which pages are earning citations.
GA4 — regex filter for AI sources: Build an Explore report filtered by:
.*chatgpt.*|.*perplexity.*|.*gemini.*|.*copilot.*|.*claude.*
This catches AI traffic that may not be auto-tagged yet.
GA4's "Ask Analytics Advisor": Google Analytics has an AI assistant in beta that understands plain English. Ask: "How much traffic did I get from ChatGPT this month?" and it will pull the answer if your tracking is connected.
Google Search Console's Generative AI performance report: A newer report showing your impressions inside Google's AI features (AI Overviews) over time, broken down by page. Rolling out slowly — check if it's visible in your account under Performance.
Summary: the mechanics in five points
- AI searches Google's index in real time — it doesn't recall your brand from training
- AI doesn't search your prompt — it rewrites it into different queries (query fan-out)
- The network tab technique shows you exactly what ChatGPT searched for any given prompt
- Semantic similarity is the #1 citation factor — match the AI's search language in your content
- Schema, llms.txt, and content chunking are documented non-factors — stop spending time on them
Want the link-building handled for you?
BacklinkBot hand-submits your product to 100+ high-DR directories — the backlink play, done for you.
See plans — from $99