The ChatGPT Playbook

ChatGPT is the slowest of the eight engines to surface you — and the most strategically important. It indexes through a different door than Perplexity. Here's how to walk in.

10 min read · Updated June 2026 · For B2B SaaS, marketplace, and DTC brands

What's in this guide

The two doors into ChatGPT
Pattern 1 — Become a Wikipedia / Crunchbase entity
Pattern 2 — Open robots.txt to OAI-SearchBot + GPTBot
Pattern 3 — Earn co-mentions on high-trust repositories
What doesn't work
The 90-day checklist

The two doors into ChatGPT

Most AEO advice treats ChatGPT as a single engine. It's actually two retrieval modes layered together, and they reward different work.

Door 1: training data. The base model knows what it knows because someone trained it on a snapshot of the public internet. If your brand isn't in the snapshot, ChatGPT can't name you in offline-mode answers — no amount of new content fixes a base model that's already trained. Inclusion happens at training time, and training cycles run roughly every 12-18 months.

Door 2: live retrieval. When ChatGPT browses (the default for paid users in 2026), it makes a real search call via OAI-SearchBot and uses fresh web results. This is the door you can work on today.

Most AEO Owl users see ChatGPT as their lowest-scoring engine precisely because they're stuck behind door 1 — the base model has no record of them — and they haven't optimized for door 2. The good news: door 2 is unblocking-grade work, and improvements compound into door 1 over time.

Pattern 1 — Become a Wikipedia / Crunchbase entity

ChatGPT's training pipeline weights certain sources extraordinarily heavily for entity grounding: Wikipedia, Crunchbase, IMDB, official corporate sites, and major news archives. If your company has a Wikipedia article, a Crunchbase profile with funding history, and a press archive of named coverage, ChatGPT knows who you are. If it has none of those, you don't exist to the base model.

The work

Get on Crunchbase first (lowest bar). Complete your profile: founding date, employee count, named founders, funding rounds (even bootstrapped — say so), product description, named customers. Update quarterly.
Push for Wikipedia notability. You don't write your own article (that gets reverted). You earn it by accumulating press coverage in independent outlets that crosses Wikipedia's notability threshold for businesses. Typical recipe: 3-5 substantive mentions in tier-1 trade press over 6 months, then propose a draft via Wikipedia's Articles for Creation process.
Get on niche reference databases for your category. For B2B SaaS: G2, Capterra, TrustRadius, BuiltWith. For DTC: ProductHunt (if you launched), GoodOnYou (for apparel), etc. ChatGPT's training cycle scrapes these.

Why this matters more than blog posts

When ChatGPT generates an answer offline, it samples from its training corpus weighted by source authority. A blog post on your own domain might be in the training data, but it carries less weight than a Wikipedia paragraph or a Crunchbase fact-line. The structured, third-party sources are where ChatGPT actually pulls "facts" from at generation time.

Pattern 2 — Open robots.txt to OAI-SearchBot + GPTBot

OpenAI publishes two separate crawler user-agents and most sites block one or both by accident. Both must be allowed for ChatGPT to find you through door 2 (live retrieval).

GPTBot — the crawler that fetches pages for training the next model version.
OAI-SearchBot — the crawler that ChatGPT calls in real-time when browsing.
ChatGPT-User — the user-agent used when a user explicitly asks ChatGPT to fetch a URL.

Your robots.txt should explicitly allow all three:

User-agent: GPTBot
Allow: /

User-agent: OAI-SearchBot
Allow: /

User-agent: ChatGPT-User
Allow: /

Verify with curl:

curl -A "GPTBot/1.0" https://yoursite.com/
curl -A "OAI-SearchBot/1.0" https://yoursite.com/

Both must return HTTP 200 with the actual content. If you're behind Cloudflare, check Security → Bots → AI Crawlers and ensure the "Block AI Bots" super-toggle is OFF.

Pattern 3 — Earn co-mentions on high-trust repositories

ChatGPT's training corpus over-indexes on a handful of platforms because they have structured, high-signal user contributions. Reddit, Stack Overflow, Hacker News, Quora, and LinkedIn are the big five for B2B. The pattern that wins is being discussed on these platforms — not posting from your brand account, but earning organic mentions from third parties.

Tactical plays:

Reddit: Have the founder participate in your category subreddit. Disclose role. Answer questions thoughtfully — don't pitch. Three substantive comments per week over a quarter is enough to seed multiple "best X" Reddit threads where your brand gets named by other commenters.
Stack Overflow / GitHub: If your product has any developer surface, get your engineers participating with code answers. Cross-link to your docs only when relevant.
LinkedIn: Founder publishes weekly. Aim for posts that generate 20+ thoughtful comments — the discussion is what gets scraped, not the post itself.
HARO / Qwoted / Featured: Founder responds to 3-5 journalist queries per week. Quotes that make it into Forbes / Inc / TechCrunch articles compound across both doors.

What doesn't work

SEO keyword stuffing. ChatGPT's training reads semantically; repeating a phrase doesn't improve recall.
Press releases on PR wires. ChatGPT down-weights syndicated PR. The placement has to be in actual editorial.
Schema markup alone. JSON-LD is critical for Perplexity and AI Overviews, but ChatGPT in browse mode doesn't lean on it as heavily as it leans on third-party entity confirmation.
Burst content. Publishing 30 blog posts in a month and stopping signals to training scrapers that the domain is low-effort. Steady cadence over years is what compounds.

The 90-day checklist

ChatGPT is the longest-horizon engine in the audit. You won't see citation rate move in 30 days; expect 60-90.

Week 1: Open robots.txt to GPTBot + OAI-SearchBot + ChatGPT-User. Verify with curl.
Week 1: Complete Crunchbase profile end to end. Add all funding history if applicable.
Week 1-2: Founder begins participating on Reddit (your category subreddit) + LinkedIn (one substantive post/week).
Week 2-12: Begin HARO/Qwoted response cadence. 3-5 queries per week. Track which placements land.
Week 4-12: Identify and pitch the 5 tier-1 trade press writers covering your category. Aim for 1-2 placements per month.
Week 8: If trade press placements are landing, begin drafting Wikipedia article via Articles for Creation. Cite the placements as notability evidence.
Week 12: Re-run the audit. ChatGPT citation rate should begin moving from "always 0" to "1-2 of 10 questions". Full pickup compounds across the next 2-3 audit cycles as the training-data signal accrues.

ChatGPT rewards presence over time, not bursts. Pick the cadence you can hold for a year and run it.

Back to Engine Performance · All Engine Playbooks