New Data: Top Factors Influencing ChatGPT Citations via @sejournal, @MattGSouthern

SE Ranking analyzed 129,000 unique domains across 216,524 pages in 20 niches to identify which factors correlate with ChatGPT citations.

The number of referring domains ranked as the single strongest predictor of citation likelihood.

What The Data Says

Backlinks And Trust Signals

Link diversity showed the clearest correlation with citations. Sites with up to 2,500 referring domains averaged 1.6 to 1.8 citations. Those with over 350,000 referring domains averaged 8.4 citations.

The researchers identified a threshold effect at 32,000 referring domains. At that point, citations nearly doubled from 2.9 to 5.6.

Domain Trust scores followed a similar pattern. Sites with Domain Trust below 43 averaged 1.6 citations. The benefits accelerated significantly at the top end: sites scoring 91–96 averaged 6 citations, while those scoring 97–100 averaged 8.4.

Page Trust mattered less than domain-level signals. Any page with a Page Trust score of 28 or above received roughly the same citation rate (8.3 average), suggesting ChatGPT weighs overall domain authority more heavily than individual page metrics .

One notable finding: .gov and .edu domains didn’t automatically outperform commercial sites. Government and educational domains averaged 3.2 citations, compared to 4.0 for sites without trusted zone designations.

The authors wrote:

“What ultimately matters is not the domain name itself, but the quality of the content and the value it provides.”

Traffic & Google Rankings

Domain traffic ranked as the second most important factor, though the correlation only appeared at high traffic levels.

Sites under 190,000 monthly visitors averaged 2 to 2.9 citations regardless of exact traffic volume. A site receiving 20 organic visitors performed similarly to one receiving 20,000.

Only after crossing 190,000 monthly visitors did traffic correlate with increased citations. Domains with over 10 million visitors averaged 8.5 citations.

Homepage traffic specifically mattered. Sites with at least 7,900 organic visitors to their main page showed the highest citation rates.

Average Google ranking position also tracked with ChatGPT citations. Pages ranking between positions 1 and 45 averaged 5 citations. Those ranking 64 to 75 averaged 3.1.

The authors noted:

“While this doesn’t prove that ChatGPT relies on Google’s index, it suggests both systems evaluate authority and content quality similarly.”

Content Depth & Structure

Content length showed consistent correlation. Articles under 800 words averaged 3.2 citations. Those over 2,900 words averaged 5.1.

Structure mattered beyond raw word count. Pages with section lengths of 120 to 180 words between headings performed best, averaging 4.6 citations. Extremely short sections under 50 words averaged 2.7 citations.

Pages with expert quotes averaged 4.1 citations versus 2.4 for those without. Content with 19 or more statistical data points averaged 5.4 citations, compared to 2.8 for pages with minimal data.

Content freshness produced one of the clearer findings. Pages updated within three months averaged 6 citations. Outdated content averaged 3.6.

Surprisingly, the raw data showed that pages with FAQ sections actually received fewer citations (3.8) than those without (4.1). However, the researchers noted that their predictive model viewed the absence of an FAQ section as a negative signal. They suggest this discrepancy exists because FAQs often appear on simpler support pages that naturally earn fewer citations.

The report also found that using question-style headings (e.g., as H1s or H2s) underperformed straightforward headings, earning 3.4 citations versus 4.3. This contradicts standard voice search optimization advice, suggesting AI models may prefer direct topical labeling over question formats.

Social Signals & Review Platforms

Brand mentions on discussion platforms showed strong correlation with citations.

Domains with minimal Quora presence (up to 33 mentions) averaged 1.7 citations. Heavy Quora presence (6.6 million mentions) corresponded to 7.0 citations.

Reddit showed similar patterns. Domains with over 10 million mentions averaged 7 citations, compared to 1.8 for those with minimal activity.

The authors positioned this as particularly relevant for smaller sites:

“For smaller, less-established websites, engaging on Quora and Reddit offers a way to build authority and earn trust from ChatGPT, similar to what larger domains achieve through backlinks and high traffic.”

Presence on review platforms like Trustpilot, G2, Capterra, Sitejabber, and Yelp also correlated with increased citations. Domains listed on multiple review platforms earned 4.6 to 6.3 citations on average. Those absent from such platforms averaged 1.8.

Technical Performance

Page speed metrics correlated with citation likelihood.

Pages with First Contentful Paint under 0.4 seconds averaged 6.7 citations. Slower pages (over 1.13 seconds) averaged 2.1.

Speed Index showed similar patterns. Sites with indices below 1.14 seconds performed reliably well. Those above 2.2 seconds experienced steep decline.

One counterintuitive finding: pages with the fastest Interaction to Next Paint scores (under 0.4 seconds) actually received fewer citations (1.6 average) than those with moderate INP scores (0.8 to 1.0 seconds, averaging 4.5 citations). The researchers suggested extremely simple or static pages may not signal the depth ChatGPT looks for in authoritative sources.

URL & Title Optimization

The report found that broad, topic-describing URLs outperformed keyword-optimized ones.

Pages with low semantic relevance between URL and target keyword (0.00 to 0.57 range) averaged 6.4 citations. Those with highest semantic relevance (0.84 to 1.00) averaged only 2.7 citations.

Titles followed the same pattern. Titles with low keyword matching averaged 5.9 citations. Highly keyword-optimized titles averaged 2.8.

The researchers concluded: “ChatGPT prefers URLs that clearly describe the overall topic rather than those strictly optimized for a single keyword.”

Factors That Underperformed

Several commonly recommended AI optimization tactics showed minimal or negative correlation with citations.

FAQ schema markup underperformed. Pages with FAQ schema averaged 3.6 citations. Pages without averaged 4.2.

LLMs.txt files showed negligible impact. Outbound links to high-authority sites also showed minimal effect on citation likelihood.

Why This Matters

The findings suggest your existing SEO strategy may already serve AI visibility goals. If you’re building referring domains, earning traffic, maintaining fast pages, and keeping content updated, you’re addressing the factors this report identified as most predictive.

For smaller sites without extensive backlink profiles, the research points to community engagement on Reddit and Quora as a viable path to building authority signals The data also suggests focusing on content depth over keyword density.

The researchers note that factors are interdependent. Optimizing one signal while ignoring others reduces overall effectiveness.

Looking Ahead

SE Ranking analyzed ChatGPT specifically. Other AI systems may weight factors differently.

SE Ranking doesn’t specify which ChatGPT version or timeframe the data represents, so these patterns should be treated as directional correlations rather than proof of how ChatGPT’s ranking algorithm works.


Featured Image: BongkarnGraphic/Shuttersrtock

The AI Consistency Paradox via @sejournal, @DuaneForrester

Doc Brown’s DeLorean didn’t just travel through time; it created different timelines. Same car, different realities. In “Back to the Future,” when Marty’s actions in the past threatened his existence, his photograph began to flicker between realities depending on choices made across timelines.

This exact phenomenon is happening to your brand right now in AI systems.

ChatGPT on Monday isn’t the same as ChatGPT on Wednesday. Each conversation creates a new timeline with different context, different memory states, different probability distributions. Your brand’s presence in AI answers can fade or strengthen like Marty’s photograph, depending on context ripples you can’t see or control. This fragmentation happens thousands of times daily as users interact with AI assistants that reset, forget, or remember selectively.

The challenge: How do you maintain brand consistency when the channel itself has temporal discontinuities?

The AI Consistency Paradox

The Three Sources Of Inconsistency

The variance isn’t random. It stems from three technical factors:

Probabilistic Generation

Large language models don’t retrieve information; they predict it token by token using probability distributions. Think of it like autocomplete on your phone, but vastly more sophisticated. AI systems use a “temperature” setting that controls how adventurous they are when picking the next word. At temperature 0, the AI always picks the most probable choice, producing consistent but sometimes rigid answers. At higher temperatures (most consumer AI uses 0.7 to 1.0 as defaults), the AI samples from a broader range of possibilities, introducing natural variation in responses.

The same question asked twice can yield measurably different answers. Research shows that even with supposedly deterministic settings, LLMs display output variance across identical inputs, and studies reveal distinct effects of temperature on model performance, with outputs becoming increasingly varied at moderate-to-high settings. This isn’t a bug; it’s fundamental to how these systems work.

Context Dependence

Traditional search isn’t conversational. You perform sequential queries, but each one is evaluated independently. Even with personalization, you’re not having a dialogue with an algorithm.

AI conversations are fundamentally different. The entire conversation thread becomes direct input to each response. Ask about “family hotels in Italy” after discussing “budget travel” versus “luxury experiences,” and the AI generates completely different answers because previous messages literally shape what gets generated. But this creates a compounding problem: the deeper the conversation, the more context accumulates, and the more prone responses become to drift. Research on the “lost in the middle” problem shows LLMs struggle to reliably use information from long contexts, meaning key details from earlier in a conversation may be overlooked or mis-weighted as the thread grows.

For brands, this means your visibility can degrade not just across separate conversations, but within a single long research session as user context accumulates and the AI’s ability to maintain consistent citation patterns weakens.

Temporal Discontinuity

Each new conversation instance starts from a different baseline. Memory systems help, but remain imperfect. AI memory works through two mechanisms: explicit saved memories (facts the AI stores) and chat history reference (searching past conversations). Neither provides complete continuity. Even when both are enabled, chat history reference retrieves what seems relevant, not everything that is relevant. And if you’ve ever tried to rely on any system’s memory based on uploaded documents, you know how flaky this can be – whether you give the platform a grounding document or tell it explicitly to remember something, it often overlooks the fact when needed most.

Result: Your brand visibility resets partially or completely with each new conversation timeline.

The Context Carrier Problem

Meet Sarah. She’s planning her family’s summer vacation using ChatGPT Plus with memory enabled.

Monday morning, she asks, “What are the best family destinations in Europe?” ChatGPT recommends Italy, France, Greece, Spain. By evening, she’s deep into Italy specifics. ChatGPT remembers the comparison context, emphasizing Italy’s advantages over the alternatives.

Wednesday: Fresh conversation, and she asks, “Tell me about Italy for families.” ChatGPT’s saved memories include “has children” and “interested in European travel.” Chat history reference might retrieve fragments from Monday: country comparisons, limited vacation days. But this retrieval is selective. Wednesday’s response is informed by Monday but isn’t a continuation. It’s a new timeline with lossy memory – like a JPEG copy of a photograph, details are lost in the compression.

Friday: She switches to Perplexity. “Which is better for families, Italy or Spain?” Zero memory of her previous research. From Perplexity’s perspective, this is her first question about European travel.

Sarah is the “context carrier,” but she’s carrying context across platforms and instances that can’t fully sync. Even within ChatGPT, she’s navigating multiple conversation timelines: Monday’s thread with full context, Wednesday’s with partial memory, and of course Friday’s Perplexity query with no context for ChatGPT at all.

For your hotel brand: You appeared in Monday’s ChatGPT answer with full context. Wednesday’s ChatGPT has lossy memory; maybe you’re mentioned, maybe not. Friday on Perplexity, you never existed. Your brand flickered across three separate realities, each with different context depths, different probability distributions.

Your brand presence is probabilistic across infinite conversation timelines, each one a separate reality where you can strengthen, fade, or disappear entirely.

Why Traditional SEO Thinking Fails

The old model was somewhat predictable. Google’s algorithm was stable enough to optimize once and largely maintain rankings. You could A/B test changes, build toward predictable positions, defend them over time.

That model breaks completely in AI systems:

No Persistent Ranking

Your visibility resets with each conversation. Unlike Google, where position 3 carries across millions of users, in AI, each conversation is a new probability calculation. You’re fighting for consistent citation across discontinuous timelines.

Context Advantage

Visibility depends on what questions came before. Your competitor mentioned in the previous question has context advantage in the current one. The AI might frame comparisons favoring established context, even if your offering is objectively superior.

Probabilistic Outcomes

Traditional SEO aimed for “position 1 for keyword X.” AI optimization aims for “high probability of citation across infinite conversation paths.” You’re not targeting a ranking, you’re targeting a probability distribution.

The business impact becomes very real. Sales training becomes outdated when AI gives different product information depending on question order. Customer service knowledge bases must work across disconnected conversations where agents can’t reference previous context. Partnership co-marketing collapses when AI cites one partner consistently but the other sporadically. Brand guidelines optimized for static channels often fail when messaging appears verbatim in one conversation and never surfaces in another.

The measurement challenge is equally profound. You can’t just ask, “Did we get cited?” You must ask, “How consistently do we get cited across different conversation timelines?” This is why consistent, ongoing testing is critical. Even if you have to manually ask queries and record answers.

The Three Pillars Of Cross-Temporal Consistency

1. Authoritative Grounding: Content That Anchors Across Timelines

Authoritative grounding acts like Marty’s photograph. It’s an anchor point that exists across timelines. The photograph didn’t create his existence, but it proved it. Similarly, authoritative content doesn’t guarantee AI citation, but it grounds your brand’s existence across conversation instances.

This means content that AI systems can reliably retrieve regardless of context timing. Structured data that machines can parse unambiguously: Schema.org markup for products, services, locations. First-party authoritative sources that exist independent of third-party interpretation. Semantic clarity that survives context shifts: Write descriptions that work whether the user asked about you first or fifth, whether they mentioned competitors or ignored them. Semantic density helps: keep the facts, cut the fluff.

A hotel with detailed, structured accessibility features gets cited consistently, whether the user asked about accessibility at conversation start or after exploring ten other properties. The content’s authority transcends context timing.

2. Multi-Instance Optimization: Content For Query Sequences

Stop optimizing for just single queries. Start optimizing for query sequences: chains of questions across multiple conversation instances.

You’re not targeting keywords; you’re targeting context resilience. Content that works whether it’s the first answer or the fifteenth, whether competitors were mentioned or ignored, whether the user is starting fresh or deep in research.

Test systematically: Cold start queries (generic questions, no prior context). Competitor context established (user discussed competitors, then asks about your category). Temporal gap queries (days later in fresh conversation with lossy memory). The goal is minimizing your “fade rate” across temporal instances.

If you’re cited 70% of the time in cold starts but only 25% after competitor context is established, you have a context resilience problem, not a content quality problem.

3. Answer Stability Measurement: Tracking Citation Consistency

Stop measuring just citation frequency. Start measuring citation consistency: how reliably you appear across conversation variations.

Traditional analytics told you how many people found you. AI analytics must tell you how reliably people find you across infinite possible conversation paths. It’s the difference between measuring traffic and measuring probability fields.

Key metrics: Search Visibility Ratio (percentage of test queries where you’re cited). Context Stability Score (variance in citation rate across different question sequences). Temporal Consistency Rate (citation rate when the same query is asked days apart). Repeat Citation Count (how often you appear in follow-up questions once established).

Test the same core question across different conversation contexts. Measure citation variance. Accept the variance as fundamental and optimize for consistency within that variance.

What This Means For Your Business

For CMOs: Brand consistency is now probabilistic, not absolute. You can only work to increase the probability of consistent appearance across conversation timelines. This requires ongoing optimization budgets, not one-time fixes. Your KPIs need to evolve from “share of voice” to “consistency of citation.”

For content teams: The mandate shifts from comprehensive content to context-resilient content. Documentation must stand alone AND connect to broader context. You’re not building keyword coverage, you’re building semantic depth that survives context permutation.

For product teams: Documentation must work across conversation timelines where users can’t reference previous discussions. Rich structured data becomes critical. Every product description must function independently while connecting to your broader brand narrative.

Navigating The Timelines

The brands that succeed in AI systems won’t be those with the “best” content in traditional terms. They’ll be those whose content achieves high-probability citation across infinite conversation instances. Content that works whether the user starts with your brand or discovers you after competitor context is established. Content that survives memory gaps and temporal discontinuities.

The question isn’t whether your brand appears in AI answers. It’s whether it appears consistently across the timelines that matter: the Monday morning conversation and the Wednesday evening one. The user who mentions competitors first and the one who doesn’t. The research journey that starts with price and the one that starts with quality.

In “Back to the Future,” Marty had to ensure his parents fell in love to prevent himself from fading from existence. In AI search, businesses must ensure their content maintains authoritative presence across context variations to prevent their brands from fading from answers.

The photograph is starting to flicker. Your brand visibility is resetting across thousands of conversation timelines daily, hourly. The technical factors causing this (probabilistic generation, context dependence, temporal discontinuity) are fundamental to how AI systems work.

The question is whether you can see that flicker happening and whether you’re prepared to optimize for consistency across discontinuous realities.

More Resources:


This post was originally published on Duane Forrester Decodes.


Featured Image: Inkoly/Shutterstock

Google Isn’t Going Anywhere: Ahrefs Ambassador On LLM Inclusion & Why Relationships Still Win via @sejournal, @theshelleywalsh

There’s a divided line in the industry between those who think optimizing for AI is separate from SEO and those who think LLM discovery is just SEO. But, this is an unproductive argument, because whatever you think, LLM inclusion is now part of SEO discovery.

So, let’s just focus on how the search journey works now and where you can find real business value.

To discuss inclusion in LLMs, I invited Patrick Stox to the latest edition of IMHO to find out what he thinks. As product advisor, technical SEO, and brand ambassador at Ahrefs, Patrick has plenty of data to work with and insights into what’s actually working for LLM inclusion right now.

In the face of the AI takeover, Patrick’s take is that Google isn’t going anywhere, and he still thinks human relationships are critical.

You can watch the full interview with Patrick on IMHO below.

Google Isn’t Going Anywhere

With the industry obsessing over ChatGPT, AI Overviews, and AI Mode, it’s easy to assume that traditional search really is dead. However, Patrick was quick to say, “I’m not betting against Google.”

“Google is still everything for most people … Most of the people that are using [LLMs] are tech forward, but the majority of folks are still just Googling things”

Recent Ahrefs data estimated that Google owns an estimated 40% of all traffic to websites, with LLM referrals still a fraction by comparison. Although Google’s share of traffic may be down a couple of percent this year, it still dominates.

After experimenting with ChatGPT and Claude when they first launched, Patrick found himself returning to Google’s AI Mode and Gemini, and thinks others will do the same. “Even I just went back to Google,” he admitted. “I think we’re going to see more of that as they improve their systems.”

Google continues releasing competitive AI innovations, and Patrick predicts these will pull many users back into Google’s ecosystem.

“I’m not betting against Google,” he says. “They’ve got more data than anyone, and they’re still on the bleeding edge.”

The Attribution Problem: LLMs Might Drive Conversions, But We Can’t Prove It

Even though sites are seeing growing referrals from LLMs, establishing attribution to any real value from LLM traffic is a challenge right now. We can talk about brand awareness, but C-Suite is only interested in business value.

Patrick agreed that while you can count mentions and citations in AI answers, that doesn’t easily translate into board-level reporting.

“You can measure how often you’re mentioned versus competitors … but going back to a business, I can’t report on that stuff. It’s all secondary, tertiary metrics.”

For Patrick, revenue and revenue-adjacent metrics still matter. That said, Ahrefs has had some signals from AI search traffic.

“We did track the signups. When I first looked at this data back in July, all the traffic from AI search was half a percent of our traffic total. But at the time, it was 12.1% of our total conversions.” He explained.

This has now dropped below 10%, while the traffic share has grown slightly.

Two Strategies That Are Working For LLM Inclusion

I asked if Ahrefs is actively investing in LLM inclusion, and Patrick said they are trying a number of different things, and the two fundamental approaches that determine LLM visibility are repetition and differentiation.

“Whatever the internet says, that’s kind of what’s being returned in these systems,”

Repetition means ensuring consistent messaging across multiple websites. LLMs synthesize what “the internet says,” so if you want to be recognized for something, that narrative needs to exist broadly. For Ahrefs, this has meant actively spreading the message that they have evolved beyond just SEO tools into a comprehensive digital marketing platform.

Differentiation through original data works alongside the repetition to stand out. Ahrefs has invested heavily in unique data studies throughout the year, including non-English language research. “This data is being heavily cited, heavily returned in these systems because there’s nothing else out there like it,” Patrick explained.

The more surprising tactic that is also currently working is listicles.

“I hate to say it, but listicles … they work right now. I don’t think it’s future-proof at all, but at the same time, I don’t want to just not be there.”

Agentic AI And The Threat Of Closed Systems

I then asked about agentic AI and systems, and does Patrick have concerns about systems becoming closed.

As LLM agents begin booking travel, making purchases, or accessing APIs directly, most likely they would rely on a small set of partners from big brands.

“ChatGPT isn’t going to make deals with unknown companies,” Stox says. “If they book flights, they’ll use major providers. If they use a dictionary, they’ll pick one dictionary.”

This would be the real threat to smaller businesses. “If an agent decides ‘we only check out through Amazon,’ a lot of stores lose sales overnight,” Patrick warns. There is no guaranteed defense. The only strategy we can follow right now is to grow your brand and footprint.

“What was the thing they used to say for Google? Make them embarrassed to not have you included.”

Beyond LLM Optimization: Channels That Still Matter

Patrick emphasized a point that’s possibly been forgotten in the AI hype: “It’s not ChatGPT that’s the second largest search engine, it’s still YouTube by far.”

YouTube has been a hugely successful referral platform for Ahrefs, and the company invested heavily in video. Patrick recommends both long and short-form, for brand discovery.

Community participation on platforms such as Reddit, Slack, and Discord also offers substantial value, but only when companies genuinely participate rather than spam.

While many brands have tried to brute-force Reddit with spam, Patrick says there can be huge value in genuine participation, especially when employees are allowed to represent the company authentically.

“You have literally a paid workforce of advocates who work for your company. Let them go out and talk to people … answer questions, basically advertise for you. They want to do it already. So let them.”

If You Started A Product Today, Where Would You Bet?

As a final question, I asked Patrick where he’d invest if launching a startup today; he did not hesitate to say relationships.

“If I launched a startup, the first thing I’d invest in is relationships. That’s still the most powerful channel … I think if I did do something like that, I’d probably grow it pretty fast. More from my connections than anything else,” he said.

After relationships, he’d focus on YouTube, website content creation, and telling friends about the product. In other words, “just normal marketing.”

“We’ve gone through this tech revolution, and now we’re realizing everything still comes back to direct connections with people.”

And that may be the most important insight of all. In an era of AI-driven discovery, the brands that win are the ones that remain unmistakably human.

Watch the full video interview with Patrick Stox here:

Thank you to Patrick Stox for offering his insights and being my guest on IMHO.

More Resources:


Featured Image: Shelley Walsh/Search Engine Journal

ChatGPT Adds Shopping Research For Product Discovery via @sejournal, @MattGSouthern

OpenAI launched shopping research in ChatGPT, a feature that creates personalized buyer’s guides by researching products across the web. The tool is rolling out today on mobile and web for logged-in users on Free, Go, Plus, and Pro plans.

The company is offering nearly unlimited usage through the holidays.

What’s New

Shopping research works differently from standard ChatGPT responses. Users describe what they need, answer clarifying questions about budget and preferences, and receive a buyer’s guide after a few minutes.

The feature pulls information including price, availability, reviews, specs, and images from across the web. You can guide the research by marking products as “Not interested” or “More like this” as options appear.

OpenAI’s announcement states:

“Shopping research is built for that deeper kind of decision-making. It turns product discovery into a conversation: asking smart questions to understand what you care about, pulling accurate, up-to-date details from high-quality sources, and bringing options back to you to refine the results.”

The company says the tool performs best in categories like electronics, beauty, home and garden, kitchen and appliances, and sports and outdoor.

Technical Details

Shopping research is powered by a shopping-specialized GPT-5 mini variant post-trained on GPT-5-Thinking-mini.

OpenAI’s internal evaluation shows shopping research reached 52% product accuracy on multi-constraint queries, compared with 37% for ChatGPT Search.

Product accuracy measures how well responses meet user requirements for attributes like price, color, material, and specs. The company designed the system to update and refine results in real time based on user feedback.

Privacy & Data Sharing

OpenAI states that user chats are never shared with retailers. Results are organic and based on publicly available retail sites.

Merchants who want to appear in shopping research results can follow an allowlisting process through OpenAI.

Limitations

OpenAI acknowledges the feature isn’t perfect. The model may make mistakes about product details like price and availability. The company encourages users to visit merchant sites for the most accurate information.

Why This Matters

This feature pulls more of the product comparison journey into one place.

As shopping research handles more of the “which one should I buy?” work inside ChatGPT, some of that early-stage discovery could happen without a traditional search click.

For retailers and affiliate publishers, that raises the stakes for inclusion in these results. Visibility may depend on how well your products and pages are represented in OpenAI’s shopping system and allowlisting process.

Looking Ahead

Shopping research in ChatGPT is now available to logged-in users starting today. OpenAI plans to add direct purchasing through ChatGPT for merchants participating in Instant Checkout, though no timeline was provided.


Featured Image: Koshiro K/Shutterstock

Google’s Mueller Questions Need For LLM-Only Markdown Pages via @sejournal, @MattGSouthern

Google Search Advocate John Mueller has pushed back on the idea of building separate Markdown or JSON pages just for large language models (LLMs), saying he doesn’t see why LLMs would need pages that no one else sees.

The discussion started when Lily Ray asked on Bluesky about “creating separate markdown / JSON pages for LLMs and serving those URLs to bots,” and whether Google could share its perspective.

Ray asked:

Not sure if you can answer, but starting to hear a lot about creating separate markdown / JSON pages for LLMs and serving those URLs to bots. Can you share Googleʼs perspective on this?

The question draws attention to a developing trend where publishers create “shadow” copies of important in formats that are easier for AI systems to understand.

There’s a more active discussion on this topic happening on X.

What Mueller Said About LLM-Only Pages

Mueller replied that he isn’t aware of anything on Google’s side that would call for this kind of setup.

He notes that LLMs have worked with regular web pages from the beginning:

I’m not aware of anything in that regard. In my POV, LLMs have trained on – read & parsed – normal web pages since the beginning, it seems a given that they have no problems dealing with HTML. Why would they want to see a page that no user sees? And, if they check for equivalence, why not use HTML?

When Ray followed up about whether a separate format might help “expedite getting key points across to LLMs quickly,” Mueller argued that if file formats made a meaningful difference, you would likely hear that directly from the companies running those systems.

Mueller added:

If those creating and running these systems knew they could create better responses from sites with specific file formats, I expect they would be very vocal about that. AI companies aren’t really known for being shy.

He said some pages may still work better for AI systems than others, but he doesn’t think that comes down to HTML versus Markdown:

That said I can imagine some pages working better for users and some better for AI systems, but I doubt that’s due to the file format, and it’s definitely not generalizable to everything. (Excluding JS which still seems hard for many of these systems).”

Taken together, Mueller’s comments suggest that, from Google’s point of view, you don’t need to create bot-only Markdown or JSON clones of existing pages just to be understood by LLMs.

How Structured Data Fits In

Other individuals in the thread drew a line between speculative “shadow” formats and cases where AI platforms have clearly defined feed requirements.

A reply from Matt Wright pointed to OpenAI’s eCommerce product feeds as an example where JSON schemas matter.

In that context, a defined spec governs how ChatGPT ingests and displays product data. Wright explains:

Interestingly, the OpenAI eCommerce product feeds are live: JSON schemas appear to have a key role in AI search already.

That example supports the idea that structured feeds and schemas are most important when a platform publishes a spec and asks you to use it.

Additionally, Wright points to a thread on LinkedIn where Chris Long observed that “editorial sites using product schemas, tend to get included in ChatGPT citations.”

Why This Matters

If you’re questioning whether to build “LLM-optimized” Markdown or JSON versions of your content, this exchange can help steer you back to the basics.

Mueller’s comments reinforce that LLMs have long been able to read and parse standard HTML.

For most sites, it’s more productive to keep improving speed, readability, and content structure on the pages you already have, and to implement schema where there’s clear platform guidance.

At the same time, the Bluesky thread shows that AI-specific formats are starting to emerge in narrow areas such as product feeds. Those are worth tracking, but they’re tied to explicit integrations, not a blanket rule that markdown is better for LLMs.

Looking Ahead

The conversation highlights how fast AI-driven search changes are turning into technical requests for SEO and dev teams, often before there is documentation to support them.

Until LLM providers publish more concrete guidelines, this thread points you back to work you can justify today: keep your HTML clean, reduce unnecessary JavaScript where it makes content hard to parse, and use structured data where platforms have clearly documented schemas.


Featured Image: Roman Samborskyi/Shutterstock

LLMs.txt Shows No Clear Effect On AI Citations, Based On 300k Domains via @sejournal, @MattGSouthern

A new analysis from SE Ranking suggests the llms.txt file isn’t delivering measurable benefits yet.

After examining roughly 300,000 domains, the company found no relationship between having llms.txt and how often a domain is cited in major LLM answers.

What The Data Says

Adoption Is Thin

SE Ranking’s crawl found llms.txt on 10.13% of domains. In other words, nearly nine out of ten sites they measured haven’t implemented it.

That low usage matters because the format is sometimes described as an emerging baseline for AI visibility. The data instead shows scattered experimentation. SE Ranking says adoption is fairly even across traffic tiers and not concentrated among the biggest brands.

High-traffic sites were slightly less likely to use the file than mid-tier websites in their dataset.

No Measurable Link To LLM Citations

To assess whether the llms.txt file affects AI visibility, SE Ranking analyzed domain-level citation frequency across responses from prominent LLMs. They employed statistical correlation tests and an XGBoost model to determine the extent to which each factor contributed to citations.

The main finding was that removing the llms.txt feature actually improved the model’s accuracy. SE Ranking concludes that llms.txt “doesn’t seem to directly impact AI citation frequency. At least not yet.”

Additionally, they found no significant correlation between citations and the file using simpler statistical methods.

How This Squares With Platform Guidance

SE Ranking notes that its results align with public platform guidance. But it’s important to be precise about what is confirmed.

Google hasn’t indicated that llms.txt is used as a signal in AI Overviews or AI Mode. In its AI search guidance, Google frames it as an evolution of Search that continues to rely on its existing Search systems and signals, without mentioning llms.txt as an input.

OpenAI’s crawler documentation similarly focuses on robots.txt controls. OpenAI recommends allowing OAI-SearchBot in robots.txt to support discovery for its search features, but does not say llms.txt affects ranking or citations.

SE Ranking also notes that some SEO logs show GPTBot occasionally fetching llms.txt files, though they say it doesn’t happen often and does not appear tied to citation outcomes.

Taken together, the dataset suggests that even if some models retrieve the file, it’s not influencing citation behavior at scale right now.

What This Means For You

If you want a clean, low-risk way to prepare for possible future adoption, adding llms.txt is easy and unlikely to cause technical harm.

But if the goal is a near-term visibility bump in AI answers, the data says you shouldn’t expect one.

That puts llms.txt in the same category as other early AI-visibility tactics. Reasonable to test if it fits your workflow, but not something to sell internally as a proven lever.


Featured Image: Mameraman/Shutterstock

New Data Finds Gap Between Google Rankings And LLM Citations via @sejournal, @MattGSouthern

Large language models cite sources differently than Google ranks them.

Search Atlas, an SEO software company, compared citations from OpenAI’s GPT, Google’s Gemini, and Perplexity against Google search results.

The analysis of 18,377 matched queries finds a gap between traditional search visibility and AI platform citations.

Here’s an overview of the key differences Search Atlas found.

Perplexity Is Closest To Search

Perplexity performs live web retrieval, so you would expect its citations to look more like search results. The study supports that.

Across the dataset, Perplexity showed a median domain overlap of around 25–30% with Google results. Median URL overlap was close to 20%. In total, Perplexity shared 18,549 domains with Google, representing about 43% of the domains it cited.

ChatGPT And Gemini Are More Selective

ChatGPT showed much lower overlap with Google. Its median domain overlap stayed around 10–15%. The model shared 1,503 domains with Google, accounting for about 21% of its cited domains. URL matches typically remained below 10%.

Gemini behaved less consistently. Some responses had almost no overlap with search results. Others lined up more closely. Overall, Gemini shared just 160 domains with Google, representing about 4% of the domains that appeared in Google’s results, even though those domains made up 28% of Gemini’s citations.

What The Numbers Mean For Visibility

Ranking in Google doesn’t guarantee LLM citations. This report suggests the systems draw from the web in different ways.

Perplexity’s architecture actively searches the web and its citation patterns more closely track traditional search rankings. If your site already ranks well in Google, you are more likely to see similar visibility in Perplexity answers.

ChatGPT and Gemini rely more on pre-trained knowledge and selective retrieval. They cite a narrower set of sources and are less tied to current rankings. URL-level matches with Google are low for both.

Study Limitations

The dataset heavily favored Perplexity. It accounted for 89% of matched queries, with OpenAI at 8% and Gemini at 3%.

Researchers matched queries using semantic similarity scoring. Paired queries expressed similar information needs but were not identical user searches. The threshold was 82% similarity using OpenAI’s embedding model.

The two-month window provides a recent snapshot only. Longer timeframes would be needed to see whether the same overlap patterns hold over time.

Looking Ahead

For retrieval-based systems like Perplexity, traditional SEO signals and overall domain strength are likely to matter more for visibility.

For reasoning-focused models like ChatGPT and Gemini, those signals may have less direct influence on which sources appear in answers.


Featured Image: Ascannio/Shutterstock

Should Advertisers Be Worried About AI In PPC?

One scroll through LinkedIn and you’d struggle not to see a post, video, or ad about AI, whatever the industry you work in.

For digital marketing, it’s completely taken over, and it has woven itself into nearly every aspect of day-to-day life, especially within PPC advertising.

From automated bidding to AI-generated ad creative, platforms like Google Ads and Microsoft Advertising have been doubling down on this for years.

Naturally, this shift raises questions and concerns among advertisers, with one side claiming it’s out of control and taking over, the other side boasting about time saved and game-changing results, and then you’ve got the middle ground trying to figure out exactly what the impact is and where it is going.

It’s a difficult topic to answer with a simple yes or no, with so many opinions and platforms for sharing them; it’s everywhere, and although certainly not a topic that is in its infancy, it does feel that way in 2025.

In this article, we’ll explore how AI is used in PPC today, the benefits it offers, the concerns it brings, and how advertisers can best adapt.

What Role Does AI Play In PPC Today?

The majority of advertisers are already using some form of AI-driven tool in their workflow, with 74% of marketers reported using AI tools last year, up from just 21% in 2022.

Then, within the platforms, PPC campaigns are heavily invested in artificial intelligence, both above and below the hood. Key areas being:

Bid Automation

Gone are the days of manual bidding on hundreds of keywords or product groups (in most cases).

Google’s and Microsoft’s Automated Bidding use machine learning to set optimal bids for each auction based on the likelihood to convert.

These algorithms analyze countless signals (device, location, time of day, user behavior patterns, etc.) in real-time to adjust bids far more precisely than a human could.

In this scenario, the role of the advertiser is to feed these bidding strategies with the best possible data to then take forward in making decisions.

Then at a strategic level, advertisers will need to determine the structure, targeting, goals, etc, and this is where Google has further pushed AI into the hands of PPC teams.

From Google’s side, it’s an indication of trust that the AI will find relevant matches and handle bids for them, and I have seen this work incredibly well, but I’ve also seen this work terribly, and it’s all context-dependent.

Dynamic Creative & Assets

Responsive Search Ads (RSAs) allow advertisers to input multiple headlines and descriptions, which Google’s AI then mixes and matches to serve the best-performing combinations for each query.

Over time, the algorithm learns which messages resonate most.

Google has even introduced generative AI tools to create ad assets (headlines, images, etc.) automatically based on your website content and campaign goals.

Similarly, Microsoft’s platform now offers a Copilot feature that can generate ad copy variations, images, and suggest keywords using AI.

Of all the AI-related changes in Google Ads, in my experience, this was one that advertisers welcomed the most, as it is a time saver and created a nice way to test different messaging, call to actions, etc.

Keyword Match Types

The recipe for Google Ads in 2025 that advertisers are given from Google is to blend broad match and automated bidding.

Why is this? According to Google, machine learning attempts to understand user intent and match ads to queries that aren’t exact matches but are deemed relevant.

Think about it this way: You’ve done your research for your new search campaign, built out your ad groups, and are confident that you have covered all bases.

How will this change over time, and how can you guarantee you’re not missing relevant auctions? This is rhetoric Google runs with for broad match as it leans into the stats with billions of searches per day, with ~15% being brand new queries, pushing advertisers to loosen targeting to allow machine learning to operate constraint-free.

There is certainly value in this, and it’s reported that 62% of advertisers using Google’s Smart Bidding have made broad match their primary keyword match type, a strategy that was very much a no-go for years; however, handing all control over to AI doesn’t fully align with what matters most (profitability, LTV, margins, etc) and there has to be a middle ground.

Audience Targeting And Optimization

Both Google and Microsoft leverage AI to build and target audiences.

Campaign types like Performance Max are almost entirely AI-driven; they automatically allocate your budget across search, display, YouTube, Gmail, etc., to find conversions wherever they occur.

Advertisers simply provide creative assets, search themes, conversion goals, etc, and the AI does the rest.

The better quality the data inputted, the better the performance to a large degree.

Of all the AI topics for Google Ads, PMax is very much debated within the industry, but it’s telling that 63% of PPC experts plan to increase spend on Google’s feed-based Performance Max campaigns this year.

Recommendations, Auto Applies, And Budget Optimization

If you work within/around PPC, you’ll have seen, closed, shouted at, and maybe on a rare occasion, taken action off the back of these.

The platforms continuously analyze account performance and suggest optimizations.

Some are basic, but others (like budget reallocation or shifting to different bid strategies) are powered by machine learning insights across thousands of accounts.

As good as these may sound, they are only as good as the data being fed into the account and lack context, which, in some cases, if applied, can be detrimental to account performance.

In summary, advertisers have had to embrace AI to a large extent in their day-to-day campaign management.

But with this embrace comes a natural question: Is all this AI making things better or worse for advertisers, or is it just a way for ad platforms to grow their market share?

What Are The Benefits Of AI In PPC?

AI offers some clear advantages for paid search marketers.

When used properly, AI can make campaigns more efficient, effective, and can save a great deal of time once spent on monotonous tasks.

Here are some key benefits:

Efficiency And Time Savings

One of the biggest wins is automation of labor-intensive tasks.

AI can analyze massive data sets and adjust bids or ads 24/7, far faster than any human.

This frees up marketers to focus on strategy instead of repetitive tasks.

Mundane tasks such as bid adjustments, budget pacing, creative rotation, etc, can be picked up by AI to allow PPC teams to focus on high-level strategy and analysis, looking at the bigger picture.

It’s certainly not a case of set-and-forget, but the balance has shifted.

AI can now take care of the executional heavy lifting, while humans guide the strategy, interpret the nuance, and make the judgment calls that machines can’t.

Structural Management

A clear benefit of AI in many facets of paid search is the consolidation of account structures.

Large advertisers might have millions of keywords or hundreds of ads, which at one time were manually mapped out and managed group by group.

With automated bidding strategies adjusting bids in real time, serving the best possible creative and doubling down on the keywords, product groups, and SKUs that work, PPC teams are able to whittle down overly complex account structures into consolidated themes where they can feed their data.

Campaigns like Performance Max scale across channels automatically, finding additional inventory (like YouTube or Display) without the advertiser manually creating separate campaigns, further making life easier for advertisers who choose to use them.

Optimization Of Ad Creative And Testing

Rather than running a handful of ad variations, responsive ads powered by AI can test dozens of combinations of headlines and descriptions instantly.

The algorithm learns which messages work best for each search term or audience segment.

Additionally, new generative AI features can create ad copy or image variations you hadn’t considered, expanding creative possibilities, but please check these before launch, and if set to auto apply, maybe remove and review first, as these outputs can be interesting.

The overarching goal from the ad platforms is to work towards solving the problem many teams face regarding getting creatives produced and fast, which they do to an extent, but there’s still a way to go.

Audience Targeting And Personalization

AI can identify user patterns to target more precisely than manual bidding.

Google’s algorithms might learn that certain search queries or user demographics are more likely to convert and automatically adjust bids or show specific ad assets to those segments, and as these change over time, so do the bidding strategies.

This kind of micro-optimization of who sees which ad was very hard to do manually, and has great limitations.

In essence, the machine finds your potential customers using complex signals that adjust bids in real time based on the user vs. setting a bid for a term/product group to serve in every ad set, essentially treating each auction the same.

What Are The Concerns Of AI In PPC?

Despite all the promise, it’s natural for advertisers to have some worries about the march of AI in paid search.

Handing over control to algorithms and black box systems comes with its challenges.

In practice, there have been hiccups and valid concerns that explain why some in the industry are cautious.

Loss Of Control And Transparency

A common gripe is that as AI takes over, advertisers lose visibility into the “why” behind performance changes.

Take PMax, for example. These fully automated campaigns provide only limited data when compared to a segmented structure, making it hard to understand what’s driving conversions and putting advertisers in a difficult position when feeding back performance to stakeholders who once had a wealth of data to dig through.

Nearly half of PPC specialists said that managing campaigns has become harder in the last two years because of the loss of insights and data due to automated campaign types like PMax, with one industry survey finding that trust in major ad platforms has declined over the past year, with Google experiencing a 54% net decline in trust sentiment.

Respondents cited the platforms’ prioritization of black box automation over giving users control as a key issue, with many feeling like they are flying partially blind, a huge worry considering budgets and importance of Google Ads as an advertising channel for millions of brands worldwide.

Performance And Efficiency Trade-Offs

I’ve mentioned this a couple of times so far, but as with most AI in the context of Google Ads, the data being fed into the platform determines how well the AI performs, and adopting AI in PPC does not result in immediate performance improvements for every account, however hard Google pushes this narrative.

Algorithms optimize for the goal you set (e.g., achieve this ROAS), sometimes at the expense of other metrics like cost per conversion or return on investment (ROI).

Take broad match keywords combined with Smart Bidding; this might bring in more traffic, but some of that traffic could be low quality or not truly incremental, impacting the bottom line and how you manage your budgets.

To be taken with a pinch of salt due to context, however, an analysis of over 2,600 Google Ads accounts found that 72% of advertisers saw better return on ad spend (ROAS) with traditional exact match keyword targeting, whereas only ~26% of accounts achieved better ROAS using broad match automation.

Advertisers are rightly concerned that blindly following AI recommendations could lead to wasted spend on irrelevant clicks or diminishing returns.

Then, you have the learning period for automated strategies, which can also be costly (but necessary) where the algorithm might spend a lot figuring out what works, something not every business can afford.

Mistakes, Quality, And Brand Safety

AI isn’t infallible.

There have been instances of AI-generated ad copy that miss the mark or even violate brand guidelines.

For example, if you let generative AI create search ads, it might produce statements that are factually incorrect or not in the desired tone.

Having worked extensively in paid search for luxury fashion brands, the risk of AI producing off-brand creative and messaging is often a roadblock to getting on board with new campaign types.

In a Salesforce survey, 31% of marketing professionals cited accuracy and quality concerns with AI outputs as a barrier.

To add further complexity to this, many of the features, such as auto applies in Google Ads, are not the easiest to spot within the accounts and are dependent on the level of expertise within the team managing PPC; certain AI-generated assets or enhancements could be live without teams knowing, which can lead to friction within businesses with strict brand guidelines.

Over-Reliance And Skills Erosion

Another subtle worry is that marketers relying heavily on AI could see their own skills become redundant.

PPC professionals used to pride themselves on granular account optimization, but if the machine is doing everything, how will their jobs change?

A study by HubSpot found that over 57% of U.S. marketers feel pressure to learn AI tools or risk becoming irrelevant in their careers.

With PPC, all this means is that less and less time is spent within the accounts undertaking repetitive tasks, something that I’ve championed for years.

Every paid search team is different and is built from different levels of expertise; however, the true value that PPC teams bring shouldn’t be the intricacies of campaign management, it’s the understanding of the value their channel is driving and everything around this that influences performance.

So, Should Advertisers Be Worried About AI In PPC?

As with most topics in PPC (and most articles I write), there isn’t a simple yes or no answer, and it’s very much context dependent.

PPC advertisers shouldn’t panic; they should be aware, informed, and prepared, and this doesn’t mean knowing the exact ins and outs of AI models, far from it.

Rather than asking if you trust it or not, or if you really should give up the reins of manual campaign management, ask yourself how you can use AI to make your job easier and to drive better results for your business/clients.

Over my last decade and a half in performance marketing, working in-house, within independents, networks, and from running my own paid media agency, I’ve seen many trends come and go, each one shifting the role of the PPC team ever so slightly.

AI is certainly not a trend, and it’s fundamentally changing the world we live in, and within the PPC world, it’s changing the way we work, pushing advertisers to spend less time in the accounts than they once did, freeing up time to allocate to what really moves the needle when managing paid media.

In my opinion, this is a good thing, but there is definitely a balance that needs to be struck, and what this balance looks like is up to you and your teams.

More Resources:


Featured Image: Roman Samborskyi/Shutterstock

Google Brings Gemini 3 To Search’s AI Mode via @sejournal, @MattGSouthern

Google has integrated Gemini 3 in Search’s AI Mode. This marks the first time Google has shipped a Gemini model to Search on its release date.

Google AI Pro and Ultra subscribers in the U.S. can access Gemini 3 Pro by selecting “Thinking” from the model dropdown in AI Mode.

Robby Stein, VP and GM of Google Search, wrote on X:

“Gemini 3, our most intelligent model, is landing in Google Search today – starting with AI Mode. Excited that this is the first time we’re shipping a new Gemini model in Search on day one.”

Google plans to expand Gemini 3 in AI Mode to all U.S. users soon, with higher usage limits for Pro and Ultra subscribers.

What’s New

Search Updates

Google describes Gemini 3 as a model with state-of-the-art reasoning and deep multimodal understanding.

In the context of Search, it’s designed to explain advanced concepts, work through complex questions, and support interactive visuals that run directly inside AI Mode responses.

With Gemini 3 in place, Google says AI Mode has effectively re-architected what a “helpful response” looks like.

Stein explains:

“Gemini 3 is also making Search smarter by re-architecting what a helpful response looks like. With new generative UI capabilities, Gemini 3 in AI Mode can now dynamically create the overall response layout when it responds to your query – completely on the fly.”

Instead of only returning a block of text, AI Mode can design a response layout tailored to your query. That includes deciding when to surface images, tables, or other structured elements so the answer is clearer and easier to work with.

In the coming weeks, Google will add automatic model selection, Stein continues:

“Search will intelligently route tough questions in AI Mode and AI Overviews to our frontier model, while continuing to use faster models for simpler tasks.”

Enhanced Query Fan-Out

Gemini 3 upgrades Google’s query fan-out technique.

According to Stein, Search can now issue more related searches in parallel and better interpret what you’re trying to do.

A potential benefit, Stein adds, is that Google may find content it previously missed:

“It now performs more and much smarter searches because Gemini 3 better understands you. That means Search can now surface even more relevant web content for your specific question.”

Generative UI

Gemini 3 in AI Mode introduces generative UI features that build dynamic visual layouts around your query.

The model analyzes your question and constructs a custom response using visual elements such as images, tables, and grids. When an interactive tool would help, Gemini 3 can generate a small app in real time and embed it directly in the answer.

Examples from Google’s announcement include:

  • An interactive physics simulation for exploring the three-body problem
  • A custom mortgage loan calculator that lets you compare different options and estimate long-term savings

All of these responses include prominent links to high-quality content across the web so you can click through to source material.

See a demonstration in Google’s launch video below:

Why This Matters

Gemini 3 changes how your content is discovered and used in AI Mode. With deeper query fan-out, Google can access more pages per question, which might influence which sites are cited or linked during long, complex searches.

The updated layouts and interactive features change how links appear on your screen. On-page tools, explainers, and visualizations could now compete directly with Google’s own interface.

As Gemini 3 becomes available to more people, it will be important to watch how your content is shown or referenced in AI responses, in addition to traditional search rankings.

Looking Ahead

Google says it will continue refining these updates based on feedback as more people try the new tools. Automatic model selection is set to arrive in the coming weeks for Google AI Pro and Ultra subscribers in the U.S., with broader U.S. access to Gemini 3 in AI Mode planned but not yet scheduled.

Selling AI Search Strategies To Leadership Is About Risk via @sejournal, @Kevin_Indig

Boost your skills with Growth Memo’s weekly expert insights. Subscribe for free!

AI search visibility isn’t “too risky” to invest in for executives to buy-in. Selling AI search strategies to leadership is about risk.

Image Credit: Kevin Indig

A Deloitte survey of +2,700 leaders reveals that getting buy-in for an AI search strategy isn’t about innovation, but risk.

SEO teams keep failing to sell AI search strategies for one reason: They’re pitching deterministic ROI in a probabilistic environment.

The old way: Rankings → traffic → revenue. But that event chain doesn’t exist in AI systems.

LLMs don’t rank. They synthesize. And Google’s AI Overviews and AI Mode don’t “send traffic.” They answer.

Yet most teams still walk into a leadership meeting with a deck built on a decaying model. Then, executives say no – not because AI search “doesn’t work,” but because the pitch asks them to fund an outcome nobody can guarantee.

In AI search, you cannot sell certainty. You can only sell controlled learning.

1. You Can’t Sell AI Search With A Deterministic ROI Model

Everyone keeps asking the wrong question: “How do I prove my AI search strategy will work so leadership will fund it?” You can’t; there’s no traffic chain you can model. Randomness is baked directly into the outputs.

You’re forcing leadership to evaluate your AI search strategy with a framework that’s already decaying. Confusion about AI search vs. traditional SEO metrics and forecasting is blocking you from buy-in. When SEO teams try to sell an AI search strategy to leadership, they often encounter several structural problems:

  1. Lack of clear attribution and ROI: Where you see opportunity, leadership sees vague outcomes and deprioritizes investment. Traffic and conversions from AI Overviews, ChatGPT, or Perplexity are hard to track.
  2. Misalignment with core business metrics: It’s harder to tie results to revenue, CAC, or pipeline – especially in B2B.
  3. AI search feels too experimental: Early investments feel like bets, not strategy. Leadership may see this as a distraction from “real” SEO or growth work.
  4. No owned surfaces to leverage: Many brands aren’t mentioned in AI answers at all. SEO teams are selling a strategy that has no current baseline.
  5. Confusion between SEO and AI search strategy: Leadership doesn’t understand the distinction between optimizing for classic Google Search vs. LLMs vs. AI Overviews. Clear differentiation is needed to secure a new budget and attention.
  6. Lack of content or technical readiness: The site lacks the structured content, brand authority, or documentation to appear in AI-generated results.

2. Pitch AI Search Strategy As Risk Mitigation, Not Opportunity

Executives don’t buy performance in ambiguous environments. They buy decision quality. And the decision they need you to make is simple: Should your brand invest in AI-driven discovery before competitors lock in the advantage – or not?

Image Credit: Kevin Indig

AI search is still an ambiguous environment. That’s why your winning strategy pitch should be structured for fast, disciplined learning with pre-set kill criteria instead of forecasting traffic → revenue. Traditionally, SEO teams pitch outcomes (traffic, conversions), but leadership needs to buy learning infrastructure (testing systems, measurement frameworks, kill criteria) for AI search.

Leadership thinks you’re asking for “more SEO budget” when you’re actually asking them to buy an option on a new distribution channel.

Everyone treats the pitch as “convince them it will work” when it should be “convince them the cost of not knowing is higher than the cost of finding out.” Executives don’t need certainty about impact – they need certainty that you’ll produce a decision with their money.

Making stakes crystal clear:

Your Point of View + Consequences = Stakes. Leaders need to know what happens if they don’t act.

Image Credit: Kevin Indig

The cost of passing on an AI search strategy can be simple and brutal:

  1. Competitors who invest early in AI search visibility will build entity authority and brand presence.
  2. Organic traffic stagnates and will drop over time while cost-per-click rises.
  3. AI Overviews and AI Mode outputs will replace queries your brand used to win in Google.
  4. Your influence on the next discovery channel will be decided without you.

AI search strategy builds brand authority, third-party mentions, entity relationships, content depth, pattern recognition, and trust signals in LLMs. These signals compound. They also freeze into the training data of future models.

If you aren’t shaping that footprint now, the model will rely on whatever scraps already exist based on whatever your competitors are feeding it.

3. Sell Controlled Experiments – Small, Reversible, And Time-Boxed

You’re asking for resources to discover the truth before the market makes the decision for you. This approach collapses resistance because it removes the fear of sunk cost and turns ambiguity into manageable, reversible steps.

A winning AI search strategy proposal sounds like:

  • “We’ll run x tests over 12 months.”
  • “Budget: ≤0.3% of marketing spend.”
  • “Three-stage gates with Go/No-Go decisions.”
  • “Scenario ranges instead of false-precision forecasts.”
  • “We stop if leading indicators don’t move by Q3.”

45% of executives rely more on instinct than facts. Balance your data with a compelling narrative – focus on outcomes and stakes, not technical details.

I covered how to build a pitch deck and strategic narrative in how to explain the value of SEO to executives, but focus on selling learning as a deliverable under the current AI search landscape.

When presenting to leaders, they focus on three things only: money (revenue, profit, cost), market (market share, time-to-market), and exposure (retention, risk). Structure every pitch around these.

The SCQA framework (Minto Pyramid) guides you:

  • Situation: Set the context.
  • Complication: Explain the problem.
  • Question: What should we do?
  • Answer: Your recommendation.

This is the McKinsey approach – and executives expect it.


Featured Image: Paulo Bobita/Search Engine Journal