How To Design URL Structures For AI Retrieval, Not Just Rankings

For years, URL structure was a technical SEO checkbox. Keep it short, use hyphens, include the keyword, done.

While that playbook still works, it’s increasingly incomplete. A growing share of the target audience now discovers content through AI assistants and large language models like ChatGPT, Perplexity, Claude, Google’s AI Overviews, and more.

These systems retrieve and synthesize information differently from traditional search crawlers, and if your URL architecture isn’t built with that in mind, you are increasing your chances of not being cited by LLMs.

In the new age of search, we need to extend those SEO fundamentals to also align with AI bots and how they crawl URLs.

Why AI Systems Read URLs Differently

Search engines have spent decades developing sophisticated crawling and indexing infrastructure. They follow redirects, resolve canonicals, parse JavaScript (sometimes…), and can infer context from a page when the URL is a string of random characters.

AI retrieval systems, particularly retrieval-augmented generation (RAG) pipelines and web-connected LLMs, often work differently.

There are three core parts to how RAG works:

  1. The input prompt is converted into a vector embedding
  2. Relevant passages are then retrieved from indexed URLs, documents and knowledge graphs in traditional search results like Google and Bing.
  3.  An LLM like ChatGPT or similar will then process this information and generate a refined response.

A developer-built RAG system will essentially use data sources from URLs to extract content – they will crawl the URL, convert the web content into searchable “chunks” and store them as numerical vectors for later retrieval.

This is now also evolving into a realm of URL context grounding, which is specific to Gemini. The aim for URL context grounding is to help Gemini (and presumably AI Overviews / AI Mode) to better understand and answer questions about content and data in individual URLs without performing traditional RAG processing.

The aim here is for the LLM to specifically pull direct information from multiple URLs, analyze multiple reports and combine information from several sources to generate more accurate summaries. This should, in theory, help to improve AI factual accuracy and reduce hallucinations.

Then there’s zero shot classification – a technique that enables models to categorize the purpose of a webpage without any task-specific training data.

Rather than relying on labeled examples, the model analyzes semantic cues such as URL structures (treated as plain text strings) and maps them to predefined categories using methods like cosine similarity or prompt-based reasoning.

This works by drawing on the model’s pre-trained language knowledge to infer a page’s likely function, while also detecting distinct patterns in the words and phrasing that signal what type of content the page contains.

This has been particularly useful in identifying phishing links and other malicious links based solely on their URL patterns but also indicates how LLMs could begin to leverage zero-shot classification to rely solely on URLs to infer semantic relevance.

A URL that communicates nothing forces LLM models to work harder and introduces ambiguity in how the content gets categorized.

More practically, when an AI system cites a source in a response, it often surfaces the URL alongside the excerpt. That URL becomes visible to real users, in the same way it does in a search result, and they’re going to make real decisions about whether or not to click.

A clean, descriptive path builds trust in a way that something like /p?id-4821 never will.

The Core Principle Of URLs As Semantic Signals

Think of your URL structure as a secondary content layer – one that communicates hierarchy, topic, and specificity independently to the page title or H1, or other metadata.

A URL like /resources/seo/url-structure-ai-retrieval/ tells a retrieval system several things at once: This lives under a resources hub, it’s within an SEO category, and it covers a specific subtopic at a granular level.

That’s a useful signal. It maps to how AI systems try to understand content provenance and relevance before surfacing it in a response.

This matters especially for:

  • Long-tail and question-based queries, where AI systems are looking for precise matches to specific information needs.
  • Topical authority, where your URL hierarchy can reinforce that your domain owns a subject area.
  • Citation quality, where a descriptive URL increases the likelihood an AI agent references your content over a competitor’s near-identical page.

Practical Architecture Principles

There are a number of practical architecture principles that you should consider for both traditional search as well as AI search.

Use A Logical, Shallow Hierarchy

Deep nesting (i.e., /blog/category/subcategory/year/month/post-title/) creates noise, and your content is multiple steps away from the homepage. A structure three levels deep is almost always sufficient, i.e., domain > category > specific page. There are some CMS setups, like Shopify, where you are forced into four or five, depending on your theme (i.e., domain/blog/name-of-blog/blog-post-title/), but as long as you’re adding meaningful context and not administrative clutter, your structure will be aligned with the principle.

Make Every Segment Human-Readable And Descriptive

Avoid abbreviations, internal jargon, or ID numbers in public-facing URLs. A URL like /ai-search-optimization communicates the topic directly, whereas a URL like /aso-v2 communicates nothing without prior knowledge.

Align URL Slugs With The Actual Search Intent, Not Just The Keyword

There’s a big difference between /email-marketing and /email-marketing-best-practices-b2b. The second one signals specificity. It’s more likely to surface when an AI system is generating a response to a precise question, because the URL itself narrows the relevance scope before the content is even parsed.

Be Consistent With Category Naming Across Your Site

If your content strategy uses /guides/ for long-form education content and /blog/ for shorter commentary, maintain that consistently. It’s likely that AI retrieval systems build a model of your site structure over time. Inconsistency blurs the signal about what type of content lives where.

Avoid Keyword Stuffing In URLs

This is old SEO advice, but it also applies here. A URL crammed with keywords looks spammy to human users who see it cited in an AI response, which undermines the trust benefit you’re trying to build. One primary keyword or phrase per segment is the right call.

What Does This Look Like In Practice

If two different marketers are writing about the same topic, the URL structure could be key for RAG systems to better understand the context of the page as part of content retrieval.

An example:

Marketer A publishes /blog/2024/03/email-tips-part-4.

Marketer B publishes /resources/email-marketing/b2b-deliverability-guide.

Marketer B’s URL structure properly communicates hierarchy (resources hub), category (email marketing), and a specific focus (B2B deliverability) before a single word of body copy is processed.

Users are also more likely to benefit from this URL being cited because they can make sense of it immediately.

It can be argued that this type of clarity and specificity could compound as your URL structure and site’s information architecture can dictate the entire topical structure of your site, also helping to communicate both expertise and relevance.

The Redirect & Consolidation Problem

This is more relevant to enterprise sites that have accumulated URL debt like redirects, duplicate paths, and inconsistent slugs due to historical content management system migrations.

This could create a specific problem for AI retrieval if there are redirect chains and duplicate paths, as crawlers may not consistently land on the canonical version of a page, and different retrieval systems handle redirect resolution differently.

A practical fix will be to prioritize your website’s URLs. Audit your highest traffic and highest value pages, and confirm that their canonical URLs are clean, accessible, and structured in line with your current taxonomy.

Then work backward.

You don’t need to restructure the entire site for the chance of being cited in AI responses, but especially for your highest value pages, you should ensure that you’re offering the best possible URL signals.

What You Should Avoid Changing

It’s important not to always chase the big and shiny, so don’t completely restructure your entire site’s URL architecture just for marginal AI retrieval gains.

URL restructuring carries real SEO risk and time to recover link equity if 301 redirects are put in place – and there have been many web migration horror stories that can attest to what can happen when they’re not implemented correctly.

The goal is to apply these principles to new content and flag structural problems in existing high-value pages where the case to remediate these issues is clear and lower risk.

If your current URL structure already follows clean, descriptive, hierarchical conventions (which is all a standard part of SEO best practice), then congratulations! You’ve been optimizing for AI retrieval without even knowing.

In Summary

URL structure has always been a relatively small signal, but as AI assistants become more of a meaningful discovery channel, URL structures have the potential to be cited in more places than just Google and Bing.

They can help you to appear in AI-generated answers, they can shape citation quality, and they can contribute to how retrieval systems will categorize your content before anything else.

Simply build URLs that tell the story of your content clearly, before the user clicks on it.

More Resources:


Featured Image: Vitya_M/Shutterstock

Google Sued Over False AI Overview About Musician via @sejournal, @MattGSouthern

Canadian fiddler Ashley MacIsaac has filed a civil lawsuit against Google, alleging an AI Overview falsely identified him as a convicted sex offender. The lawsuit could test how courts treat liability for false AI-generated search summaries.

The statement of claim, filed in February with the Ontario Superior Court of Justice, seeks at least $1.5 million in damages from Google LLC. None of the claims have been tested in court.

What The Lawsuit Alleges

MacIsaac, a Juno Award-winning musician, says he learned of the false summary in December 2025 after the Sipekne’katik First Nation confronted him with it and cancelled one of his concerts. The First Nation later issued a public apology.

According to the filing, the AI Overview falsely stated MacIsaac had been convicted of sexual assault, internet luring involving a child, and assault causing bodily harm, and wrongly claimed he’d been listed on the national sex offender registry.

The lawsuit argues Google is liable for the output its AI system generated, stating that Google “knew, or ought to have known, that the AI overview was imperfect and could return information that was untrue.”

It also alleges Google didn’t admit responsibility, didn’t reach out to MacIsaac, and didn’t offer an apology or retraction.

The filing makes a direct argument about AI liability:

“If a human spokesperson made these false allegations on Google’s behalf, a significant award of punitive damages would be warranted. Google should not have lesser liability because the defamatory statements were published by software that Google created and controls.”

MacIsaac said Google must take responsibility for what AI Overviews display. “This was not a search engine just scanning through things and giving somebody else’s story,” he said.

Google’s Response

Google hasn’t commented on the lawsuit. In December, spokesperson Wendy Manton said AI Overviews are “dynamic and frequently changing” and that when the feature misinterprets web content, Google uses those cases to improve its systems. The false summary tying MacIsaac to criminal offences no longer appears.

Why This Matters

AI Overviews can appear in Google search results as AI-generated snapshots with links to more information. Google’s Search Help documentation says AI responses may include mistakes.

When those summaries display false claims about real people, the consequences can extend beyond a bad search result. In MacIsaac’s case, the lawsuit alleges the AI Overview led to a cancelled concert and reputational harm.

MacIsaac’s case isn’t the first time AI-generated content has led to defamation allegations. In 2023, an Australian mayor threatened legal action after ChatGPT falsely claimed he’d been imprisoned for bribery. The lawsuit targets Google’s AI Overviews directly and argues the product had a defective design.

The case adds to a growing legal question around AI-generated content: whether platforms are responsible when automated summaries present false claims as search results.

Looking Ahead

The case is at the statement-of-claim stage, and Google hasn’t filed a response. Until then, the core questions are unresolved: whether Google will contest liability, how it will characterize AI Overview output, and how the court will treat automated summaries in a defamation claim.

Google Says Search Is Fine, AI Insiders Say the Median Person Has No Future via @sejournal, @gregjarboe

On April 29, 2026, Sundar Pichai stood before Alphabet’s investors and delivered a masterclass in optimism. Google Cloud revenue crossed $20 billion for the first time. AI Overviews are driving Search queries to all-time highs. Gemini Enterprise paid users grew 40% quarter over quarter. “A terrific start to the year.”

One day later, Jasmine Sun published a guest essay in The New York Times called “Silicon Valley Is Bracing for a Permanent Underclass.” Her opening line: “Most people I know in the A.I. industry think the median person is screwed, and they have no idea what to do about it.”

Same industry, same week, two completely different stories. Both true.

That’s the uncomfortable reality SEO professionals, content creators, digital marketers, and entrepreneurs need to sit with. The gap between the investor deck and the off-the-record conversation has never been wider, and navigating it requires more than following the headlines.

What Pichai Is Telling Investors

Pichai’s Q1 2026 remarks were a triumph of the quantifiable. Search revenue grew 19%. AI Overviews are bringing people back to Search, not away from it. The company’s first-party AI models now process 16 billion tokens per minute, up from 10 billion last quarter. Personal Intelligence is now live in the Gemini app, AI Mode, and Gemini in Chrome. Search latency is down more than 35% over five years, and the cost of AI-powered responses dropped more than 30% since Google upgraded to Gemini 3. For anyone who has spent two years worrying that AI would hollow out organic search, the message was: calm down, Search is fine, and we’re winning everywhere.

What Sun Is Telling Everyone Else

Sun’s essay draws on conversations with engineers, venture capitalists, and economists who tend to be more candid off the record than on it. OpenAI’s Tejal Patwardhan, who leads frontier evaluations, told the Times that GDPVal now shows ‘over an 80 percent win rate compared to human professionals,’ a figure that exceeds OpenAI’s highest published benchmark result of 70.9%. The AI Productivity Index evaluates frontier models against investment banking associates, Big Law attorneys, and management consultants, not arbitrarily, but because those benchmarks signal where development energy is being aimed.

Sun also surfaces something that should concern anyone in knowledge work. She reported that “Anthropic researchers found that junior engineers who relied on A.I. coding agents not only didn’t complete tasks much faster; they also understood their work less when quizzed about it afterward”. If that dynamic extends to content creation, marketing strategy, and SEO analysis, it has practical implications for anyone whose career depends on accumulating expertise through practice. 

Why This Is Specifically an SEO Problem

The gap between what AI company executives say publicly and what their researchers say privately is a version of a problem SEO professionals already know well: the distance between what platform owners announce and what practitioners observe in the field.

Google has spent years telling advertisers that its systems reward quality and intent. SEO practitioners have spent years measuring what actually moves rankings. Sometimes those accounts align. Often, they don’t completely, and the discrepancy only resolves through direct testing.

The AI era is creating a similar dynamic at a much larger scale. Pichai tells investors that AI Overviews are driving more queries. Sun reports that recent college graduates are applying to hundreds of jobs without a single interview. Both can be simultaneously accurate. Neither tells you what to do Monday morning. 

Ground Truthing in the AI Era

The phrase “ground truthing” comes from cartography. Before you trust what a satellite image appears to show, you send someone to the actual location to verify. You gather objective, empirical data through direct observation.

That discipline is what the AI era demands from marketing professionals. Not faith in the bullish investor narrative, not paralysis in the face of the bearish cultural one, but a methodical commitment to measuring what is actually happening in your specific market with your specific tools.

What is your organic click-through rate doing as AI Overviews expand? Are conversion rates from AI-assisted search traffic different from traditional organic? If you have started using AI for content production, what is happening to time-on-page, return visits, and brand sentiment? Are junior team members building expertise or outsourcing the thinking?

These are answerable questions, and the answers will tell you far more than either a Q1 earnings call or a New York Times opinion essay.

Confident claims about what AI means for your business will keep coming. Some from people with financial incentives to sound optimistic. Some from people whose job is to surface uncomfortable truths. Your job is to test both against observable reality and update accordingly. That’s not pessimism. It’s just good measurement practice, which has always been the foundation of effective SEO.

More Resources:


Featured Image: Kateryna Onyshchuk/Shutterstock

Why AI Search Skips Your Content (And How to Diagnose Where It’s Failing) via @sejournal, @jeffrey_coyle

This post was sponsored by Siteimprove. The opinions expressed in this article are the sponsor’s own. 

Why does my content get crawled but never cited in ChatGPT or Perplexity?

How do I tell if my AI visibility problem is technical or content-quality related?

What actually decides whether AI picks my page over a competitor’s?

The gap between appearing in an AI answer and being retrieved by an AI system is where the actual AI search strategy lives.

This article breaks down that AI search strategy process:

  1. How AI search systems retrieve and select content.
  2. Why eligibility alone doesn’t win.
  3. How to diagnose whether your content is failing at the retrieval layer or the quality layer.

The fix is different for each, and most teams are solving the wrong problem.

How AI Search Crawls Your Site & What Just Changed

AI search systems still rely on crawlers. If your pages block crawl access, depend on unexecuted JavaScript rendering, or bury content behind authentication walls, nothing downstream matters.

Semantic HTML, proper heading hierarchy, and descriptive markup remain the cost of entry. But the stakes are higher now: these aren’t just accessibility compliance items anymore. They’re the structural signals AI systems use to parse and chunk your content for retrieval.

Platforms like Siteimprove.ai that audit accessibility and content quality natively can surface these issues before they become retrieval problems. If you’re already running accessibility audits, you’re closer to AI search readiness than you might think.

What has changed is what happens after the system accesses your content.

Why You’re Now Competing Paragraph-by-Paragraph, Not Page-by-Page

AI systems don’t ingest a page as a single unit. They break it into passages: discrete chunks of text that get indexed independently.

This is where most traditional SEO thinking falls short. You’re no longer competing at the page level. You’re competing at the passage level.

A 3,000-word guide might contain 15 to 20 individually indexed passages. Some of those will be clear, self-contained, and directly responsive to a query. Others will be vague transitions or filler paragraphs that contribute nothing to retrieval.

Every passage is either a retrieval candidate or a wasted one. A page can rank well in traditional search while performing poorly in AI search, because its best passages are buried inside paragraphs the system can’t cleanly extract.

How to audit passages manually:

  1. Copy one important page into a plain document. Break it into individual paragraphs or short sections, then read each passage on its own without the surrounding page context.  
  2. Ask one question per passage. For each paragraph, write the query it actually answers. If you cannot name a clear query, that passage probably is not strong retrieval material.  
  3. Rewrite weak passages to stand alone. Lead with the answer, add specific context, and remove vague transitions that only make sense when someone reads the full page from top to bottom. 

      How AI Picks Which Passages Make It Into an Answer

      When a user asks an AI system a question, the system doesn’t read the web in real time. It queries a pre-built index, retrieves the most relevant passages from potentially millions of candidates, and scores them for relevance and quality.

      But the system rarely stops at the literal query. It expands the question into a network of related sub-questions (follow-ups, edge cases, adjacent concerns) and retrieves passages for each. This is query fan-out, and it fundamentally changes what “ranking” means.

      Your content isn’t just competing against pages that target your exact keyword. It’s competing against everything the system retrieves across that entire network of related queries.

      A page that answers one narrow question well might get retrieved for that specific sub-query. But a page that anticipates the follow-ups, the “what about” variations, and the context a user would need next gets retrieved across multiple nodes in the fan-out. That’s a fundamentally different kind of competitive advantage.

      Citation happens after all of this. The system attributes its synthesized answer to the sources that contributed the most useful material. Chasing citations without understanding retrieval is working backwards.

      How to map a simulated query fan-out manually:

      1. Start with one target question. Write down the main query your audience would ask, then list the follow-up questions they would naturally ask next.  
      2. Group those questions by intent. Separate beginner questions, implementation questions, comparison questions, edge cases, and decision-making questions.  
      3. Match each question to existing content. If a question does not map to a clear passage on your site, that is a retrieval gap. If it maps to a vague or buried passage, that is a passage-quality gap. 

      Why Being Indexed Doesn’t Mean You’ll Get Cited

      Here’s where most AI visibility strategies stall.

      Teams invest heavily in technical optimization (fixing crawl issues, improving page speed, adding structured data) and assume the rest will follow. They treat retrieval readiness as the destination instead of the starting line.

      Being indexed by an AI system means your content can be retrieved. It doesn’t mean it will be.

      Consider a practical example. Two sites publish guides on international SEO for e-commerce. Site A has strong domain authority, clean technical SEO, and a 4,000-word guide that covers the topic broadly but generically. Site B is a smaller consultancy with a 1,500-word page focused specifically on hreflang implementation for Shopify stores with three or more language variants.

      When an AI system receives a query about multilingual e-commerce SEO, it fans out into sub-questions. For the specific sub-query about hreflang configuration on Shopify, Site B’s focused passage gets retrieved and cited. Site A’s guide technically covers hreflang, but its relevant passage is buried in paragraph 37 of a general overview, sandwiched between topics that dilute its signal.

      Site A is retrieval-ready. Site B is answer-worthy. That distinction is the core tension of AI search optimization, and it requires a completely different audit than most teams are running.

      How to test this manually:

      1. Run the same query across multiple AI search experiences. Use a small set of high-value questions and record which sources are cited or referenced.  
      2. Compare the cited source to your page. Do not compare the full articles. Compare the exact section or passage that appears to answer the query.  
      3. Look for the selection difference. Ask whether the cited passage is more specific, more direct, more current, or more practical than yours. That usually reveals why it won. 

      The Two Signals That Decide AI Search Passage Selection

      The hreflang example illustrates a broader pattern. Once your content clears the technical gates, competition shifts entirely to quality. And “quality” in AI retrieval means something more specific than most content strategies account for.

      Information Gain Is A Very Important Signal

      An important factor in passage selection is whether your content contributes something the system can’t assemble from other sources.

      This is information gain: original data, proprietary research, first-person case studies, or novel frameworks that don’t exist elsewhere in the index. When every other passage in the candidate pool says roughly the same thing, the passage that introduces a new data point or a genuinely different perspective has a structural advantage.

      Generic coverage that restates widely available information is the easiest content for an AI system to replace with any other source. Original expertise is the hardest. If your content strategy doesn’t have a plan for producing material that is uniquely yours, you’re filling the index with passages any competitor could displace.

      How to identify information gain manually: 

      1. Review the top competing pages for the same topic. Look for repeated claims, definitions, examples, and recommendations that appear across nearly every source.  
      2. Mark anything your page says that competitors do not. This could include proprietary data, internal benchmarks, customer examples, expert commentary, original frameworks, or lessons from implementation.  
      3. Strengthen the unique material. Move original insights higher on the page, give them clearer headings, and support them with concrete examples instead of burying them in generic explanation. 

      How Topic Depth Gets More of Your Pages Into the Candidate Pool

      Information increases the likelihood that gain gets your best passages selected. Depth and coverage determine how many passages you have in the candidate pool to begin with.

      AI systems exploring a subject pull from multiple passages across multiple pages. If your site covers a topic comprehensively, with dedicated pages for subtopics, related concepts, and adjacent questions, you create more opportunities to be retrieved across the full query fan-out.

      This works at two levels. Across your site, topic clusters with focused pages for each subtopic outperform a single pillar page surrounded by thin supporting content. Within a single page, going three layers deep on a subject (the basics, the edge cases, and the practitioner-level tradeoffs) gives the system more high-quality passages to select from.

      A domain with strong general authority but shallow coverage of a specific subject will lose passage-level retrieval to a smaller site that covers that subject exhaustively. AI systems evaluate authority at the topic level, not just the domain level.

      How to assess topic depth manually:

      1. Create a simple topic map. Put your main topic in the center, then list the subtopics, adjacent questions, use cases, objections, comparisons, and technical details a buyer or practitioner would need.  
      2. Assign each subtopic to a URL. If several important subtopics are crammed into one broad guide, they may need dedicated pages or stronger sections.  
      3. Look for thin or missing coverage. Prioritize gaps where competitors have specific, useful content and your site only has a passing mention. 

      How to Diagnose Why Your Content Isn’t Getting Cited In AI Answers

      When AI visibility underperforms, the instinct is to produce more content. That’s often the wrong move.

      The first diagnostic question is simpler: is this a retrieval problem or a quality problem? Each has different symptoms, different causes, and different fixes.

      Signs Your Content Never Reaches the AI’s Candidate Pool

      If your content isn’t appearing in AI responses at all, even for queries where you have relevant, published material, the issue is upstream. The content isn’t reaching the candidate pool.

      Audit for these signals:

      • Crawl access restrictions or rendering failures preventing indexing.
      • Missing or broken semantic structure: heading hierarchy, section markers, descriptive markup.
      • Passages that are too long, too short, or too loosely structured to be extracted cleanly.
      • Content buried inside tabs, accordions, or interactive elements that don’t render for crawlers.

      In practice, this looks like a page that performs reasonably in traditional search but generates zero AI citations. The content might be strong. The system just can’t access or parse it at the passage level.

      Retrieval failures are technical. They’re also the fastest to fix, because the content itself may already be competitive. It just needs to reach the candidate pool.

      Signs You’re in the AI Search Citation Pool but Losing to Competitors

      If your content is being retrieved but not selected, or selected less often than competitors for the same queries, the issue is downstream. The system can see your content. It’s choosing something else.

      Audit for these signals:

      • Passages that are vague, indirect, or take too long to reach the point.
      • Coverage gaps where competitors address sub-questions your content ignores.
      • Lack of original data, examples, or practitioner-level specificity.
      • Generic treatment of a topic that other sources cover with equal or greater depth.

      The telltale sign is finding competitor citations for queries your content should own. When you compare the retrieved passages side by side, the competitor’s passage answers the question more directly, with more specificity, in fewer words.

      Quality failures require content investment. They can’t be solved with technical fixes alone.

      Fix This First, Then Move to Quality

      Start with retrieval. Technical fixes are lower effort and unlock everything downstream. A page that isn’t being crawled or chunked properly can’t benefit from content improvements at any level.

      Once retrieval is confirmed, shift to passage-level quality. Identify the specific queries where competitors are winning selection, compare the actual passages head-to-head, and close the gap at the individual passage level rather than rewriting entire pages.

      The highest-ROI work sits at the intersection: passages that are already being retrieved but aren’t winning selection. They’re close. They just need to be more direct, more specific, or more useful than the alternatives.

      How to prioritize fixes manually:

      1. Create a simple two-column audit. Label each issue as either “retrieval” or “quality.” Retrieval issues include crawl blocks, broken structure, hidden content, and poor extractability. Quality issues include vague answers, missing examples, shallow coverage, and weak differentiation.  
      2. Fix retrieval blockers first. There is no point improving a passage that systems cannot access, parse, or associate with the right topic.  
      3. Then improve near-miss passages. Focus on pages that already rank, receive impressions, or cover the right topic but lose citations to more specific competitor content. 

      What to Track Instead of Citation Screenshots

      If the old metrics (mention counts, citation screenshots, brand-name tracking) don’t tell the full story, what does?

      Track retrieval presence separately from citation selection. Retrieval presence asks whether your content appears anywhere in the system’s candidate set for a given query cluster. Citation selection asks whether it was chosen for the final synthesized answer.

      A page with high retrieval presence but low citation selection has a quality problem. A page with low retrieval presence for queries it should match has a technical problem. That distinction tells you exactly where to invest.

      The challenge is that most teams piece this together across disconnected tools: one for accessibility auditing, another for content analytics, a third for search performance. By the time you’ve correlated the data, you’ve lost the thread between cause and effect.

      This is where Siteimprove’s approach matters. Because accessibility auditing, content quality scoring, and search analytics live in one platform with native analytics, you can trace a retrieval failure back to its structural cause without jumping between tools or reconciling data sets. A broken heading hierarchy flagged in an accessibility audit connects directly to the search performance data showing that page’s declining AI visibility. A content quality score on a specific page maps to its passage-level competitiveness for the queries you’re targeting.

      That closed loop between accessibility, content, and search performance is what turns the retrieval-vs-quality framework from a diagnostic concept into an operational workflow.

      How to track AI visibility manually:

      1. Build a query-tracking spreadsheet. Include the query, topic cluster, your best-matching URL, whether your brand appeared, whether you were cited, which competitors appeared, and what type of issue you suspect.  
      2. Track patterns, not one-off screenshots. AI answers can vary, so look for repeated behavior across multiple prompts, systems, and dates.  
      3. Separate visibility from selection. A page that appears in related answers but rarely gets cited likely has a quality problem. A page that never appears for relevant prompts likely has a retrieval or coverage problem. 

      What It Takes to Get AI to Pick You

      The question brands should be asking isn’t “Can AI find us?” It’s “Does AI find us useful?”

      That shift reframes content strategy entirely — from visibility tracking to retrieval mechanics, from page-level optimization to passage-level precision, and from generic authority-building to topic-specific depth.

      Three principles hold across every AI search system operating today.

      First, treat technical accessibility as non-negotiable infrastructure. It doesn’t differentiate you, but its absence disqualifies you.

      Second, build content for the query network, not the individual keyword. AI systems resolve clusters of related questions simultaneously. Your content architecture should map to that same structure.

      Third, prioritize information gain. Original research, proprietary data, and first-person expertise are the hardest assets for an AI system to source elsewhere — and a strong signal that your content deserves selection.

      The brands that win in AI search won’t be the ones that figured out how to get mentioned. They’ll be the ones whose content was too useful to leave out.


      Image Credits

      Featured Image: Image by Siteimprove. Used with permission.

      Google Engineer Explains ‘Black Box’ AI Models In Search via @sejournal, @MattGSouthern

      Nikola Todorovic, Director of Software Engineering at Google Search, appeared on an episode of Search Off the Record to discuss how AI evolved inside Google Search.

      Todorovic leads Google’s SafeSearch engineering team and has worked in the search organization for 15 years. He said machine learning was difficult to deploy broadly across Search because complex models are harder to understand and fix than simpler systems.

      He was explaining why Google could not simply apply ML systems across Search at once. Todorovic said these models can “function like a kind of a black box” because engineers don’t always understand what happens underneath.

      That makes debugging harder when search systems change over time or when a model needs to be replaced, he said.

      SafeSearch As Proving Ground

      Todorovic said SafeSearch was one of the first places where Google could deploy AI models in Search because the team could isolate those systems from the main ranking flow.

      SafeSearch could run standalone image and video classifiers that produced a signal, such as how explicit a result might be. If problems came up, engineers could iterate on the model without disrupting the rest of Search.

      Convolutional neural networks began improving image understanding about 12 years ago, he said, making SafeSearch a natural early use case for machine learning inside Search.

      AI Overviews Built On Existing Search

      Todorovic described AI Overviews as a feature that “stamps on top” of Google’s existing retrieval and ranking systems. He said the retrieval and ranking underneath AI Overviews is still what he called “the old style, the old school.”

      The process can involve fan-out queries, he said. Google may identify additional queries related to the original input, run them in parallel, and bring the retrieved results back into one response.

      AI Overviews then combine and summarize information from selected results, including source text, snippets, titles, and other page context, he said.

      AI Mode follows a similar pattern but operates with more independence, Todorovic said. He described it as still running on Search, while having a “bigger platform for its own.”

      Why This Matters

      The “black box” quote is getting attention, but the full context matters. Todorovic was explaining why machine learning was difficult to deploy broadly across Search, not saying Google lacks oversight of AI Overviews or AI Mode.

      His comments add useful context to Google’s existing AI Search documentation. Google has already said AI Overviews and AI Mode may use query fan-out, issuing multiple related searches across subtopics and data sources to develop responses.

      The useful point is not that AI is a “black box.” His comments reinforce that traditional Search systems still matter for AI Overviews, even as Google layers summarization and fan-out on top.

      That keeps traditional Search fundamentals relevant to AI features, even as Google changes how results are summarized and presented.

      Looking Ahead

      The difference between AI Overviews and AI Mode is worth watching as Google expands AI Mode. Todorovic described AI Overviews as more isolated from the rest of Search, while AI Mode has more of its own infrastructure.

      That difference may matter for how Google explains visibility, measurement, and optimization guidance as AI Mode expands.

      Your Website Is A Source, Not A Megaphone via @sejournal, @slobodanmanic

      There’s a lesson from the early days of social media that most brands eventually learned the hard way: Social media is not a megaphone.

      You couldn’t just broadcast your press releases into the feed and expect people to care. The channel had rules. It rewarded conversation, not announcements. The companies that figured this out early thrived. The rest spent years shouting into a void, wondering why nobody was engaging.

      We’re watching the same mistake happen again, just one layer deeper. This time it’s not about which platform you’re on. It’s about assuming your website is where the message lives.

      Why Most Websites Break When AI Agents Read Them

      Most websites are still built on a core assumption: Someone will arrive at your front door, navigate your carefully designed pages, and consume your message in the exact sequence and format you intended.

      That assumption is breaking.

      In 2026, your website is no longer the only interface to your content. An AI agent might summarize your service page for someone mid-conversation. A voice assistant might read your pricing aloud, stripped of all visual hierarchy. A research tool might pull three paragraphs from your blog, recontextualize them alongside a competitor’s, and present them in a comparison the user never asked you for. Someone might never visit your site and still make a decision based entirely on what your website says.

      If your message only works when it’s wrapped in your layout, your fonts, your carefully choreographed scroll, you don’t have a message. You have a brochure. And brochures don’t travel well.

      The shift that’s happening is subtle but fundamental: You need to design the message independently of the medium.

      This doesn’t mean your website stops mattering. It means your website is now one of many surfaces where your message might land. And the message has to hold up in all of them. It has to make sense when it’s read in full, when it’s summarized in three sentences, when it’s pulled apart and reassembled by something you didn’t build and don’t control.

      That changes how you write. It changes how you structure information. It changes what you think of as “the product” of your content work.

      Here’s a simple test: If there’s a single “Lorem ipsum” anywhere in your website while it’s being built, the message came second. The design came first. That order no longer works.

      A few things this means in practice:

      Your core message needs to be extractable. If an agent grabs one paragraph from your website, does that paragraph carry weight on its own, or does it collapse without the paragraphs around it?

      Your value proposition can’t hide behind design. Bold typography and hero animations don’t travel through an API. The words have to do the work.

      Structure becomes a form of portability. Clear headings, logical hierarchy, well-defined claims. These aren’t just good for traditional SEO anymore. They’re how machines parse your intent and relay it accurately.

      You need to think about your content the way a news agency thinks about a wire story. The story has to work no matter which publication picks it up, no matter how they crop it, no matter what headline they slap on it. The facts and the narrative have to be embedded in the text itself, not in the presentation layer.

      Brand Control When AI Recontextualizes At Scale

      There’s a natural resistance to this idea. “If I don’t control the experience, how do I control the brand?” But that’s the megaphone instinct talking. The desire to control exactly how every word lands, in exactly the right font, with exactly the right whitespace. That was always a bit of an illusion anyway. People skim. People read on phones in bad lighting. People copy-paste your pricing into a Slack thread with zero context.

      The difference now is that the recontextualization is happening at scale, automatically, and often before a human even sees it.

      So, the question isn’t how to prevent that. It’s how to make sure your message is strong enough to survive it.

      Websites As Canonical Sources, Not Just Destinations

      Your website still matters. But its job description has changed.

      Your website is no longer just a destination. It’s a source. It’s the canonical, structured, well-maintained origin point from which your message gets picked up, interpreted, summarized, and carried elsewhere. The better that source material is, the better it travels.

      Think of it this way: Your website used to be the store. Now, it’s also the warehouse. And the warehouse needs to be organized well enough that anyone (human or machine) can find what they need, understand what it means, and carry it somewhere else without losing the plot.

      The companies that get this right will be the ones whose message shows up clearly, no matter where the conversation is happening. The ones that don’t will keep designing beautiful megaphones, and keep wondering why the room isn’t listening.

      More Resources:


      This post was originally published on No Hacks.


      Featured Image: Pixel-Shot/Shutterstock

      500M AI Searches Later: How To Actually Improve AI Search Visibility & Citations via @sejournal, @hethr_campbell

      What signals actually drive AI search visibility?

      Are competitors getting cited in AI Overviews while you’re watching from the sidelines?

      How do you go from AI visibility gap alerts to a system that closes them?

      Most SEO teams already have dashboards showing where they’re invisible in AI search. Few have a process to fix it.

      Learn To Turn AI Search Visibility Data Into A High-Visibility System

      Reconnect with Sam Garg, Founder and CEO of Writesonic, as he shares his practical framework for diagnosing citation gaps, prioritizing the right actions, and automating execution with AI agents and free open-source SEO & GEO tools.

      You’ll Learn:

      • What drives AI citations: Visibility signal analysis from 500M+ AI conversations. You’ll learn which content types, sources, and placements actually get cited in ChatGPT, Perplexity, and Gemini.
      • GEO tasks that move the needle: Citation outreach, content refresh, and third-party placements, plus how to use AI agents and open-source tools to automate them.
      • Where AI search is headed next: Early signals on AI ecommerce and the shift from recommendations to transactions for your channel strategy.

      This SEO webinar session covers what 500M+ AI conversations reveal about how citations are earned, which actions actually move the needle (citation outreach, content refresh, third-party placements), and how to use autonomous AI agents to execute at scale.

      Watch on-demand now to get the most data-backed, actionable guidance available on improving your brand’s AI search visibility.

      ChatGPT vs. Perplexity vs. Gemini: Which LLMs Are Driving Real Conversions? [Expert Panel] via @sejournal, @hethr_campbell

      AI search is sending high-intent traffic, but not equally across platforms.

      Which LLM is actually driving conversions in your clients’ verticals?

      Should GEO efforts be concentrated on ChatGPT versus Perplexity or Gemini?

      How do you build an AI search reporting framework clients will actually trust?

      Watch the on-demand webinar now to get conversion data by LLM.

      How To Identify & Focus On The LLM That Works For You

      Not every LLM deserves equal optimization effort.

      Misallocating that effort is costing your clients rankings, leads, and revenue.

      In this on-demand GEO webinar, Natalie Ann and our expert panel for a breakdown of which platforms are driving measurable results, and how to build an AI search strategy backed by conversion data.

      You’ll Be Able To:

      • Identify which LLMs drive the highest conversion rates in your clients’ industries
      • Prioritize GEO spend and content optimization based on platform-level performance data
      • Package LLM optimization as a billable service with reporting that proves impact to clients

      Watch now, follow along below, and be ready to rethink how you’re allocating AI search effort.

      How Brands Are Increasing AI Visibility By Up To 2,000% [Webinar] via @sejournal, @hethr_campbell

      The answer is Reddit, and yes, this 90-day strategy is worth your time.

      Most brands treat Reddit as an afterthought.

      However, Reddit is where buyers finalize their purchase decisions.

      Reddit is where human trust gets built.

      Therefore, Reddit serves as a trust signal for how AI search tools determine which brands are worth recommending.

      AI Mentions & Cites Brands Based On Trust Signals, Across Channels

      When ChatGPT, Perplexity, or Google AIO recommends a brand, it’s drawing on a web of signals that indicate the brand is credible, relevant, and mentioned by real people in real contexts.

      Reddit is one of the most authentic of those signals.

      Your opportunity: not Reddit instead of other channels, but Reddit as a meaningful addition to the multi-channel trust footprint AI reasons from.

      One brand OGS Media worked with saw 2,000% AI visibility growth in 90 days after building a genuine Reddit presence. That’s the strategy Bartosz and Brent are unpacking on May 5.

      What You’ll Learn In This AI Search Webinar

      • How Reddit community content contributes to the multi-channel trust signals AI uses to evaluate and surface brands
      • The 5-stage framework behind OGS Media’s 2,000% AI visibility result
      • The 7 most common Reddit mistakes brands make
      • What authentic subreddit engagement looks like when it’s actually working
      • How to find and engage in Reddit conversations that influence both buyers and AI

      About the Speakers

      Bartosz Goralewicz is the CEO of OGS Media and one of the most experienced Reddit marketing practitioners in SEO. Brent Csutoras is a Reddit Official Advisor and the Owner of Search Engine Journal, with nearly two decades of hands-on Reddit strategy for brands across every major vertical.

      AI Search Clicks Often Go To Local Domains: Report via @sejournal, @MattGSouthern

      Aleyda Solis, founder of Orainti, analyzed 87 million AI search visits across 10 markets, finding most clicks go to local domains rather than global defaults.

      Using Similarweb data, she examined more than 57,000 domain-market entries in the ‘click-producing layer.’ This layer includes visits to a domain after users click citations or links in AI-generated answers.

      The analysis complicates the assumption that the biggest global brands automatically dominate AI search results.

      The Main Pattern

      In non-US markets, local domains with stronger signals drive the click layer. For example, Bol.com leads in Dutch ecommerce, MercadoLivre in Brazil, Bahn.de in Germany, and Lefrecce.it in Italy, ahead of global competitors like Amazon or Booking.com.

      Solis suggests this reflects who has the usable answer locally, not brand size. For instance, Lefrecce has train route data for Milan to Rome, while Booking.com does not. Thus, AI search visibility often depends on local infrastructure.

      Different Verticals, Different Rules

      In ecommerce, five domains account for 50% of clicks, with platforms like Amazon dominating. Finance is less concentrated, accounting for 17 domains, while travel is highly fragmented with 47. Finance appears concentrated, with Stripe ranking first in 7 of 10 markets, driven by demand from B2B, developers, merchants, and infrastructure, rather than consumers.

      PayPal leads in Germany and Italy. The investing sub-category accounts for 22.4% of finance AI clicks, with TradingView ranking in the top 20 across all markets. Travel discovery and booking are more dispersed. Italy’s ecommerce is concentrated, with Amazon.it capturing 46.2% of clicks; combined with Temu, over half. UK travel requires 129 domains for 50% of clicks.

      Growth Is Uneven

      The report reveals churn behind overall growth. The median monthly growth for the top 50 domains was +20% in ecommerce, +25% in finance, and +29.1% in travel. Many markets and verticals saw about 30% to 40% of top domains decline, e.g., Spain ecommerce with 21 of 49 domains and France finance with 22 of 50.

      Solis notes that weighted averages can be distorted by small-base spikes, citing domains like azulviagens.com.br and innovasport.com with large one-month jumps, suggesting investigations rather than trends. Momentum offers more insight than a static snapshot, as a losing top domain may require more focus than a steady top-50 position.

      Why This Matters

      For brands working across multiple markets, the data suggests that AI search competitors may not be the same competitors they track in traditional SEO.

      In Italian travel, the key domain for rail intent may be Lefrecce.it. In Dutch ecommerce, it may be Bol.com. In German travel, it may be Bahn.de.

      Solis recommends a straightforward audit question: who holds the operational data, structured inventory, or institutional trust that AI needs for category tasks in each market?

      Looking Ahead

      The report highlights three gaps for international brands: presence in AI-driven answers, click acquisition, and domain ownership of customer relationships.

      Solis plans to update the analysis monthly. The next pull will show whether the local-domain pattern holds.


      Featured Image: RobinRmD/Shutterstock