Google’s UCP Update: Carts, Catalogs, And Loyalty In AI Shopping via @sejournal, @slobodanmanic

Google’s Universal Commerce Protocol can now handle shopping carts, live catalog queries, and loyalty program benefits for AI agent transactions. On March 19, Google announced three new UCP capabilities and a simplified onboarding path through Merchant Center, two months after Google and Shopify unveiled UCP at the National Retail Federation conference in January 2026.

The January launch had a big coalition (Mastercard, Visa, Walmart, Target, Best Buy) but limited functionality. UCP could handle single-item checkout sessions and not much else. The March update closes the gap between UCP’s ambition and UCP’s practical capability.

I covered UCP in depth in Selling to AI: The Complete Guide to Agentic Commerce, where I compared UCP to OpenAI and Stripe’s Agentic Commerce Protocol (ACP). This article covers what changed in March and what the changes mean for retailers.

What Google Added

Cart. UCP’s new Cart capability lets AI agents add multiple items to a shopping cart from a single retailer in one operation. Until March 2026, UCP only supported single-item checkout sessions, meaning an agent buying three products from one store needed three separate transactions. The Cart capability also supports pre-purchase exploration: agents can build baskets before a shopper commits, then convert the basket to a checkout session when the shopper is ready. UCP Cart is currently published as a draft specification.

Catalog. UCP’s new Catalog capability lets agents query real-time product details directly from a retailer’s inventory, including variants, pricing, and stock levels. The difference between Catalog and existing Google Shopping product feeds: product feeds are static snapshots updated periodically, while Catalog provides live data at the moment of the query. An agent using Catalog can check whether a specific size is in stock before presenting the product to a shopper. UCP Catalog is also a draft specification.

Identity Linking. UCP’s Identity Linking capability lets shoppers connect retailer accounts to UCP-integrated platforms using OAuth 2.0. When a shopper with a Nike membership buys through Google AI Mode, Identity Linking carries over that shopper’s member pricing, discounts, and free shipping. Without Identity Linking, shopping through an AI agent means losing the loyalty benefits a shopper would get when logged into the retailer’s website directly. Identity Linking is the only capability in this update already in UCP’s stable release rather than draft.

Simplified Onboarding

Google is building a simplified UCP onboarding process directly in Merchant Center, targeting retailers who don’t have engineering teams to implement a protocol from scratch. Google says the Merchant Center UCP rollout will happen “over the coming months.”

One concrete detail: products using the native_commerce product attribute will display a checkout button in Google AI Mode and the Gemini app. For retailers already managing product feeds through Google Merchant Center, UCP onboarding should be a settings change rather than an integration project.

Platform Partners

Commerce Inc, Salesforce, and Stripe will implement UCP on their platforms, with Google describing the timeline as “in the near future.” Retailers on Commerce Inc, Salesforce, or Stripe won’t need to implement UCP directly. The platform handles the protocol layer, similar to how Shopify’s Agentic Storefronts already abstract away multi-protocol complexity for Shopify merchants.

Salesforce’s dual-protocol position is notable. Salesforce announced ACP support in October 2025. With UCP support coming too, Salesforce Commerce Cloud merchants will be able to serve both protocols from a single platform, reaching AI agents on ChatGPT (via ACP) and Google AI Mode (via UCP) without separate integrations.

Stripe occupies an even more central position. Stripe co-created ACP with OpenAI and is now implementing UCP as well. Stripe is becoming the shared payment layer across both competing agentic commerce protocols.

What This Means

UCP’s January announcement was a statement of intent. UCP’s March update is a statement of readiness. Three things stand out:

UCP is reaching feature parity with ACP. OpenAI and Stripe’s Agentic Commerce Protocol launched in September 2025 with cart management and catalog access built in from day one. UCP launched in January 2026 without either capability. Cart, Catalog, and Identity Linking close that gap, giving UCP the core primitives AI shopping agents need to handle multi-item, loyalty-aware transactions.

Google’s onboarding play targets mass adoption, not enterprise showcases. Google wants millions of Merchant Center retailers on UCP, not just the enterprise brands (Walmart, Target, Best Buy) that endorsed UCP at NRF. Merchant Center integration is how Google reaches that scale. A retailer managing Google Shopping feeds today could become UCP-enabled without writing a line of code.

Identity Linking is UCP’s clearest differentiator over ACP. Neither ACP nor any other agentic commerce protocol offers an equivalent to Identity Linking. Identity Linking solves a specific adoption barrier: shoppers lose loyalty pricing, member discounts, and free shipping when buying through an AI agent instead of logging into a retailer’s website directly. Removing that friction makes agentic commerce more attractive to both retailers protecting their loyalty programs and shoppers unwilling to give up membership benefits.

For businesses already thinking about agentic commerce, the action items remain the same: clean product data, structured markup, and being on a platform that handles protocol complexity. What changed in March is that UCP is no longer a specification to watch. Google is building UCP into the infrastructure retailers already use.

More Resources:


This post was originally published on No Hacks.


Featured Image: Inkoly/Shutterstock

The Agent Runtime Wars Have Begun. Is Your Website Ready? via @sejournal, @slobodanmanic

The agent runtime is the new browser layer, and your website is going to be evaluated against the runtime, not against any individual model.

That’s a shift web professionals have not yet made. The conversation is still framed around models. Which model writes better? Which one cites more accurately? Which one’s API is cheaper this month? The model conversation is loud because new models ship every few weeks, and every release is theatrical.

The interesting story is the one underneath it. The foundation is being rebuilt. This week made it impossible to ignore.

The Runtime Stack Shipped In April

On April 15, Cloudflare shipped Project Think, a new Agents SDK built around durable execution with crash recovery and checkpointing, sub-agents that run as isolated children, persistent sessions with tree-structured messages, and sandboxed code execution running on Dynamic Workers. Within hours of the same day, OpenAI shipped the next evolution of its Agents SDK with native sandbox execution and a model-native harness. Two of the largest infrastructure operators on the web shipped competing answers to the same question, and the question was: how does a long-running AI agent actually run in production?

Then, on April 16, Cloudflare added five more pieces. AI Platform: a vendor-agnostic inference layer that routes models for agents. AI Search: a vector index plus chunking pipeline shipped as a managed product specifically for agent retrieval, competing with Pinecone and Algolia in the agent-side RAG layer rather than with Google AI Mode. Email Service in public beta, designed so agents can use the most universal interface in the world as a channel. PlanetScale Postgres and MySQL inside Workers. And the engineering foundation for hosting very large open-source LLMs like Kimi K2.5 directly on Cloudflare’s network.

Sundar Pichai described the same shift a week earlier. On the April 7 Cheeky Pint podcast with Stripe co-founder John Collison, he called Search itself an “agent manager”: “A lot of what are just information-seeking queries will be agentic in Search. You’ll be completing tasks. You’ll have many threads running.” Many threads per query is a runtime description of Search. Google’s CEO is pointing at the same substrate Cloudflare and OpenAI shipped this week.

If OpenClaw was the agentic web for consumers (a playable demo, an interesting prototype, something to gesture at), this is the agentic web for adults. Durable. Sandboxed. Auditable. The kind of infrastructure you would actually run a business on.

The pattern across all of it is one thing: the runtime. Not the model. Not the consumer chat app. Not the keynote slide. The runtime is the layer where agents are spun up, persisted across hours and days, given filesystem access, given network access, given memory. The runtime is the layer that decides whether an agent’s session survives a crash, whether its sub-agents can be reasoned about, whether its code execution is contained.

The Wrong Question And The New One

Web professionals have spent the last 18 months asking the wrong question. The question was: Which AI model should we optimize for? ChatGPT or Claude or Gemini or Perplexity. Whose citations matter more? Whose crawler should we let through? That conversation made sense when the models read your website directly.

They don’t anymore. The model reads what the runtime hands it. The runtime fetched your page. The runtime parsed it. The runtime executed (or did not execute) your JavaScript. The runtime resolved your structured data. The runtime negotiated authentication. By the time the model sees anything from your website, it is seeing the runtime’s interpretation of it.

The new question, if you take this week seriously, is which agent runtime your website is legible to. Three things to test before next week:

  1. Do your most important endpoints return machine-readable structured responses, or do they only render correctly inside a full browser session?
  2. Is your authentication scoped so an agent acting on a user’s behalf can hold a session across multiple calls, or does it only support one-shot human logins?
  3. Does your structured data still mean the same thing if a runtime that did not execute your JavaScript tried to read it?

These are runtime-readability questions. The model has nothing to do with them. The runtime decides whether your answer is even in the model’s context window, and the model picks from whatever the runtime hands over.

The web’s plumbing is being rebuilt. Every model in the next two years will see your website through one of these runtimes, not directly. Your website’s job, starting now, is to be legible to the runtime.

The model conversation will keep happening on conference stages and in keynote slides. The runtime conversation is happening in product changelogs from infrastructure companies. The companies that ship the runtime will decide which websites get reached by AI search and AI commerce. Stop asking which model. Start asking which runtime.

More Resources:


This post was originally published on No Hacks.


Featured Image: Viktoriia_M/Shutterstock

AI Companies Are Selling Heartwarming Ads – They’re Racing To Automate Your Job via @sejournal, @gregjarboe

OpenAI wants you to know that its technology helps you figure out what to cook for dinner. Google wants you to feel the warmth of a family settling into a new home with Gemini by their side. Anthropic would like you to see its Claude as the clean, trustworthy alternative to the ad-cluttered mess everyone else is building.

These are real campaigns, and they represent a deliberate strategic choice: Make AI feel human, domestic, and useful before anyone starts asking harder questions.

The harder question, for digital marketing professionals, SEO specialists, content creators, and entrepreneurs, is this: What are these companies actually building while they’re running heartwarming commercials?

What The Ads Are Saying

OpenAI’s consumer messaging has settled into a register of casual everyday utility. The “Dish” and “Pull Up” ads show ordinary people getting help with dinner or fitness routines, not productivity gains or enterprise automation. Google’s Gemini advertising has leaned into family milestones and emotional resonance, positioning the model as a companion for life’s significant moments. Anthropic, meanwhile, has run campaigns that explicitly mock sponsored responses in competitor products, casting Claude as the principled choice for users who don’t want their AI assistant quietly selling them something.

Each narrative is coherent, well-produced, and aimed squarely at building consumer trust. That trust, of course, is the infrastructure on which the enterprise business gets built.

What The Products Are Actually Doing

Behind the domestic warmth, all three companies are racing to deploy agentic systems capable of automating complex, multi-step professional workflows. However, this means marketing professionals will no longer be defined by their ability to perform individual tasks but by their capacity to design and manage autonomous systems that handle those tasks with minimal human supervision.

That’s a significant reframe. GPT-5.5 is being positioned as a project manager that can build entire lead funnels, including strategy, copy, and email deployment, without reprompting. Gemini 3.1 Pro’s one-million-token context window is designed for deep research at a scale that, as the roadmap puts it, “humans cannot replicate.” Claude Opus 4.7 is being marketed to enterprise clients for legal redlining, production-grade code review, and high-fidelity visual verification – work that currently employs specialists.

OpenAI has published a benchmark called GDPval that measures model performance across 44 occupations, from real estate broker to news analyst. Its latest model, GPT-5.5, scores 84.9%, a win-or-tie rate against human professionals on tested tasks. That’s not a consumer product metric. That’s a displacement metric dressed up in benchmark language.

Why This Is An SEO-Specific Problem

The traditional SEO model – research keywords, produce content, earn rankings, and drive clicks – is being restructured by the same companies that are running those warm family ads. Google’s AI Overviews, which Sundar Pichai confirmed are driving Search revenue growth of 19% in Q1 2026, are changing the click economy in ways the advertising doesn’t acknowledge. Users are getting answers without visiting pages. Brands are competing not for rankings but for citations within AI-generated summaries, a discipline some practitioners are now calling generative engine optimization (GEO).

The implication for content marketers is that volume strategies built on human-speed production are losing their edge precisely as AI tools make high-volume production cheaper and faster for everyone. The competitive advantage is shifting toward authority, entity recognition, and the kind of structural content quality that AI systems can parse and attribute. The people who figured out technical SEO before their competitors did will recognize this dynamic.

The Tension Worth Watching

There is a genuine contradiction at the center of all three companies’ public positioning. They are simultaneously telling consumers that AI is a helpful companion and telling investors that AI is automating professional-grade cognitive work at scale. Both things are true, and the gap between those narratives is where marketing professionals need to be paying attention.

Anthropic’s own researchers published findings showing that junior engineers who relied heavily on AI coding agents not only failed to complete tasks significantly faster – they also demonstrated weaker understanding of their work when tested afterward. If that extends to content strategy and SEO analysis, the profession faces a skills erosion problem that no “AI as partner” messaging addresses.

The companies building these tools have financial incentives to keep the consumer narrative warm and the enterprise narrative bullish. Your incentive is different: Measure what is actually happening to your traffic, your conversion rates, your citation share of voice, and your team’s capability development, and make decisions based on that data rather than the ads.

The dream they’re selling is appealing. Ground truth it anyway.

More Resources:


Featured Image: Frame Stock Footage/Shutterstock

“SEO Expert” Became “AI Search Expert” (Gulp.): How To Control AI Answer Accuracy via @sejournal, @lorenbaker

About a year ago, your job description changed without your permission. This on-demand session helps you catch up and take the lead.

Suddenly need to track both SERP rankings and AI accuracy? How do you make sure AI is saying the right things about your brand? What’s the best way to transition from SEO to AI Search Expert?

👆 Follow our lead. Register above, get 3 strategies & become the AI search expert your org needs.

You aren’t just fighting for clicks anymore; you’re fighting to ensure that when an AI model speaks for your brand, it actually mentions you, and gets the information right.

Accurate AI Answers = Your SEO Expertise + 3 Strategies

Good news: your SEO expertise isn’t a relic, it’s the exact foundation that works in AI search. In this session, seoClarity’s Chris Sachs, VP of Client Success, and Tania German, VP of Marketing, shared a roadmap that positions your brand as the definitive answer in AI search results.

You’ll Walk Away With:

  • The Orchestrator’s Playbook — How to lead the cross-functional teams (PR, product, content) that determine what AI models learn to trust about your brand.
  • Answer Certainty Metrics — How to move beyond “visibility” reporting and prove your brand is the definitive solution AI delivers to users.
  • Narrative Reclamation — How to stop third parties from defining your brand in AI outputs and position your organization as the primary Source of Truth.

Register above to watch the full session and get proven strategies to help you control AI outputs, lead the teams that make it happen, and prove it’s working to leadership and beyond.

Google’s Mueller: Vibe Coding Won’t Handle Your SEO For You via @sejournal, @MattGSouthern

Google Search Relations team members John Mueller and Martin Splitt discussed vibe coding websites on a recent episode of Search Off The Record.

Both found that AI coding tools could produce functional sites quickly. But getting SEO right still required specific technical direction, the same kind you’d give a human developer.

Telling AI To ‘Add Some SEO’

Mueller compared the experience of vibe coding to working with a developer who doesn’t specialize in search.

Mueller said on the podcast:

“You can always tell the AI system, now add some SEO to it. But how that works out is if you go to a developer and add some SEO and it’s like, what do you mean. Sprinkle some meta tags and add some structured data.”

Vague instructions produce vague results, whether the builder is human or AI. Mueller said he got better outcomes by telling the system what he wanted from the start. That included the domain name, canonical setup, sitemap files, and a robots.txt.

He checked whether the pages used reasonable HTML and linked properly. He also set up pre-publish checks to verify that URLs returned content and that JavaScript files weren’t blocked by robots.txt.

What They Built

Mueller has been building test websites to see how Googlebot handles requests. He deployed them to Firebase hosting using Hugo as a static site generator, with GitHub for version control.

He recently switched from VS Code with Copilot to command line tools. He named Claude Code and Gemini CLI as what he currently uses.

Splitt tried Google AI Studio to build a client-side tool with JavaScript. He described the output as readable, looking like a standard Next.js application. But he hit a loop where the AI kept using a library he didn’t want.

Splitt said:

“I asked it for a half an hour. I tried to make it not do what it wanted to do, and want to do what I wanted to do. And that was weird.”

The Technical Knowledge Question

Both acknowledged the tension in vibe coding’s promise that you don’t need to know how to code.

Mueller noted that technical understanding helps at every stage. Knowing what kind of site generator you want and how to structure pre-publish checks produced better results. Without that background, the AI will make assumptions. It might choose a static site generator, a JavaScript-heavy setup, or a full CMS with a database backend.

Mueller said:

“All of these are reasonable assumptions where if you talk to a developer they will also make these assumptions. But if you just tell the AI system like I want a website, then it will pick one.”

For personal projects and low-risk static sites, the stakes are low enough to experiment. But for anything involving user data or a production service, Mueller added that you’d want someone who understands what they’re doing.

Vibe-Coded Sites & Search Visibility

The sites Mueller built produced reasonable HTML that wouldn’t stand out as vibe-coded.

“Practically speaking, nobody can really recognize as being like, this is a vibe coded website,” he said, adding that common vibe coding frameworks can leave recognizable patterns.

He also pointed to a related risk with content. Once a site looks polished, it’s tempting to have the AI write the content too. Mueller acknowledged the tool can do that but said it’s not where he sees the most value.

Splitt agreed. AI-written content raises the question of why someone would visit a site instead of talking to the AI directly.

Mueller has flagged similar gaps in vibe-coded sites before. He reviewed a vibe-coded Bento Grid Generator on Reddit. He identified issues with crawlability, obsolete meta tags, and content stored in JavaScript files that search engines couldn’t access.

Looking Ahead

The podcast didn’t include formal guidance or policy positions on vibe-coded sites. Mueller and Splitt were sharing what they’ve tried and what they’ve run into.

For people testing these tools, the message is that AI can handle parts of code generation well, especially for lower-risk projects. It doesn’t make SEO decisions on its own. Those still require someone who knows what to ask for.


Featured Image: YouTube.com/GoogleSearchCentral, May 2026. 

7 Common AI Website Mistakes That Are Easy To Avoid via @sejournal, @martinibuster

Y Combinator general partner Aaron Epstein was joined by Raphael Schaad, founder of Cron that was sold to Notion, to discuss common mistakes made with AI designed websites. They identified seven common mistakes vibe coders made with their websites that should be avoided.

Positive And Negatives

The podcast started out by acknowledging that being able to vibe code a website is a positive thing that doesn’t have to turn out poorly just because they’re not a designer. Then they started visiting vibe coded websites and encountering multiple issues that fit into the following seven categories.

1. Generic Design Trends

The first problem they highlighted is the trend of letting the AI decide the look and feel. A recent discussion on Twitter called attention to people who turn to AI and ask, “give me something modern” and what they end up getting is something generic. And that shouldn’t be surprising because if you leave the choices to an LLM you will 100% get the most common design choices.

The design may look modern in isolation, but it loses brand value because it lacks uniqueness, it feels familiar, generic, and unoriginal. One of the examples is a layout grid that resembles a bento-box (a neatly packed Japanese lunch), which they said looked fine but is also non-original.

Another example was the generic software dashboard, with the point being the generic aspect of it. This kind of error can slip in at any point, where something looks professional but is generic.

Aaron Epstein commented:

“Go back to the engineers tab here.

Now, if this is their product, one of the other things that stands out to me is this kind of dashboard that’s got, you know, it’s got like the red, green, blue, purple kind of callouts up here.

That’s one of the hallmark classic things that a lot of AI design tools will actually create.”

Screenshot Of Generic Software Dashboard

Raphael responded:

“Yeah, every fake dashboard looks basically like something like that.”

Aaron Epstein:

They’ve got the icons that are a darker version of the lighter background color. It’s usually like the Google colors, you know, it’s like red, yellow, green, blue.

Raphael Schaad:

“The Fisher-Price primary…”

Aaron Epstein:

“So that… we tend to see a lot.”

They cited five examples of generic design trends:

  1. Overusing purple gradients
  2. Repeating generic AI design patterns
  3. Using bento-box layouts without originality
  4. Generic visual elements like the example of the software dashboard
  5. Relying on standard icons or emoji-like elements

All of those design trends that LLMs lean on end up creating a visual experience that looks other AI-built sites.

Raphael explained:

“This all kind of started when I had like a late night thought and tweeted that I see a lot of dumb hover effects on landing pages of startups these days, presumably vibe coded. And so I was kind of curious to peel the layer back there.

It’s like, how did these, like what I thought were dumb effects, how did they make it into LLMs and why are they everywhere now?

A couple other trends that we then was kind of like purple gradients. All of a sudden, all startup websites had purple gradients everywhere. Or these sections that kind of like fade as you go in, as you scroll, and they fade in and fade out.”

Aaron noted that all of those design trends are not inherently bad. What makes them bad now is that LLMs are making them common, thereby draining them of any originality they used to have.

Raphael agreed with Aaron’s assessment, explaining how this happens:

“And one of the key things was when there was a good website kind of establishing a trend, it took a while in the old world for others to kind of like copy these trends.

But now with LLMs, if there’s a good website with a purple gradient, it makes it into the LLM because the LLM gets trained on like the good examples that get linked to a lot. And then all of a sudden, like the next week, all the startup websites look the same.”

2. Unexpected User Interaction Feedback

User interaction feedback is important because it eliminates uncertainty by acknowledging that a click did something. User interaction feedback signals that something is clickable. All of those things are a part of a design language that site visitors expect to see.

Unexpected interaction feedback is a poor user experience because it breaks the pattern that a user expects when they visit a website. It’s like walking through a lobby and bumping into a glass wall in the middle of the room. It’s not supposed to be there and is distracting and disorienting, a poor user experience.

The podcast recommended avoiding these seven interactions:

  1. Lines following the user down the page
  2. Cursor-following lights
  3. Superfluous background animation effects
  4. Automatic fade-ins
  5. Moving buttons or shifting UI elements
  6. Hover effects that move elements without a clear purpose
  7. Animations that draw more attention than the product

3. Broken or Confusing UX Patterns

These are mistakes where the page becomes harder to use because the interaction model is unclear.

  • Scroll jacking
  • Non-standard navigation
  • Menus that jump or behave inconsistently
  • Clickable-looking elements that do not behave clearly
  • Buttons that move or auto-advance
  • Hover-only interactions
  • Hidden functionality behind hover
  • Duplicate or awkward sticky header behavior

Scroll hijacking was one the most common issues they encountered, stopping four times to comment on yet another site that was hijacking the browser scrolling.

At one point, Raphael commented:

“But it still feels like going through molasses… Like hijacking the …actual native browser scrolling to do some fancy thing with JavaScript to actually have the hooks to do all these animations.”

Another instance of scroll hijacking was the by-product of an animation that was loading and preventing the user from progressing.

Aaron Epstein commented as he scrolled down a page:

“What happens when we go further down?

…Valued and trusted by, okay, we’ve got a bunch of lines going everywhere. All right, so we’ve got that line following you down the page pattern again.”

Screenshot: “And now we’re scroll-jacked”

At this point the page stopped responding because of all the animations going on and Raphael said:

And now we’re scroll-jacked.

We’re locked into this position here of the website in order to build up this animation.

And I wonder what it wants to tell me, like, why is it important to capture me here on this point to build out this animation, where is it just like showing it already in the build-out state?”

Aaron noted that the animation and the scroll jacking is distracting him from what the page is trying to communicate.

He observed:

“And I find the animation is getting all of my attention, rather than what it says all the way over here on the left side. So I’m not even noticing any of this.

And this is not, on the right side, it’s not giving me enough visual information to communicate something valuable about what they do or how the product works.

So I just kind of miss everything over here on the side.

The animation is too distracting.”

The core problem here is that the site stops behaving like users expect. That creates friction, confusion, and sometimes mistrust, but certainly confusion, which is the opposite of what a website should be doing: offering clarity and communicating.

4. Weak Messaging and Product Explanation

These are mistakes where the design looks impressive, but the visitor still does not understand the product.

  • Unclear value proposition
  • Missing or vague explanation of what the product does
  • Not making clear who the product is for
  • Not explaining why the audience should care
  • Too little useful information above the fold
  • Product demos or examples without enough context

I see this kind of thing a lot with B2B type sites where you read the content and nothing on the page connects with explaining what the product or service is, much less communicating why I should care. In the past it was human slop written by someone who is more concerned with sounding techie and advanced. But nowadays it’s AI slop where content lacks purpose and is prone to using ambiguous words that have more than one meaning or words that are basically just lazy because they don’t do any work, don’t accomplish anything, fail to move the ball down the field.

A landing page is a customer acquisition channel. If visitors cannot quickly understand the product and its value, the design has failed.

5. Poor Information Hierarchy And Structure

These are mistakes where the page has too many competing visual or textual elements. The key thing here is visual or text elements that are competing for the site visitor’s attention.

  • Too many text styles
  • Extra labels that do not add meaning
  • Weak hierarchy between logo, headline, subhead, and supporting text
  • Sections that feel visually overbuilt
  • Decorative elements that add vertical space without improving clarity

AI can add structure that looks designed, but the structure may not help the reader process the page. Always be aware that AI tends to crank out content elements that look like their busy doing work but aren’t doing any work at all. And when I say work, I mean doing something purposeful, for a reason. Every word and visual element should do some work, accomplish something. This is something to be aware of when designing with AI.

6. Inconsistent Brand and Visual System

These are mistakes where the site lacks a unified identity. The site may contain attractive image assets, but they do not feel like they are a part of one coherent brand or visual style. These are hallmarks of an AI being prompted to do something modern or trendy or stylish but without having an established visual language or system in place.

  • Inconsistent visual language across sections
  • Colors that do not feel coordinated with the logo or brand
  • Product visuals that do not match the landing page style
  • Sections that look like they were generated separately
  • Brand choices that feel inherited from AI defaults rather than intentional

7. Lack Of Experienced-Based Judgment and Over-Reliance on AI

This is the underlying issue behind each category of issue with poorly vibe-coded websites. AI lets anyone design a site and create image and text assets. But it needs firm direction by someone with experience and expertise. The quality of the output is entirely dependent on the quality of the prompt, what was input.

The problem isn’t that AI makes AI slop. A lack of experience, expertise is what leads to the slop.

Here are the issues that lead to poorly designed vibe-coded websites:

  • Accepting all AI changes
  • Outsourcing taste to the LLM
  • Letting AI decide the brand direction
  • Starting from AI output instead of brand strategy
  • Spending saved time on more effects instead of clearer thinking
  • Forgetting that the human is now the editor

The insight and takeaway from reviewing poorly vibe-coded websites is that AI removes technical friction but it doesn’t replace judgment that comes from experience and expertise. The person vibe coding a website still has to decide what best serves the site visitor and the business goal.

Watch the podcast: Common Mistakes With Vibe Coded Websites

Featured Image by Shutterstock/Cast Of Thousands

The ROI Problem With AI Traffic Nobody Is Measuring Correctly via @sejournal, @DuaneForrester

Search engines were designed to do several things at once: Rank a field of options, route the user to one of them, and keep the human inside the decision so the engine never owned the choice. That last part was not an accident. It was the liability architecture. Large language models were built without any of it. They were built to answer the question directly, which is a different job entirely, and the design choices that follow from it change what visibility looks like, what risk looks like, and what the word ROI can honestly mean when the thing sending you traffic was never built to send traffic in the first place.

Two Systems, Two Jobs

A search engine’s job description is long. It crawls the web, indexes it, ranks a pool of candidate results against a query, presents them as a ranked list, and then waits for the human to make a click decision. The SERP itself has been drifting toward retention for years now, with galleries, rich snippets, answer boxes, local maps, video carousels, and AI Overviews all layering in features that keep the user on the page longer and route fewer of them to third-party sites. But the underlying contract was always the same. The engine offers options. The user selects one. The user owns the choice.

An LLM does not offer options. It produces an answer. The citation, when it appears, is not functioning as a routing instrument. It is closer to a grounding artifact produced by a retrieval pipeline, or in some framings, a confidence hedge, or both at the same time. Whichever read you prefer, none of them describe a system designed to send traffic somewhere else. The system was designed to resolve the question in place.

That distinction sits beneath every metric conversation in this space. When practitioners ask what the LLM referral rate is, what the attributed traffic number looks like, what the click-through from an AI answer is, they are asking questions that assume a routing mechanism that is not actually part of the architecture. Whatever traffic does come through is a byproduct, not a design goal, and confusing the two is the first mistake in almost every conversation about AI visibility ROI.

The Liability Surface Moved

The human in the click decision was the SERP’s shield. If the link the user selected led somewhere harmful, misleading, or defamatory, the engine could point to the list of options and the user’s own agency in choosing one. The engine had not published the claim. It had surfaced 10 candidate sources, the user had chosen one, and whatever happened next was not the engine’s editorial output. That is not a small feature. That is the reason Section 230 protections were structured the way they were, and why algorithmic ranking has traditionally been treated as something other than direct speech.

LLMs have no equivalent shield to stand behind. The system is producing the answer directly, in its own voice, without a field of options or a user-selected source. The liability surface that the SERP was designed to offload sits with the model producing the output, and the cases that have already moved through courts are starting to sketch the edges of that surface.

Walters v. OpenAI was dismissed on summary judgment in May 2025, and the decision leaned heavily on OpenAI’s disclaimers and a sophisticated reader who reasonably knew the chatbot could hallucinate. That reading protects general-purpose consumer chatbots in a very specific kind of case. It does not protect every product that uses a language model. In a separate matter, Air Canada was held liable for its customer service chatbot’s false statements about its own bereavement fare policy, because a customer could reasonably rely on an airline’s branded support agent for accurate information about that airline’s policies. Reasonable reliance is the key legal term, and the more specialized and authoritative the chatbot appears, the harder the disclaimer defense becomes to run.

The active litigation is still mapping the frontier. OpenAI is currently facing multiple lawsuits tied to allegations that ChatGPT drove users toward suicide or harmful delusions, several involving minors. The New York Times copyright case against OpenAI was allowed to proceed by a federal judge in March 2025, and Anthropic settled with book authors in August 2025 for a reported sum well into the billions. European GDPR complaints continue to move through Noyb. Battle v. Microsoft is still live. None of these outcomes are settled, and some will be dismissed on the same disclaimer grounds that resolved Walters. The point is not that LLM operators will lose every case. The point is that the liability surface now sits with the system producing the output, whether the individual plaintiff wins or loses, and every brand building against an LLM inherits some version of that surface when it uses the system’s output in its own customer-facing work.

The Denominator Problem

The most common argument against investing in AI visibility work sounds decisive until you look closely at what it is measuring. The argument runs roughly: ChatGPT and the others send a tiny sliver of referral traffic, somewhere in the low single digits of total inbound, so why reallocate budget toward a channel that barely moves the needle? Conductor’s research pegs the combined AI referral share at about 1% of publisher traffic. That number is real. At first read, it seems to close the ROI question cleanly.

It closes nothing. The problem is the denominator.

While the AI share of traffic holds roughly steady, the absolute volume of search-driven traffic has collapsed across most publisher categories. Similarweb data shows organic traffic to news publishers fell from about 2.3 billion monthly visits in mid-2024 to under 1.7 billion by May 2025, a loss of more than 600 million visits in under a year. Business Insider’s search traffic dropped 55% between April 2022 and April 2025, HuffPost lost roughly half of its search referrals, and The New York Times saw search’s share of its desktop and mobile traffic slide from 44% to 37%. Zero-click searches climbed from 56% to 69% between May 2024 and May 2025 as AI Overviews expanded across the SERP. A Reuters Institute survey of 280 media leaders in late 2025 found they expect another 43% decline on average over the next three years.

Read against that backdrop, a stable percentage share of a shrinking pie is not stable. It is a loss. The skeptics who point at the 1% number are measuring relative share of a traffic base that is contracting underneath them, and they are treating a falling absolute as if it were a steady state. The real question is not whether LLMs are sending meaningful traffic yet. The real question is whether the channel that used to send meaningful traffic is still doing what it used to do, and the answer is visibly no. The denominator is moving, and any ROI calculation anchored to the old denominator is a calculation of the previous environment, not the current one.

What The Billions Say

If the design-intent and liability and denominator arguments still leave room for doubt, the last place to look is revealed preference. What are the companies with the most complete internal data on user behavior actually doing with their capital?

The answer is unambiguous. The five largest U.S. cloud and AI infrastructure providers have committed between 660 and 690 billion dollars in 2026 capital expenditure, nearly doubling 2025 levels. Alphabet alone is guiding to between 175 and 185 billion for 2026, more than doubling its 2025 spend of 91 billion. Microsoft, Amazon, Meta, and Oracle are all running similarly aggressive curves. The number that matters most, and that defuses the usual counter-argument, comes from Bank of America credit strategists who estimate AI capex will reach 94% of operating cash flows in 2025 and 2026, up from 76% in 2024.

That is not the shape of a defensive hedge. A hedge is a fraction of the cash flow, deployed to avoid being caught flat-footed if a competitor’s bet pays off. Companies do not put 94% of operating cash flow into a category for two consecutive years unless the leadership genuinely believes the category is the business. And those leadership teams have access to data that the rest of us do not. They can see inside their own products, their own user behavior shifts, their own cohort analyses, their own enterprise pipeline conversations. They are legally bound to deploy shareholder capital in a way that reflects what they actually see, and what they are deploying it toward is the architecture that produces direct answers rather than ranked lists of options. To believe search-as-we-knew-it remains the gold standard, you have to believe that dozens of CEOs, boards, and senior leadership teams with decades of internal-only data are reading their own numbers wrong, while an external industry with none of that data is reading the market correctly. That does not pencil.

The human-behavior side of the equation makes the same point in a different register. Every labor-saving technology that has ever been introduced has reshaped the status quo faster than its skeptics predicted, because cognitive efficiency is not a preference. It is a survival behavior, wired in through long periods when calories were scarce, and shortcuts mattered. When a new tool appears that makes some task meaningfully easier, adoption is not a matter of whether. It is a matter of how fast and along what curve. ChatGPT is now at roughly 900 million weekly active users, up from 200 million 18 months earlier, and the full category is past a billion active users across platforms. The behavior has already shifted. The money has already shifted. The only thing that has not fully shifted is the measurement frame most practitioners are still using to evaluate the channel.

Which brings the question back to the one that is actually worth asking. What do you do if there is no ROI by the old definition, and you still cannot ignore the channel? The honest answer is that brands will need to invest in visibility work whose return is not expressed in clicks or referral traffic, because clicks and referral traffic are artifacts of the previous design. Being the cited source, the grounded source, the trusted source inside the answer is a different kind of visibility, and it will need a different kind of measurement. The teams that figure that out first will not be doing it because they found an ROI case that convinced their CFO. They will be doing it because they looked at the capex curves, the behavioral curves, and the liability curves, and concluded that the channel is the future, regardless of whether the spreadsheet knows how to score it yet.

If this lands somewhere real in your work, or if it reads wrong from where you are sitting, I would like to hear about it. The shift happening right now is too large for any one practitioner’s vantage point, and the best signal I get comes from the conversations that start after the article ends.

More Resources:


This post was originally published on Duane Forrester Decodes.


Featured Image: Krot_Studio/Shutterstock; Paulo Bobita/Search Engine Journal

Bing Team Describes How Grounding Differs From Search Indexing via @sejournal, @MattGSouthern

Microsoft’s Bing team published a framework describing how indexing requirements change when the goal is to ground AI answers rather than to rank search results.

The post identifies five measurement areas where the company says the two systems diverge. It also names “abstention” as a design choice for AI-powered retrieval.

What Microsoft Described

The post argues that traditional search indexing and grounding indexing share the same foundation but serve different goals.

Traditional search, the team writes, asks “which pages should a user visit?” The grounding layer asks “what information can an AI system responsibly use to construct a response?”

Microsoft identifies five categories where the measurement requirements differ.

On factual fidelity, the team notes that some ranking mismatch is tolerable in traditional search because a user can click through and evaluate. In grounding, the post describes breaking content into retrievable chunks as a process that “can distort page substance in ways that never appear in any ranking signal.”

For source attribution quality, the Bing team calls attribution helpful in traditional search but “a core signal” in grounding. Not all indexed content matters equally as evidence for an AI answer, the team adds.

On freshness, Microsoft notes a clear difference in cost. Stale content in search is a ranking problem. In grounding, the post says, “a stale fact produces a misleading response.”

For coverage of high-value facts, the post explains that a missed document in search is recoverable because alternative results exist. In grounding, the index must ensure “the specific facts and sources that people are likely to ask about are actually available and groundable.”

On contradictions, traditional search can surface one source above another and let the user decide. A grounding system can’t do that. “An AI system that silently arbitrates between contradictory sources is one that may confidently assert the wrong thing,” the team says.

Abstention And Iterative Retrieval

The post also covers two design differences between the systems.

Microsoft calls declining to answer “abstention.” For a grounding system, that’s a valid outcome when support is missing, stale, or conflicting. Traditional search doesn’t need to make this judgment because it presents options for a human to evaluate.

Iterative retrieval is the other difference. Traditional search is typically a single interaction where a query goes in and ranked results come out. Grounding systems may need to ask follow-up questions, refine retrieval based on intermediate results, and combine evidence from multiple sources.

Errors in early retrieval steps “compound through subsequent reasoning steps in ways that no human reviewer would catch in real time,” the post adds.

Context

This blog post comes after a series of moves by Microsoft to build out its grounding tooling and give publishers visibility into it.

In February, Microsoft launched the AI Performance dashboard in Bing Webmaster Tools, giving sites their first page-level citation data for AI-generated answers. The company rewrote the Bing Webmaster Guidelines in March to include GEO as a named optimization category and added grounding query-to-page mapping to the dashboard the same month. At SEO Week in April, Madhavan previewed four additional features for the dashboard, including Citation Share and grounding query intent labels.

This post is more conceptual than those prior announcements. It doesn’t introduce new tools or features. Instead, it lays out the engineering principles the company describes as guiding its index evolution.

Why This Matters

This framework clarifies what Microsoft says its systems need from the index for AI answers.

Microsoft states grounding relies on the same crawling, quality, and web understanding as search, but grounded answers require accurate, fresh, attributable, and consistent evidence. Stale facts, weak sources, and contradictions pose risks when content is used for answers.

Looking Ahead

The post offers insight into why some content is easier for AI to cite. If the Citation Share and intent-label features previewed at SEO Week ship, they could help test whether the measurement priorities described here show up in actual publisher data.


Featured Image: TY Lim/Shutterstock

The Whole Point Was The Mess via @sejournal, @pedrodias

Semrush put out an infographic last week. The kind built to be screenshotted into LinkedIn carousels and pasted into webinar decks. Four pillars. The fourth one is called “Technical GEO”: schema, structured data, clean architecture. The line that justifies it: “Ensures AI engines can parse and connect your content.”

Ensures.

See it live on X/Twitter. Image Credit: Pedro Dias

That is the entire piece in one word. The architecture of large language models is, by design, the opposite of ensured. And schema has nothing to do with whether an LLM can parse text. LLMs parse text by reading text.

Semrush is far from alone. Every SaaS vendor with skin in this game is running variations of the same play. SEO-era controllability, repackaged under a new acronym. The same percentages, pillars, and pyramids. All dressed for a system that was built specifically not to work this way.

I have made the strategic version of this case before, in “Your AI Strategy Isn’t a Strategy.” This piece is the technical floor underneath it.

Built To Read Whatever’s There

Language models exist because the web is a mess. Forums, Wikipedia stubs, blog posts written at 2 A.M., scraped product copy, machine-translated junk, code comments, half-formed sentences, typos, contradictions, every register from journal article to subreddit shitpost. Pre-training data is the public web, and the public web has never been structured.

The transformer architecture handles this by treating language as sequences of tokens. There is no parser inside the model looking for tags. There is no preference for FAQ markup. The model reads the words. That is the mechanism.

At inference time, the model generates more tokens conditioned on the input. None of that pipeline is reading microdata.

Schema.org has real jobs. It feeds rich results in classical search. It supports entity disambiguation in the knowledge graph. It helps voice assistants pull structured fields. These are well-defined functions inside specific systems. They are not the mechanism by which an LLM understands a sentence.

So when a vendor claims structured data “ensures AI engines can parse and connect your content,” there is nothing to ensure. The parsing layer they are imagining is not there. The model already parsed your sentence. It did so by reading the sentence.

One Trick, Three Brand Colors

Look at the biggest GEO and AEO explainers in the market right now, and you find the same SEO-era playbook with the acronym swapped.

Semrush is already covered. The fourth pillar of its “Technical GEO” presents schema and structured data as ensuring something that the architecture cannot ensure.

AirOps published a graphic titled “15 Ways to Get Cited by ChatGPT, Perplexity, & Google.” It is the most numbers-heavy specimen of the genre I have seen this year. Schema markup increases citation likelihood by 13%. Sequential H2 to H4 tags double your chances. Short paragraphs make content 49% more likely to appear in AI answers. Perplexity cites UGC in 91% of answers, versus Gemini’s 7. Read the source notes and the methodology trail comes home. The numbers in the graphic trace back to AirOps’s own “2026 State of AI Search Report.” AirOps is citing AirOps on the question of whether AirOps’s prescriptions work.

Peec AI does a more honest job in places. Its complete guide to GEO acknowledges the probabilistic nature of the system and concedes that foundation models are already trained, so optimization focuses on the retrieval layer. Then it lands the same prescriptions: heading hierarchy, bullet lists, FAQ markup, multiple schema types layered on each page, summaries at the top of sections – all built on the chunking claim that long paragraphs lose out because the engine extracts fragments rather than full articles.

Profound, citing Aleyda Solis’s checklist, is the most explicit in its piece: “Optimize for Chunk-Level Retrieval.” Each section, a standalone snippet. Each page, a buffet from which the engine takes what it wants. The engine, in this telling, is a polite guest who only takes what’s been laid out.

Three vendors. Same operating assumption: a controllable, prescriptive technical discipline sits between a publisher and a citation, and it occupies roughly the same shape as classical SEO. Schema, headings, structure, freshness, machine-readable formats. Familiar. Billable. Reportable up to a chief marketing officer.

What Schema Actually Does

Schema is not the target here. Schema has real, well-defined uses. Classical Google search uses it for rich results: prices, ratings, event times, the structured fields that drive search engine results page features. The knowledge graph uses it for entity disambiguation. Voice assistants pull structured fields out of it.

None of that goes away. If you’re responsible for technical SEO, keep implementing schema where it earns its keep.

Schema cannot reach into a transformer and improve its comprehension of your prose. The model isn’t architected to read schema as schema. It receives whatever text the engine fetched and chose to include, and processes that text as language tokens. The entire GEO/AEO marketing layer rests on conflating two distinct claims: that schema is useful in classical search, and that schema feeds the LLM. The first is true. The second is a category error.

Chunking Is Not Yours To Optimize

Image Credit: Pedro Dias

The chunking advice keeps reappearing because it sounds technical, sits neatly inside a flowchart, and gives a content team something concrete to do on Monday morning. It is also incoherent.

Chunking happens at retrieval time. Perplexity, ChatGPT, and Gemini each run a retriever over candidate documents, split them according to their own configurations (length, overlap, embedding model, sometimes semantic boundaries), and feed the top-k chunks into the model’s context. Those configurations belong to the engine. They get tuned differently across systems and retuned on schedules no publisher is privy to. The publisher’s view of the chunker is the publisher’s view of the model: black box, results only.

So when a vendor says “optimize for chunk-level retrieval,” what is actually being recommended is good writing. Short, self-contained paragraphs. Clear definitions near the top of sections. Internal logical structure. These are recognizable disciplines: information architecture, technical writing, readability. They have been recognizable disciplines since long before the transformer was invented. They are not a new technical layer.

A more honest version of the pitch would be: Hire someone competent at writing for the web. That sentence does not fit on a pricing page.

The Paper They Don’t Read

There is an actual academic paper called “GEO.” Aggarwal and co-authors, KDD 2024. It is the closest thing to a citable source the SaaS layer has when it sells generative engine optimization as a discipline. It is also, as papers go, easy to skim. Nine “optimization methods” are tested on a 10,000-query benchmark, with results.

What did the paper find worked?

Adding citations from credible sources. Adding quotations from relevant sources. Adding statistics. Improving fluency. Making prose easier to understand. The methods that produced the largest visibility lifts were essentially: write content with more evidence in cleaner prose.

What did the paper test and find did not work?

Keyword stuffing, the closest analogue in the paper to the SEO-era playbook the current GEO and AEO vendors have repackaged. Result: below baseline. The paper’s authors note in plain terms that techniques effective in search engines “may not translate to success in this new paradigm.”

Notice what is not in the list of nine methods. Schema. Structured data. FAQ markup. Heading hierarchy. Machine-readable formats. None of these are tested in the paper, because none of them are the optimization surface the paper studies. The paper is studying content-level interventions: what you put in the words, not metadata layered around the words.

The SaaS layer borrowed the acronym. The findings stayed in the paper. “Technical GEO” is the SEO playbook with different stickers on the same boxes, sold against research that points the other way.

The Assumption Smuggled In

The SaaS pitch only makes sense if you smuggle in one assumption: that the system you’re optimizing for has the same shape as the one that’s been billing SEO clients for a quarter-century. Inputs you control. Outputs that respond. A retrievable causal chain between the two.

That model was always a simplification of how search worked. It was close enough to keep the industry running, and close enough to keep the invoices going out.

None of that simplification survives contact with generative systems. The same prompt produces different answers across sessions, users, temperatures, model versions, and days. Observed behavior across the major engines, not a clean property of any single one. The retrieval layer in front of the model also moves: candidate sources shift, ranking shifts, freshness windows shift. No causal chain runs between “I added FAQ schema” and “the model cited my page.” What runs between them is a probability distribution, and the things you control affect that distribution in ways nobody can cleanly attribute. Not even the people who created these systems.

This is the established line on AI visibility tools, repeated here because it applies to the whole prescriptive layer. Statistically unverifiable data drawn from non-deterministic systems. A 13% citation lift, measured how, against what counterfactual, with what reproducibility? The methodological questions aren’t what those numbers are designed to answer. The numbers are the answer. They land in a graphic, get rendered as ROI in a board deck, and the conversation moves on.

Something To Say In The Meeting

Here is the part that the architecture argument and the methodology argument do not, on their own, explain. Why does the entire SaaS layer keep successfully selling this stuff to people who are not stupid?

The honest version of the answer goes something like: We are operating with reduced visibility into a system that does not expose its mechanics, that returns different outputs to different people for the same query, that is changing month by month, and that has folded a substantial chunk of the funnel into a black box. We can keep doing the work that has always been the work: writing well, being useful, building authority, maintaining the site. We can monitor what shows up where. The deterministic dashboard we used to have is not coming back.

That sentence is unsayable in a marketing meeting. It admits the lever is not connected. It tells leadership that the budget line they approved does not have a corresponding action. It gives the team nothing to put in next quarter’s plan.

So the SaaS layer fills the gap. It manufactures levers. Pillars, frameworks, percentage lifts, schema audits, chunking optimization, machine-readable formats. Reportable activity. Defensible expenditure. Something to say in the meeting. None of this gets you visibility. The engine decides that. What is on offer is the appearance of control, sold to people who would rather pay than concede that control left the room.

Once the lever is bought, it has to be operated. Schema audits get scheduled. Chunking checklists get reviewed. Citation likelihoods get tracked, refreshed, and compared. The dashboard the team paid for becomes the dashboard the team optimizes against, and the dashboard quietly replaces the actual problem with the part of the problem it can see. By the time anyone notices, the SaaS layer is writing the brief.

None of this is a moral failure on the buyer’s side. What you are watching is what happens when an industry has been organized for a quarter-century around the premise that you can pull a lever and watch the meter move, and the meter quietly disconnects from the lever. The vendors aren’t running a con. They are filling demand for the only thing the buyer can no longer afford to do without: an answer that fits in a slide.

Rank And Tank, All Over Again

I keep coming back to a phrase that fits this whole moment: dancing to the rank-and-tank tunes (I borrowed it from David McSweeney). The cycle goes: Vendor sells the controllable-discipline frame, agencies adopt it, content teams scale production around the prescriptions, AI-generated articles get pumped out at volume because the prescriptions are easy to template. Some of it ranks for a while. Most of it eventually tanks because the prescriptions were never the mechanism, and the engine adjusts, or the freshness window closes, or the system simply moves on.

The SEO industry has done this before. Spinning. Mass programmatic pages. Doorway content. Each cycle followed the same shape: a controllable input dressed as a discipline, sold at scale, briefly effective, eventually punished by the engine, replaced by the next controllable input dressed as a discipline.

GEO and AEO are the current cycle. The pillars and percentages and pyramids are this cycle’s templates. Underneath them, the strategies bifurcate.

One path is brand presence exploitation. Plant your name where the engines look. Reddit threads, top-X listicles, the same citation surfaces over and over. The cycle feeds itself: engines cite the surfaces, brands work the surfaces, surfaces feed the engines. I have written about this loop before; I called it the Ouroboros pattern. The short version is that the loop is less stable than the strategy assumes.

The other path is content at scale. Produce variations, pump out volume, treat the templated output as content that could earn a citation. I have written about this approach before, in the “Scaling Disappointment” piece. The short version is that uniqueness is not value, and at the pace these prescriptions enable, qualitative review stops being possible. The volume of AI-generated copy produced under this path is this cycle’s externality.

The next cycle will sell the cleanup.

Forget for a second whether your “Technical GEO” is set up correctly. Ask whether the thing you are putting on the page is worth reading. Large language models were designed to read whatever is there. If what is there is good, it will be read. If what is there is templated, low-utility content optimized against a chunking heuristic that does not exist, it will eventually be filtered out: by the engine, by the user, or by the next academic paper showing that retrieval quality is degraded by exactly this kind of slop.

The advantage, when it accrues, will accrue to the people who do not get distracted. Who do not subscribe to the dashboard. Who keep working on product-driven SEO and the foundations that have always connected content to people. There are early signs of this on the timelines I read. Practitioners openly questioning whether optimizing against a non-deterministic surface makes sense at all, and asking whether their attention belongs back on classical search; which, at the end of the chain, is what feeds these systems anyway.

The mess was always the point. The architecture handles it. The industry just needs to stop pretending the mess is the problem.

More Resources:


This post was originally published on The Inference.


Featured Image: Roman Samborskyi/Shutterstock

Google Adds More Links & Link Context To AI Search via @sejournal, @MattGSouthern

Google is rolling out five updates to how links appear in its generative AI Search experiences, including AI Mode and AI Overviews. The changes add subscription labels and inline links within responses, among other features.

Here’s an example of how the changes will appear:

Image Credit: Google

Hema Budaraju, VP of Product Management, wrote about the updates in a blog post.

What’s New

The updates cover five areas of link display across Google’s generative AI Search features.

Subscription Highlighting In AI Mode & AI Overviews

Google is now labeling links from users’ news subscriptions in AI Mode and AI Overviews.

Google announced subscription highlighting in December for the Gemini app but didn’t provide a timeline for AI Mode or AI Overviews. Today’s announcement confirms the expansion to both surfaces.

Google said that in early testing, people were “significantly more likely” to click links labeled as their subscriptions. The company didn’t share specific numbers.

Publishers who want to help subscribers connect their subscriptions with Google can find details on Google’s developer website.

Topic Suggestions After AI Responses

Serchers will start to see suggestions for related content at the end of many AI responses. These link to articles or analyses on different aspects of the topic.

Discussion and Social Media Previews

Google’s AI responses will include previews of perspectives from public online discussions, social media, and other firsthand sources.

The company is also adding context to these links, such as creator names and community names.

See a provided example:

Image Credit: Google

More Inline Links Within Responses

Users will start to see more links directly within AI response text, positioned next to the relevant passage. Google didn’t quantify how many more inline links users will see or where the change will appear.

See a provided example:

Image Credit: Google

Link Hover Previews on Desktop

On desktop, hovering over an inline link in Google’s AI experiences will show a preview of the linked website. The preview includes the site name and page title. Google noted that people hesitate to click links when they don’t know where they lead.

See a provided example:

Why This Matters

Image Credit: Google

These updates show Google trying to make links more visible in AI Search at a time when publishers are closely monitoring referral traffic.

More inline links, hover previews, discussion cards, and subscription labels all point in the same direction. Google wants AI responses to feel less like dead ends and more like starting points for deeper exploration of the web.

That matters because the debate around AI Search has centered on whether AI answers reduce the need to click. Google is now adding more ways to click, but it isn’t providing the data publishers need to judge the impact.

For websites, that leaves the update in a familiar place. The link treatment may improve visibility, but the traffic impact will still need to be measured in analytics after the rollout reaches its audience.

Looking Ahead

The next question is how consistently these link treatments appear across AI Search surfaces.

Google didn’t provide rollout details for most of the updates, including geography, language, eligibility, or timing. That makes early testing difficult to interpret until we can see where the features appear and which types of queries trigger them.


Featured Image: Danuta Hyniewska/Shutterstock