Ecommerce MGMT, Author at Ecommerce Management Education

Jan 19 2026

How Much Of Your Paid Media Budget Should Be Allocated To Upper Funnel?

Determining a budget split between upper and lower-funnel is a recurring topic in paid media.

Upper-funnel campaigns (typically awareness and interest) create future demand, while lower-funnel campaigns capture existing demand and are built to drive action.

Knowing where the sweet spot is with budget allocation is a skill, and requires a sound knowledge of incrementality and how to balance immediate efficiency with long-term demand creation.

In this post, I’m going to explore the data, strategies, and channel considerations to help you find an optimal mix.

The Importance Of Upper Funnel Investment

Within paid media, it’s very tempting to pour the majority of budget into the quickest wins that yield the highest returns. It makes sense on many levels, especially when teams are budgeting (and working to) strict forecasts and targets.

However, neglecting upper-funnel spend can hurt your long-term growth, with research showing that cutting brand awareness campaigns to save money or simply avoiding this type of activity can backfire.

For example, a BCG analysis found companies that slashed brand marketing saw significantly worse outcomes, having to regain their lost market share later, requiring $1.85 in spend for every $1 saved from cutting back.

In a roundabout way, suggesting that saving a dollar today on branding can (in some cases) cost nearly two dollars tomorrow.

And it’s not just efficiency; the growth impact of neglecting brand building can be detrimental, too.

In the same study from BCG, bottom-quartile brand spenders had sales growth rates 13% lower than top-quartile brand spenders, indicating brands that underinvest in awareness suffer from lower sales growth in the long term.

They also converted aware consumers to buyers at a lower rate (a 6% weaker conversion from awareness to purchase than top-brand spenders).

Studies like this prove that upper-funnel activity isn’t just a nice-to-have, or a place to use budget left over from lower-funnel spending; it directly influences revenue trajectory, market share, and even shareholder returns.

At this point, you’re probably thinking, “What do you mean by upper-funnel activity?” So let’s have a top-level run-through.

Upper-funnel campaigns plant the seeds by reaching new audiences and generating interest in audiences who may not yet be familiar with your brand.

Think Meta or Pinterest campaigns serving ads to new users as part of broad audiences, interest-based cohorts, or lookalike lists, all excluding your current customer base and/or users who have interacted with your brand.

Think YouTube or GDN campaigns serving ads to in-market, affinity, or custom audiences, again, all while excluding your current customer base.

For this post, we’re focusing specifically on paid search and paid social, with a supporting role from display advertising served through Google and Microsoft.

While programmatic, out-of-home, TV, connected TV, PR, and other channels can all be effective for upper-funnel advertising, they fall outside the scope of this piece.

My aim here is to focus on how to allocate budget toward top-of-funnel activity, specifically through paid search and social platforms.

Balancing Short-Term Performance And Long-Term Brand Building

While the exact percentage will vary by business, a number of frameworks and studies offer guidance on balancing upper vs. lower-funnel spend.

The most well-known being Les Binet and Peter Field’s research into marketing effectiveness, which suggests roughly a 60/40 split.

This translates into 60% of ad budget for brand building (upper-funnel) and 40% to direct activation (lower-funnel) as a rough starting point.

This 60/40 rule isn’t rigid, but it underscores that at least half (if not more) of your spend should typically go toward awareness and brand in order to maximize long-term growth.

Other models follow suit and emphasize a hefty allocation to upper-funnel activities.

For instance, many marketers use a 70-20-10 rule (adapted from a learning model) to diversify marketing investments: 70% on proven “always-on” channels, 20% on new or emerging channels, and 10% on experimental ideas.

Often, those proven channels include your core lower-funnel performers, while a portion of the 20% and 10% go toward upper-funnel initiatives.

Another approach, specific to paid media funnel stages and widely used in paid social campaign structuring, is a 60-30-10 funnel split: about 60% of budget for prospecting and awareness, 30% for mid- to lower-funnel retargeting, and 10% for closing at the bottom of the funnel.

This model ensures the majority of spend focuses on feeding the funnel with new prospects, while still dedicating budget to nurture them down to conversion.

Is every business other than yours running these exact models? Nope.

Does every business ensure it allocates sufficient media budget for upper funnel? Nope.

A 2024 CMO survey, found that only 31.2% of budget was allocated to long-term brand building vs. 68.8% to short-term performance on average, the opposite of what we’re told from industry leading studies, and this imbalance shows how pressure for quick ROI can overshadow brand investment and from working within paid media for a decade and a half, this is something I see time and time again.

Studies and guidelines are great, but in reality, there really isn’t a one-size-fits-all answer to the exact percentage of budget to allocate for upper-funnel, and it depends on factors like your industry, growth goals, and brand maturity.

For example, a new market entrant or a brand in a highly consideration-driven category (like automotive or B2B tech) may need to invest heavily in awareness and education since customers won’t convert without multiple touches and trust-building.

In contrast, a well-known brand in a transactional ecommerce vertical might get by with a lower percentage on upper-funnel, especially if it already benefits from high awareness.

Evaluate your current situation: If you’re in a crowded consumer goods market (e.g., retail fashion), strong branding and broad reach can differentiate you, whereas in a niche B2B service, thought leadership content and awareness efforts might be what fills the pipeline for your sales team.

The one certainty with this topic is that completely ignoring upper-funnel advertising with paid media is not good.

Even if short-term conversion pressures are high, dedicate a healthy portion of your budget to feeding the funnel.

A useful mindset is to treat awareness spend as an investment in future revenue.

As marketing effectiveness veteran Mark Ritson advocates, you must balance “the long and the short of it,” fund the brand for long-term growth and performance marketing for short-term sales.

Many successful companies treat brand marketing as “always-on” (continuous) rather than a luxury to add when times are good.

In practice, this could mean making sure, say, 20-30% of your paid search and social budget is consistently reaching new cold audiences at any given time, even if attribution for those dollars is not immediately obvious (more on that later).

What Does Upper-Funnel In Paid Search And Paid Social Look Like?

Translating budget allocation into channel strategy requires understanding how each paid media channel fits into the funnel.

Paid media is not one-size-fits-all; channels like paid search, paid social, video, and display each serve distinct roles across the funnel, from awareness to conversion.

Here are a few approaches to upper-funnel budget allocation across key channels:

Paid Search (Google & Microsoft Ads)

Paid search is typically considered a lower or mid-funnel channel; the reason being, this channel is often seen as a place to capture users who are actively searching for a product/service, often indicating intent.

Advertisers frequently split their campaign groupings into brand and non-brand, driving visibility in line with query types across search and shopping networks.

Imagine you run an ecommerce store for sneakers, you may want to serve brand ads to tailor messaging, control, brand protection, incrementality, etc., and for non-brand, you may want to serve ads for queries like “black Nike GT Blazer low” or “Asics Novablast 5,” the sole purpose being to drive direct sales.

There’s arguably an element of upper funnel in non-brand search as advertisers enter auctions for queries that do not contain their brand, and in many cases exclude their website visitor lists, so when a user searches for a query like “black size 10 running shoes” and click through, the advertiser will be getting their brand in-front of new audiences, however, the objective of the campaign is not one of awareness.

Display (Google & Microsoft Ads)

While not always front of mind for upper-funnel strategy, the Google Display Network (GDN) is great for reaching new audiences at scale as it spans over 35 million websites and apps, including YouTube, Gmail, and top-tier publisher inventory.

This breadth gives advertisers the ability to serve visually engaging ads across a vast portion of the open web, tapping into contextual, affinity, and in-market audiences.

For upper-funnel campaigns, display is often used to spark interest through static or video creative, product banners, or lifestyle-led visuals that introduce the brand to users in relevant contexts.

With options like responsive display ads, you can dynamically test creative combinations and reach a broad but targeted audience, saving time and money as resources can be freed up that would have been spent on creative development.

When allocating budget, display may not command as much as social or video initially, but it serves a valuable supporting role in prospecting and awareness.

Brands in verticals like consumer goods, travel, or SaaS can use Display as a cost-effective way to expand, reach new audiences, and drive visibility and traffic to site.

Paid Social (Meta, Instagram, TikTok, LinkedIn & More)

Paid social is one of the most common types of advertising for upper-funnel marketing.

Platforms like Facebook/Instagram (Meta), TikTok, Pinterest, LinkedIn, and others offer rich targeting options to get your message in front of people who have never heard of you, but who fit the profile of your target customer.

Nearly three-quarters of the U.S. population (73%) were active social media users. For advertisers, this means the audience they want to reach is likely out there scrolling a feed.

For upper-funnel campaigns, social ads shine by allowing you to target based on interests, demographics, behaviors, lookalike audiences, and more, pushing visually engaging content to users who aren’t actively seeking your product yet.

When allocating budget, a significant chunk of your prospecting (new customer) budget will likely go into paid social.

You could use short-form video ads showcasing your brand story or product in use, carousel ads with inspirational lifestyle imagery, or interactive polls that get people interested.

The goal at this stage is not an immediate sale (though it’s great if it happens, and it does), but to introduce your brand, value proposition, or content to a relevant audience as efficiently as possible.

YouTube And Digital Video

No discussion of upper-funnel paid media budget allocation is complete without YouTube and online video platforms.

YouTube is effectively the new prime-time TV for many demographics, blending reach and targeting with the storytelling power of video.

YouTube ads can achieve massive scale, with 53% of marketers using YouTube to achieve various objectives such as reach, awareness, and conversions.

With YouTube’s advanced targeting (by interests, demographics, in-market intent, topics, etc.), you can home in on relevant audiences for your brand messaging at scale, and drive reams of valuable data.

Recent forecasts bolster advertisers’ confidence in YouTube’s ROI, with 44% of marketers planning to increase their YouTube marketing budget.

The momentum is driven by video’s effectiveness in lifting awareness and brand favorability.

Kantar research, for instance, has shown YouTube ads can substantially boost unaided brand awareness and other brand metrics, underlining the platform’s upper-funnel impact.

For practical budgeting, treat YouTube similarly to how you’d treat television in a media mix, a primary reach vehicle.

The difference is, YouTube allows flexible budgets (you can start small and scale) and measurable results (you can track views, clicks, and even use Brand Lift surveys to measure ad recall and brand interest).

If you’re in a consumer-facing vertical like electronics, fashion, or automotive, you might allocate additional budget to YouTube for big awareness pushes around new product launches or campaigns, too, in addition to always-on brand building.

Even in B2B or niche markets, consider using YouTube for educational top-of-funnel content (e.g., explainer videos, industry thought leadership) targeted to relevant audiences.

Measuring Upper-Funnel Impact And Winning Buy-In

One reason many companies double down on lower-funnel spending is that it’s directly measurable; you see clicks and conversions, which please the performance dashboard and finance team.

Upper-funnel efforts often lack that immediate clarity on attribution, making it harder to justify budget to skeptics.

This is why measuring the impact of upper-funnel campaigns is crucial to determining the right budget allocation (and getting organizational buy-in to maintain and/or scale that spend).

Start by defining key performance indicators (KPIs) for upper-funnel campaigns that tie to your objectives.

These will be different from pure conversion metrics. Common upper-funnel KPIs include:

Reach and Impressions: How many unique people saw your ads? How many people did you reach?
Engagement Metrics: For example, video views (and view-through rates), social shares, comments, likes, or clicks on content. If people are engaging, your message is resonating at least enough to spark interest.
Click-Through Rate (CTR): While upper-funnel ads often have lower CTRs than the likes of Search Ads, a healthy CTR indicates the creative and targeting are attracting interest among a cold audience.
Brand Search Lift: Track the volume of searches for your brand name and/or direct traffic to your website during and after campaigns. An increase can signal that awareness efforts are causing more people to seek you out.
New User Acquisition: Look at the percentage of new visitors or new customers acquired. Upper-funnel campaigns should feed new people into the pipeline.
Brand Lift Studies: Use tools like Facebook’s Brand Lift or YouTube Brand Lift surveys, which can directly measure ad recall, brand awareness, and consideration among those exposed vs. a control group.

It’s also important to measure impact on a wider scale, taking a step back and analysing exactly how your upper-funnel spend impacted the business.

For example, you might find that regions where you ran a heavy awareness campaign see higher conversion rates in the subsequent weeks or months.

Techniques like marketing mix modeling or incrementality testing can help connect the dots.

Incrementality is essentially determining how much extra business an upper-funnel campaign drove that would not have happened otherwise.

You can test this by using holdout groups (e.g., show ads to 90% of your target audience but withhold them from 10% as a control, then compare behaviors), or by pausing campaigns and seeing if sales dip.

That means reporting beyond vanity metrics. For instance, instead of just saying, “Our video ad got 100,000 views,” translate that into, say, “Our brand lift study indicates an 8-point increase in awareness in our target market, which correlates with a 20% lift in branded search volume the following month.”

By connecting awareness metrics to leading indicators of sales, you make a case that those dollars are working hard.

And finally, adopt a test-and-learn approach.

If uncertainty is high, start by allocating a modest portion (say +5-10% shift) of your budget to upper-funnel campaigns for a period, then measure results.

If you can show that leads or branded searches grew, or cost per acquisition improved downstream, it will be easier to argue for maintaining or even increasing that allocation.

On the flip side, if an upper-funnel tactic isn’t performing, refine the creative or targeting rather than immediately cutting the budget, optimization is usually the answer, not abandonment, when it comes to new funnel initiatives.

Key Takeaways

Determining how much of your paid media budget to devote to the upper-funnel is a strategic decision that should be informed by both evidence and your unique context.

The data is clear that brand awareness and prospecting deserve a significant share of spend, even though many firms today allocate far less to it than they once did.

The exact figure will depend on your goals, industry, and growth stage, but the guiding principle is to invest enough in upper-funnel marketing to continually feed your future customer pipeline.

Underinvesting in awareness may boost short-term efficiency, but it eventually leads to stagnation and higher costs to reignite growth later.

In practice, this means making room in your plans for campaigns that build brand equity, engage new audiences, and create demand, even if they don’t convert immediately.

Whether it’s a YouTube video campaign reaching millions of potential customers, a series of TikTok ads riding the latest trend to put your brand on the map, or a broad Display campaign educating people about a problem your product solves, these efforts ensure your lower-funnel tactics have a steady stream of interested prospects to convert.

The upper-funnel and lower-funnel are interdependent; success comes from funding both appropriately and making them work in tandem.

So, how much of your budget should go to upper-funnel?

Enough that you’re confident you’re driving robust awareness and demand generation, not just scraping the bottom of the barrel.

For many, that will be a considerably larger portion than they currently allocate.

Aim for a balanced mix grounded in research and test data, adjust to your business needs, and then track the results.

With the right allocation, your paid media can both capture the immediate sales and expose your brand to new audiences, fueling both immediate performance and sustainable growth.

More Resources:

Featured Image: Anton Vierietin/Shutterstock

Ecommerce MGMT 0 Comments

Jan 19 2026

Head Of WordPress AI Team Explains SEO For AI Agents via @sejournal, @martinibuster

James LePage, Director Engineering AI at Automattic, and the co-lead of the WordPress AI Team, shared his insights into things publishers should be thinking about in terms of SEO. He’s the founder and co-lead of the WordPress Core AI Team, which is tasked with coordinating AI-related projects within WordPress, including how AI agents will interact within the WordPress ecosystem. He shared insights into what’s coming to the web in the context of AI agents and some of the implications for SEO.

AI Agents And Infrastructure

The first observation that he made was that AI agents will use the same web infrastructure as search engines. The main point he makes is that the data that the agents are using comes from the regular classic search indexes.

He writes, somewhat provocatively:

“Agents will use the same infrastructure the web already has.

Search to discover relevant entities.

“Domain authority” and trust signals to evaluate sources.

Links to traverse between entities.

Content to understand what each entity offers.

I find it interesting how much money is flowing into AIO and GEO startups when the underlying way agents retrieve information is by using existing search indexes. ChatGPT uses Bing. Anthropic uses Brave. Google uses Google. The mechanics of the web don’t change. What changes is who’s doing the traversing.”

AI SEO = Longtail Optimization

LePage also said that schema structured data, semantic density, and interlinking between pages is essential for optimizing for AI agents. Notable is that he said that AI optimization that AIO and GEO companies are doing is just basic longtail query optimization.

He explained:

“AI intermediaries doing synthesis need structured, accessible content. Clear schemas, semantic density, good interlinking. This is the challenge most publishers are grappling with now. In fact there’s a bit of FUD in this industry. Billions of dollars flowing into AIO and GEO when much of what AI optimization really is is simply long-tail keyword search optimization.”

What Optimized Content Looks Like For AI Agents

LePage, who is involved in AI within the WordPress ecosystem, said that content should be organized in an “intentional” manner for agent consumption, by which he means structured markdown, semantic markup, and content that’s easy to understand.

A little further he explains what he believes content should look like for AI agent consumption:

“Presentations of content that prioritize what matters most. Rankings that signal which information is authoritative versus supplementary. Representations that progressively disclose detail, giving agents the summary first with clear paths to depth. All of this still static, not conversational, not dynamic, but shaped with agent traversal in mind.

Think of it as the difference between a pile of documents and a well-organized briefing. Both contain the same information. One is far more useful to someone trying to quickly understand what you offer.”

A little later in the article he offers a seemingly contradictory prediction of the role of content in an agentic AI future, reversing today’s formula of a well organized briefing over a pile of documents, saying that agentic AI will not need a website, just the content, a pile of documents.

Nevertheless, he recommends that content have structure so that the information is well organized at the page level with clear hierarchical structure and at the site level as well where interlinking makes the relationships between documents clearer. He emphasizes that the content must communicate what it’s for.

He then adds that in the future websites will have AI agents that communicate with external AI agents, which gets into the paradigm he mentioned of content being split off from the website so that the data can be displayed in ways that make sense for a user, completely separated from today’s concept of visiting a website.

He writes:

“Think of this as a progression. What exists now is essentially Perplexity-style web search with more steps: gather content, generate synthesis, present to user. The user still makes decisions and takes actions. Near-term, users delegate specific tasks with explicit specifications, and agents can take actions like purchases or bookings within bounded authority. Further out, agents operate more autonomously based on standing guidelines, becoming something closer to economic actors in their own right.

The progression is toward more autonomy, but that doesn’t mean humans disappear from the loop. It means the loop gets wider. Instead of approving every action, users set guidelines and review outcomes.

…Before full site delegates exist, there’s a middle ground that matters right now.

The content an agent has access to can be presented in a way that makes sense for how agents work today. Currently, that means structured markdown, clean semantic markup, content that’s easy to parse and understand. But even within static content, there’s room to be intentional about how information is organized for agent consumption.”

His article, titled Agents & The New Internet (3/5), provides useful ideas of how to prepare for the agentic AI future.

Featured Image by Shutterstock/Blessed Stock

Ecommerce MGMT 0 Comments

Jan 19 2026

Google’s Mueller: Free Subdomain Hosting Makes SEO Harder via @sejournal, @MattGSouthern

Google’s John Mueller warns that free subdomain hosting services create unnecessary SEO challenges, even for sites doing everything else right.

The advice came in response to a Reddit post from a publisher whose site shows up in Google but doesn’t appear in normal search results, despite using Digitalplat Domains, a free subdomain service on the Public Suffix List.

What’s Happening

Mueller told the site owner that they likely aren’t making technical mistakes. The problem is the environment they chose to publish in.

He wrote:

“A free subdomain hosting service attracts a lot of spam & low-effort content. It’s a lot of work to maintain a high quality bar for a website, which is hard to qualify if nobody’s getting paid to do that.”

The issue comes down to association. Sites on free hosting platforms share infrastructure with whatever else gets published there. Search engines struggle to differentiate quality content from the noise surrounding it.

Mueller added:

“For you, this means you’re basically opening up shop on a site that’s filled with – potentially – problematic ‘flatmates’. This makes it harder for search engines & co to understand the overall value of the site – is it just like the others, or does it stand out in a positive way?”

He also cautioned against cheap TLDs for similar reasons. The same dynamics apply when entire domain extensions become overrun with low-quality content.

Beyond domain choice, Mueller pointed to content competition as a factor. The site in question publishes on a topic already covered extensively by established publishers with years of work behind them.

“You’re publishing content on a topic that’s already been extremely well covered. There are sooo many sites out there which offer similar things. Why should search engines show yours?”

Why This Matters

Mueller’s advice here fits a pattern I’ve covered repeatedly over the years. Previously, Google’s Gary Illyes warned against cheap TLDs for the same reason. Illyes put it bluntly at the time, telling publishers that when a TLD is overrun by spam, search engines might not want to pick up sitemaps from those domains.

The free subdomain situation creates a unique problem. While the Public Suffix List theoretically tells Google to treat these subdomains as separate sites, the neighborhood signal remains strong. If the vast majority of subdomains on that host are spam, Google’s systems may struggle to identify your site as the one diamond in the rough.

This matters for anyone considering free hosting as a way to test an idea before investing in a real domain. The test environment itself becomes the test. Search engines evaluate your site in the context of everything else published under that same domain.

The competitive angle also deserves attention. New sites on well-covered topics face a high bar regardless of domain choice. Mueller’s point about established publishers having years of work behind them is a reality check about where the effort needs to go.

Looking Ahead

Mueller suggested that search visibility shouldn’t be the first priority for new publishers.

“If you love making pages with content like this, and if you’re sure that it hits what other people are looking for, then I’d let others know about your site, and build up a community around it directly. Being visible in popular search results is not the first step to becoming a useful & popular web presence, and of course not all sites need to be popular.”

For publishers starting out, focus on building direct traffic through promotion and community engagement. Search visibility tends to follow after a site establishes itself through other channels.

Featured Image: Jozef Micic/Shutterstock

Ecommerce MGMT 0 Comments

Jan 19 2026

Amazon Rules Product Discovery, for Now

The Amazon marketplace is the world’s most popular product search engine. Yet its dominance faces emerging challenges from AI and social commerce.

For more than 20 years, Amazon has made it easy for shoppers to discover products, compare options, read reviews, and buy.

A 2024 Jungle Scout survey (PDF) of 1,000 U.S. online shoppers found that 56% initiated product searches on the Amazon marketplace, compared to 42% on traditional search engines (such as Google), and 29% on Walmart.com.

Why Amazon?

Amazon’s Prime membership was a stroke of ecommerce genius. The service changes the way some consumers think about prices and shipping.

Products on Amazon’s marketplace are often more expensive than competitors’, and Prime costs $139 per year. But to many shoppers, there’s little reason to look elsewhere when shipping is free, fast, and reliable.

Selection

Moreover, Amazon’s product selection is massive and all-inclusive. Amazon itself sells more than 12 million products. Third-party sellers add upwards of 600 million, according to published reports. A shopper looking for an item will likely find it on Amazon.

Trust

Consumers trust Amazon. They assume products will arrive on time, with returns and refunds issued without hassle.

This trust is worth a lot. A 2025 Salsify report (PDF) found that 87% of shoppers have paid more for a product because they trust the brand. Those same consumers would likely search for products on a trusted marketplace.

Reviews

The volume of reviews on Amazon attracts shoppers.

Reviews serve as decision insurance. They reduce uncertainty and shorten the research cycle, especially for products where use cases matter. Instead of reading a handful of articles, comparing retailer sites, and searching Reddit threads, shoppers can pull social proof from thousands of real buyers without leaving Amazon.

That convenience changes behavior. The marketplace becomes a place for decision-making, not just to buy. So why not start a product search where other shoppers can guide you?

Mobile app

Amazon’s mobile app provides an advantage.

Searching for products in a mobile web browser is frustrating, even in 2026. Pages load slowly. Pop-ups appear. Cookie prompts get in the way. Shoppers must pinch and zoom, navigate cluttered menus, and jump between tabs.

Amazon’s app eliminates much of that friction for mobile consumers. The search box is always one tap away, filters are quick to apply, product pages are consistent, and the comparison process happens naturally through scrolling rather than clicking across multiple sites.

It’s a good experience, and shoppers use it.

Search iteration

“Search iteration” is the refinement of a query.

Consumers in the product discovery mode typically have specific needs. Amazon search can route shoppers toward products they are likely to buy.

Brand and mindshare

Amazon is ubiquitous beyond products. Prime Video, Audible, Kindle, Fire TV, Echo devices, and Amazon’s creator and influencer content indirectly contribute to search dominance and habit.

Boston Consulting Group, for example, asserts that such “mindshare” is highly correlated with purchase consideration.

Put another way, the folks who watch Prime Video are likely to search for products on Amazon.

AI and Social

Taken together, these factors serve as a playbook for the leading product search engine and offer both lessons and dilemmas for merchants. A shop can, for example, decide to include products on Amazon solely for discovery benefits.

Another consideration is whether Amazon maintains its lead in product search.

Some 56% of respondents on the Jungle Scout 2024 survey began product searches on Amazon. But that percentage is down from the 61% reported by Jungle Scout in 2022 (PDF).

Something is chipping away at product search and discovery. In 2026, that “something” is likely AI and social.

AI commerce is likely to shift where the first query occurs, thus eroding Amazon’s product-search dominance.

As shoppers ask for “the best” product option, generative AI platforms will increasingly assemble shortlists from multiple sources, reducing the need to start with Amazon. AI will pull discovery and comparison out of the marketplace interface, although Amazon can still win the transaction.

Social commerce on TikTok, Instagram, and YouTube will increasingly resemble search engines for lifestyle-driven categories. Shoppers, especially younger ones, often arrive at Amazon with a product already selected.

In those cases, Amazon becomes the fulfillment destination rather than the discovery engine, which changes the economics of product search and advertising on the platform.

Ecommerce MGMT 0 Comments

Jan 17 2026

Google On Phantom Noindex Errors In Search Console via @sejournal, @martinibuster

Google’s John Mueller recently answered a question about phantom noindex errors reported in Google Search Console. Mueller asserted that these reports may be real.

Noindex In Google Search Console

A noindex robots directive is one of the few commands that Google must obey, one of the few ways that a site owner can exercise control over Googlebot, Google’s indexer.

And yet it’s not totally uncommon for search console to report being unable to index a page because of a noindex directive that seemingly does not have a noindex directive on it, at least none that is visible in the HTML code.

When Google Search Console (GSC) reports “Submitted URL marked ‘noindex’,” it is reporting a seemingly contradictory situation:

The site asked Google to index the page via an entry in a Sitemap.
The page sent Google a signal not to index it (via a noindex directive).

It’s a confusing message from Search Console that a page is preventing Google from indexing it when that’s not something the publisher or SEO can observe is happening at the code level.

The person asking the question posted on Bluesky:

“For the past 4 months, the website has been experiencing a noindex error (in ‘robots’ meta tag) that refuses to disappear from Search Console. There is no noindex anywhere on the website nor robots.txt. We’ve already looked into this… What could be causing this error?”

Noindex Shows Only For Google

Google’s John Mueller answered the question, sharing that there were always a noindex showing to Google on the pages he’s examined where this kind of thing was happening.

Mueller responded:

“The cases I’ve seen in the past were where there was actually a noindex, just sometimes only shown to Google (which can still be very hard to debug). That said, feel free to DM me some example URLs.”

While Mueller didn’t elaborate on what can be going on, there are ways to troubleshoot this issue to find out what’s going on.

How To Troubleshoot Phantom Noindex Errors

It’s possible that there is a code somewhere that is causing a noindex to show just for Google. For example, it may have happened that a page at one time had a noindex on it and a server-side cache (like a caching plugin) or a CDN (like Cloudflare) has cached the HTTP headers from that time, which in turn would cause the old noindex header to be shown to Googlebot (because it frequently visits the site) while serving a fresh version to the site owner.

Checking the HTTP Header is easy, there are many HTTP header checkers like this one at KeyCDN or this one at SecurityHeaders.com.

A 520 server header response code is one that’s sent by Cloudflare when it’s blocking a user agent.

Screenshot: 520 Cloudflare Response Code

Below is a screenshot of a 200 server response code generated by cloudflare:

Screenshot: 200 Server Response Code

I checked the same URL using two different header checkers, with one header checker returning a a 520 (blocked) server response code and the other header checker sending a 200 (OK) response code. That shows how differently Cloudflare can respond to something like a header checker. Ideally, try checking with several header checkers to see if there’s a consistent 520 response from Cloudflare.

In the situation where a web page is showing something exclusively to Google that is otherwise not visible to someone looking at the code, what you need to do is to get Google to look at the page for you using an actual Google crawler and from a Google IP address. The way to do this is by dropping the URL into Google’s Rich Results Test. Google will dispatch a crawler from a Google IP address and if there’s something on the server (or a CDN) that’s showing a noindex, this will catch it. In addition to the structured data, the Rich Results test will also provide the HTTP response and a snapshot of the web page showing exactly what the server shows to Google.

When you run a URL through the Google Rich Results Test, the request:

Originates from Google’s Data Centers: The bot uses an actual Google IP address.
Passes Reverse DNS Checks: If the server, security plugin, or CDN checks the IP, it will resolve back to googlebot.com or google.com.

If the page is blocked by noindex, the tool will be unable to provide any structured data results. It should provide a status saying “Page not eligible” or “Crawl failed”. If you see that, click a link for “View Details” or expand the error section. It should show something like “Robots meta tag: noindex” or ‘noindex’ detected in ‘robots’ meta tag”.

This approach does not send the GoogleBot user agent, it uses the Google-InspectionTool/1.0 user agent string. That means if the server block is by IP address then this method will catch it.

Another angle to check is for the situation where a rogue noindex tag is specifically written to block GoogleBot, you can still spoof (mimic) the GoogleBot user agent string with Google’s own User Agent Switcher extension for Chrome or configure an app like Screaming Frog set to identify itself with the GoogleBot user agent and that should catch it.

Screenshot: Chrome User Agent Switcher

Phantom Noindex Errors In Search Console

These kinds of errors can feel like a pain to diagnose but before you throw your hands up in the air take some time to see if any of the steps outlined here will help identify the hidden reason that’s responsible for this issue.

Featured Image by Shutterstock/AYO Production

Ecommerce MGMT 0 Comments

App Biotechnology and health The Checkup

Jan 17 2026

Three technologies that will shape biotech in 2026

Earlier this week, MIT Technology Review published its annual list of Ten Breakthrough Technologies. As always, it features technologies that made the news last year, and which—for better or worse—stand to make waves in the coming years. They’re the technologies you should really be paying attention to.

This year’s list includes tech that’s set to transform the energy industry, artificial intelligence, space travel—and of course biotech and health. Our breakthrough biotechnologies for 2026 involve editing a baby’s genes and, separately, resurrecting genes from ancient species. We also included a controversial technology that offers parents the chance to screen their embryos for characteristics like height and intelligence. Here’s the story behind our biotech choices.

A base-edited baby!

In August 2024, KJ Muldoon was born with a rare genetic disorder that allowed toxic ammonia to build up in his blood. The disease can be fatal, and KJ was at risk of developing neurological disorders. At the time, his best bet for survival involved waiting for a liver transplant.

Then he was offered an experimental gene therapy—a personalized “base editing” treatment designed to correct the specific genetic “misspellings” responsible for his disease. It seems to have worked! Three doses later, KJ is doing well. He took his first steps in December, shortly before spending his first Christmas at home.

KJ’s story is hugely encouraging. The team behind his treatment is planning a clinical trial for infants with similar disorders caused by different genetic mutations. The team members hope to win regulatory approval on the back of a small trial—a move that could make the expensive treatment (KJ’s cost around $1 million) more accessible, potentially within a few years.

Others are getting in on the action, too. Fyodor Urnov, a gene-editing scientist at the University of California, Berkeley, assisted the team that developed KJ’s treatment. He recently cofounded Aurora Therapeutics, a startup that hopes to develop gene-editing drugs for another disorder called phenylketonuria (PKU). The goal is to obtain regulatory approval for a single drug that can then be adjusted or personalized for individuals without having to go through more clinical trials.

US regulators seem to be amenable to the idea and have described a potential approval pathway for such “bespoke, personalized therapies.” Watch this space.

Gene resurrection

It was a big year for Colossal Biosciences, the biotech company hoping to “de-extinct” animals like the woolly mammoth and the dodo. In March, the company created what it called “woolly mice”—rodents with furry coats and curly whiskers akin to those of woolly mammoths.

The company made an even more dramatic claim the following month, when it announced it had created three dire wolves. These striking snow-white animals were created by making 20 genetic changes to the DNA of gray wolves based on genetic research on ancient dire wolf bones, the company said at the time.

Whether these animals can really be called dire wolves is debatable, to say the least. But the technology behind their creation is undeniably fascinating. We’re talking about the extraction and analysis of ancient DNA, which can then be introduced into cells from other, modern-day species.

Analysis of ancient DNA can reveal all sorts of fascinating insights into human ancestors and other animals. And cloning, another genetic tool used here, has applications not only in attempts to re-create dead pets but also in wildlife conservation efforts. Read more here.

Embryo scoring

IVF involves creating embryos in a lab and, typically, “scoring” them on their likelihood of successful growth before they are transferred to a person’s uterus. So far, so uncontroversial.

Recently, embryo scoring has evolved. Labs can pinch off a couple of cells from an embryo, look at its DNA, and screen for some genetic diseases. That list of diseases is increasing. And now some companies are taking things even further, offering prospective parents the opportunity to select embryos for features like height, eye color, and even IQ.

This is controversial for lots of reasons. For a start, there are many, many factors that contribute to complex traits like IQ (a score that doesn’t capture all aspects of intelligence at any rate). We don’t have a perfect understanding of those factors, or how selecting for one trait might influence another.

Some critics warn of eugenics. And others note that whichever embryo you end up choosing, you can’t control exactly how your baby will turn out (and why should you?!). Still, that hasn’t stopped Nucleus, one of the companies offering these services, from inviting potential customers to have their “best baby.” Read more here.

This article first appeared in The Checkup, MIT Technology Review’s weekly biotech newsletter. To receive it in your inbox every Thursday, and read articles like this first, sign up here.

Ecommerce MGMT 0 Comments

Jan 17 2026

The Download: cut through AI coding hype, and biotech trends to watch

This is today’s edition of The Download, our weekday newsletter that provides a daily dose of what’s going on in the world of technology.

AI coding is now everywhere. But not everyone is convinced.

Depending who you ask, AI-powered coding is either giving software developers an unprecedented productivity boost or churning out masses of poorly designed code that saps their attention and sets software projects up for serious long term-maintenance problems.

The problem is right now, it’s not easy to know which is true.

As tech giants pour billions into large language models (LLMs), coding has been touted as the technology’s killer app. Executives enamored with the potential are pushing engineers to lean into an AI-powered future. But after speaking to more than 30 developers, technology executives, analysts, and researchers, MIT Technology Review found that the picture is not as straightforward as it might seem. Read the full story.

—Edd Gent

Generative coding is one of our 10 Breakthrough Technologies this year. Learn more about why that is, and check out the rest of the list!

This story was also part of our Hype Correction package. You can read the rest of the stories here.

The biotech trends to watch for in 2026

Earlier this week, MIT Technology Review published our annual list of Ten Breakthrough Technologies.

—Jessica Hamzelou

This story is from The Checkup, our weekly newsletter all about the latest in health and biotech. Sign upto receive it in your inbox every Thursday.

MIT Technology Review Narrated: What’s next for AI in 2026

Our AI writers have made some big bets for the coming year—read our story about the five hot trends to watch, or listen to it on Spotify, Apple, or wherever you get your podcasts.

The must-reads

I’ve combed the internet to find you today’s most fun/important/scary/fascinating stories about technology.

1 Minnesota shows how governing and content creation have merged
In another era, we’d have just called this propaganda. (NPR)
+ MAGA influencers are just straight up lying about what is happening there. (Vox)
+ Activists are trying to identify individual ICE officers while protecting their own identities. (WP $)
+ A backlash against ICE is growing in Silicon Valley. (Wired $)

2 There’s probably more child abuse material online now than ever before
Of all Big Tech’s failures, this is surely the most appalling. (The Atlantic $)
+ US investigators are using AI to detect child abuse images made by AI. (MIT Technology Review)
+ Grok is still being used to undress images of real people. (Quartz)

3 ChatGPT wrote a suicide lullaby for a man who later killed himself
This shows it’s “still an unsafe product,” a lawyer representing a family in a tragically similar case said. (Ars Technica)
+ An AI chatbot told a user how to kill himself—but the company doesn’t want to “censor” it. (MIT Technology Review)

4 Videos emerging from Iran show how bloody the crackdown has become
Iranians are finding ways around the internet blackout to show the rest of the world how many of them have been killed. (NBC)
+ Here’s how they’re getting around the blackout. (NPR)

5 China dominates the global humanoid robot market
A new report by analysts found its companies account for over 80% of all deployments. (South China Morning Post)
+ Just how useful are the latest humanoids, though? (Nature)
+ Why humanoid robots need their own safety rules. (MIT Technology Review)

6 How is Australia’s social media ban for kids going?
It’s mixed—some teens welcome it, but others are finding workarounds. (CNBC)

7 Scientists are finding more objective ways to spot mental illness
Biomarkers like voice cadence and heart rate proving pretty reliable for diagnosing conditions like depression. (New Scientist $)

8 The Pebble smartwatch be making a comeback
This could be the thing that tempts me back into buying wearables… (Gizmodo)

9 A new video game traps you in an online scam center
Can’t see the appeal myself, but… each to their own I guess? (NYT $)

10 Smoke detectors are poised to get a high-tech upgrade
And one of the technologies boosting their capabilities is, of course, AI. (BBC)

Quote of the day

“I am very annoyed. I’m very disappointed. I’m seriously frustrated.”

—Pfizer CEO Albert Bourla tells attendees at a healthcare conference this week his feelings about the anti-vaccine agenda Health Secretary Robert F. Kennedy Jr. has been implementing, Bloomberg reports.

One more thing

How close are we to genuine “mind reading?”

Technically speaking, neuroscientists have been able to read your mind for decades. It’s not easy, mind you. First, you must lie motionless within a fMRI scanner, perhaps for hours, while you watch films or listen to audiobooks.

If you do elect to endure claustrophobic hours in the scanner, the software will learn to generate a bespoke reconstruction of what you were seeing or listening to, just by analyzing how blood moves through your brain.

More recently, researchers have deployed generative AI tools, like Stable Diffusion and GPT, to create far more realistic, if not entirely accurate, reconstructions of films and podcasts based on neural activity. So how close are we to genuine “mind reading?” Read the full story.

—Grace Huckins

We can still have nice things

A place for comfort, fun and distraction to brighten up your day. (Got any ideas? Drop me a line or skeet ’em at me.)

+ Still keen to do a bit of reflecting on the year behind and the one ahead? This free guide might help!
+ Turns out British comedian Rik Mayall had some pretty solid life advice.
+ I want to stay in this house in São Paolo.
+ If you want to stop doomscrolling, it’s worth looking at your sleep habits. ($)

Ecommerce MGMT 0 Comments

Jan 17 2026

DIY Approach Fuels Craft Cocktail Brand

Chris Harrison says it all started with a single pot on a stove. He and two high school buddies launched Liber & Co., a manufacturer of premium cocktail syrups, with that tiny test batch in 2011 in Austin, Texas.

Fast forward to 2026, and batches are now in 1,500-gallon tanks and sold worldwide to restaurants, bars, and consumers. But the culture remains hands-on, do-it-yourself, and learn-by-doing.

Chris first appeared on the podcast in 2022. In our recent conversation, he shared the company’s origins, sourcing tactics, growth plans, and more. Our entire audio is embedded below. The transcript is edited for clarity and length.

Eric Bandholz: Who are you, and what do you do?

Chris Harrison: I’m a co-founder of Liber & Co. We make premium non-alcoholic cocktail syrups for bars, restaurants, coffee shops, and home consumers. We’re based in Georgetown, Texas, near Austin, and handle almost everything in-house: manufacturing, warehousing, marketing, ecommerce, wholesale, and even international sales.

Our founding team grew up together in the same small Texas town. We’re the same age, went to the same high school, and came from similar blue-collar backgrounds. We didn’t have a big professional network or capital to outsource everything, so if something needed to be done, we learned to do it ourselves.

We’re also food people. You can’t outsource being a foodie or understanding flavor. Even the best chefs are hands-on in the kitchen, tasting, adjusting, and refining. That mindset shaped Liber & Co. from the beginning. We wanted to be close to the product to understand the ingredients, sourcing, and flavor development firsthand. That do-it-yourself culture became part of our identity.

Bandholz: How did you learn production, moving from a kitchen to a manufacturing facility?

Harrison: It’s a long, incremental journey. We relied on research and trial and error. We started with a small stock pot on a stove, then moved to a 25-gallon pan, then a 200-gallon tank, and now we operate multiple 1,500-gallon tanks.

That gradual progression was critical. You can’t attempt too much without putting the business at risk. If we had jumped straight from a kitchen setup to our current scale, we would have made far more expensive mistakes. Iterating step by step gave us time to understand what worked and what didn’t. There aren’t many shortcuts when you’re building something physical.

Our product category also made things harder. Unlike breweries, which often follow well-established scaling paths, there wasn’t a clear blueprint for cocktail syrups. That meant a lot of independent study, testing equipment, ordering samples, and experimenting with processes. We made mistakes along the way, which were part of the learning curve.

Manufacturing your own product limits capacity. You can’t sell more than you can physically make. There’s no co-manufacturer to absorb demand — you are the bottleneck. That was especially true in the early days.

Early on, we did whatever it took to fulfill orders. I spent 18 hours straight in the kitchen more than once to fill large orders for H-E-B, the grocery chain. It was manual work: long days, minimal breaks, and just pushing through. Thirteen years later, we’re grateful we no longer have to operate that way.

Bandholz: How do you find ingredient suppliers?

Harrison: Most of our sourcing has come from research. That includes a lot of Googling, using ChatGPT and Gemini, and contacting suppliers directly. We typically send a detailed request for proposal outlining who we are, what we need, and our product specifications. Then we ask if they can meet those requirements, provide documentation, and send samples. From there, we test and evaluate.

We cast a wide net geographically. With ginger, for example, we looked at suppliers across Africa, China, Vietnam, and Hawaii before ultimately choosing a Peruvian source. Some leads come from word of mouth. Someone might say, “I saw great ginger in Peru.” I’ll track down the producer through Google or LinkedIn. That actually happened.

It takes persistence. My background is in biology, so I enjoy getting into the weeds, so to speak. We also try to maintain backup suppliers. Fresh produce is unpredictable; pineapple crops suffered globally this year, driving up prices. A frozen backup supply helped smooth costs, but sourcing is never easy or guaranteed.

Bandholz: Is frozen produce better than fresh?

Harrison: In many cases, yes, frozen can be better. Farmers can wait until fruit reaches peak ripeness before harvesting. For something like raspberries, they’ll test sugar content the day of harvest using a refractometer. They literally crush the fruit and measure Brix, the dissolved-sugar level. The U.S. Food and Drug Administration even publishes approved Brix ranges for various fruits, such as peaches, pomegranates, and raspberries.

Farmers aim to hit those targets because that’s where flavor, aroma, and sweetness are best. But it comes from ripening on the vine. Once harvested, the fruit must be used immediately or preserved. Freezing is one of the best ways to lock in that peak quality.

Frozen storage requires capital. Cold storage and refrigerated transportation are expensive, but the tradeoff is consistency and quality. The frozen supply chain has expanded significantly. We’re seeing more investment in large-scale frozen facilities across the country. Even in central Texas, companies are building new frozen warehouses. We use one in North Austin.

If you’re serious about sourcing high-quality food ingredients, the frozen cold chain is often the best option.

Plus, we typically purchase small portions. Large companies such as Smucker’s buy in massive bulk. We like buying from cooperatives of many smaller, independent farms. Certain regions grow crops naturally well. For raspberries, that’s the U.S. Pacific Northwest, parts of Washington and Oregon.

Those regions have family-run farms, often third-generation operations, managing anywhere from 20 to 200 acres. Around them are many similar farms, all growing the same crop in the same climate. That creates a strong network effect: consistent weather, shared knowledge, and reliable quality across the region.

Because these farms remain independent, you avoid some of the downsides of large, consolidated operations. There’s less pressure to cut corners, harvest early, or sacrifice quality to maximize margins. In our experience, the cooperative model prioritizes long-term quality and sustainability.

We might buy one or two truckloads of fruit per year — roughly 40,000 to 80,000 pounds. A cooperative, by contrast, may handle 400 or 500 truckloads in a single harvest. Being a small buyer reduces risk. If we relied on a single farm for everything, we’d be far more vulnerable to supply disruptions.

Bandholz: How do you plan to evolve the brand?

Harrison: We don’t feel limited. We’ve explored packaging formats beyond bottles, which we currently use for syrups. Cans are a natural extension for cocktails, mocktails, or even cannabis beverages. From a formulation, sourcing, and food safety perspective, we could make those products. Packaging is often the most expensive part of goods. It can feel like a constraint, but it’s more about investment and logistics than capability.

At our scale, outsourcing packaging formats is possible. Specialized manufacturers can handle canning at scale. The primary considerations are unit economics and lack of control. That’s a philosophical question as much as a business one.

Overall, we see opportunities to grow both vertically and horizontally. We can deepen what we already do with syrups or expand into new formats, product types, and channels. Brand evolution is more about strategy, resources, and willingness to experiment while maintaining quality and authenticity.

Bandholz: Where can people buy your syrups and get in touch?

Harrison: Our site is LiberAndCompany.com. I’m on LinkedIn.

Ecommerce MGMT 0 Comments

Jan 16 2026

ChatGPT To Begin Testing Ads In The United States via @sejournal, @brookeosmundson

Just today, OpenAI confirmed it will begin testing advertising in the United States for ChatGPT Free and ChatGPT Go users in the coming weeks, marking the first time ads will appear inside the ChatGPT experience.

The test coincides with the U.S. launch of ChatGPT Go, a low-cost subscription tier priced at $8 per month that has been available internationally since August.

The details reveal a cautious approach, with clear limits on where ads can appear, who will see them, and how they will be separated from ChatGPT’s responses.

Here’s what OpenAI shared, how the tests will work, and why this shift matters for users and advertisers alike.

What OpenAI Is Testing

ChatGPT ads are not being introduced as part of a broader redesign or monetization overhaul. Instead, OpenAI is framing this as a limited test, with narrow placement rules and clear separation from ChatGPT’s core function.

Ads will appear at the bottom of a response, only when there is a relevant sponsored product or service tied to the active conversation. They will be clearly labeled, visually distinct from organic answers, and dismissible.

Users will also be able to see why a particular ad is being shown and turn off ad personalization entirely if they choose.

Just as important is where ads will not appear.

OpenAI stated that ads will not be shown to users under 18 and will not be eligible to run near sensitive or regulated topics, including health, mental health, and politics. Conversations will not be shared with advertisers, and user data will not be sold.

Timing Ad Testing with the Go Tier Launch

The timing of the announcement doesn’t seem accidental.

Alongside the ad testing plans, OpenAI confirmed that ChatGPT Go is now available in the United States.

Priced at $8 per month, Go sits between the free tier and higher-cost subscriptions, offering expanded access to messaging, image generation, file uploads, and memory.

Ads are positioned as a way to support both the free tier and Go users, allowing more people to use ChatGPT with fewer restrictions without forcing an upgrade.

At the same time, OpenAI made it clear that Pro, Business, and Enterprise subscriptions will remain ad-free, reinforcing that paid tiers are still the preferred path for users who want an uninterrupted experience.

Explaining the Guardrails of Early Ad Testing

OpenAI spent as much time explaining what ads will not do as what they will.

The company was explicit that advertising will not influence ChatGPT’s responses. Answers are optimized for usefulness, not commercial outcomes. There is no intent to optimize for time spent, engagement loops, or other metrics commonly associated with ad-driven platforms.

This is a notable departure from how advertising has historically been introduced elsewhere on the internet. Rather than retrofitting ads into an existing product and adjusting incentives later, OpenAI is attempting to define the rules up front.

Whether those rules hold over time is an open question. But the clarity of the initial framework suggests OpenAI understands the risk of getting this wrong.

What Early Ad Formats Tell Us

OpenAI shared two examples of the ad formats it plans to test inside ChatGPT.

In the first example, a ChatGPT response provides recipe ideas for a Mexican dinner party. Below the response, a sponsored product recommendation appears for a grocery item. The ad is clearly labeled and visually separated from the organic answer.

In the second example, ChatGPT responds to a conversation about traveling to Santa Fe, New Mexico. A sponsored lodging listing appears below the response, labeled as sponsored. The example also shows a follow-up chat screen, indicating that users can continue interacting with ChatGPT after seeing the ad.

In both examples, the ads appear at the bottom of ChatGPT’s responses and are presented as separate from the main answer. OpenAI stated that these formats are part of its initial ad testing and may change as testing progresses.

Why This Matters for Advertisers

This is not something advertisers can plan for just yet.

There are no announced buying models, no targeting details, no measurement framework, and no indication of when access might expand beyond testing. OpenAI has been clear that this is not an open marketplace at the moment.

Still, the implications are hard to ignore. Ads placed alongside high-intent, problem-solving conversations could eventually represent a different kind of discovery environment. One where usefulness matters more than volume, and where poor creative or loose targeting would feel immediately out of place.

If this becomes a real channel, it is unlikely to reward the same tactics that work in search or social today.

How Marketers Are Reacting So Far

Early industry reaction has been measured, not alarmist.

Most commentary acknowledges that advertising inside ChatGPT was inevitable at this scale.

Lily Ray stated her curiosity to “see how this change impacts user experience.”

Most people in the comments of her post are not shocked by this:

There is also skepticism, particularly around whether relevance can be maintained over time without pressure to expand inventory. That skepticism is warranted. History suggests that once ads work, the temptation to scale them follows.

For now, though, this feels less like an ad platform launch and more like OpenAI testing whether ads can exist inside a conversational interface without changing how people trust the product.

The Bigger Signal for AI Platforms

For users, OpenAI is expanding access while trying to preserve the trust that has made ChatGPT widely used. Introducing ads without blurring the line between answers and monetization sets a high bar, especially for a product people rely on for personal and professional tasks.

Outside of ChatGPT itself, this update shows how AI-first products may think about revenue differently than search or social networks. Ads are positioned as a way to support access, not as the product, with paid tiers remaining central.

OpenAI says it will adjust how ads appear based on user feedback once testing begins in the U.S.

For now, this is a limited test rather than a full advertising launch. Whether those boundaries hold will matter, not just for ChatGPT, but for how monetization inside conversational interfaces is expected to work.

Ecommerce MGMT 0 Comments

Jan 16 2026

AI Search in 2026: The 5 Article GEO & SEO Playbook For Modern Visibility via @sejournal, @contentful

In the SEO world, when we talk about how to structure content for AI search, we often default to structured data – Schema.org, JSON-LD, rich results, knowledge graph eligibility – the whole shooting match.

While that layer of markup is still useful in many scenarios, this isn’t another article about how to wrap your content in tags.

Structuring content isn’t the same as structured data

Instead, we’re going deeper into something more fundamental and arguably more important in the age of generative AI: How your content is actually structured on the page and how that influences what large language models (LLMs) extract, understand, and surface in AI-powered search results.

Structured data is optional. Structured writing and formatting are not.

If you want your content to show up in AI Overviews, Perplexity summaries, ChatGPT citations, or any of the increasingly common “direct answer” features driven by LLMs, the architecture of your content matters: Headings. Paragraphs. Lists. Order. Clarity. Consistency.

In this article, I’m unpacking how LLMs interpret content — and what you can do to make sure your message is not just crawled, but understood.

How LLMs Actually Interpret Web Content

Let’s start with the basics.

Unlike traditional search engine crawlers that rely heavily on markup, metadata, and link structures, LLMs interpret content differently.

They don’t scan a page the way a bot does. They ingest it, break it into tokens, and analyze the relationships between words, sentences, and concepts using attention mechanisms.

They’re not looking for a tag or a JSON-LD snippet to tell them what a page is about. They’re looking for semantic clarity: Does this content express a clear idea? Is it coherent? Does it answer a question directly?

LLMs like GPT-4 or Gemini analyze:

The order in which information is presented.
The hierarchy of concepts (which is why headings still matter).
Formatting cues like bullet points, tables, bolded summaries.
Redundancy and reinforcement, which help models determine what’s most important.

This is why poorly structured content – even if it’s keyword-rich and marked up with schema – can fail to show up in AI summaries, while a clear, well-formatted blog post without a single line of JSON-LD might get cited or paraphrased directly.

Why Structure Matters More Than Ever In AI Search

Traditional search was about ranking; AI search is about representation.

When a language model generates a response to a query, it’s pulling from many sources – often sentence by sentence, paragraph by paragraph.

It’s not retrieving a whole page and showing it. It’s building a new answer based on what it can understand.

What gets understood most reliably?

Content that is:

Segmented logically, so each part expresses one idea.
Consistent in tone and terminology.
Presented in a format that lends itself to quick parsing (think FAQs, how-to steps, definition-style intros).
Written with clarity, not cleverness.

AI search engines don’t need schema to pull a step-by-step answer from a blog post.

But, they do need you to label your steps clearly, keep them together, and not bury them in long-winded prose or interrupt them with calls to action, pop-ups, or unrelated tangents.

Clean structure is now a ranking factor – not in the traditional SEO sense, but in the AI citation economy we’re entering.

What LLMs Look For When Parsing Content

Here’s what I’ve observed (both anecdotally and through testing across tools like Perplexity, ChatGPT Browse, Bing Copilot, and Google’s AI Overviews):

Clear Headings And Subheadings: LLMs use heading structure to understand hierarchy. Pages with proper H1–H2–H3 nesting are easier to parse than walls of text or div-heavy templates.
Short, Focused Paragraphs: Long paragraphs bury the lede. LLMs favor self-contained thoughts. Think one idea per paragraph.
Structured Formats (Lists, Tables, FAQs): If you want to get quoted, make it easy to lift your content. Bullets, tables, and Q&A formats are goldmines for answer engines.
Defined Topic Scope At The Top: Put your TL;DR early. Don’t make the model (or the user) scroll through 600 words of brand story before getting to the meat.
Semantic Cues In The Body: Words like “in summary,” “the most important,” “step 1,” and “common mistake” help LLMs identify relevance and structure. There’s a reason so much AI-generated content uses those “giveaway” phrases. It’s not because the model is lazy or formulaic. It’s because it actually knows how to structure information in a way that’s clear, digestible, and effective, which, frankly, is more than can be said for a lot of human writers.

A Real-World Example: Why My Own Article Didn’t Show Up

In December 2024, I wrote a piece about the relevance of schema in AI-first search.

It was structured for clarity, timeliness, and was highly relevant to this conversation, but didn’t show up in my research queries for this article (the one you are presently reading). The reason? I didn’t use the term “LLM” in the title or slug.

All of the articles returned in my search had “LLM” in the title. Mine said “AI Search” but didn’t mention LLMs explicitly.

You might assume that a large language model would understand “AI search” and “LLMs” are conceptually related – and it probably does – but understanding that two things are related and choosing what to return based on the prompt are two different things.

Where does the model get its retrieval logic? From the prompt. It interprets your question literally.

If you say, “Show me articles about LLMs using schema,” it will surface content that directly includes “LLMs” and “schema” – not necessarily content that’s adjacent, related, or semantically similar, especially when it has plenty to choose from that contains the words in the query (a.k.a. the prompt).

So, even though LLMs are smarter than traditional crawlers, retrieval is still rooted in surface-level cues.

This might sound suspiciously like keyword research still matters – and yes, it absolutely does. Not because LLMs are dumb, but because search behavior (even AI search) still depends on how humans phrase things.

The retrieval layer – the layer that decides what’s eligible to be summarized or cited – is still driven by surface-level language cues.

What Research Tells Us About Retrieval

Even recent academic work supports this layered view of retrieval.

A 2023 research paper by Doostmohammadi et al. found that simpler, keyword-matching techniques, like a method called BM25, often led to better results than approaches focused solely on semantic understanding.

The improvement was measured through a drop in perplexity, which tells us how confident or uncertain a language model is when predicting the next word.

In plain terms: Even in systems designed to be smart, clear and literal phrasing still made the answers better.

So, the lesson isn’t just to use the language they’ve been trained to recognize. The real lesson is: If you want your content to be found, understand how AI search works as a system – a chain of prompts, retrieval, and synthesis. Plus, make sure you’re aligned at the retrieval layer.

This isn’t about the limits of AI comprehension. It’s about the precision of retrieval.

Language models are incredibly capable of interpreting nuanced content, but when they’re acting as search agents, they still rely on the specificity of the queries they’re given.

That makes terminology, not just structure, a key part of being found.

How To Structure Content For AI Search

If you want to increase your odds of being cited, summarized, or quoted by AI-driven search engines, it’s time to think less like a writer and more like an information architect – and structure content for AI search accordingly.

That doesn’t mean sacrificing voice or insight, but it does mean presenting ideas in a format that makes them easy to extract, interpret, and reassemble.

Core Techniques For Structuring AI-Friendly Content

Here are some of the most effective structural tactics I recommend:

Use A Logical Heading Hierarchy

Structure your pages with a single clear H1 that sets the context, followed by H2s and H3s that nest logically beneath it.

LLMs, like human readers, rely on this hierarchy to understand the flow and relationship between concepts.

If every heading on your page is an H1, you’re signaling that everything is equally important, which means nothing stands out.

Good heading structure is not just semantic hygiene; it’s a blueprint for comprehension.

Keep Paragraphs Short And Self-Contained

Every paragraph should communicate one idea clearly.

Walls of text don’t just intimidate human readers; they also increase the likelihood that an AI model will extract the wrong part of the answer or skip your content altogether.

This is closely tied to readability metrics like the Flesch Reading Ease score, which rewards shorter sentences and simpler phrasing.

While it may pain those of us who enjoy a good, long, meandering sentence (myself included), clarity and segmentation help both humans and LLMs follow your train of thought without derailing.

Use Lists, Tables, And Predictable Formats

If your content can be turned into a step-by-step guide, numbered list, comparison table, or bulleted breakdown, do it. AI summarizers love structure, so do users.

Frontload Key Insights

Don’t save your best advice or most important definitions for the end.

LLMs tend to prioritize what appears early in the content. Give your thesis, definition, or takeaway up top, then expand on it.

Use Semantic Cues

Signal structure with phrasing like “Step 1,” “In summary,” “Key takeaway,” “Most common mistake,” and “To compare.”

These phrases help LLMs (and readers) identify the role each passage plays.

Avoid Noise

Interruptive pop-ups, modal windows, endless calls-to-action (CTAs), and disjointed carousels can pollute your content.

Even if the user closes them, they’re often still present in the Document Object Model (DOM), and they dilute what the LLM sees.

Think of your content like a transcript: What would it sound like if read aloud? If it’s hard to follow in that format, it might be hard for an LLM to follow, too.

The Role Of Schema: Still Useful, But Not A Magic Bullet

Let’s be clear: Structured data still has value. It helps search engines understand content, populate rich results, and disambiguate similar topics.

However, LLMs don’t require it to understand your content.

If your site is a semantic dumpster fire, schema might save you, but wouldn’t it be better to avoid building a dumpster fire in the first place?

Schema is a helpful boost, not a magic bullet. Prioritize clear structure and communication first, and use markup to reinforce – not rescue – your content.

How Schema Still Supports AI Understanding

That said, Google has recently confirmed at Search Central Live in Madrid that its LLM (Gemini), which powers AI Overviews, does leverage structured data to help understand content more effectively.

In fact, at the event, John Mueller recommends to use structured data because it gives models clearer signals about intent and structure.

That doesn’t contradict the point; it reinforces it. If your content isn’t already structured and understandable, schema can help fill the gaps. It’s a crutch, not a cure.

Schema is a helpful boost, but not a substitute, for structure and clarity.

In AI-driven search environments, we’re seeing content without any structured data show up in citations and summaries because the core content was well-organized, well-written, and easily parsed.

In short:

Use schema when it helps clarify the intent or context.
Don’t rely on it to fix bad content or a disorganized layout.
Prioritize content quality and layout before markup.

The future of content visibility is built on how well you communicate, not just how well you tag.

Conclusion: Structure For Meaning, Not Just For Machines

Optimizing for LLMs doesn’t mean chasing new tools or hacks. It means doubling down on what good communication has always required: clarity, coherence, and structure.

If you want to stay competitive, you’ll need to structure content for AI search just as carefully as you structure it for human readers.

The best-performing content in AI search isn’t necessarily the most optimized. It’s the most understandable. That means:

Anticipating how content will be interpreted, not just indexed.
Giving AI the framework it needs to extract your ideas.
Structuring pages for comprehension, not just compliance.
Anticipating and using the language your audience uses, because LLMs respond literally to prompts and retrieval depends on those exact terms being present.

As search shifts from links to language, we’re entering a new era of content design. One where meaning rises to the top, and the brands that structure for comprehension will rise right along with it.

More Resources:

Featured Image: Igor Link/Shutterstock

Ecommerce MGMT 0 Comments