SolCrys Logo

Strategy & Positioning

Your website has a third audience now — agents

Most product teams design for two audiences: humans (your users) and search engines (Google, Bing). In 2026 there's a third: AI agents — browser agents that operate websites on a user's behalf, voice agents that summarize what they read, MCP-connected tools that act through your application. They see what your real users see, your screen-reader users see, and your search crawlers see — but the failure modes are different in each case. This essay translates Google's web.dev guidance on agent-friendly websites into product decisions for B2B SaaS, with concrete UI examples, the five rules that move the needle, and the anti-patterns that quietly break agent usability without affecting human users.

Updated 2026-05-17

Questions this guide answers

  • How do I make my website agent-friendly?
  • What is an agent-friendly UI?
  • How do AI agents read websites?
  • What does Google say about agent-friendly websites?

Direct answer

Your product UI now serves three audiences. The first is humans — the users you've been designing for since you started. The second is search crawlers — the audience SEO has trained product and engineering teams to accommodate for two decades. The third, new in 2026, is agents: AI systems that operate websites on a user's behalf, summarize what they see, or call into your application through MCP and similar protocols.

Agent-friendly UX is not a new design discipline. It's the same accessibility and clarity practices that benefit screen-reader users, with one extra dimension: agents that work by screenshot rather than DOM see what's visually rendered; agents that work by accessibility tree see structural intent; agents that work by DOM see what's structurally present. The three views diverge in ways your human users never see, and that divergence is where agent UX breaks.

The three views agents read your UI through

Different agents read your UI different ways. Today's production agents tend to use one of three approaches, often in combination.

Visual (screenshot-based)

The agent takes a screenshot, identifies UI elements visually, and acts on what it sees. This is closest to how a human uses your product. Failure modes: text rendered inside an image, charts without titles, low-contrast labels, ambiguous icons without text — all of these are invisible or unreliable to the agent in the same ways they would be to a human with low vision.

Structural (DOM-based)

The agent walks the DOM, finds elements by HTML semantics or class names, and acts on the underlying structure. This is closest to how a search crawler reads your page. Failure modes: clickable `<div>` elements with `onclick` handlers instead of proper `<button>` elements; custom interactive widgets without ARIA roles; state changes signaled only by CSS classes the DOM doesn't reflect; content rendered into `<canvas>` elements the DOM can't introspect.

Semantic (accessibility-tree-based)

The agent reads the browser's accessibility tree, the same data path screen readers use. This gives the agent intent labels — "this is a primary submit button" — rather than just visual or structural data. Failure modes: missing accessible names on buttons, form inputs without associated labels, decorative images marked as content, modal dialogs without a `role="dialog"` attribute or proper focus management.

The five rules that move the needle

Google's web.dev guidance on building agent-friendly websites is straightforward: agent UX is accessibility and UX, applied with discipline. The five rules below capture most of the practical value. They are not new rules; they are the rules accessibility teams have been asking for since the W3C ARIA spec was published. Agents are the new constituency that makes the rules procurement-relevant.

  • Use real semantic HTML for interactive elements. Buttons should be `<button>` tags, links should be `<a>` tags, and inputs should have associated labels. A clickable `<div>` with an `onclick` handler may look right visually but carries no semantic role information — and will fail across all three agent views.
  • Give every interactive element an accessible name. The visible label is often enough; when it's not (icon-only buttons, ambiguous controls), an `aria-label` or `aria-labelledby` carries the intent. "Submit" beats "Click here"; both beat an icon with no text.
  • Make state visible in the DOM, not just in CSS. If a button is disabled, use `disabled` or `aria-disabled`; if a toggle is on, use `aria-pressed` or the equivalent. State that lives only in CSS class names is invisible to agents that read the DOM or accessibility tree.
  • Keep layout stable after first paint. Layout shift moves agent targets between the time the agent identifies them and the time it tries to click them, and the agent fails. The Core Web Vitals metric CLS (Cumulative Layout Shift) was designed for humans, but it captures the same failure mode.
  • Surface operations as visible affordances, not as hidden gestures. If an important action only happens on right-click or on a multi-touch gesture, agents that don't simulate those won't find it. Provide an explicit button or menu entry as the primary path; gestures can be a shortcut.

The anti-patterns that quietly break agent UX

Three patterns are common in modern B2B SaaS UIs that look fine to human users and quietly fail to agents.

1. Drag-and-drop without keyboard equivalents

Agents that operate via DOM or accessibility tree generally can't simulate mouse drag operations reliably. If your only way to reorder a list is drag-and-drop, an agent acting on a user request to reorder cannot complete the task. Provide a keyboard-equivalent or button-driven alternative.

2. Content hidden behind hover-only tooltips

If an important data field — a sample, a confidence interval, a definition — is visible only on hover, screenshot-based agents may not capture it and DOM-based agents may not find it without simulating hover. Move important content out of hover-only states or make it focusable via keyboard.

3. State implied by visual proximity

"The currently selected row is the one with the slightly darker background" is fine for humans, who notice. Agents reading the DOM see no `aria-selected` and no class name that says "selected"; they have to infer from visual styling, which is brittle. Mark selection state explicitly.

A one-week audit you can run

Five days, one rule per day, applied to the three highest-traffic flows in your product. End of week, you have an inventory of where agent UX fails and a prioritized list of fixes.

  • Day 1: Walk every clickable element in the priority flows. Inventory any rendered as a `<div>` instead of a `<button>` or an `<a>`. Replace.
  • Day 2: For every input field, confirm a visible or aria-associated label. For every button, confirm an accessible name. Fix gaps.
  • Day 3: For every stateful component (toggle, tab, selected row, disabled button), confirm DOM-visible state. Replace CSS-only state.
  • Day 4: Run the priority flows in a low-bandwidth Chrome window and watch for layout shift. Pin elements that shift.
  • Day 5: Audit hover-only and gesture-only interactions. Add visible alternatives for any operation that's currently gesture-only.

What you don't have to do

A few things tend to come up in agent-readiness conversations that you can safely skip. Google's May 2026 guide is explicit: there's no agent-specific schema, no special markup file, no agent-friendly content variant. The accessibility and clarity work above is what agents need. "Build a separate agent UI" is a category error.

Similarly, you don't need to rewrite content for agents. Agents read your product UI; the user's request flows through them. The content quality bar is the same one your human users hold you to.

Why this matters now

Agentic AI is moving from research demos to production in 2026. Anthropic's Claude operates browsers; OpenAI's Operator agent is generally available; the Apps SDK shipped; Google's Gemini agent capability has its own footprint. The number of times your product is operated by an agent on a user's behalf has gone from approximately zero in 2024 to a measurable share in mid-2026, and is growing.

If you wait until the share is large to start, you will have a backlog of work to fix and customers who already churned because their agent couldn't operate your product. The rules above are not expensive to apply early; they are expensive to retrofit later.

About the author

Eason Wang is Co-Founder & CPO of SolCrys, the AEO operating system for brands and agencies competing for visibility, citations, and recommendations across the major AI engines. PhD in Machine Learning. Eighteen-plus years building enterprise products; started at Microsoft Research Asia. Current focus: agentic AI workflows in production. Connect on LinkedIn.

Written May 2026 after a product review where SolCrys's own UI failed agent UX in a way our human users had never noticed. The rules above are the ones we run our product reviews against now.

FAQ

Is agent-friendly UX really just accessibility?

Mostly yes, plus a few things accessibility didn't historically prioritize — layout stability under varied screen sizes, DOM-visible state for components that look fine on screen, explicit affordances for operations that humans do with gestures. If your product is fully accessible at the WCAG 2.2 AA level, agent UX is probably 80% there. The remaining 20% is the layout-shift and gesture-equivalent work above.

Do agents really act on websites at any meaningful scale yet?

Yes, and growing. Anthropic, OpenAI, and Google all ship agent capabilities to their consumer products in 2026, and enterprise customers are running production agent workflows through MCP and similar protocols. Volumes are still small compared to human users for most products, but the curve is steep enough that planning for the year-out state matters.

What about MCP and direct API integrations — do those replace the UI-level work?

MCP and tool-use APIs are a parallel path: an agent that calls your product through MCP doesn't need to operate your UI. But not every interaction will be tool-use-shaped, and not every product can expose every operation as a tool. The UI-level agent readiness work is the broader-coverage path; MCP is a focused optimization for the highest-frequency workflows.

Does this conflict with the no-AI-specific-content rules from Google's guide?

No. Google's guide is about not creating AI-specific content variants, AI-specific schema, or AI-specific files. The agent-readiness work in this essay is about product UI clarity that benefits humans first and agents second — exactly the framing Google's own guidance uses for agent-friendly websites.

Related guides

Strategy & Positioning

Production Is Cheap. Trust Is Scarce.

AI made producing brand content nearly free, but made being cited, recommended, and trusted by AI engines radically more scarce and concentrated. A field note from SolCrys Co-Founder & CPO Eason Wang on what changes when production stops being the marketing bottleneck — and what the new substrate is.

Free AI visibility audit

Find out where your brand is missing, miscited, or misrepresented.

SolCrys maps high-intent prompts to mentions, citations, answer accuracy, and content gaps so your team can prioritize the next pages to ship.

Get a free audit