Google-Extended

Safe Block: Depends Verified robots.txt: Yes

Google control token used to manage whether crawled content may be used for Gemini model training and grounding.

Key facts

Operator
Google
Family
Google
Purpose
Control Token
User-Agent
Google-Extended
Should you block it?
Depends
Verified
Yes
Respects robots.txt
Yes
Identity type
Official Documented
Confidence
High
Last verified
2026-04-01
Last checked
2026-04-01

Bot details

Identity

User-Agent
Google-Extended
robots.txt token
Google-Extended
Aliases
Google AI opt-out token

Ownership

Operator
Google
Family
Google
Type
Ai
Purpose
Control Token

Verification and trust

Source type
Official
Confidence
High
Last verified
2026-04-01
Last checked
2026-04-01
Verification
Control token only; it relies on existing Google crawler traffic rather than a separate HTTP user-agent.

Blocking and detection

Robots

User-agent: Google-Extended
Disallow: /

Cloudflare

(http.user_agent contains "Google-Extended")
Advanced server rules

Apache

RewriteEngine On
RewriteCond %{HTTP_USER_AGENT} Google\-Extended [NC]
RewriteRule ^ - [F,L]

Nginx

if ($http_user_agent ~* "Google-Extended") { return 403; }

Notes

Google-Extended is a standalone product token, not a separate HTTP crawler. Google says publishers can use it to manage whether content crawled from their sites may be used for future Gemini model training and for grounding in certain Google AI products.

Blocking Google-Extended does not itself block Google Search crawling.

Evidence and source

  • Control token only; it relies on existing Google crawler traffic rather than a separate HTTP user-agent.