Bot intelligence record

Google-Extended

Usually allow

Use the Google-Extended identifier to separate Google AI crawler, assistant, or AI search traffic from normal visitor requests in server logs.

Ai Ai Assistant Official Documented Confidence: High Verified: Yes robots.txt: Yes
Operator
Google
Family
Google
Type
Ai
Source type
Official
Last checked
2026-05-20

User-Agent Pattern

Google
Google-Extended
Verification note

User-agent strings are identification signals, not proof of identity. Confirm important allow, block, or rate-limit decisions with logs, DNS or IP evidence, request behavior, or operator documentation when available.

Robots.txt Snippet

Click snippet to copy
User-agent: Google-Extended Disallow: /

Click the snippet to copy it, or highlight the text manually.

Handling Guidance

Depends

This bot is usually safe to allow when the request source is verified and the traffic matches your site policy.

AI crawling, assistant retrieval, AI search, or model-supporting discovery depending on the user-agent.

Record Details

Structured data
Operator
Google
Family
Google
Type
Ai
Purpose
Ai Assistant
Identity type
Official Documented
Confidence
High
Last verified
2026-04-01
Last checked
2026-05-20
Source type
Official
Verification
Control token only; it relies on existing Google crawler traffic rather than a separate HTTP user-agent.
Spoofing risk
User-agent strings for Google-Extended can be spoofed. Treat user-agent detection as a classification signal, then verify with published IP ranges, reverse DNS, signatures, operator documentation, or published operator documentation, IP ranges, reverse DNS, signatures, or other verified identity signals before allow-listing.

Notes

Google-Extended is listed in the Botcrawl directory as an AI control token from Google. The primary identifier for log review is Google-Extended.

Identification

  • User-agent pattern: Google-Extended
  • Family: Google
  • Type: AI
  • Kind: Control Token

Common use

AI crawling, assistant retrieval, AI search, or model-supporting discovery depending on the user-agent.

Verification and handling

Control token only; it relies on existing Google crawler traffic rather than a separate HTTP user-agent.

Directory guidance marks the risk level as Safe and the blocking decision as Depends. Do not rely on the user-agent string alone because user-agent strings can be copied or spoofed.

Robots.txt handling: Yes.

Evidence and Source

  • Control token only; it relies on existing Google crawler traffic rather than a separate HTTP user-agent.
  • Match `Google-Extended` as a case-insensitive substring in HTTP user-agent logs. Review bot_aliases for alternate names or product labels. Do not treat a user-agent match alone as proof of identity for allow-listing.
  • AI crawling, assistant retrieval, AI search, or model-supporting discovery depending on the user-agent.
  • User-agent strings for Google-Extended can be spoofed. Treat user-agent detection as a classification signal, then verify with published IP ranges, reverse DNS, signatures, operator documentation, or published operator documentation, IP ranges, reverse DNS, signatures, or other verified identity signals before allow-listing.

Monitor This Bot In Edge

Botcrawl Edge

Use Botcrawl Edge to see matching traffic, create allow or block rules, and control this bot across connected sites.