ProjectShield Url Check
Project Shield, created by Google Cloud and Jigsaw and powered by Google Cloud Armor, provides free unlimited protection against DDoS attacks, a type of digital attackā¦
Google control token used to manage whether crawled content may be used for Gemini model training and grounding.
User-agent: Google-Extended
Disallow: /
(http.user_agent contains "Google-Extended")
RewriteEngine On
RewriteCond %{HTTP_USER_AGENT} Google\-Extended [NC]
RewriteRule ^ - [F,L]
if ($http_user_agent ~* "Google-Extended") { return 403; }
Google-Extended is a standalone product token, not a separate HTTP crawler. Google says publishers can use it to manage whether content crawled from their sites may be used for future Gemini model training and for grounding in certain Google AI products.
Blocking Google-Extended does not itself block Google Search crawling.
https://developers.google.com/crawling/docs/crawlers-fetchers/google-common-crawlers
Project Shield, created by Google Cloud and Jigsaw and powered by Google Cloud Armor, provides free unlimited protection against DDoS attacks, a type of digital attackā¦
Google Messages link-preview fetcher.
Generic Google crawler for public-content fetches outside core Search.
GoogleOther-Image is GoogleOther optimized for fetching publicly accessible image URLs.
GoogleOther-Video is GoogleOther optimized for fetching publicly accessible video URLs.
Google Pinpoint fetcher for user-supplied sources.
Google's main crawler for Search and related surfaces.
Google crawler used on owner request for Vertex AI Agents.
Apple control token for foundation-model training opt-out.
Webz.io extended web crawler that maintains a repository of web crawl data.
