PerplexityBot
Perplexity crawler used for search indexing, not model training.
Perplexity user-triggered fetcher used to visit pages when answering a user's question.
User-agent: Perplexity-User
Disallow: /
(http.user_agent contains "Perplexity-User")
RewriteEngine On
RewriteCond %{HTTP_USER_AGENT} Perplexity\-User [NC]
RewriteRule ^ - [F,L]
if ($http_user_agent ~* "Perplexity-User") { return 403; }
Perplexity-User is used for user actions inside Perplexity. It is not used for automatic web crawling or training, and Perplexity says this user-requested fetcher generally ignores robots.txt rules.
Perplexity publishes IP ranges for Perplexity-User.
https://docs.perplexity.ai/docs/resources/perplexity-crawlers
Perplexity crawler used for search indexing, not model training.
Proxies user actions on Yandex services, including online translation and button-click responses.
Yandex Calendar robot that downloads calendar files by user request.
Google Pinpoint fetcher that requests URLs users specify as sources for Pinpoint collections.
Google user-triggered agent used by Google-hosted agents to navigate the web on a user’s request.
