Diffbot-User is listed in the Botcrawl directory as a crawler used for scraping, SEO analysis, or data collection from Diffbot. The primary identifier for log review is Diffbot-User.
Identification
- User-agent pattern:
Diffbot-User - Family: Diffbot
- Type: Scraper
- Kind: Fetcher
Common use
Public web data collection, SEO analysis, content extraction, or third-party crawling activity.
Verification and handling
Confirm the user-agent against server logs and use published operator documentation, IP ranges, reverse DNS, or other trust signals when available.
Directory guidance marks the risk level as Neutral and the blocking decision as Depends. Do not rely on the user-agent string alone because user-agent strings can be copied or spoofed.
Robots.txt handling: Yes.
Identification
Verification And Behavior
Common Use
Public web data collection, SEO analysis, content extraction, or third-party crawling activity.
Detection Notes
Match `Diffbot-User` as a case-insensitive substring in HTTP user-agent logs. Review bot_aliases for alternate names or product labels. Do not treat a user-agent match alone as proof of identity for allow-listing.
Rules And Blocking Notes
User-agent: Diffbot-User Disallow: / 