omgili

Risky Block: Depends robots.txt: Unknown

Webz.io web crawler that maintains a repository of web crawl data sold to other companies.

Key facts

Operator
Webz.io
Family
Webz.io
Purpose
Site Owner Fetch
User-Agent
omgili
Should you block it?
Depends
Respects robots.txt
Unknown
Identity type
Unknown
Confidence
Unknown

Bot details

Identity

User-Agent
omgili
robots.txt token
omgili
HTTP user-agent
Mozilla/5.0 (compatible; omgili/0.5 +http://omgili.com)
Aliases
Webz.io Omgili

Ownership

Operator
Webz.io
Family
Webz.io
Type
Ai
Purpose
Site Owner Fetch

Verification and trust

Source type
Unknown
Confidence
Unknown
Verification
Verify the exact user-agent against Webz.io's published crawler documentation.
Source URL
https://webz.io

Blocking and detection

Robots

User-agent: omgili
Disallow: /

Cloudflare

(http.user_agent contains "omgili")
Advanced server rules

Apache

RewriteEngine On
RewriteCond %{HTTP_USER_AGENT} omgili [NC]
RewriteRule ^ - [F,L]

Nginx

if ($http_user_agent ~* "omgili") { return 403; }

Notes

Webz.io web crawler that maintains a repository of web crawl data sold to other companies.

omgili is operated by Webz.io. Botcrawl classifies it as a ai bot with a primary purpose of site owner fetch.

Key profile signals: risk level: risky; verified: no.

Verify the exact user-agent against Webz.io’s published crawler documentation.

Source documentation: https://webz.io

Evidence and source

  • Verify the exact user-agent against Webz.io's published crawler documentation.