Internet Archive

Browse bots in this family.

archive.org_bot Neutral

Internet Archive

The Internet Archive bot, also known as archive.org_bot, is the web crawler for the Internet Archive's Wayback Machine. It systematically crawls and preserves publicly accessible web pages.

Operator and family: Internet Archive Type: Feed Purpose: Feed Fetch
Verified robots.txt: Yes Block: Depends Verified Bot
IABot Safe

InternetArchiveBot

InternetArchiveBot looks for URL references on Wikipedia and assesses if the URL is still alive, or delivering 404s.

Operator and family: Internet Archive Type: Feed Purpose: Feed Fetch
Verified robots.txt: No Block: No Verified Bot