Bot intelligence record
TurnitinBot ContentIngest
Usually allowUse the TurnitinBot/ContentIngest identifier to separate Turnitin search indexing or content discovery traffic from normal visitor requests in server logs.
User-Agent Pattern
TurnitinTurnitinBot/ContentIngest
Robots.txt Snippet
Copy-readyUser-agent: Turnitin Disallow: /
Handling Guidance
DependsThis bot is usually safe to allow when the request source is verified and the traffic matches your site policy.
Search indexing, content discovery, rendering, or search-result freshness checks.
Record Details
Structured data- Operator
- Turnitin
- Family
- Turnitin
- Type
- Search
- Purpose
- Indexing
- Identity type
- Official Documented
- Confidence
- High
- Last verified
- 2026-04-01
- Last checked
- 2026-05-20
- Source type
- Official
- Spoofing risk
- User-agent strings for TurnitinBot/ContentIngest can be spoofed. Treat user-agent detection as a classification signal, then verify with published IP ranges, reverse DNS, signatures, operator documentation, or published operator documentation, IP ranges, reverse DNS, signatures, or other verified identity signals before allow-listing.
Notes
TurnitinBot ContentIngest is listed in the Botcrawl directory as a search crawler from Turnitin. The primary identifier for log review is TurnitinBot/ContentIngest.
Identification
- User-agent pattern:
TurnitinBot/ContentIngest - Family: Turnitin
- Type: Search
- Kind: Crawler
Common use
Search indexing, content discovery, rendering, or search-result freshness checks.
Verification and handling
Confirm the user-agent against server logs and use published operator documentation, IP ranges, reverse DNS, or other trust signals when available.
Directory guidance marks the risk level as Safe and the blocking decision as Depends. Do not rely on the user-agent string alone because user-agent strings can be copied or spoofed.
Robots.txt handling: No.
Evidence and Source
- Compare the observed user-agent against the documented TurnitinBot/ContentIngest pattern. Where available, confirm with operator documentation, published IP ranges, reverse DNS, signed-agent metadata, or published operator documentation, reverse DNS, published IP ranges, signatures, or other trust signals.
- Match `TurnitinBot/ContentIngest` as a case-insensitive substring in HTTP user-agent logs. Review bot_aliases for alternate names or product labels. Use bot_http_agent for full user-agent examples when the client sends a longer browser-like string. Do not treat a user-agent match alone as proof of identity for allow-listing.
- Search indexing, content discovery, rendering, or search-result freshness checks.
- User-agent strings for TurnitinBot/ContentIngest can be spoofed. Treat user-agent detection as a classification signal, then verify with published IP ranges, reverse DNS, signatures, operator documentation, or published operator documentation, IP ranges, reverse DNS, signatures, or other verified identity signals before allow-listing.
WordPress Bot Protection
Bot Blocker for WordPress
Monitor bot traffic, review live activity, and control AI crawlers, scrapers, scanners, spam bots, and fake trusted bots from one clean WordPress dashboard.
