Bot intelligence record

TurnitinBot ContentIngest

Usually allow

Use the TurnitinBot/ContentIngest identifier to separate Turnitin search indexing or content discovery traffic from normal visitor requests in server logs.

Search Indexing Official Documented Confidence: High Verified: Yes robots.txt: No

User-Agent Pattern

Turnitin
TurnitinBot/ContentIngest

Robots.txt Snippet

Copy-ready
User-agent: Turnitin Disallow: /

Handling Guidance

Depends

This bot is usually safe to allow when the request source is verified and the traffic matches your site policy.

Search indexing, content discovery, rendering, or search-result freshness checks.

Record Details

Structured data
Operator
Turnitin
Family
Turnitin
Type
Search
Purpose
Indexing
Identity type
Official Documented
Confidence
High
Last verified
2026-04-01
Last checked
2026-05-20
Source type
Official
Spoofing risk
User-agent strings for TurnitinBot/ContentIngest can be spoofed. Treat user-agent detection as a classification signal, then verify with published IP ranges, reverse DNS, signatures, operator documentation, or published operator documentation, IP ranges, reverse DNS, signatures, or other verified identity signals before allow-listing.

Notes

TurnitinBot ContentIngest is listed in the Botcrawl directory as a search crawler from Turnitin. The primary identifier for log review is TurnitinBot/ContentIngest.

Identification

  • User-agent pattern: TurnitinBot/ContentIngest
  • Family: Turnitin
  • Type: Search
  • Kind: Crawler

Common use

Search indexing, content discovery, rendering, or search-result freshness checks.

Verification and handling

Confirm the user-agent against server logs and use published operator documentation, IP ranges, reverse DNS, or other trust signals when available.

Directory guidance marks the risk level as Safe and the blocking decision as Depends. Do not rely on the user-agent string alone because user-agent strings can be copied or spoofed.

Robots.txt handling: No.

Evidence and Source

  • Compare the observed user-agent against the documented TurnitinBot/ContentIngest pattern. Where available, confirm with operator documentation, published IP ranges, reverse DNS, signed-agent metadata, or published operator documentation, reverse DNS, published IP ranges, signatures, or other trust signals.
  • Match `TurnitinBot/ContentIngest` as a case-insensitive substring in HTTP user-agent logs. Review bot_aliases for alternate names or product labels. Use bot_http_agent for full user-agent examples when the client sends a longer browser-like string. Do not treat a user-agent match alone as proof of identity for allow-listing.
  • Search indexing, content discovery, rendering, or search-result freshness checks.
  • User-agent strings for TurnitinBot/ContentIngest can be spoofed. Treat user-agent detection as a classification signal, then verify with published IP ranges, reverse DNS, signatures, operator documentation, or published operator documentation, IP ranges, reverse DNS, signatures, or other verified identity signals before allow-listing.

WordPress Bot Protection

Bot Blocker for WordPress

Monitor bot traffic, review live activity, and control AI crawlers, scrapers, scanners, spam bots, and fake trusted bots from one clean WordPress dashboard.