How robots.txt governs AI crawlers
robots.txt lives at the root of your domain and lists rules grouped by user-agent. Compliant AI crawlers read it before fetching and obey the most specific matching group. It’s a request, not a lock—well-behaved bots honor it; bad actors ignore it.
Anatomy
# https://example.com/robots.txt
User-agent: GPTBot
Disallow: /
User-agent: OAI-SearchBot
Allow: /
User-agent: *
Allow: /
Sitemap: https://example.com/sitemap.xmlSpecificity matters: a named user-agent group (e.g. GPTBot) overrides the wildcard (*) group for that bot. Bots do not merge the two.
