Glossary / AI Crawlers
AI Crawlers
GPTBot
OpenAI's web crawler that fetches content to train and update its models.
Definition
GPTBot is the user-agent string OpenAI uses to crawl publicly accessible web pages. It is used to collect training data for models like GPT-4 and to keep those models' knowledge current. Site owners can allow or block GPTBot in robots.txt using the user-agent directive `GPTBot`.
Why it matters for AI visibility
If GPTBot is blocked, OpenAI's models are less likely to have indexed your brand's most accurate, up-to-date pages — reducing the chance of appearing in ChatGPT-generated recommendations.
Related
Check your site
The free scan checks crawler access, robots.txt, sitemap, structured data, and discoverability — and turns the results into a prioritized fix list.