Glossary / AI Crawlers
AI Crawlers
Google-Extended
Google's opt-out user-agent for AI product training, separate from regular search crawling.
Definition
Google-Extended is a separate crawler user-agent Google introduced to give site owners control over whether their content is used to train Gemini models and improve AI features like AI Overviews. Blocking it does not affect standard Google Search indexing — that is controlled by Googlebot. Allowing Google-Extended is a distinct opt-in for AI product training.
Why it matters for AI visibility
If you block Google-Extended, your content may be underrepresented in Gemini training data and Google's generative AI answers. Since it is separate from Googlebot, you can allow one while restricting the other.
Related
Check your site
The free scan checks crawler access, robots.txt, sitemap, structured data, and discoverability — and turns the results into a prioritized fix list.