A public directory of known
bots used across the web
Powering Vercel Bot Protection to allow verified bots to bypass bot filters
- AdIdxBotVerified
AdIdxBot is the crawler used by Bing Ads for quality control of ads and their destination websites. It has multiple user agent variants including desktop, iPhone, and Windows Phone versions.
Search EngineAdvertising - AdsBot-GoogleVerified
AdsBot-Google is Google's web crawler used for quality control of Google Ads.
Search EngineAdvertising - AdsenseVerified
The AdSense crawler visits participating sites in order to provide them with relevant ads.
Search EngineAdvertising - Adyen WebhookVerified
Adyen’s webhooks (Notification API) send encrypted, real-time HTTP callbacks for key payment and account events—automating order fulfillment, settlement reconciliation, and risk-management workflows.
Webhook - AhrefsBotVerified
Powers the database for both Ahrefs, a marketing intelligence platform, and Yep, an independent, privacy-focused search engine.
Search Engine - AhrefsSiteAuditVerified
Powers Ahrefs’ Site Audit tool. Ahrefs users can use Site Audit to analyze websites and find both technical SEO and on-page SEO issues.
Analytics - AI2Bot
AI2Bot is operated by the Allen Institute for Artificial Intelligence (Ai2) to crawl the web for content to train open-source AI models. It is used to index academic publications and web content for research purposes.
AI Training - aiHitBot
aiHitBot collects and maintains historical information about companies. It gathers data from company websites to build comprehensive company profiles, including changes in company executives and other historical information.
AI Training - AlgoliaVerified
The Algolia Crawler extracts content from your site and makes it searchable.
Search Engine - Amazon KendraVerified
Amazon Kendra is a managed information retrieval and intelligent search service that uses natural language processing and advanced deep learning model.
AI SearchSearch Engine - Amazon QVerified
Amazon Q Business is a generative artificial intelligence (generative AI)-powered assistant that you can tailor to your business needs.
AI Search - AmazonbotVerified
Amazonbot is Amazon's web crawler used to improve our services, such as enabling Alexa to more accurately answer questions for customers.
AI SearchAI Training - Amazon Product Discovery
Amazon's web crawler used to collect publicly available product details from Amazon Selling Partner websites to help improve the accuracy and completeness of product information on Amazon. This helps ensure that Amazon customers see correct and complete information to help them in their shopping journey.
E-commerceSearch Engine - Amazon Seller Initiated Listing
Amazon's web crawler that helps sellers succeed by giving them the option to provide a URL to a website and create high-quality product pages in Amazon's store. This bot crawls seller-provided URLs to collect product information for listing creation.
E-commerceUser Initiated - APIs-GoogleVerified
Crawling preferences addressed to the APIs-Google user agent affect the delivery of push notification messages by Google APIs.
Search Engine - Apple PodcastsVerified
Apple Podcasts crawler that only accesses URLs associated with registered content on Apple Podcasts. Does not follow robots.txt.
User Initiated - ApplebotVerified
Applebot powers search features in Apple's ecosystem (Spotlight, Siri, Safari) and may be used to train Apple's foundation models for generative AI features.
Search EngineAI Training - Artemis Web CrawlerVerified
Artemis is a calm web reader with which you can follow websites and blogs.
Preview - Awario Bot
Awario's web crawler used to discover and collect new and updated web data for their social media monitoring and brand mention tracking platform. The crawler helps Internet marketers find who is mentioning their brand online.
Analytics - Awario RSS Bot
One of Awario's primary web crawlers specialized in collecting RSS feed data.
Analytics - Awario Smart Bot
One of Awario's primary web crawlers that discovers and collects new and updated web data.
Analytics - BaiduSpiderVerified
Baiduspider is Baidu’s web crawler that indexes websites for inclusion in its Chinese-market search results.
Search Engine - BarkrowlerVerified
Barkrowler is Babbar's web crawler that fuels and updates their graph representation of the web, providing SEO tools for the marketing community.
Search Engine - Better StackVerified
Better Stack is a platform for monitoring and alerting on your applications.
Monitoring - BingbotVerified
Bingbot is Microsoft's web crawler used for indexing websites for Bing Search.
Search Engine - BLEXBotVerified
BLEXBot is SE Ranking's web crawler that helps analyze websites for SEO purposes, including backlink analysis, rank tracking, and website auditing. The bot is part of SE Ranking's all-in-one SEO platform used by marketing professionals and agencies.
AnalyticsSearch Engine - BrightbotVerified
Brightbot is Bright Data's crawler layer that monitors the health of websites and enforces ethical web data collection. It prevents access to non-public information and blocks interactive endpoints that could be abused, acting as a guardian for ethical data collection.
MonitoringAnalytics - Bytespider
Bytespider is ByteDance's web crawler used to gather training data for their AI large language models. It's primarily used to scrape web content to train TikTok's AI features and other ByteDance AI products.
AI Training - CCBotVerified
CCBot is operated by the Common Crawl Foundation to crawl web content for AI training and research. Common Crawl is a non-profit organization that maintains an open repository of web crawl data that is universally accessible for research and analysis.
AI Training - CensysInspectBot
Censys Inspect is a web crawler operated by Censys that performs internet-wide scanning to discover, monitor, and analyze publicly accessible devices and services. The crawler follows best practices, only accesses public-facing services, and respects robots.txt directives.
MonitoringAnalytics - ChatGPT-UserVerified
Handles user-initiated requests in ChatGPT, accessing external content to provide real-time information; not used for automated crawling or AI training.
User InitiatedAI (No Training)AI User Initiated - ChecklyVerified
Checkly is a platform for monitoring and alerting on your applications.
Monitoring - Chrome LighthouseVerified
PageSpeed Insights (PSI) reports on the user experience of a page on both mobile and desktop devices, and provides suggestions on how that page may be improved.
Analytics - Chrome Privacy Preserving Prefetch ProxyVerified
Chrome's Privacy Preserving Prefetch Proxy service that fetches /.well-known/traffic-advice to enable privacy-preserving prefetch hints.
Preview - ClarityBot
ClarityBot is seoClarity's web crawler that performs technical SEO audits, analyzes content, and monitors website performance. The bot respects robots.txt directives and crawl delays, and can be configured by seoClarity clients to control crawl speed and frequency.
AnalyticsMonitoring - Claude-SearchBotVerified
Claude-SearchBot navigates the web to improve search result quality for users. It analyzes online content specifically to enhance the relevance and accuracy of search responses.
AI SearchAI (No Training) - Claude-UserVerified
Claude-User supports Claude AI users. When individuals ask questions to Claude, it may access websites using a Claude-User agent.
User InitiatedAI SearchAI (No Training)+1 - ClaudeBotVerified
ClaudeBot helps enhance the utility and safety of our generative AI models by collecting web content that could potentially contribute to their training.
AI Training - ContentKingBot
ContentKing (now Conductor Website Monitoring) is a website monitoring tool that continuously audits websites to help improve their performance and visibility. It makes HTTP GET requests to monitor websites' SEO, content changes, and technical health.
MonitoringAnalytics - CookiebotVerified
Cookiebot automates compliance with cookie laws and helps you manage your cookie consent preferences.
Monitoring - CookieScript
A cookie scanning bot that examines websites for cookie usage to help maintain GDPR and other privacy regulation compliance.
Monitoring - Cotoyogi
Cotoyogi is a web crawler operated by the Center for Research and Development on Data Lake, ROIS-DS (Research Organization of Information and Systems - Data Science) for collecting Japanese language data resources.
AI Training - Coveobot
Coveobot is a crawler operated by Coveo that indexes content for enterprise search, recommendations, and generative experience platforms. The bot crawls and analyzes both structured and unstructured content to enable unified search experiences across multiple data sources.
AI Search - CriteoBotVerified
CriteoBot is a crawler operated by Criteo that analyzes web content to serve relevant contextual ads. The bot respects robots.txt directives and crawl delays, and only accesses publicly available content.
AdvertisingAnalytics - Datadog Synthetic Monitoring RobotVerified
Datadog's automated monitoring service that performs synthetic tests to verify website availability and performance.
Monitoring - DataForSeoBotVerified
DataForSeoBot is a backlink checker bot operated by DataForSEO that crawls websites to build and maintain their backlink database. The bot respects robots.txt directives and crawl delays, and is used to provide SEO data and analytics services.
Analytics - DetectifyVerified
Detectify is a web security scanner that performs automated security tests on web applications and attack surface monitoring.
MonitoringUser Initiated - DigitalOceanUptimeBot
DigitalOcean Uptime is a monitoring service that checks the health of any URL or IP address. The probe performs checks from multiple global regions to monitor latency, uptime, and SSL certificates of websites and hosts.
Monitoring - Discord Bot
Discord's link preview bot that crawls URLs shared in Discord chats to generate rich previews.
Preview - DotBot
DotBot is a web crawler operated by Moz (formerly SEOmoz) that collects data for their Link Explorer tool and Links API. It helps build Moz's link intelligence database which powers their Domain Authority and Page Authority metrics.
Analytics - DuckAssistBotVerified
DuckAssistBot is a web crawler for DuckDuckGo Search that crawls pages in real-time for AI-assisted answers, which prominently cite their sources. This data is not used in any way to train AI models.
AI SearchAI (No Training) - DuckDuckBotVerified
DuckDuckBot is a web crawler for DuckDuckGo. DuckDuckBot’s job is to constantly improve search results and offer users the best and most secure search experience possible.
Search Engine - Facebook WebhooksVerified
Facebook's webhook service that delivers real-time event notifications for Meta platform events and changes.
Webhook - FacebookExternalHitVerified
Fetches content for shared links on Meta platforms to generate rich previews.
PreviewSocial Media - FeedfetcherVerified
Feedfetcher is used for crawling RSS or Atom feeds for Google News and PubSubHubbub.
Search Engine - GeedoProductSearchBotVerified
GeedoProductSearch is a web crawler operated by Geedo SIA that indexes product information from e-commerce websites. The crawler respects robots.txt directives and can be configured for crawl speed and behavior through standard crawl-delay settings.
E-commerce - GitHub CamoVerified
GitHub's image proxy service
Preview - GitHub HookshotVerified
GitHub's webhooks for events like push, pull request, etc.
Webhook - Google-CloudVertexBotVerified
Crawling preferences addressed to the Google-CloudVertexBot user agent affect crawls requested by the site owners' for building Vertex AI Agents. It has no effect on Google Search or other products.
AI Search - Google-ExtendedVerified
Google-Extended is a standalone product token that web publishers can use to manage whether their sites help improve Gemini Apps and Vertex AI generative APIs, including future generations of models that power those products. Grounding with Google Search on Vertex AI does not use web pages for grounding that have disallowed Google-Extended. Google-Extended does not impact a site's inclusion or ranking in Google Search.
AI Training - Google-InspectionToolVerified
Crawling preferences addressed to the Google-InspectionTool user agent affect Search testing tools such as the Rich Result Test and URL inspection in Search Console. It has no effect on Google Search or other products.
Monitoring - Google PageRendererVerified
Upon user request, Google Page Renderer fetches and renders web pages.
Preview - Google Publisher CenterVerified
Google Publisher Center fetches and processes feeds that publishers explicitly supplied for use in Google News landing pages.
Search Engine - Google Read AloudVerified
Upon user request, Google Read Aloud fetches and reads out web pages using text-to-speech (TTS).
User Initiated - Google-SafetyVerified
The Google-Safety user agent handles abuse-specific crawling, such as malware discovery for publicly posted links on Google properties. As such it's unaffected by crawling preferences.
Monitoring - Google Site VerifierVerified
Google Site Verifier fetches Search Console verification tokens.
Verification - Google StoreBotVerified
Crawling preferences addressed to the Storebot-Google user agent affect all surfaces of Google Shopping (for example, the Shopping tab in Google Search and Google Shopping).
Search EngineE-commerce - GooglebotVerified
Crawling preferences addressed to the Googlebot user agent affect Google Search (including Discover and all Google Search features), as well as other products such as Google Images, Google Video, Google News, and Discover.
Search Engine - GoogleOtherVerified
Crawling preferences addressed to the GoogleOther user agent don't affect any specific product. GoogleOther is the generic crawler that may be used by various product teams for fetching publicly accessible content from sites. For example, it may be used for one-off crawls for internal research and development. It has no effect on Google Search or other products.
Search Engine - GoogleStackdriverMonitoringBot
GoogleStackdriverMonitoringBot is operated by Google Cloud to perform uptime checks and monitor availability of services. The bot sends HTTP/HTTPS requests from multiple global locations to verify service health and responsiveness.
Monitoring - GPT-ActionsVerified
Enables ChatGPT to interact with external APIs and retrieve real-time information from the web in response to user-initiated requests; allows access to up-to-date content without being used for automated crawling or AI training.
User InitiatedAI (No Training)AI User Initiated - GPTBotVerified
Crawls web content to improve OpenAI's generative AI models; respects 'robots.txt' directives to exclude sites from training data.
AI Training - HetrixTools Uptime Monitoring BotVerified
HetrixTools Uptime Monitoring Bot is used by HetrixTools's monitoring services to perform various checks on websites, including uptime and performance monitoring.
Monitoring - HookdeckVerified
A reliable Event Gateway for event-driven applications
Webhook - HydrozenVerified
Hydrozen is a tool for monitoring availability of your websites, Cronjobs, APIs, Domains, SSL etc.
Monitoring - IASBot
IAS (Integral Ad Science) crawler, formerly known as AdmantX, is used for analyzing web content to ensure brand safety and suitability for advertisers. The crawler helps assess content quality, context, and safety for digital advertising campaigns.
AdvertisingAnalytics - ImagesiftBotVerified
ImageSiftBot is a web crawler that scrapes the internet for publicly available images to support Hive's suite of web intelligence products.
AI Training - InngestVerified
Inngest is a platform for building event-driven applications.
Webhook - InternetMeasurementBot
InternetMeasurementBot is operated by driftnet.io to discover and measure services that network owners and operators have publicly exposed. The bot performs network measurements and service discovery without attempting to log in to systems or send spam.
Monitoring - LinkedInBotVerified
LinkedInBot is a bot that renders links shared on LinkedIn.
PreviewSocial Media - LogRocketBot
LogRocket Asset Cacher is a bot that captures and caches web assets (CSS, JavaScript, images) to ensure proper playback of user sessions in LogRocket's session replay feature. The bot only accesses publicly available content when LogRocket needs to record sessions.
AnalyticsMonitoring - LumarVerified
The Lumar website intelligence platform is used by SEO, engineering, marketing and digital operations teams to monitor the performance of their site’s technical health, and ensure a high-performing, revenue-driving website.
Analytics - meta-externalagentVerified
The Meta-ExternalAgent crawler crawls the web for use cases such as training AI models or improving products by indexing content directly.
AI Training - meta-externalfetcherVerified
The Meta-ExternalFetcher crawler performs user-initiated fetches of individual links to support specific product functions. Because the fetch was initiated by a user, this crawler may bypass robots.txt rules.
User Initiated - MicrosoftPreviewVerified
MicrosoftPreview generates page snapshots for Microsoft products. It has desktop and mobile variants, with Chrome version dynamically updated to match the latest Microsoft Edge version.
Preview - MJ12bot
MJ12bot is a web crawler operated by Majestic-12 Ltd, a UK-based company that builds a search engine focused on backlink analysis and web structure mapping. The crawler is part of a distributed community-based system that helps build Majestic's link intelligence database.
Search Engine - adsnaverVerified
Naver's ad crawler that periodically visits registered ad landing pages to collect on-page content for effective ad matching and ranking. It ignores robots.txt for URLs registered in the ad system.
Search EngineAdvertising - naver-bluenoVerified
Naver's preview-snippet crawler that fetches summary information (titles, descriptions, images) when users insert links in Naver services such as blogs or cafés. It operates on demand and respects robots.txt.
Preview - naverbotVerified
Naver's web crawler (also known as Yeti) is used by Naver, South Korea's largest search engine, to crawl and index web content.
Search Engine - OAI-SearchBotVerified
Indexes websites for inclusion in ChatGPT's search results; does not crawl content for AI model training.
AI SearchAI (No Training) - OhDearBot
OhDearBot is a monitoring bot operated by Oh Dear that performs uptime checks, broken link detection, and mixed content scanning. The bot follows standard crawling practices and throttles requests to minimize server impact.
Monitoring - PayPalVerified
PayPal delivers real-time event notifications for payments, subscriptions, and account updates.
Webhook - Perplexity-UserVerified
Handles user-initiated requests in Perplexity, accessing external content to provide real-time information; not used for automated crawling or AI training.
User InitiatedAI SearchAI (No Training)+1 - PerplexityBotVerified
Indexes websites for inclusion in Perplexity's search results; does not crawl content for AI model training.
AI SearchAI (No Training) - PetalBotVerified
PetalBot is a web crawler operated by Huawei's Petal Search engine. It crawls both PC and mobile websites to build an index database for Petal search engine and to provide content recommendations for Huawei Assistant and AI Search services.
Search EngineAI Search - Pingdom BotVerified
Pingdom Bot is used by Pingdom's monitoring services to perform various checks on websites, including uptime and performance monitoring.
Monitoring - Pinterest BotVerified
Pinterest's web crawler that indexes content for their platform. It crawls websites to collect metadata for Pins, including images, titles, descriptions, and prices. The crawler also helps maintain Pin data accuracy and detect broken links.
Social Media - ProximicBot
Proximic is Comscore's web crawler that performs contextual content analysis to help advertisers determine the best matching campaigns for a page's content. The bot respects robots.txt, only downloads static textual content, and crawls at a controlled rate.
AdvertisingAnalytics - PulsePoint CrawlerVerified
A web crawler used by PulsePoint, a digital advertising technology company, for content indexing and ads.txt verification.
Advertising - QStashVerified
QStash is a platform for building event-driven applications.
Webhook - Razorpay-WebhookVerified
Razorpay’s webhooks enable merchants to receive secure, real-time HTTP callbacks for key payment events—automating reconciliation, notifications, and downstream workflows.
Webhook - Amazon Route 53 Health Check ServiceVerified
Amazon Route 53 Health Check Service
Monitoring - Sanity WebhookVerified
Sanity's webhook service that delivers real-time event notifications for content changes and other events.
Webhook - SBIntuitionsBot
SBIntuitionsBot is a crawler operated by SB Intuitions Corp. that collects web data for AI development and information analysis. The bot follows RFC 9309 Robots Exclusion Protocol standards and can be controlled via robots.txt directives.
AI Training - ScreamingFrogBot
Screaming Frog SEO Spider is a website crawler used by SEO professionals for site audits and technical SEO analysis. It's a desktop-based tool that crawls websites' links, images, CSS, scripts and apps to evaluate onsite SEO. The crawler respects robots.txt and can be configured for crawl speed and behavior.
Analytics - SeekportBotVerified
SeekportBot is the web crawler for Seekport, a German search engine operated by SISTRIX. The bot crawls and indexes web content while respecting robots.txt directives and crawl delays.
Search Engine - SemanticScholarBot
The Semantic Scholar bot crawls domains to find academic PDFs. These PDFs are served on semanticscholar.org so researchers can discover and understand other academic accomplishments.
AI SearchAI Training - Semrush Site AuditVerified
Semrush Site Audit is a powerful website crawler that analyzes the health of a website by checking for on-page and technical SEO issues, including duplicate content, broken links, HTTPS implementation, hreflang attributes, and more.
Analytics - SemrushVerified
Semrush is a platform for SEO, content marketing, competitor research, PPC and social media marketing.
Search EngineMonitoringAnalytics - Sentry Uptime Monitoring BotVerified
Sentry's Uptime Monitoring Bot performs health checks on configured URLs to monitor the availability and reliability of web services.
Monitoring - SeznamBotVerified
SeznamBot is the web crawler operated by Seznam.cz, the leading Czech search engine. The bot crawls and indexes web content for Seznam's search results, respecting robots.txt directives and crawl delays.
Search Engine - SISTRIX Optimizer Uptime
SISTRIX Optimizer Uptime bot performs continuous monitoring of website availability by checking the startpage once per minute. It is part of SISTRIX's SEO and website monitoring platform.
Monitoring - Site24x7Verified
Site24x7 Bot is used by Site24x7's monitoring services to perform various checks on websites, including uptime and performance monitoring.
Monitoring - Sitebulb
Sitebulb is a desktop and cloud-based website crawler used by SEO professionals for technical SEO audits. It analyzes websites to find technical issues, opportunities for improvement, and provides detailed reports with visualizations and prioritized recommendations.
AnalyticsUser Initiated - SlackLinkExpandingBot
Slackbot Link Expanding is a bot operated by Slack that fetches metadata from shared links to create rich previews. The bot uses HTTP Range headers to efficiently fetch only necessary metadata like oEmbed and Open Graph tags, and caches responses globally for about 30 minutes.
Preview - Slackbot
Slackbot is Slack's default, general-purpose bot that handles various API requests and integrations. It is used for tasks not covered by specialized bots like ImgProxy or LinkExpanding, such as making API requests for service integrations or handling outgoing webhooks.
Webhook - Slack-ImgProxy
Slack-ImgProxy is a bot operated by Slack that fetches and caches images posted in Slack channels. The bot helps improve performance, ensures HTTPS delivery, and protects user privacy by hiding detailed referrer information.
Preview - SnapchatAdsBot
SnapchatAdsBot is a crawler operated by Snapchat that verifies and analyzes websites for their advertising platform. The bot helps ensure content quality and safety for Snapchat's advertising ecosystem.
AdvertisingAnalytics - SnapURLPreviewBot
SnapURLPreviewBot is a crawler operated by Snap Inc. that analyzes and generates previews of URLs shared on Snapchat and other Snap platforms. The bot helps ensure content quality and safety by validating URLs and generating preview metadata.
PreviewAnalyticsSocial Media - StatusCakeVerified
StatusCake is a website monitoring service that checks the uptime and performance of your website.
Monitoring - Stripe WebhooksVerified
Stripe's webhook service that delivers real-time event notifications for payment processing and account updates.
Webhook - svixVerified
svix is a webhook service for sending events to webhooks.
Webhook - TangibleeBot
TangibleeBot is a crawler operated by Tangiblee that collects product data from e-commerce websites to power their product visualization and virtual try-on services. The bot simulates single-visitor activity and crawls at an agreed-upon frequency to prevent disruption to website performance.
E-commerce - TikTokSpider
TikTokSpider is a web crawler used by TikTok/ByteDance to index and analyze web content for their platform. It helps in content discovery, link previews, and data collection for TikTok's services.
Social Media - TTD-Content
TTD-Content is a crawler operated by The Trade Desk that verifies content and quality of ad placements for their demand-side platform. The bot helps ensure brand safety and ad verification by analyzing webpage content where ads may be displayed.
AdvertisingVerification - TwitterbotVerified
Fetches content for shared links on X/Twitter to generate rich previews.
PreviewSocial Media - Uptime RobotVerified
Uptime Robot is a platform for monitoring and alerting on your applications.
Monitoring - UsercentricsBot
UsercentricsBot is operated by Usercentrics GmbH to scan websites for data processing services and third-party technologies. The bot helps ensure GDPR compliance by identifying services that need to be included in the website's Consent Management Platform (CMP).
Analytics - v0botVerified
Bot for v0 services.
Preview - Vercel Favicon BotVerified
Vercel Favicon Bot
Preview - vercelflagsVerified
vercel flags
Monitoring - Vercel Screenshot BotVerified
Vercel Screenshot Bot
Preview - verceltracingVerified
vercel tracing
Preview - Yahoo! SlurpVerified
Yahoo! Slurp is the web crawler (robot) used by Yahoo! Search to discover and index web pages for its search engine.
- YandexbotVerified
YandexBot is a web crawler operated by Yandex, a major Russian search engine.
Search Engine - YisouSpider
YisouSpider is a search engine crawler operated by Yisou that indexes web content for their search engine results. The crawler follows standard crawling practices and respects robots.txt directives.
Search Engine