About the ScoutJet web crawler

ScoutJet web crawler

ScoutJet is the web crawler for IBM Watson.
We are developing next generation search technology, and kindly request that you permit ScoutJet access to your site so that we may refine our relevance algorithms with the broadest variety of content available from the Internet.

ScoutJet obeys robots.txt

You can prevent ScoutJet from indexing all or part of your site by including the following lines in your http://www.yoursite.com/robots.txt file:

# Allow only specific directories
User-agent: ScoutJet
Disallow: /
Allow: /public
You can also limit the rate at which ScoutJet crawls your page using the Crawl-delay directive:
# Limit ScoutJet's crawl rate (example is to crawl no more than 1 page every 5 seconds)
User-agent: ScoutJet
Crawl-delay: 5
In addition, ScoutJet understands wildcards and Allow.
ScoutJet crawls from the following IP ranges:
199.87.248.*, 199.87.249.*, 199.87.250.*, 199.87.251.*, 199.87.252.*, 199.87.253.*, 199.87.254.*, 199.87.255.*
38.99.96.*, 38.99.97.*, 38.99.98.*, 38.99.99.*,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,
ScoutJet tries its best to crawl politely. But if you do experience a problem with ScoutJet, please let us know at crawler (at) blekko (dot) com.