# This is a file retrieved by webwalkers a.k.a. spiders that # conform to a defacto standard. # Comments to the webmaster should be posted at # This file is used to allow crawlers to index our site. # # List of all web robots: http://www.robotstxt.org/wc/active/html/index.html # # Check robots.txt at: # http://www.searchengineworld.com/cgi-bin/robotcheck.cgi # # Details about Googlebot available at: http://www.google.com/bot.html # The Google search engine can see everything #User-agent: Googlebot # All other robots will be restricted from accessing the Google-specific index pages User-agent: * Disallow: /404/ Disallow: /common/ Disallow: /i/ Disallow: /cn/press/ Disallow: /press/