Note that there are some explanatory texts on larger screens.

plurals
  1. POhow to ban crawler 360Spider with robots.txt or .htaccess?
    primarykey
    data
    text
    <p>I've got a problems because of 360Spider: this bot makes too many requests per second to my VPS and slows it down (the CPU-usage becomes 10-70%, but usually i have 1-2%). I looked into httpd logs and saw there such lines:</p> <pre><code>182.118.25.209 - - [06/Sep/2012:19:39:08 +0300] "GET /slovar/znachenie-slova/42957-polovity.html HTTP/1.1" 200 96809 "http://www.hrinchenko.com/slovar/znachenie-slova/42957-polovity.html" "Mozilla/5.0 (Windows; U; Windows NT 5.1; zh-CN; rv:1.8.0.11) Gecko/20070312 Firefox/1.5.0.11; 360Spider 182.118.25.208 - - [06/Sep/2012:19:39:08 +0300] "GET /slovar/znachenie-slova/52614-rospryskaty.html HTTP/1.1" 200 100239 "http://www.hrinchenko.com/slovar/znachenie-slova/52614-rospryskaty.html" "Mozilla/5.0 (Windows; U; Windows NT 5.1; zh-CN; rv:1.8.0.11) Gecko/20070312 Firefox/1.5.0.11; 360Spider </code></pre> <p>etc.</p> <p>How can I block this spider completely via robots.txt? Now my robots.txt looks like this:</p> <pre><code>User-agent: * Disallow: /cgi-bin/ Disallow: /tmp/ User-agent: YoudaoBot Disallow: / User-agent: sogou spider Disallow: / </code></pre> <p>I've added lines:</p> <pre><code>User-agent: 360Spider Disallow: / </code></pre> <p>but that does not seem to work. How to block this angry bot?</p> <p>If you offer to block it via .htaccess, so mind that it looks now like this:</p> <pre><code># Turn on URL rewriting RewriteEngine On # Installation directory RewriteBase / SetEnvIfNoCase Referer ^360Spider$ block_them Deny from env=block_them # Protect hidden files from being viewed &lt;Files .*&gt; Order Deny,Allow Deny From All &lt;/Files&gt; # Protect application and system files from being viewed RewriteRule ^(?:application|modules|system)\b.* index.php/$0 [L] # Allow any files or directories that exist to be displayed directly RewriteCond %{REQUEST_FILENAME} !-f RewriteCond %{REQUEST_FILENAME} !-d # Rewrite all other URLs to index.php/URL RewriteRule .* index.php/$0 [PT] </code></pre> <p>And, in spite of presence of </p> <pre><code>SetEnvIfNoCase Referer ^360Spider$ block_them Deny from env=block_them </code></pre> <p>this bot still tries to kill my VPS and is logged in access logs.</p>
    singulars
    1. This table or related slice is empty.
    plurals
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload