wp-includeswp-includes is a website for every WordPress fan – Tutorials, news and database all related to WordPress!

Web Crawler & User Agent Blocking Techniques

Sucuri August 14, 2020

wp-includeswp-includes is a website for every WordPress fan – Tutorials, news and database all related to WordPress!

This is a simple script that allows hackers to block specific crawlers based upon website requests from specific user-agents. This is useful when you don’t want certain traffic from being able to load certain content – usually a phishing page or a malicious download.

if(preg_match(‘/bot|crawler|spider|facebook|alexa|twitter|curl/i’, $_SERVER[‘HTTP_USER_AGENT’])) {
logger(“[BOT] {$_SERVER[‘REQUEST_URI’]} – 500”);

WordPress Agency for Development

header(‘HTTP/1.1 500 Internal Server Error’);
exit();
}

Using preg_match, the script looks for certain known crawler strings in the user-agent.

Continue reading Web Crawler & User Agent Blocking Techniques at Sucuri Blog.

Source: Sucuri

Web Crawler & User Agent Blocking Techniques

You May Also Like

Top 10 Security Tips to Keep Your WordPress Site Healthy

How to Perform a Website Security Audit ( with Checklist)

How to Create a Website Maintenance Plan & Contract