Nginx常用模块的使用(五):对IP地址和浏览器的限制
对浏览器进行限制
配置文件的修改
server {
listen 80;
server_name www.pp.com alias web.pp.com;
location / {
root html/pp.com;
index index.html index.htm;
}
if ($http_user_agent !~* Chrome) {
return 404;
}
}
对IP地址的限制
配置文件的修改
```bash
server {
listen 80;
server_name www.pp.com alias web.pp.com;
location / {
root html/pp.com;
index index.html index.htm;
}
if ($remote_addr ~* 192.168.0.1[0-9]) {
return 404;
}
}
对爬虫进行限制
配置文件的修改
server {
listen 80;
server_name www.pp.com alias web.pp.com;
location / {
root html/pp.com;
index index.html index.htm;
}
if ($http_user_agent ~* python-requests) {
return 404;
}
}
python爬虫伪装成其他浏览器
>>> import requests
>>> headers = {"User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/61.0.3163.100 Safari/537.36"}
>>> result = requests.get("http://www.pp.com", headers=headers)
>>> result.text
|