一、爬虫
data:image/s3,"s3://crabby-images/56d12/56d128e800f83ca6d0fff863561926b4af99f05a" alt="在这里插入图片描述" data:image/s3,"s3://crabby-images/0ced6/0ced65c1dd6b1fa50cc416c9cbbb99fcc7eb8bc6" alt="在这里插入图片描述"
二、Requests方法
1、requests.get()
import requests
res = requests.get('URL')
data:image/s3,"s3://crabby-images/3c11a/3c11a06d227926766ea2dadcfbfb87b0bc191432" alt="在这里插入图片描述"
2、案例:用requests.get下载小说《三国演义》
import requests
res = requests.get('https://localprod.pandateacher.com/python-manuscript/crawler-html/sanguo.md')
novel=res.text
print(novel[:800])
3、Response对象常用属性
data:image/s3,"s3://crabby-images/ee91c/ee91cbb814b5fc119b5492e6e4c4e9de612b826f" alt="在这里插入图片描述"
①、常见响应状态码
data:image/s3,"s3://crabby-images/a2f60/a2f60d6b53da110360e6f17bda0f58e9129e5b74" alt="在这里插入图片描述"
②、response.content
data:image/s3,"s3://crabby-images/b1c07/b1c078b44034db30096ec0910a1de6a827625cd3" alt="在这里插入图片描述"
import requests
res = requests.get('https://res.pandateacher.com/2018-12-18-10-43-07.png')
pic=res.content
photo = open('ppt.jpg','wb')
photo.write(pic)
photo.close()
③、将小说保存到本地成txt
import requests
res = requests.get('https://localprod.pandateacher.com/python-manuscript/crawler-html/sanguo.md')
novel=res.text
k = open('《三国演义》.txt','a+')
k.write(novel)
k.close()
4、总结
data:image/s3,"s3://crabby-images/f5c33/f5c33aee8806800fdd9a5bbd28f0d003667244a0" alt="在这里插入图片描述"
三、爬虫伦理(Robots协议)
data:image/s3,"s3://crabby-images/4c55b/4c55b0dffb914b4d561d7cd7b86bd0db26562352" alt="在这里插入图片描述" data:image/s3,"s3://crabby-images/27651/27651dd628974c8517474952c4bec31fe04a0c76" alt="在这里插入图片描述"
|