python中关于bs4获取链接的问题

2025-03-04 17:55:04
推荐回答(2个)
回答1:

import urllib2
import urllib
from lxml import etree
def tieba_spider(fullurl):
headers = {
"User-Agent":"Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/65.0.3325.146 Safari/537.36"
}
request = urllib2.Request(fullurl,headers = headers)
html = urllib2.urlopen(request).read()

回答2:

获取a标签的href属性啊,也就是链接