scrapy Splash Request设置请求头

it2022-05-05  135

scrapy Splash Request设置请求头

1. 在lua里直接设置:splash:set_user_agent(“Mozilla/5.0”)

2. 写在middleware里面:

class HeaderMiddleware(object): @classmethod def from_crawler(cls, crawler): # This method is used by Scrapy to create your spiders. s = cls() crawler.signals.connect(s.spider_opened, signal=signals.spider_opened) return s def process_request(self, request, spider): request.headers["user-agent"] = UserAgent().random request.headers["authority"] = 'www.ame.com' request.headers["accept-language"] = 'en;q=0.8' def spider_opened(self, spider): spider.logger.info('Spider opened: %s' % spider.name)

再在爬虫的customer_settings里添加:


最新回复(0)