scrapy爬虫,里面用到一个正则表达式,有如下错误信息,麻烦看一下是怎么的?
爬虫代码:
def parseContent(self, response):
for sel in response.xpath('//div[@class="lefcon"]'):
item = NovelspiderItem()
content=sel.xpath('div[3]/div/p/text()').extract()
item['content'] = re.search("正文:(.*?)</p",content,re.S).group(1)
yield item
错误信息:
File "D:\novelspider\novelspider\spiders\novspider.py", line 62, in parseContent
item['content'] = re.search("正文:(.*?)</p",content,re.S).group(1)
File "C:\Python27\lib\re.py", line 146, in search
return _compile(pattern, flags).search(string)
TypeError: expected string or buffer
extract返回值为list,请用.extract()[0]