http://www.xxx.com/test/test.html
http://www.xxx.com/test/test2.html
http://www.xxx.com/ok/ok.html
http://www.xxx.com/ok/ok2.html
http://www.xxx.com/ok/ok3.html
http://www.xxx.com/try/try.html
http://www.xxx.com/try/try2.html
http://www.xxx.com/try/try3.html
http://www.xxx.com/trial/trial.html
http://www.xxx.com/trial/trial2.html
……
假设文件里有这些url,需要匹配非test和非ok的其他地址(也就是除了前五条以外的其他地址),请问python怎么实现?
import re
text = """
http://www.xxx.com/test/test.html
http://www.xxx.com/test/test2.html
http://www.xxx.com/ok/ok.html
http://www.xxx.com/ok/ok2.html
http://www.xxx.com/ok/ok3.html
http://www.xxx.com/try/try.html
http://www.xxx.com/try/try2.html
http://www.xxx.com/try/try3.html
http://www.xxx.com/trial/trial.html
http://www.xxx.com/trial/trial2.html
"""
findList = \
re.findall("http:\/\/www\.xxx\.com\/(?!test)(?!ok)[a-z]*\/[a-z0-9]*\.html",
text)
for string in findList:
print(string)