我写的这段代码有很多情况都未考虑到,希望各位提出比较全面的方法:
import re
reg = r'^https?:\/\/([a-z0-9\-\.]+)[\/\?]?'
url = 'http://.com/blog/biu/1190000000330941'
m = re.match(reg, url)
uri = m.groups()[0] if m else ''
print uri[uri.rfind('.', 0, uri.rfind('.')) + 1:]
# 输出结果:.com
python有一个内置的模块专门做url解析的 可以自己指定scheme
https://docs.python.org/2.7/library/urlparse.html?highlight=urlparse#module-urlparse
3.*的python也有
from urlparse import *
url = 'http://.com/blog/biu/1190000000330941'
r = urlparse(url)
print r
输出
ParseResult(scheme='http', netloc='.com', path='/blog/biu/1190000000330941', params='', query='', fragment='')