我所做的是:
import urlparse
...
def parse(self, response):
...
urlparse.urljoin(response.url, extractedLink.strip())
...
注意strip(),因为我有时会遇到奇怪的链接,例如:
<a href="
/MID_BRAND_NEW!%c2%a0MID_70006_Google_Android_2.2_7%22%c2%a0Tablet_PC_Silver/a904326516.html
">MID BRAND NEW! MID 70006 Google Android 2.2 7" Tablet PC Silver</a>