久久久久久久视色,久久电影免费精品,中文亚洲欧美乱码在线观看,在线免费播放AV片

<center id="vfaef"><input id="vfaef"><table id="vfaef"></table></input></center>

<p id="vfaef"><kbd id="vfaef"></kbd></p>

<pre id="vfaef"><u id="vfaef"></u></pre>

<thead id="vfaef"><input id="vfaef"></input></thead>

<abbr id="fq2v2"></abbr>

<p id="fq2v2"></p>

當(dāng)前位置：站長(zhǎng)資訊網(wǎng) > 編程知識(shí) > 正文

用Python獲取Amazon亞馬遜的商品信息

2022-10-08 分類：編程知識(shí) 閱讀(771) 評(píng)論(0)

引言

亞馬遜網(wǎng)站相較于國(guó)內(nèi)的購(gòu)物網(wǎng)站，可以直接使用python的最基本的requests進(jìn)行請(qǐng)求。訪問(wèn)不是過(guò)于頻繁，在未觸發(fā)保護(hù)機(jī)制的情況下，可以獲取我們想要的數(shù)據(jù)。本次通過(guò)以下三部分簡(jiǎn)單介紹下基本爬取流程：

使用requests的get請(qǐng)求，獲取亞馬遜列表和詳情頁(yè)的頁(yè)面內(nèi)容
使用css/xpath對(duì)獲取的內(nèi)容進(jìn)行解析，取得關(guān)鍵數(shù)據(jù)
動(dòng)態(tài)IP的作用及其使用方法

一、獲取亞馬遜列表頁(yè)的信息

以游戲區(qū)為例：

用Python獲取Amazon亞馬遜的商品信息

程序員必備接口測(cè)試調(diào)試工具：立即使用
Apipost = Postman + Swagger + Mock + Jmeter
Api設(shè)計(jì)、調(diào)試、文檔、自動(dòng)化測(cè)試工具
后端、前端、測(cè)試，同時(shí)在線協(xié)作，內(nèi)容實(shí)時(shí)同步

獲取列表內(nèi)能獲取到的商品信息，如商品名，詳情鏈接，進(jìn)一步獲取其他內(nèi)容。

用requests.get()獲取網(wǎng)頁(yè)內(nèi)容，設(shè)置好header，利用xpath選擇器選取相關(guān)標(biāo)簽的內(nèi)容：


import requests   from parsel import Selector   from urllib.parse import urljoin       spiderurl = 'https://www.amazon.com/s?i=videogames-intl-ship'   headers = {       "authority": "www.amazon.com",       "user-agent": "Mozilla/5.0 (iPhone; CPU iPhone OS 10_3_3 like Mac OS X) AppleWebKit/603.3.8 (KHTML, like Gecko) Mobile/14G60 MicroMessenger/6.5.19 NetType/4G Language/zh_TW",   }   resp = requests.get(spiderurl, headers=headers)   content = resp.content.decode('utf-8')   select = Selector(text=content)   nodes = select.xpath("http://a[@title='product-detail']")   for node in nodes:       itemUrl = node.xpath("./@href").extract_first()       itemName = node.xpath("./div/h2/span/text()").extract_first()       if itemUrl and itemName:           itemUrl = urljoin(spiderurl,itemUrl)#用urljoin方法湊完整鏈接           print(itemUrl,itemName)
登錄后復(fù)制

此時(shí)已經(jīng)獲取的當(dāng)前列表頁(yè)目前能獲得的信息：

用Python獲取Amazon亞馬遜的商品信息

二、獲取詳情頁(yè)信息

進(jìn)入詳情頁(yè)：

用Python獲取Amazon亞馬遜的商品信息

進(jìn)入詳情頁(yè)之后，能獲得

贊(1)

標(biāo)簽：AI Amazon app CPU css list php python 亞馬遜程序員購(gòu)物購(gòu)物網(wǎng)站

相關(guān)推薦

網(wǎng)站地圖滬ICP備18035694號(hào)-2

滬公網(wǎng)安備31011702889846號(hào)