I'm engaged in cross-border electricity company, often need to obtain a competitor's sales page data, I know python can do the work, specific steps are:
1, by rival goods commodity code (ASIN) - this is we give the ahead of time, you can get a competitor's sales page links, format: "https://www.amazon.com/dp/" + ASIN
In our original form holds a list of such ASIN data
2, after entering a competitor's sales page, I want to get them the following data: the brand - the title - price - comments - star - q&a - category - BSR1 - node - BSR2 (press F12 can get these elements corresponding code)
3, will get all the data imported into a excel worksheets in
4, need to traverse all competitors ASIN
It is important to note that amazon has a strong anti climb mechanism, how to reflect climb? (I saw some similar python program by delay) it
Thank you very much for the teacher is willing to provide valuable help
If you can, later there are many such work need help
CodePudding user response:
A little bit more simple of the user-agent + Referer testA bit difficult to change the IP, decryption, identification authentication code
CodePudding user response: