Home > other >  For help in python batch crawl excel in annual reports
For help in python batch crawl excel in annual reports

Time:10-09

Python small white, on the Internet to see a lot of scripts, but they are running buggy,
Want to batch download excel in company annual report ~ in recent years

CodePudding user response:

Take a look at this https://www.cnblogs.com/insane-Mr-Li/p/9092619.html

CodePudding user response:

You have to have a target site, analysis of web pages, find download excel node, download files, generally USES requests library, request page, find the download nodes, get a download link, use the requests. Get according to the download link to download files, complex web pages, using selenium + LXML + requests

CodePudding user response:

Basic process: simulation log in, log in later to get to the login cookies, and then use the cookies fetching excel

CodePudding user response:

CSV format I will excel file establishment, has been successfully read, but how to splice the url to download work, don't know much about

CodePudding user response:

refer to 2nd floor water flows Dong spreading reply:
you have to have a target site, analysis of the web, find the download excel node, download files, generally USES requests library, request page, find the download nodes, get a download link, use the requests. Get according to the download link to download files, complex web pages, using selenium + LXML + requests


This part is the analysis of the I don't know how to write, is can, for example

CodePudding user response:

reference 5 floor syl_222 reply:
Quote: refer to the second floor loses water flows Dong response:

You have to have a target site, analysis of web pages, find download excel node, download files, generally USES requests library, request page, find the download nodes, get a download link, use the requests. Get according to the download link to download files, complex web pages, using selenium + LXML + requests


This part is the analysis of the I don't know how to write, can excuse me for example

You excel downloaded from the website? Or is the company annual report information in the reading excel?

CodePudding user response:

refer to 6th floor water flows Dong spreading reply:
Quote: refer to the fifth floor syl_222 reply:

Quote: refer to the second floor loses water flows Dong response:

You have to have a target site, analysis of web pages, find download excel node, download files, generally USES requests library, request page, find the download nodes, get a download link, use the requests. Get according to the download link to download files, complex web pages, using selenium + LXML + requests


This part is the analysis of the I don't know how to write, can excuse me for example

You excel downloaded from the website? Or is the company annual report information in the reading excel?


Is to download the excel in annual reports, excel is given

CodePudding user response:

refer to 7th floor syl_222 response:
Quote: refer to the sixth floor loses water flows Dong response:

Quote: refer to the fifth floor syl_222 reply:

Quote: refer to the second floor loses water flows Dong response:

You have to have a target site, analysis of web pages, find download excel node, download files, generally USES requests library, request page, find the download nodes, get a download link, use the requests. Get according to the download link to download files, complex web pages, using selenium + LXML + requests


This part is the analysis of the I don't know how to write, can excuse me for example

You excel downloaded from the website? Or is the company annual report information in the reading excel?


Is to download the excel in annual reports, excel is given

Download the excel in? The company annual report? You excel has, is reading excel content, read and write excel with openpyxl, you put the excel to capture a issued to come up

CodePudding user response:

refer to the eighth floor water flows Dong spreading reply:
Quote: refer to 7th floor syl_222 response:

Quote: refer to the sixth floor loses water flows Dong response:

Quote: refer to the fifth floor syl_222 reply:

Quote: refer to the second floor loses water flows Dong response:

You have to have a target site, analysis of web pages, find download excel node, download files, generally USES requests library, request page, find the download nodes, get a download link, use the requests. Get according to the download link to download files, complex web pages, using selenium + LXML + requests


This part is the analysis of the I don't know how to write, can excuse me for example

You excel downloaded from the website? Or is the company annual report information in the reading excel?


Is to download the excel in annual reports, excel is given

Download the excel in? The company annual report? You excel has, is reading excel in content, read and write excel with openpyxl, you put the excel to capture a issued to come up

part of the content is as follows

CodePudding user response:

You are to read this inside information is excel? And then how to output? Save what?

CodePudding user response:

references to the tenth floor water flows Dong spreading reply:
is you want to read this inside information is excel? And then how to output? Save what?

My idea is probably reading excel inside company code, the use of company code query report addresses, recycling download link to download, annual reports are generally PDF, can save this format file,

CodePudding user response:

11 references syl_222 response:
Quote: reference to the tenth floor water flows Dong spreading reply:

You are to read this inside information is excel? And then how to output? Save what?

My idea is probably reading excel inside company code, the use of company code query report addresses, recycling download link to download, annual reports are generally not PDF file is can save the format

Your account after the problem, to describe clearly, excel, you have some company information, but no company annual reports, annual reports need according to the company information inquire on the net, and then download the company's annual report, that you have to read the excel in your company information, then find annual reports can be downloaded site, download the PDF
  • Related