http://gzf.zfcxjw.cq.gov.cn:9090/site/cqgzf/queryresultpublic/applicationresultdetail/30
CodePudding user response:
Acquisition or crawl? Collection need interface, crawl more violenceCodePudding user response:
CodePudding user response:
WebclientCodePudding user response:
WebBrowser control simulating landing, landing information, assembly parameters, request to get the data, test data, deposited in the DBCodePudding user response:
The more difficult ah, layman don't understandCodePudding user response:
Is regular or watch TABLE or DIV, take TABLE data easily,CodePudding user response:
The second floor positive solution... Sites with json is simplerCodePudding user response:
The WebBrowserCodePudding user response:
To catch the first step to open the web page F12 data refresh the XHR...The results I didn't think.. The site instantly what all need not you can see a matched with "web data interface. The.
http://gzf.zfcxjw.cq.gov.cn:9090/site/cqgzf/queryresultpublic/getSqshjgAction
Look at the parameter
IsInit: true
PageNumber: 1
The prefix:
Xm:
Cnumber:
SQPQ:
Hx:
Code:
TableName: querynow28
Paging problem, it should not.
Then look at the data structure is simple...
The dataList: [,...].
MessageId: "success"
NoData: "N"
PageArray: [1, 2, 3, 4, 5, 6, 7, 8]
PageNumber: 1
TotalPage: 1871
So using webclient donwloadstring submitted once namevalue. After get json deserialized to go..
A few lines of code to.
CodePudding user response:
This are not collected, change the page number directly imported into your own applicationCodePudding user response:
First analysis of string, then extracted with regular HTML data