Home > Back-end >  The couple do web crawler, ask everybody to give directions
The couple do web crawler, ask everybody to give directions

Time:10-19

I am an undergraduate, only learned c + +, now need to do a web crawler from BBS fetching information, and to grab the information according to the BBS section division, also need to be able to analyze the information, such as to find the word of the day,
Daniel, please give directions, I really don't know how to begin now,

CodePudding user response:

First using the HTTP protocol to related web page download
HTML file and then analysis, find out the inside of things of interest, such as BBS structure, key words, links, and so on

CodePudding user response:

Need to use regular expressions such search technology

CodePudding user response:

BCB6 support regular?

CodePudding user response:

Step by step, don't come up to ask a general question, solve the problem of download page first,

CodePudding user response:

CodePudding user response:

Every step has many problems to deal with, is a system of work, can not be solved a post is,
If you can't decompose the problem into pieces of small knowledge, and to consult technical reference or to BBS to ask questions, then obviously you assume a mission impossible,

CodePudding user response:

Web page is not have set the key words, web pages are usually want to search engine (such as: baidu, GOOGLE, etc.), included, will set itself some, you know the first, and then this part of the extracted in the web page should be about the same,
  • Related