A Method of Topic Crawling Based on Improved Shark Search
A topic crawler and topic technology, which is applied in the direction of network data indexing, network data retrieval, and other database retrieval, etc., can solve problems such as unsatisfactory retrieval results and unretrievable data, so as to reduce error rate and improve crawling Coverage, the effect of solving myopia problems
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0082] Below in conjunction with accompanying drawing and specific embodiment, further illustrate the present invention, should be understood that these examples are only for illustrating the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various aspects of the present invention All modifications of the valence form fall within the scope defined by the appended claims of the present application.
[0083] A topic crawling method based on improved shark search, which constructs topic word vectors by introducing word vectors and topic models, and expands the semantics of words. Combining the semi-structured features of the webpage to improve the TF-IDF algorithm and extract the keywords of the webpage, the correlation between the webpage and the topic is transformed into the correlation between the webpage keywords and the subject words. On this basis, the webpag...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


