A method and a system for constructing a classification corpus by means of the Internet
A corpus and Internet technology, applied in neural learning methods, text database clustering/classification, text database indexing, etc., can solve problems such as poor accuracy and ignoring web page layout
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0028] The specific embodiments of the present invention will be further specifically described below through specific embodiments in conjunction with the accompanying drawings.
[0029] like figure 1 As shown, the present invention provides a method for constructing a dynamic classification corpus using Internet corpus, including the following steps: S1, setting the target category: setting the target category by the user, and setting a number of initial keywords. For the target category A, set n keywords, n≥1, K={k 1 , k 2 ,...,k n}, keywords mainly describe the characteristic words contained in this category of information; S2, setting information sources: provide several information sources by the user, or submit the first N items of the search engine retrieval results by the initial keywords of the target category as Internet information sources, so Each information source described above includes a website address and several information source description keywords, a...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


