Method and device for labeling pages
A page labeling and page technology, applied in the Internet field, can solve the problems of high labor consumption, low accuracy and low processing efficiency, and achieve the effect of reducing labor consumption and improving efficiency and accuracy.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0057] See figure 1 As shown, in the embodiment of the present invention, a process of labeling a page is as follows:
[0058] Step 100: Determine the first keyword group and category of the page to be marked with a tag;
[0059] Step 110: Select a sub-tag library corresponding to the category of the page from the category tag library. Any sub-tag library in the category tag library includes each element used to represent the attribute of the sub-tag library from different elements, and each element Respectively corresponding element information;
[0060] Step 120: Check whether there is any element information that is the same as any keyword in the keyword group in the element information included in the selected subtag library;
[0061] Step 130: Mark the same element information as any keyword as the label of the page.
[0062] For web page texts in different categories, the corresponding tag libraries are also different. For example, the tag libraries of web pages that introduce mo...
Embodiment 2
[0128] Step 200: Use web crawler technology to generate a classification tag library;
[0129] In this step, the classification tag library includes a first subtag library corresponding to movies, a second subtag library corresponding to music, a third subtag library corresponding to news, and a fourth subtag library corresponding to travel. A subtag library includes each element, the element information included under each element, and the information entropy corresponding to each element information;
[0130] Step 210: Determine the category of the page to be labeled and the corresponding first keyword group;
[0131] In this step, the determined corresponding category is movie, and the corresponding first keyword group includes 5 keywords: Hong Kong, Chinese, Chen XX, Jiang X, comedy;
[0132] Step 220: For each of the 5 keywords, check whether there is element information that is the same as the keyword in the subtag library corresponding to the movie;
[0133] Step 230: Determine ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


