Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for labeling pages

A page labeling and page technology, applied in the Internet field, can solve the problems of high labor consumption, low accuracy and low processing efficiency, and achieve the effect of reducing labor consumption and improving efficiency and accuracy.

Active Publication Date: 2017-11-28
HUAWEI TECH CO LTD
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Embodiments of the present invention provide a method and device for labeling webpages, which are used to solve the problem of relying on manual labeling of webpages in the prior art, which has relatively large manpower consumption, low processing efficiency and low accuracy. defect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for labeling pages
  • Method and device for labeling pages
  • Method and device for labeling pages

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0057] See figure 1 As shown, in the embodiment of the present invention, a process of labeling a page is as follows:

[0058] Step 100: Determine the first keyword group and category of the page to be marked with a tag;

[0059] Step 110: Select a sub-tag library corresponding to the category of the page from the category tag library. Any sub-tag library in the category tag library includes each element used to represent the attribute of the sub-tag library from different elements, and each element Respectively corresponding element information;

[0060] Step 120: Check whether there is any element information that is the same as any keyword in the keyword group in the element information included in the selected subtag library;

[0061] Step 130: Mark the same element information as any keyword as the label of the page.

[0062] For web page texts in different categories, the corresponding tag libraries are also different. For example, the tag libraries of web pages that introduce mo...

Embodiment 2

[0128] Step 200: Use web crawler technology to generate a classification tag library;

[0129] In this step, the classification tag library includes a first subtag library corresponding to movies, a second subtag library corresponding to music, a third subtag library corresponding to news, and a fourth subtag library corresponding to travel. A subtag library includes each element, the element information included under each element, and the information entropy corresponding to each element information;

[0130] Step 210: Determine the category of the page to be labeled and the corresponding first keyword group;

[0131] In this step, the determined corresponding category is movie, and the corresponding first keyword group includes 5 keywords: Hong Kong, Chinese, Chen XX, Jiang X, comedy;

[0132] Step 220: For each of the 5 keywords, check whether there is element information that is the same as the keyword in the subtag library corresponding to the movie;

[0133] Step 230: Determine ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and a device for tagging pages. According to the technical scheme, the method includes determining the type and a key term of a to-be-tagged page; selecting a sub tag library corresponding to the type of the page from a classification tag library, wherein any a sub tag library in the classification tag library includes elements used for expressing the attribute of the sub tag library and element information corresponding to each element; for any a keyword in the key term, checking whether or not same element information identical to the keyword exists in all the element information included in the selected sub tag library; taking the element information identical to the keyword as a tag of the page and tagging the page. According to the method, manual operation is not needed, so that manpower consumption is reduced, and tagging efficiency and accuracy are improved.

Description

Technical field [0001] The present invention relates to the field of Internet technology, in particular to a method and device for labeling pages. Background technique [0002] With the development of network globalization and the popularization of personal computers, the Internet has become the main way to obtain information and conduct network communications. In the Internet, more and more information is presented in the form of web pages, and the design of web pages is becoming more and more complex and diversified. Therefore, it takes longer to find target information on the Internet. [0003] In order to find the target webpage from the rich webpage more conveniently and improve the search efficiency, you can mark the webpage with tags. After the webpage is marked with the tags, you can classify the webpages, and when searching the webpages, you can search based on the tags. To improve search efficiency and accuracy. [0004] Existing methods of manual labeling are used when l...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F16/955G06F16/958
Inventor 卢汉
Owner HUAWEI TECH CO LTD