Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for sorting information of namesake persons on Internet

A person information, Internet technology, applied in the field of Internet data processing, can solve the problems of too many natural languages, reduce the performance of Internet servers, and it is difficult to distinguish the webpage content of different people, so as to improve performance, distinguish efficiency and accuracy. , the effect of reducing the processing burden

Inactive Publication Date: 2015-03-25
FUJITSU LTD
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, because the same person may be involved in different events, and natural language has a variety of expressions and many synonyms, it is difficult to distinguish the web content of different people in the existing technology if the name of the person is only distinguished by words. Come, so it is difficult for web pages belonging to the same person to be grouped together
[0004] Further, if the distinction of the character name information is not accurate enough, the character who needs to obtain the accurate character name information will have to submit the query request repeatedly, which will cause the Internet server to continuously respond to the repeated request submitted by the character, thus It also increases the data processing burden of the Internet server and reduces the performance of the Internet server

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for sorting information of namesake persons on Internet
  • Method and device for sorting information of namesake persons on Internet
  • Method and device for sorting information of namesake persons on Internet

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] Embodiments of the present invention will be described below with reference to the drawings.

[0034] Aiming at the problems in the prior art, the embodiment of the present invention provides the first method for classifying information on people with the same name on the Internet, see figure 1 , which can include:

[0035] S101: For the input person name information, search for relevant webpages including the person name information.

[0036] In this embodiment, when the person's name information is input through the browser, it is necessary to use the person's name information as a query keyword to search through a search engine, so as to obtain a relevant webpage including the person's name information. Wherein, the specific implementation of the search engine does not affect the implementation of the embodiment of the present invention, for example: if the input character name is "Li Xiang", a large number of webpages containing the name "Li Xiang" will be obtained...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a method and device for sorting information of namesake persons on the Internet. The method comprises the following steps: for input person name information, searching relevant webpages including the person name information; respectively extracting person attributive characters and webpage subject characters of the relevant webpages; performing generalization respectively on the person attributive characters and the webpage subject characters by using a hyponymy dictionary and / or a synonymy dictionary; acquiring an initial relation result of the relevant webpages according to the generalized person attributive characters, and acquiring an initial clustering result of the relevant webpages according to the generalized webpage subject characters; and fusing the initial relation result and the initial clustering result to obtain a final sorting result of the relevant webpages. By the method and device for sorting information of namesake persons on the Internet, different relevant webpages including a same person name can be clustered more precisely and accurately, and thus a more accurate sorting result of an actual persons is obtained.

Description

technical field [0001] The present invention generally relates to the technical field of Internet data processing, and in particular to a method and device for classifying information on people with the same name on the Internet. Background technique [0002] With the development of the Internet, more and more characters use the Internet for communication or business negotiation, etc., so the character information resources on the Internet are extremely rich. However, due to the phenomenon of duplicate names in actual applications, the phenomenon of duplicate names on the Internet is becoming more and more serious. Therefore, it is very important to use the data processing method to distinguish these characters with the same name on the Internet. [0003] In the current prior art, the schemes for classifying person name information all adopt the method of word-based webpage clustering, that is, similar webpages containing the same person name are clustered. However, becaus...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 贾文杰张姝王新文夏迎炬于浩
Owner FUJITSU LTD