Method and system for identifying Chinese full name based on Chinese shortened form of entity

A Chinese and abbreviated technology, applied in the field of recognition based on priority functions, can solve problems such as high citation rate and inability to obtain the original language in the first time, and achieve the effect of improving accuracy and facilitating retrieval

Active Publication Date: 2007-12-26
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF0 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

On a local scale, the page citation rate of "Xiangshan International Conference" and "Xiangshan Hotel" may be higher than that of "Xiangshan Park", which makes it impossible to obtain the most possible original language at the first time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for identifying Chinese full name based on Chinese shortened form of entity
  • Method and system for identifying Chinese full name based on Chinese shortened form of entity
  • Method and system for identifying Chinese full name based on Chinese shortened form of entity

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] Below in conjunction with accompanying drawing and specific embodiment the present invention is described in further detail:

[0050] Before explaining the method of the present invention, the formation rules and word-forming methods of Chinese abbreviations are sorted out and summarized. According to the form of word formation, Chinese abbreviations are divided into abbreviations, truncated forms, condensed forms, combined forms and special forms:

[0051] Abbreviation: Select one or more morphemes of each participle in the original language to form an abbreviation. Such as "Peking University", "Institute of Biology", etc.;

[0052] Abridged type: select and retain one or more morphemes of the original participle, and delete the abbreviation formed by the rest of the secondary participles. Such as "Tsinghua University" and "World War II";

[0053] Condensation: For parallel words with common morphemes, the morphemes of different components are reduced and combined, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A method for identifying Chinese full name according to Chinese short form of entity includes screening out candidate primitive set from normally used entry-bank according to abbreviation sentence to be identified, utilizing multi-path priority function combination to screen said set, calculate priority of candidate primitive, holding candidate primitive with high priority and seeking out one candidate primitive with highest priority as final result. The system used for realizing said method is also disclosed.

Description

technical field [0001] The invention relates to the abbreviation recognition technology in the fields of Chinese information processing and information retrieval, in particular to a recognition method for context-independent abbreviations based on a priority function. Background technique [0002] Natural language processing is an important problem in the field of computer science and artificial intelligence. It studies various theories and methods that can realize effective communication between humans and computers using natural language. With the widespread application of computers and the Internet, the number of natural language texts that can be processed by computers has increased unprecedentedly, and the demand for text mining, information extraction, cross-language information processing, and human-computer interaction for massive information has grown rapidly. From small-scale constrained language processing to large-scale real text processing, its research will ha...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
Inventor 卢汉曹存根岳小莉
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products