Method and system for recognizing feature lexical item in Chinese naming entity

A named entity and recognition method technology, applied in the field of word recognition, can solve the problem of lack of automatic recognition feature terms

Active Publication Date: 2008-02-06
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF0 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the prior art, there is a lack of related methods for automatically identifying feature terms from Chinese named entities using computers

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for recognizing feature lexical item in Chinese naming entity
  • Method and system for recognizing feature lexical item in Chinese naming entity
  • Method and system for recognizing feature lexical item in Chinese naming entity

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0075] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0076] The basic idea of ​​the method for identifying feature terms in named entities of the present invention is to use the existing word segmentation program to perform word segmentation on named entities, then create dictionaries and word context dictionaries according to word segmentation results, and use dictionaries and word context dictionaries to perform word segmentation The candidate named entities are calculated and processed to obtain the feature terms in the named entities, and the dictionary is expanded according to the processing results; finally, the feature terms are returned to the user.

[0077] Before explaining the above in detail, first sort out the formation rules and word formation methods of Chinese named entities, summarize the grammatical features and word formation rules of the characteristic words in named entities, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides a characteristic lexical item recognizing method for a Chinese named entity, and the method includes: executing a segmentation operation on the named entity to be recognized, then getting a candidate named entity; executing an initial operation on the candidate named entity, and getting a first result; establishing a dictionary basing on the candidate named entity and the first result, and establishing a word situation dictionary basing on the first result; the dictionary and the word situation dictionary are jointly called as a dictionary database; referring to the dictionary database, executing a compounding processing on the first result time after time; after one time of the compounding processing, expanding the dictionary database basing on the processing result; the extended dictionary database is used as the dictionary to be referenced on the next compounding processing; acquiring the recognized characteristic lexical item basing on the results after a plurality of times of compounding processing. The present invention also provides a characteristic lexical item recognizing system for the Chinese named entity. Dispensing with the context, the present invention can realize recognizing and understanding on the characteristic lexical item of the Chinese named entity, and can improve the accuracy rate of the natural language understanding and the information searching.

Description

technical field [0001] The invention relates to word recognition in the fields of Chinese information processing and information retrieval, in particular to a method for recognizing characteristic words in named entities and a corresponding system. Background technique [0002] Natural language processing is an important problem in the field of computer science and artificial intelligence. It studies various theories and methods that can realize effective communication between humans and computers using natural language. With the widespread application of computers and the Internet, the number of natural language texts that can be processed by computers has increased unprecedentedly, and the demand for text mining, information extraction, cross-language information processing, and human-computer interaction for massive information has grown rapidly. From small-scale constrained language processing to large-scale real text processing, its research will have a profound impact...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
Inventor 曹馨宇曹存根岳小莉
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products