Korean named entities recognition method based on maximum entropy model and neural network model

A neural network model and named entity recognition technology, applied in biological neural network models, neural learning methods, neural architectures, etc., can solve the problems of cumbersome process, complex model calculation process, poor generalization ability, etc.

Inactive Publication Date: 2017-11-24
GLOBAL TONE COMM TECH
View PDF3 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] To sum up, the problems existing in the existing technology are: the current named entity recognition process is cumbersome, expensive and poor in portability, the model calculation process is complicated, the generalization ability is poor, and it cannot handle unregistered words and other problems.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Korean named entities recognition method based on maximum entropy model and neural network model
  • Korean named entities recognition method based on maximum entropy model and neural network model
  • Korean named entities recognition method based on maximum entropy model and neural network model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0062] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0063] The application principle of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0064] Such as figure 1 As shown, the Korean named entity recognition method based on maximum entropy and neural network model provided by the embodiment of the present invention includes the following steps:

[0065] S101: Construct a prefix tree dictionary, and when any template combining nouns and proper nouns matches in the input sentence, identify it as a target word;

[0066] S102: Obtain the target word through the target word selection module, search the target word from the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of named entities recognition, and discloses a Korean named entities recognition method based on a maximum entropy model and a neural network model. The method comprises the steps that a prefix tree dictionary is built, when any one combined noun template or any one proper noun template is matched in an input sentence, the combined noun template or the proper noun template are recognized into a target word; the target word is obtained in a target word selection module, the target word is searched in an entity dictionary, and when only one subclass is matched, the subclass serves as a tag of the target word; the maximum entropy model is adopted, and various linguistics information is utilized; a feedforward neural network model is constructed; adjacency words form an entity tag through a template selection rule. All data used in the method is extracted in a training corpus with tags and a field-independent entity dictionary, the data is very easily migrated to other application fields, and the performance cannot be reduced obviously.

Description

technical field [0001] The invention belongs to the technical field of named entity recognition, in particular to a Korean named entity recognition method based on maximum entropy and a neural network model. Background technique [0002] Named Entity Recognition (NER) is a basic task in the field of natural language processing. The named entity of its research subject generally includes 3 major categories (entity category, time category and number category) and 7 subcategories (person name, place name, organization name, time, date, currency and percentage). Time and digital entities can be identified through finite state machines, which is relatively simple. However, entity classes such as person names, place names, and organization names are open, and new named entities are constantly generated, and there are many ambiguities, which are difficult to solve by using the location method. Accurate labeling of named entity types often involves analysis at the semantic level, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06N3/04
CPCG06N3/04G06F40/295G06N3/084G06F40/242G06F40/53G06N3/08G06F40/263
Inventor 程国艮李世奇
Owner GLOBAL TONE COMM TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products