Named entity recognizing method based on mixed model

A named entity recognition and hybrid model technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as improving the difficulty of identifying complex named entities, and achieve the effect of improving recognition accuracy and recognition recall rate

Active Publication Date: 2017-05-10
NORTHEASTERN UNIV
View PDF1 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This phenomenon of nesting among named entities greatly ...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Named entity recognizing method based on mixed model
  • Named entity recognizing method based on mixed model
  • Named entity recognizing method based on mixed model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The specific implementation manners of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0028] A named entity recognition method based on a mixed model proposed by the present invention can significantly improve the recognition accuracy rate and recognition recall rate of person name entities, place name entities and organization name entities.

[0029] Recognition accuracy and recognition recall are used to evaluate the quality of named entity recognition results. Recognition accuracy refers to the ratio of the number of retrieved relevant documents to the total number of retrieved documents, which measures the precision rate of the retrieval system; recognition recall The rate refers to the ratio of the number of relevant documents retrieved to the number of all relevant documents in the document library, which measures the recall rate of the retrieval system; the F value is the weighted harmonic average of the re...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a named entity recognizing method based on a mixed model. The method comprises the following steps: pre-processing; by virtue of a self-adaptive selecting mode, in a hidden Markov model and a conditional random field model, selecting a model with a relatively high F value as a self-adaptive statistic recognizing model, initially recognizing the named entity for a recognized corpus to obtain an initial named entity recognizing result; constructing a basic dictionary formed by a knowledge base and a recognizing rule library; by virtue of the basic dictionary, performing secondary recognition on the initial named entity recognizing result by adopting the self-adaptive static recognizing model, and analyzing the F value of the secondary recognizing result, and updating the basic dictionary; and constructing the mixed model based on the basic dictionary and the self-adaptive statistic recognizing model, recognizing the to-be-recognized corpus to obtain a person name entity, a place name entity and an institute name entity in the to-be-recognized corpus, supplementing the recognizing result into the knowledge base, and updating the basic dictionary for recognition next time. According to the method provided by the invention, the recognizing accuracy and the recognizing recall rate are remarkably improved.

Description

technical field [0001] The invention belongs to the technical field of natural language processing, and in particular relates to a named entity recognition method based on a mixed model. Background technique [0002] With the application of emerging networks such as the Internet, cloud computing, mobile media, and the Internet of Things, a large number of Web2.0 technologies that create content for users have been born, making Web applications enter the era of big data. A series of Internet services such as search engines, e-commerce, and social networking sites Derivative business developed rapidly. Big data in the modern era has four characteristics, namely large data volume, diverse data structures, fast data generation, and high commercial value. With large amounts of data, not all information is useful data. This leads to the coexistence of a large amount of invalid data and valuable data. Therefore, in the era of big data, how to find valuable data from huge data se...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27
CPCG06F40/295
Inventor 信俊昌贾大宇王国仁聂铁铮
Owner NORTHEASTERN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products