Method and device for establishing named entity labeling resource library, storage medium and computer device

A named entity and resource library technology, applied in the field of information processing, can solve the problems of being unable to meet the needs of named entity recognition and not being able to adapt to large-scale application scenarios

Inactive Publication Date: 2017-11-07
深圳市牛鼎丰科技有限公司
View PDF2 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The resources in these manually labeled resource libraries are very limited, not enough to adapt to large-scale application scenarios such as machine translation, and with the development of society, new named entities are constantly being born, such as organization names, movie names, product names , book titles, etc., so the manually labeled resource library is far from meeting the needs of named entity recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for establishing named entity labeling resource library, storage medium and computer device
  • Method and device for establishing named entity labeling resource library, storage medium and computer device
  • Method and device for establishing named entity labeling resource library, storage medium and computer device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0065] In order to make the above objects, features and advantages of the present invention more comprehensible, specific implementations of the present invention will be described in detail below in conjunction with the accompanying drawings. In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention. However, the present invention can be implemented in many other ways different from those described here, and those skilled in the art can make similar improvements without departing from the connotation of the present invention, so the present invention is not limited by the specific implementations disclosed below.

[0066] Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the technical field of the invention. The terms used herein in the description of the present invention are for the purpose of descr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method and device for establishing a named entity labeling resource library, a storage medium and a computer device. Calculation is performed by using a resource library of the round of iterations formed by a small number of seed banks and unlabeled texts in an unlabeled text set, seed banks of a next round of iterations are generated by calculating the average utility value of each named entity in the unlabeled texts, then the generated seed banks and other unlabeled texts form a resource library of the next round of iterations, and a next round of seed banks is calculated, so that calculation continues till all the unlabeled texts are calculated, new named entities are found, and the named entity labeling resource library is generated. The method is simple in calculation, obtained results are high in confidence degree, and the method is suitable for large-scale text processing. Text data is non-structured data, effectiveness evaluation on the non-structured data is generally difficult, so that the quantitative evaluation on text named entities can be achieved by adopting the method.

Description

technical field [0001] The invention relates to the technical field of information processing, in particular to a method, device, storage medium and computer equipment for constructing a named entity tagging resource library. Background technique [0002] A named entity refers to a person's name, an organization's name, a place name, and all other entities identified by a name. In a broad sense, a named entity also includes numbers, dates, currencies, addresses, etc. Named Entity Recognition (NER) is one of the basic technologies of natural language processing, which plays an important role in improving the performance of many natural language processing application systems. At present, NER mainly uses statistical models as processing techniques, such as Hidden Markov Model (HMM), Conditional Random Field Model (Conditional Random Field, CRF) and other statistical models. These statistical models require a large number of annotation resources. As a training set, a manually ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27
CPCG06F40/216G06F40/295
Inventor 秦兴德秦祎晗刘奕慧郭玮
Owner 深圳市牛鼎丰科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products