Named entity recognition model training method and device and information extraction method and device

A named entity recognition and training method technology, applied in the field of information extraction, can solve problems such as high cost, failure to realize information extraction, information extraction errors, etc., and achieve the effect of strong versatility

Pending Publication Date: 2019-08-16
THE FOURTH PARADIGM BEIJING TECH CO LTD
View PDF10 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the rule-based approach, since a lot of manual work is required to formulate the extracted expert rules, which is not fundamentally different from the costly and inefficient human querying of advertised entities
In addition, there are usually omissions in a large amount of manual work, so the formulated rules are not completely accurate. For example, once an error occurs in the formulation of a company's entity rules, it will lead to errors in entity recognition and cannot be realized in actual applications. An error occurred in the information extraction or information extraction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Named entity recognition model training method and device and information extraction method and device
  • Named entity recognition model training method and device and information extraction method and device
  • Named entity recognition model training method and device and information extraction method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The present invention may have various modifications and various embodiments, and it should be understood that the present invention is not limited to these embodiments, but includes all modifications, equivalents, and substitutions within the spirit and scope of the present invention. For example, the orders of operations described herein are examples only, and are not limited to those orders set forth herein, but rather than operations that must occur in a particular order, which may be as follows after an understanding of the disclosure of the present application Clearly that is changed. Also, descriptions of features known in the art may be omitted for increased clarity and conciseness. The terms used in the exemplary embodiments of the present invention are used to describe the particular embodiment only, and not to limit the exemplary embodiment. The singular forms used herein are intended to include the plural forms as well, unless the context clearly dictates o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a named entity recognition model training method and device and an information extraction method and device. According to the training texts in the set, entities in the semi-annotation information of the training texts are matched with entities in the corresponding training texts; based on a matching result, an effective named entity label of the corresponding training textis obtained; vector representation of each training text in the training text set is obtained; and based on the vector representation of each training text in the training text set and the effective named entity labeling, a named entity recognition model is trained based on deep learning to obtain a target named entity recognition model.

Description

technical field [0001] The following description relates to the field of information extraction, and more particularly, to a named entity recognition model training method and device, and an information extraction method and device. Background technique [0002] Now, information extraction is a relatively common problem faced by various industries. For example, in the industrial world, in the process of business personnel dealing with various business problems, various types of articles are important reference materials for business personnel. Business personnel need to dig out important information in articles every day, but the massive amount of articles is overwhelming. load. Taking the work of the Shenzhen Stock Exchange (“Shenzhen Stock Exchange”) as an example, a total of 265,985 announcements were disclosed in 2016, and 291,607 announcements were disclosed in 2017. With the increasing number of listed companies, this number will also increase year by year. The incre...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06N3/04G06N3/08
CPCG06N3/08G06F40/295G06N3/045
Inventor 李楚桐胡楠
Owner THE FOURTH PARADIGM BEIJING TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products