Fine-grained numerical information extraction method based on joint learning model

A numerical information and learning model technology, applied in biological neural network models, other database retrieval, electrical digital data processing, etc., can solve problems such as flexibility and adaptability to restrict performance, and achieve error propagation avoidance, high accuracy, and universal good chemical performance

Active Publication Date: 2020-06-19
NANJING UNIV
View PDF2 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Rule- and semantic-analysis-based methods leverage expert knowledge to provide good interpretability, but their limited flexibility and adaptability somewhat constrain their performance on diverse real texts

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fine-grained numerical information extraction method based on joint learning model
  • Fine-grained numerical information extraction method based on joint learning model
  • Fine-grained numerical information extraction method based on joint learning model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] Below in conjunction with accompanying drawing and specific embodiment, further illustrate the present invention, should be understood that these embodiments are only for illustrating the present invention and are not intended to limit the scope of use of the present invention, after having read the present invention, those skilled in the art will understand each aspect of the present invention The modifications of all equivalent forms all fall within the scope defined by the appended claims of the present application.

[0029] Such as figure 1 As shown, the implementation of the present invention is to perform specific preprocessing on the input text to facilitate subsequent recognition at the word level, and then use methods such as regular expressions and rules to recognize all possible numerical trigger words. Based on the recognition results, select words within a certain window size before and after each numerical trigger word, and construct the input for the deep...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a fine-grained numerical information extraction method based on a joint learning model. The fine-grained numerical information extraction method comprises the following steps:preprocessing an input text; candidate numerical values are recognized according to the numerical value correlation regular expression, and the numerical values serve as trigger words and are splicedwith front and back lexical examples of the trigger words to serve as input of a joint learning model; accessing the bottom layer of the joint learning model to a word embedding layer, and adding theposition information of a numerical trigger word to the output of the word embedding layer to obtain the final distributed representation of each lexical example; accessing a subsequent feature extraction network based on distributed representation, and respectively accessing networks of specific tasks at a high level: accessing a classification network to obtain a semantic type of numerical information, and accessing a sequence labeling network to obtain a plurality of semantic roles related to numerical trigger words; combining outputs of specific tasks to obtain a numerical information unit; for a plurality of numerical information units, the composite relationship between the numerical information units is judged based on a statistical method. The method does not need artificial designfeatures, and is high in accuracy and strong in generalization ability.

Description

technical field [0001] The invention belongs to the technical field of information extraction in the field of natural language processing, and in particular relates to a method for extracting fine-grained numerical information based on a joint learning model. Background technique [0002] With the rapid growth of World Wide Web data, people expect to obtain information more efficiently and comprehensively from all kinds of data, so as to use it in various applications such as intelligent question answering and search recommendation. Among them, text data has the characteristics of a large amount of data and a large amount of information, and has become a very important data source in information extraction. Scientific researchers and the industry are very concerned about the research of related technologies, and have defined some classic information extraction tasks, such as: named entity recognition, relationship extraction, event extraction, etc. [0003] In addition to t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/31G06F16/903G06F40/289G06F40/30G06F16/35G06N3/04
CPCG06F16/313G06F16/90344G06F16/353G06N3/044G06N3/045Y02D10/00
Inventor 于佳婕丁文韬瞿裕忠
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products