Information extraction method and device based on pre-training model, equipment and storage medium

An information extraction and pre-training technology, applied in neural learning methods, biological neural network models, instruments, etc., can solve problems such as lack of change in word vectors, poor adaptability, and affect the prediction accuracy of information extraction models, so as to enhance the generalization ability , good prediction accuracy, and strong representation ability

Pending Publication Date: 2021-02-05
SHANGHAI PUDONG DEVELOPMENT BANK
View PDF10 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the word vectors obtained in this way lack of change, especially in polysemy words; polysemy words correspond to a fixed vector in the embedding matrix, but in fact, different words in different languages context often have different meanings
In this wa

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Information extraction method and device based on pre-training model, equipment and storage medium
  • Information extraction method and device based on pre-training model, equipment and storage medium
  • Information extraction method and device based on pre-training model, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] In order to enable those skilled in the art to better understand the technical solutions in this specification, the technical solutions in the embodiments of this specification will be clearly and completely described below in conjunction with the drawings in the embodiments of this specification. Obviously, the described The embodiments are only some of the embodiments in this specification, not all of them. Based on the embodiments in this specification, all other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the protection scope of this specification.

[0057] This specification relates to information extraction techniques in natural language processing. It can be applied to any application scenario that needs to identify and extract specific elements from corpus texts. For example, in some embodiments of this specification, the information extraction technology may be applied to information extraction of fin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Embodiments of the invention provide a pre-training model-based information extraction method, apparatus and device, and a storage medium. The method comprises the steps of obtaining a to-be-processedfinancial text; and extracting a target element from the financial text to be processed according to a preset information extraction model, wherein the information extraction model is obtained by training a context pre-training model. The embodiment of the specification can improve the prediction accuracy of the information extraction model.

Description

technical field [0001] This description relates to the technical field of natural language processing, and in particular to an information extraction method, device, device and storage medium based on a pre-trained model. Background technique [0002] The work of Information Extraction (IE) is mainly to extract structured information from unstructured text, which is a very important subfield in Natural Language Processing (NLP). When performing information extraction, it is necessary to identify the named entity recognition (NamedEntitiesRecognitio, NER) to be extracted, so NER is an essential part of information extraction, and its purpose is to identify the names of people, places, and organizations in the corpus (text). and other named entities. [0003] At present, the existing information extraction models generally use the word vector processing method of word embedding (Word Embedding) during pre-training. However, the word vectors obtained in this way lack of chang...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/30G06F40/295G06N3/08
CPCG06F40/30G06F40/295G06N3/08
Inventor 杨洋万建伟潘仰耀张琛李锋万化
Owner SHANGHAI PUDONG DEVELOPMENT BANK
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products