Unlock instant, AI-driven research and patent intelligence for your innovation.

Standard entity text determination method and device based on BiLSTM model and storage medium

A determination method and entity technology, applied to devices and storage media, in the field of standard entity text determination methods based on the BiLSTM model, can solve the problems of low applicability, low accuracy and low efficiency, achieve high consistency rate and improve efficiency , the effect of improving the accuracy

Active Publication Date: 2021-12-10
INST OF INFORMATION ON TRADITIONAL CHINESE MEDICINE CACMS
View PDF7 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In view of this, the embodiment of the present invention provides a method, device and storage medium for determining a standard entity text based on the BiLSTM model, to solve the existing candidate entity disambiguation method when determining the standard entity text in the standardization task of a medical entity, Its applicability is relatively low, the results obtained are far from the actual standard entity text, and the accuracy and efficiency of the standard entity text determination results are relatively low.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Standard entity text determination method and device based on BiLSTM model and storage medium
  • Standard entity text determination method and device based on BiLSTM model and storage medium
  • Standard entity text determination method and device based on BiLSTM model and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0055] In the candidate entity recall stage, the candidate entity set is constructed based on the literal similarity of strings, text statistical features, and Elasticsearch search engine retrieval. Text matching at this stage is only equivalent to coarse screening. Non-standard text data has a variety of expressions. Different words in Chinese texts may have the same meaning, and some diagnostic original words with similar expressions have differences in word order. The text matching methods used in the coarse screening process have low accuracy and cannot meet our needs, so they can also be matched with semantic similarity. In the candidate entity disambiguation stage, using text semantic matching information can improve the quality of entity normalization.

[0056] At present, there are two main frameworks for semantic similarity matching based on deep learning. One is the Siamese twin network, which uses a parameter-sharing symmetric network to model input entity pairs, an...

Embodiment 2

[0139] Figure 9 It is a block diagram of a standard entity text determination device based on the BiLSTM model provided by the embodiment of the present invention. In this embodiment, the device is applied to figure 1 A standard entity text determination method based on the BiLSTM model is shown. The device includes at least the following modules:

[0140] A selection module 51, configured to select a corresponding candidate entity set for the received text entity to be matched;

[0141] The grouping module 52 is used for forming a text entity pair with the text entity to be matched for each candidate entity in the candidate entity set;

[0142] The feature vector module 53 is used for each text entity pair, using the preset neural matching neural network to calculate the first similarity feature vector of the text entity pair, and using the text statistical method and the fully connected network to calculate the second similarity of the text entity pair degree feature vec...

Embodiment 3

[0148] The embodiment of the present invention provides a standard entity text determination device based on the BiLSTM model, which is used for determining a standard entity text based on the BiLSTM model, such as Figure 10 As shown, the electronic device includes a processor 1001 and a memory 1002, wherein the processor 1001 and the memory 1002 can be connected through a bus or other methods, Figure 10 Take connection via bus as an example.

[0149] The processor 1001 can be a central processing unit (Central Processing Unit, CPU) or other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), graphics processing units (Graphics Processing Unit, GPU), embedded neural network processing Neural-network Processing Unit (NPU) or other dedicated deep learning coprocessor, Application Specific Integrated Circuit (ASIC), Field-Programmable Gate Array (Field-Programmable Gate Array, FPGA) or other programmable logic Chips such as devices, discrete ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a standard entity text determination method and device based on a BiLSTM model and a storage medium. The method comprises the steps of selecting a candidate entity set corresponding to a received to-be-matched text entity for the to-be-matched text entity, for each candidate entity in the candidate entity set, forming a text entity pair with the to-be-matched text entity, for each text entity pair, calculating a first similarity feature vector of the text entity pair by adopting a preset nerve matching neural network, and calculating a second similarity feature vector of the text entity pair by adopting a text statistical method and a full connection network, adopting a splicing network to splice the first similarity feature vector and the second similarity feature vector of each text entity pair to form a similarity vector of each entity pair, and outputting the similarity of the two entity texts in each entity pair according to the similarity vector of each text entity pair, and determining the candidate text entity in the text entity pair with the highest similarity as a standard text entity corresponding to the to-be-matched text entity.

Description

technical field [0001] The present invention relates to the technical fields of natural language text information processing and medical big data mining, and in particular to a BiLSTM model-based standard entity text determination method, device and storage medium. Background technique [0002] The problem of entity name ambiguity exists in the process of natural language processing. The medical disease diagnosis record contains information such as the name of the main disease diagnosed by the patient, the name of the secondary diagnosis disease (that is, the name of the accompanying disease), and the operation for the diagnosis of the disease. For the same disease name, due to the variety of diseases and differences in doctor experience, there are often many different expressions for the same disease name, which brings great challenges to the standardization of medical electronic medical record data. However, since medical texts are mainly input by handwriting by doctors at...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/295G06F40/216G06F40/126G06N3/04G06K9/62G16H50/70
CPCG06F40/295G06F40/216G06F40/126G16H50/70G06N3/044G06N3/045G06F18/22
Inventor 文天才周雪忠诸强李明洋
Owner INST OF INFORMATION ON TRADITIONAL CHINESE MEDICINE CACMS