Entity relationship mining method based on biomedical literature

A biomedical and entity-relationship technology, applied in the fields of healthcare informatics, informatics, medical reporting, etc., can solve the problems of limited development, huge training data sets of deep learning methods, high cost of biomedical text training integration, and achieve optimal extraction order, the effect of good sorting effect

Active Publication Date: 2020-07-17
ZHEJIANG UNIV
View PDF4 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In recent years, deep learning models have achieved relatively good results in biomedical text mining tasks, but deep learning methods require huge training data sets
Due to the high cost of building a large biomedical text training set, the development of deep learning for biomedical text mining is limited.
Therefore, the current disease-related databases are generally collected manually and based on templates. They fail to make full use of deep learning models to mine entity relationships, and rely heavily on complex feature engineering of machine learning.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Entity relationship mining method based on biomedical literature
  • Entity relationship mining method based on biomedical literature
  • Entity relationship mining method based on biomedical literature

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] The present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be noted that the following embodiments are intended to facilitate the understanding of the present invention, but do not limit it in any way.

[0041] like figure 1 As shown, an entity relationship mining method based on biomedical literature includes: biomedical literature data acquisition, biomedical entity recognition, and entity relationship mining.

[0042] Preprocess the biomedical literature downloaded from public databases. Articles with categories matching appendices, errata, or retractions were discarded, and articles with abstracts that were too long or too short were removed. Some articles have redundant html tags, journal information, and experimental registration information. We use a rule-based method to delete these redundant and invalid information. Merge the title and abstract information of each document as raw unst...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an entity relationship mining method based on biomedical literature, which comprises the following steps: (1) querying disease-related biomedical literature in a public database, and obtaining biomedical text data after data preprocessing; (2) performing biomedical named entity recognition on the obtained biomedical text data in combination with a regular matching templateand a deep learning model; and (3) based on the entity recognition result, mining the entity relationship by adopting transfer learning and reinforcement learning methods. The biomedical nouns entities in the literature can be effectively identified by acquiring the biomedical literature related to the disease from the network, extracting the abstract and the title and carrying out entity recognition and relationship mining, and the hidden relationship among various entities can be mined.

Description

technical field [0001] The invention belongs to the technical field of text data mining, in particular to an entity relationship mining method based on biomedical literature. Background technique [0002] With the rapid development of biomedical technology, the amount of biomedical literature is currently exploding at an unprecedented rate. Biomedical researchers are faced with massive literature databases, and effective information acquisition has become an arduous task. Non-coding RNA and protein-coding genes are important objects in disease research. The potential relationship between genes, non-coding RNAs, proteins and diseases revealed in the research results can help biologists more effectively explore the mysteries of life generation, health maintenance and disease treatment. Most of the current databases mined from biomedical literature are manually compiled by domain experts. However, in the face of the exponentially increasing number of documents, there are gre...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F16/33G06F40/279G06F40/211G16H15/00
CPCG06F16/35G06F16/334G16H15/00G16B50/10G06F40/295G16H50/70
Inventor 陈铭陈琦周银聪胡大辉吴文怡
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products