Mesh-based similarity measurement method for medical literature collection

A similarity measurement and medical literature technology, applied in unstructured text data retrieval, text database clustering/classification, instruments, etc., can solve the problem of inability to mine hidden information similarity, achieve fast and efficient similarity calculation, The effect of saving human resources and accurate relationship mining

Active Publication Date: 2020-12-08
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In view of the above-mentioned shortcomings in the prior art, the MeSH-based similarity measurement method for medical literature sets provided by the present invention solves the problem that traditional methods only perceive the superficial meaning of documents and cannot mine the similarity of hidden information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mesh-based similarity measurement method for medical literature collection
  • Mesh-based similarity measurement method for medical literature collection
  • Mesh-based similarity measurement method for medical literature collection

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] The specific embodiments of the present invention are described below so that those skilled in the art can understand the present invention, but it should be clear that the present invention is not limited to the scope of the specific embodiments. For those of ordinary skill in the art, as long as various changes Within the spirit and scope of the present invention defined and determined by the appended claims, these changes are obvious, and all inventions and creations using the concept of the present invention are included in the protection list.

[0021] refer to figure 1 , figure 1 A flow chart showing a MeSH-based similarity measurement method for medical literature collections; as figure 1 As shown, the method 100 includes steps 101 to 106.

[0022] In step 101, search keywords related to diseases or genes are obtained. Taking cancer as an example, there are many types of cancer (BRCA, THCA, UCEC, BLCA...), some are caused by external factors, and some are caus...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for measuring the similarity of medical literature collections based on MeSH, which includes obtaining search subject words related to diseases or genes; retrieving documents related to the search subject words, and using the same search subject words to retrieve All documents form a subject term document set; using the weight value of the medical subject terms contained in each document in the subject term document set, the documents are mapped to the vector space to construct a MeSH space matrix; calculate the subject term document set A in the MeSH space matrix The weight value of the TCM subject term g; according to the weight value of the subject term document set in all medical subject terms, construct the vector formula of the subject term document set A; calculate the cosine similarity between the document set A and the document set B in the MeSH space matrix.

Description

technical field [0001] The invention relates to the calculation of similarity between documents, in particular to a method for measuring the similarity of medical document collections based on MeSH (Vetor Space Model, vector space model). Background technique [0002] The traditional method of calculating the similarity of medical literature collections is to convert the original medical literature data into the relationship between diseases and genes through manual calibration and record them in the database, and establish a genetic association database; The relationship of multiple genes creates a human disease network; and the disease-related gene network is obtained through data indicators such as eigenvector center and betweenness centrality. However, document relationship mining based on manual calibration requires energy to review and cannot meet the speed of new documents; semantic-based document mining involves natural language processing, and the amount of calculat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/194G06F40/216G06F16/35
CPCG06F40/194G06F40/216
Inventor 邹见效鲁文斌凡时财徐红兵
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products