A Keyword-Based Method for Extracting Key Semantic Information from TCM Disease Texts

A technology of semantic information and extraction methods, which is applied in the field of natural language processing, can solve the problems of many network parameters and slow operation efficiency, and achieve the effect of many network parameters, slow operation efficiency and reduced labor costs

Active Publication Date: 2020-10-13
ZHEJIANG UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Generally, entity recognition is based on deep learning, with many network parameters and slow operation efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Keyword-Based Method for Extracting Key Semantic Information from TCM Disease Texts
  • A Keyword-Based Method for Extracting Key Semantic Information from TCM Disease Texts
  • A Keyword-Based Method for Extracting Key Semantic Information from TCM Disease Texts

Examples

Experimental program
Comparison scheme
Effect test

experiment example

[0050] Assume that the content of TCM condition text A is as follows: the patient developed cough, fever, and no sputum a week ago without obvious incentives.

[0051] Obtained after the above S101 clause and word segmentation, TCM condition text A: The patient developed cough, fever, and no sputum a week ago without obvious incentives

[0052] After the above S102, the result of the dependency syntax tree of the Chinese medical condition text A is obtained, such as figure 2 shown.

[0053] Obtain the key word queue Q of TCM disease text A through above-mentioned S103 as follows: Q=[“cough”, “fever”, “sputum”]

[0054] When using the above S104 to search upwards, there will be:

[0055] The word "cough" searches upwards for "occurs"

[0056] The word "fever" searched upwards is empty

[0057] The term "coughing up" searched upwards for "none"

[0058] When using the above S105 to search downwards, there will be: "cough", "fever", and "expectation" are all empty.

[0059...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for extracting key semantic information of TCM disease texts based on keywords, comprising the following steps: (1) performing sentence segmentation and word segmentation processing on TCM disease texts; (2) generating dependent syntax for the sentence and word segmentation processing results (3) Initialize the keywords in the TCM disease text and generate a keyword queue; (4) Based on the dependency syntax tree, take any word in the keyword queue as the starting point, and search upward and downward in the dependency syntax tree Search, and the searched words are marked as key semantic information. In this method, keywords are used as a feature to extract key semantic information, and the final result is obtained through the dependency syntax tree.

Description

technical field [0001] The invention belongs to the technical field of natural language processing, and in particular relates to a method for extracting key semantic information of TCM disease texts based on keywords. Background technique [0002] Chinese medicine is the characteristic of medical science in our country. Currently. my country has made good progress in the informatization of traditional Chinese medicine, laying a good foundation for the intelligentization of traditional Chinese medicine. The construction of TCM informatization is mainly reflected in two aspects: 1) TCM literature informatization; in the 1980s, more than 10 TCM books such as "Huangdi Neijing Suwen" and "Compendium of Materia Medica" realized digital retrieval; 2) Construction of basic database of traditional Chinese medicine. Since 1998, the team led by Professor Wu Zhaohui has united more than 30 Chinese medicine research institutions across the country, and through the efforts of nearly 30...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/279G06F16/31G06F16/335
CPCG06F40/279
Inventor 姜晓红陈广吴健吴朝晖
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products