Text mining method based on online medical question and answer information

A text mining and medical technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as disease question and answer information that has not seen relevant research.

Active Publication Date: 2015-10-07
NANKAI UNIV
View PDF0 Cites 52 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, in the medical field, there have been a lot of entity recognition work on electronic medical records, various medical reports, med...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text mining method based on online medical question and answer information
  • Text mining method based on online medical question and answer information
  • Text mining method based on online medical question and answer information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0043] 101: Use the network data extraction method based on DOM and webpage templates to extract disease question and answer information from the acquired original webpage;

[0044] 102: Perform medical named entity recognition through the characteristics of the conditional random field model in the extracted disease question and answer information;

[0045] 103: Mining medical entity relationships through medical named entity recognition.

[0046] Before step 101 adopts the network data extraction method based on DOM and web page template to extract disease question and answer information from the acquired original web page, the text mining method also includes:

[0047] In the face of public web data, research medical related websites, analyze and determine the specific situation of crawling links and webpage data scale, and then use web crawlers to crawl webpage data.

[0048] Wherein, in step 101, the step of extracting disease question and answer information from the obt...

Embodiment 2

[0074] 201: Acquisition and content extraction of online disease question and answer data;

[0075] Since the medical information analysis and mining method is mainly oriented to online medical question answering, it tries to structure the medical knowledge contained in the question answering data. Therefore, the first task is to obtain the network disease question answering data.

[0076] The specific steps are as follows: Facing the public web data, first investigate medical related websites, analyze and determine the specific situation of crawling links and webpage data scale, and then use web crawlers to crawl webpage data. Subsequently, the network data extraction method based on DOM and web page templates was used to extract disease question and answer information from the acquired original web pages.

[0077] Among them, see figure 2 , network data extraction based on DOM and web page template mainly includes the following steps:

[0078] 1) Analyze the characteristi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text mining method based on online medical question and answer information. The text mining method comprises the following steps of: extracting disease question and answer information from an obtained original webpage by adopting a network data extracting mode based on DOM and a webpage template; carrying out medical named entity identification in the extracted disease question and answer information by virtue of characteristics of a conditional random field model; and mining a medical entity relationship by virtue of the medical named entity identification. The method can be used for effectively obtaining a potential association relationship among various entities. The method is suitable for mining work of all disease classes, and has certain expandability.

Description

technical field [0001] The invention relates to the field of text mining, in particular to a text mining method based on online medical question-and-answer information. Background technique [0002] Recently, with the rapid development of the Internet, various social media have sprung up. In the health-related medical field, many online disease question-and-answer websites have emerged, which provide patients with more diversified channels for obtaining medical information. These websites mainly focus on health knowledge, disease information, medical news, etc., and also provide users with an online disease question-and-answer function. In China, well-known websites such as Sina Health, Xunyiwenyao, Haodafu Online, and 39 Asking Doctors contain a lot of disease question-and-answer information, but these question-and-answer information are in an unstructured state in the text. In order to make full use of question-answering information and extract and mine useful medical kno...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F19/00G06F17/30
Inventor 刘杰苏娅黄亚楼
Owner NANKAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products