Method for screening risk factors of diffuse colorectal adenoma based on directional weighted association rule model

A risk factor, weighted correlation technology, applied in the field of medical data analysis, can solve the problems of easy omission of risk factors, incompleteness, and single method.

Active Publication Date: 2020-10-30
SHANGHAI MARITIME UNIVERSITY
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the method for the analysis of risk factors is too single. These traditional methods have achieved certain results for single factor analysis, but they are not perfect. Some risk factors with small probability but very important risk factors are easily missed.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for screening risk factors of diffuse colorectal adenoma based on directional weighted association rule model
  • Method for screening risk factors of diffuse colorectal adenoma based on directional weighted association rule model
  • Method for screening risk factors of diffuse colorectal adenoma based on directional weighted association rule model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0043] Embodiment one, with reference to figure 1 , a risk factor screening method for sporadic colorectal adenoma based on a directed weighted association rule model, including the following specific steps:

[0044] S1. Preprocessing the colorectal adenoma data. Delete irrelevant data, delete redundant information, delete feature columns with more than 50% missing values, and delete dirty data with obvious abnormalities. A total of 234 cases were included in the standard dataset, of which 62 cases were diagnosed with colorectal adenoma.

[0045] refer to figure 2 , using random forest mean impurity reduction for feature selection.

[0046] (1) Calculate the information entropy of the original data, and the initial information entropy is:

[0047]

[0048] (2) Find the information entropy H2, taking the classification according to features 7 and 24 as an example:

[0049] H2 (classified by feature 7) = 0.8283984298779227;

[0050] H2 (classified by feature 24) = 0.79...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for screening risk factors of diffuse colorectal adenoma based on a directional weighted association rule model, and belongs to the field of data mining. The method comprises the following steps: firstly, preprocessing data; secondly, performing feature extraction by adopting a feature selection method for reducing the average impure degree of the random forest, and determining an optimal division node by utilizing information gain to obtain an optimal feature set; then, inputting the optimal feature set into a directional weighted association rule model to generate a strong association rule; and finally, bringing the risk factors contained in the strong association rule into a risk factor set, and communicating with experts. Compared with the prior art, the method mainly provides a directional weighted association rule model to screen the risk factors of colorectal adenoma, affirms the significance of living dietary habit factors in the etiology of colorectal adenoma, finds out unfound high-risk factors in the previous research, and provides a set of method worthy of reference for finding out the risk factors of colorectal adenoma.

Description

technical field [0001] The invention relates to medical data analysis, in particular to a risk factor screening method for sporadic colorectal adenoma based on a directional weighted association rule model. Background technique [0002] Sporadic colorectal adenomas (CRAs) are benign glandular tumors of the colon and rectum that are precursors to colorectal cancer. Early detection and timely treatment can effectively reduce the probability of canceration, which is of great significance to prolong the survival time of patients. Investigations have found that CRA is closely related to living and eating habits, and 66% to 78% of colorectal adenomas can be avoided through healthy living habits. However, some important risk factors are still ignored or even not discovered, so it is impossible to effectively guide patients to live a healthy life and improve the status quo. [0003] In recent years, more and more researchers have realized the importance of lifestyle and dietary ha...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16H50/70
CPCG16H50/70
Inventor 余盖青高俊波程陈费若岚王长静
Owner SHANGHAI MARITIME UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products