Unlock instant, AI-driven research and patent intelligence for your innovation.

Endometrial tumor classification marking method based on random forest

A technology of endometrium and random forest, applied in the field of data processing, can solve problems such as inability to handle continuous, discrete and mixed large data sets, algorithm accuracy is not very ideal, and accuracy is reduced

Pending Publication Date: 2020-10-30
WENZHOU UNIVERSITY
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Most of the classification models used in data prediction are KNN algorithm, neural network algorithm, Bayesian algorithm, etc., but the accuracy of these algorithms is not very ideal, and they cannot handle continuous, discrete, and mixed large data sets, especially when missing data is large. In the case of many, the accuracy will decrease rapidly with missing data
[0004] Therefore, there is an urgent need for an effective algorithm to analyze endometrial tumor data, which can handle continuous, discrete and mixed large data sets, and can overcome the problem of rapid decrease in accuracy when there is a lot of data shortage

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Endometrial tumor classification marking method based on random forest
  • Endometrial tumor classification marking method based on random forest
  • Endometrial tumor classification marking method based on random forest

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0042] Such as figure 1 As shown, in the embodiment of the present invention, a random forest-based endometrial tumor classification and marking method proposed includes the following steps:

[0043] Step S1. Obtain endometrial malignant tumor data and endometrial benign tumor data to form sample data, and perform normalization processing on the obtained sample data, and further divide the normalized sample data into test sets and multiple training sets;

[0044] The specific process is as follows: firstly, the data of malignant endometrial tumors and benign endometrial tumors are collected. The above-mentioned data come from patients with tumors found in the ovarian endometrium during the operation.

[0045] Secondly, the endometrial malignant tumor data and en...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an endometrial tumor classification marking method based on a random forest, and the method comprises the steps of obtaining endometrial malignant and benign tumor data as sample data, carrying out the normalization processing, and dividing a test set and a plurality of training sets; performing decision tree training on each training set to obtain a corresponding CART decision tree model; evaluating all features of each CART decision tree model to obtain a corresponding feature set; selecting an optimal feature from each feature set through Gini index comparison to perform branching processing to obtain a decision tree and form a random forest model; optimizing the random forest model by adopting a particle swarm algorithm and importing the random forest model intoa test set to obtain a trained random forest model; acquiring endometrial tumor data to be detected, importing the endometrial tumor data into the trained random forest model, and distinguishing benign or malignant tumor data. By implementing the invention, continuous, discrete and mixed endometrial tumor data sets can be processed, and the problem that the accuracy is quickly reduced under the condition that more data is lacked can be solved.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a method for classifying and marking endometrial tumors based on random forests. Background technique [0002] In the rapidly developing Internet era, the emergence and application of machine learning is profoundly changing the medical industry. Prior to this, the collection and analysis of medical data was a challenging and difficult task. But now, through the analysis and processing of data, machine learning can accurately and clearly realize the set plan and deliver the results. [0003] At present, there are relatively few related studies on endometrial tumor data at home and abroad. Most of the classification models used in data prediction are KNN algorithm, neural network algorithm, Bayesian algorithm, etc., but the accuracy of these algorithms is not very ideal, and they cannot handle continuous, discrete, and mixed large data sets, especially when missing data i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06F18/214G06F18/24323
Inventor 唐震洲周铭琰李方靖林凤金楚许方怡易新凯王岩孔令剑
Owner WENZHOU UNIVERSITY