A binary classification method based on seepage analysis

A binary classification and percolation technology, applied in the intersection of machine learning and network science, it can solve problems such as slow scoring, inability to handle a large number of multi-class features or variables, and increased computational costs.

Active Publication Date: 2019-02-22
BEIHANG UNIV
View PDF7 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In 1958, the famous British statistician David Cox discussed the logistic regression model, which is mainly applied to classification problems with linear features. The model has limitations in adapting to data and scenarios. When the dimension of the feature space is large, The performance of logistic regression is poor, and it cannot handle a large number of multi-category features or variables well; in addition, in 1963, VladimirN. In constructing hyperplane or hyperplane set, it is used for classification, regression or other tasks, such as: anomaly detection, etc., but the SVM algorithm is difficult to implement for large-scale training samples, because SVM uses quadratic programming to solve support vectors, and solving two The second plan will involve the calculation of the m-order matrix (m is the number of samples). When the number of m is large, the storage and calculation of the matrix will consume a lot of machine memory and computing time; K-means clustering (k-means) initially Proposed by Stuart Lloyd in 1957, as a pulse code modulation technology, with the development of information technology, it is widely used in data clustering analysis. This algorithm needs to continuously adjust the sample classification and continuously calculate the adjusted new cluster. Class center, so when the amount of data is very large, the time overhead of the algorithm is very large; K-neighborhood algorithm (KNN) is mainly applied to data mining and classification. This algorithm is a lazy algorithm. When the amount of calculation for classification is large, it is necessary to scan all The calculation distance of training samples has a large memory overhead and slow scoring speed, resulting in increased calculation costs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A binary classification method based on seepage analysis
  • A binary classification method based on seepage analysis
  • A binary classification method based on seepage analysis

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0106] The present invention will now be explained in detail in conjunction with the embodiments and the accompanying drawings.

[0107] A binary classification method based on seepage analysis in the embodiment of the present invention, see Figure 6 As shown, its specific implementation steps are as follows:

[0108] Step 1, the data that the embodiment of the present invention uses is provided by the statistics of XX center of XX hospital, and research object is children's blood index, is respectively WBC (white blood cell), RBC (red blood cell), HGB (hemoglobin), PLT (platelet), HCT (red blood cell Compaction) 5 indicators, that is, N=5. During the research process, 10,000 data were randomly selected from the data, and individual units were abstracted into nodes, each node represented a 5-dimensional vector, the initial number of nodes was N=10,000, and the number of nodes was sequentially numbered from 1 to 10,000, according to the European style The distance formula ca...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a binary classification method based on seepage analysis, which comprises the following steps: 1, constructing an index network based on a data vector; 2, seepage analysis of index network; 3, constructing a likelihood function and determining a key threshold; 4. evaluation and validation of the model. Through the above steps, the invention further provides technical supportand theoretical support for large-scale, high-dimensional and high-complexity system group classification and evaluation classification effect based on seepage analysis; in addition, the background knowledge is expressed by graph model and the network analysis is carried out by using the seepage theory based on phase change, which reduces the computational complexity, converges quickly and is suitable for large-scale computation and reduces the computational cost.

Description

technical field [0001] The present invention proposes a binary classification method based on seepage analysis, constructs an index network based on the degree of correlation between individual units, constructs a binary classifier through network seepage analysis, and uses a confusion matrix for model evaluation, which belongs to machine learning and network Interdisciplinary fields of science. Background technique [0002] Binary classification problems are widely used in fields such as medicine, industry, and social analysis. With the advent of the era of big data, data has become an indispensable part of people's lives, forming a way of life from data to data; the emergence of mobile Internet has largely enriched the way people generate data At the same time, with the emergence of modern theories and technologies such as artificial intelligence, machine learning and cloud computing, it provides a strong guarantee for the solution of classification problems, such as: log...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06F17/18
CPCG06F17/18G06F18/29G06F18/214
Inventor 李大庆郑参
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products