Model detection method based on feature selection

A technology of model detection and feature selection, applied in character and pattern recognition, instruments, computer components, etc., can solve the problems of large amount of calculation of Internet data, difficulty in obtaining stable data, frequent data changes, etc., to improve generalization ability, The effect of reducing the scale and improving the correctness of classification

Inactive Publication Date: 2016-03-30
CTRIP COMP TECH SHANGHAI
View PDF0 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The technical problem to be solved by the present invention is to provide a model detection method based on feature selection in order to overcome the defects of large amount of Internet data calculation, frequent data changes and difficulty in obtaining a stable data model in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model detection method based on feature selection
  • Model detection method based on feature selection
  • Model detection method based on feature selection

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The present invention is further illustrated below by means of examples, but the present invention is not limited to the scope of the examples.

[0026] Such as figure 1 Shown, the model detection method based on feature selection of the present invention comprises the following steps:

[0027] Step 101, randomly split the original data set, and put the split data into the original training set, verification set and test set;

[0028] Step 102, using the original training set to train a model;

[0029] Step 103, using the model to predict the original training set and verification set, and obtain the prediction error of the original training set and the prediction error of the verification set respectively;

[0030] Step 104, delete the jth feature in the original training set to obtain a new training set, use the model to predict the new training set, and obtain the prediction error of the new training set, wherein the initial value of j is 1;

[0031] Step 105, ass...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a model detection method based on feature selection, and the method comprises the steps: cutting an original data set; training a model through employing a training set; predicting the training set and a verification set through employing the model, and obtaining a prediction error; deleting features, obtaining a new training set, predicting the new training set through the model, and obtaining a prediction error; assigning j to (j+1), returning to the last step, and executing the next step till the value is C; calculating prediction error distances between the new training set and the training set; ordering the distances, and searching features corresponding to G minimum distances; storing sequence numbers of the features into a feature deleting feature sequence, and deleting features in the training set and a verification set; assigning C to (C-G), returning to a second step, and executing the next step till C is not greater than G; obtaining a sequence number K according to the prediction error, and deleting former (K-1) features from the training set and a testing set; training the new model through employing the training set with the features being deleted, predicting the testing set with the features being deleted through employing the new model, and obtaining the prediction error.

Description

technical field [0001] The invention relates to technical fields such as data mining, machine learning, big data, cloud computing, and the Internet, and in particular to a model detection method based on feature selection. Background technique [0002] With the rapid development of the Internet, massive amounts of data have been accumulated, which has also brought many problems to data analysis and data mining: [0003] Large amount of data and high dimensionality: a data set consists of the number of instances P and the number of features N, and the combination of the two brings a huge amount of calculation to the algorithm; [0004] Frequent data changes: Rapid changes on the Internet also produce changing data, which requires regenerating the data model; [0005] Noisy data and missing data: Internet data lacks strict conventions, the data is uneven, some algorithms are sensitive to noisy data, and it is difficult to obtain a stable data model. Contents of the inventio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/2411
Inventor 何鸣杨琪吴鹏越
Owner CTRIP COMP TECH SHANGHAI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products