Input data processing method and device for categorical data mining model
A technology for classifying data and inputting data, which is applied in the field of data processing and can solve problems such as low efficiency and low popularity
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment approach 2
[0097] A chi-square test is performed on each variable, and variables that do not meet the chi-square test are eliminated;
[0098] Calculate the correlation coefficient between each variable and the target variable;
[0099] According to the magnitude of the correlation coefficient, the top N variables with the highest correlation coefficient are selected; N≥1.
[0100] In this embodiment, the amount of the degree of linear correlation between the research variables is generally represented by the letter r. Due to the different research objects, there are many ways to define the correlation coefficient, and the Pearson correlation coefficient is more commonly used.
[0101] For example: the calculation of the correlation coefficient can be through the following formula 3):
[0102]
[0103] Among them, X and Y are two different variables, Cov(X,Y) is the covariance of X and Y, Var[X] is the variance of X, and Var[Y] is the variance of Y.
[0104] In this embodiment, the...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com