Data processing method and device based on principal component analysis and storage medium

A principal component analysis and data processing technology, applied in the field of data processing, can solve the problems of many redundant features and low model prediction efficiency, and achieve the effects of improving prediction efficiency, fast budgeting speed, and reducing the amount of calculation

Pending Publication Date: 2020-07-31
MIGU CO LTD +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The inventors found that there are at least the following problems in the prior art: analyzing the characteristics of the sample data according to the above formula, many redundant features are obtained, resulting in low prediction efficiency of the model trained by using the sample data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method and device based on principal component analysis and storage medium
  • Data processing method and device based on principal component analysis and storage medium
  • Data processing method and device based on principal component analysis and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention more clear, various implementation modes of the present invention will be described in detail below in conjunction with the accompanying drawings. However, those of ordinary skill in the art can understand that in each implementation manner of the present invention, many technical details are proposed in order to enable readers to better understand the present invention. However, even without these technical details and various changes and modifications based on the following implementation modes, the technical solution claimed in the present invention can also be realized.

[0022] The first embodiment of the present invention relates to a data processing method based on principal component analysis, the specific process is as follows figure 1 shown, including:

[0023] S101: Perform dimensionality reduction processing on the initial sample data to obtain sampl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention relates to the field of software defect prediction, and discloses a data processing method and device based on principal component analysis, and a computer readable storage medium, and the method comprises the steps: carrying out the dimension reduction of initial sample data, and obtaining the sample data of a preset dimension; acquiring a plurality of features ofthe sample data, and calculating the relevancy between each feature and a preset category, the preset category being one of a plurality of categories of the sample data; and removing the features of which the relevancy is less than the preset relevancy in the plurality of features, and taking the remaining features as identification features of the sample data. According to the data processing method and device based on principal component analysis and the computer readable storage medium provided by the invention, redundant features in the sample data can be removed, and the sample data withhigh discrimination is obtained, so that the prediction efficiency is improved.

Description

technical field [0001] The embodiments of the present invention relate to the field of data processing, and in particular to a principal component analysis-based data processing method, device, and computer-readable storage medium. Background technique [0002] Information entropy is a measure of the amount of information required to eliminate uncertainty, that is, the amount of information that an unknown event may contain. An event or a system, to be precise, is a random variable with certain uncertainties. The uncertainty of some random variables is very high. To eliminate this uncertainty, a lot of information needs to be introduced, and the measurement of this much information is expressed by "information entropy". The more information that needs to be introduced to eliminate uncertainty, the higher the information entropy, and vice versa. If a situation has high certainty, little information needs to be introduced, so the information entropy is very low. According t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00G06K9/62
CPCG06V40/172G06F18/2135G06F18/214
Inventor 奚晓钰
Owner MIGU CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products