Mutual information based real-time property extracting method

An extraction method and real-time feature technology, applied in the fields of instruments, character and pattern recognition, computer parts, etc., can solve the problems of sparse value density, high analysis difficulty, and relatively high analysis accuracy requirements, and achieve rapid dimensionality reduction and improvement. The effect of precision

Inactive Publication Date: 2018-09-14
LIAONING UNIVERSITY
View PDF0 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

It uses various variable parameters collected in the industrial production process to strictly monitor the production process, and through the analysis of production data, it improves the production process, optimizes the production process, and reduces energy consumption. However, due to its large volume and multi-source characteristics such as strong nonlinearity, and sparse value density make its analysis very difficult, and the requirements for analysis accuracy are relatively high. At the same time, the high-speed operation of the production line also requires higher real-time data processing
Therefore, in order to mine the information hidden in industrial big data better and faster, and accurately predict and analyze it, it is necessary to remove redundant attributes in the data, reduce the workload in the subsequent mining process, and improve the efficiency of data mining. In order to improve efficiency and performance, feature extraction must be performed on industrial big data. Traditional feature extraction algorithms have very strict requirements on data, and have certain requirements on the distribution and internal structure of data samples, and there are certain requirements on processing speed. Therefore, this paper proposes a real-time feature extraction algorithm based on mutual information (Feature Extraction Algorithm Based OnMutual Information, MIFE); this algorithm uses mutual information as the correlation coefficient for principal component analysis, and uses sliding windows to dynamically update data; then the current The combination of window data and new window data completes the feature extraction of the overall data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mutual information based real-time property extracting method
  • Mutual information based real-time property extracting method
  • Mutual information based real-time property extracting method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0093] A feature selection method for high-dimensional incomplete data. First, it is judged whether the current window is the first window; if it is the first window, a feature extraction strategy based on mutual information is adopted. If it is not the first window, an incremental data extraction strategy based on sliding windows needs to be adopted.

[0094] The feature extraction strategy based on mutual information specifically includes the following:

[0095] Step 1: Assume the feature space R m×n On the sample data set X, each piece of data X i It consists of n-dimensional feature vectors, that is, (x i1 ,x i2 ,...x in ) First, according to the probability distribution of the statistical features of the data sample, and then calculate the information entropy H(x j ), and then calculate the mutual information between each feature according to formula (2) to form a mutual information matrix

[0096]

[0097] where p(x i ) is the probability of occurrence of eac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A mutual information based real-time property extracting method comprises the following steps: 1) determining whether a current window is a first window; if so, entering the step 2); if not so, entering the step 3); 2) calculating the mutual information between two dimensions and forming a mutual information matrix; calculating the unit matrix of the mutual information matrix; performing propertydecomposition on the mutual information matrix; sequencing the property values and the property vectors; calculating the rate of contribution of each property vector; determining the component decision matrix of the front k property vectors with the accumulative rate of contribution of 85-95%; and mapping the data in the current window on the decision matrix; and 3) calculating the mutual information matrix in the current window; projecting to the unit matrix of the mutual information matrix in the previous window; performing property decomposition on the mutual information matrix; obtaining the component decision matrix of the front k property vectors with the accumulative rate of contribution of 85-95%; and projecting the data to the decision matrix to realize property extraction.

Description

technical field [0001] The invention relates to a real-time feature extraction method based on mutual information, which belongs to the technical fields of machine learning and data mining. Background technique [0002] In the past two years, the United States, Germany and China have successively proposed the "Advanced Manufacturing Partnership" report 2.0, "Industry 4.0 R&D White Paper", and "Made in China 2025", which opened the prelude to the transformation of the manufacturing industry to intelligent manufacturing. At the same time, the new concept of industrial big data has also appeared in front of people. Industrial big data is a new concept in the era of intelligent manufacturing. It generally refers to big data in the industrial field, including various data inside and outside the enterprise. With the successive implementation of intelligent transformation strategies, industrial big data has increasingly become the main driving force for the global manufacturing in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/2135
Inventor 王妍李俊吴阳李玉诺
Owner LIAONING UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products