Unlock instant, AI-driven research and patent intelligence for your innovation.

A Constrained Bicluster Mining and Missing Value Prediction Method Based on Preserving Submatrix

A forecasting method and bi-clustering technology, applied in database models, instruments, calculations, etc., can solve problems such as not solving sparse matrices, not taking into account overlapping or coverage constraints, and not being able to expand well

Active Publication Date: 2021-05-14
SOUTH CHINA UNIV OF TECH
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these methods do not scale well when the data matrix becomes very large
Furthermore, no overlapping or coverage constraints are taken into account during pattern mining of order-preserving submatrixes, nor does it address the issue of sparse matrices

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Constrained Bicluster Mining and Missing Value Prediction Method Based on Preserving Submatrix
  • A Constrained Bicluster Mining and Missing Value Prediction Method Based on Preserving Submatrix
  • A Constrained Bicluster Mining and Missing Value Prediction Method Based on Preserving Submatrix

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0070]The present invention will be described in more detail below with reference to the embodiments and the drawings, but the embodiments of the present invention are not limited thereto.

[0071]Based on the conventional dual clustering method, the two restriction conditions of the coverage constraint and overlapping constraints are added, first defined a sparse predefined sequence matrix model, and then repeat the data matrix, constantly search for excavation. Compliance with the dual clustering mode of the predefined sparse sequence matrix model, and in the mining process, it is preferred to extend the rows and columns of less overlapping or overlapping, if they cannot be excavated. More eligible modes to override new rows and columns, which means that the row coverage has converges, and the scanning data matrix will stop, and finally calculate the missing value in each dual cluster mode according to the linear fitting method. Accordingly, the present invention is applicable to the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a constrained bi-clustering mining and missing value prediction method based on an order-preserving sub-matrix, comprising the following steps: S1, receiving a request and mining the bi-clustering mode of the data matrix; S2, calculating the overlapping degree of rows and columns and coverage; S3, judging whether the coverage of rows and columns converges, if not, proceed to step S4, otherwise proceed to step S9; S4, randomly disrupt the order of rows and columns in the data matrix, and sort the set of candidate columns; S5, judge the set of candidate columns Whether it is empty; S6. Judging whether there are qualified columns in the candidate column set; S7. Expanding the column set and row set of the double-clustering mode according to the newly selected qualified columns; S8. Judging whether the current double-clustering mode conforms to the sparse order Requirements of the sub-matrix model; S9, calculating the missing value of each bi-clustering pattern; S10, returning the final bi-clustering set and missing value. Therefore, the present invention effectively improves the quality of the biclustering model and the accuracy of missing value prediction.

Description

Technical field[0001]The present invention relates to the field of dual clustering mode excavation, and more particularly to a constrained dual clustering mining and lack value prediction method based on a sequestrative sub-matrix.Background technique[0002]The general clustering is based on all attributes of the data, and the data cluster is called conventional clusters. Traditional clusters can only look for global information and cannot find local information, while a large amount of interesting information is hidden in these local information. In order to better search local information in the data matrix, people present a double clustering concept. The so-called dual cluster, also called sub-matrix mode, is a set of row sets with local similarities in the data matrix, a sub-matrix of a set of column subset, which means that this group is set on this set of columns. Performance has some local similarity. The dual cluster excavation is to capture interesting local similarity in th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/2458G06F16/28
CPCG06F16/2465G06F16/285
Inventor 钟佳琪李东方琼
Owner SOUTH CHINA UNIV OF TECH