Bidirectional clustering method for risk control system data complementation

A system data, bidirectional clustering technology, applied in the field of cluster analysis, can solve the problems of insufficient speed and efficiency of missing data completion, and achieve the effect of good noise robustness, improved accuracy, and excellent completion effect.

Pending Publication Date: 2021-03-12
京科互联科技(山东)有限公司
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to overcome the deficiencies in the prior art, and provide a two-way clustering method for wind control system data completion to solve the problems of insufficient speed and insufficient efficiency of missing data completion

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Bidirectional clustering method for risk control system data complementation
  • Bidirectional clustering method for risk control system data complementation
  • Bidirectional clustering method for risk control system data complementation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0032] Such as Figure 1-3 As shown, a two-way clustering method for wind control system data completion, including example clustering, attribute clustering, local matrix construction, local matrix filling, matrix filling five steps, in which:

[0033] The example clustering is to assign sample points to different clusters 1. The centroids of each cluster 1 are different. The centroids of each cluster 1 are obtained by updating formula 1. The similarity in the example clustering is calculated by distance calculation formula 1. Distance calculation formula 1 for Among them, D represents the number of attributes of the data object, and the formula of the subset c assigned to the cluster in the example cluster is

[0034] Attribute clustering is to cluster the data obtained by example clustering in the attribute dimension and assign them to different clusters 2. The centroids of each cluster 2 are different. The centroids of each cluster 2 are obtained by updating formula 2. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of clustering analysis, in particular to a bidirectional clustering method for risk control system data complementation, example clustering mainly takes intra-cluster high similarity and inter-cluster low similarity as targets, sample points are distributed to different clusters, attribute clustering performs attribute dimension clustering on centroidsobtained by example clustering, information of example dimensions and attribute dimensions is fully considered, clustering is combined to effectively capture potential rules between rows and columns,a local matrix is constructed accordingly, users in the local matrix have high correlation with projects, the local matrix is filled with a potential factor model, and the method has the advantage ofbeing good in noise robustness through bidirectional clustering and improving the accuracy of the processing result by capturing the features of multiple dimensions,.

Description

technical field [0001] The invention relates to the technical field of cluster analysis, in particular to a two-way clustering method for wind control system data completion. Background technique [0002] With the development of information technology and the Internet, more and more machine learning algorithms are applied to the traditional financial field. In the traditional financial field, how to carry out financial risk control through big data combined with machine learning has attracted much attention. Most traditional risk control models are based on labeled supervised learning tasks. However, as the amount of data continues to increase, most of the collected data is incomplete due to reasons such as storage errors, unreliable collection equipment, unstable network status, or malicious fraud by users. And these incomplete data may be redundant, noisy or missing, etc. Data loss is a common phenomenon in risk control systems, and the amount of lost data increases exp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06N3/045G06F18/23213
Inventor 郑小禄诸葛天心刘羽中胡亮仵伟强尹昌
Owner 京科互联科技(山东)有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products