Unlock instant, AI-driven research and patent intelligence for your innovation.

Financial missing data processing method based on clustering

A missing data and processing method technology, applied in the field of data processing, can solve the problems of missing data filling and missing information errors, etc., to achieve accurate filling and reduce errors

Pending Publication Date: 2022-05-06
JIANGNAN UNIV +1
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The above clustering-based methods either only consider the partial situation of the missing data, or start from the whole without considering the error caused by the missing information, so that the missing data cannot be accurately filled.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Financial missing data processing method based on clustering
  • Financial missing data processing method based on clustering
  • Financial missing data processing method based on clustering

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0038] refer to figure 1 , as an embodiment of the present invention, provides a clustering-based financial missing data processing method, including:

[0039] S1: Acquire financial datasets.

[0040] It should be noted that the financial data set acquired in this embodiment is a data set provided by a certain company.

[0041] S2: Two-step processing on the financial dataset.

[0042] It should be noted that the two-step process includes:

[0043] One step is to not handle missing values ​​in the dataset, and one step is to split the dataset into missing and complete datasets.

[0044] S3: Perform a clustering operation on the data set obtained after the two-step processing, and integrate the clustered clusters.

[0045] It should be noted that missing values ​​in the dataset that are not handled include:

[0046] Carry out k-means clustering processing on financial missing data sets;

[0047] The k-means clustering process is as follows:

[0048] The number k of the s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a clustering-based financial missing data processing method. The method comprises the steps of obtaining a financial data set; performing two-step processing on the financial data set; clustering operation is carried out on the data set obtained through two-step processing, and clusters obtained after clustering are integrated; missing data objects are divided into the most expected clusters through similarity measurement, and filling is carried out through information in the clusters. According to the financial missing data processing method based on clustering, overall and local combination is achieved, the distribution situation of original samples is reserved to a great extent, errors caused by missing data are reduced, and the missing data are filled more accurately.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a method for processing missing financial data based on clustering. Background technique [0002] For the method of processing missing financial data, Chinese patent CN201810215615.4 uses a sliding window to dynamically evaluate whether the data is missing, and then uses the internal time and space characteristics of the data to fill in the missing data; Chinese patent CN202110588570.7, by corresponding The spatial feature vector of the previous historical data is combined with the context vector related to the previous historical data to fill in the missing data; Chinese patent CN201810996476.3 compares the similarity between the matrix where the missing data is located and the adjacent matrix, and selects the non-missing items with high similarity The value is used as the interpolation value of missing data; Chinese patent CN112732685A uses KNN nearest neighbor filling ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/215G06K9/62
CPCG06F16/215G06F18/23213
Inventor 陈丽芳李晓婉谢振平刘渊崔乐乐宋设杨宝华
Owner JIANGNAN UNIV