Unlock instant, AI-driven research and patent intelligence for your innovation.

Abnormal data identification and correction method, system and device and readable storage medium

A technology of abnormal data and normal data, applied in the field of data processing, can solve problems such as decision-making mistakes, reduced work efficiency, and abnormal data cannot be effectively identified, and achieve the effect of improving the efficiency and accuracy of identification

Active Publication Date: 2021-04-13
平安消费金融有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In view of this, the purpose of the embodiments of the present invention is to provide a method, system, device, and readable storage medium for identifying and correcting abnormal data, so as to solve the problem that abnormal data cannot be effectively identified, resulting in wrong decision-making and reduced work efficiency. And other issues

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Abnormal data identification and correction method, system and device and readable storage medium
  • Abnormal data identification and correction method, system and device and readable storage medium
  • Abnormal data identification and correction method, system and device and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0044] refer to figure 1 , shows a flow chart of the steps of the method for identifying and correcting abnormal data according to Embodiment 1 of the present invention. It can be understood that the flowchart in this method embodiment is not used to limit the sequence of execution steps. An exemplary description is given below taking the computer device 2 as the execution subject. details as follows.

[0045] Step S100, acquiring user data, where the user data includes transaction data.

[0046] Specifically, user data can be credit data, which is an important indicator for macroeconomic research and analysis, including transaction data of each business scenario in credit data, such as user ID, transaction amount, repayment amount, etc. User data can be obtained from various banks or third-party platforms.

[0047] Step S120, calculating the minimum number of clustering points of the transaction data based on the linear regression model, where the minimum number of cluste...

Embodiment 2

[0096] read on figure 2 , shows a schematic diagram of program modules of Embodiment 2 of the abnormal data identification and correction system of the present invention. In this embodiment, the abnormal data identification and correction system 20 may include or be divided into one or more program modules, and one or more program modules are stored in a storage medium and executed by one or more processors , so as to complete the present invention, and realize the identification and correction method of the above-mentioned abnormal data. The program module referred to in the embodiment of the present invention refers to a series of computer program instruction segments capable of completing specific functions, which is more suitable than the program itself to describe the execution process of the abnormal data identification and correction system 20 in the storage medium. The following description will specifically introduce the functions of each program module of the prese...

Embodiment 3

[0136] refer to image 3 , is a schematic diagram of the hardware architecture of the computer device according to Embodiment 3 of the present invention. In this embodiment, the computer device 2 is a device capable of automatically performing numerical calculation and / or information processing according to preset or stored instructions. The computer device 2 may be a rack server, a blade server, a tower server or a cabinet server (including an independent server, or a server cluster composed of multiple servers) and the like. Such as image 3 As shown, the computer device 2 at least includes, but is not limited to, a memory 21 , a processor 22 , a network interface 23 , and an abnormal data identification and correction system 20 that can communicate with each other through a system bus. in:

[0137] In this embodiment, the memory 21 includes at least one type of computer-readable storage medium, and the readable storage medium includes flash memory, hard disk, multimedia ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an abnormal data identification and correction method comprising the steps of acquiring user data, wherein the user data comprises transaction data; calculating a minimum clustering point number of the transaction data based on a linear regression model, wherein the minimum clustering point number is the number of clustering centers of a clustering algorithm; calculating sub-data of the transaction data through the clustering algorithm and the minimum clustering point number, and taking the sub-data of which the distance is greater than a preset weight as abnormal data; and identifying attribute data of the abnormal data in the user data, and modifying the abnormal data in the attribute data through a correction algorithm to obtain normal data. The invention further discloses an abnormal data identification and correction system and device and a readable storage medium. The method has the advantages that the K value needed by the clustering algorithm is calculated through linear regression, the defect that K value selection of the K-means clustering algorithm is often difficult to estimate is overcome, and the recognition efficiency and accuracy of abnormal data are improved.

Description

technical field [0001] The embodiments of the present invention relate to the field of data processing, and in particular to a method, system, device, and readable storage medium for identifying and correcting abnormal data. Background technique [0002] With the continuous development of informatization and the wide application of the Internet, a large amount of data is generated, and it is necessary to effectively manage the large amount of measurement data generated, and there are many abnormal data in a large amount of historical data and newly generated data. The inventor found that there are too many fields in the credit reporting business, and the error information in each field cannot be modified in real time. It is necessary to check, intercept and repair the data of thousands of dimensions. The abnormal data in the current multi-dimensional data is difficult to identify. Contents of the invention [0003] In view of this, the purpose of the embodiments of the pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/215G06F16/28G06K9/62
CPCG06F16/215G06F16/285G06F18/23G06F18/2433G06F18/10
Inventor 刘栩良
Owner 平安消费金融有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More