Multi-source risk control data cleaning processing method

A technology of data cleaning and processing methods, which is applied in the fields of electrical digital data processing, natural language data processing, special data processing applications, etc., and can solve the problems of low data analysis efficiency, huge amount of data analysis, and different types of data, etc.

Pending Publication Date: 2020-08-21
中建材信息技术股份有限公司 +1
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] Risks exist in all business activities. The sources of risks are different, and the strategies, data, and models needed to assess risks will vary widely. Therefore, the way to avoid risks is to analyze risks, master their laws, and realize risk control. Risk control requires the coll...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-source risk control data cleaning processing method
  • Multi-source risk control data cleaning processing method
  • Multi-source risk control data cleaning processing method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0085] Such as Figure 1-6 As shown, one of the purposes of this embodiment is to provide a method for cleaning and processing multi-source wind control data, the steps are as follows:

[0086] (1) Multi-source data collection stage:

[0087] S1.1, historical data collection, the system adopts the full import method to import the historical data of risk control;

[0088] S1.2, real-time data collection, the system obtains incremental changes based on database log analysis to realize real-time data synchronization;

[0089] (2) Data integration stage:

[0090] S1.3, pattern matching, based on the similarity of attributes, pattern matching is performed on the data;

[0091] S1.4. Semantic conversion, converting the attribute data of various heterogeneous data sources into standard data;

[0092] (3) Data cleaning stage:

[0093] S1.5. Invalid information filtering, identifying and eliminating erroneous data and duplicate data;

[0094] S1.6, data encryption, encrypting the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of risk control data processing, in particular to a multi-source risk control data cleaning processing method. The method comprises the steps of collectinghistorical data and enabling a system to import risk control historical data in a full import mode; carrying out real-time data acquisition: enabling the system to acquire increment change based on adatabase log analysis mode to realize real-time data synchronization; performing pattern matching: performing pattern matching on the data on the basis of the similarity of the attributes; making semantic conversion: converting the attribute data of each heterogeneous data source into standard data; filtering invalid information and recognizing and eliminating wrong data and repeated data; encrypting data: encrypting original data; and making data compression: carrying out lossless compression processing on the original data. The data designed by the invention adopts a multi-source acquisition mode, and meanwhile, unification of the data is realized, data processing is facilitated, invalid data can be reduced, and the data processing efficiency is improved.

Description

technical field [0001] The present invention relates to the technical field of wind control data processing, in particular to a method for cleaning and processing multi-source wind control data. Background technique [0002] Risks exist in all business activities. The sources of risks are different, and the strategies, data, and models required to evaluate risks will vary widely. Therefore, the way to avoid risks is to analyze risks, master their laws, and realize risk control. Risk control requires the collection of multi-source data. Due to different data sources, the types of data are different, and unified analysis cannot be performed. At the same time, the data contains a lot of useless data. When analyzing, the amount of data analysis is huge, resulting in low data analysis efficiency. Contents of the invention [0003] The purpose of the present invention is to provide a method for cleaning and processing multi-source risk control data, so as to solve the problems r...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/18G06F16/16G06F16/215G06F16/22G06F16/242G06F21/60G06F40/151
CPCG06F16/1815G06F16/162G06F16/16G06F40/151G06F21/602G06F16/2433G06F16/215G06F16/2246
Inventor 刘庆王伟
Owner 中建材信息技术股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products