Data cleaning method, device and equipment, and storage medium

A data cleaning and data technology, applied in the field of data processing, can solve the problems that it is not easy to ensure code standardization, consistency, optimization experience is difficult to promote on a large scale, data tracking, and error checking are unfavorable, etc., to achieve high data cleaning efficiency and improve Accurate, consistent results

Pending Publication Date: 2019-08-20
PINGAN PUHUI ENTERPRISE MANAGEMENT CO LTD
View PDF3 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, it is obvious that the output efficiency of manual input and writing code is low; the efficiency of code operation depends on the personal ability of engineers, and optimization experience is difficult to promote on a large scale; irregular data is directly discarded, which is an irreversible operation. Both are disadvantageous; at the same time, it is not easy to ensure the standardization and consistency of the code

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data cleaning method, device and equipment, and storage medium
  • Data cleaning method, device and equipment, and storage medium
  • Data cleaning method, device and equipment, and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048]In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0049] Those skilled in the art will understand that unless otherwise stated, the singular forms "a", "an", "said" and "the" used herein may also include plural forms. It should be further understood that the word "comprising" used in the specification of the present application refers to the presence of the features, integers, steps, operations, elements and / or components, but does not exclude the presence or addition of one or more other features, Integers, steps, operations, elements, components, and / or groups thereof.

[0050] figure 1 It is an overall flowchart of a ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of data processing, in particular to data cleaning method, device and equipment, and a storage medium. The data cleaning method comprises the steps: obtaining to-be-cleaned data, converting the to-be-cleaned data into decimal data, and generating a to-be-cleaned data table corresponding to a data source terminal from the decimal to-be-cleaned data; calling a preset data cleaning rule file, extracting a data cleaning rule corresponding to the table name of the to-be-cleaned data table, and generating a corresponding data cleaning execution code for each data cleaning rule; labeling each piece of to-be-cleaned data in the to-be-cleaned data table, and matching a corresponding data cleaning rule for the to-be-cleaned data; analyzing the label of each piece of to-be-cleaned data, executing a data cleaning execution code, and cleaning the to-be-cleaned data to obtain to-be-stored data; and storing the to-be-stored data in the form of binary data.According to the technical scheme, the data cleaning accuracy and efficiency are improved, and the storage space is saved.

Description

technical field [0001] The present application relates to the technical field of data processing, and in particular to a data cleaning method, device, equipment and storage medium. Background technique [0002] With the rapid development of computer technology and communication technology, people can obtain more and more digital information, but at the same time, they need to invest more time in organizing and organizing the digital information. For example, in business systems, some irregular data is often generated due to factors such as code defects, business definition changes, and network delays. For example, the payment time of an order is earlier than the creation time of the order. This is a piece of data that does not conform to business logic. . Before performing statistical analysis on the data, it is necessary to filter out these irregular data to ensure the accuracy of the statistics. Data cleaning is a process of reducing data errors and inconsistencies. The ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/215G06F16/25
CPCG06F16/215G06F16/258
Inventor 宫雪
Owner PINGAN PUHUI ENTERPRISE MANAGEMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products