A data cleaning processing method and system

A data cleaning and processing method technology, applied in the field of data processing, can solve the problems of inability to realize the reuse of cleaning and processing rules and low degree of automation, and achieve good promotion and application value, improve operational efficiency, and reduce the effect of configuration work

Pending Publication Date: 2019-04-26
INSPUR QILU SOFTWARE IND
View PDF5 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method cannot realize the reuse of cleaning and processing rules. When the new data format needs to be cleaned, the cleaning and processing rules cannot be reused and need to be re-developed and configured. From raw data in various formats from multiple sources to standardized cleaning result data, The whole process is not highly automated

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A data cleaning processing method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0035] like figure 1 As shown, in the data cleaning and processing method of the present invention, the method first establishes a relational map between the data model and the data cleaning and processing rules, and uses the relational map to complete the automatic processing from the original data to the standardized cleaning and processing result data, and completes the data cleaning. processing. With the relationship map between the data model and the cleaning and processing rules as the core, it can solve the whole-process automatic processing of the standardized cleaning and processing result data of raw data in various formats from multiple sources, and can improve the operating efficiency of the cleaning and processing rules to a greater extent. Improve the reusability of cleaning processing rules.

[0036] The data cleaning and processing method specifically includes the following steps:

[0037] S1: The definition of the entity physical model of multi-source data, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data cleaning processing method and system, and belongs to the technical field of data processing. According to the data cleaning and processing method, firstly, a relation graph between a data model and data cleaning and processing rules is established, full-automatic processing from original data to standardized cleaning and processing result data is completed through the relation graph, and data cleaning and processing are completed. According to the data cleaning processing method, the operation efficiency of the cleaning processing rule can be improved, the reusedegree of the cleaning processing rule is improved to a greater extent, and the popularization and application value is very high.

Description

technical field [0001] The invention relates to the technical field of data processing, and specifically provides a data cleaning and processing method and system. Background technique [0002] With the continuous progress of society, the social economy has developed rapidly, and science and technology have also achieved rapid development. The advancement of science and technology has brought about the development of various industries in the social field. The development of the industry will inevitably bring about the generation of data in each industry, and the rapid development of the industry will also bring more and more data. After the data is generated, the data needs to be processed in order to make better use of the data in the subsequent work process, and the data processing method becomes the top priority. [0003] Before the emergence of massive data processing methods, massive data processing was basically carried out in their respective source business proces...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/215
Inventor 王乐张辉
Owner INSPUR QILU SOFTWARE IND
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products