Dirty data identification method and device, data clean method and device, and controller

A technology for data cleaning and identification methods, which is applied in the field of data processing and can solve the problems of low efficiency of data cleaning methods

Active Publication Date: 2018-12-11
NIO ANHUI HLDG CO LTD
View PDF14 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The object of the present invention is to provide a method for identifying dirty data and a method for cleaning data. What is to be solved is to improve the technical problem of low efficiency of existing data cleaning methods, and improve the efficiency and accuracy of data cleaning.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dirty data identification method and device, data clean method and device, and controller
  • Dirty data identification method and device, data clean method and device, and controller
  • Dirty data identification method and device, data clean method and device, and controller

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] In order to further explain the technical means and effects of the present invention to achieve the intended purpose of the invention, the specific implementation methods, features and methods of the dirty data identification method and data cleaning method proposed according to the present invention will be described below in conjunction with the accompanying drawings and preferred embodiments. Its effect is described in detail below.

[0053] see figure 1 , the present invention relates to a dirty data identification method, comprising the following steps:

[0054] Step S1, defining a domain rule base for data cleaning.

[0055] Taking Sanfang pile data as an example, according to business requirements, the fields that need to be cleaned in each tripartite pile data record (identified by a unique pile id) include: address information field, business (service) time field, etc. According to the characteristics of the tripartite pile data, for each field that needs dat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides a dirty data identification method and device, a data cleaning method and device, and a controller. The dirty data identification method comprises the following steps: extracting a domain rule base, wherein the domain rule base comprises one or a plurality of first decision rules and one or a plurality of second decision rules; identifying dirty data in a data fieldaccording to the domain rule filtering the data field by the first decision rule, and judging the data field as dirty data if the data field matches any one of the first decision rules; if the first decision rule is not matched, filtering the data field using the second decision rule. The dirty data identification method of the invention improves data cleaning efficiency and cleaning accuracy.

Description

technical field [0001] The invention relates to a data processing method, in particular to a dirty data identification method and device, a data cleaning method and device, and a controller. Background technique [0002] Charging piles are an integral part of electric vehicle charging equipment. In the charging resource management and monitoring system, it is often necessary to access the data of the charging piles of the third-party platform (referred to as the three-party pile) to meet the needs of the business, and the quality of the three-party pile data has a direct impact on the upper-level business, so in the three-party pile Data cleaning is required before entering the charging resource management and monitoring system. Data cleaning can be described as using a series of logical operations to detect dirty data from a large amount of raw data and repair or discard it. If the data cleaning of the three party piles is done entirely by manual assistance, the efficienc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 许伟佳
Owner NIO ANHUI HLDG CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products