Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Power data standardization cleaning method and device under multi-source data access

A technology for power data and multi-source data, which is applied in the field of data processing and can solve the problems of data quality restricting data application and processing, and data difficulties.

Pending Publication Date: 2021-05-11
GUANGDONG POWER GRID CO LTD DONGGUAN POWER SUPPLY BUREAU
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, it is very difficult to manually process these huge and messy data, and data quality has become one of the bottlenecks restricting data application and processing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Power data standardization cleaning method and device under multi-source data access
  • Power data standardization cleaning method and device under multi-source data access
  • Power data standardization cleaning method and device under multi-source data access

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The present invention will be described in detail below with reference to the accompanying drawings and in combination with embodiments.

[0034] Such as figure 1 As shown, a method for standardized cleaning of power data under multi-source data access, including:

[0035] S10. Preliminary clustering processing of electric power data, using the K-means algorithm to read the collected data, and classifying the collected data according to the attribute value characteristics of the data. The processing complexity of this work can be expressed as A(n), and The collected data is represented in the form of character strings, and the computational complexity of data clustering can be expressed as A(m·I), where m represents the number of data with different attributes, and I represents the number of data with the same attribute. The feasibility of clustering, setting the constraints in the preliminary clustering process is expressed as:

[0036]

[0037] In the formula, S ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a power data standardization cleaning method under multi-source data access, which comprises: S10, carrying out data preliminary clustering processing: reading acquired data by using a K-means algorithm, classifying the acquired data according to attribute value characteristics of the data, and storing the classified data in a database; and S20, enabling multi-source data cleaning to adopt the data subjected to clustering processing as a data source of data cleaning, setting the processed data to be in a database form, and completing the multi-source data cleaning work by adopting an existing data cleaning tool. The method has the beneficial effects that the collected data is classified according to the attribute value characteristics of the data, the data subjected to clustering processing is adopted as a data source for data cleaning, the processed data is set to be in a database form, the multi-source data cleaning work is completed by adopting an existing data cleaning tool, and the data cleaning efficiency is improved, the accuracy of the database data processing result is improved, and then the accuracy of data cleaning is improved.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a method and device for standardized cleaning of power data under multi-source data access. Background technique [0002] People can use more and more data resources, but massive data does not necessarily have real value. The value of data comes from its quality, and the quality of data mining directly affects the quality of decision-making. However, it is very difficult to manually process these huge and messy data, and data quality has become one of the bottlenecks restricting data application and processing. Correcting quality problems in data, avoiding decision-making mistakes, and reducing decision-making risks are important links in data processing. In previous studies, the data cleaning system was used to complete data cleaning. However, due to the increase in the amount of data, the emergence of multi-source data has an impact on the performance of the system, e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/215G06F16/28G06K9/62
CPCG06F16/215G06F16/285G06F18/22G06F18/23213G06F18/24
Inventor 周立德黎鸣陈凤超梅傲琪胡润锋钟志明邱泽坚何毅鹏黄达区饶欢张锐刘沛林徐睿烽鲁承波
Owner GUANGDONG POWER GRID CO LTD DONGGUAN POWER SUPPLY BUREAU
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products