A data cleaning method and a device for data cleaning
A data cleaning and data technology, applied in the field of data processing, can solve problems such as low execution efficiency, large amount of transmitted data, and high pressure of data processing, and achieve the effects of improving efficiency, saving time, and reducing the pressure of data cleaning
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0053] At present, after the data center server obtains the source data from the client, it cleans and integrates the source data to obtain the target data. Since there may be a large amount of invalid or data information that needs to be cleaned in the source data, this will result in a large amount of transmitted data. Especially when the data center server is connected to multiple clients, it will cause the problems of high execution pressure and low execution efficiency of the data center server. In order to solve this problem, the present invention provides a data cleaning method, which moves data cleaning integration from the server to the client, fully utilizes the computing resources of the client, reduces the network transmission volume, reduces the processing pressure of the server, and expands The overall throughput of the data cleaning and integration system is improved, and the overall operating efficiency of the system is improved. On the other hand, the data cl...
Embodiment 2
[0096] figure 1 The method of data cleaning shown is explained from the client side, see below figure 2 , explaining the method for data cleaning of the present invention from the server side, the method for data cleaning specifically includes the following steps:
[0097] Step 20: The server sends the first cleaning strategy to the client, and receives the first cleaning data and / or the summary of the first cleaning data after the client cleans the source data according to the first cleaning strategy.
[0098] In this embodiment, the server sends the first cleaning strategy to the client, wherein the first cleaning strategy includes a preset cleaning rate threshold to control the data cleaning time. Wherein, the preset cleaning rate threshold is set by the server according to actual conditions, and is not specifically limited here.
[0099] The client cleans the source data according to the first cleaning strategy to obtain the first cleaning data and / or the summary of the...
Embodiment 3
[0112] see image 3 , image 3 It is a schematic structural diagram of a device for data cleaning provided by an embodiment of the present invention. The device for data cleaning in this embodiment includes one or more processors 31 and memory 32 . in, image 3 A processor 31 is taken as an example.
[0113] Processor 31 and memory 32 can be connected by bus or other means, image 3 Take connection via bus as an example.
[0114]The memory 32, as a non-volatile computer-readable storage medium based on data cleaning, can be used to store non-volatile software programs, non-volatile computer-executable programs and modules, such as Embodiment 1 and / or Embodiment 2 The method of data cleaning in and the corresponding program instructions. The processor 31 executes various functional applications and data processing of the data cleaning method by running the non-volatile software programs, instructions and modules stored in the memory 32, so as to realize the data cleaning ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap