Data cleaning method and system
A data cleaning and data technology, applied in the computer field, can solve the problems of data machines that cannot be recognized or corrected, high cost, and low efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0021] combine figure 1 , the data cleaning method provided in this embodiment includes:
[0022] Step S1: Segment tasks for the data that needs to be manually cleaned;
[0023] Step S2: Publish the divided tasks to the crowdsourcing platform;
[0024] Step S3: receiving the manual cleaning result data returned by the task recipient through the crowdsourcing platform, and integrating the manual cleaning result data with the machine cleaning result data.
[0025] The data cleaning method provided by the embodiment of the present invention divides the data that needs to be manually cleaned into tasks, and sends the divided tasks to a large number of unfixed task recipients through the crowdsourcing platform, thereby completing the data cleaning , can improve data cleaning efficiency and reduce costs.
[0026] Preferably, the data that needs to be manually cleaned includes abnormal data submitted by the machine during the data cleaning process. These abnormal data include data...
Embodiment 2
[0035] combine figure 2 , the data cleaning system provided by the embodiment of the present invention includes task segmentation module 1: used to perform task segmentation on the data that needs to be manually cleaned; task release module 2: used to release the segmented tasks to crowdsourcing Platform; data integration module 3: used to receive the manual cleaning result data returned by the task recipient through the crowdsourcing platform, and integrate the manual cleaning result data with the machine cleaning result data.
[0036] The data cleaning system provided by the embodiment of the present invention divides the data that needs to be manually cleaned into tasks, and sends the divided tasks to a large number of unfixed task recipients through the crowdsourcing platform, thereby completing the data cleaning , can improve data cleaning efficiency and reduce costs.
[0037] Preferably, the data that needs to be manually cleaned includes abnormal data submitted by the...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com