Unlock instant, AI-driven research and patent intelligence for your innovation.

Rapid data aggregation method for big data cleaning

A data aggregation and big data technology, applied in the field of big data cleaning, can solve problems such as no correlation, unsatisfactory aggregation effect, and inability to meet big data cleaning, so as to improve accuracy and reduce calculation time

Active Publication Date: 2019-09-03
JILIN UNIV
View PDF8 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the existing data aggregation methods cannot obtain results within an effective time in terms of algorithm time complexity and accuracy when processing big data, and cannot meet the needs of big data cleaning.
[0003] It is summarized as two questions: First, how to improve the accuracy of aggregation. If it is divided according to the traditional index mark, although it can be very convenient and regular, it will make the aggregation effect in the text unsatisfactory and irrelevant.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Rapid data aggregation method for big data cleaning
  • Rapid data aggregation method for big data cleaning
  • Rapid data aggregation method for big data cleaning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments are only It is a part of embodiments of the present invention, but not all embodiments. The components of the embodiments of the invention generally described and illustrated in the figures herein may be arranged and designed in a variety of different configurations. Accordingly, the following detailed description of the embodiments of the invention provided in the accompanying drawings is not intended to limit the scope of the claimed invention, but merely represents selected embodiments of the invention. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without making...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a rapid data aggregation method for big data cleaning. The method comprises the following steps: data reading: storing original data in Excel, reading data information in the Excel in a file stream form, storing the read data information in a record list according to the format of the data, and finally returning the record list; segmenting the big data text; performing textsimilarity comparison; and displaying and modifying an aggregation result: printing out the form to be displayed and providing the form for a user to modify and delete, and downloading the form afterthe modification is completed.

Description

technical field [0001] The invention relates to the technical field of big data cleaning, in particular to a fast data aggregation method for big data cleaning. Background technique [0002] In the era of big data, data is one of the most valuable assets of an enterprise. There is a direct relationship between the data quality of an enterprise and its business performance. Every business decision, customer management, and business investment of an enterprise is based on data analysis. superior. Data cleaning can greatly improve the quality of enterprise data, help enterprises make more reasonable decisions, further reduce costs and improve income and competitiveness. Data cleaning refers to the last procedure to find and correct identifiable errors in data files, including checking data consistency, dealing with invalid and missing values, etc. However, data mining based on big data faces many problems such as data availability. Research on big data cleaning technology is ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/24G06F17/27G06F16/35
CPCG06F16/35G06F40/18G06F40/289Y02D10/00
Inventor 周柚王康平时小虎吴春国耿昭阳王依章
Owner JILIN UNIV