Method for cleaning junk data

A technology of garbage data and cleaning method, applied in the computer field, can solve the problems of low efficiency of database garbage data investigation, inability to identify timely and completely, and cumbersome work.

Inactive Publication Date: 2013-02-06
浙江图讯科技股份有限公司
View PDF3 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, manual data deletion often requires the operator to have a higher understanding of the database and be able to distinguish which data is junk data and which data is normal data. The operator needs to check and proofread the database data from time to time, which is cumbersome and inefficient; The aging mechanism periodically ages and deletes some unnecessary or low-frequency data, which cannot be timely and completely identified as garbage data caused by input errors, malicious entries, and repeated storage, and the efficiency of checking database garbage data Low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for cleaning junk data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0025] Embodiment 1: as figure 1 As shown, the data processing is carried out in the data processing center, which is connected to the database for storing the data extracted from the database and completing the cleaning of garbage data.

[0026] Table 1 and Table 2 are a small piece of data in the personnel information table. Each column in the information table is the same type of data, and each row is set with a field ID. The field ID includes personnel identification (FID_), person name (FUSERNAME_FID_), and ID number (IDCARD_FID_), gender (SEX_FID_), date of birth (BIRTHDATE_FID_), account location (HOME_FID_), and set up a data source database with field IDs,

[0027] A

Zhang Sank

310107193002120111

male

1930-02-12

Shanghai

B

Li Si

110224189005210324

Female

1890-05-21

Beijing

C

Wang Wu

510202197010013478

male

1970-10-01

chongqing

D

Wang Wu

51020219...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an information maintenance technology in the field of computers, in particular to a method for cleaning junk data. The method comprises the steps of establishing a junk data cleaning rule system and detecting error data and repeating data from a data source so as to filter information of a database. The method has the following benefits: 1, the rules have expansibility, so a new rule can be supplemented at any time; 2, the rules and the data form a many-to-many mapping relationship so as to allow one data item to correspond to multiple rules and also allow one rule to correspond to multiple data items; 3, the rules are semantic, so the established rules are shown in an easily understood form and stored in a computer language manner; 4, the junk data processing scheme is configured flexibly and allows to process the junk data in multiple self-defining modes comprising deleting, jumping, manual processing and the like; and 5, multi-database support is adopted so as to allow to process support of main stream databases comprising oracle, mysql, db2 and the like.

Description

technical field [0001] The invention relates to information maintenance technology in the field of computers, in particular to a method for cleaning garbage data. Background technique [0002] During the operation of the computer, a large amount of application data needs to be invoked and executed, and some application data is either stored or deleted after being executed. [0003] With the development of computer technology, there are more and more types of application information, and more and more data are stored in the database. However, due to the limitation of capacity, the data stored in the database cannot be unlimited, and when the amount of data in the database reaches a certain amount It is likely to cause a decrease in computer work efficiency and affect the work process. Generally, data is deleted manually, and the operator manually deletes some unnecessary or erroneous data; another method is to periodically aging and delete some unnecessary or low-frequency d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 方绪群张峰生王斌
Owner 浙江图讯科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products