Data-table classification system and method based on association rules

A classification system and data table technology, applied in data mining, database model, multi-dimensional database and other directions, can solve the problem of high classification result error and achieve the effect of convenient classification operation

Active Publication Date: 2017-11-17
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF7 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Based on the above technical problems, the present invention provides a data table classification system and method based on association rules, which solves the technical problem that the current classification method is only based on the physical attributes of the database and does not involve the data content in the database, resulting in high errors in classification results.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data-table classification system and method based on association rules
  • Data-table classification system and method based on association rules
  • Data-table classification system and method based on association rules

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment

[0074] Step 1: Set up two sets of category data tables: "Personally Identifiable Information" and "Financial Salary Information"; manually collect the two sets of data tables containing the content of the category data tables.

[0075] Step 2: Use the data table information reading unit to read the field content of the two sets of data tables. Part of the data table fields are shown in Table 1, where tables T1-T30 are data tables of the "personally identifiable information" category, and the data tables after table T31 It is a data table of the category of "financial salary information";

[0076] Table 1

[0077]

[0078]

[0079] Step 3: For the "Personally Identifiable Information" category, traverse the data data tables T1-T30. First, add all the fields in the data table T1 to the category space of the "personally identifiable information" class as the category elements of the category space; in the data table T2, the "name" field is a synonym for the category elemen...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data-table classification system and method based on association rules, and belongs to the technical field of data-table classification. The method includes the steps that a plurality of training set data tables are artificially collected, the training set data tables comprise categorical data tables, and categorical space of the categorical data tables is constructed through the training set data tables; according to the categorical space, the training set data tables are pretreated; the pretreated training set data tables are subjected to association rule analysis, the association rules are screened, and the association rules of the categorical data tables are obtained; data tables needing to be identified are pretreated, the pretreated data tables are matched through the association rules of the categorical data tables, and class information of the data tables needing to be identified is obtained. The data tables are classified through the content of the data tables, and the accuracy of data-table classification is effectively increased.

Description

technical field [0001] The invention relates to a data table classification system and method, in particular to a data table classification system and method based on association rules for classifying data tables. Background technique [0002] In recent years, with the continuous advancement of social informatization, enterprise data not only shows an increasing trend in quantity, but also has the characteristics of diverse categories, frequent changes, and complex environments. Most enterprise data is stored in different data warehouses on the internal network, including high-value and sensitive data, which makes standardized data management difficult. For example, it is difficult for managers to fully control the distribution of data. However, the storage form, distribution status, type, and sensitivity of data in the internal network are extremely important for managers. Because this information can help them discover potential risks, respond to the supervision of releva...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F2216/03G06F16/2282G06F16/2465G06F16/283
Inventor 张小松牛伟纳宋珺
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products