Unlock instant, AI-driven research and patent intelligence for your innovation.

Establishment method of text data classification and classification model for data sharing and exchange

A text data, data-oriented technology, applied in text database clustering/classification, unstructured text data retrieval, electronic digital data processing, etc., to achieve the effect of high degree of automation and high accuracy

Active Publication Date: 2022-04-22
NO 30 INST OF CHINA ELECTRONIC TECH GRP CORP
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In order to overcome the above-mentioned shortcomings of the prior art, the present invention provides a method for establishing a text data classification and classification model oriented to data sharing and exchange, aiming at the fine-grained protection requirements existing in the existing data sharing and exchange process, the main technical problems to be solved as follows:

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Establishment method of text data classification and classification model for data sharing and exchange
  • Establishment method of text data classification and classification model for data sharing and exchange
  • Establishment method of text data classification and classification model for data sharing and exchange

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] A method for establishing a text data classification and classification model for data sharing and exchange, such as figure 1 shown.

[0019] Include the following steps:

[0020] 1) Through the text vectorization technology, the text data is quantified and identified mainly by keywords to form a structured description of the data;

[0021] 1.1) input text set P={P 1 ,P 2 ,...,P t ,...,P M}, where P t is the tth text, and M is the total number of texts. Set the number of keywords for each text extraction to M key , damping factor d, sliding window width ω, iteration stop threshold σ, maximum number of iterations G max ;

[0022] 1.2) Segment each text in the text set P using a word segmentation algorithm and remove stop words;

[0023] 1.3) Utilize the TF-IDF algorithm to calculate the TF-IDF value of each word corresponding in the text set P;

[0024] 1.4) For text P t Perform vectorization, t=1, 2, ..., M;

[0025] 1.4.1) Based on the width ω, the sliding...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for establishing a text data classification and grading model oriented to data sharing and exchange. The hybrid classification model classifies the vectorized data; the third step is to classify the classified data into security levels; the fourth step is to establish a data classification classification model. Compared with the prior art, the positive effects of the present invention are: data classification is carried out with artificial intelligence technology combined with artificial subjective judgment, data classification is carried out from the perspective of security, and fine-grained protection is carried out for different categories or levels of data during the sharing and exchange process. Provide more appropriate and accurate security policies, improve data security protection and improve the efficiency of data protection, its processing process is highly automated, and the accuracy of classification results is strong.

Description

technical field [0001] The invention relates to a method for establishing a text data classification and classification model facing data sharing and exchange. Background technique [0002] Our country is currently implementing the information security level protection system, and the idea of ​​"sub-area, sub-key" protection proposed is an effective means to solve the current information security problems. In the current process of data sharing and exchange, data management is relatively chaotic. The same or similar protection measures are adopted for different data, and the protection granularity is relatively coarse, which brings great hidden dangers to the security of data sharing and exchange. Once some sensitive data falls into the hands of people with ulterior motives Hands will seriously affect the interests of enterprises and even national security. Therefore, fine-grained data protection is an important part of information security, and there are few domestic and f...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G06F40/289G06F40/30G06N3/04
CPCG06F16/35G06F16/355G06N3/045
Inventor 颜亮姬少培董贵山刘栋
Owner NO 30 INST OF CHINA ELECTRONIC TECH GRP CORP