Unlock instant, AI-driven research and patent intelligence for your innovation.

Archive processing method based on association rule mining

A processing method and archive technology, applied in special data processing applications, text database query, unstructured text data retrieval, etc., can solve problems such as single function, inability to make sensitive judgments on digital data, and inability to compare texts

Active Publication Date: 2021-10-19
中盾创新数字科技(北京)有限公司
View PDF6 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the Jaccard similarity is not sensitive to the word frequency response of the same similar word, and the cosine similarity can make a sensitive judgment on the direction of the vector, but it cannot make a sensitive judgment on the digital data
[0005] Therefore, the function of the existing text similarity comparison method is relatively single, and it can only make accurate judgments in one aspect, but cannot compare all aspects of the text, resulting in inaccurate final results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Archive processing method based on association rule mining

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] In order to make the purposes, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments These are some embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0045] In order to facilitate the understanding of the embodiments of the present invention, further explanation will be given below with specific embodiments in conjunction with the accompanying drawings, and the embodiments do not constitute limitations to the embodiments of the present invention.

[0046] figure 1 It is a workin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method comprises the following steps: extracting elements of multiple dimensions of archives, performing a corresponding similarity measurement method on a to-be-tracked archive and a recorded archive based on a cloud model sequence after linear regression, and calculating the similarity between dimension attributes of the cloud model archives; judging whether the stored record file is a parent file of the file to be tracked according to the content relevance between the file to be tracked and the stored record file; and carrying out future archive prediction on the archives with the obtained father-child archive relationship and carrying out clustering verification on validity between the archives to be tracked and the stored record archives. According to the archive processing method based on association rule mining provided by the invention, the relationship among the archives can be tracked and searched through multi-layer screening and filtering based on association rules, the relationship of the archives can be determined, and the relationship determination method can be subjected to validity verification and potential relationship mining of future archives through operations such as prediction and classification.

Description

technical field [0001] Embodiments of the present invention relate to the field of text data processing, and in particular, to a file processing method based on association rule mining. Background technique [0002] Text traceability is mainly used in the fields of academic integrity detection, search engine optimization, etc. The purpose is to determine whether the content of a text is copied or adapted from another or more texts. The principle of obtaining homologous texts is mainly based on the comparison of text similarity. At the same time, with the widespread use of databases and the gradual rise of data sharing, the problem of data leakage is becoming more and more serious. And because data is often shared with multiple parties, it can be difficult to trace the source of a data breach. If there is a widely used method of tracing back to the source of a data breach, it will act as a deterrent to data breachers, thereby mitigating the growing problem of data breaches....

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/33G06F16/35G06K9/62
CPCG06F16/3344G06F16/35G06F18/23213
Inventor 李帅
Owner 中盾创新数字科技(北京)有限公司