Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

File classification method and device

A file classification and target file technology, applied in the computer field, can solve the problems of limited marking file types and relying on manual marking.

Pending Publication Date: 2020-10-30
北京天空卫士网络安全技术有限公司
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] In view of this, the embodiment of the present invention provides a document classification method and device, which can at least solve the problem in the prior art that the types of marked documents are limited and rely on manual marking

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • File classification method and device
  • File classification method and device
  • File classification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] Exemplary embodiments of the present invention are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present invention to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0046] The label in this solution refers to the category to which the file belongs. For example, if the content of the file is a financial-related document, the corresponding label is "finance". It is understandable that a file can belong to multiple categories, that is, have multiple labels, such as file It belongs to development documents and product requirements documen...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a file classification method and device, and relates to the technical field of computers. One specific embodiment of the method comprises the steps of obtaining a file fingerprint of a target file in response to a tag query operation on the target file, and determining similar file fingerprints of which the similarity with the file fingerprint exceeds a preset similarity threshold in a local fingerprint library; obtaining meta-information corresponding to the similar file fingerprint, and determining a label according to a label identifier in the meta-information to obtain a first label set; transmitting the file fingerprint to a server for label query so as to receive a second label set returned by the server; and obtaining a union set of the first label set and the second label set to obtain a labeled set of the target file, and determining a category to which the target file belongs according to labels in the labeled set. According to the embodiment, the filefingerprint is only associated with the file content, and the limitation of only specific types of files is broken through; and the files are marked through the labels of the files associated with the files, so that the file classification correctness is improved.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a file classification method and device. Background technique [0002] In recent years, the computer security industry has gradually developed from early network security to data security. One direction of data security is data classification, which divides data into categories with different security levels, and adopts different security policies for different levels to manage data. On this basis, many data classification tools have been produced, such as non-user-driven machine learning (classification algorithm, clustering algorithm), user-driven file labeling / marking, etc. [0003] This solution mainly involves user-driven file tags / marks, and manages files according to the existing tags on the files. Currently, the methods for manipulating tags on files include adding, deleting, and updating tags. [0004] In the process of realizing the present invention, the inventor fi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/16G06F16/18G06F21/16
CPCG06F16/16G06F16/162G06F16/1815G06F21/16
Inventor 陈少涵胡立中李仕毅
Owner 北京天空卫士网络安全技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products