Computer file similarity identification system and method based on image analysis
A technology of image analysis and similarity, applied in the computer field, can solve the problems of lack of identification methods and low efficiency of image files, and achieve the effect of high recognition accuracy, high efficiency, and improved efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0031] Such as figure 1 with image 3 As shown, a computer file similarity recognition system based on image analysis includes: a file attribute data extraction unit configured to extract basic attributes of two target files for comparison, and the target files are: first The target file and the second target file, the basic attributes include at least: file name, file type, file size, file location, file creation time and file modification time; the file content extraction unit is configured to open the first target file and the first target file Two target files, and extract the content of the two files, and temporarily store the extracted file content; the file content conversion unit is configured to convert the extracted file content into the corresponding image content; the file similarity recognition unit includes: A similarity recognition unit, a second similarity recognition unit, and a result generation unit; the first similarity recognition unit is configured to deter...
Embodiment 2
[0034] On the basis of the previous embodiment, the first similarity recognition unit judges the similarity of the two files according to the basic attributes of the two files, and the method for obtaining the first judgment result executes the following steps: The file name, file type, file size, file location, file creation time, and file modification time of the file and the second target file are matched and recognized; the matching recognition method is: treating each character in the matching item, Obtain the keyword to which the character belongs and the index position of the character in the keyword according to the keyword set; judge whether the character is the first of the keyword according to the index position of the character in the keyword to which it belongs Character; if the character is the first character of the keyword, record the keyword to which the character belongs in the matching information set, and mark in the record that the first character of the key...
Embodiment 3
[0039] reference Figure 4 with Figure 5 On the basis of the previous embodiment, the second similarity recognition unit includes: a local probability model estimation subunit, configured to use the following formula to calculate the probability of each local area of the image content: Among them, i is the number of each local area, n is the number of local areas, σ(x i ) Represents the local area x i The probability of each local area x i Is a matrix, Is the transpose of the matrix, w i Is the preset template matrix, b i Is the adjustment value corresponding to the matrix, the value range is: 5-10, m is the probability adjustment value, the value range is: 0.2-0.6; the local area weight calculation subunit, according to the probability of the local area to calculate each The weight value of a local area is used as the weight value of the local area; the image segmentation subunit is configured to segment the image content of the second target file into unit domains; the unit...
PUM

Abstract
Description
Claims
Application Information

- R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com