Method and apparatus for ascertaining similar documents
A file and algorithm technology, applied in electrical components, user identity/authority verification, transmission systems, etc., can solve problems such as slow download speed, high server pressure, and inability to download files, avoiding cumbersome operations and high efficiency.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0036] In the embodiment of the present invention, by comparing the content signatures corresponding to the two files, if the comparison results are determined to be consistent, it is determined that the two files contain data with at least part of the same content, and the two files are determined to be similar. When one of the files needs to be downloaded, part of the data can be obtained from another file, realizing downloading from more data sources, and improving the efficiency of downloading files.
[0037] The files in the embodiment of the present invention include text files, audio files, video files, and compressed files. The content signature corresponding to the file includes the content signature of the file and the content signature of the block data. The file content signature corresponds to the data of the entire file, and the content signature of the block data corresponds to the corresponding block data. The content signature is the information data obtained...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 