Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and apparatus for ascertaining similar documents

A file and algorithm technology, applied in electrical components, user identity/authority verification, transmission systems, etc., can solve problems such as slow download speed, high server pressure, and inability to download files, avoiding cumbersome operations and high efficiency.

Active Publication Date: 2012-03-07
SHENZHEN THUNDER NETWORK TECH
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

There are some defects in this method. When the user downloads a certain file intensively, the pressure on the server is too great; if the specified file on the server is moved or deleted, or the server cannot be connected temporarily, the file cannot be downloaded; or, when the bandwidth of the server is insufficient or Downloads are very slow when the network is busy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for ascertaining similar documents
  • Method and apparatus for ascertaining similar documents
  • Method and apparatus for ascertaining similar documents

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In the embodiment of the present invention, by comparing the content signatures corresponding to the two files, if the comparison results are determined to be consistent, it is determined that the two files contain data with at least part of the same content, and the two files are determined to be similar. When one of the files needs to be downloaded, part of the data can be obtained from another file, realizing downloading from more data sources, and improving the efficiency of downloading files.

[0037] The files in the embodiment of the present invention include text files, audio files, video files, and compressed files. The content signature corresponding to the file includes the content signature of the file and the content signature of the block data. The file content signature corresponds to the data of the entire file, and the content signature of the block data corresponds to the corresponding block data. The content signature is the information data obtained...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for determining similar documents, which is used for obtaining similar documents and necessary data from the similar documents. The invention comprises following steps: getting relevant information on two documents and confirming that the content and data of the two documents are similar; getting the corresponding content signature of the data of the two documents in the same length respectively; comparing the content signature corresponding to one document and the content signature corresponding to the obtained other document, and confirming the content signature with consistent comparative result; determining the two documents are similar. The invention also discloses the method for applying the similar documents in the process of data download, and discloses the device for the methods.

Description

technical field [0001] The invention relates to the fields of computer and communication, in particular to a method and device for determining similar files. Background technique [0002] One of the main applications of the Internet is resource sharing, and users can obtain the information and data they need through the Internet. [0003] One of the prior art is single resource downloading. Early download software, such as the file download function built into the product Microsoft Internet Explorer, can only be downloaded from a single address. For example, the user clicks a Uniform Resource Locator (Uniform Resource Locator, URL) address http: / / down.XXX.net / file1 on the webpage to download the file1. Then the download software will only try to connect to the server down.XXX.net based on HyperText Transfer Protocol (HyperText Transfer Protocol, http) and obtain the data of file 1 on the server. When all the data of the file is obtained, the download is successful. There ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): H04L29/06H04L9/32
Inventor 陈涛
Owner SHENZHEN THUNDER NETWORK TECH