Duplicate file detection method, terminal and server

A technology of duplicate files and detection method, applied in the field of computer network, can solve the problem of not being able to detect whether a user file is a duplicate file in time, etc.

Active Publication Date: 2019-08-06
BEIJING QIYI CENTURY SCI & TECH CO LTD
View PDF8 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the inventor found in the process of implementing the present invention that at least the following problems exist in the prior art: the prior art judges whether it is a duplicate file by calculating the hash value of the uploaded file, and cannot detect whether the file uploaded by the user in time for duplicate files

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Duplicate file detection method, terminal and server
  • Duplicate file detection method, terminal and server
  • Duplicate file detection method, terminal and server

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0072] The technical solutions in the embodiments of the present invention will be described below with reference to the drawings in the embodiments of the present invention.

[0073] In the prior art, the process of judging whether the uploaded file is a duplicate file by calculating the hash value of the uploaded file cannot timely detect whether the file uploaded by the user is a duplicate file.

[0074] In order to solve the above problems, the present invention provides a method for detecting duplicate files, which can be applied to a terminal and a server respectively, the terminal and the server communicate through a network, and the terminal can be a browser or other terminals.

[0075] The terminal can acquire the files to be processed that the user needs to upload to the server, and send the files to be processed to the server. When the terminal sends the file to be processed to the server, the terminal can also obtain the size of the file to be processed, and detect...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a duplicate file detection method, a terminal and a server. The method comprises the following steps: when a to-be-processed file needing to be uploaded to a server by a user is sent to the server, the terminal obtains the size of the to-be-processed file, detects a target numerical value interval to which the size of the to-be-processed file belongs, calculates a hash value of the to-be-processed file according to a file hash value calculation mode corresponding to the target numerical interval, and sends information containing the hash value of the to-be-processed file is sent to a server, the server determines whether the to-be-processed file is a duplicate file or not according to the sending information and sends a response result to a terminal, and the response result comprises information that the to-be-processed file is the duplicate file or information that the to-be-processed file is not the duplicate file. Based on the processing, theserver can obtain the hash value of the to-be-processed file without waiting for the end of the transmission of all the to-be-processed files, so that the server can early determine whether the to-be-processed file is a duplicate file or not.

Description

technical field [0001] The invention relates to the technical field of computer networks, in particular to a method for detecting duplicate files, a terminal and a server. Background technique [0002] With the rapid development of computer network technology, users can not only easily watch their favorite videos online through video terminals, but also upload the videos they shoot or obtain through other channels to the video server, so as to share the videos they upload. for other users to watch. As the server receives more and more files such as videos uploaded by users, these files will inevitably be duplicated. In order to avoid storing duplicate files, the server needs to check the files uploaded by the user one by one to determine whether they are duplicate files. [0003] Therefore, in order to avoid storing duplicate files, the existing technology judges the uploaded file by calculating the hash (hash) value of the uploaded file after the file upload is completed,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/174G06F16/13G06F16/61G06F16/71
CPCG06F16/137G06F16/1748G06F16/61G06F16/71
Inventor 李春平杨鹏飞
Owner BEIJING QIYI CENTURY SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products