Processing method and system for carrying out reduplication removal on Internet resources

A technology of Internet resources and processing methods, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve the problems of inability to guarantee accuracy, massive information calibration, and a large amount of manpower

Active Publication Date: 2012-11-21
深圳宜搜天下科技股份有限公司
View PDF2 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The technical problem solved by the present invention is to provide a processing method and system for deduplication of Internet resources, so

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Processing method and system for carrying out reduplication removal on Internet resources
  • Processing method and system for carrying out reduplication removal on Internet resources

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] In order to make the technical problems, technical solutions and beneficial effects to be solved by the present invention clearer and clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0038] Such as figure 1 As shown, it is a flowchart of the first embodiment of the present invention, which provides a processing method for deduplication of Internet resources. In this embodiment, the Internet resources refer to the installation package of an android mobile phone. The method runs on a computer and uses the computer's High-speed computing functions and automatic functions are completed. This method also requires the support of the network and database programs, as well as the jdk that supports the java language. The method specifically...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a processing method for carrying out reduplication removal on Internet resources, which comprises the following steps of: downloading resources and description information of the resources from the Internet; placing the description information of the resources into a database and carrying out corresponding storage on resource packages, wherein only one piece of description information is stored for the resources with the same website source and the same resource name and resource version; extracting out the description information in the resource packages by an extraction program and updating the extracted description information of the resource packages into the database; scoring the informativeness of each resource by a scoring program and giving a corresponding score; combining the resources of which the names are the same with those of the resource packages into one group by a grouping program; and carrying out selection on the same resources according to scores of the resources by a prepotency program and providing the selected resources for a user. The invention also provides a processing system for carrying out reduplication removal on the Internet resources. Due to the adoption of the scheme, the duplication of the resources is lowered and the user is prevented from downloading wrong resources.

Description

technical field [0001] The invention relates to network search technology, in particular to a processing method and system for deduplication of Internet resources. Background technique [0002] At present, there are 300 million Android devices in the world. In mid-December 2010, the average number of activated devices per day was only 700,000. By 2011, the Android operating system had increased by 250%, with an average of 850,000 new devices activated every day. The number of activated devices per week is 3.7 million. At the same time, the average monthly download of Android applications is as high as 1 billion times, and the number of applications in the Android market has exceeded 450,000. Android has become a rapidly growing ecosystem. [0003] With the increase of applications, the search engine will include all the resources of different android manufacturers and different android providers, including various resources on the android website, so there will be a large n...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 李锦根张云飞黄兴红
Owner 深圳宜搜天下科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products