Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for avoiding repeated storage of content in distributed cache collaboration

A distributed cache and distributed hash technology, applied in transmission systems, electrical components, etc., can solve the problems of low cache utilization and difficult identification.

Active Publication Date: 2018-10-16
NANJING NORMAL UNIVERSITY
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the current content is identified by URL in the cache, and the same content will be identified by different URLs at different content providers, making it difficult for cache nodes to identify objects with the same actual content based on the URL, which will be repeated in the cache storage, resulting in low cache utilization

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for avoiding repeated storage of content in distributed cache collaboration
  • Method and device for avoiding repeated storage of content in distributed cache collaboration
  • Method and device for avoiding repeated storage of content in distributed cache collaboration

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] This embodiment discloses a method for avoiding duplicate storage of content based on distributed cache collaboration, including the following steps:

[0057] S101. Use a binary identification method of provider-dependent identification and provider-independent identification to identify the content. Wherein, the provider-dependent identifier means that the identifier of the content is related to the provider, so the same content data block may have different identifiers in different content providers, such as URL identifiers; the independent identifier of the provider refers to the identifier of the content It has nothing to do with the provider, so the same content data block has the same content identification. For example, existing hash algorithms such as MD2, MD4, MD5 and SHA-1 can be used to perform hash operations on content data blocks, and the obtained hash summary Can be independently identified as a provider.

[0058] S102. Establish a two-level logical mapp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for avoiding repeated storage of content in distributed cache cooperation. The method comprises the steps as follows: using a binary identifying method to perform binary identification on content data blocks; establishing two-level logic mapping tables based on the binary identification of the content data blocks to organize caching content, and performing cache cooperation on the caching content based on distributed Hash operation; receiving a request for a certain content data block from the external by an edge content router, forwarding the request to an internal node which is in charge of storing the content via two-stage routing, transmitting the data block with the node which is in charge of the corresponding content data block to the edge content router, and updating cache replacement control state information of corresponding items of the two-level mapping tables; and receiving a certain data block from the external by the edge content router, forwarding the data block to an internal node which is in charge of storing the content through two-stage routing, and establishing corresponding two-level mapping relationships in the routing forwarding process.. The method of the invention could avoid repeated storage of the content, thereby reducing network flow and improving user experience.

Description

technical field [0001] The present invention relates to the technical field of network storage and content distribution, in particular to a method and device for avoiding duplicate storage of content in distributed cache collaboration. Background technique [0002] At present, users' access to the Internet has changed from point-to-point communication to content acquisition. The traditional TCP / IP network only transmits content, but does not perceive content, resulting in a large amount of redundant traffic transmission on the network. In order to solve the content explosion caused by content acquisition, whether it is the current Internet or the future network proposed by the research community, caching is used as a basic means to satisfy users' asynchronous access to content with heavy tail characteristics. For example, transparent Web Cach adopted by the Internet, PPCache in P2P content distribution network, content cache in CDN, and information / content-centric network N...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): H04L29/08
Inventor 张国强
Owner NANJING NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products