Network flow index retrieving and compressing method based on inverted list

A technology of index retrieval and inverted index, which is applied in the direction of data exchange network, special data processing application, instrument, etc., can solve the problem of no utilization and achieve the effect of efficient index technology

Inactive Publication Date: 2014-08-27
TSINGHUA UNIV
View PDF0 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0026] It can be seen that there are still unused 2 bits in the code pattern in the relative-10 algorithm

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Network flow index retrieving and compressing method based on inverted list
  • Network flow index retrieving and compressing method based on inverted list
  • Network flow index retrieving and compressing method based on inverted list

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] The preferred embodiments will be described in detail below with reference to the accompanying drawings. It should be emphasized that the following description is exemplary only, and is not intended to limit the scope of the invention and its application.

[0055] The idea of ​​solving the problem of the present invention is mainly divided into two parts: the first part is to make a dictionary index of IP addresses for the network flow information; the second part is to use the index compression algorithm to compress the data in the inverted list. The first part specifically includes: converting numbers into dictionaries; establishing the IP offset dictionary as an inverted index, taking a fixed-length integer sequence, and compressing it into an index file. The index compression algorithms in the second part include: pForDelta algorithm, simple9 compression algorithm, carryover-12 compression algorithm.

[0056] The following takes the Internet traffic big data retrie...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a network flow index retrieving and compressing method based on an inverted list in the crossing field of computer networks and big data analysis. The network flow index retrieving and compressing method is used for solving the problems existing in network flow index retrieving and compressing research at present. The method includes the concrete steps that firstly, an index is set up through IP offset addresses, concretely, digits are converted into a dictionary, and then an inverted index is set up through the IP offset dictionary; secondly, data in the inverted list are compressed, and compression algorithms include the simple9 algorithm, the carryover-12 algorithm and the pfordelta algorithm; thirdly, decompression and retrieve are carried out, concretely, corresponding decoders are selected according to the different compression algorithms and decode compression units, and decoded digits are converted into a special data structure such as the inverted index and the dictionary; finally, information of all flow packages is obtained according to the inverted index. The network flow index retrieving and compressing method has the advantages that an efficient index technology and an index compressing technology are effectively realized, and massive network flow data can be retrieved effectively.

Description

technical field [0001] The invention relates to the intersection field of computer network and big data analysis, in particular to a method for searching and compressing a network flow index based on an inverted list. Background technique [0002] 1 network traffic [0003] When transferring information between computer networks, a single piece of information is divided into multiple data blocks and sent as a transmission unit. Each small block may be transmitted along different paths in one or more networks, and in The destination is reorganized, and these small pieces are "net packets". In the Transmission Control Protocol / Internet Protocol (TCP / IP) protocol suite, the network packets of the network can be divided into Internet Protocol (IP) network packets, transport layer transmission control packets because of the different information they contain. Protocol / User Datagram Protocol (Transmission Control Protocol / User Datagram Protocol, TCP / UDP) network packets and appl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L29/06H04L12/24G06F17/30
Inventor 陈震刘洪健马戈曹军威
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products