Network transmission method for large extensible markup language (XML) document

A network transmission and file technology, applied in the field of network transmission of large XML files, can solve the problems of high cost, low overall performance, and unsatisfactory compression rate, and achieve the effect of improving transmission efficiency, reducing transmission times, and reducing transmission time.

Inactive Publication Date: 2012-07-11
PEKING UNIV FOUNDER GRP CO LTD +2
View PDF5 Cites 27 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The Burrows-Wheeler transformation used by bzip2 has a high compression rate for the strings that often appear in the source document, but the compression rate for the strings that are loosely continuous is not ideal, and the overall performance is low
For the PPM algorithm, it is a technology based on context modeling and prediction. The algorithm will inevitably consume a large amount of computing resources when performing modeling and prediction. Although its PPM has the characteristics of simplicity and high compression rate, it is also a relatively expensive method. compression algorithm
Although theoretically speaking, the mode-dependent compression method can achieve a higher compression rate than the mode-independent mode, but it also ha

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Network transmission method for large extensible markup language (XML) document
  • Network transmission method for large extensible markup language (XML) document
  • Network transmission method for large extensible markup language (XML) document

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0065] like figure 1 , a network transmission method of a large XML file, comprising the following steps:

[0066] S1: read the XML document;

[0067] S2: preprocessing all node tags of the XML document;

[0068] S3: processing redundant tags in the XML document;

[0069] S4: Complete the compression processing and fragment transmission of the XML document.

[0070] Wherein, step S2 further includes:

[0071] S21: find out all the node label information in the XML document, such as: the name of the node, the attribute value of the node and other information, and save this information in a node data table, and assign a unique number to each item in the table as The index value of the item in the table;

[0072] S22: After the establishment of the node data table is completed, the corresponding index number of the node is used to process th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a network transmission method for a large extensible markup language (XML) document. The method comprises the following steps of: 1, reading the XML document; 2, preprocessing all node tags in the XML document; 3, processing redundant tags in the XML document; and 4, finishing the compression processing and fragmentation transmission of the XML document. The step 4 further comprises the following steps of: 41, setting an initial fragmentation size to be Initial_Size; 42, extracting a plurality of pieces of data to form a data packet for the compression processing and the network transmission, wherein the size of the data packet is Initial_Size; 43, judging a current network load condition, and obtaining a value of a maximum transmission unit; 44, re-determining a fragmentation size to be New-Size according to the current network load condition and the value of the maximum transmission unit; 45, re-extracting a data packet with the size of New-Size for the compression processing and the network transmission; and 46, repeating the steps 43 to 45.

Description

technical field [0001] The invention relates to the transmission of XML files, in particular to a method for network transmission of large XML files. Background technique [0002] XML (Extensible Markup File) is a markup language that has the following advantages: (1) self-describing; (2) supporting verification of document content; (3) allowing the existence of markup languages ​​in different fields; (4) supporting Advanced collection, therefore, documents in XML format are widely used in today's Internet applications. Due to the limitation of Internet bandwidth capacity, it is very important to effectively compress XML and design the transmission mechanism when the system transmits large XML files. [0003] There are currently two ways to transfer large XML files: (1) file structure-aware; (2) non-file structure-aware; and the file structure-aware method is further divided into: schema-based and non-schema . [0004] Considering that XML files are stored in the form of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): H04L29/08
Inventor 孙伟丰李浩王恒程仁波刘钰王文军郑程光赖铮罗正海徐邵稀胡奎关健赵伟
Owner PEKING UNIV FOUNDER GRP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products