Methods and devices for text data compression and decompression

A technology of text data and compression method, which is applied in the computer field, can solve the problems of large transmission volume and low transmission efficiency of text data, and achieve the effect of solving excessive transmission volume

Inactive Publication Date: 2016-08-17
ADVANCED NEW TECH CO LTD
View PDF5 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The embodiment of the present application provides a text data compression method to solve the problems of excessive text data transmission and low transmission efficiency in the prior art
[0005] The embodiment of the present application provides a text data decompression method to solve the problems of excessive text data transmission and low transmission efficiency in the prior art
[0006] The embodiment of the present application provides a text data compression device to solve the problems of excessive text data transmission and low transmission efficiency in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Methods and devices for text data compression and decompression
  • Methods and devices for text data compression and decompression
  • Methods and devices for text data compression and decompression

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0033] figure 1 The process flow of the text data compression method provided in Embodiment 1 of the present application mainly refers to re-customizing the 62 characters of 0-9, A-Z, and a-z involved in the data packet and forming a corresponding American Standard for Information Interchange code (American Standard Code for Information Interchange, ASCII) dictionary table, and then compress the useful low-order (Binary Digit, Bit) value in one byte of custom encoding to less byte space, specifically including the following steps:

[0034] S101: Extract characters in the data packet.

[0035] The number of ASCII-encoded characters exceeds 128, and 8 Bit values ​​are required for encoding in binary, and a maximum of 256 characters can be encoded. In some specific application scenarios, the number of characters to be encoded is less than 128 or less. For example, in some RPC call scenarios, the characters to be encoded only involve 62 characters of 0-9, A-Z, and a-z. The data ...

Embodiment 2

[0063] figure 2 The process flow of the text data decompression method provided in Embodiment 2 of the present application mainly refers to decompressing the text data compressed in Embodiment 1, including the following steps:

[0064] S201: Receive compressed data packets.

[0065] The compressed data packet received in this step refers to the compressed data packet in Embodiment 1. The decompression of the data packet mainly refers to the decompression processing of the received compressed data packet, and the operation is performed at the receiving end of the data packet, such as at the web page end and the smart terminal.

[0066] S202: sequentially extract the Bit value representing a character in the compressed data packet as the custom code of the character.

[0067] The Bit value representing a character refers to a six-bit Bit value representing a custom code of a character. The characters in the compressed data packet are compressed and grouped according to Embod...

Embodiment 3

[0085] image 3 Shown is a schematic structural diagram of the text data compression device provided in Embodiment 3 of the present application, specifically including:

[0086] An extracting unit 301, configured to extract characters in the data packet, wherein the characters occupy a certain number of bytes;

[0087] The conversion unit 302 is configured to obtain a custom code corresponding to the character in a specific manner according to the standard code of the character, and the number of bits encoded by the custom code is smaller than the number of bits encoded by the standard code;

[0088] Compression unit 303, configured to compress the useful low bit value of the self-defined code into a certain byte space, wherein the certain byte space is less than the byte space occupied by the character standard code;

[0089] A sending unit 304, configured to send the compressed data packet.

[0090] The number of ASCII-encoded characters exceeds 128, and 8 Bit values ​​are...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for text data compression, comprising: extracting characters in a data packet; according to the standard codes of the characters, obtaining user-defined codes corresponding to the characters through a special method, the coded Bit number of the user-defined code being smaller than the coded Bit number of the standard code; compressing available low-order Bit value of the user-defined code to a certain byte space, wherein the certain byte space is smaller than the byte space occupied by the standard codes of the characters. Through user-defined coding on few characters, the user-defined code Bit value is compressed in relatively few bytes, and problems in the prior art that transmission amount of text data is excessive and transmission efficiency is low are solved. The invention also discloses a method for text data decompression, a device for text data compression, and a device for text data decompression.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to a method and device for compressing and decompressing text data. Background technique [0002] Currently, there are multiple standards for character encoding, such as the American Standard Code for Information Interchange (ASCII) that uses 8-bit codes to encode 255 characters, and the Unicode (Universal Code) that uses 16-bit standards to solve the problem of insufficient capacity. Multiple-Octet Coded Character Set, Unicode, UCS), etc. When a character occupies more digits, it means that it occupies a larger amount of storage, and it also requires more bandwidth and more transmission time during the file transfer process. [0003] In some application scenarios, such as the RPC remote service call process of massive data requests, the network transmission data packets are not compressed, and in high concurrency and large data volume scenarios, the data transmission v...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 王俊翔
Owner ADVANCED NEW TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products