Chinese data compression method and Chinese data decompression method and related devices

A data compression and decompression technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve the problems of insufficient compression rate and inability to greatly compress Chinese data, and achieve rapid decompression and large-scale compression The effect of Chinese data

Active Publication Date: 2010-06-23
ALIBABA (CHINA) CO LTD
View PDF0 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

After analyzing, experimenting and comparing various lossless compression techniques, the inventors found that although the above-mentioned lossless compression techniques all have the ability to compress Chinese data, they are not suitable for use in environments w

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese data compression method and Chinese data decompression method and related devices
  • Chinese data compression method and Chinese data decompression method and related devices
  • Chinese data compression method and Chinese data decompression method and related devices

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] In order to make the objectives, technical solutions, and advantages of the embodiments of the present invention clearer, the technical solutions provided by the embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

[0038] The invention provides a kind of Chinese data compression method, and this method comprises the steps:

[0039] Step A, read the Chinese data to be compressed;

[0040] Step B, performing word segmentation on the Chinese data to obtain a word segmentation set that forms the Chinese data;

[0041] Step C, read the participle from the participle set, if the participle is composed of more than two Chinese characters, then search whether there is the participle in the preset participle encoding library, and if so, obtain the encoding of the participle , and store the code into the compressed data, the code occupies at most (at most) two bytes of storage space of the compressed data;

[0042...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Chinese data compression and a Chinese data decompression method and related devices. The Chinese data compression method comprises the following steps: in the step A, reading Chinese data to be compressed; in the step B, carrying out word segmenting on the Chinese data to obtain a word set composing the Chinese data; in the step C, reading a word from the word set, if the word consists of more than two Chinese characters, searching the word in a preset word code library, if the word is searched, acquiring a code of the word from the word code library and storing the code into compression data, wherein the code at most occupies two bytes of storage space in the compression data; and repeating the step C until all the word in the word set are processed. The Chinese data compression technology provided by the invention can realize great compression of the Chinese data. The decompression method for the Chinese compression data, which is provided by the invention, can realize rapid decompression of the compression data.

Description

technical field [0001] The invention relates to the technical field of data compression, in particular to a Chinese data compression and decompression method and related equipment. Background technique [0002] The information age has brought about an "information explosion". The surge in data volume requires effective data compression for transmission or storage. Especially with the widespread application of embedded terminals such as PDAs, mobile phones, and navigators, due to the relatively low hardware conditions of these terminals, they cannot meet the storage requirements of massive data. Therefore, the demand for compressing and storing massive data is more urgent and urgent. strict. [0003] Since Shannon proposed information entropy theory and a simple encoding method - Shannon encoding in 1948, data compression technology has experienced a rapid development stage. Existing data compression techniques are mainly divided into two categories: lossy compression techn...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F17/22
Inventor 吴跃进
Owner ALIBABA (CHINA) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products