Compression, search and decompression of log messages
By using tokenization and lossless compression of log messages, the problem of high storage and search costs for massive log data is solved. It achieves efficient lossless compression and fast search, supports complex queries and custom analysis, and avoids dependence on source code.
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Patents(China)
- Current Assignee / Owner
- 源维科技
- Filing Date
- 2020-11-16
- Publication Date
- 2026-06-16
AI Technical Summary
Existing technologies incur high storage and search costs when processing massive amounts of log data. Furthermore, compressed logs cannot be searched efficiently. Conventional compression tools result in slow decompression speeds, making it impossible to directly analyze compressed logs. Moreover, access to the source code of the program that generates the logs is required, leading to security and commercial limitations.
Log messages are tokenized and categorized into numeric and non-numeric expressions. They are then stored as compressed log messages using lossless compression technology, including timestamps, a dictionary of non-numeric expressions, and a dictionary of log types. The system supports searches without decompression and provides an API for custom analysis.
It significantly reduces storage space requirements, increases search speed by 100-1000 times, supports complex queries and custom analysis, saves computing resources, and does not require access to the source code of the program that generates logs.
Smart Images

Figure CN112800008B_ABST