A Genetic Quantification and Representation Method for Chinese Documents Based on Numeric-String Mixed Coding
A mixed coding and string technology, applied in the field of gene quantification and characterization of Chinese documents, can solve the problem of low matching accuracy, improve the protection ability, facilitate storage and matching, and prevent unauthorized reading.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0039] In order to make the above objects, features and advantages of the present invention more obvious and understandable, the present invention will be further described below through specific embodiments and accompanying drawings.
[0040] figure 1 It is a schematic diagram of the genetic composition elements of the document. Document gene is composed of document carrier feature, document attribute feature and document content feature. Document carrier characteristics are composed of file name, file size, file creation time, file modification time and file hash value (including MD5, SHA1, SHA265 and SHA512); document attribute characteristics are composed of inherent attributes and statistical attributes, and inherent attributes include Document type, document title, document category, document note, document author, document revision number, document last saver, statistical attributes include document word count, document sentence count and document paragraph count; do...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


