The invention discloses
a DNA storage coding method for optimizing Chinese storage, which comprises the following steps: 1) inputting a Chinese text, and recoding first-level
Chinese characters or first-level and second-level
Chinese characters according to the type of contained characters and the GB2312-80 standard; 2) counting the occurrence frequency of the segmented words in the text, multiplying the occurrence frequency by the length of the segmented words, sorting the products, and encoding the segmented words ranked in the front column,3) converting all characters into a binary sequence, and carrying out
Huffman coding compression, 4) converting into
a DNA sequence, and adding an address code and an RS error
correction code, 5) the decoding process being an encoding reverse process,firstly carrying outerror correction , then carrying out the sequence splicing, and converting the
DNA sequence into a binary sequence. According to the method, the redundancy of the Chinese text isreduced, the
DNA storage coding compression effect is improved, and extremely high Chinese coding potential is obtained.