Method and system for processing text based on DNA sequences

A DNA sequence and text processing technology, applied in the field of DNA sequence-based text processing methods and systems, can solve the problems of single functional tasks, inability to communicate with each other, and low execution efficiency.
CN102200967AInactive Publication Date: 2011-09-28INST OF RADIATION MEDICINE ACAD OF MILITARY MEDICAL SCI OF THE PLA

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
INST OF RADIATION MEDICINE ACAD OF MILITARY MEDICAL SCI OF THE PLA
Publication Date
2011-09-28
Estimated Expiration
Not applicable · inactive patent

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention provides a method and system for processing a text based on DNA sequences. The method comprises the following steps of: allocating DNA sequence codes to characters of over two texts; and performing similarity analysis on the over two texts allocated with DNA sequence codes by using a DNA sequence processing method, wherein the characters are one kind or multiple kinds of digitals, letters, words or symbols, and the letters or the words are the letters or the words in one or multiple languages. The allocation of the DNA sequence codes to the characters of the over two texts is realized by the following steps of: allocating decimal numbers to the characters of the over two texts; converting the decimal numbers into quaternary numbers; enabling 0, 1, 2, 3 in the quaternary numbers to respectively correspond to one kind of four kinds of deoxyribonucleic acid; and converting the quaternary numbers into the DNA sequence codes. The invention also provides the system for realizing the method. The method and the system provided by the invention do not depend on the establishment of the existing database and the extraction of key words, have no restriction on the numbers of characters and character combinations, and can realize the efficient and comprehensive analysis for text information.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The present invention relates to an information processing method and system, in particular to a DNA sequence-based text processing method and system. Background technique

[0002] Spectrum description, similarity comparison and cluster analysis of text are routine analysis methods in text processing. At present, there are many kinds of text processing systems, but most of them only complete one of the tasks, such as the academic paper detection system of China National Knowledge Infrastructure (CNKI) and the ROST anti-plagiarism system developed by Associate Professor Shen Yang of Wuhan University and his team. In order to complete the similarity comparison of texts.

[0003] The spectral characterization of text refers to analyzing one or more texts from the level of characters (single character or multi-character combination), by fixing all possible characters or character combinations on the abscissa, and then counting their presence in the text o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More