Method and system for processing text based on DNA sequences
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- INST OF RADIATION MEDICINE ACAD OF MILITARY MEDICAL SCI OF THE PLA
- Publication Date
- 2011-09-28
- Estimated Expiration
- Not applicable · inactive patent
Smart Images
Figure 1 Figure 2 Figure 3
Abstract
Description
technical field
[0001] The present invention relates to an information processing method and system, in particular to a DNA sequence-based text processing method and system. Background technique
[0002] Spectrum description, similarity comparison and cluster analysis of text are routine analysis methods in text processing. At present, there are many kinds of text processing systems, but most of them only complete one of the tasks, such as the academic paper detection system of China National Knowledge Infrastructure (CNKI) and the ROST anti-plagiarism system developed by Associate Professor Shen Yang of Wuhan University and his team. In order to complete the similarity comparison of texts.
[0003] The spectral characterization of text refers to analyzing one or more texts from the level of characters (single character or multi-character combination), by fixing all possible characters or character combinations on the abscissa, and then counting their presence in the text o...