Method for ordering and seeking character strings

A string and character technology, applied in the field of character processing, can solve problems such as low processing efficiency and large time overhead, and achieve the effects of speeding up efficiency, improving search efficiency, and improving creation efficiency

Inactive Publication Date: 2010-06-23
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF0 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to overcome the defects of low processing efficiency and large time overhead due to the need to process each character in the string separately in the existing string processing method, thereby providing a method based on Godel encoding to realize String handling methods for

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for ordering and seeking character strings
  • Method for ordering and seeking character strings
  • Method for ordering and seeking character strings

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0037] In one embodiment of the present invention, the establishment of a peptide sequence dictionary for a protein database is taken as an example to illustrate the specific implementation and application of the method of the present invention.

[0038] A large amount of data about protein sequences are stored in a protein database. A protein sequence is composed of multiple amino acids, and its subsequence is called a peptide sequence. Since amino acids are usually represented by English letters in the prior art, a peptide sequence composed of amino acids is usually represented by a string of English letters. For example, "AAIK", "GK", etc.

[0039] The process of building a peptide sequence dictionary from the protein database can refer to figure 1 , which mainly includes: simulating the enzymatic digestion process in biology to split the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method for ordering and seeking character strings, which comprises the following steps of: sorting characters in all the character strings needed to be ordered and assigning a numerical value for one type of characters, wherein the assigned numerical values of different types of characters are different; by combining the assigned numerical value of each character, adopting a Godel coding method to code each character string needed to be ordered respectively, wherein each character string obtains one Godel coding value represented by a number; and comparing the Godel coding values of all the character strings needed to be ordered and ordering the character strings according to the magnitude of the Godel coding values. The method adopts the Godel coding method to map the character strings into the Godel coding values represented by floating numbers and orders the character strings by ordering the Godel coding values so as to enhance the ordering efficiency.

Description

technical field [0001] The invention relates to the field of character processing, in particular to a method for sorting and searching character strings. Background technique [0002] Currently, performing processing operations on computer strings, including sorting and searching, has a wide range of demands in daily work, study or research. To give a simple example, in an Excel document, the user may need to sort the strings in the table. In a relatively complicated example, to establish a corresponding peptide sequence database based on a protein database, it is also necessary to perform a sorting operation on character strings (usually English letter strings) used to represent peptide sequences. In addition, in operations such as establishing electronic dictionaries in various languages ​​and looking up names in the phone book, strings also need to be processed. [0003] In the prior art, the processing of character strings on computers usually takes the character strin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 李由贺思敏付岩袁作飞迟浩王海鹏王乐珩孙瑞祥
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products