Method and device for determining importance degree of sequence site, equipment and storage medium
A technology of importance and sequence, applied in the determination method of sequence site importance, equipment and storage media, and device fields, can solve problems affecting the prediction accuracy of transcription factor binding sites, and achieve the effect of ensuring accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0028] figure 1 A schematic flow chart of a method for determining the importance of a sequence site provided in Embodiment 1 of the present invention, the method is applicable to the case of determining the importance of a sequence site in a sequence characteristic string of a transcription factor, and the method can be determined by the sequence site The importance determination device is implemented, wherein the device can be implemented by software and / or hardware, and is generally integrated into computer equipment.
[0029] Such as figure 1 As shown, a method for determining the importance of a sequence site provided by Embodiment 1 of the present invention includes the following operations:
[0030] It should be noted that when the transcription factor binding site is predicted based on the sequence signature string in the transcription factor based on the existing prediction method, because the different importance of each sequence site of the sequence signature strin...
Embodiment 2
[0051] figure 2 It is a schematic flowchart of a method for determining the importance of sequence sites provided by Embodiment 2 of the present invention. Embodiment 2 of the present invention is optimized on the basis of the above-mentioned embodiments. In this embodiment, each of the sites is further initialized The specific optimization of the weight vector is: randomly select the initial component value of each component in the weight vector of each site within a set value range, wherein the set value range is (0,1).
[0052] Further, in this embodiment, each initial site weight vector is iteratively processed based on the selected optimal solution search algorithm to obtain the target site weight vector. The specific optimization is as follows: each of the initial site weight vectors is selected as Individuals of the current population in the genetic algorithm; determine the fitness value of each individual in the current population relative to the set of equal-length s...
Embodiment 3
[0109] image 3 A structural block diagram of a device for determining the importance of sequence sites provided in Embodiment 3 of the present invention. The device is suitable for determining the importance of the sequence site in the sequence feature string of the transcription factor, and the device can be implemented by software and / or hardware, and is generally integrated into computer equipment. Such as image 3 As shown, the device includes: a vector generation module 31 , a vector initialization module 32 , a vector processing module 33 and an importance determination module 34 .
[0110] Wherein, the vector generation module 31 is used to determine the number of sequence points that the sequence feature string has in the set of fixed-length sequence strings, and generate a set number of dimensions as the position weight vector of the number of sequence points;
[0111] A vector initialization module 32, configured to initialize each of the site weight vectors, and ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


