Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for determining importance degree of sequence site, equipment and storage medium

A technology of importance and sequence, applied in the determination method of sequence site importance, equipment and storage media, and device fields, can solve problems affecting the prediction accuracy of transcription factor binding sites, and achieve the effect of ensuring accuracy

Active Publication Date: 2017-12-26
SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI
View PDF3 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, when using existing calculation methods to predict transcription factor binding sites for a given sequence signature string, it is usually carried out under the premise that each sequence site in the default sequence signature string has the same importance, which greatly affects Accuracy of predictions for transcription factor binding sites

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for determining importance degree of sequence site, equipment and storage medium
  • Method and device for determining importance degree of sequence site, equipment and storage medium
  • Method and device for determining importance degree of sequence site, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0028] figure 1 A schematic flow chart of a method for determining the importance of a sequence site provided in Embodiment 1 of the present invention, the method is applicable to the case of determining the importance of a sequence site in a sequence characteristic string of a transcription factor, and the method can be determined by the sequence site The importance determination device is implemented, wherein the device can be implemented by software and / or hardware, and is generally integrated into computer equipment.

[0029] Such as figure 1 As shown, a method for determining the importance of a sequence site provided by Embodiment 1 of the present invention includes the following operations:

[0030] It should be noted that when the transcription factor binding site is predicted based on the sequence signature string in the transcription factor based on the existing prediction method, because the different importance of each sequence site of the sequence signature strin...

Embodiment 2

[0051] figure 2 It is a schematic flowchart of a method for determining the importance of sequence sites provided by Embodiment 2 of the present invention. Embodiment 2 of the present invention is optimized on the basis of the above-mentioned embodiments. In this embodiment, each of the sites is further initialized The specific optimization of the weight vector is: randomly select the initial component value of each component in the weight vector of each site within a set value range, wherein the set value range is (0,1).

[0052] Further, in this embodiment, each initial site weight vector is iteratively processed based on the selected optimal solution search algorithm to obtain the target site weight vector. The specific optimization is as follows: each of the initial site weight vectors is selected as Individuals of the current population in the genetic algorithm; determine the fitness value of each individual in the current population relative to the set of equal-length s...

Embodiment 3

[0109] image 3 A structural block diagram of a device for determining the importance of sequence sites provided in Embodiment 3 of the present invention. The device is suitable for determining the importance of the sequence site in the sequence feature string of the transcription factor, and the device can be implemented by software and / or hardware, and is generally integrated into computer equipment. Such as image 3 As shown, the device includes: a vector generation module 31 , a vector initialization module 32 , a vector processing module 33 and an importance determination module 34 .

[0110] Wherein, the vector generation module 31 is used to determine the number of sequence points that the sequence feature string has in the set of fixed-length sequence strings, and generate a set number of dimensions as the position weight vector of the number of sequence points;

[0111] A vector initialization module 32, configured to initialize each of the site weight vectors, and ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and a device for determining the importance degree of sequence site, equipment and a storage medium. The method comprises the following steps that: determining a sequence site number owned by a sequence feature string in a fixed length sequence string set, and generating a set quantity of dimensions as the site weight vector of the sequence site number; initializing each site weight vector to obtain a set quantity of initial site weight vectors with initial component values; on the basis of a selected optimal solution search algorithm, carrying out iterative processing on each initial site weight vector to obtain a target site weight vector; and correspondingly determining each target component value in the target site weight vector as the importance degree of each sequence site in the sequence feature string. By use of the method, the importance degree of each sequence site in the sequence feature string can be quickly determined, effective prediction information is provided for subsequently predicting the transcription factor binding site of the sequence feature string so as to guarantee the accuracy of the prediction processing of the transcription factor binding site.

Description

technical field [0001] The present invention relates to the technical field of computer equipment, in particular to a method, device, equipment and storage medium for determining the importance of sequence sites. Background technique [0002] Transcription is the first stage of gene expression in organisms. DNA transcription requires the regulation of transcription factors. Among them, transcription must be bound to DNA to regulate the transcription process. The part of DNA that binds to transcription factors is called transcription factor binding Site, generally, a transcription factor binding site is a sequence feature string, which is equivalent to a plurality of sequence sites. [0003] The prediction and determination of whether the sequence feature string in a transcription factor is a transcription factor binding site is helpful for understanding the transcription regulation mechanism and the growth process of cells, and is of great significance for determining drug t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/20G06F19/24
CPCG16B25/00G16B40/00
Inventor 赵苗苗陈世雄林闯李光林
Owner SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI