Automatic parallelization knockout strategy sequence repeatability analysis method and system.
An analysis method and sequence repetition technology, applied in the field of gene knockout, can solve the problems of low accuracy, large amount of data, confusion and errors, etc., and achieve the effect of improving analysis and evaluation efficiency and reducing analysis time
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0034] Embodiment 1: refer to figure 2 , the first embodiment of the present invention provides an automatic parallelized knockout strategy sequence repeatability analysis method, including: step S1, according to the preset fragment length, divide the knockout strategy data information corresponding to each knockout strategy, and obtain The continuous sub-fragments in the knockout strategy data information; and, using the continuous sub-fragments containing repeating units as repeating sequences;
[0035] It should be noted that in a mouse gene sequence, there may be multiple different knockout strategies, so an in-depth analysis of all possible knockout strategies is required to obtain the optimal knockout strategy. Sequence repeatability analysis for the region corresponding to the knockout strategy of the gene sequence is an essential link to determine whether each knockout strategy is applicable. After the gene is knocked out according to the selected knockout strategy, ...
Embodiment 2
[0045] Embodiment 2: refer to Figure 3-5 , the second embodiment of the present invention provides an automatic parallelized knockout strategy sequence repeatability analysis method, based on the above-mentioned embodiment 1, the step S2, "according to the repeat sequence, determine the sequence repetition corresponding to the knockout strategy degrees" include:
[0046] Step S21, determining the base composition of the repeating unit, the interval type corresponding to the repeating unit, and the number of occurrences of the repeating unit in the knockout strategy data information;
[0047] Among them, the base refers to derivatives of purine and pyrimidine, which are components of nucleic acid, nucleoside, and nucleotide. The major bases of DNA and RNA are slightly different, with an important difference: Thymine is the major pyrimidine base of DNA and is extremely rare in RNA; conversely, uracil is the major pyrimidine base of RNA and is rare in DNA . The base includes ...
Embodiment 3
[0092] Embodiment 3: refer to Figure 6 , the third embodiment of the present invention provides an automatic parallelized knockout strategy sequence repeatability analysis method, based on the above-mentioned embodiment 2, the step S2, "according to the repeat sequence, determine the sequence repetition corresponding to the knockout strategy degree, so as to use the sequence repeatability to carry out sequence repeatability analysis", also include:
[0093] Step S3, using the sequence repetition degree as the score assignment result of the knockout strategy, and obtaining the knockout strategy data information with the assigned score according to the score assignment result, and extracting all the knockout strategy data information less than the preset preferred threshold. The knockout strategy corresponding to the sequence repetition degree is used as a screening set;
[0094] Step S4, taking the knockout strategy with the lowest sequence repeatability in the screening set ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


