Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

128 results about "Reference genome sequence" patented technology

A reference genome is the initial sequence to which all subsequent sequences are ultimately compared, and therefore must be as complete as possible given fiscal and technical constraints. Reference genome sequences result from the de novo sequencing and assembly of a haploid complement of an organism’s genome.

Method for detecting mononucleotide polymorphism

The invention is applicable to the field of biological engineering and provides a method for detecting mononucleotide polymorphism. The method comprises the following steps: sequencing fragments obtained by high throughput sequencing technology are compared on a referenced genome sequence; the likelihood ratio of various genotypes of the corresponding sites on the genome to be tested is obtained according to the sequencing mass fraction of each basic group in the genome to be tested and obtained by sequencing; the posterior probability of each genotype of each site on the referenced genome is calculated according to the likelihood ratio and the prior probability preset for each genotype, and the genotype which has the highest posterior probability is determined as the most likely right genotype of the corresponding sites on the genome to be tested to obtain the consistent sequence of the genome to be tested; and the sites of the genome to be tested, which are inconsistent with the sequence of the referenced genome in the consistent sequence are detected to obtain the polymorphism sites of the genome to be tested. The embodiment of the invention can achieve a more accurate test result as the influence of the prior probability on mononucleotide polymorphism test result is considered in advance.
Owner:WUHAN BGI CLINICAL LAB CO LTD

Detection method of chromosome copy number variation

InactiveCN105349678AOvercoming the pitfalls of noninvasive testingThe result is accurateMicrobiological testing/measurementReference genome sequenceAbsolute volume
The invention provides a detection method of chromosome copy number variation. The detection method comprises the following steps: preparing cell-free DNA from body fluid of a parent body with an appendage and taking the cell-free DNA as a sample to be detected; preparing cell-free DNA from body fluid of a normal parent body without appendages and taking the cell-free DNA as a check sample; comparing to a reference genome sequence after all sequencing is finished; counting matching number and calculating z value; determining that cnv originates from the appendage if the length of the sample to be detected is smaller than the absolute value of the z value of a DNA sequence of N bp and greater than the absolute value of the z value of all DNA sequences of the sample to be detected in the same window; and determining that the cnv originates from the parent body if the length of the sample to be detected is not smaller than the absolute value of the z value of the DNA sequence of N bp and not greater than the absolute value of the z value of all the DNA sequences of the sample to be detected in the same window. By the detection method, whether the cnv originates from the parent body or the appendage can be judged effectively, accuracy of the result of existing non-invasive detection can be the same with that of the result of the traditional intrusion detection, loss caused by false positive is avoided effectively, and meanwhile, the detection method is easy to operate and has application value.
Owner:SHANGHAI MAJORBIO BIO PHARM TECH

Method for detecting variable spliceosome in third generation full-length transcriptome

ActiveCN105389481AEfficient access to shear structuresPerfect commentSequence analysisSpecial data processing applicationsReference genome sequenceGene model
The invention discloses a method for detecting a variable spliceosome in a third generation full-length transcriptome. The method comprises the following steps: merging original annular test sequences with joints removed to form a monomolecular transcript sequence, and screening a third generation full-length transcript sequence; comparing the third generation full-length transcript sequence with a reference genome sequence, and screening a third generation full-length transcript sequence having coverage and similarity with the reference genome sequence larger than preset thresholds; carrying out splicing false positive filtration and DNA contamination filtration on the screened third generation full-length transcript sequence; and carrying out gene annotation and variable spliceosome annotation on the filtered third generation full-length transcript sequence. An overlong read length of a third generation sequencing technology mentioned in the method disclosed by the invention is large enough to cover most RNA, the third generation full-length transcript sequence can be obtained by SMRT sequencing transcriptomes without being assembled, and a splicing structure of a gene can be effectively obtained by third generation transcriptome sequencing, and more perfect gene model annotation can be constructed.
Owner:嘉兴菲沙基因信息有限公司

Processing method and processing device of high-throughput sequencing data, storage medium and processor

The invention provides a processing method and a processing device of high-throughput sequencing data, a storage medium and a processor. The processing method includes: acquiring secondary sequencingsequences, wherein the secondary sequencing sequences are sequencing sequences which are in the high-throughput sequencing data, can be recognized by amplification primers of a target fragment and areafter removing corresponding amplification primers; carrying out alignment on the secondary sequencing sequences and a reference genome sequence to obtain a primary variation result; and utilizing mutation data in known mutation data to correct the primary variation result to obtain a processing result. False positive processing results caused by primer mutation existing in amplification productoverlapping regions are reduced through partially removing the primers in all the sequences from the original data, which are obtained by high-throughput sequencing, according to known primer information. Certain erroneously-amplified sequences in the high-throughput sequencing data can also be removed, accuracy of subsequent analysis is improved, and decreasing of an overall data amount and improving of analysis efficiency are also facilitated.
Owner:BEIJING ACCB BIOTECH

Wheat BSR-Seq gene locating method

The invention discloses a wheat BSR-Seq gene locating method. The wheat BSR-Seq gene locating method comprises the following steps of: constructing and sequencing mixing pools, performing quality variation mining, screening a transcript tightly linked to a target gene, developing and locating a molecular marker, etc. The next-generation transcriptome sequencing technology (transcriptome sequencing, RNA-Seq) and a mixing pool technology (Bulked Segregant Analysis BSA) are combined; a wheat sequencing draft sequence is used as a reference sequence at first; then, a lot of high-quality SNP heritable variations on the transcript is mined at high throughput by adopting the next-generation sequencing technology; the allele frequency is precisely calculated in combination with the mixing pool technology, so that the transcript tightly linked to the target gene possibly can be rapidly screened out; false positive is precisely checked and controlled through Fish; the method is independent of a reference genomic sequence, low in cost, rapid and high in precision; the wheat gene locating efficiency and precision are increased; the wheat polymorphic molecular marker developing cost is reduced; therefore, the wheat gene fine locating working time is reduced to several months from several years; the locating precision is reduced to a fraction or 0 cM from several cM; and the fine locating cost is reduced to several thousands from several ten thousands.
Owner:北京麦美瑞生物科技有限公司

Polymorphic gene typing and somatic change detection using sequencing data

A system and method for determining the exact pair of alleles corresponding to polymorphic genes from sequencing data and for using the polymorphic gene information in formulating an immunogenic composition. Reads from a sequencing data set mapping to the target polymorphic genes in a canonical reference genome sequence, and reads mapping within a defined threshold of the target gene sequence locations are extracted from the sequencing data set. Additionally, all reads from the set data set are matched against a probe reference set, and those reads that match with a high degree of similarity are extracted. Either one, or a union of both these sets of extracted reads are included in a final extracted set for further analysis. Ethnicity of the individual may be inferred based on the available sequencing data which may then serve as a basis for assigning prior probabilities to the allele variants. The extracted reads are aligned to a gene reference set of all known allele variants. The allele variant that maximizes a first posterior probability or posterior probability derived score is selected as the first allele variant. A second posterior probability or posterior probability derived score is calculated for reads that map to one or more other allele variants and the first allele variant using a weighting factor. The allele that maximizes the second posterior probability or posterior probability score is selected as the second allele variant.
A system and method for identifying somatic changes in polymorphic loci using WES data. The exact pair of alleles corresponding to the polymorphic gene are determined as described using a normal or germline sample from an individual. A tumor or otherwise diseased sample is also retrieved from the individual and the corresponding WES data is generated. Reads corresponding to the polymorphic gene are extracted as described in the paragraph above. These reads are then aligned to the inferred pair of allele sequences. The alignment of the germline or normal reads to the inferred pair of alleles, along with the alignment of the tumor or diseased reads to the inferred pair of alleles are simultaneously used as inputs to somatic change detection algorithms to identify somatic changes with greater precision and sensitivity.
Owner:DANA FARBER CANCER INST INC +2

Kit and probe for measuring instability of genome and application of kit and probe

The invention provides a kit for measuring genome instability and application thereof, and a probe set contained in the kit is determined by the following steps: dividing a reference genome sequence into a plurality of first-level regions, wherein the first-level regions contain at least one known SNP site; performing SNP filtering on each of the plurality of primary regions; selecting a notch region; dividing the gap region into at least one secondary region based on an expected interval; for each of the at least one secondary region, respectively carrying out secondary high-frequency SNP search: carrying out extension treatment from the central point of the secondary region to two sides for at least one time, and after extending for a preset length for at least one time, searching for the SNP with the highest frequency in the obtained region, and determining a third-level region on the basis of taking the summary of the first-level high-frequency SNP and the second-level high-frequency SNP as the starting point SNP; and based on the third-level region, constructing a probe for specifically identifying the third-level region. By utilizing the method, the probe combination which can be effectively used for analyzing the instability of the genome can be effectively obtained.
Owner:BGI GENOMICS CO LTD +2
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products