Metagenome gene analysis method, device and equipment and storage medium

A technology of metagenomics and gene analysis, which is applied in the interdisciplinary field of computer science and bioinformatics, can solve problems such as dependence, low accuracy of metagenomic gene analysis, and inability to analyze unknown biological genes, so as to achieve the goal of improving accuracy and precision Effect

Active Publication Date: 2018-03-23
SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The object of the present invention is to provide a gene analysis method, device, equipment and storage medium of a metagenomic group, aiming to solve the problem that the gene analysis of the existing metagenomic group needs to rely on the reference genome, and the unknown biological genes in the metagenomic group cannot be analyzed, resulting in The problem of low accuracy of metagenomic gene analysis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Metagenome gene analysis method, device and equipment and storage medium
  • Metagenome gene analysis method, device and equipment and storage medium
  • Metagenome gene analysis method, device and equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0025] figure 1 It shows the implementation process of the metagenomic gene analysis method provided by the first embodiment of the present invention. For the convenience of explanation, only the parts related to the embodiment of the present invention are shown, and the details are as follows:

[0026] In step S101, the metagenomic sequencing data input by the user is received, each set of sequencing data is screened and assembled, and assembly data corresponding to each set of sequencing data is generated.

[0027] In the embodiment of the present invention, DNA extraction and sequencing are performed on environmental samples to obtain metagenomic sequencing data, each set of sequencing data contains a plurality of read length data (reads), and the length of the read length data in the sequencing data is relatively short , the read length data in each set of sequencing data needs to be assembled to obtain a longer assembly sequence, and these assembly sequences constitute th...

Embodiment 2

[0037] figure 2 The structure of the metagenomic gene analysis device provided by Embodiment 2 of the present invention is shown. For the convenience of description, only the parts related to the embodiment of the present invention are shown, including:

[0038] The sequencing assembly unit 21 is configured to receive the metagenomic sequencing data input by the user, screen and assemble each set of sequencing data, and generate assembly data corresponding to each set of sequencing data.

[0039]In the embodiment of the present invention, each set of sequencing data contains multiple pieces of read length data (reads), and the length of the read length data in the sequencing data is relatively short. It is necessary to assemble the read length data in each set of sequencing data to obtain Longer assembly sequences, these assembly sequences constitute the assembly data corresponding to each set of sequencing data. As an example, the read length data in the sequencing data can...

Embodiment 3

[0056] Figure 4 The structure of the computing device provided by the third embodiment of the present invention is shown, and for the convenience of description, only the parts related to the embodiment of the present invention are shown.

[0057] The computing device 4 of the embodiment of the present invention includes a processor 40 , a memory 41 and a computer program 42 stored in the memory 41 and operable on the processor 40 . When the processor 40 executes the computer program 42, the steps in the above-mentioned method embodiments are realized, for example figure 1 Steps S101 to S104 are shown. Alternatively, when the processor 40 executes the computer program 42, the functions of the units in the above-mentioned device embodiments are realized, for example figure 2 Function of units 21 to 24 shown.

[0058] In the embodiment of the present invention, each set of sequencing data of the metagenomic group is screened, assembled and gene predicted, and the predicted ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention is applicable to the technical field of cross-computer science and bioinformatics, and provides a metagenome gene analysis method, device and equipment and a storage medium. The method comprises the steps of screening and assembling each set of received sequencing data, generating assembling data corresponding to the sequencing data, generating predicted genes of each set of sequencing data by performing gene prediction on the assembling data, constructing a gene dictionary according to the predicted genes, clustering the gene dictionary according to the abundance of each gene inthe gene dictionary, classifying and assembling each set of screened sequencing data according to clustered gene clusters, and generating and outputting genes of metagenomes. Accordingly, on the condition that no existing biological gene is adopted as a reference genome, gene analysis of the metagenome is achieved, the limit caused by the reference genome is avoided, and the gene analysis accuracy of the genome is effectively improved.

Description

technical field [0001] The invention belongs to the cross technical field of computer science and bioinformatics, and in particular relates to a metagenomic gene analysis method, device, equipment and storage medium. Background technique [0002] Metagenomics is a method of microbial research that does not require microbial cultivation. It can directly analyze and study microbial samples taken from the environment. The emergence of this technology has made it possible for people to study the non-culturable microorganisms that account for 99% of microorganisms. . Metagenome technology first needs to extract and sequence genes from environmental samples, and then analyze the sequencing results through computer software. Since the genes extracted in the metagenomics are a mixture of multiple genes from multiple species (including unknown species), it is a very complicated task to find out the genome sequence of each species in the metagenomic data. [0003] At present, the an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F19/18G06F19/24
CPCG16B20/00G16B40/00
Inventor 郭宁魏彦杰张慧玲葛健秋滕彦宁
Owner SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products