Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Data analysis method, system and equipment of virus database and storage medium

A data analysis and database technology, applied in the field of systems, equipment and storage media, and virus database data analysis methods, can solve problems such as low reliability, deviation of results, and inaccurate virus classification, so as to improve accuracy and effectiveness Effect

Pending Publication Date: 2021-10-22
明科生物技术(杭州)有限公司
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, unlike other microorganisms such as bacteria or fungi, viruses have general-purpose marker genes that can be studied as a whole. Therefore, the corresponding viral community classification and analysis cannot be carried out by means of tag sequence amplicon sequencing, and systematic Obtain the compositional diversity of viruses within a sample and the functions they perform
The existing gene macro virus analysis method is based on the analysis method of the macro genome, and some of them are further identified according to the software developed by themselves. The comparison database is also compared based on the NR total database, which results in virus classification. Inaccurate, low reliability, deviation of results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data analysis method, system and equipment of virus database and storage medium
  • Data analysis method, system and equipment of virus database and storage medium
  • Data analysis method, system and equipment of virus database and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0055] This embodiment provides a data analysis method for a virus database, please refer to figure 1 , including the following steps:

[0056] Step S101, based on the virus data of the sample, compare the virus data with the host genome after quality control, remove host contamination, and obtain high-quality virus data to be analyzed after screening;

[0057] Step S102, assembling a macro virus from the virus data to be analyzed to obtain a virus contig;

[0058] Step S103, evaluating and screening the virus contig, removing false positive viruses, and obtaining the virus contig screening result;

[0059] Step S104, performing virus classification on the virus contig screening results to obtain virus contig groups;

[0060] Step S105, comparing the virus contig group to a known database, and judging whether the virus contig group is in the known database; if so, then end the analysis; if not, proceed to step S106;

[0061] Step S106, if not, write the virus contig group i...

Embodiment 2

[0089] Embodiment 2 discloses a kind of data analysis system of a kind of virus database of embodiment 1, comprises:

[0090] The preprocessing module preprocesses the virus data based on the virus data of the sample, removes the virus data contaminated by the host, and obtains the virus data to be analyzed;

[0091] The assembly module is used to assemble the macro virus from the virus data to be analyzed to obtain the virus contig;

[0092] The screening module is used to evaluate and screen virus contig, remove false positive viruses, and obtain virus contig screening results;

[0093] The classification module is used to classify the virus contig screening results to obtain virus contig groups;

[0094] Contrast module, is used for comparing virus contig group with known virus library, judges whether virus contig group exists in known virus library;

[0095] The update module is set to: if not, write the virus contig group into the known virus database based on the functio...

Embodiment 3

[0097] Embodiment 3 provides a kind of data analysis electronic equipment of the virus database of embodiment 1, comprises: processor, memory, input device and output device; The quantity of processor in the computer equipment can be one or more, in this embodiment A processor is adopted; the processor, memory, input device and output device in the electronic device are connected through a bus or other methods, and the connection through a bus is taken as an example in this embodiment.

[0098] As a computer-readable storage medium, the memory can be used to store software programs, computer-executable programs and modules, such as program instructions corresponding to the data analysis method of the virus database in the embodiment of the present invention. The processor executes various functional applications and data processing of the electronic device by running the software programs, instructions and modules stored in the memory, that is, realizes the data method of the v...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data analysis method, system and equipment of a virus database and a storage medium, and the method comprises the following steps: based on virus data of a sample, comparing a host genome after quality control of the virus data, and removing host pollution to obtain screened to-be-analyzed virus data; assembling a macro virus for the virus data to be analyzed to obtain a virus contig; evaluating and screening the virus contig, removing false positive viruses, and obtaining a virus contig screening result; carrying out virus classification on the virus contig screening result to obtain a virus contig class group; comparing the virus contig class group to a known virus database, and judging whether the virus contig class group is in the known virus database or not; if not, writing the virus contig class group into the known virus database based on the functional classification and the virus abundance of the virus contig class group to update the known virus database. According to the data analysis method, the validity and accuracy of virus data can be improved, and the known virus database can be expanded.

Description

technical field [0001] The invention relates to the technical field of gene detection, in particular to a data analysis method, system, equipment and storage medium of a virus database. Background technique [0002] Macrovirome is a new branch of metagenomics. It takes the genetic material of all viruses in the environment as the research object, and identifies all the virus components in the environment. The research scope is in the human or animal intestines or the ocean, soil, etc. , to tap potential hazards to humans and the environment. [0003] However, unlike other microorganisms such as bacteria or fungi, viruses have general-purpose marker genes that can be studied as a whole. Therefore, the corresponding viral community classification and analysis cannot be carried out by means of tag sequence amplicon sequencing, and systematic The compositional diversity of viruses within a sample and the functions performed are obtained. The existing gene macro virus analysis ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16B50/30G16B40/20
CPCG16B50/30G16B40/20
Inventor 刘国琦韩长春陈华
Owner 明科生物技术(杭州)有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products