Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Biological sequence processing and model training method

A model training and biological sequence technology, applied in the field of biological sequence processing and model training, can solve the problem of large time cost of identifying and classifying genes, and achieve the effect of increasing scale and improving performance.

Pending Publication Date: 2022-08-09
ZHEJIANG UNIV CITY COLLEGE
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Because the genes to be classified and identified need to be carefully compared with certain genes in the database, the BLAST method takes a relatively long time to identify classified genes

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Biological sequence processing and model training method
  • Biological sequence processing and model training method
  • Biological sequence processing and model training method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] In order to make those skilled in the art better understand the technical solutions of the present invention, the preferred embodiments of the present invention will be described below with reference to specific examples, but it should be understood that the accompanying drawings are only used for exemplary descriptions, and should not be construed as comprehension of the present invention. Limitation; in order to better illustrate this embodiment, some parts of the drawings will be omitted, enlarged or reduced, which do not represent the size of the actual product; for those skilled in the art, some well-known structures and their descriptions in the drawings may be The omission is understandable. The positional relationships described in the drawings are only for exemplary illustration, and should not be construed as limiting the present invention.

[0039]The present invention will be further described below in conjunction with the accompanying drawings and embodimen...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a biological sequence processing and model training method. The method comprises the following steps: S1, acquiring data of a biological gene sequence and integrating the data; s2, preprocessing the data, traversing the read biological gene sequences, and filtering out biological gene sequences meeting requirements; s3, constructing a data set required by the training model, and finely adjusting the data set according to the number of each category of data in the data set to ensure that the scales of the various categories of data in the data set are approximately equal; s4, carrying out quantity balance and gene data length balance processing on the data of the data set to obtain a training set; and S5, training a model with a reverse complementary network by using the training set. The method provided by the invention can save time on the basis that the accuracy of the method is similar to that of a traditional gene classification and recognition method, and can correctly predict part of genes which cannot be correctly classified by a traditional biological method.

Description

technical field [0001] The invention relates to the technical field of computer processing of biological gene sequences, in particular to a biological sequence processing and model training method. Background technique [0002] The pneumonia epidemic caused by the new coronavirus has been threatening the health and safety of human beings for a long time. In fact, the new coronavirus is only a very common type of viruses that have appeared in human history. The viruses that are still raging around the world include Influenza virus, HIV virus, liver disease virus, etc. Viruses have always existed in the world, and with the development of mankind, it has also been constantly evolving and updating, and it was not until the late 19th century that people were first recognized as this tiny pathogen. These invisible viruses are affecting people's health all the time. [0003] For a newly discovered virus, trying to figure out its source requires a lot of resources to use tradition...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G16B30/00
CPCG16B30/00G06F18/241G06F18/214
Inventor 明朝燕陈湘竣
Owner ZHEJIANG UNIV CITY COLLEGE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products