HLA sequencing peak graph identification method

A map recognition and sequencing technology, applied in the field of peak map recognition of HLA next-generation sequencing, can solve problems such as inability to accurately identify sequences, and achieve the effects of long time-consuming, high recognition accuracy, and strong ease of use.

Active Publication Date: 2019-05-14
银丰基因科技有限公司 +1
View PDF13 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In view of the above-mentioned prior art, in order to solve the problem that the peak map of HLA first-generation sequencing cannot accurately identify its sequence in the traditional method, the present invention provides a method for identifying the peak map of HLA sequencing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • HLA sequencing peak graph identification method
  • HLA sequencing peak graph identification method
  • HLA sequencing peak graph identification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0030] Example HLA sequencing peak map recognition method

[0031] The steps are as follows (the flow chart is as follows figure 1 shown):

[0032] (1) Constructing a electropherogram recognition model:

[0033] ①Collect a large amount of artificially identified data and import it into the preprocessing module of the electropherogram identification system, such as image 3 shown;

[0034] ②Preprocess the imported HLA electropherogram data, complete the information extraction of the binary ab1 file, the original sequence comparison, sequence segmentation and dislocation repair;

[0035] The "preprocessing" includes multiple processing of electropherogram data (flow chart as figure 2 Shown): electropherogram reading, sequence comparison, dislocation repair, data arrangement; in the electropherogram reading stage, the data identification and reading of off-machine sequencing files are mainly completed (converting binary electropherogram data files into ordinary text data ,S...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an HLA sequencing peak graph identification method, which comprises the following steps of: (1) constructing a peak graph identification model: (1) collecting machine peak graph data under the existing HLA, and (2) pre-processing to complete information extraction, original sequence comparison, sequence segmentation and dislocation repair work of a binary ab1 file; 3, feature extraction; (4) collecting a large amount of artificially recognized data, training the data by using a random forest algorithm, and constructing a peak graph recognition model; (2) carrying out base recognition on the to-be-detected HLA first-generation sequencing original off-machine data by utilizing a peak diagram recognition model; (3) arranging the recognized base sequence, and reassembling the single-chain and double-chain part sequences; and (4) outputting an identification result. According to the recognition method, the peak diagram sequence information can be accurately obtained,the overall accuracy is 99.5% or above, and the working efficiency of HLA data interpretation personnel is greatly improved.

Description

technical field [0001] The invention relates to an HLA sequencing peak pattern identification method, which is applied to the peak pattern identification of HLA first-generation (Sanger) sequencing. Background technique [0002] At present, the development of peak map recognition technology is one of the current research hotspots, and researchers have developed a variety of peak map recognition technologies, for example: Chinese invention patent CN 102676657 B discloses a sequencing image recognition system and method, which is based on Image recognition is a recognition system for judging base types. Chinese invention patent application CN 108351917 A discloses a system and method for identifying variants with high precision, which is a method for matching and typing based on patient sequence reads and reference sequences of known HLA alleles; in addition , uTYPE HLA Sequencing Software adopts the method of setting a threshold on the electropherogram to identify the base s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00
Inventor 丛华剑王连水徐明张倩李庆林张琛齐效乾
Owner 银丰基因科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products