Method for correcting and standardizing TCR beta high-throughput sequencing data based on template sequence and reference cell

A template sequence, sequencing data technology, applied in sequence analysis, character and pattern recognition, instrumentation, etc., can solve the problem that the sequencing data is not well resolved, and the estimation of the diversity of the T cell pool is affected.

Pending Publication Date: 2022-06-07
ARMY MEDICAL UNIV
View PDF1 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, during the deep sequencing of the T cell receptor repertoire, sequencing errors seriously affect the estimation of the diversity of the T cell repertoire
In addition, in the process of building a library, multiple PCR amplifications are required for the CDR3 sequences in the samples. The interference between multiple primers and the difference in amplification efficiency will cause amplification bias.
Therefore, it is necessary to establish an effective method to correct amplification bias, PCR and sequencing errors, so as to promote the research of T cell receptor library.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for correcting and standardizing TCR beta high-throughput sequencing data based on template sequence and reference cell
  • Method for correcting and standardizing TCR beta high-throughput sequencing data based on template sequence and reference cell
  • Method for correcting and standardizing TCR beta high-throughput sequencing data based on template sequence and reference cell

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0052] Construction and sequencing of TCRβ library of mouse spleen CD3+ T cells.

[0053] 1. RNA extraction

[0054] Sort 1000,000 mouse spleen CD3+ T cells, add 800ul Trizol (TRIzol Reagent, Invitrogen, 15596018), mix by pipetting, let stand for 5 min at room temperature, add 200ul Trizol solution in which 200 2B4 hybridoma cells are dissolved; add 200 ul chloroform, invert and mix for 30 s, place at room temperature for 3 min; centrifuge at 12,000 g for 15 min at 4°C; transfer the upper aqueous phase to another EP tube; add isopropanol at 0.5 ml isopropanol / ml Trizol, invert and mix, and place at room temperature for 10 min ; 4°C, centrifuge at 12000g for 10min; add 75% ethanol according to 1ml 75% ethanol / ml Trizol, shake gently to suspend the precipitate; 4°C, centrifuge at 8000g for 5min, suck off the supernatant; air dry at room temperature for 5-10min; Dissolve in RNase-free water to obtain mouse total RNA, and conduct concentration determination and quality control. ...

Embodiment 2

[0090] Quantitative analysis of TCRβ in mouse spleen CD3+ T cells.

[0091] The number of TCRs measured by evaluating the external reference cells was used as a reference for the number of T cells in the sample; sequencing errors were corrected by the template sequence. Using the assumption that "high frequency sequences are more likely to be the original correct sequences", the stepwise extraction clustering method was used to correct sequencing errors. Use the molecular barcode in the template sequence to separate the template sequence from the sequencing sample, count the number of sequencing reads of different V template sequences, investigate the law of the amplification bias after mixing the sample, and explore the method of correcting the amplification bias to correct Sequencing errors caused by base mutation bias.

[0092] This example is based on mouse spleen CD3+ T cell sequencing data (see Example 1) to study the TCRβ repertoire characteristics of mouse spleen. Th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of biology, and particularly discloses a TCR beta high-throughput sequencing data correction and standardization method based on a template sequence and a reference cell, which comprises the following steps: doping a fixed number of external reference cells and a fixed number of synthetic templates into a sample to construct a TCR beta high-throughput sequencing library, a high-throughput sequencing platform is used for sequencing; analyzing an amplification bias rule by using the added template sequence; a sequencing error caused by base mutation bias is corrected; standardizing the sample sequencing data by using the external reference cells; the T cell receptors in the sample are accurately quantified. According to the method, TCR beta high-throughput sequencing data can be corrected and standardized through amplification bias correction, sequencing error correction and sample standardization in a high-throughput sequencing mode by adopting a standardized process, and finally, accurate and real TCR beta library distribution is obtained.

Description

technical field [0001] The invention relates to the field of biotechnology, in particular to a method for correcting and normalizing TCRβ high-throughput sequencing data based on a template sequence and a reference cell. Background technique [0002] T cell receptors (TCRs) are specific receptors on the surface of T cells, responsible for recognizing antigens presented by the major histocompatibility complex (MHC) and mediating immune responses. Understanding the diversity of the T cell receptor repertoire will help us understand the immune status of the body, and then clarify the internal motivation of the occurrence and development of immune diseases, and provide help for the development of related vaccines and the treatment of diseases. Complementarities determining region 3 (CDR3) on the T cell receptor β subunit is a very important region on the TCR receptor. This region has the strongest binding ability to antigenic peptides and is also the region with the highest dive...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B30/00G16B40/00G16B40/30G06K9/62
CPCG16B30/00G16B40/00G16B40/30G06F18/23213
Inventor 万瑛于海礼倪青山邹丽云韩清娟陈钢黄毅曾聪
Owner ARMY MEDICAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products