Large-batch unicellular ATAC-seq data quality controlling and analyzing method

A quality control method and data quality technology, applied in the field of biology, can solve problems such as the development of a quality control analysis process for single-cell epigenome sequencing data

Inactive Publication Date: 2017-11-21
浙江青元生物科技有限公司
View PDF1 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Currently no dedicated quality control and analysis pipelines have been developed for single-cell epigenome sequencing data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Large-batch unicellular ATAC-seq data quality controlling and analyzing method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] In order to understand the technical content of the present invention more clearly, the following examples are given in detail.

[0025] Hereinafter, the present invention will be more fully described and illustrated by using exemplary embodiments of the present invention with reference to the accompanying drawings, but it does not mean that the present invention is limited thereto.

[0026] Such as figure 1 Shown is a flow chart of the steps of the scATAC-seq data quality control and data analysis process control method of the present invention.

[0027] In one embodiment, the quality control and analysis method uses the scATAC-seq data set, the data comes from the NCBI GEO database (GSE65360), and the data comes from a total of 288 scATAC-seq data sets of three cell types combined into one set scATAC-seq data. Such as figure 1 shown, including the following steps:

[0028] The first step is to process the data file: the FASTQ format of the original sequencing fi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a large-batch unicellular ATAC-seq data quality controlling and analyzing method. The unicellular ATAC-seq data quality controlling and analyzing method is characterized by comprising the first step of regarding a FASTQ format of an original sequencing file or a SAM/BAM format which is already compared as an input file; the second step of conducting quality control over a sequencing section level and a multicellular level; the third step of conducting quality control over a single cell layer; the fourth step of detecting cell clusters and cell specific peaks; the fifth step of providing a report file about quality control for a user. According to the large-batch unicellular ATAC-seq data quality controlling and analyzing method, scATAC-seq data research serves as a starting point, a systematic and comprehensive quality control and analysis procedure is constructed, specificity quality control and analysis of unicellular data of different types and visualization of unsupervised cell clusters can be systematically generated, and the true cognition of unicellular epigenetic groups is reinforced. The large-batch unicellular ATAC-seq data quality controlling and analyzing method can be applicable to unicellular ATAC-seq data of different types.

Description

technical field [0001] The invention belongs to the technical field of biology, in particular to the technical field of biological information analysis based on large batches of single-cell ATAC-seq data, and explores and develops the quality control and data analysis process of single-cell ATAC-seq data. Background technique [0002] In recent years, researchers have developed single-cell-based sequencing technologies to study epigenetic phenomena. The emergence of single-cell transcriptome sequencing technology has largely solved the problems that existed in the previous second-generation sequencing. Compared with the vigorous development and application of scRNA-seq technology, the single-cell genome and epigenome sequencing technology Progress has been relatively slow. An important reason for this phenomenon is that most genome and epigenome sequencing technologies require DNA pretreatment. For example, the amplification of DNA fragments; the conversion of bisulfite ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/16G06F19/24G06F19/28
CPCG16B15/00G16B40/00G16B50/00
Inventor 张勇张超施威扬王璐莹
Owner 浙江青元生物科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products