Data processing method for chromatin immunoprecipitation high-throughput sequencing

A technology of co-immunoprecipitation and sequencing data, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as no solution, reference for technical practitioners, etc., to optimize the process of data quality assessment and improve work efficiency effect

Inactive Publication Date: 2014-06-11
FENGHE SHANGHAI INFORMATION TECH
View PDF2 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, how to process these small tools into the analysis process of ChIP-seq data, there

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method for chromatin immunoprecipitation high-throughput sequencing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] In order to understand the technical content of the present invention more clearly, the following examples are given in detail.

[0023] see figure 1 Shown is a flow chart of the steps of the method for processing chromatin immunoprecipitation high-throughput sequencing data of the present invention.

[0024] In one embodiment, the chromatin immunoprecipitation high-throughput sequencing data processing method, such as figure 1 shown, including the following steps:

[0025] (1) The system obtains the initial sequence file of high-throughput sequencing of chromatin immunoprecipitation in fastq format;

[0026] (2) The system removes low-quality sequence data from the initial sequence file of chromatin immunoprecipitation high-throughput sequencing to obtain the screened sequence data;

[0027] (3) The system compares the screened sequence data to the reference genome, and according to the comparison result, retains the sequence data that has a unique alignment positio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a data processing method for chromatin immunoprecipitation high-throughput sequencing, and belongs to the technical field of molecular biology. The method comprises the following steps: firstly eliminating low quality sequence data in an initial sequence file, then contrasting the filtered sequence data in a reference genome, counting signal peak amount and density distribution in different areas according to the classification of the reference genome, and determining neighboring genes of each signal peak for gene body function enrichment analysis, and finally generating a gene body function enrichment result text file and a corresponding graphical representation file. The method provides a high-efficiency and high-throughput data analysis process, so that each sequencing process is effectively integrated so as to help scientific research personnel to efficiently complete earlier-stage sequence quality control and sequence filtering of high-throughput data and reflect the advantages and disadvantages of a chromatin immunoprecipitation high-throughput sequencing experiment based on data statistics of contrasted sequence, and the distribution characteristics of the sequence on chromosome can be reflected, thus the work efficiency of sequencing is greatly promoted.

Description

technical field [0001] The invention relates to the technical field of molecular biology, in particular to the technical field of chromatin sequencing data analysis, and specifically refers to a high-throughput sequencing data processing method for chromatin immunoprecipitation. Background technique [0002] The emergence of a new generation of high-throughput sequencing technology has greatly enriched people's plans to use molecular biology methods to study the changes in cells. At present, many high-throughput sequencing technologies such as ChIP-seq, RNA-seq, ChIRP-seq, High-C, MeDIP-seq, DNA-seq, etc. have been applied in the fields of molecular biology and basic medical research. Among them, ChIP-seq technology is a high-throughput data analysis method whose main purpose is to study the interaction between proteins and chromosomal DNA. Its experimental part mainly includes chromatin immunoprecipitation (ChIP) sample preparation and deep sequencing (Deep Sequencing). ) ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F19/18
Inventor 王立山曹鑫恺臧卫东王媛媛
Owner FENGHE SHANGHAI INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products