Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Sequencing data analyzing and monitoring system based on distributed computing

A technology of distributed computing and sequencing data, applied in computing, hardware monitoring, electrical digital data processing, etc., can solve the problems of task management and difficult archiving of task analysis results

Pending Publication Date: 2022-01-21
江西烈冰生物科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] In the analysis of biological mass data, there will be a large number of computing and data analysis tasks running on the cluster. At present, a core issue of high-throughput sequencing is data analysis. For example, a data needs to go through dozens of steps of data analysis before it can be obtained. The analysis results, including a series of operations such as data cleaning, data filtering, data comparison, data deduplication, data correlation analysis, database comparison and dimensionality reduction analysis, and some problems caused by this: for example, the sample size is huge , the user requires to get the analysis results within a limited time, and when the sample size increases, hundreds or even thousands of tasks will be performed at the same time, resulting in the difficulty of task management and archiving of task analysis results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sequencing data analyzing and monitoring system based on distributed computing
  • Sequencing data analyzing and monitoring system based on distributed computing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0014] Such as figure 1 A sequencing data analysis and monitoring system based on distributed computing is shown, including a log monitoring module, a system parameter monitoring module, a real-time command monitoring module and a storage space statistics module, the log monitoring module is connected to a database, and the system parameter monitoring module is connected to A CPU module, the real-time command monitoring module is connected to the database and the CPU module, and the storage space statistics module is connected to the database.

[0015] Preferably, the log monitoring module includes a log retrieval module and a log random read module, the log retrieval module can retrieve external task log files in the running state, and the log random read module is connected to the database. When switching to task completion, the log file information of the task is copied to the database, and the log random reading module can read the completed task log information stored in ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a sequencing data analyzing and monitoring system based on distributed computing. The system comprises: a log monitoring module, a system parameter monitoring module, a real-time command monitoring module, and a storage space statistics module, wherein the log monitoring module is connected with a database, the system parameter monitoring module is connected with a CPU module, the real-time command monitoring module is connected with the database and the CPU module, and the storage space statistics module is connected with the database. The system can conveniently monitor all analysis tasks delivered on a cluster, can organize the analysis tasks in an organized form, and on the basis of running tasks delivered on the cluster, can perform statistics on samples, perform statistics on running sample tasks on the cluster, perform statistics on tasks, and perform statistics on delivery task workflows; by means of the technical solution, the CPU, the memory, and the cluster resource use condition of the delivery tasks can be visually obtained in a very simple mode, and analysis logs of system tasks and the space occupied by analysis can be output in real time.

Description

technical field [0001] The invention relates to the field of analysis of biological massive data, in particular to a sequencing data analysis and monitoring system based on distributed computing. Background technique [0002] In the analysis of biological mass data, there will be a large number of computing and data analysis tasks running on the cluster. At present, a core issue of high-throughput sequencing is data analysis. For example, a data needs to go through dozens of steps of data analysis before it can be obtained. The analysis results, including a series of operations such as data cleaning, data filtering, data comparison, data deduplication, data correlation analysis, database comparison and dimensionality reduction analysis, and some problems caused by this: for example, the sample size is huge , the user requires to get the analysis results within a limited time, and when the sample size increases, hundreds or even thousands of tasks will be performed at the sam...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/30G06F9/48G16B20/30
CPCG06F11/3006G06F9/4843G16B20/30
Inventor 宗杰樊小龙
Owner 江西烈冰生物科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products