Data processing method and system for drug research and development

A data processing and data technology, applied in the field of data processing methods and systems for drug research and development, can solve problems such as time-consuming, charge-key level errors, and large scale, so as to improve flexibility, reduce operation and maintenance complexity, and improve fault tolerance The effect of capacity and stability

Pending Publication Date: 2021-01-01
SHENZHEN JINGTAI TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0019] Cleaning the aggregated data generally requires a series of data cleaning methods to obtain information that is ultimately beneficial to drug development, such as molecular deduplication, charge bond level error processing, chiral molecule processing, etc., each update of these processing methods or Any new addition may require recalculation of the data collected and cleaned in the past. The main problem of this part is the large scale and time-consuming

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method and system for drug research and development
  • Data processing method and system for drug research and development

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] Such as figure 1 and figure 2 As shown, the data processing method for drug research and development of an embodiment of the present invention includes:

[0052] Step S101, data integration: build a variety of data integrators, use matching data access methods according to different data, obtain data, serialize the obtained data into strings and push them to the data collection pipeline, and the data collection pipeline will obtain the data in batches , Store in the data warehouse in an asynchronous manner, and calibrate a unique identifier for each stored data record, and the stored data at this time is the original data;

[0053] Step S103, data processing: send the unique identifier of the original data stored in the data warehouse to the data cleaning pipeline through the trigger, and the data cleaning subscriber processes the data in the data cleaning pipeline, processes the data, and passes the unique identifier during the cleaning process Access the content of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data processing method and system for drug research and development, and the method comprises the steps of building a plurality of data integrators, obtaining data in a matched data access mode according to different data, pushing the data to a data collection pipeline in a serialization mode, enabling the data collection pipeline to store the obtained data in a data warehouse in a batch and asynchronous mode, and storing the data in a database; calibrating a unique identifier for each data record; enablinig the trigger to send the unique identifier of the data storedin the data warehouse to the data cleaning pipeline; allowing the data cleaning subscriber to process the data in the data cleaning pipeline, process the data, store the processed data in the data warehouse, add a new identifier, analyze the data in the data warehouse, and store an analysis result in a knowledge base; according to the data processing method and system for drug research and development, data information of different data sources is connected, original data is stored, cleaned and recalculated through batch data processing and persistence technologies, and then a knowledge baseoriented to domain problems is constructed according to needs.

Description

technical field [0001] The invention relates to an auxiliary method for drug research and development, in particular to a data processing method and system for drug research and development. Background technique [0002] In the existing drug R&D process, the collection, arrangement and analysis of drug data are important steps throughout the drug R&D process. Commonly used drug R&D information collection generally includes the following categories of data: [0003] Data based on drug target information: [0004] Including the biological function of the target and the indications related to clinical molecules, the epidemiology of the indications, and the clinical needs to be met, etc. Commonly used public data sources include: Pubmed, Google Scholar, HowNet, etc. [0005] Data based on drug and protein structure information: [0006] Target-related information can be queried through websites such as Uniprot, and protein crystal structure information corresponding to the tar...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G16C20/70
CPCG16C20/70
Inventor 吴楚楠徐旻张佩宇马健温书豪赖力鹏
Owner SHENZHEN JINGTAI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products