Compound image molecular structural formula extraction method based on adversarial learning
A technology of molecular structure and extraction method, applied in the direction of neural learning method, neural architecture, character and pattern recognition, etc., can solve the problems of low recognition rate and accuracy, high adaptability and generalization ability, low resolution, etc. , to achieve the effect of improving recognition rate, high adaptive and generalization ability, improving accuracy and robustness
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0030] Aiming at extracting molecular structural formulas of compound images from existing journal databases, this embodiment provides a method for extracting molecular structural formulas of compound images based on adversarial learning.
[0031] combine figure 1 , a method for extracting molecular structural formulas from compound images based on adversarial learning, comprising the following steps:
[0032] S1. Build a data set;
[0033] S101, using the molecular formula SMILES codes of 300,000 compounds in the compound image generation tool RDkit database as the input SMILES code database;
[0034] S102, use RDkit to generate 2D compound structure images for all SMILES codes in the database, and perform preprocessing;
[0035] S103, correspond one-to-one with 300,000 SMILES codes and compound images, and form data pair as a data set.
[0036] Further, all molecular structure images of compounds need to be preprocessed, specifically including: grayscale processing, norm...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


