Chemical reaction validity predicting method

A technique for chemical reactions and prediction methods, applied at the intersection of computer science and chemical organic synthesis

Pending Publication Date: 2019-07-16
SHANGHAI JIAO TONG UNIV
View PDF6 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The traditional organic synthesis mode is not only a challenge to the cost and physical streng

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chemical reaction validity predicting method
  • Chemical reaction validity predicting method
  • Chemical reaction validity predicting method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] In order to make the technical means, creative features, goals and effects achieved by the present invention easy to understand, the above technical solutions are further described below.

[0019] A method for predicting the legitimacy of chemical reactions considers each chemical element as a word and builds an element table containing M words. In order to enable the model to automatically segment elements, manually labeled segmentation data can be used to pre-train the segmentation model to segment elements. The splitting method includes splitting by periodic table elements and splitting by compounds.

[0020] After the elements are segmented, use word embedding (Word Embedding) method to map each element to a vector space of specific dimension N, and at this time obtain an M×N mapping space (Embedding). Assuming that a response contains K elements, the response is expressed as a vector of (K, M, N) dimensions.

[0021] The existing chemical reactions are divided in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a chemical reaction validity predicting method. The method comprises the specific steps of segmentation, specifically, the chemical reaction is subjected to information acquisition, the elements in the chemical equation are segmented through a machine learning module, and independent elements are obtained; vectorization, specifically, the elements after being segmented serve as vocabulary units to built an element table, the elements are mapped into the vector space of specific dimension through a word embedding method till all elements are represented by vectors; result prediction, specifically, compound feature representation and chemical reaction internal feature representation are conducted by the machine learning module, the feature space representing the result is enabled to pass through the fully connected layer to acquire feature representation of an initial compound and a target compound, validity judgment is conducted by calculating the distance between vectors, and the success rate prediction result is acquired and output; and sorting and updating, specifically, the history data of the chemical reaction is divided by the machine learning module into a positive example and a negative example according to success or failure, the system parameters are updated in combination with loss function and penalty term, so that the result tends to be moreand more precise.

Description

technical field [0001] The invention belongs to the intersecting field of computer science and chemical organic synthesis, and relates to a method for feature extraction and analysis of chemical reactions based on machine learning technology, for verifying the legality of organic synthesis, and for predicting the success rate of unknown chemical reactions. Background technique [0002] Machine learning shines in many fields such as biopharmaceuticals and medical diagnosis. It has changed traditional research methods, improved scientific research efficiency, and prompted changes in many industries. By learning the deep information hidden in the data and mining the internal correlations to make predictions and judgments, the machine learning system has extremely effective insights and high efficiency, and can achieve capabilities comparable to or even surpassing humans in vertical fields. [0003] The traditional organic synthesis mode is not only a challenge to the cost and p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G16C20/10
CPCG16C20/10
Inventor 张倬胜赵海姜舒李江彤杨旸
Owner SHANGHAI JIAO TONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products