A Molecular Graph Representation Learning Method Based on Contrastive Learning

A graph representation and molecular technology, applied in the field of graph representation learning, can solve problems such as molecular attribute prediction, and achieve the effect of enriching molecular structure information

Active Publication Date: 2022-03-15
ZHEJIANG UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] The invention provides a molecular graph representation learning method based on comparative learning, which can obtai...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Molecular Graph Representation Learning Method Based on Contrastive Learning
  • A Molecular Graph Representation Learning Method Based on Contrastive Learning
  • A Molecular Graph Representation Learning Method Based on Contrastive Learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be noted that the following embodiments are intended to facilitate the understanding of the present invention, but do not limit it in any way.

[0046] The molecular graph representation learning method based on comparative learning provided by the present invention can be used in application scenarios such as chemical molecular attribute prediction, virtual screening, etc., using the similarity of molecular fingerprints as a basis to select positive and negative samples, and comparing them with molecular data in the feature space, And directly encode the knowledge of functional groups in the chemical domain into the representation of molecules to obtain a discriminative molecular graph representation with chemical domain knowledge. The invention solves the problem of insufficient labeling data existing in supervised learning, and fully u...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a molecular graph representation learning method based on comparative learning, which includes: obtaining the molecular fingerprint representation of each molecule, and calculating the similarity between every two molecular fingerprints; Each atom matches the corresponding functional group; the molecular graph is modeled with the heterogeneous graph; the representation of each atom in the molecule and the representation of its functional group are encoded using the RGCN in the structure-aware molecular encoder, and the molecule is mapped to the feature through the aggregation function According to the fingerprint similarity between molecules, select positive and negative samples, and carry out comparative learning in the feature space; use the method of comparative learning to train on large sample molecular data sets, and obtain Structure-aware molecular encoders for downstream molecular property prediction tasks. The invention helps to capture richer molecular structure information and solve the problem of molecular attribute prediction.

Description

technical field [0001] The invention belongs to the field of graph representation learning, in particular to a molecular graph representation learning method based on contrastive learning. Background technique [0002] Over the past few years, Graph Representation Learning has become a popular research area for analyzing graph-structured data. Graph representation learning aims to learn an encoding function that takes full advantage of graph data to transform graph data with complex structures into dense representations in low-dimensional spaces that preserve diverse graph properties and structural features. [0003] The traditional unsupervised graph representation learning method uses the random walk method to convert the graph into a node sequence, and models the co-occurrence relationship between the central node and the neighbor nodes. However, this type of learning framework has two obvious disadvantages: one is the lack of parameter sharing between encoders, which wi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G16C20/70G16C20/20G16C20/80G06N3/04G06N3/08G06N20/00
CPCG16C20/70G06N3/042G06N3/0464G06N3/0895G06N3/0985G06N3/084G06N3/08
Inventor 陈华钧杨海宏方尹庄祥
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products