Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

97 results about "Molecular descriptor" patented technology

Molecular descriptors play a fundamental role in chemistry, pharmaceutical sciences, environmental protection policy, and health researches, as well as in quality control, being the way molecules, thought of as real bodies, are transformed into numbers, allowing some mathematical treatment of the chemical information contained in the molecule.

Method for predicting n-octyl alcohol air distribution coefficient (KOA) at different temperatures through quantitative structure-activity relationship and solvent model

The invention discloses a theoretical prediction method for organic chemical n-octyl alcohol/ air distribution coefficient (KOA) and belongs to the field of ecological risk assessment testing strategy. The method comprises the following steps of: establishing a quantitative structure-activity relationship (QSAR) based on a molecular Dragon descriptor of a compound and calculating free melting energy based on a thermodynamic principle by adopting an open source solvent model, and transforming to obtain the KOA according to a thermodynamic principle formula of logKOA=-deltaGOA/2.303RT. A general strategy of predicting the KOA of the compound is provided based on the method, namely whether the molecule is in the application range is judged according to the Dragon descriptor, if so, a QSAR model is preferentially adopted (QSAR-T is adopted at different temperatures), otherwise, the compound is predicted by adopting an SM8AD solvent model. The method and strategy are adopted and accorded, the KOA of different compounds at different temperatures can be rapidly and effectively predicted, lots of manpower, material resources and financial resources are saved, and important essential data is provided for large-scale ecological risk assessment and environment supervision of chemicals.
Owner:DALIAN UNIV OF TECH

Method for predicting fish bio-concentration factors of organic chemicals by quantitative structure-activity relationship

ActiveCN103761431ATransparent prediction rulesEasy to understand and analyzeSpecial data processing applicationsDensity functional theoryOrganic compound
The invention discloses a method for predicting fish bio-concentration factors of organic chemicals by the quantitative structure-activity relationship, and belongs to the field of ecological risk assessment and test strategies. According to the method, bio-concentration factor data of 780 types of organic compounds are collected from public databases or published papers; molecular structures of the organic compounds are optimized according to the density functional theory, and 4885 types of molecule descriptors of the organic compounds are preliminarily screened on the basis of the optimized molecular structures to acquire 3480 molecule descriptors; the organic compounds are divided into a training set and a verification set according to a ratio of 4:1, the training set is used for creating a predication model, and the verification set is used for external verification after model creation. The method has the advantages that the model is clear in application field and covers new pollutants, has good imitative effect, robustness and predication capability, and can effectively predict bio-concentration factors of different types of organic compounds; predication results of the method can provide important data support for risk assessment and management of the organic chemicals and are of great significance in ecological risk assessment.
Owner:DALIAN UNIV OF TECH

Method for adopting quantitative structure-activity relationship model to predicting soil or sediment adsorption coefficients of organic compound

The invention discloses a method for adopting a quantitative structure-activity relationship model to predicting soil or sediment adsorption coefficients of an organic compound. On the basis that a molecular structure of the organic compound is known, the soil or sediment adsorption coefficients of the organic compound can be quickly and efficiently predicted only by calculating a molecular descriptor with the molecular structure and applying a built QSAR (quantitative structure-activity relationship) model. The method is simple, quick, low in cost and capable of saving manpower, material resources and financial resources needed for experiment testing. Modeling is performed according to guidelines on building and using the QSAR model of the Organization for Economic Cooperation and Development, and a simple and transparent multiple linear regression analysis method is applied, so that easiness in understanding and applying is realized; the method has clear application domain, good fitting capacity, robustness and predicting capability; by the method, the soil or sediment adsorption coefficients of the organic compound in the application domain can be effectively predicted, and necessary basic data are provided for ecological risk evaluation and management of the compound; the method has important significance.
Owner:DALIAN UNIV OF TECH

Modeling method and device of compound toxicity prediction model and application of compound toxicity prediction model

The invention provides a modeling method of a compound toxicity prediction model. The modeling method at least comprises the following steps of: S101, providing toxicity classification labels of candidate modeling compounds; S102, providing a molecular descriptor of each candidate modeling compound; S103, providing a target protein descriptor of each candidate modeling compound; S104, providing aquantitative high-throughput screening analysis descriptor of each candidate modeling compound, wherein the quantitative high-throughput screening analysis descriptor is a PubChem activity score of aspecified amount of high-throughput screening; and S105, constructing and training a compound toxicity prediction model. According to the method, physicochemical properties, biological activity and target protein action properties of drug candidate compounds can be fully utilized, and a drug toxicity prediction system is constructed by utilizing statistical modeling advantages of a machine learning algorithm based on ensemble learning, so that the model has interpretability and prediction performance, and has the better physicochemical and biological significance and research value.
Owner:上海尔云信息科技有限公司

Medicament module pharmacokinetic property and toxicity predicting method based on capsule network

The invention provides a medicament module pharmacokinetic property and toxicity predicting method based on a capsule network. After a comprehensive module fingerprint and a module descriptor are constructed and early-period preparing operation for establishing model is performed, a low-grade characteristic content of a molecule is extracted from an upper-layer low-grade characterized through convolutional or restricted Boltzmann machine operation; then a capsule network method is used for abstracting the high-grade characteristic of the molecule in a lower-layer high-grade characteristic; a relation between the high-grade characteristic and an active label is fit through a dynamic routing algorithm, thereby predicting the pharmacokinetic property and the toxicity class of an unknown smallmolecule. The method does not require collection of large scale datasets for training, optimization is performed on input through end-to-end and furthermore automatic dimension reduction is realized.A coupling coefficient is updated through iterating a dynamic routing process. The dynamic routing conveys all characteristics of an upper-layer capsule to a random lower-layer capsule, thereby greatly reserving a hierarchical position relation. The method realizes a better predicting effect than that of a traditional machine learning method.
Owner:SICHUAN UNIV

Method for predicting inhibiting concentration of pyridazine HCV NS5B polymerase inhibitor based on particle swarm optimization support vector machine

The invention discloses a method for predicting inhibiting concentration of a pyridazine HCV NS5B polymerase inhibitor based on a particle swarm optimization (PSO) support vector machine (SVM). The method comprising the following steps: establishing and optimizing a sample set, calculating inhibitor molecule descriptor, preprocessing a molecule descriptor data set, rescaling the inhibitor molecule descriptor data set, dividing the data set, optimizing support vector machine parameters, establishing a model and predicting. The method for predicting the inhibiting concentration of the pyridazine HCV NS5B polymerase inhibitor based on the particle swarm optimization support vector machine is an SVM parameter selecting method based on a PSO algorithm, the optimization of the model is achieved by using the high global searching ability of the PSO, a relational model is established through the structure and the inhibiting concentration of the pyridazine HCV NS5B polymerase inhibitor, accurate predicting is conducted on the inhibiting concentration of the pyridazine HCV NS5B polymerase inhibitor, effectiveness of the method is verified, an excellent method for predicting the inhibiting concentration of other inhibitors is provided, and predicting conducted on unknown output can be accurate as far as possible.
Owner:NORTHWEST NORMAL UNIVERSITY

Computer aided design system for predicting energetic molecule based on machine learning performance

The invention discloses a computer aided design system for predicting energetic molecules based on machine learning performance, and belongs to the technical field of computer aided design systems. The system comprises: a molecule rapid generation module, which is used for generating all possible molecular structural formulas according to permutation and combination of a molecular mother ring anda substituent group input by a user, and removing a repeated structure; a molecular data set module used for recording reported energetic molecular performance data, and the energetic molecular performance data being mainly used as a training set of the machine learning model training module; a molecular descriptor generation module used for calculating molecular descriptors of molecules generatedby the molecular rapid generation module or molecules in the molecular data set module or molecules input by a user; a machine learning model training module used for training and storing a model byadopting a machine learning algorithm according to the data of the molecular descriptor set; and a performance prediction module used for reading the model and carrying out performance prediction on the molecules generated by the molecule rapid generation module or the molecules input by the user.
Owner:INST OF CHEM MATERIAL CHINA ACADEMY OF ENG PHYSICS

Novel non-standard-dependence quantitative analysis method based on study on homologous/similar compound structure-mass-spectrum response relationship

InactiveCN103018317AMass Spec Response High and LowQuantitatively accurateMaterial analysis by electric/magnetic meansMethodological researchCompound structure
The invention belongs to the field of analysis, relates to a quantitative analysis method for homologous / similar compounds, and particularly relates to a quantitative compound analysis method for a complicated matrix sample which does not contain a standard substance. The method comprises the following steps of: (1) selecting a series of homologous / similar compounds, and carrying out mass-spectrum quantitative methodological study and textual research on the homologous / similar compounds; (2) carrying out zero-intercept linear fitting according to established standard curves of all the compounds; (3) carrying out structural optimization on the compounds by using molecular simulation software, and calculating related molecular descriptors; (4) carrying out establishment and verification on the relationship between the structures and mass-spectrum responses of the compounds by using the molecular simulation software; (5) carrying out qualitative analysis on the complicated matrix sample by employing related mass spectrometry technologies; (6) calculating the slope coefficients of linear fit standard curves of the verified series compounds according to a structure-mass-spectrum response relationship equation established previously; and (7) obtaining the concentrations of the compounds in the complicated matrix sample by using the fit standard curves of the series compounds. Thus, the non-standard-dependence quantitative analysis is realized.
Owner:CHINA PHARM UNIV

Machine learning estimation method for sensitivity and mechanical properties of energetic substances and relationship between the sensitivity and the mechanical properties of the energetic substances

The invention belongs to the technical field of compound performance evaluation, and discloses a machine learning estimation method for sensitivity and mechanical properties of energetic substances and a relationship between the sensitivity and the mechanical properties of the energetic substances. The estimation method comprises the following steps: constructing a quantitative structure-activityrelationship model of impact sensitivity and bulk modulus of seven nitro energetic substances based on an artificial neural network and a method for determining independent screening and sparse operators by taking a molecular descriptor and molecular structure information calculated by E-Dragon as characteristics; and determining the relationship between the impact sensitivity and the mechanical property of the nitro energetic substance and the quantitative relationship between the impact sensitivity and the mechanical property of the nitro energetic substances respectively with the molecularstructure by utilizing the constructed quantitative structure-activity relationship model of the impact sensitivity and the bulk modulus of the nitro energetic substances. According to the method, seven QSPR models of the impact sensitivity and the bulk modulus of the nitro nitro energetic substances are established on the basis of molecular descriptors calculated by EDragon and several common molecular structure information, so that the process of experimental research on energetic materials is shortened, and design and comprehensive evaluation of novel energetic compounds are facilitated.
Owner:SICHUAN UNIV

Method for selecting an optimally diverse library of small molecules based on validated molecular structural descriptors

The use for biological screening purposes of a subset (library) of a large combinatorially accessible chemical universe increases the efficiency of the screening process only if the subset contains members representative of the total diversity of the universe. In order to insure inclusion in the subset of molecules representing the total diversity of the universe under consideration, valid molecular descriptors which quantitatively reflect the diversity of the molecules in the universe are required. A unique validation method is used to examine both a new three dimensional steric metric and some prior art metrics. With this method, the relative usefulness / validity of individual metrics can be ascertained from their application to randomly selected literature data sets. By the appropriate application of validated metrics, the method of this invention selects a subset of a combinatorial accessible chemical universe such that the molecules of the subset are representative of all the diversity present in the universe and yet do not contain multiple members which represent the same diversity (oversample). The use of the neighborhood definition of a validated metric may also be used to combine (without oversampling the same diversity) any number of combinatorial screening libraries.
Owner:CERTARA

Drug oral availability and toxicity prediction method based on graph convolutional neural network

The invention discloses a graph convolutional neural network-based drug oral availability and toxicity prediction method. The method comprises the steps of S1, preparing an initial training set; s2, establishing a graph model of drugs, and obtaining a training set; s3, training a graph convolutional neural network and a full-connection neural network by using the training set, and fitting a molecular descriptor of the drug and a mapping relationship between a graph model and oral availability and toxicity of the drug; s4, performing numerical modification on each molecular descriptor feature in the training data, predicting the modified training data by using a neural network, and determining a corresponding predicted value error; s5, sorting all the molecular descriptor features of the medicine, calibrating the molecular descriptor features located in the preorder, deleting the molecular descriptor features of the medicine which are not calibrated, and updating the training data; and S6, retraining the graph convolutional neural network and the full-connection neural network constructed in the step S3. According to the method, the drug oral availability and toxicity prediction model with high prediction precision can be obtained.
Owner:NAT INNOVATION INST OF DEFENSE TECH PLA ACAD OF MILITARY SCI

Quantitative structure activity relationship model for predicting water-phase reaction rate constant of organic matter and sulfuric acid free radical in water phase

The invention discloses a method for predicting the water-phase reaction rate constant of organic matter and a sulfuric acid free radical in a water phase through quantitative structure activity relationship. Based on a compound structure, a molecular descriptor with a structural feature is calculated, and by the adoption of a multiple linear regression method, a QSAR model of an organic compound kso4- is constructed. Establishment of the model strictly confirms to the constructing and using guideline of the QSAR model from organization for economic cooperation and development, the constructed model has a definite application field, contains 197 kinds of organic compounds of different structures, and contains compounds containing carbon-carbon double bonds, carbon-carbon triple bonds, a hydroxyl group, phenolic hydroxyl, a carbonyl group, an aldehyde group, a carboxy group, an ester group, an amide group, nitro, an amino group, a cyano-group, an ether bond, a disulfide bond, fluorine, chlorine, bromine, iodine, arsenium and other radical groups. The mode has good fitting capacity, robustness and predicting capacity, can rapidly and accurately predict the kso4- value of the organic compound, and reference is provided for application of an advanced oxidation process based on a sulfuric acid free radical.
Owner:DALIAN UNIV OF TECH

Method for predicting toxicity of chemicals taking zebra fish embryos as receptors by establishing QSAR model

The invention discloses a method for predicting toxicity of chemicals taking zebra fish embryos as receptors by establishing a QSAR model. On the basis of a known compound molecular structure, the half lethal concentration of the compound taking zebra fish embryos as receptors can be quickly and efficiently predicted only by calculating molecular descriptors with structural characteristics and applying the constructed QSAR model, and the method is simple, quick, low in cost and capable of saving manpower, material resources and financial resources required by experimental testing. According tothe method, modeling is carried out according to the construction of the QSAR model and the use guide rule of the economic cooperation and development organization, and a simple and transparent multiple linear regression analysis method is applied, so the method is easy to understand and apply; the method has a clear application domain and good fitting ability, robustness and prediction ability,can effectively predict the median lethal concentration of the compound in the application domain by taking zebra fish embryos as receptors, provides necessary basic data for ecological risk evaluation and management of the compound, and has important significance.
Owner:DALIAN UNIV OF TECH

Computer-implemented method of merging libraries of molecules using validated molecular structural descriptors and neighborhood distances to maximize diversity and minimize redundancy

The use for biological screening purposes of a subset (library) of a large combinatorially accessible chemical universe increases the efficiency of the screening process only if the subset contains members representative of the total diversity of the universe. In order to insure inclusion in the subset of molecules representing the total diversity of the universe under consideration, valid molecular descriptors which quantitatively reflect the diversity of the molecules in the universe are required. A unique validation method is used to examine both a new three dimensional steric metric and some prior art metrics. With this method, the relative usefulness/validity of individual metrics can be ascertained from their application to randomly selected literature data sets. By the appropriate application of validated metrics, the method of this invention selects a subset of a combinatorial accessible chemical universe such that the molecules of the subset are representative of all the diversity present in the universe and yet do not contain multiple members which represent the same diversity (oversample). The use of the neighborhood definition of a validated metric may also be used to combine (without oversampling the same diversity) any number of combinatorial screening libraries.
Owner:CERTARA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products