Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

67 results about "Protein Databases" patented technology

Protein structure database. In biology, a protein structure database is a database that is modeled around the various experimentally determined protein structures. The aim of most protein structure databases is to organize and annotate the protein structures, providing the biological community access to the experimental data in a useful way.

Protein secondary mass spectrometric identification method based on probability statistic model

InactiveCN102495127AMore identificationThe result of the identification method is excellentMaterial analysis by electric/magnetic meansProtein DatabasesMass number
The invention discloses a protein secondary mass spectrometric identification method based on a probability statistic model. The method comprises the following steps of: firstly, virtualizing an enzymolysis protein database array, and establishing a peptide section database and a peptide section database index for peptide sections processed by the enzymolysis according to the mass number of the peptide sections; secondly, finding out standby peptide sections meeting the requirements from the peptide section database according to a nuclear-cytoplasmic ratio of parent ions in an experiment map to be analyzed, and generating a theoretical map meeting the requirements by all the standby peptide sections; thirdly, removing isotopes and noises from the experiment map to be analyzed; matching the processed experiment map to be analyzed and the theoretical map of each standby peptide section and grading, and selecting the standby peptide section with the highest score as an identification result of the experiment map; and finally, carrying out whole false positive control according to all the experiment map identification results. According to the invention, the quantity of effective massspectrums and the quantity of the protein peptide sections are higher than those of an existing algorithm; and the method has the advantages of capability of dynamically selecting peaks and fast operation speed.
Owner:JINAN UNIVERSITY

Protein second-level mass spectrum identification method based on peak intensity recognition capability

The invention discloses a protein second-level mass spectrum identification method based on peak intensity recognition capability. The method comprises the following steps: firstly, virtualizing enzymatically hydrolyzed protein database sequence, establishing a peptide fragment database and a peptide fragment database index for peptide fragments subjected to enzymatic hydrolysis according to the mass number of the peptide fragments; then, finding out candidate peptide fragments conforming to the requirement from the established peptide fragment database according to the mass number of parent ions without charges in a to-be-analyzed experiment spectrum; then removing an isotopic peak and selecting an effective peak from the to-be-analyzed experiment spectrum so as to generate a theory spectrum of the candidate peptide fragments conforming to the requirement, counting peak intensity information of different ions, calculating the peak intensity recognition capability of different types of ions at different intervals, marking each candidate peptide fragment based on the peak intensity recognition capability, and selecting the peptide fragment with the highest mark as the authentication result of the experiment spectrum; and finally, performing quality control on the authentication result. The number of valid mass spectra and the number of valid protein peptide fragments, which are authenticated by the method, are both higher than those obtained by an existing algorithm; peaks can be selected dynamically; the running speed is high.
Owner:广州辉骏生物科技股份有限公司

Index acceleration method and corresponding system in scale protein identification

The invention provides an index acceleration method in scale protein identification, which comprises the following steps of: setting quality intervals for peptide sequences; setting the size of counting windows, and setting the number of the counting windows and the range of each counting window by combining the quality intervals; performing simulated enzyme digestion on protein database, and calculating the quantity of the peptide sequences in each counting window according to the quality of the peptide sequences obtained through the simulated enzyme digestion; obtaining the quantity of the peptide sequences which can be processed once in the memory of a computer according to the capacity of the memory of the computer, and obtaining a quality range section of the peptide sequences which can be processed once in the memory of the computer by combining the quantity of the peptide sequences in each counting window; performing the simulated enzyme digestion on the protein database, saving the obtained peptide sequences in one quality range section in the memory of the computer, and finishing the operations of sequencing, redundancy removal, and dictionary and inverted list establishment on the saved peptide sequences in the memory of the computer; and establishing a dictionary and an inverted list for each quality range section.
Owner:INST OF COMPUTING TECH CHINESE ACAD OF SCI

Antibacterial peptide prediction method and device based on protein pre-training representation learning

The invention discloses an antibacterial peptide prediction method and device based on protein pre-training representation learning; the method comprises the following steps: S1, employing a pre-training strategy to carry out the word segmentation and covering of a label-free protein sequence from a protein database, and obtaining a pre-training representation learning model, carrying out pre-training of two tasks of covering a language model and sentence continuity prediction, capturing expressions of a word level and a sentence level, and helping the model to learn general structural features of a protein sequence; S2, for the antibacterial peptide pre-recognition and prediction task, changing an output layer of a pre-training model, and performing fine adjustment on the model by using an antibacterial peptide data set with a label to generate an antibacterial peptide prediction model; and S3, according to the antibacterial peptide pre-identification and prediction task, adopting an antibacterial peptide prediction model for identification, and outputting a prediction result. Pre-training is applied to the field of antibacterial peptide recognition and prediction, and an efficient antibacterial peptide prediction model is established based on a known antibacterial peptide sequence with small data volume and unbalanced distribution.
Owner:XIAMEN UNIV

High-throughput retrieval method for drug targets

ActiveCN105205351ASolve the problem of different parameters and different processing methods for different peopleUniversally applicableSpecial data processing applicationsProtein DatabasesProtein target
The invention relates to a high-throughput retrieval method for drug targets, and belongs to the field of bioinformatics. The high-throughput retrieval method includes the steps that a drug and target complex serves as reference, a drug combining bag is defined, all fragments in the combining bag are represented with protein structural fingerprints, and the protein structural fingerprints include amino acid sequences, protein folding shape codes, physicochemical properties and vector coupling; the digital drug combining bag is input, a global known protein structure database is retrieved to perform fingerprint comparison and quantitative evaluation, and protein structures are arrayed in the sequence of fingerprint similarity from high to low; structural protein is selected as possible target spot regions, wherein similarity scores of the protein folding codes and similarity scores of the amino acid physicochemical properties reach top two thousand at the same time, and possible target protein of drugs is analyzed and predicted. The high-throughput retrieval method can be applied to secondary development and research of the drugs, and new effects of the approved clinic drugs are developed by finding the new targets.
Owner:MICRO PHARMATECH

Peptide identification method based on subset error rate estimation

The invention relates to a peptide identification method based on subset error rate estimation. The peptide identification method comprises the following steps: 1, analyzing a peptide sample to be identified by a mass spectrometer to generate a tandem mass spectrum; 2, searching a target-bait protein database containing a target peptide sequence in the tandem mass spectrum, and sorting obtained peptide identification results according to scores from high to low; 3, setting a score threshold value x, and estimating the error rate FDRk(x) of a type k peptide identification subset, the score of which is higher than x, by a transferring FDR (False Discovery Rate) method; 4, finding the minimum value of x by adjusting the score threshold value x to enable the estimated FDRk(x) to be less than a given error rate control level alpha, so that the obtained type k peptide identification result with the score higher than x serves as an acceptable reliable identification result. The peptide identification method provided by the invention estimates the subset error rate through the transferring FDR method and obtains the reliable peptide identification result through the subset error rate, thus having high identification accuracy.
Owner:ACAD OF MATHEMATICS & SYSTEMS SCIENCE - CHINESE ACAD OF SCI

Method for rapidly identifying strain-level pathogenic bacteria in food through double adsorption

InactiveCN108020674AHigh-resolutionStrengthen the effect of microwave wall breakingBiological testingProtein DatabasesRibosomal protein E-L30
The invention discloses a method for rapidly identifying strain-level pathogenic bacteria in food through double adsorption. The method comprises main steps as follows: a) strain-level pathogenic bacteria in liquid food are adsorbed by MIL-101 magnetic particles, and wall breaking is performed under microwave assistance; after wall breaking, to-be-identified pathogenic bacterium protein is rapidlyenriched through the MIL-101 magnetic particles; b) a mass spectrum graph of the pathogenic bacterium protein is collected through MALDI / TOF MS (matrix-assisted laser desorption ionization / time of flight mass spectrometry); c) a Tagident search tool is used for performing protein database searching on mass spectrum peak, and ribosomal protein is selected; d) a rapid microorganism identification database is sought by use of ribosomal protein obtained through searching, and the attributes of the strain-level pathogenic bacteria are determined. The method has the advantages of simplicity and rapidness. Enrichment of pathogenic bacteria, wall breaking and mycoprotein enrichment are integrated, finally, a ribosomal protein database is used for rapidly identifying the pathogenic bacteria, and the strain level of pathogenic bacteria in the liquid food is rapidly identified.
Owner:TIANJIN MODERN VOCATIONAL TECH COLLEGE

Method of anticipating interaction between proteins

InactiveCN1416549ALibrary screeningPeptide preparation methodsProtein DatabasesAmino acid sequence alignment
The present invention relates a method for predicting a protein or polypeptide (B) that interacts with a specific protein or polypeptide (A), wherein the method is characterized by comprising: 1) decomposing the amino acid sequence of protein or polypeptide (A) into a series of oligopeptides having a pre-determined length as sequence information; 2) searching, within a database of protein or polypeptide amino acid sequences, for a protein or polypeptide (C) comprising an amino acid sequence for each member of the series or for a protein or polypeptide (D) comprising an amino acid sequence homologous to an amino acid sequence for each member of the series; 3) carrying out local amino acid sequence alignment between said protein or polypeptide (A) and the detected protein or polypeptide (C) or detected protein or polypeptide (D); and 4) predicting whether the detected protein or polypeptide (C) and / or protein or polypeptide (D) is a protein or polypeptide (B) that interacts with the protein or polypeptide (A) based on the results of the local amino acid sequence alignment and a value calculated from a frequency of amino acids and / or a frequency of said oligopeptides in said amino acid sequence database; and to a recording medium for carrying out the above method, a device comprising the recording medium, and proteins obtained thereby.
Owner:DAIICHI SEIYAKU CO LTD +1

Protein structure prediction method and device based on multi-task time domain convolutional neural network

The invention relates to a protein structure prediction method and device based on a multi-task time domain convolutional neural network. The method comprises the steps of: obtaining a target gene sequence and a protein database; establishing a DNA RNAamino acid ternary sequence data set corresponding to each protein according to the genetic code table and a protein database; establishing a multiple regression equation according to the residue depth and physicochemical properties of amino acids in the protein database to obtain statistical depth characteristics of each protein; clustering theternary sequence data set and mapping the ternary sequence data set into a multi-dimensional feature vector; taking the multi-dimensional feature vector and the statistical depth feature of the protein as the input of a multi-task time domain convolutional neural network, and training the multi-task time domain convolutional neural network; and predicting the protein structure by utilizing the statistical depth characteristics of the protein. According to the invention, the statistical depth characteristics of the protein are combined with the multi-task time domain convolutional neural network, so that the complexity of the model is reduced, and the generalization and the fitting degree are improved.
Owner:WUHAN GENECREATE BIOLOGICAL ENG CO LTD

Protein structure prediction method, protein structure prediction device and medium

The invention provides a protein structure prediction method, a protein structure prediction device and a medium. The protein structure prediction method is applied to the computer equipment, the computer equipment comprises a CPU and at least one GPU, and the method comprises the following steps: obtaining a target protein sequence of a to-be-predicted protein structure. And in the CPU, according to the sequence length of the target protein sequence, determining an alignment quantity threshold value of a matching sequence corresponding to the target protein sequence. And comparing the target protein sequence with a plurality of protein sequences in a preset protein sequence library according to the comparison quantity threshold, and determining a matching sequence corresponding to the target protein sequence. And determining a matching structure corresponding to the matching sequence in a preset protein structure database. And inputting the matching sequence and the matching structure into a protein structure prediction model preset in a GPU for protein structure prediction, and obtaining a protein prediction structure corresponding to the target protein sequence. The memory occupation of the GPU can be reduced, the operation speed of the GPU is improved, and the prediction rate is accelerated.
Owner:SUZHOU LANGCHAO INTELLIGENT TECH CO LTD

Method for carrying out large-scale proteomics identification based on silkworm tissue sample

The invention relates to a method for carrying out large-scale proteomics identification based on a silkworm tissue sample. The method comprises the following steps: pre-treating a domestic silkworm proteomics sample, carrying out graded peptide fragment mass spectrum online detection and constructing a silkworm protein database. By optimizing a protein extraction method, a peptide fragment is divided into 8 grades by adopting a high pH (Potential of Hydrogen) grading method and silkworm fat body samples can be extracted as many as possible, so that the protein identification quantity is improved; a condition that a sample spraying needle is blocked, caused by the fact that a feeding amount is too great, is prevented through optimizing a sample feeding amount and chromatography gradient time; the detection time is shortened through optimizing the chromatography gradient time; a Streamline database containing 21,878 protein sequences is established; the database can be used for identifying more protein quantity and redundant sequences are removed by the database; later-period proteomics data analysis is facilitated. According to the method provided by the invention, a stable and efficient domestic silkworm proteomics identification platform is established and the method has important meaning on silkworm proteomics large-scale identification.
Owner:SOUTHWEST UNIVERSITY

Index acceleration method and corresponding system in scale protein identification

The invention provides an index acceleration method in scale protein identification, which comprises the following steps of: setting quality intervals for peptide sequences; setting the size of counting windows, and setting the number of the counting windows and the range of each counting window by combining the quality intervals; performing simulated enzyme digestion on protein database, and calculating the quantity of the peptide sequences in each counting window according to the quality of the peptide sequences obtained through the simulated enzyme digestion; obtaining the quantity of the peptide sequences which can be processed once in the memory of a computer according to the capacity of the memory of the computer, and obtaining a quality range section of the peptide sequences which can be processed once in the memory of the computer by combining the quantity of the peptide sequences in each counting window; performing the simulated enzyme digestion on the protein database, saving the obtained peptide sequences in one quality range section in the memory of the computer, and finishing the operations of sequencing, redundancy removal, and dictionary and inverted list establishment on the saved peptide sequences in the memory of the computer; and establishing a dictionary and an inverted list for each quality range section.
Owner:INST OF COMPUTING TECH CHINESE ACAD OF SCI

Method for predicting antibacterial peptides of lactic acid bacteria based on graph neural network

The invention discloses a method for predicting antibacterial peptides of lactic acid bacteria based on a graph neural network. The method comprises the following steps: establishing a positive sample by searching known antibacterial peptides of lactic acid bacteria, establishing a negative sample by collecting sequences with the length of 5 to 255 from a protein database, and removing redundant sequences and similarities; performing feature extraction according to the positive and negative samples to obtain a feature vector and an initial input graph, and establishing a graph neural network model on the basis; through training, evaluation and loop optimization of the graph neural network model, determining parameters such as the optimal layer number, the optimal training round number and the learning rate of the graph neural network; and finally, predicting data of strains suspected to have antibacterial activity according to the graph neural network model. By adopting the method for predicting the antibacterial peptides of the lactic acid bacteria, wet experiment screening in a laboratory is replaced by computer model prediction, the judgment time of the protein sequence of the antibacterial peptides of the lactic acid bacteria is shortened, accurate and efficient batch identification is realized, and an effective alternative method is provided for screening lactic acid bacteria strains with antibacterial characteristics.
Owner:INNER MONGOLIA AGRICULTURAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products