A method of identifying protein compounds by using a fruit fly optimization method

A protein complex and Drosophila optimization technology, applied in the field of bioinformatics, can solve the problem of low recognition accuracy of protein complexes, the inability to consider the internal structure of protein complexes with global characteristics and local characteristics, and failure to consider protein interactions Network characteristics and other issues to achieve high accuracy

Inactive Publication Date: 2016-08-17
SHAANXI NORMAL UNIV
View PDF0 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But they generate protein complexes mainly through some heuristic rules without taking into account the network properties of the entire protein interaction
[0009] The shortcomings of the above clustering methods do not take into account the dynamics of the

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method of identifying protein compounds by using a fruit fly optimization method
  • A method of identifying protein compounds by using a fruit fly optimization method
  • A method of identifying protein compounds by using a fruit fly optimization method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0064] Taking 12 dynamic protein networks as an example, the steps to identify protein complexes using the Drosophila optimization method are as follows:

[0065] In this embodiment, the yeast data set (DIP 20140427 version) collected from the DIP database is used as the simulation data set. The DIP data contains 4995 proteins and 21554 interaction relationships. The gene expression dataset is collected from the yeast metabolic expression dataset GSE3431 in the GEO database, which includes 6777 genes, gene values ​​at 36 time points in 3 cycles, covering 95% of the proteins in DIP. Using gene expression values ​​to create 12 dynamic protein interaction networks. The experimental platform is Windows 7 operating system, Intel Core 2 Duo 3.1GHz processor, 4GB physical memory, and realizes the FOCA method of the present invention with Matlab R2010b software.

[0066] 1. Transform the protein interaction network into an undirected graph

[0067] Transform the protein interaction ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method of identifying protein compounds by using a fruit fly optimization method. The method comprises the steps of converting a protein-protein interaction network into a undirected graph, performing pretreatment on the edges and nodes of the protein-protein interaction network, establishing a dynamic protein-protein interaction network, setting parameters, forming fruit fly positions, matching fruit flies with the protein-protein interaction network, determining initialization fruit fly positions, determining the fruit fly odor concentration, updating the fruit fly positions, generating a protein compound, and filtering the protein compound. The method gives full consideration to the dynamic nature of the protein network, the protein compound inner core-attachment structure and the locality and wholeness of the protein-protein interaction network and can identify protein compounds accurately. The results of simulation experiments show that the performance of the indexes such as the accuracy and the recall ratio are excellent. Compared with other clustering methods, the method, based on the characteristics of the protein network and the protein compounds, realizes the protein compound identification process and improves the protein compound identification accuracy.

Description

technical field [0001] The invention belongs to the field of biological information, in particular to a method for identifying protein complexes in a dynamic protein interaction network. Background technique [0002] At present, with the birth of high-throughput technology, a large amount of protein interaction (protein interaction) data has been detected, and it is becoming more and more important to detect protein complexes by computer to understand the function of unknown proteins and predict diseases. The interaction between proteins changes differently as cells enter different life cycles, so constructing a network that can more realistically simulate the dynamic interactions between proteins before clustering plays an important role in the identification of protein complexes. important role. Most of the current methods for identifying protein complexes through computer clustering work on static protein interaction networks, ignoring the dynamic changes in the connecti...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F19/18
CPCG16B20/00
Inventor 雷秀娟丁玉连吴振强裘国永
Owner SHAANXI NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products