1 / 49

Modularity in Biological networks

Modularity in Biological networks. Traditional view of modularity:. Modularity in Cellular Networks. Hypothesis: Biological function are carried by discrete functional modules. Hartwell, L.-H., Hopfield, J. J., Leibler, S., & Murray, A. W., Nature , 1999.

arvid
Download Presentation

Modularity in Biological networks

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Modularity in Biological networks

  2. Traditional view of modularity: Modularity in Cellular Networks • Hypothesis: Biological function are carried by discrete functional modules. • Hartwell, L.-H., Hopfield, J. J., Leibler, S., & Murray, A. W., Nature, 1999. • Question: Is modularity a myth, or a structural property of biological networks? (are biological networks fundamentally modular?)

  3. Modularity in cell biology

  4. Definition of a module • Loosely linked island of densely connected nodes • Groups of co-expressed genes

  5. Concept of modules in a network

  6. Concept of modules in a network

  7. Definition of a module

  8. Computational analysis of modular structuresData clustering approach

  9. Concept of data clustering analysis • Partitioning a data set into groups so that points in one group are similar to each other and are as different as possible from the points in other groups. • The validity of a clustering is often in the eye of beholder.

  10. Concept of data clustering analysis • In order to describe two data points are similar or not, we need to define a similarity measure. • We also need a score function for our objectives. • A clustering algorithm can be used to partition the data set with optimized score function.

  11. Types of clustering algorithms • Partition-based clustering algorithms • Hierarchical clustering algorithms • Probabilistic model-based clustering algorithms

  12. Partitioning problem • Given the set of n nodes network D={x(1),x(2),∙∙∙,x(n)}, our task is to find K clusters C={C1,C2,∙∙∙,CK} such that each node x(i) is assigned to a unique cluster Ck with optimized score function S(C1,C2,∙∙∙,CK).

  13. Community structure of biological network Community 2 Community 1 Community 3

  14. Score function for network clustering • To maximize the intra group connections as many as possible and to minimize the inter group connection as few as possible.

  15. Spectral analysis clustering algorithm

  16. Adjacency Matrix • Aij= 1 if ith protein interacts with jth protein • Aij=0 otherwise • Aij=Aji (undirected graph) • Aij is a sparse matrix, most elements of Aij are zero

  17. Spectral analysis

  18. Algorithm (Spectral analysis) • Randomly assign a vector X=(X1,X2,…,Xn) • Iterate X(k+1)=AX(k) untill it converges • Try another vector which is perpendicular to previous found eigenspace

  19. Topological Structure Hidden Topological Structure Original Network

  20. An example Protein-protein interaction network of Saccharomyces cerevisiae

  21. Data source Assign 80000 interactions of 5400 yeast proteins a confidence value We take 11855 interactions with high and medium confidence among 2617 proteins with 353 unknown function proteins.

  22. Quasi-bipartite Quasi-clique negative eigenvalue Positive eigenvalue

  23. With the spectral analysis, we obtain 48 quasi-cliques and 6 quasi-bipartites. • There are annotated proteins, unannotated and unknown proteins within a quasi-clique

  24. Application—function prediction

  25. Hierarchical clustering algorithm • A similarity distance measure between node i and j, d(i,j) • The similarity measure can be let the network to be a weighted network Wij.

  26. Types of hierarchical clustering • Agglomerative hierarchical clustering • Divisive hierarchical clustering

  27. Properties of similarity measure • d(i,j)≥0 • d(i,j)=d(j,i) • d(i,j)≤d(i,k)+d(k,j)

  28. Similarity measure for agglomerative clustering • Correlation • Shortest path length • Edge betweenness

  29. How good is agglomerative clustering ?

  30. Hierarchical tree (Dendrogram) threshold

  31. Distance between clusters Cluster 2 Cluster 1 Single link

  32. Distance between clusters Cluster 2 Cluster 1 Complete link

  33. Single link 1.5 2.0 2.2 3.5 x2 x3 x1 x4 x5

  34. Divisive hierarchical clustering M.E.J., Newman and M. Girvan, Phys. Rev. E 69, 026113, (2004)

  35. Definition of edge betweeness

  36. Definition of edge betweeness

  37. Calculation of edge betweenness

  38. Quantitative measurement of network modularity Modularity Q

  39. Threshold selection

  40. Karate club network

  41. Karate club network

  42. Examples of agglomerative hierarchical clustering

  43. Can we identify the modules? J(i,j): # of nodes both i and j link to; +1 if there is a direct (i,j) link

  44. Modules in the E. coli metabolism E. Ravasz et al., Science, 2002 Pyrimidine metabolism

  45. Yeast signaling proteins in MIPS PNAS, vol.100, pp.1128, (2003).

  46. Spotted microarray for Saccharomyces cerevisiae Similarity measure

  47. Regulatory module network

  48. Genome Biology, 9, R2, (2008).

More Related