Constraints imposed by non-functional protein–protein interactions on gene expression and proteome size
Jingshan Zhang1, Sergei Maslov2 & Eugene I Shakhnovich1
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA, USA
- Department of Condensed Matter Physics and Materials Science, Brookhaven National Laboratory, Upton, NY, USA
Correspondence to: Eugene I Shakhnovich1 Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA 2138, USA. Tel.: +617 495 4130; Fax: +617 384 9228; Email: eugene@belok.harvard.edu
Received 7 February 2008; Accepted 21 June 2008; Published online 5 August 2008
Article highlights
- Crowded intracellular environments present a challenge for proteins to form functional specific complexes while reducing nonfunctional interactions with promiscuous nonspecific partners.
- These requirements impose limit on concentrations and diversity (number of types) of proteins, which can be co-expressed in cells' compartments, while cells evolve towards greater complexity manifested in diversity of their proteomes.
- Cytoplasm and other compartments of Baker's yeast evolved to the limit of highest complexity-number of types of proteins co-expressed -consistent with the constraints that functional protein interactions are dominant and that majority of proteins are not sequestered into random, nonfunctional complexes.
Synopsis
Most proteins need to bind their specific protein partners to properly perform their biological function. When proteins diffuse in crowded intracellular environments searching for these functional partners, they randomly encounter functionally unrelated proteins and form transient non-functional complexes. If too many protein types coexist in a cellular compartment, the waste of resource to such non-functional interactions would significantly impair the functions of proteins. We hypothesize that the need to reduce such inefficiency down to an acceptable level sets an upper limit to the proteome diversity.
Among the
4500 types of baker's yeast proteins simultaneously expressed under the log-growth conditions (Ghaemmaghami et al, 2003; Huh et al, 2003), about 1800 types are known to be co-localized in the cytoplasm (Huh et al, 2003). How much do non-functional interactions affect normal protein functioning in the cytoplasm and other compartments of yeast cells? To check our hypothesis, we sketch the phase diagram in Figure 3 and find that the cytoplasm, nucleus and mitochondria are all located near a special corner of the phase diagram. The shadowed region is a biologically problematic one in which the ability of proteins to form a sufficient number of functional specific complexes is significantly impaired. Namely, if the number m of protein types co-expressed and co-localized in a compartment was to increase considerably, the system would enter the shadowed undesirable region where a typical protein spends more time tied up inside non-functional complexes than in the functionally important ones. On the other hand, if the average concentration of individual proteins
was to decrease, the system would once again enter the undesirable region where proteins spend most of their time in a monomeric, unbound state.
Figure 3
The phase diagram for baker's yeast. The area in shade is the 'dead zone'. Its boundaries, defined by Equations (13) and (15), are blurred to indicate they are crossover rather than sharp transitions. The colours indicate the average fraction of proteins tied up inside non-functional complexes in the yeast cytoplasm as defined in Equation (12). The white circle shows the
and m for a group of proteins coexpressed and colocalized in the yeast cytoplasm, whereas the white star and cross represents these parameters for the cell nucleus and mitochondria.
Although it is advantageous for compartments of a real cell to avoid the dangerous 'shadowed' region, why are they positioned close to the upper limit of m and the lower limit of
? First, we conjecture that, in general, the biological evolution pushes an organism towards higher protein diversity, as this allows the organism to perform a broader spectrum of functionally important tasks. This explains our observation that m is close to the upper limit. Secondly, given the value of m, the organism decreases
all the way down to its lower bound to reduce the fraction of time proteins waste inside non-functional complexes. Our analysis shown in colour in Figure 3 indicates that a typical protein localized in yeast cytoplasm wastes 20% of the searching time tied up in non-functional complexes. Owing to the variation of concentrations and other parameters of individual proteins, for some proteins the fraction of time wasted searching for their specific partners is considerably larger. Thus, a substantial increase in
could impair the temporal efficiency of biochemical processes.
The major quantities required to produce Figure 3 are the concentrations and subcellular localizations of individual proteins, as well as the typical binding (free) energy of their functional (specific) and non-functional (non-specific) interactions. For protein concentrations, we use the large-scale experimental data on expression levels and localizations of nearly all yeast proteins (Ghaemmaghami et al, 2003; Huh et al, 2003; Belle et al, 2006), and average compartment volumes reported by Visser et al (1995) and Jorgensen et al (2007). The median binding energy of specific interactions is taken from the PINT (Protein Interaction Thermodynamic) Database (Kumar and Gromiha, 2006). Binding energies of non-functional interactions are not readily available from the literature. To obtain a rough estimate, we adopt a simple model (Deeds et al, 2006), which assumes that the free energy of binding between two functionally unrelated proteins is given by a linear function of the Gaussian-distributed hydrophobic coverage of their surfaces. As large-scale yeast two-hybrid experiments check for interactions between all possible pairs of highly overexpressed bait-and-prey proteins, we expect relatively strong non-functional interactions to be amply represented among the results of such assays. Hence, the fraction of interacting pairs in large-scale yeast two-hybrid core data sets (Ito et al, 2001; Uetz et al, 2000), which report all PPIs with affinities of the order of Kd*
1
M (Estojak et al, 1995) and stronger, allows us to estimate the typical strength of non-functional interactions and its distribution across the whole proteome.
Acknowledgements
We are grateful to Paul Choi, Eric Deeds, Lucas Nivon and Boris Shakhnovich for discussions, and Orr Ashenberg for sharing hydrophobicity data. This work was supported by the National Institutes of Health. Work at Brookhaven National Laboratory was carried out under Division of Material Science, US Department of Energy Contract DE-AC02-98CH10886.
References
- Aloy P, Russell RB (2006) Structural systems biology: modelling protein interactions. Nat Rev Mol Cell Biol 7: 188–197 | Article | PubMed | ISI | ChemPort |
- Bogan AA, Thorn KS (1998) Anatomy of hot spots in protein interfaces. J Mol Biol 280: 1–9 | Article | PubMed | ISI | ChemPort |
- Deane CM, Salwinski L, Xenarios I, Eisenberg D (2002) Protein interactions: two methods for assessment of the reliability of high throughput observations. Mol Cell Proteomics 1: 349–356 | Article | PubMed | ISI | ChemPort |
- Deeds EJ, Ashenberg O, Gerardin J, Shakhnovich EI (2007) Robust protein protein interactions in crowded cellular environments. Proc Natl Acad Sci USA 104: 14952–14957 | Article | PubMed | ChemPort |
- Giot L, Bader JS, Brouwer C, Chaudhuri A, Kuang B, Li Y, Hao YL, Ooi CE, Godwin B, Vitols E, Vijayadamodar G, Pochart P, Machineni H, Welsh M, Kong Y, Zerhusen B, Malcolm R, Varrone Z, Collis A, Minto M et al (2003) A protein interaction map of Drosophila melanogaster. Science 302: 1727–1736 | Article | PubMed | ISI | ChemPort |
- Glyakina AV, Garbuzynskiy SO, Lobanov MY, Galzitskaya OV (2007) Different packing of external residues can explain differences in the thermostability of proteins from thermophilic and mesophilic organisms. Bioinformatics 23: 2231–2238 | Article | PubMed | ChemPort |
- Huang H, Jedynak BM, Bader JS (2007) Where have all the interactions gone? Estimating the coverage of two-hybrid protein interaction maps. PLoS Comput Biol 3: e214 | Article | PubMed | ChemPort |
- Janin J (1995) Protein–protein recognition. Prog Biophys Mol Biol 64: 145–166 | Article | PubMed | ChemPort |
- Maslov S, Ispolatov I (2007) Propagation of large concentration changes in reversible protein-binding networks. Proc Natl Acad Sci USA 104: 13655–13660 | Article | PubMed | ChemPort |
- Noskov SY, Lim C (2001) Free energy decomposition of protein-protein interactions. Biophys J 81: 737–750 | PubMed | ChemPort |


