<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>gb-2010-11-9-r98</ui><ji>GBJ</ji><fm>
<dochead>Research</dochead>
<bibl>
<title>
<p>The rate of the molecular clock and the cost of gratuitous protein synthesis</p>
</title>
<aug>
<au id="A1"><snm>Plata</snm><fnm>Germ&#225;n</fnm><insr iid="I1"/><insr iid="I2"/><email>gap211811@columbia.edu</email></au>
<au id="A2"><snm>Gottesman</snm><mi>E</mi><fnm>Max</fnm><insr iid="I3"/><insr iid="I4"/><email>meg811@columbia.edu</email></au>
<au ca="yes" id="A3"><snm>Vitkup</snm><fnm>Dennis</fnm><insr iid="I1"/><insr iid="I5"/><email>dv2121@columbia.edu</email></au>
</aug>
<insg>
<ins id="I1"><p>Center for Computational Biology and Bioinformatics, Columbia University, 1130 St Nicholas Ave, New York City, NY 10032, USA</p></ins>
<ins id="I2"><p>Integrated Program in Cellular, Molecular, Structural, and Genetic Studies, Columbia University, 1130 St Nicholas Ave, New York City, NY 10032, USA</p></ins>
<ins id="I3"><p>Department of Microbiology and Immunology, Columbia University, 701 W. 168 St, New York City, NY 10032, USA</p></ins>
<ins id="I4"><p>Department of Biochemistry and Molecular Biophysics, Columbia University, 701 W. 168 St, New York City, NY 10032, USA</p></ins>
<ins id="I5"><p>Department of Biomedical Informatics, Columbia University, 1130 St Nicholas Ave, New York City, NY 10032, USA</p></ins>
</insg>
<source>Genome Biology</source>
<issn>1465-6906</issn>
<pubdate>2010</pubdate>
<volume>11</volume>
<issue>9</issue>
<fpage>R98</fpage>
<url>http://genomebiology.com/2010/11/9/R98</url>
<xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2010-11-9-r98</pubid><pubid idtype="pmpid">20920270</pubid></pubidlist></xrefbib>
</bibl>
<history><rec><date><day>8</day><month>6</month><year>2010</year></date></rec><revrec><date><day>3</day><month>9</month><year>2010</year></date></revrec><acc><date><day>29</day><month>9</month><year>2010</year></date></acc><pub><date><day>29</day><month>9</month><year>2010</year></date></pub></history>
<cpyrt><year>2010</year><collab>Plata et al.; licensee BioMed Central Ltd.</collab><note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<sec>
<st>
<p>Background</p>
</st>
<p>The nature of the protein molecular clock, the protein-specific rate of amino acid substitutions, is among the central questions of molecular evolution. Protein expression level is the dominant determinant of the clock rate in a number of organisms. It has been suggested that highly expressed proteins evolve slowly in all species mainly to maintain robustness to translation errors that generate toxic misfolded proteins. Here we investigate this hypothesis experimentally by comparing the growth rate of <it>Escherichia coli </it>expressing wild type and misfolding-prone variants of the LacZ protein.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>We show that the cost of toxic protein misfolding is small compared to other costs associated with protein synthesis. Complementary computational analyses demonstrate that there is also a relatively weaker, but statistically significant, selection for increasing solubility and polarity in highly expressed <it>E. coli </it>proteins.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>Although we cannot rule out the possibility that selection against misfolding toxicity significantly affects the protein clock in species other than <it>E. coli</it>, our results suggest that it is unlikely to be the dominant and universal factor determining the clock rate in all organisms. We find that in this bacterium other costs associated with protein synthesis are likely to play an important role. Interestingly, our experiments also suggest significant costs associated with volume effects, such as jamming of the cellular environment with unnecessary proteins.</p>
</sec>
</sec>
</abs>
</fm><meta>
<classifications>
<classification id="30010001" subtype="man_spc_id" type="BMC">Biochemistry and structural biology</classification>
<classification id="30010002" subtype="man_spc_id" type="BMC">Bioinformatics</classification>
<classification id="30010008" subtype="man_spc_id" type="BMC">Evolution</classification>
</classifications>
</meta><bdy>
<sec>
<st>
<p>Background</p>
</st>
<p>Once the first protein sequences became available, their comparison led to the conclusion that the number of accumulated substitutions between orthologs was mainly a function of the evolutionary time elapsed since the last common ancestor of corresponding species <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B2">2</abbr>
</abbrgrp>. Consequently, orthologous proteins accumulate substitutions at an approximately constant rate over long evolutionary intervals. This observation suggests that one can use available protein sequences as a molecular clock to estimate divergence times between different species <abbrgrp>
<abbr bid="B3">3</abbr>
</abbrgrp>. Further studies revealed that while the pace of the molecular clock is similar for orthologous proteins in different lineages, it varies by several orders of magnitude across non-orthologous proteins <abbrgrp>
<abbr bid="B4">4</abbr>
<abbr bid="B5">5</abbr>
</abbrgrp>.</p>
<p>For several decades the dominant hypothesis explaining the large variability of the molecular clock rate between non-orthologous proteins was based on the concept of functional protein density: the higher the fraction of protein residues directly involved in its function, the slower the protein molecular clock <abbrgrp>
<abbr bid="B6">6</abbr>
<abbr bid="B7">7</abbr>
</abbrgrp>. It was not until high-throughput genomics data became widely available that multiple molecular and genetic variables were used to investigate the dominant factors influencing the molecular clock rates of different proteins. Surprisingly, such features as gene essentiality <abbrgrp>
<abbr bid="B8">8</abbr>
<abbr bid="B9">9</abbr>
<abbr bid="B10">10</abbr>
<abbr bid="B11">11</abbr>
</abbrgrp>, the number of protein-protein interactions <abbrgrp>
<abbr bid="B12">12</abbr>
<abbr bid="B13">13</abbr>
</abbrgrp>, and specific functional roles <abbrgrp>
<abbr bid="B14">14</abbr>
<abbr bid="B15">15</abbr>
</abbrgrp>, have been shown to have, on average, either non-significant or significant but relatively weak correlations with protein evolutionary rates. On the other hand, quantities directly related to gene expression, such as codon bias, mRNA expression, and protein abundance, showed the strongest correlation with the rate of protein evolution <abbrgrp>
<abbr bid="B16">16</abbr>
<abbr bid="B17">17</abbr>
</abbrgrp>. For example, expression alone explains about a third of the variance in the substitution rates in several microbial species <abbrgrp>
<abbr bid="B14">14</abbr>
<abbr bid="B17">17</abbr>
<abbr bid="B18">18</abbr>
</abbrgrp> and about a quarter of the variance in <it>Caenorhabditis elegans </it>
<abbrgrp>
<abbr bid="B19">19</abbr>
</abbrgrp>. In these and many other organisms, highly expressed genes accept significantly less synonymous and non-synonymous (amino acid changing) substitutions than genes with low expression levels <abbrgrp>
<abbr bid="B20">20</abbr>
</abbrgrp>.</p>
<p>Considering the major role played by expression in setting the rate of amino acid substitutions, it is important to understand the main molecular mechanisms of this effect <abbrgrp>
<abbr bid="B21">21</abbr>
</abbrgrp>. A popular theory by Drummond <it>et al. </it>
<abbrgrp>
<abbr bid="B18">18</abbr>
<abbr bid="B22">22</abbr>
<abbr bid="B23">23</abbr>
</abbrgrp> suggests that highly expressed proteins may evolve slowly in all organisms, from microbes to human <abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>, due to the selection against toxicity associated with protein misfolding. The logic behind this interesting hypothesis is that a significant fraction (&gt; 10%) of cellular proteins may contain translation errors <abbrgrp>
<abbr bid="B24">24</abbr>
<abbr bid="B25">25</abbr>
</abbrgrp> that could cause cytotoxic protein misfolding. If misfolded proteins indeed incur substantial toxicity costs, greater pressure to avoid misfolding will affect highly expressed genes since they generate relatively more misfolded proteins <abbrgrp>
<abbr bid="B18">18</abbr>
</abbrgrp>. Consequently, adaptive pressure will maintain sequences of highly expressed proteins robust to translation errors, which will in turn slow the amino acid substitution rate, that is, the protein molecular clock. The misfolding toxicity hypothesis was supported by the results of computer simulations <abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>, but to the best of our knowledge, it has never been tested experimentally.</p>
<p>In this study we specifically investigated whether the toxicity of misfolded proteins or other costs associated with protein synthesis make a dominant contribution to cellular fitness (growth rate), and consequently constrain the molecular clock in <it>Escherichia coli</it>. To test this, we used wild type (WT) and misfolding-prone variants of the <it>E. coli </it>&#946;-galactosidase gene, <it>lacZ</it>. We also computationally analyzed the contribution of other related factors, such as protein stability and solubility.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>The native biological function of the LacZ protein is to cleave lactose for use as a source of carbon and energy <abbrgrp>
<abbr bid="B26">26</abbr>
</abbrgrp>; in the absence of lactose, &#946;-galactosidase does not participate in <it>E. coli </it>carbon metabolism. Therefore, we used <it>lacZ </it>expression in a lactose-free medium to measure the cost of gratuitous protein expression <abbrgrp>
<abbr bid="B27">27</abbr>
<abbr bid="B28">28</abbr>
</abbrgrp>. To compare that expression cost to the cost of potentially toxic protein misfolding, we used site-directed mutagenesis to engineer several destabilizing single-residue substitutions into LacZ. Single amino acid substitutions should serve as a good model for translational errors because only rarely, in about 10% of the proteins that contain translation errors, two or more residues will be simultaneously mistranslated in the same protein. We expressed the misfolding-prone mutants at the same level as the WT protein. Because the misfolded LacZ proteins are both potentially toxic and also devoid of biological function, the comparison of the growth rates of bacteria carrying the WT and each of the destabilized mutants allowed us to evaluate the additional fitness cost specifically arising from misfolding toxicity.</p>
<sec>
<st>
<p>Destabilizing mutations in <it>lacZ </it>yield aggregated and partialy soluble proteins</p>
</st>
<p>Amino acid substitutions in protein cores are significantly more destabilizing than substitutions on protein surfaces <abbrgrp>
<abbr bid="B29">29</abbr>
<abbr bid="B30">30</abbr>
</abbrgrp>. Therefore, we selected five buried residues encoding non-polar amino acids that could be mutated to polar residues with single nucleotide substitutions while maintaining a similar level of codon preference (Table <tblr tid="T1">1</tblr>). We used the DPX server <abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp> to identify buried residues of the LacZ protein based on its crystal structure (Protein Data Bank (PDB) code <ext-link ext-link-id="1dp0" ext-link-type="pdb">1dp0</ext-link>). We then applied the I-Mutant2.0 algorithm <abbrgrp>
<abbr bid="B32">32</abbr>
</abbrgrp> to confirm that the selected substitutions would be indeed destabilizing. Using site-directed mutagenesis, the five selected substitutions were introduced separately into plasmids containing <it>lacZ </it>under transcriptional control of the isopropyl &#946;-D-1-thiogalactopyranoside (IPTG)-inducible <it>lac </it>promoter <abbrgrp>
<abbr bid="B33">33</abbr>
</abbrgrp>. We then used a &#946;-galactosidase assay <abbrgrp>
<abbr bid="B34">34</abbr>
</abbrgrp> to experimentally confirm reductions in the catalytic activity of LacZ in all of the generated mutants (Table <tblr tid="T1">1</tblr>).</p>
<tbl hint_layout="double" id="T1"><title><p>Table 1</p></title><caption><p>Characteristics of destabilizing mutations engineered into <it>E. coli </it>&#946;-galactosidase</p></caption><tblbdy cols="6">
      <r>
         <c>
            <p/>
         </c>
         <c ca="center" cspan="5">
            <p>
               <b>Mutant</b>
            </p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <b>V567D</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>F758S</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>I141N</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>G353D</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>A880E</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Predicted &#916;&#916;G (kcal/mol)</p>
         </c>
         <c ca="center">
            <p>-2.6</p>
         </c>
         <c ca="center">
            <p>-2.9</p>
         </c>
         <c ca="center">
            <p>-2.4</p>
         </c>
         <c ca="center">
            <p>-1.6</p>
         </c>
         <c ca="center">
            <p>-0.6</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Relative protein activity (%)</p>
         </c>
         <c ca="center">
            <p>31</p>
         </c>
         <c ca="center">
            <p>4</p>
         </c>
         <c ca="center">
            <p>17</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c ca="center">
            <p>61</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Codon substitution (WT/mutant)</p>
         </c>
         <c ca="center">
            <p>GTC/GAC</p>
         </c>
         <c ca="center">
            <p>TTT/TCT</p>
         </c>
         <c ca="center">
            <p>ATT/AAT</p>
         </c>
         <c ca="center">
            <p>GGC/GAC</p>
         </c>
         <c ca="center">
            <p>GCG/GAG</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Codon preference % (WT/mutant)</p>
         </c>
         <c ca="center">
            <p>13.5/53.9</p>
         </c>
         <c ca="center">
            <p>29.0/32.4</p>
         </c>
         <c ca="center">
            <p>33.5/17.3</p>
         </c>
         <c ca="center">
            <p>42.8/53.9</p>
         </c>
         <c ca="center">
            <p>32.3/24.7</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Found in inclusion bodies (see Figure 1a)</p>
         </c>
         <c ca="center">
            <p>No</p>
         </c>
         <c ca="center">
            <p>Yes</p>
         </c>
         <c ca="center">
            <p>Yes</p>
         </c>
         <c ca="center">
            <p>Yes</p>
         </c>
         <c ca="center">
            <p>No</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>In the table, &#916;&#916;G values represent destabilizing effects predicted by the I-Mutant2.0 server <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. The experimentally determined enzymatic activities of the mutants (in percentages) are shown in the table relative to WT.</p>
   </tblfn></tbl>
<p>To determine whether the destabilized proteins tended to aggregate, we separated soluble proteins and proteins in inclusion bodies (see Materials and methods) and analyzed them by SDS-PAGE (Figure <figr fid="F1">1a</figr>). The three mutants with the lowest catalytic activity (F758S, I141N and G353D) were found in inclusion bodies (Table <tblr tid="T1">1</tblr>), the remaining two mutants (V567D and A880E) and WT proteins were found mainly in the soluble protein fraction. Next, by inspecting total cell extracts at different time points after IPTG induction, we confirmed that the total amount of protein synthesized in each mutant strain was similar to that in the WT. As shown in Figure <figr fid="F1">1b</figr>, similar amounts of LacZ are produced in the WT and either soluble (V567D) or insoluble (F758S) mutants. Quantitative analysis of the Coomasie stained bands also did not reveal any significant difference between the LacZ synthesis rates in WT and mutant strains (Figure <figr fid="F1">1c</figr>). Finally, because expression of misfolded proteins is expected to generate a heat shock response <abbrgrp>
<abbr bid="B35">35</abbr>
<abbr bid="B36">36</abbr>
</abbrgrp>, we used western blots to monitor the amount of the GroEL heat shock protein in induced and un-induced cells carrying WT and mutant <it>lacZ </it>(Figure <figr fid="F1">1d</figr>). In cells carrying WT <it>lacZ</it>, the concentration of GroEL increased when IPTG was added. However, in both the V567D and F758S mutants, the levels of GroEL in either induced or uninduced cells were equal or higher than that in induced WT cells.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Expression of destabilizing mutants and wild-type LacZ</p></caption><text>
   <p><b>Expression of destabilizing mutants and wild-type LacZ</b>. <b>(a) </b>SDS-PAGE of soluble and insoluble fractions of cells expressing WT LacZ and five destabilizing mutants induced with 10 &#956;M IPTG. <b>(b) </b>Total &#946;-galactosidase at different times after IPTG induction. The LacZ band is indicated by the black arrow. <b>(c) </b>Relative synthesis rate of &#946;-galactosidase. <it>P</it>-values were obtained using a <it>t</it>-test of the linear regression slopes based on quantification of the gel images. Error bars represent the standard error of the regression slopes. <b>(d) </b>GroEL western blots in cells exprerssing WT and LacZ mutants. S, soluble fraction; I, insoluble fraction; '-', no IPTG; '+', 20 &#956;M IPTG; &#916;, heat shock (1 h shift from 37 to 42&#176;C).</p>
</text><graphic file="gb-2010-11-9-r98-1" hint_layout="double"/></fig>
<p>Overall, the results described in this section demonstrate that: all engineered mutants have significantly reduced catalytic activities; soluble and insoluble mutants are expressed at the same level as WT; and the mutants induce a heat shock response, and in some cases aggregate in inclusion bodies.</p>
</sec>
<sec>
<st>
<p>Misfolded proteins are no more toxic than wild-type proteins</p>
</st>
<p>The synthesis of WT or mutant &#946;-galactosidase was initially induced by adding 10 &#956;M IPTG. Using WT LacZ activity as a reference <abbrgrp>
<abbr bid="B37">37</abbr>
</abbrgrp>, we estimated that about 30,000 molecules of &#946;-galactosidase were present in each bacterial cell at this induction level. This approximately corresponds to half of the protein molecules expressed by a fully induced WT <it>lacZ </it>operon <abbrgrp>
<abbr bid="B34">34</abbr>
</abbrgrp>. Cells expressing WT LacZ grew 13.5% slower on glycerol as the sole carbon source compared to uninduced cells (Figure <figr fid="F2">2a</figr>). If misfolded proteins indeed impose a significant extra cost on the bacterium, then similarly expressed mutant strains with destabilizing substitutions should lead to a more pronounced growth decrease compared to the one observed with WT LacZ. However, as shown in Figure <figr fid="F2">2a</figr>, the mutant strains grew as well as cells expresing WT LacZ, and, despite inclusion body formation, two of the mutants even grew significantly faster (see Discussion).</p>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>Comparison of the growth rates for wild-type and misfolding-prone LacZ</p></caption><text>
   <p><b>Comparison of the growth rates for wild-type and misfolding-prone LacZ</b>. <b>(a) </b>Growth rates of cells expressing WT LacZ relative to uninduced cells and cells expressing each of the five destabilizing mutants (10 &#956;M IPTG). Mann-Whitney U <it>P</it>-value: *0.02; **8 &#215; 10<sup>-4</sup>. <b>(b) </b>Growth rates of cells expressing WT LacZ and two mutants at different induction (IPTG) levels; the growth rate of cells carrying an empty plasmid is also shown for comparison. <b>(c) </b>Growth rates of cells expressing LacZ and two destabilizing mutants on acetate and glycerol as the main carbon source; in both cases expression was induced with 10 &#956;M ITPG). Error bars represent the standard error of the mean calculated based on triplicate experiments.</p>
</text><graphic file="gb-2010-11-9-r98-2" hint_layout="double"/></fig>
<p>To further explore the potential toxicity of the destabilized proteins, we focused on two mutants (F758S and V567D). These mutants are examples of a completely aggregated and a soluble but destabilized LacZ protein, respectively. By varying the concentration of IPTG, we monitored the growth of cells with different levels of expressed LacZ proteins (Figure <figr fid="F2">2b</figr>). Importantly, no additional growth decrease was observed in the mutant strains compared to the WT at all IPTG induction levels. When no IPTG was added, resulting in a low expression level from the un-induced promoter, we also observed the same growth rate reduction in all constructs relative to cells carrying an empty pBR322 plasmid (Figure <figr fid="F2">2b</figr>).</p>
<p>We investigated the possibility that the toxicity of misfolded proteins was more pronounced on a relatively poor carbon source by measuring the growth of the <it>E. coli </it>V567D and F758S mutants and the WT on acetate. Although the overall growth rate on acetate was only about 60% of that on glycerol, we again did not observe any additional fitness (growth) decrease due to the destabilizing mutations (Figure <figr fid="F2">2c</figr>). This experiment confirmed that the observed results are not specific to a particular carbon source.</p>
</sec>
<sec>
<st>
<p>Nucleotide level selection, protein solubility, and stability in <it>E. coli</it>
</p>
</st>
<p>Nucleotide sequences of highly expressed genes are significantly constrained by selection for amino acid codons corresponding to abundant tRNAs <abbrgrp>
<abbr bid="B38">38</abbr>
<abbr bid="B39">39</abbr>
<abbr bid="B40">40</abbr>
</abbrgrp>. A recent experimental analysis by Kudla <it>et al. </it>
<abbrgrp>
<abbr bid="B41">41</abbr>
</abbrgrp> suggests that non-optimal codons can directly influence <it>E. coli </it>growth (fitness). Using 154 variants of GFP with multiple random synonymous substitutions, these authors found a significant positive correlation between codon optimality and bacterial growth rate. An important role played by the nucleotide-level selection in evolution of <it>E. coli </it>proteins is also supported by a high correlation between the rates of non-synonymous (Ka) and synonymous (Ks) substitutions (Figure <figr fid="F3">3b</figr>; Spearman's rank correlation r = 0.66, <it>P</it>-value &lt; 10<sup>-10</sup>). In addition, the partial correlation between Ka and mRNA expression, controlling for Ks, is small (r = -0.14, <it>P </it>= 7 &#215; 10<sup>-9</sup>), whereas the partial correlation between Ks and expression, controlling for Ka, is significantly higher (r = -0.38, <it>P </it>&lt; 10<sup>-10</sup>).</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>Correlation of <it>E. coli </it>mRNA expression with Ka, protein solubility, and the fraction of charged residues</p></caption><text>
   <p><b>Correlation of <it>E. coli </it>mRNA expression with Ka, protein solubility, and the fraction of charged residues</b>. <b>(a) </b>Correlation between expression and the rate of non-synonymous substitutions (Ka; Spearman's r = -0.45, <it>P </it>&lt; 10<sup>-10</sup>). <b>(b) </b>Correlation between Ka and the rate of synonymous substitutions (Ks; r = 0.66, <it>P </it>&lt; 10<sup>-10</sup>). <b>(c) </b>Correlation between expression and protein solubility measured <it>in vitro </it><abbrgrp><abbr bid="B48">48</abbr></abbrgrp> (r = 0.27, <it>P </it>&lt; 10<sup>-10</sup>). <b>(d) </b>Correlation between expression and the fraction of charged residues (r = 0.28, <it>P </it>&lt; 10<sup>-10</sup>). The red lines on each panel represent a 200-point moving average.</p>
</text><graphic file="gb-2010-11-9-r98-3" hint_layout="double"/></fig>
<p>Although selection for optimal codons at the nucleotide level should significantly affect the rates of both synonymous and non-synonymous substitutions <abbrgrp>
<abbr bid="B40">40</abbr>
</abbrgrp>, there are additional constraints specifically acting on non-synonymous sites <abbrgrp>
<abbr bid="B42">42</abbr>
<abbr bid="B43">43</abbr>
</abbrgrp>. Many of these additional constraints affect the propensity of proteins to misfold and aggregate. For example, it has been reported that highly expressed <it>E. coli </it>proteins are more soluble than proteins with lower expression <abbrgrp>
<abbr bid="B44">44</abbr>
<abbr bid="B45">45</abbr>
<abbr bid="B46">46</abbr>
</abbrgrp>. It is likely that the observed increase in solubility is necessary to avoid protein aggregation and non-functional binding <abbrgrp>
<abbr bid="B47">47</abbr>
</abbrgrp> mediated by non-specific hydrophobic interactions. Using the genome-wide protein solubility data for <it>E. coli </it>proteins obtained by Niwa <it>et al</it>. <abbrgrp>
<abbr bid="B48">48</abbr>
</abbrgrp>, we indeed observed a significant correlation between solubility and expression (Figure <figr fid="F3">3c</figr>; Spearman's r = 0.27, <it>P </it>&lt; 10<sup>-10</sup>). Importantly, the observed selection for solubility does not explain the correlation between the protein evolutionary rate and expression (Figure <figr fid="F3">3a</figr>; r = -0.45, <it>P </it>&lt; 10<sup>-10</sup>); the partial correlation between Ka and expression, controlling either for solubility or for the fraction of charged residues, is still significant (r = -0.42 and -0.41, respectively; <it>P </it>&lt; 10<sup>-10</sup>).</p>
<p>The positive correlation between solubility and expression is in agreement with an increase in the fraction of charged residues (Figure <figr fid="F3">3d</figr>; r = 0.28, <it>P </it>&lt; 10<sup>-10</sup>) and a simultaneous decrease in the fraction of hydrophobic residues (r = -0.16, <it>P </it>&lt; 10<sup>-10</sup>) in highly expressed <it>E. coli </it>proteins. We observed similar results by analyzing <it>E. coli </it>protein duplicates (paralogs) with different expression levels. By directly comparing duplicates expressed at different levels, many confounding factors, such as differences in folding topology or protein secondary structure, are removed. The analysis of 370 <it>E. coli </it>paralogs (see Materials and methods) demonstrated a decrease in the fraction of hydrophobic residues (paired Wilcoxon signed rank test, <it>P </it>= 7 &#215; 10<sup>-4</sup>) and a simultaneous increase in the fraction of charged residues (<it>P </it>= 7 &#215; 10<sup>-6</sup>) in the duplicates with higher expression levels.</p>
<p>The analysis of 602 <it>E. coli </it>protein structures currently available in the PDB (see Materials and methods) confirmed a significant increase in the fraction of solvent-exposed charged residues in highly expressed proteins (r = 0.18, <it>P </it>= 6 &#215; 10<sup>-6</sup>). While such an increase may lead to higher protein stabilities <abbrgrp>
<abbr bid="B49">49</abbr>
</abbrgrp>, a proposed consequence of selection for translational robustness <abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>, we did not detect strong correlations between mRNA expression and other structural features usually associated with increased protein stability <abbrgrp>
<abbr bid="B18">18</abbr>
<abbr bid="B22">22</abbr>
</abbrgrp>. For example, we did not observe a significant increase in the fraction of buried hydrophobic residues (r = 0.06, <it>P </it>= 0.13) <abbrgrp>
<abbr bid="B50">50</abbr>
<abbr bid="B51">51</abbr>
<abbr bid="B52">52</abbr>
</abbrgrp> or an increase in the average number of contacts per residue (contact density) in highly expressed <it>E. coli </it>proteins (r = 0.02, <it>P </it>= 0.96). Neither did we find a decrease in the fraction of residues in loops or unstructured protein regions (r = 0.07, <it>P </it>= 0.06) <abbrgrp>
<abbr bid="B53">53</abbr>
</abbrgrp>. Our analysis of experimentally determined <it>E. coli </it>protein stabilities assembled in the ProTherm database <abbrgrp>
<abbr bid="B54">54</abbr>
</abbrgrp> also failed to reveal any significant correlation between protein stability, measured either by protein melting temperature (r = -0.14, <it>P </it>= 0.46) or folding free energy (&#916;G, r = -0.08, <it>P </it>= 0.70), and mRNA expression level (Figure <figr fid="F4">4a,b</figr>). We also did not detect significant changes in the contact order, a structural measure strongly associated with folding speed <abbrgrp>
<abbr bid="B55">55</abbr>
<abbr bid="B56">56</abbr>
</abbrgrp>, in highly expressed bacterial proteins (r = -0.01, <it>P </it>= 0.8).</p>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>Relationship between protein stability and mRNA expression</p></caption><text>
   <p><b>Relationship between protein stability and mRNA expression</b>. The experimentally measured stability data were obtained from the ProTherm database <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>, and the expression data for <it>E. coli </it>were obtained from the study by Lu <it>et al</it>. <abbrgrp><abbr bid="B78">78</abbr></abbrgrp>. <b>(a) </b>Correlation between mRNA expression and melting temperature for 28 proteins (r = -0.14, <it>P </it>= 0.45). <b>(b) </b>Correlation between mRNA expression and folding free energy for 23 proteins (r = -0.08, <it>P </it>= 0.70). The dashed red line represents the linear regression between each variable and the natural logarithm of the expression values.</p>
</text><graphic file="gb-2010-11-9-r98-4" hint_layout="double"/></fig>
<p>Overall, the computational analysis described above suggests that, at least based on the currently available datasets, an increase in folding speed and/or protein stability for highly expressed bacterial proteins is unlikely to play a major role in constraining the protein molecular clock in <it>E. coli</it>.</p>
</sec>
</sec>
<sec>
<st>
<p>Discussion</p>
</st>
<p>The results presented here demonstrate that, at least in <it>E. coli</it>, the cost associated with the gratuitous expression of a protein is significantly higher than the additional toxicity cost incurred by destabilization or misfolding of the same amount of protein; by 'gratuitous' we imply here that the protein has no effect on fitness through its biological function. It is important to emphasize that our growth measurements are not sensitive enough to detect small fitness effects - for example, decreases in the growth rate on the order of 1% or less - and consequently we cannot rule out additional costs specifically related to misfolding toxicity <abbrgrp>
<abbr bid="B57">57</abbr>
</abbrgrp>. In fact, a detailed study by Lindner <it>et al</it>. <abbrgrp>
<abbr bid="B58">58</abbr>
</abbrgrp> using time-lapse microscopy showed that the presence of protein aggregates in <it>E. coli </it>has an effect on growth rate at the level of individual cells. Nevertheless, our experiments do show that the misfolding toxicity cost is significantly smaller than other costs associated with protein expression.</p>
<p>We believe that the main expression costs specifically in this bacterium are related to translational efficiency and jamming of the cell's cytoplasm with useless proteins. Importantly, expression costs associated with amino acid waste, or the energy required for gratuitous expression, were recently shown by Stoebel <it>et al</it>. <abbrgrp>
<abbr bid="B59">59</abbr>
</abbrgrp> to play a relatively minor role. On the other hand, both gratuitous protein expression and suboptimal codons can significantly slow bacterial growth, for instance, by reducing the pool of free ribosomes in the cell <abbrgrp>
<abbr bid="B33">33</abbr>
<abbr bid="B41">41</abbr>
</abbrgrp>. This effect will preferentially affect highly expressed genes bound by a relatively larger number of ribosomes. A gene with non-optimal codons will slow the rate of translation (speed of ribosomal motion) and thus titrate more ribosomes. A reduced pool of free ribosomes will necessarily slow expression of all bacterial genes and thus decrease the rate of biomass synthesis <abbrgrp>
<abbr bid="B60">60</abbr>
</abbrgrp>.</p>
<p>Interestingly, we observed that bacteria expressing two of the mutants (F758S and G353D) grew significantly faster than cells expressing native LacZ protein (Figure <figr fid="F2">2a</figr>), although still not as fast as uninduced <it>E. coli</it>. This intriguing result demonstrates that titration of ribosomes cannot be the only explanation for the costs associated with gratuitous protein synthesis. The F758S and G353D proteins had the lowest catalytic activities of all constructs (Table <tblr tid="T1">1</tblr>) and both mutants, as well as I141N, were found mostly in inclusion bodies. It is likely that the localization of the LacZ proteins to inclusion bodies prevents jamming of the cytoplasm and relieves effects associated with non-functional binding. It was previously shown that an asymmetric partition of inclusion bodies during cell division may result in a cell rejuvenation phenotype <abbrgrp>
<abbr bid="B58">58</abbr>
</abbrgrp>. We would like to emphasize that this result does not support the misfolding toxicity hypothesis, as these mutants grew faster than the strain expressing WT LacZ. Based on the growth rates of mutants primarily localized to inclusion bodies (V567D, F758S, I141N; average growth decrease 6.7%) and the proteins remaining in the cytoplasm (WT, V567D, A880E; average growth decrease 14%), one can conclude that effects of jamming and translational efficiency make approximately similar contributions to fitness.</p>
<p>An important separate question in the context of the mistranslation-induced misfolding hypothesis is whether phenotypic (transcriptional or translational) mutations can cause enough protein misfolding to be significantly cytotoxic. Although suboptimal codons are expected to substantially increase the translational error rate <abbrgrp>
<abbr bid="B39">39</abbr>
</abbrgrp>, no correlation was observed between codon optimization and the fraction of properly folded GFP by Kudla <it>et al. </it>
<abbrgrp>
<abbr bid="B41">41</abbr>
</abbrgrp>. Even if relatively rare, phenotypic mutations can still be significantly damaging if they occur in functionally and structurally important sties. This may explain a well-established correlation between codon optimization and evolutionary conservation of corresponding protein sites <abbrgrp>
<abbr bid="B61">61</abbr>
<abbr bid="B62">62</abbr>
<abbr bid="B63">63</abbr>
</abbrgrp>. This correlation is not necessarily a consequence of selection against mistranslation-induced toxicity, and again may be primarily related to the loss of functional proteins and the cost of additional protein synthesis necessary to compensate for the misfolding. In fact, it has been reported that essential bacterial proteins have lower aggregation propensities than those predicted for non-essential proteins <abbrgrp>
<abbr bid="B46">46</abbr>
</abbrgrp>.</p>
<p>While our study demonstrates that misfolding toxicity is unlikely to be a universally dominant factor connecting expression and the protein molecular clock in all species, we cannot rule out the possibility that toxicity may play an important role in other species. We note, however, that in higher organisms the correlations between mRNA expression and the protein molecular clock are generally much weaker than in some microbes. For example, Liao <it>et al</it>. <abbrgrp>
<abbr bid="B64">64</abbr>
</abbrgrp> demonstrated that expression plays a relatively minor role in constraining the molecular clock in mammalian species. Also, by comparing evolutionary rate of separate and fused protein domains in human and <it>Arabidopsis</it>, Wolf <it>et al</it>. <abbrgrp>
<abbr bid="B65">65</abbr>
</abbrgrp> found a comparable contribution from expression and structural-functional constraints.</p>
<p>A number of elegant experimental studies have demonstrated a cytotoxic effect of several misfolded or marginally stable proteins in higher organisms <abbrgrp>
<abbr bid="B66">66</abbr>
<abbr bid="B67">67</abbr>
</abbrgrp>. For instance, several hundred mutations in the SOD1 protein were shown to result in aggregates associated with amyotrophic lateral sclerosis in humans <abbrgrp>
<abbr bid="B68">68</abbr>
</abbrgrp>; also, non-natural peptides have been used to induce cytotoxic aggregates of GFP in <it>C. elegans </it>
<abbrgrp>
<abbr bid="B69">69</abbr>
</abbrgrp>. Although these studies directly demonstrate the importance of misfolding and aggregation for some specific proteins, the extent to which these effects dominate the molecular clock for all proteins in these and other species needs to be investigated and again compared to other contributing factors.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>Our experimental results suggest that selection against toxic protein misfolding is unlikely to be the universal and dominant factor determining the rate of the protein molecular clock in all species. We demonstrate that, at least in <it>E. coli</it>, other factors associated with gratuitous protein synthesis, such as translational efficiency and possibly jamming of the cytoplasm, are likely to be the primary constraints. Our computational analyses also suggest a relatively weaker, but statistically significant, selection for increasing solubility and polarity in highly expressed <it>E. coli </it>proteins.</p>
</sec>
<sec>
<st>
<p>Materials and methods</p>
</st>
<sec>
<st>
<p>Strains and mutant generation</p>
</st>
<p>
<it>E. coli </it>K12 strain GP4 (W3102, XA 21Z, <it>lac</it>I<sup>q</sup>) was used in all experiments. <it>lacZ </it>was expressed from the IPTG-inducible <it>lac </it>promoter in plasmid PIV18 <abbrgrp>
<abbr bid="B33">33</abbr>
</abbrgrp>; PIV18 is a pBR322 derivative that carries a mutation in the Shine Dalgarno sequence of the <it>lacZ </it>transcript that increases translation efficiency. Site directed mutagenesis was carried out using Stratagene's QuikChange Lightning kit (Stratagene, Cedar Creek, TX, USA). pBR322 was used as the empty plasmid control.</p>
</sec>
<sec>
<st>
<p>Growth curve analysis</p>
</st>
<p>For each construct, a sweep of colonies was grown overnight on Luria-Bertani (LB) liquid media supplemented with 100 &#956;g/ml ampicillin. Overnight cultures were diluted by a 1:100 factor and grown on M9 minimal media suplemented with 0.5% casaminoacids, 0.25 &#956;g/ml thiamine, 100 &#956;g/ml ampicillin and either 0.4% glycerol or acetate as carbon sources. We transfered 300 &#956;l of cells with an OD600 of 0.5 to flasks containing 5.5 ml of prewarmed media suplemented with the appropiate amount of IPTG. Two hours after induction, OD600 was measured every 45 minutes. Growth rate was determined as the regression line slope of time and the logarithm of OD600.</p>
</sec>
<sec>
<st>
<p>SDS-PAGE and western blotting</p>
</st>
<p>The equivalent of 200 &#956;l of cells at an OD600 of 0.7 was collected by centrifugation and lysed using Novagen's BugBuster (primary amine-free) Protein Extraction Reagent (Novagen, Merck, Darmstadt, Germany). Soluble proteins were retrieved after centrifugation of the lysed cells and aggregated proteins were then harvested following instructions for inclusion body purification described in the BugBuster reagent manual. Both fractions were saved in a 50 &#956;l volume including 10 &#956;l 4&#215; SDS loading buffer, boiled, and electrophoresed on a 10% SDS polyacrylamide gel. Gels were stained with Coomassie blue and scanned for analysis. For the analysis of total protein, cells were lysed in BugBuster reagent containing rLysozyme and boiled after addition of 4&#215; SDS loading buffer. Bands were quantified using the ImageJ program <abbrgrp>
<abbr bid="B70">70</abbr>
</abbrgrp>.</p>
<p>Protein samples separated by SDS-PAGE as described above were blotted overnight onto a nitrocellulose membrane and incubated with Anti-GroEL antibody produced in rabbit 1:10 000 (Sigma Aldrich, St Louis, MO, USA). Blots were blocked with 5% non-fat dry milk, incubated with 1:3,000 anti-rabbit horseradish peroxidase conjugate antibody and visualized with Amersham's ECL Plus Western Blotting Reagent (GE Healthcare, Munich, Germany).</p>
</sec>
<sec>
<st>
<p>Structural analysis of <it>E. coli </it>proteins</p>
</st>
<p>In the analysis we used 602 <it>E. coli </it>protein structures currently available in the PDB <abbrgrp>
<abbr bid="B71">71</abbr>
</abbrgrp>. To prevent sampling biases, we filtered available PDB entries so that no two protein structures used in the calculations had sequence identity higher than 90%; similar results were obtained without filtering. We defined buried residues as those with a solvent accessible area smaller than 16% <abbrgrp>
<abbr bid="B72">72</abbr>
<abbr bid="B73">73</abbr>
</abbrgrp>. Solvent accessibility was calculated by the DSSP <abbrgrp>
<abbr bid="B74">74</abbr>
</abbrgrp> program. The fraction of protein residues in loops was also calculated using DSSP. Two non-adjacent protein residues were considered to be in contact if any two of their non-hydrogen atoms were closer than 4.5 &#197; <abbrgrp>
<abbr bid="B75">75</abbr>
</abbrgrp>. The protein contact density was defined as the average number of non-adjacent contacts per residue. Contact order was calculated as (L &#215; N)<sup>-1 </sup>&#215; &#931;&#916;S<sub>ij</sub>, where N is the total number of contacts, L is the total number of residues in the protein and &#916;S<sub>ij</sub>, which is summed over all contacts, is the number of amino acids separating contacting residues <abbrgrp>
<abbr bid="B56">56</abbr>
</abbrgrp>. <it>In vitro </it>solubility data for <it>E. coli </it>proteins was obtained directly from the study of Niwa <it>et al</it>. <abbrgrp>
<abbr bid="B48">48</abbr>
</abbrgrp>.</p>
</sec>
<sec>
<st>
<p>Correlation of the synonymous (Ks) and non-synonymous (Ka) substitution rates with expression</p>
</st>
<p>Orthologous open reading frames and protein sequences from <it>E. coli </it>and <it>Salmonella enterica </it>were used to calculate Ks and Ka values. The <it>E. coli-Salmonella </it>orthologs were determined as bi-directional best hits using protein BLAST <abbrgrp>
<abbr bid="B76">76</abbr>
</abbrgrp>. Ka and Ks values were calculated using the maximum likelihood method implemented in the PAML package <abbrgrp>
<abbr bid="B77">77</abbr>
</abbrgrp>. The mRNA expression data reported by Lu <it>et al</it>. <abbrgrp>
<abbr bid="B78">78</abbr>
</abbrgrp> were used to calculate the correlations. For the analysis of duplicated genes, we defined duplicates as pairs of <it>E. coli </it>proteins having more than 40% sequence identity that could be aligned for at least 80% of their total length using BLAST. In the analysis of duplicates, we used expression data from 466 experiments in the Many Microbes Microarrays Database <abbrgrp>
<abbr bid="B79">79</abbr>
</abbrgrp>. We selected for the analysis only the pairs for which one paralog had higher expression values in more than 80% of the reported experiments.</p>
</sec>
</sec>
<sec>
<st>
<p>Abbreviations</p>
</st>
<p>GF: green fluorescent protein; IPTG: isopropyl &#946;-D-1-thiogalactopyranoside; Ka: the rate of non-synonymous substitutions; Ks: the rate of synonymous substitutions; PDB: Protein Data Bank; WT: wild type.</p>
</sec>
<sec>
<st>
<p>Competing interests</p>
</st>
<p>The authors declare that they have no competing interests.</p>
</sec>
<sec>
<st>
<p>Authors' contributions</p>
</st>
<p>GP carried out the experiments and computational analyses. DV, MG and GP conceived the experiments and analyzed the results. DV and GP wrote the paper with revisions by MG. All authors read and approved the final manuscript.</p>
</sec>
</bdy><bm>
<ack>
<sec>
<st>
<p>Acknowledgements</p>
</st>
<p>We would like to thank Steen Pedersen (University of Copenhagen) for kindly providing the WT LacZ and pBR322 plasmids. We also thank Barry Honig for many helpful discussions on protein folding and stability. This work was supported in part by NIGMS grant GM079759 to DV, and the National Centers for Biomedical Computing U54CA121852 grant to Columbia University.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>Evolutionary divergence and convergence in proteins.</p></title><aug><au><snm>Zuckerkandl</snm><fnm>E</fnm></au><au><snm>Pauling</snm><fnm>L</fnm></au></aug><source>Evolving Genes and Proteins</source><publisher>New York: Academic Press</publisher><editor>Bryson V, Vogel H</editor><pubdate>1965</pubdate><fpage>97</fpage><lpage>166</lpage></bibl><bibl id="B2"><title><p>Primary structure and evolution of cytochrome C.</p></title><aug><au><snm>Margoliash</snm><fnm>E</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1963</pubdate><volume>50</volume><fpage>672</fpage><lpage>679</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.50.4.672</pubid><pubid idtype="pmcid">221244</pubid><pubid idtype="pmpid">14077496</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Paleontological evidence to date the tree of life.</p></title><aug><au><snm>Benton</snm><fnm>MJ</fnm></au><au><snm>Donoghue</snm><fnm>PC</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2007</pubdate><volume>24</volume><fpage>26</fpage><lpage>53</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msl150</pubid><pubid idtype="pmpid" link="fulltext">17047029</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><aug><au><snm>Gillespie</snm><fnm>JH</fnm></au></aug><source>The Causes of Molecular Evolution</source><publisher>New York: Oxford University Press</publisher><pubdate>1991</pubdate></bibl><bibl id="B5"><title><p>Molecular clock mirages.</p></title><aug><au><snm>Ayala</snm><fnm>FJ</fnm></au></aug><source>Bioessays</source><pubdate>1999</pubdate><volume>21</volume><fpage>71</fpage><lpage>75</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/(SICI)1521-1878(199901)21:1&lt;71::AID-BIES9&gt;3.0.CO;2-B</pubid><pubid idtype="pmpid" link="fulltext">10070256</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Evolutionary processes and evolutionary noise at the molecular level. I. Functional density in proteins.</p></title><aug><au><snm>Zuckerkandl</snm><fnm>E</fnm></au></aug><source>J Mol Evol</source><pubdate>1976</pubdate><volume>7</volume><fpage>167</fpage><lpage>183</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/BF01731487</pubid><pubid idtype="pmpid">933174</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Biochemical evolution.</p></title><aug><au><snm>Wilson</snm><fnm>AC</fnm></au><au><snm>Carlson</snm><fnm>SS</fnm></au><au><snm>White</snm><fnm>TJ</fnm></au></aug><source>Annu Rev Biochem</source><pubdate>1977</pubdate><volume>46</volume><fpage>573</fpage><lpage>639</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1146/annurev.bi.46.070177.003041</pubid><pubid idtype="pmpid" link="fulltext">409339</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Why Is the correlation between gene importance and gene evolutionary rate so weak?</p></title><aug><au><snm>Wang</snm><fnm>Z</fnm></au><au><snm>Zhang</snm><fnm>JZ</fnm></au></aug><source>PLoS Genet</source><pubdate>2009</pubdate><volume>5</volume><fpage>e1000329</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pgen.1000329</pubid><pubid idtype="pmcid">2605560</pubid><pubid idtype="pmpid">19132081</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Influence of metabolic network structure and function on enzyme evolution.</p></title><aug><au><snm>Vitkup</snm><fnm>D</fnm></au><au><snm>Kharchenko</snm><fnm>P</fnm></au><au><snm>Wagner</snm><fnm>A</fnm></au></aug><source>Genome Biol</source><pubdate>2006</pubdate><volume>7</volume><fpage>R39</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2006-7-5-r39</pubid><pubid idtype="pmcid">1779518</pubid><pubid idtype="pmpid">16684370</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>Essential genes are more evolutionarily conserved than are nonessential genes in bacteria.</p></title><aug><au><snm>Jordan</snm><fnm>IK</fnm></au><au><snm>Rogozin</snm><fnm>IB</fnm></au><au><snm>Wolf</snm><fnm>YI</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Genome Res</source><pubdate>2002</pubdate><volume>12</volume><fpage>962</fpage><lpage>968</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1383730</pubid><pubid idtype="pmpid">12045149</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Protein dispensability and rate of evolution.</p></title><aug><au><snm>Hirsh</snm><fnm>AE</fnm></au><au><snm>Fraser</snm><fnm>HB</fnm></au></aug><source>Nature</source><pubdate>2001</pubdate><volume>411</volume><fpage>1046</fpage><lpage>1049</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/35082561</pubid><pubid idtype="pmpid" link="fulltext">11429604</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>No simple dependence between protein evolution rate and the number of protein-protein interactions: only the most prolific interactors tend to evolve slowly.</p></title><aug><au><snm>Jordan</snm><fnm>IK</fnm></au><au><snm>Wolf</snm><fnm>YI</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>BMC Evol Biol</source><pubdate>2003</pubdate><volume>3</volume><fpage>1</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2148-3-5</pubid><pubid idtype="pmpid">12515583</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>Evolutionary rate in the protein interaction network.</p></title><aug><au><snm>Fraser</snm><fnm>HB</fnm></au><au><snm>Hirsh</snm><fnm>AE</fnm></au><au><snm>Steinmetz</snm><fnm>LM</fnm></au><au><snm>Scharfe</snm><fnm>C</fnm></au><au><snm>Feldman</snm><fnm>MW</fnm></au></aug><source>Science</source><pubdate>2002</pubdate><volume>296</volume><fpage>750</fpage><lpage>752</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1068696</pubid><pubid idtype="pmpid" link="fulltext">11976460</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>An analysis of determinants of amino acids substitution rates in bacterial proteins.</p></title><aug><au><snm>Rocha</snm><fnm>EP</fnm></au><au><snm>Danchin</snm><fnm>A</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2004</pubdate><volume>21</volume><fpage>108</fpage><lpage>116</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msh004</pubid><pubid idtype="pmpid" link="fulltext">14595100</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>Systemic determinants of gene evolution and function.</p></title><aug><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Mol Syst Biol</source><pubdate>2005</pubdate><volume>1</volume><fpage>2005 0021</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/msb4100029</pubid><pubid idtype="pmcid">1681469</pubid><pubid idtype="pmpid">16729056</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Integrated assessment of genomic correlates of protein evolutionary rate.</p></title><aug><au><snm>Xia</snm><fnm>Y</fnm></au><au><snm>Franzosa</snm><fnm>EA</fnm></au><au><snm>Gerstein</snm><fnm>MB</fnm></au></aug><source>PLoS Comput Biol</source><pubdate>2009</pubdate><volume>5</volume><fpage>e1000413</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pcbi.1000413</pubid><pubid idtype="pmcid">2688033</pubid><pubid idtype="pmpid">19521505</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>Highly expressed genes in yeast evolve slowly.</p></title><aug><au><snm>Pal</snm><fnm>C</fnm></au><au><snm>Papp</snm><fnm>B</fnm></au><au><snm>Hurst</snm><fnm>LD</fnm></au></aug><source>Genetics</source><pubdate>2001</pubdate><volume>158</volume><fpage>927</fpage><lpage>931</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1461684</pubid><pubid idtype="pmpid">11430355</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>Why highly expressed proteins evolve slowly.</p></title><aug><au><snm>Drummond</snm><fnm>DA</fnm></au><au><snm>Bloom</snm><fnm>JD</fnm></au><au><snm>Adami</snm><fnm>C</fnm></au><au><snm>Wilke</snm><fnm>CO</fnm></au><au><snm>Arnold</snm><fnm>FH</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2005</pubdate><volume>102</volume><fpage>14338</fpage><lpage>14343</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0504070102</pubid><pubid idtype="pmcid">1242296</pubid><pubid idtype="pmpid">16176987</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>Synonymous codon usage, accuracy of translation, and gene length in <it>Caenorhabditis elegans</it>.</p></title><aug><au><snm>Marais</snm><fnm>G</fnm></au><au><snm>Duret</snm><fnm>L</fnm></au></aug><source>J Mol Evol</source><pubdate>2001</pubdate><volume>52</volume><fpage>275</fpage><lpage>280</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">11428464</pubid></xrefbib></bibl><bibl id="B20"><title><p>The quest for the universals of protein evolution.</p></title><aug><au><snm>Rocha</snm><fnm>EP</fnm></au></aug><source>Trends Genet</source><pubdate>2006</pubdate><volume>22</volume><fpage>412</fpage><lpage>416</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.tig.2006.06.004</pubid><pubid idtype="pmpid" link="fulltext">16808987</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>An integrated view of protein evolution.</p></title><aug><au><snm>Pal</snm><fnm>C</fnm></au><au><snm>Papp</snm><fnm>B</fnm></au><au><snm>Lercher</snm><fnm>MJ</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2006</pubdate><volume>7</volume><fpage>337</fpage><lpage>348</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrg1838</pubid><pubid idtype="pmpid">16619049</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>Mistranslation-induced protein misfolding as a dominant constraint on coding-sequence evolution.</p></title><aug><au><snm>Drummond</snm><fnm>DA</fnm></au><au><snm>Wilke</snm><fnm>CO</fnm></au></aug><source>Cell</source><pubdate>2008</pubdate><volume>134</volume><fpage>341</fpage><lpage>352</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2008.05.042</pubid><pubid idtype="pmcid">2696314</pubid><pubid idtype="pmpid">18662548</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>The evolutionary consequences of erroneous protein synthesis.</p></title><aug><au><snm>Drummond</snm><fnm>DA</fnm></au><au><snm>Wilke</snm><fnm>CO</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2009</pubdate><volume>10</volume><fpage>715</fpage><lpage>724</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrg2662</pubid><pubid idtype="pmcid">2764353</pubid><pubid idtype="pmpid">19763154</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>The frequency of translational misreading errors in <it>E. coli </it>is largely determined by tRNA competition.</p></title><aug><au><snm>Kramer</snm><fnm>EB</fnm></au><au><snm>Farabaugh</snm><fnm>PJ</fnm></au></aug><source>RNA</source><pubdate>2007</pubdate><volume>13</volume><fpage>87</fpage><lpage>96</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1261/rna.294907</pubid><pubid idtype="pmcid">1705757</pubid><pubid idtype="pmpid">17095544</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Missense translation errors in <it>Saccharomyces cerevisiae</it>.</p></title><aug><au><snm>Stansfield</snm><fnm>I</fnm></au><au><snm>Jones</snm><fnm>KM</fnm></au><au><snm>Herbert</snm><fnm>P</fnm></au><au><snm>Lewendon</snm><fnm>A</fnm></au><au><snm>Shaw</snm><fnm>WV</fnm></au><au><snm>Tuite</snm><fnm>MF</fnm></au></aug><source>J Mol Biol</source><pubdate>1998</pubdate><volume>282</volume><fpage>13</fpage><lpage>24</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/jmbi.1998.1976</pubid><pubid idtype="pmpid" link="fulltext">9733638</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>The lac Operon: A Short History of a Genetic Paradigm.</p></title><aug><au><snm>M&#252;ller-Hill</snm><fnm>B</fnm></au></aug><publisher>New York: Walter de Gruyter</publisher><pubdate>1996</pubdate></bibl><bibl id="B27"><title><p>Gratuitous overexpression of genes in <it>Escherichia coli </it>leads to growth inhibition and ribosome destruction.</p></title><aug><au><snm>Dong</snm><fnm>HJ</fnm></au><au><snm>Nilsson</snm><fnm>L</fnm></au><au><snm>Kurland</snm><fnm>CG</fnm></au></aug><source>J Bacteriol</source><pubdate>1995</pubdate><volume>177</volume><fpage>1497</fpage><lpage>1504</lpage><xrefbib><pubidlist><pubid idtype="pmcid">176765</pubid><pubid idtype="pmpid">7883706</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>Optimality and evolutionary tuning of the expression level of a protein.</p></title><aug><au><snm>Dekel</snm><fnm>E</fnm></au><au><snm>Alon</snm><fnm>U</fnm></au></aug><source>Nature</source><pubdate>2005</pubdate><volume>436</volume><fpage>588</fpage><lpage>592</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature03842</pubid><pubid idtype="pmpid" link="fulltext">16049495</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>Genetic analysis of protein stability and function.</p></title><aug><au><snm>Pakula</snm><fnm>AA</fnm></au><au><snm>Sauer</snm><fnm>RT</fnm></au></aug><source>Annu Rev Genet</source><pubdate>1989</pubdate><volume>23</volume><fpage>289</fpage><lpage>310</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1146/annurev.ge.23.120189.001445</pubid><pubid idtype="pmpid" link="fulltext">2694933</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Structural and genetic analysis of the folding and function of T4 lysozyme.</p></title><aug><au><snm>Matthews</snm><fnm>BW</fnm></au></aug><source>FASEB J</source><pubdate>1996</pubdate><volume>10</volume><fpage>35</fpage><lpage>41</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">8566545</pubid></xrefbib></bibl><bibl id="B31"><title><p>CX, DPX and PRIDE: WWW servers for the analysis and comparison of protein 3D structures.</p></title><aug><au><snm>Vlahovicek</snm><fnm>K</fnm></au><au><snm>Pintar</snm><fnm>A</fnm></au><au><snm>Parthasarathi</snm><fnm>L</fnm></au><au><snm>Carugo</snm><fnm>O</fnm></au><au><snm>Pongor</snm><fnm>S</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2005</pubdate><volume>33</volume><fpage>W252</fpage><lpage>W254</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gki362</pubid><pubid idtype="pmcid">1160123</pubid><pubid idtype="pmpid">15980464</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>I-Mutant2.0: predicting stability changes upon mutation from the protein sequence or structure.</p></title><aug><au><snm>Capriotti</snm><fnm>E</fnm></au><au><snm>Fariselli</snm><fnm>P</fnm></au><au><snm>Casadio</snm><fnm>R</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2005</pubdate><volume>33</volume><fpage>W306</fpage><lpage>W310</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gki375</pubid><pubid idtype="pmcid">1160136</pubid><pubid idtype="pmpid">15980478</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>Synthesis of proteins in <it>Escherichia coli </it>is limited by the concentration of free ribosomes. Expression from reporter genes does not always reflect functional mRNA levels.</p></title><aug><au><snm>Vind</snm><fnm>J</fnm></au><au><snm>Sorensen</snm><fnm>MA</fnm></au><au><snm>Rasmussen</snm><fnm>MD</fnm></au><au><snm>Pedersen</snm><fnm>S</fnm></au></aug><source>J Mol Biol</source><pubdate>1993</pubdate><volume>231</volume><fpage>678</fpage><lpage>688</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/jmbi.1993.1319</pubid><pubid idtype="pmpid" link="fulltext">7685825</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><aug><au><snm>Sambrook</snm><fnm>J</fnm></au><au><snm>Russell</snm><fnm>DW</fnm></au></aug><source>Molecular Cloning: A Laboratory Manual</source><publisher>Woodbury, NY; CSHL Press</publisher><pubdate>2001</pubdate></bibl><bibl id="B35"><title><p>Gene expression response to misfolded protein as a screen for soluble recombinant protein.</p></title><aug><au><snm>Lesley</snm><fnm>SA</fnm></au><au><snm>Graziano</snm><fnm>J</fnm></au><au><snm>Cho</snm><fnm>CY</fnm></au><au><snm>Knuth</snm><fnm>MW</fnm></au><au><snm>Klock</snm><fnm>HE</fnm></au></aug><source>Protein Eng</source><pubdate>2002</pubdate><volume>15</volume><fpage>153</fpage><lpage>160</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/protein/15.2.153</pubid><pubid idtype="pmpid" link="fulltext">11917152</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>Induction of a heat shock-like response by unfolded protein in <it>Escherichia coli</it>: Dependence on protein level not protein degradation.</p></title><aug><au><snm>Parsell</snm><fnm>DA</fnm></au><au><snm>Sauer</snm><fnm>RT</fnm></au></aug><source>Genes Dev</source><pubdate>1989</pubdate><volume>3</volume><fpage>1226</fpage><lpage>1232</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.3.8.1226</pubid><pubid idtype="pmpid" link="fulltext">2676724</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>Sizing the holin lesion with an endolysin-beta-galactosidase fusion.</p></title><aug><au><snm>Wang</snm><fnm>IN</fnm></au><au><snm>Deaton</snm><fnm>J</fnm></au><au><snm>Young</snm><fnm>R</fnm></au></aug><source>J Bacteriol</source><pubdate>2003</pubdate><volume>185</volume><fpage>779</fpage><lpage>787</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JB.185.3.779-787.2003</pubid><pubid idtype="pmcid">142811</pubid><pubid idtype="pmpid">12533453</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>Codon preferences in free-living microorganisms.</p></title><aug><au><snm>Andersson</snm><fnm>SG</fnm></au><au><snm>Kurland</snm><fnm>CG</fnm></au></aug><source>Microbiol Rev</source><pubdate>1990</pubdate><volume>54</volume><fpage>198</fpage><lpage>210</lpage><xrefbib><pubidlist><pubid idtype="pmcid">372768</pubid><pubid idtype="pmpid">2194095</pubid></pubidlist></xrefbib></bibl><bibl id="B39"><title><p>The selection-mutation-drift theory of synonymous codon usage.</p></title><aug><au><snm>Bulmer</snm><fnm>M</fnm></au></aug><source>Genetics</source><pubdate>1991</pubdate><volume>129</volume><fpage>897</fpage><lpage>907</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1204756</pubid><pubid idtype="pmpid">1752426</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>Gene expression and molecular evolution.</p></title><aug><au><snm>Akashi</snm><fnm>H</fnm></au></aug><source>Curr Opin Genet Dev</source><pubdate>2001</pubdate><volume>11</volume><fpage>660</fpage><lpage>666</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0959-437X(00)00250-1</pubid><pubid idtype="pmpid" link="fulltext">11682310</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>Coding-sequence determinants of gene expression in <it>Escherichia coli</it>.</p></title><aug><au><snm>Kudla</snm><fnm>G</fnm></au><au><snm>Murray</snm><fnm>AW</fnm></au><au><snm>Tollervey</snm><fnm>D</fnm></au><au><snm>Plotkin</snm><fnm>JB</fnm></au></aug><source>Science</source><pubdate>2009</pubdate><volume>324</volume><fpage>255</fpage><lpage>258</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1170160</pubid><pubid idtype="pmpid" link="fulltext">19359587</pubid></pubidlist></xrefbib></bibl><bibl id="B42"><title><p>An integrated view of protein evolution.</p></title><aug><au><snm>Pal</snm><fnm>C</fnm></au><au><snm>Papp</snm><fnm>B</fnm></au><au><snm>Lercher</snm><fnm>MJ</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2006</pubdate><volume>7</volume><fpage>337</fpage><lpage>348</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrg1838</pubid><pubid idtype="pmpid" link="fulltext">16619049</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>Constraints and plasticity in genome and molecular-phenome evolution.</p></title><aug><au><snm>Koonin</snm><fnm>EV</fnm></au><au><snm>Wolf</snm><fnm>YI</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2010</pubdate><volume>11</volume><fpage>487</fpage><lpage>498</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrg2810</pubid><pubid idtype="pmpid" link="fulltext">20548290</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Protein misfolding, evolution and disease.</p></title><aug><au><snm>Dobson</snm><fnm>CM</fnm></au></aug><source>Trends Biochem Sci</source><pubdate>1999</pubdate><volume>24</volume><fpage>329</fpage><lpage>332</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0968-0004(99)01445-0</pubid><pubid idtype="pmpid" link="fulltext">10470028</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>A relationship between mRNA expression levels and protein solubility in <it>E. coli</it>.</p></title><aug><au><snm>Tartaglia</snm><fnm>GG</fnm></au><au><snm>Pechmann</snm><fnm>S</fnm></au><au><snm>Dobson</snm><fnm>CM</fnm></au><au><snm>Vendruscolo</snm><fnm>M</fnm></au></aug><source>J Mol Biol</source><pubdate>2009</pubdate><volume>388</volume><fpage>381</fpage><lpage>389</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.jmb.2009.03.002</pubid><pubid idtype="pmpid" link="fulltext">19281824</pubid></pubidlist></xrefbib></bibl><bibl id="B46"><title><p>Protein aggregation profile of the bacterial cytosol.</p></title><aug><au><snm>de Groot</snm><fnm>NS</fnm></au><au><snm>Ventura</snm><fnm>S</fnm></au></aug><source>PLoS One</source><pubdate>2010</pubdate><volume>5</volume><fpage>e9383</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0009383</pubid><pubid idtype="pmcid">2828471</pubid><pubid idtype="pmpid">20195530</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>Constraints imposed by non-functional protein-protein interactions on gene expression and proteome size.</p></title><aug><au><snm>Zhang</snm><fnm>J</fnm></au><au><snm>Maslov</snm><fnm>S</fnm></au><au><snm>Shakhnovich</snm><fnm>EI</fnm></au></aug><source>Mol Syst Biol</source><pubdate>2008</pubdate><volume>4</volume><fpage>210</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/msb.2008.48</pubid><pubid idtype="pmcid">2538908</pubid><pubid idtype="pmpid">18682700</pubid></pubidlist></xrefbib></bibl><bibl id="B48"><title><p>Bimodal protein solubility distribution revealed by an aggregation analysis of the entire ensemble of <it>Escherichia coli </it>proteins.</p></title><aug><au><snm>Niwa</snm><fnm>T</fnm></au><au><snm>Ying</snm><fnm>BW</fnm></au><au><snm>Saito</snm><fnm>K</fnm></au><au><snm>Jin</snm><fnm>W</fnm></au><au><snm>Takada</snm><fnm>S</fnm></au><au><snm>Ueda</snm><fnm>T</fnm></au><au><snm>Taguchi</snm><fnm>H</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2009</pubdate><volume>106</volume><fpage>4201</fpage><lpage>4206</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0811922106</pubid><pubid idtype="pmcid">2657415</pubid><pubid idtype="pmpid">19251648</pubid></pubidlist></xrefbib></bibl><bibl id="B49"><title><p>Mutation of exposed hydrophobic amino acids to arginine to increase protein stability.</p></title><aug><au><snm>Strub</snm><fnm>C</fnm></au><au><snm>Alies</snm><fnm>C</fnm></au><au><snm>Lougarre</snm><fnm>A</fnm></au><au><snm>Ladurantie</snm><fnm>C</fnm></au><au><snm>Czaplicki</snm><fnm>J</fnm></au><au><snm>Fournier</snm><fnm>D</fnm></au></aug><source>BMC Biochem</source><pubdate>2004</pubdate><volume>5</volume><fpage>9</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2091-5-9</pubid><pubid idtype="pmcid">479692</pubid><pubid idtype="pmpid">15251041</pubid></pubidlist></xrefbib></bibl><bibl id="B50"><title><p>Energetics of protein folding.</p></title><aug><au><snm>Baldwin</snm><fnm>RL</fnm></au></aug><source>J Mol Biol</source><pubdate>2007</pubdate><volume>371</volume><fpage>283</fpage><lpage>301</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.jmb.2007.05.078</pubid><pubid idtype="pmpid" link="fulltext">17582437</pubid></pubidlist></xrefbib></bibl><bibl id="B51"><title><p>Dominant forces in protein folding.</p></title><aug><au><snm>Dill</snm><fnm>KA</fnm></au></aug><source>Biochemistry</source><pubdate>1990</pubdate><volume>29</volume><fpage>7133</fpage><lpage>7155</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1021/bi00483a001</pubid><pubid idtype="pmpid">2207096</pubid></pubidlist></xrefbib></bibl><bibl id="B52"><title><p>Free energy balance in protein folding.</p></title><aug><au><snm>Honig</snm><fnm>B</fnm></au><au><snm>Yang</snm><fnm>AS</fnm></au></aug><source>Adv Protein Chem</source><pubdate>1995</pubdate><volume>46</volume><fpage>27</fpage><lpage>58</lpage><xrefbib><pubidlist><pubid idtype="doi">full_text</pubid><pubid idtype="pmpid">7771321</pubid></pubidlist></xrefbib></bibl><bibl id="B53"><title><p>Transproteomic evidence of a loop-deletion mechanism for enhancing protein thermostability.</p></title><aug><au><snm>Thompson</snm><fnm>MJ</fnm></au><au><snm>Eisenberg</snm><fnm>D</fnm></au></aug><source>J Mol Biol</source><pubdate>1999</pubdate><volume>290</volume><fpage>595</fpage><lpage>604</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/jmbi.1999.2889</pubid><pubid idtype="pmpid" link="fulltext">10390356</pubid></pubidlist></xrefbib></bibl><bibl id="B54"><title><p>ProTherm, version 4.0: thermodynamic database for proteins and mutants.</p></title><aug><au><snm>Bava</snm><fnm>KA</fnm></au><au><snm>Gromiha</snm><fnm>MM</fnm></au><au><snm>Uedaira</snm><fnm>H</fnm></au><au><snm>Kitajima</snm><fnm>K</fnm></au><au><snm>Sarai</snm><fnm>A</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2004</pubdate><volume>32</volume><fpage>D120</fpage><lpage>121</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkh082</pubid><pubid idtype="pmcid">308816</pubid><pubid idtype="pmpid">14681373</pubid></pubidlist></xrefbib></bibl><bibl id="B55"><title><p>Contact order revisited: influence of protein size on the folding rate.</p></title><aug><au><snm>Ivankov</snm><fnm>DN</fnm></au><au><snm>Garbuzynskiy</snm><fnm>SO</fnm></au><au><snm>Alm</snm><fnm>E</fnm></au><au><snm>Plaxco</snm><fnm>KW</fnm></au><au><snm>Baker</snm><fnm>D</fnm></au><au><snm>Finkelstein</snm><fnm>AV</fnm></au></aug><source>Protein Sci</source><pubdate>2003</pubdate><volume>12</volume><fpage>2057</fpage><lpage>2062</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1110/ps.0302503</pubid><pubid idtype="pmcid">2324001</pubid><pubid idtype="pmpid">12931003</pubid></pubidlist></xrefbib></bibl><bibl id="B56"><title><p>Contact order, transition state placement and the refolding rates of single domain proteins.</p></title><aug><au><snm>Plaxco</snm><fnm>KW</fnm></au><au><snm>Simons</snm><fnm>KT</fnm></au><au><snm>Baker</snm><fnm>D</fnm></au></aug><source>J Mol Biol</source><pubdate>1998</pubdate><volume>277</volume><fpage>985</fpage><lpage>994</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/jmbi.1998.1645</pubid><pubid idtype="pmpid" link="fulltext">9545386</pubid></pubidlist></xrefbib></bibl><bibl id="B57"><title><p>Marginal fitness contributions of nonessential genes in yeast.</p></title><aug><au><snm>Thatcher</snm><fnm>JW</fnm></au><au><snm>Shaw</snm><fnm>JM</fnm></au><au><snm>Dickinson</snm><fnm>WJ</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1998</pubdate><volume>95</volume><fpage>253</fpage><lpage>257</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.95.1.253</pubid><pubid idtype="pmcid">18192</pubid><pubid idtype="pmpid">9419362</pubid></pubidlist></xrefbib></bibl><bibl id="B58"><title><p>Asymmetric segregation of protein aggregates is associated with cellular aging and rejuvenation.</p></title><aug><au><snm>Lindner</snm><fnm>AB</fnm></au><au><snm>Madden</snm><fnm>R</fnm></au><au><snm>Demarez</snm><fnm>A</fnm></au><au><snm>Stewart</snm><fnm>EJ</fnm></au><au><snm>Taddei</snm><fnm>F</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2008</pubdate><volume>105</volume><fpage>3076</fpage><lpage>3081</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0708931105</pubid><pubid idtype="pmcid">2268587</pubid><pubid idtype="pmpid">18287048</pubid></pubidlist></xrefbib></bibl><bibl id="B59"><title><p>The cost of expression of <it>Escherichia coli </it>lac operon proteins is in the process, not in the products.</p></title><aug><au><snm>Stoebel</snm><fnm>DM</fnm></au><au><snm>Dean</snm><fnm>AM</fnm></au><au><snm>Dykhuizen</snm><fnm>DE</fnm></au></aug><source>Genetics</source><pubdate>2008</pubdate><volume>178</volume><fpage>1653</fpage><lpage>1660</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1534/genetics.107.085399</pubid><pubid idtype="pmcid">2278061</pubid><pubid idtype="pmpid">18245823</pubid></pubidlist></xrefbib></bibl><bibl id="B60"><title><p>Translation efficiency is determined by both codon bias and folding energy.</p></title><aug><au><snm>Tuller</snm><fnm>T</fnm></au><au><snm>Waldman</snm><fnm>YY</fnm></au><au><snm>Kupiec</snm><fnm>M</fnm></au><au><snm>Ruppin</snm><fnm>E</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2010</pubdate><volume>107</volume><fpage>3645</fpage><lpage>3650</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0909910107</pubid><pubid idtype="pmcid">2840511</pubid><pubid idtype="pmpid">20133581</pubid></pubidlist></xrefbib></bibl><bibl id="B61"><title><p>Synonymous codon usage in <it>Drosophila melanogaster</it>: natural selection and translational accuracy.</p></title><aug><au><snm>Akashi</snm><fnm>H</fnm></au></aug><source>Genetics</source><pubdate>1994</pubdate><volume>136</volume><fpage>927</fpage><lpage>935</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1205897</pubid><pubid idtype="pmpid">8005445</pubid></pubidlist></xrefbib></bibl><bibl id="B62"><title><p>Selection for minimization of translational frameshifting errors as a factor in the evolution of codon usage.</p></title><aug><au><snm>Huang</snm><fnm>Y</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au><au><snm>Lipman</snm><fnm>DJ</fnm></au><au><snm>Przytycka</snm><fnm>TM</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2009</pubdate><volume>37</volume><fpage>6799</fpage><lpage>6810</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkp712</pubid><pubid idtype="pmcid">2777431</pubid><pubid idtype="pmpid">19745054</pubid></pubidlist></xrefbib></bibl><bibl id="B63"><title><p>Synonymous codon usage in <it>Escherichia coli</it>: selection for translational accuracy.</p></title><aug><au><snm>Stoletzki</snm><fnm>N</fnm></au><au><snm>Eyre-Walker</snm><fnm>A</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2007</pubdate><volume>24</volume><fpage>374</fpage><lpage>381</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msl166</pubid><pubid idtype="pmpid" link="fulltext">17101719</pubid></pubidlist></xrefbib></bibl><bibl id="B64"><title><p>Impacts of gene essentiality, expression pattern, and gene compactness on the evolutionary rate of mammalian proteins.</p></title><aug><au><snm>Liao</snm><fnm>BY</fnm></au><au><snm>Scott</snm><fnm>NM</fnm></au><au><snm>Zhang</snm><fnm>J</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2006</pubdate><volume>23</volume><fpage>2072</fpage><lpage>2080</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msl076</pubid><pubid idtype="pmpid" link="fulltext">16887903</pubid></pubidlist></xrefbib></bibl><bibl id="B65"><title><p>Comparable contributions of structural-functional constraints and expression level to the rate of protein sequence evolution.</p></title><aug><au><snm>Wolf</snm><fnm>MY</fnm></au><au><snm>Wolf</snm><fnm>YI</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Biol Direct</source><pubdate>2008</pubdate><volume>3</volume><fpage>40</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1745-6150-3-40</pubid><pubid idtype="pmcid">2572155</pubid><pubid idtype="pmpid">18840284</pubid></pubidlist></xrefbib></bibl><bibl id="B66"><title><p>Inherent toxicity of aggregates implies a common mechanism for protein misfolding diseases.</p></title><aug><au><snm>Bucciantini</snm><fnm>M</fnm></au><au><snm>Giannoni</snm><fnm>E</fnm></au><au><snm>Chiti</snm><fnm>F</fnm></au><au><snm>Baroni</snm><fnm>F</fnm></au><au><snm>Formigli</snm><fnm>L</fnm></au><au><snm>Zurdo</snm><fnm>J</fnm></au><au><snm>Taddei</snm><fnm>N</fnm></au><au><snm>Ramponi</snm><fnm>G</fnm></au><au><snm>Dobson</snm><fnm>CM</fnm></au><au><snm>Stefani</snm><fnm>M</fnm></au></aug><source>Nature</source><pubdate>2002</pubdate><volume>416</volume><fpage>507</fpage><lpage>511</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/416507a</pubid><pubid idtype="pmpid" link="fulltext">11932737</pubid></pubidlist></xrefbib></bibl><bibl id="B67"><title><p>Progressive disruption of cellular protein folding in models of polyglutamine diseases.</p></title><aug><au><snm>Gidalevitz</snm><fnm>T</fnm></au><au><snm>Ben-Zvi</snm><fnm>A</fnm></au><au><snm>Ho</snm><fnm>KH</fnm></au><au><snm>Brignull</snm><fnm>HR</fnm></au><au><snm>Morimoto</snm><fnm>RI</fnm></au></aug><source>Science</source><pubdate>2006</pubdate><volume>311</volume><fpage>1471</fpage><lpage>1474</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1124514</pubid><pubid idtype="pmpid" link="fulltext">16469881</pubid></pubidlist></xrefbib></bibl><bibl id="B68"><title><p>Exposure of hydrophobic surfaces initiates aggregation of diverse ALS-causing superoxide dismutase-1 mutants.</p></title><aug><au><snm>Munch</snm><fnm>C</fnm></au><au><snm>Bertolotti</snm><fnm>A</fnm></au></aug><source>J Mol Biol</source><pubdate>2010</pubdate><volume>399</volume><fpage>512</fpage><lpage>525</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.jmb.2010.04.019</pubid><pubid idtype="pmcid">2927901</pubid><pubid idtype="pmpid">20399791</pubid></pubidlist></xrefbib></bibl><bibl id="B69"><title><p>Conversion of green fluorescent protein into a toxic, aggregation-prone protein by C-terminal addition of a short peptide.</p></title><aug><au><snm>Link</snm><fnm>CD</fnm></au><au><snm>Fonte</snm><fnm>V</fnm></au><au><snm>Hiester</snm><fnm>B</fnm></au><au><snm>Yerg</snm><fnm>J</fnm></au><au><snm>Ferguson</snm><fnm>J</fnm></au><au><snm>Csontos</snm><fnm>S</fnm></au><au><snm>Silverman</snm><fnm>MA</fnm></au><au><snm>Stein</snm><fnm>GH</fnm></au></aug><source>J Biol Chem</source><pubdate>2006</pubdate><volume>281</volume><fpage>1808</fpage><lpage>1816</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1074/jbc.M505581200</pubid><pubid idtype="pmpid" link="fulltext">16239215</pubid></pubidlist></xrefbib></bibl><bibl id="B70"><title><p>Image Processing with ImageJ.</p></title><aug><au><snm>Abramoff</snm><fnm>MD</fnm></au><au><snm>Magelhaes</snm><fnm>PJ</fnm></au><au><snm>Ram</snm><fnm>SJ</fnm></au></aug><source>Biophotonics Int</source><pubdate>2004</pubdate><volume>11</volume><fpage>36</fpage><lpage>42</lpage></bibl><bibl id="B71"><title><p>The Protein Data Bank.</p></title><aug><au><snm>Berman</snm><fnm>HM</fnm></au><au><snm>Westbrook</snm><fnm>J</fnm></au><au><snm>Feng</snm><fnm>Z</fnm></au><au><snm>Gilliland</snm><fnm>G</fnm></au><au><snm>Bhat</snm><fnm>TN</fnm></au><au><snm>Weissig</snm><fnm>H</fnm></au><au><snm>Shindyalov</snm><fnm>IN</fnm></au><au><snm>Bourne</snm><fnm>PE</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2000</pubdate><volume>28</volume><fpage>235</fpage><lpage>242</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/28.1.235</pubid><pubid idtype="pmcid">102472</pubid><pubid idtype="pmpid">10592235</pubid></pubidlist></xrefbib></bibl><bibl id="B72"><title><p>Impact of residue accessible surface area on the prediction of protein secondary structures.</p></title><aug><au><snm>Momen-Roknabadi</snm><fnm>A</fnm></au><au><snm>Sadeghi</snm><fnm>M</fnm></au><au><snm>Pezeshk</snm><fnm>H</fnm></au><au><snm>Marashi</snm><fnm>SA</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2008</pubdate><volume>9</volume><fpage>357</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-9-357</pubid><pubid idtype="pmcid">2553345</pubid><pubid idtype="pmpid">18759992</pubid></pubidlist></xrefbib></bibl><bibl id="B73"><title><p>Conservation and prediction of solvent accessibility in protein families.</p></title><aug><au><snm>Rost</snm><fnm>B</fnm></au><au><snm>Sander</snm><fnm>C</fnm></au></aug><source>Proteins</source><pubdate>1994</pubdate><volume>20</volume><fpage>216</fpage><lpage>226</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/prot.340200303</pubid><pubid idtype="pmpid">7892171</pubid></pubidlist></xrefbib></bibl><bibl id="B74"><title><p>Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features.</p></title><aug><au><snm>Kabsch</snm><fnm>W</fnm></au><au><snm>Sander</snm><fnm>C</fnm></au></aug><source>Biopolymers</source><pubdate>1983</pubdate><volume>22</volume><fpage>2577</fpage><lpage>2637</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/bip.360221211</pubid><pubid idtype="pmpid">6667333</pubid></pubidlist></xrefbib></bibl><bibl id="B75"><title><p>Structural determinants of the rate of protein evolution in yeast.</p></title><aug><au><snm>Bloom</snm><fnm>JD</fnm></au><au><snm>Drummond</snm><fnm>DA</fnm></au><au><snm>Arnold</snm><fnm>FH</fnm></au><au><snm>Wilke</snm><fnm>CO</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2006</pubdate><volume>23</volume><fpage>1751</fpage><lpage>1761</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msl040</pubid><pubid idtype="pmpid" link="fulltext">16782762</pubid></pubidlist></xrefbib></bibl><bibl id="B76"><title><p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.</p></title><aug><au><snm>Altschul</snm><fnm>SF</fnm></au><au><snm>Madden</snm><fnm>TL</fnm></au><au><snm>Schaffer</snm><fnm>AA</fnm></au><au><snm>Zhang</snm><fnm>J</fnm></au><au><snm>Zhang</snm><fnm>Z</fnm></au><au><snm>Miller</snm><fnm>W</fnm></au><au><snm>Lipman</snm><fnm>DJ</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>1997</pubdate><volume>25</volume><fpage>3389</fpage><lpage>3402</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/25.17.3389</pubid><pubid idtype="pmcid">146917</pubid><pubid idtype="pmpid">9254694</pubid></pubidlist></xrefbib></bibl><bibl id="B77"><title><p>PAML: a program package for phylogenetic analysis by maximum likelihood.</p></title><aug><au><snm>Yang</snm><fnm>ZH</fnm></au></aug><source>Comput Appl Biosci</source><pubdate>1997</pubdate><volume>13</volume><fpage>555</fpage><lpage>556</lpage><xrefbib><pubid idtype="pmpid">9367129</pubid></xrefbib></bibl><bibl id="B78"><title><p>Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation.</p></title><aug><au><snm>Lu</snm><fnm>P</fnm></au><au><snm>Vogel</snm><fnm>C</fnm></au><au><snm>Wang</snm><fnm>R</fnm></au><au><snm>Yao</snm><fnm>X</fnm></au><au><snm>Marcotte</snm><fnm>EM</fnm></au></aug><source>Nat Biotechnol</source><pubdate>2007</pubdate><volume>25</volume><fpage>117</fpage><lpage>124</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nbt1270</pubid><pubid idtype="pmpid" link="fulltext">17187058</pubid></pubidlist></xrefbib></bibl><bibl id="B79"><title><p>Many Microbe Microarrays Database: uniformly normalized Affymetrix compendia with structured experimental metadata.</p></title><aug><au><snm>Faith</snm><fnm>JJ</fnm></au><au><snm>Driscoll</snm><fnm>ME</fnm></au><au><snm>Fusaro</snm><fnm>VA</fnm></au><au><snm>Cosgrove</snm><fnm>EJ</fnm></au><au><snm>Hayete</snm><fnm>B</fnm></au><au><snm>Juhn</snm><fnm>FS</fnm></au><au><snm>Schneider</snm><fnm>SJ</fnm></au><au><snm>Gardner</snm><fnm>TS</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2008</pubdate><volume>36</volume><fpage>D866</fpage><lpage>870</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkm815</pubid><pubid idtype="pmcid">2238822</pubid><pubid idtype="pmpid">17932051</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm></art>