Genome Biology

official impact factor 6.89

Open Access Research

Optimality of the genetic code with respect to protein stability and amino-acid frequencies

Dimitri Gilis1*, Serge Massar3, Nicolas J Cerf2 and Marianne Rooman3,1

Author Affiliations

1 Biomolecular Engineering, Université Libre de Bruxelles, ave F D Roosevelt, 1050 Bruxelles, Belgium

2 Ecole Polytechnique, Université Libre de Bruxelles, ave F D Roosevelt, 1050 Bruxelles, Belgium

3 Service de Physique Théorique, Université Libre de Bruxelles, Boulevard du Triomphe, 1050 Bruxelles, Belgium

For all author emails, please log on.

Genome Biology 2001, 2:research0049-research0049.12 doi:10.1186/gb-2001-2-11-research0049

Published: 24 October 2001

Abstract

Background

The genetic code is known to be efficient in limiting the effect of mistranslation errors. A misread codon often codes for the same amino acid or one with similar biochemical properties, so the structure and function of the coded protein remain relatively unaltered. Previous studies have attempted to address this question quantitatively, by estimating the fraction of randomly generated codes that do better than the genetic code in respect of overall robustness. We extended these results by investigating the role of amino-acid frequencies in the optimality of the genetic code.

Results

We found that taking the amino-acid frequency into account decreases the fraction of random codes that beat the natural code. This effect is particularly pronounced when more refined measures of the amino-acid substitution cost are used than hydrophobicity. To show this, we devised a new cost function by evaluating in silico the change in folding free energy caused by all possible point mutations in a set of protein structures. With this function, which measures protein stability while being unrelated to the code's structure, we estimated that around two random codes in a billion (109) are fitter than the natural code. When alternative codes are restricted to those that interchange biosynthetically related amino acids, the genetic code appears even more optimal.

Conclusions

These results lead us to discuss the role of amino-acid frequencies and other parameters in the genetic code's evolution, in an attempt to propose a tentative picture of primitive life.