Email updates

Keep up to date with the latest news and content from Genome Biology and BioMed Central.

Open Access Research

Functional associations of proteins in entire genomes by means of exhaustive detection of gene fusions

Anton J Enright and Christos A Ouzounis*

Author Affiliations

Computational Genomics Group, European Bioinformatics Institute, EMBL Cambridge Outstation, Cambridge CB10 1SD, UK

For all author emails, please log on.

Genome Biology 2001, 2:research0034-research0034.7  doi:10.1186/gb-2001-2-9-research0034

Published: 28 August 2001

Abstract

Background

It has recently been shown that the detection of gene fusion events across genomes can be used for predicting functional associations of proteins, including physical interaction or complex formation. To obtain such predictions we have made an exhaustive search for gene fusion events within 24 available completely sequenced genomes.

Results

Each genome was used as a query against the remaining 23 complete genomes to detect gene fusion events. Using an improved, fully automatic protocol, a total of 7,224 single-domain proteins that are components of gene fusions in other genomes were detected, many of which were identified for the first time. The total number of predicted pairwise functional associations is 39,730 for all genomes. Component pairs were identified by virtue of their similarity to 2,365 multidomain composite proteins. We also show for the first time that gene fusion is a complex evolutionary process with a number of contributory factors, including paralogy, genome size and phylogenetic distance. On average, 9% of genes in a given genome appear to code for single-domain, component proteins predicted to be functionally associated. These proteins are detected by an additional 4% of genes that code for fused, composite proteins.

Conclusions

These results provide an exhaustive set of functionally associated genes and also delineate the power of fusion analysis for the prediction of protein interactions.