Log on / register
BioMed Central home | Journals A-Z | Feedback | Support | My details
.refereed research
 |  |  |  |  | 


Open AccessHighly AccessResearch

The genome of Rhizobium leguminosarum has recognizable core and accessory components

J Peter W Young1 email, Lisa C Crossman2 email, Andrew WB Johnston3 email, Nicholas R Thomson2 email, Zara F Ghazoui1 email, Katherine H Hull1 email, Margaret Wexler3 email, Andrew RJ Curson3 email, Jonathan D Todd3 email, Philip S Poole4 email, Tim H Mauchline4 email, Alison K East4 email, Michael A Quail2 email, Carol Churcher2 email, Claire Arrowsmith2 email, Inna Cherevach2 email, Tracey Chillingworth2 email, Kay Clarke2 email, Ann Cronin2 email, Paul Davis2 email, Audrey Fraser2 email, Zahra Hance2 email, Heidi Hauser2 email, Kay Jagels2 email, Sharon Moule2 email, Karen Mungall2 email, Halina Norbertczak2 email, Ester Rabbinowitsch2 email, Mandy Sanders2 email, Mark Simmonds2 email, Sally Whitehead2 email and Julian Parkhill2 email

1Department of Biology, University of York, York, UK

2The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge, UK

3School of Biological Sciences, University of East Anglia, Norwich, UK

4School of Biological Sciences, University of Reading, Reading, UK

author email corresponding author email

Genome Biology 2006, 7:R34doi:10.1186/gb-2006-7-4-r34

Published: 26 April 2006

Subject areas: Genome studies, Microbiology and parasitology

Abstract

Background

Rhizobium leguminosarum is an α-proteobacterial N2-fixing symbiont of legumes that has been the subject of more than a thousand publications. Genes for the symbiotic interaction with plants are well studied, but the adaptations that allow survival and growth in the soil environment are poorly understood. We have sequenced the genome of R. leguminosarum biovar viciae strain 3841.

Results

The 7.75 Mb genome comprises a circular chromosome and six circular plasmids, with 61% G+C overall. All three rRNA operons and 52 tRNA genes are on the chromosome; essential protein-encoding genes are largely chromosomal, but most functional classes occur on plasmids as well. Of the 7,263 protein-encoding genes, 2,056 had orthologs in each of three related genomes (Agrobacterium tumefaciens, Sinorhizobium meliloti, and Mesorhizobium loti), and these genes were over-represented in the chromosome and had above average G+C. Most supported the rRNA-based phylogeny, confirming A. tumefaciens to be the closest among these relatives, but 347 genes were incompatible with this phylogeny; these were scattered throughout the genome but were over-represented on the plasmids. An unexpectedly large number of genes were shared by all three rhizobia but were missing from A. tumefaciens.

Conclusion

Overall, the genome can be considered to have two main components: a 'core', which is higher in G+C, is mostly chromosomal, is shared with related organisms, and has a consistent phylogeny; and an 'accessory' component, which is sporadic in distribution, lower in G+C, and located on the plasmids and chromosomal islands. The accessory genome has a different nucleotide composition from the core despite a long history of coexistence.


© 1999-2009 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.