Log on / register
BioMed Central home | Journals A-Z | Feedback | Support | My details
.refereed research
 |  |  |  |  | 


Open AccessHighly AccessResearch

The proteome of Toxoplasma gondii: integration with the genome provides novel insights into gene expression and annotation

Dong Xia1 email, Sanya J Sanderson1 email, Andrew R Jones1 email, Judith H Prieto2 email, John R Yates2 email, Elizabeth Bromley3 email, Fiona M Tomley3 email, Kalpana Lal4 email, Robert E Sinden4 email, Brian P Brunk5 email, David S Roos5 email and Jonathan M Wastling1,6 email

1Department of Pre-clinical Veterinary Science, Faculty of Veterinary Science, University of Liverpool, Liverpool L69 7ZJ, UK

2Department of Cell Biology, The Scripps Research Institute, North Torrey Pines Road, La Jolla, CA 92037, USA

3Division of Microbiology, Institute for Animal Health, Compton, Berkshire, RG20 7NN, UK

4The Division of Cell and Molecular Biology, Imperial College London, London, SW7 2AZ, UK

5Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA

6Veterinary Pathology, Faculty of Veterinary Science, University of Liverpool, Liverpool L69 7ZJ, UK

author email corresponding author email

Genome Biology 2008, 9:R116doi:10.1186/gb-2008-9-7-r116

Published: 21 July 2008

Subject areas: Genome studies, Microbiology and parasitology, Model organisms

Abstract

Background

Although the genomes of many of the most important human and animal pathogens have now been sequenced, our understanding of the actual proteins expressed by these genomes and how well they predict protein sequence and expression is still deficient. We have used three complementary approaches (two-dimensional electrophoresis, gel-liquid chromatography linked tandem mass spectrometry and MudPIT) to analyze the proteome of Toxoplasma gondii, a parasite of medical and veterinary significance, and have developed a public repository for these data within ToxoDB, making for the first time proteomics data an integral part of this key genome resource.

Results

The draft genome for Toxoplasma predicts around 8,000 genes with varying degrees of confidence. Our data demonstrate how proteomics can inform these predictions and help discover new genes. We have identified nearly one-third (2,252) of all the predicted proteins, with 2,477 intron-spanning peptides providing supporting evidence for correct splice site annotation. Functional predictions for each protein and key pathways were determined from the proteome. Importantly, we show evidence for many proteins that match alternative gene models, or previously unpredicted genes. For example, approximately 15% of peptides matched more convincingly to alternative gene models. We also compared our data with existing transcriptional data in which we highlight apparent discrepancies between gene transcription and protein expression.

Conclusion

Our data demonstrate the importance of protein data in expression profiling experiments and highlight the necessity of integrating proteomic with genomic data so that iterative refinements of both annotation and expression models are possible.


© 1999-2009 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.