Log on / register
BioMed Central home | Journals A-Z | Feedback | Support | My details
.refereed research
 |  |  |  |  | 


Open AccessHighly AccessMethod

Clustering analysis of SAGE data using a Poisson approach

Li Cai1* email, Haiyan Huang2,5* email, Seth Blackshaw3,6, Jun S Liu4, Connie Cepko3 and Wing H Wong2,4 email

Department of Research Computing, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA

Department of Biostatistics, Harvard School of Public Health, 66 Huntington Avenue, Boston, MA 02115, USA

Department of Genetics, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115, USA

Department of Statistics, Harvard University, Science Center, 1 Oxford Street, Cambridge, MA 02138, USA

Current address: Department of Statistics, University of California, Berkeley, 367 Evans Hall, Berkeley, CA 94720, USA

Current address: Department of Neuroscience, Johns Hopkins University School of Medicine, 773 N Broadway Ave, Baltimore, MD 21287, USA

author email corresponding author email* Contributed equally

Genome Biology 2004, 5:R51doi:10.1186/gb-2004-5-7-r51

Published: 29 June 2004

Subject areas: Bioinformatics, Methods, Genome studies, Neurobiology

Abstract

Serial analysis of gene expression (SAGE) data have been poorly exploited by clustering analysis owing to the lack of appropriate statistical methods that consider their specific properties. We modeled SAGE data by Poisson statistics and developed two Poisson-based distances. Their application to simulated and experimental mouse retina data show that the Poisson-based distances are more appropriate and reliable for analyzing SAGE data compared to other commonly used distances or similarity measures such as Pearson correlation or Euclidean distance.


© 1999-2010 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.