Log on / register
BioMed Central home | Journals A-Z | Feedback | Support | My details
.refereed research
 |  |  |  |  | 


Open AccessHighly AccessMethod

Identifying repeat domains in large genomes

Degui Zhi1 email, Benjamin J Raphael2 email, Alkes L Price3 email, Haixu Tang4 email and Pavel A Pevzner2 email

Bioinformatics Program, University of California, San Diego, CA 92093-0419, USA

Department of Computer Science and Engineering, University of California, San Diego, CA 92093-0114, USA

Department of Genetics, Harvard Medical School, Boston, MA 02115, USA

School of Informatics and Center for Genomics and Bioinformatics, Indiana University, Bloomington, IN 47408, USA

author email corresponding author email

Genome Biology 2006, 7:R7doi:10.1186/gb-2006-7-1-r7

Published: 31 January 2006

Subject areas: Bioinformatics, Evolution

Abstract

We present a graph-based method for the analysis of repeat families in a repeat library. We build a repeat domain graph that decomposes a repeat library into repeat domains, short subsequences shared by multiple repeat families, and reveals the mosaic structure of repeat families. Our method recovers documented mosaic repeat structures and suggests additional putative ones. Our method is useful for elucidating the evolutionary history of repeats and annotating de novo generated repeat libraries.


© 1999-2010 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.