Log on / register
BioMed Central home | Journals A-Z | Feedback | Support | My details
.refereed research
 |  |  |  |  | 


Open AccessHighly AccessResearch

Motif composition, conservation and condition-specificity of single and alternative transcription start sites in the Drosophila genome

Elizabeth A Rach1 email, Hsiang-Yu Yuan1 email, William H Majoros2 email, Pavel Tomancak3 email and Uwe Ohler2,4,5 email

Program in Computational Biology and Bioinformatics, Duke University, Science Drive, Durham, NC 27708, USA

Institute for Genome Sciences and Policy, Duke University, Science Drive, Durham, NC 27708, USA

Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse, Dresden 01307, Germany

Department of Biostatistics and Bioinformatics, Duke University, Duke University School of Medicine, Erwin Road, Durham NC 27710, USA

Department of Computer Science, Duke University, Durham, NC 27708, USA

author email corresponding author email

Genome Biology 2009, 10:R73doi:10.1186/gb-2009-10-7-r73

Published: 9 July 2009

Subject areas: Bioinformatics, Development, Molecular biology

Abstract

Background

Transcription initiation is a key component in the regulation of gene expression. mRNA 5' full-length sequencing techniques have enhanced our understanding of mammalian transcription start sites (TSSs), revealing different initiation patterns on a genomic scale.

Results

To identify TSSs in Drosophila melanogaster, we applied a hierarchical clustering strategy on available 5' expressed sequence tags (ESTs) and identified a high quality set of 5,665 TSSs for approximately 4,000 genes. We distinguished two initiation patterns: 'peaked' TSSs, and 'broad' TSS cluster groups. Peaked promoters were found to contain location-specific sequence elements; conversely, broad promoters were associated with non-location-specific elements. In alignments across other Drosophila genomes, conservation levels of sequence elements exceeded 90% within the melanogaster subgroup, but dropped considerably for distal species. Elements in broad promoters had lower levels of conservation than those in peaked promoters. When characterizing the distributions of ESTs, 64% of TSSs showed distinct associations to one out of eight different spatiotemporal conditions. Available whole-genome tiling array time series data revealed different temporal patterns of embryonic activity across the majority of genes with distinct alternative promoters. Many genes with maternally inherited transcripts were found to have alternative promoters utilized later in development. Core promoters of maternally inherited transcripts showed differences in motif composition compared to zygotically active promoters.

Conclusions

Our study provides a comprehensive map of Drosophila TSSs and the conditions under which they are utilized. Distinct differences in motif associations with initiation pattern and spatiotemporal utilization illustrate the complex regulatory code of transcription initiation.


© 1999-2010 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.