Genome Biology

official impact factor 6.89

Open Access Highly Access Research

Computational discovery of sense-antisense transcription in the human and mouse genomes

Jay Shendure and George M Church*

Author Affiliations

Department of Genetics, Harvard Medical School, Boston, MA 02115, USA

For all author emails, please log on.

Genome Biology 2002, 3:research0044-research0044.14 doi:10.1186/gb-2002-3-9-research0044

Published: 22 August 2002

Abstract

Background

Overlapping but oppositely oriented transcripts have the potential to form sense-antisense perfect double-stranded (ds) RNA duplexes. Over recent years, the number and variety of examples of mammalian gene-regulatory phenomena in which endogenous dsRNA duplexes have been proposed or demonstrated to participate has greatly increased. These include genomic imprinting, RNA interference, translational regulation, alternative splicing, X-inactivation and RNA editing. We computationally mined public mouse and human expressed sequence tag (EST) databases to search for additional examples of bidirectionally transcribed genomic regions.

Results

Our bioinformatics approach identified over 217 candidate overlapping transcriptional units, almost all of which are novel. From experimental validation of a subset of our predictions by orientation-specific RT-PCR, we estimate that our methodology has a specificity of 84% or greater. In many cases, regions of sense-antisense overlap within the 5'- or 3'-untranslated regions of a given transcript correlate with genomic patterns of mouse-human conservation.

Conclusions

Our results, in conjunction with the literature, bring the total number of predicted and validated examples of overlapping but oppositely oriented transcripts to over 300. Several of these cases support the hypothesis that a subset of the instances of substantial mouse-human conservation in the 5' and 3' UTRs of transcripts might be explained in part by functionality of an overlapping transcriptional unit.