Genome Biology

official impact factor 6.89

This article has not been peer reviewed.

Deposited research article

Improving SAGE di-tag processing

Jacques Colinge and Georg Feger*

Author Affiliations

Serono Pharmaceutical Research Institute, Ch. des Aulx 14, CH-1228 Plan-les-Ouates, Switzerland

For all author emails, please log on.

Genome Biology 2001, 2:preprint0002-preprint0002.10 doi:10.1186/gb-2001-2-3-preprint0002


This is the first version of this article to be made available publicly, and no other version is available at present. The article was submitted to Genome Biology for peer review.

Published: 22 February 2001

Abstract

Background

SAGE is a genome-wide method for obtaining gene expression profiles. It generates tags of 10 nucleotides in length, which are assumed to determine the corresponding gene transcript. In practice however, this is not always sufficient for uniquely identifying a gene.

Results

We propose an improved processing of SAGE sequences that allows us to obtain one extra base for reasonably abundant tags. This method includes a statistical test for controlling the relevance of extra base predictions.

Conclusions

The improved SAGE sequence processing we present reduces the uncertainty in SAGE tag to gene mapping and can be applied to any SAGE library.