Genome Biology

official impact factor 6.89

Open Access Highly Access Research

Promoter features related to tissue specificity as measured by Shannon entropy

Jonathan Schug1*, Winfried-Paul Schuller2, Claudia Kappen2, J Michael Salbaum2, Maja Bucan3 and Christian J Stoeckert1

Author Affiliations

1 Center for Bioinformatics, University of Pennsylvania, Philadelphia, PA 19104, USA

2 Department of Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha, NE 68198, USA

3 Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA

For all author emails, please log on.

Genome Biology 2005, 6:R33 doi:10.1186/gb-2005-6-4-r33

Published: 29 March 2005

Additional files

Additional File 1:

A table showing H and Q values for all normal human tissues in the GNF-GEA dataset. H and Q values for all normal tissues in the GNF-GEA dataset for human using both the original MAS4 quantification and our RMA re-quantification. The RMA data were normalized to yield common medians of 3.75 prior to the H and Q calculation. The data for each tissue are placed in separate worksheets. Each worksheet contains H- and Q-values, the expression value of the gene in the worksheet's tissue, and its maximum expression across all tissues in the file, the gene symbol, RefSeq, SwissProt, and Unigene ID, and a description. The rows in each worksheet are sorted by increasing values of Q using the RMA data. Thus the top of each worksheet displays the genes most specific to that worksheet's tissue.

Format: XLS Size: 69.9MB Download file

This file can be viewed with: Microsoft Excel Viewer

Open Data

Additional File 2:

A table showing H and Q values for all normal mouse tissues in the GNF-GEA dataset. H and Q values for all normal tissues in the GNF-GEA dataset for mouse using both the original MAS4 quantification and our RMA re-quantification. The RMA data were normalized to yield common medians of 3.22 prior to the H and Q calculation. The data for each tissue are placed in separate worksheets. Each worksheet contains H- and Q-values, the expression value of the gene in the worksheet's tissue, and its maximum expression across all tissues in the file, the gene symbol, RefSeq, SwissProt, and Unigene ID, and a description. The rows in each worksheet are sorted by increasing values of Q using the RMA data. Thus the top of each worksheet displays the genes most specific to that worksheet's tissue.

Format: XLS Size: 105.6MB Download file

This file can be viewed with: Microsoft Excel Viewer

Open Data