Genome Biology

official impact factor 6.89

Open Access Method

A method to assess compositional bias in biological sequences and its application to prion-like glutamine/asparagine-rich domains in eukaryotic proteomes

Paul M Harrison* and Mark Gerstein

Author Affiliations

Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Avenue, New Haven, CT 06520-8114, USA

For all author emails, please log on.

Genome Biology 2003, 4:R40 doi:10.1186/gb-2003-4-6-r40

Published: 30 May 2003

Additional files

Additional data file 1:

The abundance of biases counted up in different ways for different bias probability thresholds

Format: TXT Size: 3KB Download file

Open Data

Additional data file 2:

A table showing the number of biased regions for all the eukaryotes (for a uniform probability model)

Format: TXT Size: 1KB Download file

Open Data

Additional data file 3:

The coordinates of S. cerevisiae (gln+asn)-rich domains

Format: TAB Size: 6KB Download file

Open Data

Additional data file 4:

The coordinates of S. pombe (gln+asn)-rich domains

Format: TAB Size: 1KB Download file

Open Data

Additional data file 5:

The coordinates of C. elegans (gln+asn)-rich domains

Format: TAB Size: 12KB Download file

Open Data

Additional data file 6:

The coordinates of Arabidopsis (gln+asn)-rich domains

Format: TAB Size: 8KB Download file

Open Data

Additional data file 7:

The coordinates of Drosophila (gln+asn)-rich domains

Format: TAB Size: 61KB Download file

Open Data

Additional data file 8:

The coordinates of human (gln+asn)-rich domains

Format: TAB Size: 9KB Download file

Open Data

Additional data file 9:

The sequence of the S. cerevisiae proteome

Format: FASTA Size: 3.1MB Download file

Open Data

Additional data file 10:

The sequence of the S. pombe proteome

Format: FASTA Size: 2.7MB Download file

Open Data

Additional data file 11:

The sequence of the C.elegans proteome

Format: FASTA Size: 10.2MB Download file

Open Data

Additional data file 12:

The sequence of the Arabidopsis> proteome

Format: FASTA Size: 11.7MB Download file

Open Data

Additional data file 13:

The sequence of the Drosophila proteome

Format: FASTA Size: 8.2MB Download file

Open Data

Additional data file 14:

The sequence of the human proteome

Format: FASTA Size: 11.9MB Download file

Open Data