Table 2

Classification of GenBank ESTs with respect to their genome mapping coordinates in relation to the set of non-redundant spliced RefSeq sequences


EST clusters with overlap to exons of RefSeq genes*
EST clusters wholly intronic to RefSeq genes
EST clusters mapped outside of RefSeq genes
Total

Spliced EST contigs
16,241
8,013
10,144
34,398
Number of exons of spliced EST contigs (median)
10
2
3

Total number of spliced ESTs in contigs
3,616,644
162,841
241,049
4,020,534
Number of spliced ESTs per contig (median)
91
3
4

Unspliced EST contigs
4,030
55,139
29,198
88,367
Total number of unspliced ESTs in contigs
56,752
190,583
140,091
387,426
Number of unspliced ESTs per contig (median)
4
2
2

Spliced EST singlets
1,053
6,205
6,631
13,889
Unspliced EST singlets
3,539
121,091
71,662
196,292
Total non-redundant EST clusters (contigs + singlets)
24,863
190,448
117,635
332,946
Total ESTs
3,677,988
480,720
459,433
4,618,141

*The reference dataset comprises 15,783 spliced non-redundant RefSeq units plus the evidence of additional splice variants obtained for each transcriptional unit from all mRNA sequences mapping to the same locus.

Nakaya et al. Genome Biology 2007 8:R43   doi:10.1186/gb-2007-8-3-r43