Table 3

Comparison of duplications and potential sequence misassignment errors in genome assemblies


December 2001
April 2002
June 2002





Length
Duplications
Errors
Length
Duplications
Errors
Length
Duplications
Errors

Chr1
2,564
99
115
2,459
68
60
2,469
71
44
Chr2
2,413
70
45
2,468
79
57
2,407
69
23
Chr3
2,048
49
90
2,047
29
73
1,949
31
40
Chr4
1,914
39
44
1,970
51
49
1,920
41
25
Chr5
1,848
55
90
1896
55
112
1,810
45
23
Chr6
1,783
58
56
1,828
43
153
1,703
29
6
Chr7
1,638
130
48
1,605
119
27
1,574
101
2
Chr8
1,457
35
66
1,484
33
43
1,439
26
40
Chr9
1,330
83
38
1,291
75
27
1,324
83
16
Chr10
1,421
74
51
1,385
72
39
1,344
63
13
Chr11
1,414
51
84
1,341
43
36
1,374
44
20
Chr12
1,396
30
83
1,342
24
32
1,313
28
34
Chr13
1,151
29
21
1,136
22
15
1,134
19
1
Chr14
1,065
27
8
1,054
23
10
1,043
13
0
Chr15
991
62
30
1,000
54
20
992
56
17
Chr16
938
65
44
932
67
32
817
60
21
Chr17
839
66
46
811
46
29
801
53
21
Chr18
818
16
59
809
12
32
775
12
14
Chr19
769
45
28
730
34
12
600
32
3
Chr20
630
10
5
628
12
4
628
11
1
Chr21
446
18
3
446
16
2
446
15
0
Chr22
478
28
0
477
29
1
477
28
0
ChrX
1,517
54
40
1,518
61
23
1,492
55
22
ChrY
584
86
2
584
95
2
584
85
1
ChrUn
74
10
1
125
11
43
14
4
1
Total
31,526
1,290
1,097
31,366
1,175
932
30,431
1,074
389






























% range*

Duplication
Error

Duplication
Error

Duplication
Error
90-92%

135
0

137
0

117
0
92-94%

334
0

334
0

311
0
94-96%

391
0

382
0

367
0
96-98%

451
0

444
0

418
0
98-100%

884
1,097

724
932

665
389

*All numbers shown in the table are × 100 kb. *Sequence similarity between duplication by five levels of percent identity.

Cheung et al. Genome Biology 2003 4:R25   doi:10.1186/gb-2003-4-4-r25

Open Data