Table 3

Comparison of duplications and potential sequence misassignment errors in genome assemblies

December 2001

April 2002

June 2002




Length

Duplications

Errors

Length

Duplications

Errors

Length

Duplications

Errors


Chr1

2,564

99

115

2,459

68

60

2,469

71

44

Chr2

2,413

70

45

2,468

79

57

2,407

69

23

Chr3

2,048

49

90

2,047

29

73

1,949

31

40

Chr4

1,914

39

44

1,970

51

49

1,920

41

25

Chr5

1,848

55

90

1896

55

112

1,810

45

23

Chr6

1,783

58

56

1,828

43

153

1,703

29

6

Chr7

1,638

130

48

1,605

119

27

1,574

101

2

Chr8

1,457

35

66

1,484

33

43

1,439

26

40

Chr9

1,330

83

38

1,291

75

27

1,324

83

16

Chr10

1,421

74

51

1,385

72

39

1,344

63

13

Chr11

1,414

51

84

1,341

43

36

1,374

44

20

Chr12

1,396

30

83

1,342

24

32

1,313

28

34

Chr13

1,151

29

21

1,136

22

15

1,134

19

1

Chr14

1,065

27

8

1,054

23

10

1,043

13

0

Chr15

991

62

30

1,000

54

20

992

56

17

Chr16

938

65

44

932

67

32

817

60

21

Chr17

839

66

46

811

46

29

801

53

21

Chr18

818

16

59

809

12

32

775

12

14

Chr19

769

45

28

730

34

12

600

32

3

Chr20

630

10

5

628

12

4

628

11

1

Chr21

446

18

3

446

16

2

446

15

0

Chr22

478

28

0

477

29

1

477

28

0

ChrX

1,517

54

40

1,518

61

23

1,492

55

22

ChrY

584

86

2

584

95

2

584

85

1

ChrUn

74

10

1

125

11

43

14

4

1

Total

31,526

1,290

1,097

31,366

1,175

932

30,431

1,074

389

% range*

Duplication

Error

Duplication

Error

Duplication

Error

90-92%

135

0

137

0

117

0

92-94%

334

0

334

0

311

0

94-96%

391

0

382

0

367

0

96-98%

451

0

444

0

418

0

98-100%

884

1,097

724

932

665

389


*All numbers shown in the table are × 100 kb. *Sequence similarity between duplication by five levels of percent identity.

Cheung et al. Genome Biology 2003 4:R25   doi:10.1186/gb-2003-4-4-r25

Open Data