Table 1 |
|||
|
Types of error |
|||
|
Error type |
Number of occurrences |
Percent of errors |
Error rate |
|
|
|||
|
Insertions |
58,337 |
36% |
0.18% |
|
Homopolymer extension and CAFIE |
32,858 |
20% |
0.10% |
|
Not associated with homopolymers |
25,479 |
16% |
0.08% |
|
Deletions |
43,107 |
27% |
0.13% |
|
Incomplete homopolymer extension |
13,868 |
9% |
0.04% |
|
Not associated with homopolymers |
29,239 |
18% |
0.09% |
|
Mismatches |
25,281 |
16% |
0.08% |
|
Homopolymer extension and CAFIE |
16,725 |
10% |
0.05% |
|
Not associated with homopolymers |
8,556 |
5% |
0.03% |
|
Ambiguous base calls (N) |
34,184 |
21% |
0.10% |
|
|
|||
|
Read errors |
Number of occurrences |
Cumulative percent of reads |
Percent of reads |
|
|
|||
|
Reads with no errors (perfect match) |
279,468 |
82% |
82% |
|
Reads with no more than one error |
35,813 |
93% |
11% |
|
Reads with no more than two errors |
11,651 |
96% |
3% |
|
Reads with more than two errors |
13,218 |
100% |
4% |
|
|
|||
|
Errors were classified as insertions, deletions, mismatches (substitutions) and ambiguous base calls (Ns). We further classified insertions, deletions and mismatches by their association with homopolymer effects. Deletions corresponding to an adjacent base are considered incomplete extension. Insertions and mismatches of the same base as an adjacent base are extensions. Insertions and mismatches that are the same as an upcoming homopolymer with no more than two intervening bases are considered carry forward errors. |
|||
|
Huse et al. Genome Biology 2007 8:R143 doi:10.1186/gb-2007-8-7-r143 |
|||