Table 1

Data used for genome assembly and scaffolding
Insert size (bp) Read length (bp) Raw data (Gb) Coverage (X) Data after filtering (Gb) Coverage (x) GC content (%)
200 100 8.28 19.90 7.11 17.09 40.25
500 100 14.36 34.51 9.64 23.17 39.85
800 100 8.06 19.36 5.74 13.80 42.12
2 kb 49 5.65 13.59 4.70 11.30 45.35
5 kb 49 6.77 16.30 5.61 13.49 45.90
10 kb 49 10.50 25.14 7.01 16.80 43.68
Total - 53.62 128.80 39.81 95.65 42.86

DNA was sequenced on an Illumina HiSeq 2000 (100 bp read lengths) or on the Illumina GAIIx (49 bp reads). Libraries were constructed across a range of insert sizes, from 200 bp to 10 kb. The final assembly after filtering consisted of 39.81 Gb of data with 95× coverage of the genome.

Kocher et al.

Kocher et al. Genome Biology 2013 14:R142   doi:10.1186/gb-2013-14-12-r142

Open Data