Genome Biology

official impact factor 6.89

Open Access Highly Access Method

Prediction of effective genome size in metagenomic samples

Jeroen Raes1, Jan O Korbel1,2, Martin J Lercher1, Christian von Mering1,3 and Peer Bork1*

  • * Corresponding author: Peer Bork bork@embl.de

  • † Equal contributors

Author Affiliations

1 European Molecular Biology Laboratory, Meyerhofstrasse 1, D-69117 Heidelberg, Germany

2 Molecular Biophysics & Biochemistry Department, Yale University, Whitney Avenue, New Haven, Connecticut, USA

3 Institute of Molecular Biology, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland

For all author emails, please log on.

Genome Biology 2007, 8:R10 doi:10.1186/gb-2007-8-1-r10

Published: 15 January 2007

Additional files

Additional data file 1:

A description of methods, which are supplementary to the manuscript.

Format: PDF Size: 124KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional data file 2:

A figure showing the error distribution for EGS prediction on simulated reads.

Format: PDF Size: 91KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional data file 3:

A figure showing that EGS predictions work well when analyzing mixtures of different species or read lengths.

Format: PDF Size: 71KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional data file 4:

A table summarizing estimated genome sizes for available unfinished genomic sequencing project datasets.

Format: PDF Size: 88KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional data file 5:

A figure showing the error distribution for EGS prediction on simulated reads, using the bacteria-specific version of the prediction formula.

Format: PDF Size: 65KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional data file 6:

A figure showing the error distribution for EGS prediction on real reads, using the bacteria-specific version of the prediction formula.

Format: PDF Size: 67KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional data file 7:

A table summarizing OG markers.

Format: PDF Size: 41KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional data file 8:

A table showing randomly selected genomes and read lengths used for calibration.

Format: PDF Size: 55KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional data file 9:

A table summarizing the shotgun sequencing projects used to estimate cloning bias.

Format: PDF Size: 52KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional data file 10:

A table summarizing the data statistics for available environmental shotgun sequencing datasets (measured after quality clipping).

Format: PDF Size: 52KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional data file 11:

A figure showing the error distribution for EGS prediction on real reads.

Format: PDF Size: 66KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data