<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>gb-2005-6-8-229</ui>
	<ji>GBJ</ji>
	<fm>
		<dochead>Minireview</dochead>
		<bibl>
			<title>
				<p>Metagenomics for studying unculturable microorganisms: cutting the Gordian knot</p>
			</title>
			<aug>
				<au id="A1">
					<snm>Schloss</snm>
					<mi>D</mi>
					<fnm>Patrick</fnm>
					<insr iid="I1"/>
				</au>
				<au id="A2" ca="yes">
					<snm>Handelsman</snm>
					<fnm>Jo</fnm>
					<insr iid="I1"/>
					<email>joh@plantpath.wisc.edu</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Department of Plant Pathology, University of Wisconsin, Madison, WI 53706, USA</p>
				</ins>
			</insg>
			<source>Genome Biology</source>
			<issn>1465-6906</issn>
			<pubdate>2005</pubdate>
			<volume>6</volume>
			<issue>8</issue>
			<fpage>229</fpage>
			<url>http://genomebiology.com/2005/6/8/229</url>
			<xrefbib>
				<pubidlist><pubid idtype="pmpid">16086859</pubid><pubid idtype="doi">10.1186/gb-2005-6-8-229</pubid>
				</pubidlist></xrefbib>
		</bibl>
		<history>
			<pub>
				<date>
					<day>1</day>
					<month>8</month>
					<year>2005</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2005</year>
			<collab>BioMed Central Ltd</collab>
		</cpyrt>
		<shorttitle>
			<p>Metagenomics for studying unculturable microorganisms</p>
		</shorttitle>
		<shortabs>
			<p>Recent advances in shotgun sequencing and computational methods for genome assembly have advanced the field of metagenomics, the culture-independent cloning and analysis of microbial DNA extracted directly from an environmental sample, to provide glimpses into the life of uncultured microorganisms.</p>
		</shortabs>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<p>More than 99% of prokaryotes in the environment cannot be cultured in the laboratory, a phenomenon that limits our understanding of microbial physiology, genetics, and community ecology. One way around this problem is metagenomics, the culture-independent cloning and analysis of microbial DNA extracted directly from an environmental sample. Recent advances in shotgun sequencing and computational methods for genome assembly have advanced the field of metagenomics to provide glimpses into the life of uncultured microorganisms.</p>
			</sec>
		</abs>
	</fm>
	<meta>
		<classifications>
			<classification type="BMC" subtype="man_spc_id" id="30010014">Microbiology and parasitology</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010007">Ecology</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
		</classifications>
	</meta>
	<bdy>
		<sec>
			<st>
				<p/>
			</st>
			<p>The estimate that fewer than 1% of the prokaryotes in most environments can be cultivated in isolation <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> has produced a quandary: what is the significance of the field of modern microbial genomics if it is limited to culturable organisms? Until recently, this limitation meant that the genomes of most microbial life could not be dissected because more than half of the known bacterial phyla contain no cultured representatives, and the archaeal kingdoms are likewise dominated by uncultured members. The problem can be likened to the Gordian knot of Greek legend, which was impossible to unravel. The knot, which was constructed with interwoven strands with no ends exposed, served as a source of great pride of the citizens of Gordium where it was displayed. It was Alexander the Great who finally cut the massive knot and called the act his greatest victory. One strategy to expose the rest of the microbial world to the eye of the microbiologist - analogous to attempting to untie the knot - is to coax more bacteria into pure culture. The alternative approach - which could cut through it as Alexander the Great did - is metagenomics.</p>
			<p>Metagenomics is the culture-independent analysis of a mixture of microbial genomes (termed the metagenome) using an approach based either on expression or on sequencing <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. Recent studies in the Sargasso Sea <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, acid mine drainage <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, soil <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, and sunken whale skeletons <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> have used the shotgun-sequencing approach to sample the genomic content of these varied environments. In each study, environmental samples were obtained and the microbial DNA was extracted directly from the sample, sheared, cloned into <it>Escherichia coli</it>, and random clones were sequenced. In some of the studies sequence overlaps were then used to assemble contigs or scaffolds of genomic sequence. The Sargasso Sea study <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> resulted in nearly 2,000,000 random sequence reads, a massive total <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>; the acid-mine-drainage community sequence, more modest in size but impressive in the analytical insights gained, was based on 100,000 sequence reads (Table <tblr tid="T1">1</tblr>). Assembling so many sequence reads, while simultaneously accounting for heterogeneities between genomes, introduced unique challenges for each study. In the hyper-diverse soil metagenomic sequencing project, fewer than 1% of the 150,000 sequence reads could be assigned to a contig <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, whereas the acid-mine-drainage sequencing project successfully assigned 85% of the sequence reads to one of 1,183 scaffolds <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. The genome sequences of uncultured microorganisms residing in mixed communities can now realistically be determined.</p>
			<tbl id="T1">
				<title>
					<p>Table 1</p>
				</title>
				<caption>
					<p>Summary of metagenomic sequencing projects</p>
				</caption>
				<tblbdy cols="5">
					<r>
						<c ca="left">
							<p>Community</p>
						</c>
						<c ca="left">
							<p>Estimated species richness</p>
						</c>
						<c ca="left">
							<p>Thousands of sequence reads</p>
						</c>
						<c ca="left">
							<p>Total DNA sequenced (Mbp)</p>
						</c>
						<c ca="left">
							<p>Sequence reads in contigs (%)</p>
						</c>
					</r>
					<r>
						<c cspan="5">
							<hr/>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>Acid mine drainage</p>
						</c>
						<c ca="left">
							<p>6</p>
						</c>
						<c ca="left">
							<p>100</p>
						</c>
						<c ca="left">
							<p>76</p>
						</c>
						<c ca="left">
							<p>85</p>
						</c>
					</r>
					<r>
						<c>
							<p/>
						</c>
						<c>
							<p/>
						</c>
						<c>
							<p/>
						</c>
						<c>
							<p/>
						</c>
						<c>
							<p/>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>Deep sea whale fall</p>
						</c>
						<c>
							<p/>
						</c>
						<c>
							<p/>
						</c>
						<c>
							<p/>
						</c>
						<c>
							<p/>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>Sample 1</p>
						</c>
						<c ca="left">
							<p>150</p>
						</c>
						<c ca="left">
							<p>38</p>
						</c>
						<c ca="left">
							<p>25</p>
						</c>
						<c ca="left">
							<p>43</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>Sample 2</p>
						</c>
						<c ca="left">
							<p>50</p>
						</c>
						<c ca="left">
							<p>38</p>
						</c>
						<c ca="left">
							<p>25</p>
						</c>
						<c ca="left">
							<p>32</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>Sample 3</p>
						</c>
						<c ca="left">
							<p>20</p>
						</c>
						<c ca="left">
							<p>40</p>
						</c>
						<c ca="left">
							<p>25</p>
						</c>
						<c ca="left">
							<p>47</p>
						</c>
					</r>
					<r>
						<c>
							<p/>
						</c>
						<c>
							<p/>
						</c>
						<c>
							<p/>
						</c>
						<c>
							<p/>
						</c>
						<c>
							<p/>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>Sargasso Sea</p>
						</c>
						<c>
							<p/>
						</c>
						<c>
							<p/>
						</c>
						<c>
							<p/>
						</c>
						<c>
							<p/>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>Samples 1-4</p>
						</c>
						<c ca="left">
							<p>300 per sample</p>
						</c>
						<c ca="left">
							<p>1,662</p>
						</c>
						<c ca="left">
							<p>1,361</p>
						</c>
						<c ca="left">
							<p>61</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>Sample 5-7</p>
						</c>
						<c ca="left">
							<p>300 per sample</p>
						</c>
						<c ca="left">
							<p>325</p>
						</c>
						<c ca="left">
							<p>265</p>
						</c>
						<c ca="left">
							<p>&lt;1</p>
						</c>
					</r>
					<r>
						<c>
							<p/>
						</c>
						<c>
							<p/>
						</c>
						<c>
							<p/>
						</c>
						<c>
							<p/>
						</c>
						<c>
							<p/>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>Minnesota farm soil</p>
						</c>
						<c ca="left">
							<p>&gt;3,000</p>
						</c>
						<c ca="left">
							<p>150</p>
						</c>
						<c ca="left">
							<p>100</p>
						</c>
						<c ca="left">
							<p>&lt;1</p>
						</c>
					</r>
				</tblbdy>
			</tbl>
		</sec>
		<sec>
			<st>
				<p>A simple oceanic community</p>
			</st>
			<p>The most extensive metagenomic sequencing effort has been the attempt by Venter <it>et al. </it><abbrgrp><abbr bid="B4">4</abbr></abbrgrp> to sequence the prokaryotic genomes in the water of the Sargasso Sea, a well characterized region of the Atlantic near Bermuda that has unusually low nutrient levels; this study has already spawned numerous other meta-analyses (for example, <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>). Among one billion nucleotides of sequenced DNA, Venter <it>et al. </it><abbrgrp><abbr bid="B4">4</abbr></abbrgrp> identified more than 1.2 million open reading frames (ORFs), including 782 that had significant similarity to rhodopsin-like proteins. This was a surprise because the rhodopsins were previously thought to be present in only a small group of organisms, and the Sargasso Sea study broadened the spectrum of species known to have them. One intriguing problem in metagenomics is that most ORFs cannot be assigned to gene families of known function <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. In the Sargasso Sea sequences, for instance, 69% of the ORFs had no known function <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. This analysis points to a major limitation in annotating sequences from uncultured microorganisms: if no relative of the organism being sequenced has ever been sequenced, then the likelihood of matching each of the newly identified genes to genes of known function is low. The choice of database used for comparison determines the answer, as demonstrated by the identification by Venter <it>et al. </it><abbrgrp><abbr bid="B4">4</abbr></abbrgrp> of 16S rRNA sequences from the Sargasso Sea by querying a database containing only 16S rRNA gene sequences from genome sequences of Bacteria and Archaea. As they limited the comparative database to cultured microorganisms, it was not surprising that they did not identify any 16S rRNA gene fragments from any phyla with no cultured representatives. A further limitation of this study was presented by Delong <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, who pointed out that the two genomes that Venter <it>et al. </it><abbrgrp><abbr bid="B4">4</abbr></abbrgrp> were able to complete were probably contaminants in the sea-water sample. Obvious examples of assembly error (for example, contigs containing bacterial 5S and 23S rRNA genes adjacent to an archaeal 16S rRNA gene) suggest an insidious assembly problem throughout the sequence collection <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Perhaps the next stage of the project will profit from the mistakes of this 'pilot' sequencing attempt <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>.</p>
		</sec>
		<sec>
			<st>
				<p>An even simpler biofilm community</p>
			</st>
			<p>Although the nutrient-limited Sargasso Sea was selected for metagenomics because it was thought to contain a simple community <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, the community was not simple enough to allow assembly of most of the sequence reads into contigs. Tyson <it>et al. </it><abbrgrp><abbr bid="B5">5</abbr></abbrgrp> selected a far simpler community, that of a biofilm found in the very acidic waste water from an iron mine (termed acid mine drainage), which contains three bacterial and three archaeal lineages. By grouping the assembled contigs into 'bins' according to their GC content and the number of reads per contig, they were able to assign each bin to an organism. The near-complete genome sequences (ten-fold coverage) of <it>Ferroplasma </it>type II and <it>Leptospirillum </it>group II members enabled Tyson <it>et al. </it><abbrgrp><abbr bid="B5">5</abbr></abbrgrp> conceptually to model the metabolic processes that each genome contributes to the broader community.</p>
			<p>This thorough sequencing and metabolic analysis provided the starting point for a 'proteogenomic' analysis. Protein was extracted from biofilms found in the acid mine drainage and digested with trypsin <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Applying shotgun mass spectrometry to the fragmented proteins, Ram <it>et al. </it><abbrgrp><abbr bid="B13">13</abbr></abbrgrp> obtained a sequence of part of the proteome. By combining the proteome and metagenome sequences <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, they linked one or more peptide sequences to approximately 49% of the ORFs from the five dominant genomes <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. The most powerful outcome of this analysis was the identification from the <it>Leptospirillum </it>group II sequences of a novel acid-stable iron-oxidizing c-type cytochrome with an adsorption maximum wavelength at 579 nm (Cyt<sub>579</sub>). Cyt<sub>579 </sub>is the primary iron-oxidizing enzyme in the microbial community and mediates the rate-limiting step in acid production. In this relatively simple community, the proteogenomic approach enabled Ram <it>et al. </it><abbrgrp><abbr bid="B13">13</abbr></abbrgrp> to quantify protein production from each ORF, validate the DNA-derived metabolic model, and identify a process that potentially acts as a keystone for the whole ecosystem.</p>
		</sec>
		<sec>
			<st>
				<p>First metagenomic analyses of complex microbial communities</p>
			</st>
			<p>A fundamental challenge in understanding microbial communities is to chronicle genetic conservation across time and location and to delineate the smallest complement of genes conserved in genomes across different communities <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. Tringe <it>et al. </it><abbrgrp><abbr bid="B6">6</abbr></abbrgrp> tackled this problem by sequencing microbial communities sampled both from soil from a Minnesota farm and from three deep-sea communities living on sunken whale skeletons ('whale-fall') and comparing them with the Sargasso Sea sequence collection. ORFs from each metagenomic sequence were assigned to clusters of orthologous genes (COGs <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>), operons, pathways in the Kyoto Encyclopedia of Genes and Genomes (KEGG) <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, and COG functional categories <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
			<p>The relative enrichment found using each of the four annotation methods between the Sargasso Sea, deep-sea whale fall, and Minnesota farm soil was then determined, resulting, in essence, in an <it>in silico </it>subtractive hybridization. The over-representation of rhodopsin ORFs in the Sargasso Sea and ORFs encoding cellobiose phosphorylase in the Minnesota farm soil make biological sense, because marine microorganisms are more likely to use light-driven energy transduction systems and soil microorganisms are more likely to encounter plant-derived oligosaccharides such as cellobiose. The large number of ORFs of no known function that were over-represented in each community may indicate as-yet unknown functional systems. Generating copious sequence information from a community is intrinsically valuable, but this comparative analysis <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> is a worthy example of how metagenomics may move beyond descriptive, annotation-based analyses toward meaningful inference about ecological phenomena.</p>
		</sec>
		<sec>
			<st>
				<p>Dealing with complexity and contamination</p>
			</st>
			<p>Application of molecular biology methods to cultured organisms has led to striking insights into the life of microbes in mono-species culture. But genomics has failed to elucidate the functions of microbial communities, where most microorganisms on Earth spend most of their time and that provide the platform from microorganisms shape plant, animal, environmental and human health. Metagenomics, coupled with gene arrays, proteomics, expression-based analyses, and microscopy, will give insights into problems such as genome evolution and the membership of particular niches that are currently hindered by our inability to culture most microorganisms in pure culture <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. To realize the full potential of metagenomics, however, a number of obstacles need to be overcome. Perhaps the most significant of these is the microbial complexity in most communities. The successful analysis of the acid mine drainage community was predicated on its simplicity. In contrast, the Minnesota farm soil probably contains more than 5,000 species and 10<sup>4</sup>-10<sup>5 </sup>strains, making it inevitable that the over 150,000 sequence reads could not be assembled into contigs <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> (Table <tblr tid="T1">1</tblr>). It is likely that 2-5 gigabase-pairs of sequence are necessary to obtain eight-fold coverage of the dominant species in the community, suggesting that inventive approaches are needed to enrich DNA sequences from less abundant organisms or from members that are unique to a community <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>.</p>
			<p>Another focus for improvement in metagenomics is the use of robust sampling and DNA-extraction procedures. Methodology that guards against contamination such as that revealed in the Sargasso Sea samples is essential. Making the metagenomic studies ecologically meaningful will require sampling strategies that account for spatial and temporal variability, thereby enabling comparisons between communities <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. These comparisons will also require standardized and aggressive methods for extracting DNA. It is unfortunate that all of the large metagenomic sequencing projects used chemical extraction methods to obtain DNA, whereas the technique of 'bead beating', which applies high shear forces to cells, is more effective than chemical lysis methods at breaking tough cells (for example, <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>). The studies that used chemical lysis methods therefore include DNA from only a subset of the organisms that can be accessed by modern methods.</p>
			<p>This is an exciting time for metagenomics, as many projects are underway to sequence the metagenomes of biologically interesting environments. The US Joint Genome Institute (JGI) has essentially sequenced the metagenomes of the microbial communities associated with two extinct ancient cave bears, which contained less than 2 and 6% cave bear DNA, respectively <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. The JGI is also currently sequencing metagenomic DNA for more than ten studies through their scientific Community Sequencing Program <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, and the J. Craig Venter Foundation is sequencing the metagenomes of samples taken along a path intended to simulate the voyage of Darwin's ship <it>The Beagle</it>, as well as samples of New York City's air <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. A future prospect is completing the human genome by sequencing the metagenome of the 10<sup>12 </sup>microbial cells that are associated with the human body <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. Each of these studies will unearth secrets unique to the environment being examined, and comparison of results of these studies will provide a meta-understanding of the recurrent and unique themes in community structure and function.</p>
		</sec>
	</bdy>
	<bm>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>Combination of 16S rRNA targeted oligonucleotide probes with flow-cytometry for analyzing mixed microbial populations.</p>
				</title>
				<aug>
					<au>
						<snm>Amann</snm>
						<fnm>RI</fnm>
					</au>
					<au>
						<snm>Binder</snm>
						<fnm>BJ</fnm>
					</au>
					<au>
						<snm>Olson</snm>
						<fnm>RJ</fnm>
					</au>
					<au>
						<snm>Chisholm</snm>
						<fnm>SW</fnm>
					</au>
					<au>
						<snm>Devereux</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Stahl</snm>
						<fnm>DA</fnm>
					</au>
				</aug>
				<source>Appl Environ Microbiol</source>
				<pubdate>1990</pubdate>
				<volume>56</volume>
				<fpage>1919</fpage>
				<lpage>1925</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">184531</pubid>
						<pubid idtype="pmpid">2200342</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Metagenomics: genomic analysis of microbial communities.</p>
				</title>
				<aug>
					<au>
						<snm>Riesenfeld</snm>
						<fnm>CS</fnm>
					</au>
					<au>
						<snm>Schloss</snm>
						<fnm>PD</fnm>
					</au>
					<au>
						<snm>Handelsman</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Annu Rev Genet</source>
				<pubdate>2004</pubdate>
				<volume>38</volume>
				<fpage>525</fpage>
				<lpage>552</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1146/annurev.genet.38.072902.091216</pubid>
						<pubid idtype="pmpid" link="fulltext">15568985</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Biotechnological prospects from metagenomics.</p>
				</title>
				<aug>
					<au>
						<snm>Schloss</snm>
						<fnm>PD</fnm>
					</au>
					<au>
						<snm>Handelsman</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Curr Opin Biotechnol</source>
				<pubdate>2003</pubdate>
				<volume>14</volume>
				<fpage>303</fpage>
				<lpage>310</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0958-1669(03)00067-3</pubid>
						<pubid idtype="pmpid" link="fulltext">12849784</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Environmental genome shotgun sequencing of the Sargasso Sea.</p>
				</title>
				<aug>
					<au>
						<snm>Venter</snm>
						<fnm>JC</fnm>
					</au>
					<au>
						<snm>Remington</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Heidelberg</snm>
						<fnm>JF</fnm>
					</au>
					<au>
						<snm>Halpern</snm>
						<fnm>AL</fnm>
					</au>
					<au>
						<snm>Rusch</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Eisen</snm>
						<fnm>JA</fnm>
					</au>
					<au>
						<snm>Wu</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Paulsen</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Nelson</snm>
						<fnm>KE</fnm>
					</au>
					<au>
						<snm>Nelson</snm>
						<fnm>W</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>2004</pubdate>
				<volume>304</volume>
				<fpage>66</fpage>
				<lpage>74</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1093857</pubid>
						<pubid idtype="pmpid" link="fulltext">15001713</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>Community structure and metabolism through reconstruction of microbial genomes from the environment.</p>
				</title>
				<aug>
					<au>
						<snm>Tyson</snm>
						<fnm>GW</fnm>
					</au>
					<au>
						<snm>Chapman</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Hugenholtz</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Allen</snm>
						<fnm>EE</fnm>
					</au>
					<au>
						<snm>Ram</snm>
						<fnm>RJ</fnm>
					</au>
					<au>
						<snm>Richardson</snm>
						<fnm>PM</fnm>
					</au>
					<au>
						<snm>Solovyev</snm>
						<fnm>VV</fnm>
					</au>
					<au>
						<snm>Rubin</snm>
						<fnm>EM</fnm>
					</au>
					<au>
						<snm>Rokhsar</snm>
						<fnm>DS</fnm>
					</au>
					<au>
						<snm>Banfield</snm>
						<fnm>JF</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2004</pubdate>
				<volume>428</volume>
				<fpage>37</fpage>
				<lpage>43</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nature02340</pubid>
						<pubid idtype="pmpid" link="fulltext">14961025</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Comparative metagenomics of microbial communities.</p>
				</title>
				<aug>
					<au>
						<snm>Tringe</snm>
						<fnm>SG</fnm>
					</au>
					<au>
						<snm>von Mering</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Kobayashi</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Salamov</snm>
						<fnm>AA</fnm>
					</au>
					<au>
						<snm>Chen</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Chang</snm>
						<fnm>HW</fnm>
					</au>
					<au>
						<snm>Podar</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Short</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Mathur</snm>
						<fnm>EJ</fnm>
					</au>
					<au>
						<snm>Detter</snm>
						<fnm>JC</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>2005</pubdate>
				<volume>308</volume>
				<fpage>554</fpage>
				<lpage>557</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1107851</pubid>
						<pubid idtype="pmpid" link="fulltext">15845853</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>Metagenomics or megagenomics?</p>
				</title>
				<aug>
					<au>
						<snm>Handelsman</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Nat Rev Microbiol</source>
				<pubdate>2005</pubdate>
				<volume>3</volume>
				<fpage>457</fpage>
				<lpage>458</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1038/nrmicro1185</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>Chitinase gene sequences retrieved from diverse aquatic habitats reveal environment-specific distributions.</p>
				</title>
				<aug>
					<au>
						<snm>LeCleir</snm>
						<fnm>GR</fnm>
					</au>
					<au>
						<snm>Buchan</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Hollibaugh</snm>
						<fnm>JT</fnm>
					</au>
				</aug>
				<source>Appl Environ Microbiol</source>
				<pubdate>2004</pubdate>
				<volume>70</volume>
				<fpage>6977</fpage>
				<lpage>6983</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">535185</pubid>
						<pubid idtype="pmpid" link="fulltext">15574890</pubid>
						<pubid idtype="doi">10.1128/AEM.70.12.6977-6983.2004</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>Alternative oxidase and plastoquinol terminal oxidase in marine prokaryotes of the Sargasso Sea.</p>
				</title>
				<aug>
					<au>
						<snm>McDonald</snm>
						<fnm>AE</fnm>
					</au>
					<au>
						<snm>Vanlerberghe</snm>
						<fnm>GC</fnm>
					</au>
				</aug>
				<source>Gene</source>
				<pubdate>2005</pubdate>
				<volume>349</volume>
				<fpage>15</fpage>
				<lpage>24</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.gene.2004.12.049</pubid>
						<pubid idtype="pmpid" link="fulltext">15777727</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Introducing DOTUR, a computer program for defining operational taxonomic units and estimating species richness.</p>
				</title>
				<aug>
					<au>
						<snm>Schloss</snm>
						<fnm>PD</fnm>
					</au>
					<au>
						<snm>Handelsman</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Appl Environ Microbiol</source>
				<pubdate>2005</pubdate>
				<volume>71</volume>
				<fpage>1501</fpage>
				<lpage>1506</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1128/AEM.71.3.1501-1506.2005</pubid>
						<pubid idtype="pmpid" link="fulltext">15746353</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>The microbial selenoproteome of the Sargasso Sea.</p>
				</title>
				<aug>
					<au>
						<snm>Zhang</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Fomenko</snm>
						<fnm>DE</fnm>
					</au>
					<au>
						<snm>Gladyshev</snm>
						<fnm>VN</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2005</pubdate>
				<volume>6</volume>
				<fpage>R37</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1186/gb-2005-6-4-r37</pubid>
						<pubid idtype="pmpid" link="fulltext">15833124</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>Microbial community genomics in the ocean.</p>
				</title>
				<aug>
					<au>
						<snm>Delong</snm>
						<fnm>EF</fnm>
					</au>
				</aug>
				<source>Nat Rev Microbiol</source>
				<pubdate>2005</pubdate>
				<volume>3</volume>
				<fpage>459</fpage>
				<lpage>469</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nrmicro1158</pubid>
						<pubid idtype="pmpid" link="fulltext">15886695</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>Community proteomics of a natural microbial biofilm.</p>
				</title>
				<aug>
					<au>
						<snm>Ram</snm>
						<fnm>RJ</fnm>
					</au>
					<au>
						<snm>Verberkmoes</snm>
						<fnm>NC</fnm>
					</au>
					<au>
						<snm>Thelen</snm>
						<fnm>MP</fnm>
					</au>
					<au>
						<snm>Tyson</snm>
						<fnm>GW</fnm>
					</au>
					<au>
						<snm>Baker</snm>
						<fnm>BJ</fnm>
					</au>
					<au>
						<snm>Blake</snm>
						<fnm>RC</fnm>
						<suf>2nd</suf>
					</au>
					<au>
						<snm>Shah</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Hettich</snm>
						<fnm>RL</fnm>
					</au>
					<au>
						<snm>Banfield</snm>
						<fnm>JF</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2005</pubdate>
				<volume>308</volume>
				<fpage>1915</fpage>
				<lpage>1920</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science. 1109070</pubid>
						<pubid idtype="pmpid" link="fulltext">15879173</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B14">
				<title>
					<p>COGs - Clusters of orthologous groups</p>
				</title>
				<url>http://www.ncbi.nlm.nih.gov/Class/NAWBIS/Modules/Genomes2Other/genomes41.html</url>
			</bibl>
			<bibl id="B15">
				<title>
					<p>KEGG: Kyoto encyclopedia of genes and genomes</p>
				</title>
				<url>http://www.genome.jp/kegg/</url>
			</bibl>
			<bibl id="B16">
				<title>
					<p>Community genomics in microbial ecology and evolution.</p>
				</title>
				<aug>
					<au>
						<snm>Allen</snm>
						<fnm>EE</fnm>
					</au>
					<au>
						<snm>Banfield</snm>
						<fnm>JF</fnm>
					</au>
				</aug>
				<source>Nat Rev Microbiol</source>
				<pubdate>2005</pubdate>
				<volume>3</volume>
				<fpage>489</fpage>
				<lpage>498</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nrmicro1157</pubid>
						<pubid idtype="pmpid" link="fulltext">15931167</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<title>
					<p>Breakthrough of the year: the runners-up.</p>
				</title>
				<source>Science</source>
				<pubdate>2004</pubdate>
				<volume>306</volume>
				<fpage>2013</fpage>
				<lpage>2017</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">15604368</pubid>
						<pubid idtype="doi">10.1126/science.306.5704.2013</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B18">
				<title>
					<p>Evaluation and optimization of DNA extraction and purification procedures for soil and sediment samples.</p>
				</title>
				<aug>
					<au>
						<snm>Miller</snm>
						<fnm>DN</fnm>
					</au>
					<au>
						<snm>Bryant</snm>
						<fnm>JE</fnm>
					</au>
					<au>
						<snm>Madsen</snm>
						<fnm>EL</fnm>
					</au>
					<au>
						<snm>Ghiorse</snm>
						<fnm>WC</fnm>
					</au>
				</aug>
				<source>Appl Environ Microbiol</source>
				<pubdate>1999</pubdate>
				<volume>65</volume>
				<fpage>4715</fpage>
				<lpage>4724</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">91634</pubid>
						<pubid idtype="pmpid" link="fulltext">10543776</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<title>
					<p>Genomic sequencing of pleistocene cave bears.</p>
				</title>
				<aug>
					<au>
						<snm>Noonan</snm>
						<fnm>JP</fnm>
					</au>
					<au>
						<snm>Hofreiter</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Smith</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Priest</snm>
						<fnm>JR</fnm>
					</au>
					<au>
						<snm>Rohland</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Rabeder</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Krause</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Detter</snm>
						<fnm>JC</fnm>
					</au>
					<au>
						<snm>Paabo</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Rubin</snm>
						<fnm>EM</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2005</pubdate>
				<volume>309</volume>
				<fpage>597</fpage>
				<lpage>599</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1113485</pubid>
						<pubid idtype="pmpid" link="fulltext">15933159</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B20">
				<title>
					<p>DOE Joint Genome Institute</p>
				</title>
				<url>http://jgi.doe.gov</url>
			</bibl>
			<bibl id="B21">
				<title>
					<p>Life in the Air.</p>
				</title>
				<source>Science</source>
				<pubdate>2005</pubdate>
				<volume>307</volume>
				<fpage>1558d</fpage>
				<xrefbib>
					<pubid idtype="doi">10.1126/science.307.5715.1558d</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B22">
				<title>
					<p>The meaning and impact of the human genome sequence for microbiology.</p>
				</title>
				<aug>
					<au>
						<snm>Relman</snm>
						<fnm>DA</fnm>
					</au>
					<au>
						<snm>Falkow</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Trends Microbiol</source>
				<pubdate>2001</pubdate>
				<volume>9</volume>
				<fpage>206</fpage>
				<lpage>208</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0966-842X(01)02041-8</pubid>
						<pubid idtype="pmpid" link="fulltext">11336835</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
		</refgrp>
	</bm>
</art>
