<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2003-4-5-402</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Correspondence</dochead>
      <bibl>
         <title>
            <p>Beyond 100 genomes</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Janssen</snm>
               <fnm>Paul</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
            </au>
            <au id="A2">
               <snm>Audit</snm>
               <fnm>Benjamin</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A3">
               <snm>Cases</snm>
               <fnm>Ildefonso</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A4">
               <snm>Darzentas</snm>
               <fnm>Nikos</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A5">
               <snm>Goldovsky</snm>
               <fnm>Leon</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A6">
               <snm>Kunin</snm>
               <fnm>Victor</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A7">
               <snm>Lopez-Bigas</snm>
               <fnm>Nuria</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A8">
               <snm>Peregrin-Alvarez</snm>
               <mnm>Manuel</mnm>
               <fnm>Jos&#233;</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A9">
               <snm>Pereira-Leal</snm>
               <mi>B</mi>
               <fnm>Jos&#233;</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A10">
               <snm>Tsoka</snm>
               <fnm>Sophia</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A11" ca="yes">
               <snm>Ouzounis</snm>
               <mi>A</mi>
               <fnm>Christos</fnm>
               <insr iid="I2"/>
               <email>ouzounis@ebi.ac.uk</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Centre d'Ing&#233;nierie des Prot&#233;ines (CIP), Universit&#233; de Li&#232;ge, 4000 Li&#232;ge, Belgium</p>
            </ins>
            <ins id="I2">
               <p>Computational Genomics Group, The European Bioinformatics Institute, EMBL Cambridge Outstation, Cambridge CB10 1SD, UK</p>
            </ins>
            <ins id="I3">
               <p>Laboratory of Microbiology, Belgian Nuclear Research Centre, SCK/CEN, Boeretang 200, B-2400-MOL, Belgium</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2003</pubdate>
         <volume>4</volume>
         <issue>5</issue>
         <fpage>402</fpage>
         <url>http://genomebiology.com/2003/4/5/402</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/gb-2003-4-5-402</pubid>
               <pubid idtype="pmpid">12734008</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <pub>
            <date>
               <day>28</day>
               <month>4</month>
               <year>2003</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2003</year>
         <collab>BioMed Central Ltd</collab>
      </cpyrt>
      <shorttitle>
         <p>Beyond 100 genomes</p>
      </shorttitle>
      <shortabs>
         <p>By the end of 2002, we witnessed the landmark submission of the 100th complete genome sequence in the databases. An overview of these genomes reveals certain interesting trends and provides valuable insights into possible future developments.</p>
      </shortabs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p/>
         </st>
         <p>Since the publication of the first entire genome sequence seven years ago <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, a multitude of other genomes have been - or are in the process of being - sequenced <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. By the end of 2002, we witnessed the landmark submission of the 100th complete genome sequence in the databases <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. There are now 106 complete genomes in the public domain, thanks to advances in sequencing technology and sustained funding. An overview, and in particular the rank ordering, of these genomes reveals certain interesting trends and provides valuable insights into possible future developments.</p>
         <p>First, the contribution of genome sequencing projects in terms of actual protein sequence entries has been staggering. There are 433,238 protein sequences derived exclusively from entire genomes <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> (Figure <figr fid="F1">1</figr>), out of a total of a million protein sequences known to date. In contrast, there are only 101,602 entries in Swiss-Prot (release 40), underlining the significant effort that is required for high-quality annotation <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. The growth of protein sequence data coming from entire genomes is expected to reach over 1 million entries in two years' time (Figure <figr fid="F1">1</figr>). Given that approximately 40% of genes in any organism cannot be assigned a specific functional role <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, this suggests that in just a few years hundreds of thousands of sequences will be uncharacterized. While the large-scale characterization of protein function obtained from high-throughput experimental techniques <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> will alleviate some of the above problems, it is clear that to capitalize on the information explosion in genome biology, more research should also be devoted to the development of intelligent automated genome-annotation systems that are able to predict functional properties of protein sequences <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Cumulative number of protein sequence entries (<it>y</it>-axis) in completed genomes (CoGenT, in blue) and Swiss-Prot (in red) as a function of time (<it>x</it>-axis)</p>
            </caption>
            <text>
               <p>Cumulative number of protein sequence entries (<it>y</it>-axis) in completed genomes (CoGenT, in blue) and Swiss-Prot (in red) as a function of time (<it>x</it>-axis).</p>
            </text>
            <graphic file="gb-2003-4-5-402-1"/>
         </fig>
         <p>Second, in addition to the well-defined collection of 106 completed and published genomes, there are another 544 ongoing projects, covering a large number of taxa. Yet, the known taxa of Bacteria and Archaea are far better represented among the completed genome projects compared to the Eukarya (Figure <figr fid="F2">2</figr>). Using comparative genomics we have already obtained a glimpse of the bewildering biological diversity of the prokaryotic world <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Very soon, a similar trend might emerge for the Eukarya: 208 out of the 544 ongoing genome projects are dedicated to eukaryotic species. However, many eukaryotic taxa are still not represented (Figure <figr fid="F2">2</figr>). A better sampling of phylogenetic diversity might be required, to fully explore the genomes of eukaryotic cells.</p>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Phylogenetic distribution of genome sequencing projects</p>
            </caption>
            <text>
               <p>Phylogenetic distribution of genome sequencing projects. Archaea and Bacteria are shown to the phylum level and Eukarya to their first taxonomic branching, with the exception of Metazoa and Fungi. The numbers in parentheses represent the number of completed, published (red) and ongoing (blue) genome projects. The tree is based on the taxonomy database from the National Center for Biotechnology Information (NCBI). Information about ongoing genome projects has been obtained from the Genomes OnLine Database (GOLD) <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, as of 22 January 2003.</p>
            </text>
            <graphic file="gb-2003-4-5-402-2"/>
         </fig>
         <p>Third, over time, both the range of sequenced genome sizes and the selection of species on the basis of their social impact has expanded <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> (Figure <figr fid="F3">3</figr>). Sequenced genome sizes range from 0.5 to 300 Megabases (Mb), with the exception of the human and mouse genomes, which span 2,900 and 2,500 Mb respectively (and together constitute almost 90% of the data in the 106 available DNA sequences). Although species of medical and academic interest were initially the main targets of genome projects, there has been a recent trend to sequence genomes from species with impact on agriculture, environmental sciences or industrial processes. In addition, a growing number of genomes are being sequenced in order to provide a better perspective for the structure and function of evolutionarily related genes and genomes through comparative analysis.</p>
         <fig id="F3">
            <title>
               <p>Figure 3</p>
            </title>
            <caption>
               <p>Representation of completed genome sequences over time (<it>x</it>-axis) and size (<it>y</it>-axis, in Mb, logarithmic scale) labeled according to their social impact</p>
            </caption>
            <text>
               <p>Representation of completed genome sequences over time (<it>x</it>-axis) and size (<it>y</it>-axis, in Mb, logarithmic scale) labeled according to their social impact. Genomes from Archaea (squares), Bacteria (circles) and Eukarya (triangles) are colored according to their academic (blue), medical (pink), agricultural (light green), ecological (dark green) and industrial (black) relevance.</p>
            </text>
            <graphic file="gb-2003-4-5-402-3"/>
         </fig>
         <p>Thus, 10 years after the computational analysis of the first eukaryotic chromosome <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> and seven years after an exhaustive analysis of the first complete genome <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B12">12</abbr></abbrgrp>, genomic science has become a stand-alone discipline, and genome sequencing and computational analysis have become mutually dependent, intertwined in a fascinating interplay. Not so long ago it would have been unthinkable that from a set of DNA fragments, it would be possible to assemble a single genome, find the genes, translate them into proteins, identify their potential functional roles and ultimately integrate all this structural and functional information into complex biochemical networks <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Although there are still significant challenges, these technologies, along with scientific advances, have now come of age and are expected to have a growing impact on various aspects of human welfare.</p>
      </sec>
   </bdy>
   <bm>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Whole-genome random sequencing and assembly of <it>Haemophilus influenzae </it>Rd.</p>
            </title>
            <aug>
               <au>
                  <snm>Fleischmann</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Clayton</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Kirkness</snm>
                  <fnm>EF</fnm>
               </au>
               <au>
                  <snm>Kerlavage</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Bult</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Tomb</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Dougherty</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Merrick</snm>
                  <fnm>JM</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>1995</pubdate>
            <volume>269</volume>
            <fpage>496</fpage>
            <lpage>512</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">102476</pubid>
                  <pubid idtype="pmpid">7542800</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Status of genome projects for nonpathogenic bacteria and archaea.</p>
            </title>
            <aug>
               <au>
                  <snm>Nelson</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>IT</fnm>
               </au>
               <au>
                  <snm>Heidelberg</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>CM</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2000</pubdate>
            <volume>18</volume>
            <fpage>1049</fpage>
            <lpage>1054</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">11017041</pubid>
                  <pubid idtype="doi">10.1038/80235</pubid>
                  <pubid idtype="pmcid">150431</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Genome sequence of the endocellular obligate symbiont of tsetse flies, <it>Wigglesworthia glossinidia</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Akman</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Yamashita</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wataname</snm>
                  <fnm>HOK</fnm>
               </au>
               <au>
                  <snm>Shiba</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Aksoy</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2002</pubdate>
            <volume>32</volume>
            <fpage>402</fpage>
            <lpage>407</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">12219091</pubid>
                  <pubid idtype="doi">10.1038/ng986</pubid>
                  <pubid idtype="pmcid">29859</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Complete Genome Tracking Database.</p>
            </title>
            <url>http://maine.ebi.ac.uk:8000/services/cogent</url>
         </bibl>
         <bibl id="B5">
            <title>
               <p>The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000.</p>
            </title>
            <aug>
               <au>
                  <snm>Bairoch</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Apweiler</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <fpage>45</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">102476</pubid>
                  <pubid idtype="pmpid" link="fulltext">10592178</pubid>
                  <pubid idtype="doi">10.1093/nar/28.1.45</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Genome sequences and great expectations.</p>
            </title>
            <aug>
               <au>
                  <snm>Iliopoulos</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Tsoka</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Andrade</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Janssen</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Audit</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Tramontano</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Valencia</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Leroy</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Sander</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2000</pubdate>
            <volume>2</volume>
            <fpage>interactions0001.1</fpage>
            <lpage>0001.3</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">11178275</pubid>
                  <pubid idtype="doi">10.1186/gb-2000-2-1-interactions0001</pubid>
                  <pubid idtype="pmcid">150431</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>A biochemical genomics approach for identifying genes by the activity of their products.</p>
            </title>
            <aug>
               <au>
                  <snm>Martzen</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>McCraith</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Spinelli</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Torres</snm>
                  <fnm>FM</fnm>
               </au>
               <au>
                  <snm>Fields</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Grayhack</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Phizicky</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1999</pubdate>
            <volume>286</volume>
            <fpage>1153</fpage>
            <lpage>1155</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">10550052</pubid>
                  <pubid idtype="doi">10.1126/science.286.5442.1153</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Automated genome sequence analysis and annotation.</p>
            </title>
            <aug>
               <au>
                  <snm>Andrade</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>NP</fnm>
               </au>
               <au>
                  <snm>Leroy</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hoersch</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>de Daruvar</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Reich</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Franchini</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Tamames</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Valencia</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Sander</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1999</pubdate>
            <volume>15</volume>
            <fpage>391</fpage>
            <lpage>412</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">10366660</pubid>
                  <pubid idtype="doi">10.1093/bioinformatics/15.5.391</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Prokaryotic diversity - magnitude, dynamics, and controlling factors.</p>
            </title>
            <aug>
               <au>
                  <snm>Torsvik</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Ovreas</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Thingstad</snm>
                  <fnm>TF</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>296</volume>
            <fpage>1064</fpage>
            <lpage>1066</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">12004116</pubid>
                  <pubid idtype="doi">10.1126/science.1071698</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Biodiversity: microbial genomes multiply.</p>
            </title>
            <aug>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>RF</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>416</volume>
            <fpage>697</fpage>
            <lpage>700</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">11961543</pubid>
                  <pubid idtype="doi">10.1038/416697a</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>What's in a genome?</p>
            </title>
            <aug>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Sander</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Scharf</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Schneider</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1992</pubdate>
            <volume>358</volume>
            <fpage>287</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/358287a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">1641000</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Challenging times for bioinformatics.</p>
            </title>
            <aug>
               <au>
                  <snm>Casari</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Andrade</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Boyle</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Daruvar</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Schneider</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tamames</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Valencia</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sander</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1995</pubdate>
            <volume>376</volume>
            <fpage>647</fpage>
            <lpage>648</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/376647a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">7651513</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Protein function in the post-genomic era.</p>
            </title>
            <aug>
               <au>
                  <snm>Eisenberg</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Marcotte</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Xenarios</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Yeates</snm>
                  <fnm>TO</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>405</volume>
            <fpage>823</fpage>
            <lpage>826</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">10866208</pubid>
                  <pubid idtype="doi">10.1038/35015694</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Genomes OnLine Database (GOLD): a monitor of genome projects world-wide.</p>
            </title>
            <aug>
               <au>
                  <snm>Bernal</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ear</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Kyrpides</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>126</fpage>
            <lpage>127</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">11125068</pubid>
                  <pubid idtype="doi">10.1093/nar/29.1.126</pubid>
                  <pubid idtype="pmcid">29859</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
