<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2000-2-1-interactions0001</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Open letter</dochead>
      <bibl>
         <title>
            <p>Genome sequences and great expectations</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Iliopoulos</snm>
               <fnm>Ioannis</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A2">
               <snm>Tsoka</snm>
               <fnm>Sophia</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A3">
               <snm>Andrade</snm>
               <mi>A</mi>
               <fnm>Miguel</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A4">
               <snm>Janssen</snm>
               <fnm>Paul</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A5">
               <snm>Audit</snm>
               <fnm>Benjamin</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A6">
               <snm>Tramontano</snm>
               <fnm>Anna</fnm>
               <insr iid="I3"/>
            </au>
            <au id="A7">
               <snm>Valencia</snm>
               <fnm>Alfonso</fnm>
               <insr iid="I4"/>
            </au>
            <au id="A8">
               <snm>Leroy</snm>
               <fnm>Christophe</fnm>
               <insr iid="I5"/>
            </au>
            <au id="A9">
               <snm>Sander</snm>
               <fnm>Chris</fnm>
               <insr iid="I5"/>
            </au>
            <au id="A10" ca="yes">
               <snm>Ouzounis</snm>
               <mi>A</mi>
               <fnm>Christos</fnm>
               <insr iid="I1"/>
               <email>ouzounis@ebi.ac.uk</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Computational Genomics Group, Research Programme, The European Bioinformatics Institute, EMBL Cambridge Outstation, Cambridge, CB10 1SD, UK</p>
            </ins>
            <ins id="I2">
               <p>Biological Structures and BioComputing Programme, EMBL, Meyerhofstrasse 1, Heidelberg, Germany</p>
            </ins>
            <ins id="I3">
               <p>Department of Computational Biology and Chemistry, IRBM, Via Pontina, Pomezia, Rome, Italy</p>
            </ins>
            <ins id="I4">
               <p>Protein Design Group, National Center for Biotechnology, Cantoblanco, Madrid, Spain</p>
            </ins>
            <ins id="I5">
               <p>MIT Center for Genome Research, One Kendall Square, Cambridge, MA 02139, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2000</pubdate>
         <volume>2</volume>
         <issue>1</issue>
         <fpage>interactions0001.1</fpage>
         <lpage>interactions0001.3</lpage>
         <url>http://genomebiology.com/2000/2/1/interactions/0001</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/gb-2000-2-1-interactions0001</pubid>
               <pubid idtype="pmpid">11178275</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <pub>
            <date>
               <day>29</day>
               <month>12</month>
               <year>2000</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2000</year>
         <collab>GenomeBiology.com</collab>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <p>To assess how automatic function assignment will contribute to genome annotation in the next five years, we have performed an analysis of 31 available genome sequences. An emerging pattern is that function can be predicted for almost two-thirds of the 73,500 genes that were analyzed. Despite progress in computational biology, there will always be a great need for large-scale experimental determination of protein function.</p>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010015">Model organisms</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010014">Microbiology and parasitology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Letter</p>
         </st>
         <p>The completion of a first draft for the human genome sequence represents an outstanding scientific achievement of our times. Despite the unprecedented hype in the popular media, it is evident that the completion of this phase is just the beginning of a long march towards an understanding of the structure and function of our genetic blueprint [<abbr bid="B1">1</abbr>]. What is to be expected from the analysis of the human genome five years from now? One guide in answering this question is a thorough assessment of our capability to analyze the genomes that have already been sequenced in the past five years, starting in 1995 with the genome of <it>Haemophilus influenzae</it> [<abbr bid="B2">2</abbr>,<abbr bid="B3">3</abbr>]. Since then, 31 entire genome sequences have been made available to the public domain (as of September 2000) [<abbr bid="B4">4</abbr>]. To obtain a snapshot of the maximum possible set of database-driven sequence annotations across species, we have used G<smcaps>ene</smcaps>Q<smcaps>uiz</smcaps>, a system for automated large-scale sequence analysis [<abbr bid="B3">3</abbr>,<abbr bid="B5">5</abbr>].</p>
         <p>Automated genome sequence analysis and annotation has the advantage that the analysis strategy is uniformly applied to all genome sequences against the same database, rendering results comparable [<abbr bid="B5">5</abbr>]. The updated annotations also form a resource for the scientific community for further analysis and assessment. The set of annotations we obtained contains an additional 5,534 novel protein function assignments, a 15% increase over the original genome sequence publications (Figure <figr fid="F1">1a</figr>). Evaluation of the results has shown that the agreement of automated function prediction reaches 95%, when compared to expert manual analysis [<abbr bid="B5">5</abbr>,<abbr bid="B6">6</abbr>]. Through this procedure, patterns that are not discernible from individual genomes become apparent, with interesting implications for the future.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>A summary of the annotation levels for 31 genomes.</p>
            </caption>
            <text>
               <p>A summary of the annotation levels for 31 genomes. Annotations for all genomes (for 73,500 unique genes, 134,000 annotations in total - approximately a twofold annotation coverage) are available on the world wide web at the European Bioinformatics Institute Computational Genomics Group Services page [<abbr bid="B15">15</abbr>] - then point and click at 'GeneQuiz'. Total computation required 2,400 CPU-hrs on a 16-CPU SGI Power Challenge and 68GB of storage. Results for other genomes will be made available at the same URL as they are completed. <b>(a)</b> Information snapshot for 31 entire genomes and a eukaryotic chromosome (<it>Plasmodium falciparum</it>, chromosome 2). For species (and strain) name abbreviations, please refer to the website [<abbr bid="B15">15</abbr>]. Bacteria are shown in black, Archaea in red and Eukarya in blue. Percentages for proteins with homologs of known structure (pink) or function (blue), hypothetical proteins (dark brown) and unique proteins (light brown) are shown. Species are sorted according to the sum of structure and function information; the horizontal line represents the average of known/predicted functions across species. Diamonds (bottom panel) represent the percentage increase in new findings over the original (or public database) annotations (except <it>Drosophila melanogaster,</it> for which such comparison is not currently possible). This percentage range, ranging from 0 to 20, is indicated in brackets. <b>(b)</b> An 'information clock' for the genome of <it>Haemophilus influenzae</it>, showing the relative levels of annotation over time, reflecting a general increase of information in the public databases. Colours are used as in (a).</p>
            </text>
            <graphic file="gb-2000-2-1-interactions0001-1"/>
         </fig>
         <p>First, the detection of proteins of known structure and/or function has increased over time (Figure <figr fid="F1">1b</figr>). This is especially true for homologs of known structure, a trend that should be further enhanced during the next five years as a result of the recent initiatives in structural genomics [<abbr bid="B7">7</abbr>,<abbr bid="B8">8</abbr>]. The total number of homologs of known function has also increased, but at a slower pace (Figure <figr fid="F1">1b</figr>). This may be attributable to two factors: either high-throughput experimental characterization has not yet provided genome-wide functional information or that this type of functional annotation has not yet been adequately transferred into the public sequence databases (Figure <figr fid="F1">1b</figr>) [<abbr bid="B9">9</abbr>]. It is imperative that provisions be made to start recording this type of functional information for the human genome [<abbr bid="B10">10</abbr>].</p>
         <p>Second, there is some degree of variability for the 31 genomes that we have analyzed. On average, function is known or can be predicted for 62% of the proteins encoded in each genome (Figure <figr fid="F1">1a</figr>). Of these, a significant proportion has homologs in the structure database (19% on average) and the remaining majority (43% on average) have homologs of known function. The total fraction of proteins of predicted structure/function varies significantly across species, ranging from 31% for <it>Aeropyrum pernix</it> (9% structure and 22% function) to 80% for <it>Escherichia coli</it> (21% and 59%, respectively; see Figure <figr fid="F1">1a</figr>). It remains to be seen how the human genome ranks within this range of functional assignments.</p>
         <p>Third, there is a substantial number of uncharacterized proteins, including hypothetical proteins (with homologs of unknown function; 26% on average) or 'unique' proteins (without homologs; 12% on average). Hypothetical proteins range from 13% for <it>E. coli</it> and 14% for <it>Rickettsia prowazekii</it> to 40% for <it>Pyrococcus abyssii</it> and 47% for <it>Plasmodium falciparum</it> (chromosome 2). These protein families represent excellent candidate targets for functional genomics. Species with high percentages of hypothetical proteins demarcate taxa whose properties generally remain unknown. This group includes Archaea (34% on average), <it>Mycobacterium tuberculosis</it> (33%), <it>Helicobacter pylori</it> and <it>Chlamydia pneumoniae</it> strains (all at 30%) and yeast (24%). At the other end of the spectrum, certain species are sufficiently covered by their relatives (e.g. <it>H. influenzae</it> at 14%).</p>
         <p>Finally, the percentage of 'unique' proteins delineates the number of uncharacterized protein families. This number varies widely, ranging from 0% for <it>Mycoplasma genitalium</it> and 4% for two <it>Chlamydia trachomatis</it> strains to 39% for <it>A. pernix.</it> A few of these genes may encode taxon-specific proteins or may represent false gene predictions. It is interesting that species with few unique proteins are bacterial (Figure <figr fid="F1">1a</figr>), an indication that many protein families in their phylogenetic neighbourhood have been detected. At the other extreme, species with many unique sequences also include <it>Treponema pallidum,</it> the two <it>Neisseria meningitidis</it> strains (all at 22%) and <it>Deinococcus radiodurans</it> (24%). It is expected that the human genome will also contain a high number of unique proteins, because it currently represents the only vertebrate genome that has been fully sequenced.</p>
         <p>The annotation level for each species crucially depends on the existence of homologs of the proteins that are potentially encoded in the genome sequence. Automatic function assignment also depends on the quality of the underlying databases and the availability of proteins of known structure or function. Another factor that significantly influences annotation quality is the definition of gene structure, an exceedingly difficult task for eukaryotic genomes [<abbr bid="B11">11</abbr>]. The difficulties of eukaryotic genome annotation have been recently reviewed [<abbr bid="B12">12</abbr>], with specific details on the Ensembl and GadFly projects for the human and the fruitfly genomes, respectively.</p>
         <p>The present study suggests that a great deal should be expected from the analysis of the human genome. On the one hand, using up-to-date databases and automatic approaches, a considerable proportion of the human genome will be annotated, to guide further experimentation and discovery. On the other hand, no matter how much the database increases over time, there will still be a great need for functional experiments, to detect the cellular roles of novel proteins. In spite of progress in the field of bioinformatics [<abbr bid="B13">13</abbr>], filling this information gap is going to be the next major challenge for genomics research, where large-scale experimentation will lead the way [<abbr bid="B14">14</abbr>].</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors thank members of the Systems Group at the European Bioinformatics Institute for help. This work was fully supported by the TMR Programme of the European Commission (DG-XII - Science, Research and Development).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Draft data leave geneticists with a mountain still to climb.</p>
            </title>
            <aug>
               <au>
                  <snm>Butler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Smaglik</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>405</volume>
            <fpage>984</fpage>
            <lpage>985</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35016703</pubid>
                  <pubid idtype="pmpid" link="fulltext">10890409</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Whole-genome random sequencing and assembly of <it>Haemophilus influenzae</it> Rd.</p>
            </title>
            <aug>
               <au>
                  <snm>Fleischmann</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Clayton</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Kirkness</snm>
                  <fnm>EF</fnm>
               </au>
               <au>
                  <snm>Kerlavage</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Bult</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Tomb</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Dougherty</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Merrick</snm>
                  <fnm>JM</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>1995</pubdate>
            <volume>269</volume>
            <fpage>496</fpage>
            <lpage>512</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7542800</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Challenging times for bioinformatics.</p>
            </title>
            <aug>
               <au>
                  <snm>Casari</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Andrade</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Boyle</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Daruvar</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Schneider</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tamames</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Valencia</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sander</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1995</pubdate>
            <volume>376</volume>
            <fpage>647</fpage>
            <lpage>648</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/376647a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">7651513</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Genomes OnLine Database (GOLD 1.0): a monitor of complete and ongoing genome projects worldwide.</p>
            </title>
            <aug>
               <au>
                  <snm>Kyrpides</snm>
                  <fnm>NC</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1999</pubdate>
            <volume>15</volume>
            <fpage>773</fpage>
            <lpage>774</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/15.9.773</pubid>
                  <pubid idtype="pmpid" link="fulltext">10498782</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Automated genome sequence analysis and annotation.</p>
            </title>
            <aug>
               <au>
                  <snm>Andrade</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>NP</fnm>
               </au>
               <au>
                  <snm>Leroy</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hoersch</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>de Daruvar</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Reich</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Franchini</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Tamames</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Valencia</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Sander</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1999</pubdate>
            <volume>15</volume>
            <fpage>391</fpage>
            <lpage>412</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/15.5.391</pubid>
                  <pubid idtype="pmpid" link="fulltext">10366660</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Sequence analysis of the <it>Methanococcus jannaschii</it> genome and the prediction of protein function.</p>
            </title>
            <aug>
               <au>
                  <snm>Andrade</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Casari</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>de Daruvar</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sander</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Schneider</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tamames</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Valencia</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1997</pubdate>
            <volume>13</volume>
            <fpage>481</fpage>
            <lpage>483</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9283767</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Structural genomics: beyond the human genome project.</p>
            </title>
            <aug>
               <au>
                  <snm>Burley</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Almo</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Bonanno</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Capel</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chance</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Gaasterland</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Sali</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Studier</snm>
                  <fnm>FW</fnm>
               </au>
               <au>
                  <snm>Swaminathan</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>1999</pubdate>
            <volume>23</volume>
            <fpage>151</fpage>
            <lpage>157</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/13783</pubid>
                  <pubid idtype="pmpid" link="fulltext">10508510</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Structural genomics and its importance for gene function analysis.</p>
            </title>
            <aug>
               <au>
                  <snm>Skolnick</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Fetrow</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Kolinski</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2000</pubdate>
            <volume>18</volume>
            <fpage>283</fpage>
            <lpage>287</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/73723</pubid>
                  <pubid idtype="pmpid" link="fulltext">10700142</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Recent developments and future directions in computational genomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Tsoka</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>FEBS Lett</source>
            <pubdate>2000</pubdate>
            <volume>480</volume>
            <fpage>42</fpage>
            <lpage>48</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0014-5793(00)01776-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">10967327</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>One-stop shop for microarray data.</p>
            </title>
            <aug>
               <au>
                  <snm>Brazma</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Robinson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cameron</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>403</volume>
            <fpage>699</fpage>
            <lpage>700</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35001676</pubid>
                  <pubid idtype="pmpid" link="fulltext">10693778</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Gene-finding approaches for eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Stormo</snm>
                  <fnm>GD</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <fpage>394</fpage>
            <lpage>397</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.10.4.394</pubid>
                  <pubid idtype="pmpid" link="fulltext">10779479</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Annotating eukaryotic genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Reese</snm>
                  <fnm>MG</fnm>
               </au>
            </aug>
            <source>Curr Opin Struct Biol</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <fpage>349</fpage>
            <lpage>354</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0959-440X(00)00095-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">10851187</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Protein function in the post-genomic era.</p>
            </title>
            <aug>
               <au>
                  <snm>Eisenberg</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Marcotte</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Xenarios</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Yeates</snm>
                  <fnm>TO</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>405</volume>
            <fpage>823</fpage>
            <lpage>826</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35015694</pubid>
                  <pubid idtype="pmpid" link="fulltext">10866208</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Genomics, gene expression and DNA arrays.</p>
            </title>
            <aug>
               <au>
                  <snm>Lockhart</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Winzeler</snm>
                  <fnm>EA</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>405</volume>
            <fpage>827</fpage>
            <lpage>836</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35015701</pubid>
                  <pubid idtype="pmpid" link="fulltext">10866209</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>The European Bioinformatics Institute Computational Genomics Group - Services</p>
            </title>
            <note><url>http://www.ebi.ac.uk/research/cgg/services/</url> &#160;<url>http://www.genomes.org</url></note>
         </bibl>
      </refgrp>
   </bm>
</art>
