<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2002-3-6-reviews1015</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Minireview</dochead>
      <bibl>
         <title>
            <p>Insights into cereal genomes from two draft genome sequences of rice</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Bancroft</snm>
               <fnm>Ian</fnm>
               <insr iid="I1"/>
               <email>ian.bancroft@bbsrc.ac.uk</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>John Innes Centre, Norwich Research Park, Colney, Norwich NR4 7UH, UK</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2002</pubdate>
         <volume>3</volume>
         <issue>6</issue>
         <fpage>reviews1015.1</fpage>
         <lpage>reviews1015.3</lpage>
         <url>http://genomebiology.com/2002/3/6/reviews/1015</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/gb-2002-3-6-reviews1015</pubid>
               <pubid idtype="pmpid">12093379</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <pub>
            <date>
               <day>28</day>
               <month>5</month>
               <year>2002</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2002</year>
         <collab>BioMed Central Ltd</collab>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <p>Draft genome sequences have been reported for two subspecies of rice. The drafts include the sequences of an estimated 99% of all rice genes and provide major advances in our understanding of the content and complexity of cereal genomes in general and the rice genome in particular.</p>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010019">Plant biology</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p/>
         </st>
         <p>A third of the human population depends on rice as a staple food [<abbr bid="B1">1</abbr>]. Rice is also an important model for other cereal crops that, along with rice, account for more than 60% of worldwide agricultural production [<abbr bid="B2">2</abbr>]. Its small genome and long history of genetic studies led to rice being an early target for complete genome sequencing. Draft sequence has recently been reported for two subspecies of rice: <it>japonica</it> [<abbr bid="B3">3</abbr>], which is widely grown in Japan, and <it>indica</it> [<abbr bid="B4">4</abbr>], which is widely grown in China and elsewhere. These sequence data can now be analyzed and compared with the published genome sequence of the widely adopted model species for dicotyledonous flowering plants, <it>Arabidopsis thaliana</it> [<abbr bid="B5">5</abbr>].</p>
      </sec>
      <sec>
         <st>
            <p>Gene content of rice</p>
         </st>
         <p>The rice draft genomic sequences were generated by whole-genome shotgun (WGS) sequencing, largely starting from plasmid clones of nuclear DNA. This strategy contrasts with that employed by the International Rice Gene Sequencing Project (IRGSP) [<abbr bid="B6">6</abbr>], which produces high-quality sequence from large-insert clones that are individually selected from a complete physical map.WGS sequencing rapidly and cost-effectively generates sequence from nearly all of the genes in the genome, but the lengths of contiguous sequence are shorter, errors are more numerous and integration with genetic maps is poorer. The <it>japonica</it> WGS sequencing [<abbr bid="B3">3</abbr>] was conducted to 6X redundancy - that is, each base was sequenced an average of six times. Repeated sequences (which are numerous in the rice genome) were removed, permitting the assembly of 42,109 contiguous sequences, or contigs, representing 390 megabases of total sequence (thus, the mean length of contigs was 9.2 kb). The <it>indica</it> WGS sequencing [<abbr bid="B4">4</abbr>] was conducted to 4X redundancy. In this case, repeated sequences were masked, permitting the assembly of 127,550 sequence contigs, representing 361 Mb total length (the mean length of contigs was thus 3.5 kb). The total size of the <it>indica</it> genome was estimated as 466 Mb [<abbr bid="B4">4</abbr>] and that of <it>japonica</it> should be very similar. On the basis of comparisons with individually sequenced rice genes, finished sequence from the IRGSP, and other rice sequences, the <it>japonica</it> and <it>indica</it> sequences provide 99% and 92% coverage of genes, respectively [<abbr bid="B3">3</abbr>,<abbr bid="B4">4</abbr>]. The sequence probably contains relatively few mis-assemblies; this was tested for the <it>indica</it> data by using cDNA sequences to screen for artificial exon-sized rearrangements, and putative mis-assemblies were identified in only 1.1% of the genes tested [<abbr bid="B4">4</abbr>]. Thus, although highly fragmented, the draft sequence generated by WGS allows us to gain early insights into many characteristics of the rice genome.</p>
         <p>Automated gene prediction within the rice genome sequence is complicated by gradients in patterns of the usage of codons and amino acids. This phenomenon was analyzed in the <it>indica</it> sequence [<abbr bid="B4">4</abbr>], where the 5' ends of genes were found to have a higher GC content than the 3' ends by up to 25%. This difference extended approximately a kilobase from the 5' ends of genes. These gradients are not observed in <it>A. thaliana,</it> so assessing total gene numbers in rice is more difficult than in <it>A. thaliana,</it> and the numbers reported should be considered very much as approximations. Using various measures of estimation of true gene numbers, the range for <it>japonica</it> was reported as being 32,000-50,000 [<abbr bid="B3">3</abbr>] and for <it>indica</it> as 46,000-56,000 [<abbr bid="B4">4</abbr>]. Both estimates are considerably more than the 25,500 or so genes predicted in the <it>A. thaliana</it> genome. They are also greater than for <it>Caenorhabditis elegans</it> and <it>Drosophila melanogaster</it> (which have around 19,100 and 13,500 genes, respectively) [<abbr bid="B7">7</abbr>,<abbr bid="B8">8</abbr>] and, probably, than for human (which has perhaps 38,000) [<abbr bid="B9">9</abbr>]. The relatively larger number of genes encoded by plant genomes is likely to be a consequence of the frequency with which polyploidy occurs and the apparent selective advantage of polyploidy.</p>
      </sec>
      <sec>
         <st>
            <p>The functions of rice genes</p>
         </st>
         <p>The proteins encoded by the predicted genes in both the <it>indica and japonica</it> sequences were analyzed and classified using the InterPro [<abbr bid="B10">10</abbr>,<abbr bid="B11">11</abbr>] and Gene Ontology (GO) [<abbr bid="B12">12</abbr>,<abbr bid="B13">13</abbr>] databases. The results are not directly comparable with the previously published classification of <it>A. thaliana</it> proteins [<abbr bid="B5">5</abbr>], which used InterPro and PENDANT [<abbr bid="B14">14</abbr>,<abbr bid="B15">15</abbr>]. But when 25,426 <it>A. thaliana</it> genes were classified using GO, along with 53,398 predicted complete <it>indica</it> genes, allowing direct comparison of their functional classifications, the proportions of genes in each functional category were found to be almost identical between the two plants [<abbr bid="B4">4</abbr>]. The major difference, as shown in Figure <figr fid="F1">1</figr>, is that a larger proportion of rice genes remain unclassified. The classified predicted proteins (10,893 from <it>indica</it> and 9,230 for <it>A. thaliana</it>) are very likely to represent the products of genuine genes, but what about the genes predicted to encode unclassifiable proteins? For the predicted <it>A. thaliana</it> proteins, 80-85% (around 21,000 genes) have homologous predicted genes in the rice <it>indica</it> and <it>japonica</it> sequences [<abbr bid="B3">3</abbr>,<abbr bid="B4">4</abbr>]. This includes about 8,000 proteins predicted to be in rice but not in <it>D. melanogaster, C. elegans, Saccharomyces cerevisiae</it> or sequenced bacterial genomes, so these probably represent the plant-specific set of genes. The approximately 4,000 predicted proteins of <it>A. thaliana</it> that did not have homologies in rice are either artefacts or are unique to dicotyledonous plants. When the predicted rice gene sequences are compared with the <it>A. thaliana</it> genome, less than half (about 30,000) have significant homologs. This represents the minimum number of genes in rice, therefore, although it should be recognized that they are not necessarily all functional. The predicted rice proteins with no homologs in <it>A. thaliana</it> are either artefacts of the automated annotation (probably most of them) or are unique to monocotyledonous plants. Although the number of genes in rice is relatively large, the number of distinct gene families that appear to be present (15,000 reported for <it>japonica</it> [<abbr bid="B3">3</abbr>]) is similar to the number for <it>A. thaliana, D. melanogaster</it> and <it>C. elegans</it> (13,382, 10,736 and 14,177, respectively [<abbr bid="B5">5</abbr>]). The increased number of genes per family in plants is clearly a consequence of gene duplication, but it is not necessarily the case that the functions of members of a family are redundant.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Comparison of the functional classifications of predicted rice and <it>A. thaliana</it> proteins</p>
            </caption>
            <text>
               <p>Comparison of the functional classifications of predicted rice and <it>A. thaliana</it> proteins (see text for further details; the figure is redrawn from [3,4]).</p>
            </text>
            <graphic file="gb-2002-3-6-reviews1015-1"/>
         </fig>
      </sec>
      <sec>
         <st>
            <p>Comparative analyses</p>
         </st>
         <p>Analysis of the <it>japonica</it> sequence has revealed extensive gene duplication in rice [<abbr bid="B3">3</abbr>]. The sequences of more than 2,000 mapped rice cDNA markers were used to identify homologs in the genome sequence. Using a threshold value of 80% identity over 100 base-pairs, the mean number of homologous loci per cDNA was over 1.94. Evidence of extensive duplication of genomic segments was also detected. Most of the segments identified were very small (four markers or fewer), suggesting that extensive recombination and/or rearrangement has occurred since the duplication event(s). Dating of the duplications suggested that whole-genome duplication occurred 40 to 50 million years ago. The genome of <it>A. thaliana</it> also appears to be the result of whole genome duplication - a tetraploidy event - in its ancestry, followed by extensive rearrangement [<abbr bid="B16">16</abbr>]. This underlines the importance of polyploidy in plant evolution and as a key feature of plant genomes to be recognized when planning comparative analyses.</p>
         <p>There is extensive conservation in the <it>japonica</it> genome sequence [<abbr bid="B3">3</abbr>] of genes previously identified in other cereals. The alignment of cereal genomes using sequence-based markers from cereals other than rice showed extensive conservation of gene order (conserved synteny), confirming previous results [<abbr bid="B17">17</abbr>]. But can the draft rice sequence be used to assess the extent of conservation of synteny between the rice and <it>A. thaliana</it> genomes? Although the fragmented nature of WGS-derived sequence data makes such data poorly suited to the comparative analysis of distantly related genomes, conservation of genome structure was identified between <it>japonica</it> and <it>A. thaliana</it> [<abbr bid="B3">3</abbr>]. At least 137 blocks of conserved synteny (defined as three or more <it>A. thaliana</it> genes from the same chromosome mapping to one rice bacterial artificial chromosome, or BAC, contig) were identified. This finding supports the general applicability of previous data from a specific region of the rice genome for which conserved genome microstructure could be identified in <it>A. thaliana</it> [<abbr bid="B18">18</abbr>]. Several of the syntenic blocks in <it>japonica</it> aligned to multiple regions of the <it>A. thaliana</it> genome, providing further confirmation for observations that the <it>A. thaliana</it> genome is the result of multiple rounds of duplication [<abbr bid="B18">18</abbr>,<abbr bid="B19">19</abbr>]. More extensive analyses were reported [<abbr bid="B3">3</abbr>] for adjacent pairs of genes present in <it>A. thaliana,</it> for which the positions in the <it>japonica</it> sequence were assessed. This analysis showed that many pairs of genes show synteny, but most pairs had intervening genes in rice. This pattern of interspersion of conserved and non-conserved genes confirms previous similar findings in comparisons of the microstructure of the genomes of rice and <it>A. thaliana</it> [<abbr bid="B18">18</abbr>,<abbr bid="B20">20</abbr>] and provides support for the generality of this feature of plant genome microstructure [<abbr bid="B21">21</abbr>].</p>
      </sec>
      <sec>
         <st>
            <p>Towards a high-quality rice genome sequence</p>
         </st>
         <p>The published draft rice genome sequences contain at least parts of almost all rice genes. The annotation of those genes is presently poor, and their number could be anywhere in the range 30,000 to 60,000. Future rice genome sequencing plans include the integration of the draft sequence data recently reported [<abbr bid="B3">3</abbr>,<abbr bid="B4">4</abbr>] with data being generated by the IRGSP [<abbr bid="B6">6</abbr>], which already incorporates sequence data generated by Monsanto [<abbr bid="B22">22</abbr>]. This will allow the most rapid progress towards the completion of fully contiguous, high quality genome sequence, which will be needed to underpin comparative genomics and gene discovery in cereals. Verification and functional analysis of the predicted genes will be important if they are to fulfil their potential for aiding the development of our understanding of plant science and the advancement of agricultural production worldwide.</p>
      </sec>
   </bdy>
   <bm>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Origin, dispersal, cultivation and variation of rice.</p>
            </title>
            <aug>
               <au>
                  <snm>Khush</snm>
                  <fnm>GS</fnm>
               </au>
            </aug>
            <source>Plant Mol Biol</source>
            <pubdate>1997</pubdate>
            <volume>35</volume>
            <fpage>25</fpage>
            <lpage>34</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1005810616885</pubid>
                  <pubid idtype="pmpid">9291957</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>The Living Fields: Our Agricultural Heritage.</p>
            </title>
            <aug>
               <au>
                  <snm>Harlan</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>New York: Cambridge University Press;</source>
            <pubdate>1995</pubdate>
            <fpage>30</fpage>
            <lpage>31</lpage>
         </bibl>
         <bibl id="B3">
            <title>
               <p>A draft sequence of the rice genome (<it>Oryza sativa</it> L. <it>ssp. japonica</it>).</p>
            </title>
            <aug>
               <au>
                  <snm>Goff</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Ricke</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lan</snm>
                  <fnm>TH</fnm>
               </au>
               <au>
                  <snm>Presting</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Dunn</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Glazebrook</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sessions</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Oeller</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Varma</snm>
                  <fnm>H</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>296</volume>
            <fpage>92</fpage>
            <lpage>100</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1068275</pubid>
                  <pubid idtype="pmpid" link="fulltext">11935018</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>A draft sequence of the rice genome (<it>Oryza sativa</it> L. <it>ssp. indica</it>).</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Dai</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>296</volume>
            <fpage>79</fpage>
            <lpage>92</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1068037</pubid>
                  <pubid idtype="pmpid" link="fulltext">11935017</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Analysis of the genome sequence of the flowering plant <it>Arabidopsis thaliana</it>.</p>
            </title>
            <aug>
               <au>
                  <cnm>The Arabidopsis Genome Initiative</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>408</volume>
            <fpage>796</fpage>
            <lpage>815</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35048692</pubid>
                  <pubid idtype="pmpid" link="fulltext">11130711</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>International Rice Genome Sequencing Project: the effort to completely sequence the rice genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Sasaki</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Burr</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Curr Opin Plant Biol</source>
            <pubdate>2000</pubdate>
            <volume>3</volume>
            <fpage>138</fpage>
            <lpage>41</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1369-5266(99)00047-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">10712951</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Sequence and analysis of the genome of <it>C. elegans:</it> a platform for investigating biology.</p>
            </title>
            <aug>
               <au>
                  <cnm>The C. elegans Sequencing Consortium</cnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1998</pubdate>
            <volume>282</volume>
            <fpage>2012</fpage>
            <lpage>2018</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.282.5396.2012</pubid>
                  <pubid idtype="pmpid" link="fulltext">9851916</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>The genome sequence of <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Adams</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Celniker</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Holt</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Gocayne</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Amanatides</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Scherer</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>PW</fnm>
               </au>
               <au>
                  <snm>Hoskins</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Galle</snm>
                  <fnm>RF</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>287</volume>
            <fpage>2185</fpage>
            <lpage>2195</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.287.5461.2185</pubid>
                  <pubid idtype="pmpid" link="fulltext">10731132</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>The sequence of the human genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Venter</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>EW</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>PW</fnm>
               </au>
               <au>
                  <snm>Mural</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Sutton</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>HO</fnm>
               </au>
               <au>
                  <snm>Yandell</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Holt</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Gocayne</snm>
                  <fnm>JD</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>291</volume>
            <fpage>1304</fpage>
            <lpage>1351</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1058040</pubid>
                  <pubid idtype="pmpid" link="fulltext">11181995</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>InterPro - an integrated documentation resource for protein families, domains and functional sites.</p>
            </title>
            <aug>
               <au>
                  <snm>Apweiler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Attwood</snm>
                  <fnm>TK</fnm>
               </au>
               <au>
                  <snm>Bairoch</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Biswas</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bucher</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Cerutti</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Corpet</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Croning</snm>
                  <fnm>MD</fnm>
               </au>
               <etal/>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <fpage>1145</fpage>
            <lpage>1150</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/16.12.1145</pubid>
                  <pubid idtype="pmpid" link="fulltext">11159333</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>InterPro</p>
            </title>
            <url>http://www.ebi.ac.uk/interpro/</url>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Annotating eukaryote genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Reese</snm>
                  <fnm>MG</fnm>
               </au>
            </aug>
            <source>Curr Opin Struct Biol</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <fpage>349</fpage>
            <lpage>354</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0959-440X(00)00095-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">10851187</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Gene Ontology Consortium</p>
            </title>
            <url>http://www.geneontology.org/</url>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Functional and structural genomics using PEDANT.</p>
            </title>
            <aug>
               <au>
                  <snm>Frishman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Albermann</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hani</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Heumann</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Metanomski</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Zollner</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mewes</snm>
                  <fnm>HW</fnm>
               </au>
            </aug>
            <source>BioInformatics</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <fpage>44</fpage>
            <lpage>57</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/17.1.44</pubid>
                  <pubid idtype="pmpid" link="fulltext">11222261</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>PEDANT</p>
            </title>
            <url>http://pedant.gsf.de/</url>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Extensive duplication and reshuffling in the <it>Arabidopsis</it> genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Blanc</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Barakat</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Guyot</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Cooke</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Delseny</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2000</pubdate>
            <volume>12</volume>
            <fpage>1093</fpage>
            <lpage>1101</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">149051</pubid>
                  <pubid idtype="pmpid" link="fulltext">10899976</pubid>
                  <pubid idtype="doi">10.1105/tpc.12.7.1093</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Comparative genetics in the grasses.</p>
            </title>
            <aug>
               <au>
                  <snm>Gale</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Devos</snm>
                  <fnm>KM</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>1971</fpage>
            <lpage>1974</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">33824</pubid>
                  <pubid idtype="pmpid" link="fulltext">9482816</pubid>
                  <pubid idtype="doi">10.1073/pnas.95.5.1971</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Conservation of microstructure between a sequenced region of the genome of rice and multiple segments of the genome of <it>Arabidopsis thaliana</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Mayer</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Murphy</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Tarchini</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wambutt</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Volckaert</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Pohl</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Dusterhoft</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Stiekema</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Entian</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Terryn</snm>
                  <fnm>N</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>1167</fpage>
            <lpage>1174</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.GR-1617R</pubid>
                  <pubid idtype="pmpid" link="fulltext">11435398</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Comparing sequenced segments of the tomato and <it>Arabidopsis</it> genomes: Large-scale duplication followed by selective gene loss creates a network of synteny.</p>
            </title>
            <aug>
               <au>
                  <snm>Ku</snm>
                  <fnm>H-M</fnm>
               </au>
               <au>
                  <snm>Vision</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Tanksley</snm>
                  <fnm>SD</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <fpage>9121</fpage>
            <lpage>9126</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">16832</pubid>
                  <pubid idtype="pmpid" link="fulltext">10908680</pubid>
                  <pubid idtype="doi">10.1073/pnas.160271297</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Identification and analysis of homoeologous segments of the genomes of rice and <it>Arabidopsis thaliana.</it></p>
            </title>
            <aug>
               <au>
                  <snm>van Dodeweerd</snm>
                  <fnm>A-M</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Bent</snm>
                  <fnm>EG</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Bevan</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Bancroft</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Genome</source>
            <pubdate>1999</pubdate>
            <volume>42</volume>
            <fpage>887</fpage>
            <lpage>892</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1139/gen-42-5-887</pubid>
                  <pubid idtype="pmpid" link="fulltext">10584310</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Duplicate and diverge: the evolution of plant genome microstructure.</p>
            </title>
            <aug>
               <au>
                  <snm>Bancroft</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <fpage>89</fpage>
            <lpage>93</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(00)02179-X</pubid>
                  <pubid idtype="pmpid" link="fulltext">11173118</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>The use of the Monsanto draft rice genome sequence in research.</p>
            </title>
            <aug>
               <au>
                  <snm>Barry</snm>
                  <fnm>GF</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2001</pubdate>
            <volume>125</volume>
            <fpage>1164</fpage>
            <lpage>1165</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1104/pp.125.3.1164</pubid>
                  <pubid idtype="pmpid" link="fulltext">11244095</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
