<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2002-3-10-research0053</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Long terminal repeat retrotransposons of <it>Oryza sativa</it></p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>McCarthy</snm>
               <mi>M</mi>
               <fnm>Eugene</fnm>
               <insr iid="I1"/>
               <email>gm@uga.edu</email>
            </au>
            <au id="A2">
               <snm>Liu</snm>
               <fnm>Jingdong</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A3">
               <snm>Lizhi</snm>
               <fnm>Gao</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A4">
               <snm>McDonald</snm>
               <mi>F</mi>
               <fnm>John</fnm>
               <insr iid="I1"/>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Genetics, University of Georgia, Athens, GA 30602, USA</p>
            </ins>
            <ins id="I2">
               <p>Monsanto, St. Louis, MO 63198, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2002</pubdate>
         <volume>3</volume>
         <issue>10</issue>
         <fpage>research0053.1</fpage>
         <lpage>research0053.11</lpage>
         <url>http://genomebiology.com/2002/3/10/research/0053</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/gb-2002-3-10-research0053</pubid>
               <pubid idtype="pmpid">12372141</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>28</day>
               <month>12</month>
               <year>2001</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>11</day>
               <month>3</month>
               <year>2002</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>9</day>
               <month>7</month>
               <year>2002</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>13</day>
               <month>9</month>
               <year>2002</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2002</year>
         <collab>McCarthy et al., licensee BioMed Central Ltd</collab>
      </cpyrt>
      <shorttitle>
         <p>Long terminal repeat retrotransposons of <it>Oryza sativa</it></p>
      </shorttitle>
      <shortabs>
         <p>A new data-mining program, LTR_STRUC, was used to mine the GenBank rice (<it>Oryza sativa</it>) database as well as the more extensive Monsanto rice dataset for long terminal repeat retrotransposons. Each of the major clades of rice LTR retrotransposons is more closely related to elements present in other species than to the other clades of rice elements, suggesting that horizontal transfer may have occurred.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Long terminal repeat (LTR) retrotransposons constitute a major fraction of the genomes of higher plants. For example, retrotransposons comprise more than 50% of the maize genome and more than 90% of the wheat genome. LTR retrotransposons are believed to have contributed significantly to the evolution of genome structure and function. The genome sequencing of selected experimental and agriculturally important species is providing an unprecedented opportunity to view the patterns of variation existing among the entire complement of retrotransposons in complete genomes.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Using a new data-mining program, LTR_STRUC, (LTR retrotransposon structure program), we have mined the GenBank rice (<it>Oryza sativa</it>) database as well as the more extensive (259 Mb) Monsanto rice dataset for LTR retrotransposons. Almost two-thirds (37) of the 59 families identified consist of <it>copia</it>-like elements, but <it>gypsy</it>-like elements outnumber <it>copia</it>-like elements by a ratio of approximately 2:1. At least 17% of the rice genome consists of LTR retrotransposons. In addition to the ubiquitous <it>gypsy</it>- and <it>copia</it>-like classes of LTR retrotransposons, the rice genome contains at least two novel families of unusually small, non-coding (non-autonomous) LTR retrotransposons.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>Each of the major clades of rice LTR retrotransposons is more closely related to elements present in other species than to the other clades of rice elements, suggesting that horizontal transfer may have occurred over the evolutionary history of rice LTR retrotransposons. Like LTR retrotransposons in other species with relatively small genomes, many rice LTR retrotransposons are relatively young, indicating a high rate of turnover.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010019">Plant biology</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010009">Genetics</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Retrotransposons are mobile genetic elements that make up a large fraction of most eukaryotic genomes. They are particularly abundant in plants, where they are often a principal component of nuclear DNA. In maize 50-80%, and in wheat fully 90%, of the genome is made up of retrotransposons [<abbr bid="B1">1</abbr>,<abbr bid="B2">2</abbr>]. In animals this percentage is generally lower than in plants but can still be large. For example, more than 40% of the human genome is now known to be composed of retroelements [<abbr bid="B3">3</abbr>,<abbr bid="B4">4</abbr>].</p>
         <p>All retrotransposons are distinguished by a life cycle involving an RNA intermediate. The RNA genome of a retroelement is copied into a double-stranded DNA molecule by reverse transcriptase and is subsequently integrated into the host's genome. Retrotransposons fall into two main categories, those with long terminal repeats (LTRs), such as retroviruses and LTR retrotransposons, and those that lack such repeats, (for example, long interspersed nuclear elements or LINEs).</p>
         <p>Our laboratory is in the process of screening the GenBank rice (<it>Oryza sativa</it>) database (GBRD) and the Monsanto rice dataset (MRD) for the presence of LTR retrotransposons. We have chosen to scan the rice genome because, as the most important food crop in the world, much of its sequence data is already available. With a haploid content of 430 million base pairs (Mbp), the rice genome is the smallest among cultivated cereals [<abbr bid="B5">5</abbr>,<abbr bid="B6">6</abbr>] and only about three times larger than the smallest known genome among angiosperms, that of <it>Arabidopsis thaliana</it> (~130 Mbp). <it>O. sativa</it> has one of the smallest genomes among grasses as a whole [<abbr bid="B6">6</abbr>]. Genomes of other cereals are far larger. For example, the maize (<it>Zea mays</it>) genome is 2,500 million base pairs (2.5 Gbp) and that of wheat (<it>Triticum aestivum</it>), 16 Gbp. The molecular genetic resources for rice are excellent, including detailed physical and genetic maps, large YAC and BAC libraries, an efficient transformation system, and an extensive collection of expressed sequence tags (ESTs).</p>
         <p>We have used a new search program, LTR_STRUC (LTR retrotransposon structure program; E.M.M. and J.F.M., unpublished work), as the initial data-mining tool in our survey. Structural features important to the algorithm on which LTR_STRUC is based include two sites critical to replication, the primer-binding site (PBS) and polypurine tract (PPT), as well as the presence of canonical dinucleotides at the ends of each LTR (typically TG and CA). Particularly important are the direct or 'target-site' repeats (TSRs). When an LTR retrotransposon inserts itself into host DNA, a short (usually 4-6 bp) segment of host DNA is replicated at the site of insertion. This feature allows LTR_STRUC to make an exact demarcation of the limits of a putative element. Because it searches for retroelements on the basis of their generic structure, LTR_STRUC eliminates much of the bias inherent in BLAST searches based on a known retroelement query. After elements were initially identified using LTR_STRUC, sequence analyses were carried out to identify open reading frames (ORFs) encoding reverse transcriptase (RT) and other retrotransposon proteins. Subsequent RT sequence alignments were carried out, followed by construction of phylogenetic trees.</p>
         <p>RTs from elements identified in our survey fall into numerous distinct families, where 'family' is defined as a group of elements with RTs having mutual similarity of at least 90% at the amino-acid level [<abbr bid="B7">7</abbr>]. In addition, four types of non-autonomous elements discussed here lack RT sequences (<it>Osr25, Osr37/Rire4, Osr43</it>, and <it>Osr44</it>), and were classified as distinct families on the basis of their unique structures (see below).</p>
         <p>Currently, there is no consensus with respect to rice retrotransposon nomenclature. In our method of nomenclature, rice LTR retrotransposons are specified by the appellation <it>Osr</it> (<it>Oryza sativa</it> retrotransposon). Distinct families are indicated by number (for example, <it>Osr1, Osr2, Osr3</it>, . . .). There have been four different nomenclatures previously used in reference to rice LTR retrotransposons: <it>Tos</it> (transposon <it>Oryza sativa</it>) [<abbr bid="B8">8</abbr>], <it>Rire</it> (rice retrotransposon) [<abbr bid="B9">9</abbr>] <it>Rrt</it> (rice retrotransposon) (S. Wang, submission to EMBL database: <it>Rtr3</it> (accession number T03666), <it>Rrt5</it> (T03669), and <it>Rrt8</it> (T03671)), and <it>Osr</it> (<it>Oryza sativa</it> retrotransposon) (N. Jwa, submission to GenBank: <it>Osr1</it> (AB046118)). We have chosen to adopt the <it>Osr</it> nomenclature in this study because it is consistent with the systematic logic (indicative of genus and species of host organism) used in previous genomic studies of LTR retrotransposons and includes the letter 'r' to indicate retrotransposon. However, in every case where we use the <it>Osr</it> acronym in this paper to refer to a previously named family, we also include any pre-existing name(s) for the family (for example, <it>Osr15/Tos12, Osr26/Rire2</it>).</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <p>As is the case for most eukaryotic species analyzed to date, rice LTR retrotransposons fall, for the most part, into two major categories, <it>gypsy</it>-like and <it>copia</it>-like (two exceptions are discussed below). <it>Copia</it>-like elements in the rice genome are usually 5-6 kb in length; however, certain families are composed of longer elements so that the mean length is around 6.2 kb. For example, elements in <it>Osr</it>7 and <it>Osr</it>8 are about 9,000 bp in length. Results of our study indicate that the TSRs of all rice LTR retrotransposons are 5 bp long (Table <tblr tid="T1">1</tblr>). The dinucleotides terminating the LTRs are similarly invariant: across all families, the 5' nucleotide pair is consistently TG, and the 3' end, consistently CA (except for a few mutated copies). In the rice genome, normal <it>gypsy</it>-like elements (that is, those that lack a deletion or insertion) are typically in the 10 to 13 kb range, but some do bear large insertions or internal deletions. Their mean length of 11.7 kb is larger than that of typical <it>gypsy</it>-like elements in other species, which are usually in the range of 7-8 kb [<abbr bid="B7">7</abbr>,<abbr bid="B10">10</abbr>]. The reason for this larger mean length of <it>O. sativa</it> LTR retrotransposons is presently unknown. Duplication of retroelement sequences during the process of reverse transcription has been previously observed in mammalian systems [<abbr bid="B11">11</abbr>] and nested insertions of transposons into LTR retrotransposons are not uncommon in plants [<abbr bid="B12">12</abbr>]. However, none of the full-length LTR retrotransposons reported here has a substructure consistent with nested LTR retrotransposon insertions. For example, none of the elements we report in Table <tblr tid="T1">1</tblr> encode more than one region of RT homology and none contain nested pairs of putative LTRs. Of course, we cannot eliminate the possibility that the larger size of <it>O. sativa gypsy</it>-like elements is, at least in part, due to insertions of unrecognized elements or ancient insertions of known elements that can no longer be recognized. Whatever, the reason for the exceptional size of <it>O. sativa gypsy</it>-like elements, it apparently does not inhibit function, as sequence analysis (see below) indicates that the majority of these elements have transposed in the recent evolutionary past. <it>Gypsy</it>-like elements in <it>O. sativa</it> also have larger LTRs than <it>copia</it>-like elements, many with lengths in excess of 3,000 bp (mean ~1,000 bp), whereas the typical <it>copia</it>-like LTR is around 500 bp long.</p>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Summary of rice LTR retrotransposons characterized in this study</p>
            </caption>
            <tblbdy cols="10">
               <r>
                  <c ca="left">
                     <p>Family</p>
                  </c>
                  <c ca="left">
                     <p>Pre-existing name(s)</p>
                  </c>
                  <c ca="left">
                     <p>Accession number of exemplar</p>
                  </c>
                  <c ca="left">
                     <p>Location</p>
                  </c>
                  <c ca="left">
                     <p>Chromosome number</p>
                  </c>
                  <c ca="left">
                     <p>LTR length (bp)</p>
                  </c>
                  <c ca="left">
                     <p>Inserted element length</p>
                  </c>
                  <c ca="left">
                     <p>TSR</p>
                  </c>
                  <c ca="left">
                     <p>%LNI (mean for family)</p>
                  </c>
                  <c ca="left">
                     <p>Approximate copy number (haploid genome)<sup>&#8225;</sup></p>
                  </c>
               </r>
               <r>
                  <c cspan="10">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr1</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Tos14/Rire15</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>AC023240</p>
                  </c>
                  <c ca="left">
                     <p>100410-106807</p>
                  </c>
                  <c ca="left">
                     <p>10</p>
                  </c>
                  <c ca="left">
                     <p>965</p>
                  </c>
                  <c ca="left">
                     <p>6,398</p>
                  </c>
                  <c ca="left">
                     <p>AGTCC</p>
                  </c>
                  <c ca="left">
                     <p>98.1</p>
                  </c>
                  <c ca="center">
                     <p>250</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr2</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AL442110</p>
                  </c>
                  <c ca="left">
                     <p>95121-100070</p>
                  </c>
                  <c ca="left">
                     <p>4</p>
                  </c>
                  <c ca="left">
                     <p>267</p>
                  </c>
                  <c ca="left">
                     <p>4,950</p>
                  </c>
                  <c ca="left">
                     <p>ATATT</p>
                  </c>
                  <c ca="left">
                     <p>98.5</p>
                  </c>
                  <c ca="center">
                     <p>&lt;50</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr3</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AF458765</p>
                  </c>
                  <c ca="left">
                     <p>51-5250</p>
                  </c>
                  <c ca="left">
                     <p>?</p>
                  </c>
                  <c ca="left">
                     <p>146</p>
                  </c>
                  <c ca="left">
                     <p>5,200</p>
                  </c>
                  <c ca="left">
                     <p>CATTC</p>
                  </c>
                  <c ca="left">
                     <p>99.3</p>
                  </c>
                  <c ca="center">
                     <p>50-100</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr4</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AB026295</p>
                  </c>
                  <c ca="left">
                     <p>160208-165872</p>
                  </c>
                  <c ca="left">
                     <p>6</p>
                  </c>
                  <c ca="left">
                     <p>350</p>
                  </c>
                  <c ca="left">
                     <p>5,665</p>
                  </c>
                  <c ca="left">
                     <p>GTTAC</p>
                  </c>
                  <c ca="left">
                     <p>98.9</p>
                  </c>
                  <c ca="center">
                     <p>&lt;50</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr5</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AC021891</p>
                  </c>
                  <c ca="left">
                     <p>56044-62135</p>
                  </c>
                  <c ca="left">
                     <p>X</p>
                  </c>
                  <c ca="left">
                     <p>477</p>
                  </c>
                  <c ca="left">
                     <p>6,092</p>
                  </c>
                  <c ca="left">
                     <p>TACAG</p>
                  </c>
                  <c ca="left">
                     <p>96.2</p>
                  </c>
                  <c ca="center">
                     <p>&lt;50</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr6</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AP001366</p>
                  </c>
                  <c ca="left">
                     <p>57569-62773</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>440</p>
                  </c>
                  <c ca="left">
                     <p>5,205</p>
                  </c>
                  <c ca="left">
                     <p>ACCTG</p>
                  </c>
                  <c ca="left">
                     <p>99.8</p>
                  </c>
                  <c ca="center">
                     <p>&lt;50</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr7</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AP002538</p>
                  </c>
                  <c ca="left">
                     <p>44996-53915</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>1608</p>
                  </c>
                  <c ca="left">
                     <p>8,920</p>
                  </c>
                  <c ca="left">
                     <p>AGTTT</p>
                  </c>
                  <c ca="left">
                     <p>98.8</p>
                  </c>
                  <c ca="center">
                     <p>&lt;50</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr8</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AC021891</p>
                  </c>
                  <c ca="left">
                     <p>65191-74406</p>
                  </c>
                  <c ca="left">
                     <p>X</p>
                  </c>
                  <c ca="left">
                     <p>1220</p>
                  </c>
                  <c ca="left">
                     <p>9,216</p>
                  </c>
                  <c ca="left">
                     <p>TAAAT</p>
                  </c>
                  <c ca="left">
                     <p>97.2</p>
                  </c>
                  <c ca="center">
                     <p>1100</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr9</it>
                        <sup>*</sup>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AP000969</p>
                  </c>
                  <c ca="left">
                     <p>25869-28634</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="center">
                     <p>50-100</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr10</it>
                        <sup>*</sup>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AC069324</p>
                  </c>
                  <c ca="left">
                     <p>137920-139740</p>
                  </c>
                  <c ca="left">
                     <p>10</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="center">
                     <p>400</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr11</it>
                        <sup>*</sup>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Rire1</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>AP003853</p>
                  </c>
                  <c ca="left">
                     <p>96975-98088</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="center">
                     <p>&lt;50</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr12</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AC073166</p>
                  </c>
                  <c ca="left">
                     <p>104289-109024</p>
                  </c>
                  <c ca="left">
                     <p>10</p>
                  </c>
                  <c ca="left">
                     <p>221</p>
                  </c>
                  <c ca="left">
                     <p>4,736</p>
                  </c>
                  <c ca="left">
                     <p>AGAAG</p>
                  </c>
                  <c ca="left">
                     <p>99.7</p>
                  </c>
                  <c ca="center">
                     <p>&lt;50</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr13</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Tos5</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>AC073405</p>
                  </c>
                  <c ca="left">
                     <p>72924-79364</p>
                  </c>
                  <c ca="left">
                     <p>5</p>
                  </c>
                  <c ca="left">
                     <p>968</p>
                  </c>
                  <c ca="left">
                     <p>6,441</p>
                  </c>
                  <c ca="left">
                     <p>TATGT</p>
                  </c>
                  <c ca="left">
                     <p>99.6</p>
                  </c>
                  <c ca="center">
                     <p>650</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr14</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Tos1/Tos4</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>AC069324</p>
                  </c>
                  <c ca="left">
                     <p>8821-17191</p>
                  </c>
                  <c ca="left">
                     <p>10</p>
                  </c>
                  <c ca="left">
                     <p>319</p>
                  </c>
                  <c ca="left">
                     <p>8,371</p>
                  </c>
                  <c ca="left">
                     <p>CTCCC</p>
                  </c>
                  <c ca="left">
                     <p>97.6</p>
                  </c>
                  <c ca="center">
                     <p>350</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr15</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Tos12</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>AP002867</p>
                  </c>
                  <c ca="left">
                     <p>127118-132180</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>262</p>
                  </c>
                  <c ca="left">
                     <p>5,062</p>
                  </c>
                  <c ca="left">
                     <p>GCTTC</p>
                  </c>
                  <c ca="left">
                     <p>94.5</p>
                  </c>
                  <c ca="center">
                     <p>250</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr16</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Tos6</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>AP002845</p>
                  </c>
                  <c ca="left">
                     <p>42644-49551</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>300</p>
                  </c>
                  <c ca="left">
                     <p>6,908</p>
                  </c>
                  <c ca="left">
                     <p>TGCTT</p>
                  </c>
                  <c ca="left">
                     <p>97.9</p>
                  </c>
                  <c ca="center">
                     <p>&lt;50</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr17</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AC018727</p>
                  </c>
                  <c ca="left">
                     <p>102539-96583</p>
                  </c>
                  <c ca="left">
                     <p>10</p>
                  </c>
                  <c ca="left">
                     <p>501</p>
                  </c>
                  <c ca="left">
                     <p>5,957</p>
                  </c>
                  <c ca="left">
                     <p>TCATC</p>
                  </c>
                  <c ca="left">
                     <p>99.6</p>
                  </c>
                  <c ca="center">
                     <p>50-100</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr18</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AC068654</p>
                  </c>
                  <c ca="left">
                     <p>23423-25036</p>
                  </c>
                  <c ca="left">
                     <p>X</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="center">
                     <p>&lt;50</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr19</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AC069300</p>
                  </c>
                  <c ca="left">
                     <p>73013-77731</p>
                  </c>
                  <c ca="left">
                     <p>10</p>
                  </c>
                  <c ca="left">
                     <p>205</p>
                  </c>
                  <c ca="left">
                     <p>4,719</p>
                  </c>
                  <c ca="left">
                     <p>GGGAC</p>
                  </c>
                  <c ca="left">
                     <p>99.5</p>
                  </c>
                  <c ca="center">
                     <p>50-100</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr20</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AC084406</p>
                  </c>
                  <c ca="left">
                     <p>8749-14200</p>
                  </c>
                  <c ca="left">
                     <p>3</p>
                  </c>
                  <c ca="left">
                     <p>286</p>
                  </c>
                  <c ca="left">
                     <p>5,452</p>
                  </c>
                  <c ca="left">
                     <p>TTATA</p>
                  </c>
                  <c ca="left">
                     <p>97.9</p>
                  </c>
                  <c ca="center">
                     <p>50-100</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr21</it>
                        <sup>*</sup>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Tos17</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>AC087545</p>
                  </c>
                  <c ca="left">
                     <p>81711-84269</p>
                  </c>
                  <c ca="left">
                     <p>10</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="center">
                     <p>50-100</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr22</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AC074283</p>
                  </c>
                  <c ca="left">
                     <p>24546-19810</p>
                  </c>
                  <c ca="left">
                     <p>10</p>
                  </c>
                  <c ca="left">
                     <p>191</p>
                  </c>
                  <c ca="left">
                     <p>4,647</p>
                  </c>
                  <c ca="left">
                     <p>GAACC</p>
                  </c>
                  <c ca="left">
                     <p>97.9</p>
                  </c>
                  <c ca="center">
                     <p>50-100</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr23</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AP002843</p>
                  </c>
                  <c ca="left">
                     <p>144255-139782</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>209</p>
                  </c>
                  <c ca="left">
                     <p>4,774</p>
                  </c>
                  <c ca="left">
                     <p>AGGAT</p>
                  </c>
                  <c ca="left">
                     <p>99.5</p>
                  </c>
                  <c ca="center">
                     <p>50-100</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr24</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AC016781</p>
                  </c>
                  <c ca="left">
                     <p>25997-30858</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="left">
                     <p>221</p>
                  </c>
                  <c ca="left">
                     <p>4,852</p>
                  </c>
                  <c ca="left">
                     <p>CCGAG</p>
                  </c>
                  <c ca="left">
                     <p>98.6</p>
                  </c>
                  <c ca="center">
                     <p>&lt;50</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr25</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AP001278</p>
                  </c>
                  <c ca="left">
                     <p>28729 35569</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>417</p>
                  </c>
                  <c ca="left">
                     <p>6,841</p>
                  </c>
                  <c ca="left">
                     <p>TCGAG</p>
                  </c>
                  <c ca="left">
                     <p>98.9</p>
                  </c>
                  <c ca="center">
                     <p>500<sup>&#167;</sup></p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr26</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Rire2</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>AP001111</p>
                  </c>
                  <c ca="left">
                     <p>59274-70587</p>
                  </c>
                  <c ca="left">
                     <p>5</p>
                  </c>
                  <c ca="left">
                     <p>440</p>
                  </c>
                  <c ca="left">
                     <p>11,314</p>
                  </c>
                  <c ca="left">
                     <p>GATAT</p>
                  </c>
                  <c ca="left">
                     <p>97.9</p>
                  </c>
                  <c ca="center">
                     <p>500</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr27</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Rire9</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>AP000399</p>
                  </c>
                  <c ca="left">
                     <p>75139-88038</p>
                  </c>
                  <c ca="left">
                     <p>6</p>
                  </c>
                  <c ca="left">
                     <p>1087</p>
                  </c>
                  <c ca="left">
                     <p>12,900</p>
                  </c>
                  <c ca="left">
                     <p>AATAT</p>
                  </c>
                  <c ca="left">
                     <p>99.0</p>
                  </c>
                  <c ca="center">
                     <p>900</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr28</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AP002539</p>
                  </c>
                  <c ca="left">
                     <p>139654-121650</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>2195</p>
                  </c>
                  <c ca="left">
                     <p>18,005</p>
                  </c>
                  <c ca="left">
                     <p>GTTAT</p>
                  </c>
                  <c ca="left">
                     <p>99.0</p>
                  </c>
                  <c ca="center">
                     <p>&lt;50</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr29</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AP002747</p>
                  </c>
                  <c ca="left">
                     <p>78609-87615</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>656</p>
                  </c>
                  <c ca="left">
                     <p>9,007</p>
                  </c>
                  <c ca="left">
                     <p>GGAAC</p>
                  </c>
                  <c ca="left">
                     <p>96.0</p>
                  </c>
                  <c ca="center">
                     <p>550</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr30</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AC078891</p>
                  </c>
                  <c ca="left">
                     <p>52683-65684</p>
                  </c>
                  <c ca="left">
                     <p>10</p>
                  </c>
                  <c ca="left">
                     <p>1507</p>
                  </c>
                  <c ca="left">
                     <p>13,002</p>
                  </c>
                  <c ca="left">
                     <p>ACTTT</p>
                  </c>
                  <c ca="left">
                     <p>97.2</p>
                  </c>
                  <c ca="center">
                     <p>1500</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr31</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Rire7</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>AP003054</p>
                  </c>
                  <c ca="left">
                     <p>102778-110180</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>787</p>
                  </c>
                  <c ca="left">
                     <p>7,403</p>
                  </c>
                  <c ca="left">
                     <p>AAACC</p>
                  </c>
                  <c ca="left">
                     <p>99.9</p>
                  </c>
                  <c ca="center">
                     <p>&lt;50</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr32</it>
                        <sup>*</sup>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AP002820</p>
                  </c>
                  <c ca="left">
                     <p>111559-12278</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="left">
                     <p>ND</p>
                  </c>
                  <c ca="center">
                     <p>50-100</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr33</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Rire8</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>AP002864</p>
                  </c>
                  <c ca="left">
                     <p>35539-47557</p>
                  </c>
                  <c ca="left">
                     <p>6</p>
                  </c>
                  <c ca="left">
                     <p>3009</p>
                  </c>
                  <c ca="left">
                     <p>12,009</p>
                  </c>
                  <c ca="left">
                     <p>CACAC</p>
                  </c>
                  <c ca="left">
                     <p>99.1</p>
                  </c>
                  <c ca="center">
                     <p>550</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr34</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AF111709</p>
                  </c>
                  <c ca="left">
                     <p>25889-38685</p>
                  </c>
                  <c ca="left">
                     <p>5</p>
                  </c>
                  <c ca="left">
                     <p>3292</p>
                  </c>
                  <c ca="left">
                     <p>12,797</p>
                  </c>
                  <c ca="left">
                     <p>AGAAA</p>
                  </c>
                  <c ca="left">
                     <p>99.4</p>
                  </c>
                  <c ca="center">
                     <p>450</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr35</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AC068924</p>
                  </c>
                  <c ca="left">
                     <p>94924-100611</p>
                  </c>
                  <c ca="left">
                     <p>10</p>
                  </c>
                  <c ca="left">
                     <p>423</p>
                  </c>
                  <c ca="left">
                     <p>5,688</p>
                  </c>
                  <c ca="left">
                     <p>CTAAT</p>
                  </c>
                  <c ca="left">
                     <p>98.3</p>
                  </c>
                  <c ca="center">
                     <p>&lt;50</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr36</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AP001551</p>
                  </c>
                  <c ca="left">
                     <p>59722-64876</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>319</p>
                  </c>
                  <c ca="left">
                     <p>5,155</p>
                  </c>
                  <c ca="left">
                     <p>GGTCA</p>
                  </c>
                  <c ca="left">
                     <p>98.4</p>
                  </c>
                  <c ca="center">
                     <p>&lt;50</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr37</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Rire4?</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>AC068654</p>
                  </c>
                  <c ca="left">
                     <p>2534-6969</p>
                  </c>
                  <c ca="left">
                     <p>X</p>
                  </c>
                  <c ca="left">
                     <p>794</p>
                  </c>
                  <c ca="left">
                     <p>4,436</p>
                  </c>
                  <c ca="left">
                     <p>CTTGA</p>
                  </c>
                  <c ca="left">
                     <p>98.9</p>
                  </c>
                  <c ca="center">
                     <p>600</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr38</it>
                        <sup>&#8224;</sup>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AF458766</p>
                  </c>
                  <c ca="left">
                     <p>31-5535</p>
                  </c>
                  <c ca="left">
                     <p>?</p>
                  </c>
                  <c ca="left">
                     <p>332</p>
                  </c>
                  <c ca="left">
                     <p>5,525</p>
                  </c>
                  <c ca="left">
                     <p>TGAGG</p>
                  </c>
                  <c ca="left">
                     <p>96.2</p>
                  </c>
                  <c ca="center">
                     <p>&lt;50</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr39</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AF458767</p>
                  </c>
                  <c ca="left">
                     <p>51-5267</p>
                  </c>
                  <c ca="left">
                     <p>?</p>
                  </c>
                  <c ca="left">
                     <p>368</p>
                  </c>
                  <c ca="left">
                     <p>5,217</p>
                  </c>
                  <c ca="left">
                     <p>CAAAG</p>
                  </c>
                  <c ca="left">
                     <p>97.6</p>
                  </c>
                  <c ca="center">
                     <p>&lt;50</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr40</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AC020666</p>
                  </c>
                  <c ca="left">
                     <p>65731-77151</p>
                  </c>
                  <c ca="left">
                     <p>10</p>
                  </c>
                  <c ca="left">
                     <p>564</p>
                  </c>
                  <c ca="left">
                     <p>11,421</p>
                  </c>
                  <c ca="left">
                     <p>ACATG</p>
                  </c>
                  <c ca="left">
                     <p>98.3</p>
                  </c>
                  <c ca="center">
                     <p>600</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr41</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AP003631</p>
                  </c>
                  <c ca="left">
                     <p>27347-43001</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>518</p>
                  </c>
                  <c ca="left">
                     <p>15,655</p>
                  </c>
                  <c ca="left">
                     <p>GGTTC</p>
                  </c>
                  <c ca="left">
                     <p>97.7</p>
                  </c>
                  <c ca="center">
                     <p>300</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr42</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AF458768</p>
                  </c>
                  <c ca="left">
                     <p>51-5655</p>
                  </c>
                  <c ca="left">
                     <p>?</p>
                  </c>
                  <c ca="left">
                     <p>358</p>
                  </c>
                  <c ca="left">
                     <p>5,605</p>
                  </c>
                  <c ca="left">
                     <p>ATGTC</p>
                  </c>
                  <c ca="left">
                     <p>99.9</p>
                  </c>
                  <c ca="center">
                     <p>&lt;50</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr43</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AP000815</p>
                  </c>
                  <c ca="left">
                     <p>77117-78910</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>291</p>
                  </c>
                  <c ca="left">
                     <p>1,794</p>
                  </c>
                  <c ca="left">
                     <p>CTGAT</p>
                  </c>
                  <c ca="left">
                     <p>98.6</p>
                  </c>
                  <c ca="center">
                     <p>&lt;50</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Osr44</it>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>AP000364</p>
                  </c>
                  <c ca="left">
                     <p>41541-42747</p>
                  </c>
                  <c ca="left">
                     <p>8</p>
                  </c>
                  <c ca="left">
                     <p>148</p>
                  </c>
                  <c ca="left">
                     <p>1,207</p>
                  </c>
                  <c ca="left">
                     <p>AACAA</p>
                  </c>
                  <c ca="left">
                     <p>99.9</p>
                  </c>
                  <c ca="center">
                     <p>&lt;50</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p><sup>*</sup>Location given is for an example RT in the GBRD (no full-length element was identified for this family). <sup>&#8224;</sup>As a full-length element is known in the MRD, the TSR and lengths of the LTR and element (columns 5-7) are taken from an element in the MRD while the location (if given) in columns 2-4 refers to an RT in the GBRD. <sup>&#8225;</sup>Percentages based on number of hits using a sample LTR from each family as query to search the MRD. <sup>&#167;</sup>N. Jiang and S.R. Wessler (unpublished work) suggest that if pericentric DNA (which is largely heterochromatic) is taken into account, <it>Osr25</it> elementsexist at a higher copy number (~1,000 copies in the entire genome) than our survey, based largely on euchromatic sequences, would suggest. ND, not determined.</p>
            </tblfn>
         </tbl>
         <p>Our survey has identified numerous LTR retrotransposon families that have not been described previously. These findings show that at least 59 distinct LTR retrotransposon families exist in the rice genome. This result compares with an earlier family estimate of 32 based on screening genomic libraries [<abbr bid="B8">8</abbr>]. <it>Copia</it>-like elements are less numerous than <it>gypsy</it>-like elements in the rice genome, but they still comprise more than half the families, a total of 37. In addition to 57 families of <it>copia</it>- and <it>gypsy</it>-like elements, we have identified two families of LTR retrotransposons (<it>Osr43</it> and <it>Osr44</it>) that show no significant sequence similarity to any known transposon.</p>
         <p>For the purposes of this analysis, a 'full-length element' is defined as one that has two complete and recognizable LTRs. Any other LTR retrotransposon sequence is here defined as a 'fragment'. The results of our survey of the GBRD and MRD suggest that there are in the order of 450 full-length <it>copia</it>-like elements in the entire rice genome. We found full-length <it>copia</it>-like elements both with and without RT domains. We estimate the total copy number (including fragmentary copies) at 3,500, or about 3% of the genome. BLAST searches with representative LTR queries from each of the rice LTR-retrotransposon families against the MDR indicate that <it>gypsy</it>-like elements are twice as common (total copy number ~7,000; ~1,400 full-length). Previous estimates of this ratio have been somewhat higher [<abbr bid="B13">13</abbr>]. Owing in part to their large LTRs, <it>gypsy</it>-like elements in rice are twice as long as <it>copia</it>-like elements (11.7 kb versus 6.2 kb) and so make up a proportionately larger fraction of the genome (~14%). That is, a total of about 17% of the genome is composed of LTR retrotransposon sequences. This estimate exceeds those of previous workers [<abbr bid="B8">8</abbr>,<abbr bid="B13">13</abbr>,<abbr bid="B14">14</abbr>,<abbr bid="B15">15</abbr>]. For example, using a variety of RT probes Wang <it>et al.</it> [<abbr bid="B14">14</abbr>] estimated that around 100 copies of <it>copia</it>-like elements are present in the entire haploid genome. This estimate did not discriminate between full-length and fragmentary copies. From our examination of the searchable portion of the GBRD alone (which represented at the time approximately 10% of the rice genome), we have identified the actual sequences for 46 separate full-length <it>copia</it>-like elements. This implies that the number of full-length <it>copia</it>-like elements in the whole genome should be about ten times higher, that is, around 450 to 500 elements. In an analysis of 340 kb around the <it>Adh1-Adh2</it> region of the rice genome, Tarchini <it>et al.</it> [<abbr bid="B16">16</abbr>] reported that 14.4% of this region consisted of LTR retrotransposons. This value is in reasonably good agreement with our estimate of about 17%. Mao <it>et al.</it> [<abbr bid="B15">15</abbr>] give a lower figure (9.3%) but we believe our higher figure is more accurate because their study sought homology to known retrotransposon sequences and such homology would be undetectable for the many new families of retrotransposons presented here. Similarly, they give a higher ratio of <it>gypsy</it>- to <it>copia</it>-like elements, but they may not have been aware that <it>gypsy</it>-like elements are significantly larger in rice, which would inflate their estimate of this ratio.</p>
         <p>The previous low estimates of copy number given for rice LTR retrotransposons are probably attributable to three factors. First, these earlier studies used an incomplete set of RTs as probes for hybridization (or as queries for BLAST). For example, <it>Osr8</it>, a high copy <it>copia</it>-like family, was not recognized in previous studies. Second, a number of rice LTR retrotransposons lack an RT ORF and would thus go undetected in studies using RT probes. In particular, no member of families <it>Osr25</it> and <it>Osr37/Rire4</it> seem to have an RT (yet these two families have a total copy number of around 900 elements). Third, data-mining with LTR_STRUC (see Materials and methods) allows a higher degree of assurance that the putative RTs detected in the survey actually are RTs because it places putative polyproteins in the context of a canonical retroviral structure. Such is not the immediate result of a simple BLAST with an RT query. Our estimate that LTR retrotransposons make up 17% of the rice genome is conservative, inasmuch as our study was based primarily on euchromatic sequences and did not include elements present within the traditionally retrotransposon-rich heterochromatin [<abbr bid="B14">14</abbr>,<abbr bid="B17">17</abbr>]. Thus, our results bring the rice genome closer to the LTR retrotransposon densities reported for other cereals.</p>
         <sec>
            <st>
               <p>Intra-element percent LTR nucleotide identity</p>
            </st>
            <p>Because of the replication process characteristic of LTR retrotransposons, the LTRs of a given retroelement are sequentially identical at the time the element inserts into the host genome [<abbr bid="B18">18</abbr>]. Thereafter, as an element accumulates mutations, its LTRs become increasing different from each other as substitutions specific for each of the two LTRs increase in number. The level of nucleotide identity seen between LTRs of a particular element, usually referred to as intra-element percent LTR nucleotide identity (%LNI), can be used in determining the relative ages of LTR retrotransposon families [<abbr bid="B7">7</abbr>]. In rice, comparison of the two LTRs of the same element often showed the presence of a 10 to 30 bp regional duplication present in one LTR but not the other. In calculating %LNI, we have considered such duplications as single mutation events.</p>
            <p>As the neutral nucleotide substitution rate has yet to be computed for rice, we cannot presently equate %LNI with a divergence time in years. However, the generally low level of sequence divergence between flanking LTRs of rice LTR retrotransposons (1.7%) indicates that most of the euchromatic full-length LTR retrotransposons in rice are relatively young, although significantly older elements were also identified. The seeming preponderance of young full-length LTR retrotransposons in the euchromatin of rice is similar to previous reports on yeast [<abbr bid="B19">19</abbr>,<abbr bid="B20">20</abbr>], <it>Caenorhabditis elegans</it> [<abbr bid="B7">7</abbr>], <it>A. thaliana</it> [<abbr bid="B21">21</abbr>] and <it>Drosophila melanogaster</it> [<abbr bid="B12">12</abbr>]. This contrasts with findings in <it>Z. mays</it> [<abbr bid="B12">12</abbr>] and humans [<abbr bid="B22">22</abbr>].</p>
         </sec>
         <sec>
            <st>
               <p><it>Copia</it>-like families</p>
            </st>
            <p>To date, 23 families of <it>copia</it>-like elements have been reported for rice (S. Wang, submission to EMBL, N. Jwa, submission to GenBank, and [<abbr bid="B8">8</abbr>,<abbr bid="B9">9</abbr>,<abbr bid="B19">19</abbr>,<abbr bid="B23">23</abbr>,<abbr bid="B24">24</abbr>]). Several have been described under more than one name. For example, the amino-acid sequence given for <it>Tos4</it> in Hirochika <it>et al.</it> [<abbr bid="B23">23</abbr>] is the same as that given for <it>Tos1</it> in GenBank (accession number S22455) so they are really the same. <it>Rire5</it> described by Kumekawa <it>et al.</it> [<abbr bid="B25">25</abbr>] is the same family as <it>Tos14</it> previously described by Hirochika <it>et al.</it> [<abbr bid="B23">23</abbr>]. The equivalence between <it>Tos14</it> and <it>Rire5</it> became evident when we found the LTR sequence reported by Kumekawa <it>et al.</it> in elements that also contained the RT sequence given by Hirochika for <it>Tos14.</it> In our survey of GenBank and MRDB, we have identified an additional 16 <it>copia</it>-like families that have not been described by previous workers. In addition, exemplars for each of the previously identified families were found (except in the case of certain families that exist at such low copy numbers that no full-length element exists in GenBank or MRDB).</p>
         </sec>
         <sec>
            <st>
               <p>The largest <it>copia</it>-like family</p>
            </st>
            <p>One of the most interesting new finds in our survey was <it>Osr8</it>, one of the oldest families of LTR retrotransposons in the rice genome. On the basis of a survey of the available portion of the GBRD and MRD, we estimate the copy number of <it>Osr8</it> to be around 1,100 (more than any other <it>copia</it>-like family). <it>Osr8</it> elements exist far more frequently as fragments (ratio of 10:1) and they display relatively low levels of %LNI in their full-length copies (mean %LNI for the five full-length <it>Osr8</it> elements present in the GBRD is 97.2%). The RT of <it>Osr8</it> is 60% similar to an unnamed polyprotein in <it>Z. mays</it> (AAD20307). A closely related family, <it>Osr10</it> has two full-length copies in the GBRD but scans of the MRD suggest this element, also previously unrecognized, has the third highest copy number (~400) among <it>copia</it>-like elements. Outside rice, the RT of <it>Osr10</it> shows highest similarity (~65%) to that of the maize retrotransposon <it>Opie-2</it> (T04112). The broader clade that includes <it>Osr7, Osr8, Osr9</it>, and <it>Osr10</it> is closely related to <it>Endovir1-1</it> (AAG52949) of <it>Arabidopsis</it> (Figure <figr fid="F1">1</figr>, Table <tblr tid="T2">2</tblr>). These elements are also related (~60% similar) to maize's <it>PREM-2</it> as well as to tomato's <it>ToRTL1.</it> Both <it>Osr7</it> and <it>Osr9</it> are present in very low copy number (one full-length and a few fragments in the GBRD).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>RT-based neighbor-joining tree for <it>copia</it>-like retrotransposons</p>
               </caption>
               <text>
                  <p>RT-based neighbor-joining tree for <it>copia</it>-like retrotransposons. Distances (uncorrected <it>p</it>) appear next to each branch. RT sequences from plant species other than rice are included for comparison.</p>
               </text>
               <graphic file="gb-2002-3-10-research0053-1"/>
            </fig>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Non-rice RTs used in phylogenies</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>Name of retrotransposon</p>
                     </c>
                     <c ca="left">
                        <p>Accession number</p>
                     </c>
                     <c ca="left">
                        <p>Host organism</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Opie-2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>T04112</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Z. mays</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Hopscotch</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>T02087</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Z. mays</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Fourf</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>AAK73108</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Z. mays</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Sto-4</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>T17429</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Z. mays</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Zmr-1</it>
                           <sup>*</sup>
                        </p>
                     </c>
                     <c ca="left">
                        <p>S27768</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Z. mays</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Endovir1-1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>AAG52949</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>A. thaliana</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Ta1-2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>S23315</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>A. thaliana</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Atr-1</it>
                           <sup>*</sup>
                        </p>
                     </c>
                     <c ca="left">
                        <p>NP_175303</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>A. thaliana</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Atr-2</it>
                           <sup>*</sup>
                        </p>
                     </c>
                     <c ca="left">
                        <p>T01860</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>A. thaliana</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Atr-3</it>
                           <sup>*</sup>
                        </p>
                     </c>
                     <c ca="left">
                        <p>NP_178752</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>A. thaliana</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Atr-4</it>
                           <sup>*</sup>
                        </p>
                     </c>
                     <c ca="left">
                        <p>NP_174802.1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>A. thaliana</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Atr-5</it>
                           <sup>*</sup>
                        </p>
                     </c>
                     <c ca="left">
                        <p>AAF13073.1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>A. thaliana</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Atr-6</it>
                           <sup>*</sup>
                        </p>
                     </c>
                     <c ca="left">
                        <p>NP_179047</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>A. thaliana</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Retrosor1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>AAD19359</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Sorghum bicolor</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Retrosor3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>AAD22153</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>S. bicolor</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Daniela</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>AF326781<sup>&#8224;</sup></p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Triticum aestivum</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Acr-1</it>
                           <sup>*</sup>
                        </p>
                     </c>
                     <c ca="left">
                        <p>CAA73042</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Ananas comosus</it>
                        </p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>*</sup>Previously unnamed RT found by BLAST searches of the GBRD, using rice RTs found in our study as queries. <it>Acr</it>, <it>Ananas comosus</it> retrotransposon; <it>Atr</it>, <it>A. thaliana</it> retrotransposon; <it>Zmr</it>, <it>Z. mays</it> retrotransposon.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p><it>Osr14/Tos1/Tos4, Osr15/Tos12</it> and <it>Osr53/Tos18</it></p>
            </st>
            <p>Although it is present at only about a quarter of the copy number of <it>Osr8</it>, the unrelated <it>Osr14/Tos1/Tos4</it> is also composed primarily of highly fragmented elements. Those that are full length have low %LNI (family mean 97.6%). Thus, <it>Osr14/Tos1/Tos4</it> and <it>Osr8</it> seem to be of similar age and to have followed a similar evolutionary pattern, albeit with less intense amplification in the case of <it>Osr14/Tos1/Tos4. Osr14/Tos1/Tos4, Osr15/Tos12</it>, and <it>Osr53/Tos18</it> form a well defined clade and are more closely related to <it>Ta1-2</it> (S23315) of <it>Arabidopsis</it> than to any other rice retroelement family outside their clade (Figure <figr fid="F1">1</figr>, Table <tblr tid="T2">2</tblr>). <it>Osr15/Tos12</it> and <it>Osr53</it> are only just sufficiently different to constitute distinct families.</p>
         </sec>
         <sec>
            <st>
               <p>A quartet of closely allied families</p>
            </st>
            <p><it>Osr1/Tos14/Rire5, Osr13/Tos5, Osr51/Tos15</it>, and <it>Osr52/Tos16</it> have been described as distinct families but, inasmuch as their RTs are all 85% similar to each other, these groups are only marginally distinct. Searches of GenBank show that elements in this group are much more closely related to (75-80% at the amino-acid level) to maize retrotransposon <it>Fourf</it> (AAK73108) than to any rice LTR retrotransposon outside their clade. If the elements belonging to this group were considered to be a single family, it would be almost as large (~900 elements) as <it>Osr8.</it> In the GBRD the majority of these elements are fragmentary, but the estimated copy number of full-length elements in the rice genome for this quartet still exceeds 100.</p>
         </sec>
         <sec>
            <st>
               <p>A <it>Hopscotch</it>-like clade of fragmented elements</p>
            </st>
            <p><it>Osr18, Osr19, Osr20, Osr22, Osr23, Osr24, Osr45/Tos7</it>, and <it>Osr46/Tos8</it> form a clade of low copy number families composed primarily of fragmentary copies. Our results suggest that each of these families has a copy number in the range of 50-100 elements. Members of this clade are closely related to maize's <it>Hopscotch</it> element (T04112) (Figure <figr fid="F1">1</figr>, Table <tblr tid="T2">2</tblr>).</p>
         </sec>
         <sec>
            <st>
               <p>Low copy number <it>copia</it>-like families</p>
            </st>
            <p><it>Osr2</it> and <it>Osr12</it> are low-copy families and are represented in the GBRD by two and three copies respectively, all of which are full length (although one copy of <it>Osr12</it> contains a large internal deletion), suggesting that these elements may have recently invaded the rice genome. The high level of LTR nucleotide identity (&#8805; 99%) seen in these elements is consistent with this recent invasion hypothesis. Members of <it>Osr12</it> and <it>Osr2</it> are potentially active because they have large, intact polyprotein ORFs, usually in excess of 1,000 amino acids. All three <it>Osr12</it> elements detected in the GBRD are on chromosome 10. Similarly, both <it>Osr2</it> elements are inserted within 50 kb of each other on chromosome 4. Nonetheless, these two families are not closely related (their RT sequences are only ~50% similar at the amino-acid level). <it>Osr12</it> RTs differ from those of all other rice <it>copia</it>-like elements by 50%. And yet RT sequences of elements in <it>Osr12</it> are 60% similar to certain elements in the maize genome (<it>Zmr1</it> (S27768) and <it>mzecopia</it> (M94481.1)).</p>
            <p>One full-length, and one fragmented copy of <it>Osr6</it> are present in the GBRD. <it>Osr5</it> is slightly more common than <it>Osr6</it>, to which it is most closely related, but it is currently represented in the GBRD by only a single full-length copy and a few fragments. <it>Osr5</it> is 60% similar to the tobacco retrotransposon <it>Tnt1-94</it> at the amino-acid level (RT comparison). <it>Osr4</it> is another low-copy family. It has several fragmented representatives in the GBRD, and is probably somewhat older than <it>Osr12</it> and <it>Osr2</it>, but it has only three full-length copies in the GBRD, <it>Osr4</it> elements have an exceptionally large polyprotein ORF (~1,600 amino acids). The RT of <it>Osr4</it> shows 50% similarity to that of retroelements in the <it>Arabidopsis</it> genome (for example, BAB01972, NP_175303).</p>
            <p>Although the RT of <it>Osr3</it> was detected during our survey, elements in this family are fragments with ill defined LTRs. TBLASTN reveals the RT of <it>Osr3</it> to be the single representative of its type in the GBRD. Both <it>Osr3</it> and the equally aberrant <it>Osr21/Tos17</it> differ from those of other <it>copia</it>-like elements found in our study by about 55%. <it>Osr11/Rire1</it> is a low-copy family closely related (75% similarity) to a retroelement in the <it>Arabidopsis</it> genome (<it>Atr-2</it>, T01860). Two other closely related families are <it>Osr16/Tos6</it> and <it>Osr17</it>, both of which are similar to <it>Sto-4</it> (T17429) of maize (Figure <figr fid="F1">1</figr>, Table <tblr tid="T2">2</tblr>). Nine additional low-copy families identified by earlier workers are <it>Osr47/Tos9</it>, <it>Osr48/Tos10, Osr49/Tos11, Osr50/Tos13, Osr54/Tos19, Osr55/Tos20, Osr57/Rtr3, Osr58/Rrt5</it>, and <it>Osr59/Rrt8.</it> Source references for each of these nine families are given in Table <tblr tid="T3">3</tblr>.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Previously named low-copy families for which a full-length exemplar has not been presented in this paper</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>Family</p>
                     </c>
                     <c ca="left">
                        <p>Pre-existing family name</p>
                     </c>
                     <c ca="left">
                        <p>Accession number (or source) of sequence</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Osr45</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Tos7</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>T03709</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Osr46</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Tos8</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>T03704</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Osr47</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Tos9</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>T03705</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Osr48</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Tos10</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>T03706</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Osr49</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Tos11</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>T03707</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Osr50</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Tos13</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Hirochika <it>et al.</it> [23]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Osr51</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Tos15</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>T03711</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Osr52</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Tos16</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>T03712</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Osr53</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Tos18</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>T03716</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Osr54</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Tos19</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>T03721</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Osr55</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Tos20</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>T03723</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Osr56</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Rire3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Kumekawa <it>et al.</it> [25]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Osr57</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Rtr3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>T03666</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Osr58</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Rrt5</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>T03669</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Osr59</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Rrt8</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>T03671</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p><it>Gypsy</it>-like families predominate in <it>O. sativa</it></p>
            </st>
            <p><it>Osr27/Rire9</it> [<abbr bid="B26">26</abbr>] is the third largest family in the rice genome, with an estimated copy number of 900 elements, mostly full length. Li <it>et al.</it> [<abbr bid="B26">26</abbr>] estimated the copy number of this family at 1,600. The typical <it>Osr27/Rire9</it> element is quite large (~12.8 kb total length). Having intact polyprotein ORFs and high mean %LNI (99%), these elements probably are, or recently have been, actively transposing. Yet the presence of a few members of this family that are more mutated (short ORFs, low LTR-LTR nucleotide identity) suggests that this may also be an ancient family. Two other families, <it>Osr40</it> and <it>Osr41</it>, are also members of the same clade as <it>Osr27/Rire9, Osr25</it> and <it>Osr26/Rire2</it> (<it>Osr25</it> and <it>Osr26/Rire2</it> are discussed below), but both have RTs that are about 30% different from those of <it>Osr26/Rire2</it> and <it>Osr27/Rire9.</it> Neither <it>Osr40</it> nor <it>Osr41</it> has been previously identified, but with approximate copy numbers of 600 and 300, respectively, these are both large families. The RTs of members of this clade show about 60% similarity to that of <it>Retrosor1</it> (<it>Sorghum bicolor</it>, AAD19359).</p>
            <p>With approximately 1,500 elements, <it>Osr30</it> constitutes 14% of all LTR retrotransposons in the rice genome. Although <it>Osr30</it> is the largest family of LTR retroelements in the genome, it has not been previously named. These elements are slightly larger (~13.1 kb) than those of <it>Osr27/Rire9.</it> A higher proportion of fragmented copies and lower level of LTR-LTR nucleotide identity suggest that <it>Osr30</it> is older than <it>Osr27/Rire9</it>. <it>Osr29</it>, which is closely allied to <it>Osr30</it>, is also a large family with more than 500 member elements. Taken together, the elements of the <it>Osr29</it> and <it>Osr30</it> clade are unusual, because they are as closely related to other major rice clades as they are to any elements outside rice. <it>Osr28</it> is a low-copy family that is most closely related to <it>Osr29</it> and <it>Osr30</it> (Figure <figr fid="F2">2</figr>).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>RT-based neighbor-joining tree for <it>gypsy</it>-like retrotransposons</p>
               </caption>
               <text>
                  <p>RT-based neighbor-joining tree for <it>gypsy</it>-like retrotransposons. Distances (uncorrected <it>p</it>) appear next to each branch. RT sequences from plant species other than rice are included for comparison.</p>
               </text>
               <graphic file="gb-2002-3-10-research0053-2"/>
            </fig>
            <p>Two other large <it>gypsy</it>-like families are <it>Osr33/Rire8</it> [<abbr bid="B25">25</abbr>] and <it>Osr34.</it> These two families each have copy numbers of approximately 500. Two low-copy families belonging to the same clade are <it>Osr32</it> and <it>Osrs6/Rire3</it> [<abbr bid="B27">27</abbr>] (Figure <figr fid="F2">2</figr>). Members of these families have large LTRs, typically in the range 3,000-3,500 bp. RTs of families in this clade show high sequence similarity to an LTR retrotransposon in pineapple (~70% to <it>Acr-1;</it> CAA73042) and to one in <it>Sorghum bicolor</it> (~77% to <it>Retrosor3</it>, AAD221153) (Figure <figr fid="F2">2</figr>).</p>
         </sec>
         <sec>
            <st>
               <p>Low-copy <it>gypsy</it>-like elements</p>
            </st>
            <p><it>Osr31/Rire7</it> is an aberrant low-copy family that is much more closely related (77% similarity) to an <it>Arabidopsis</it> element, <it>Atr-4</it> (see Table <tblr tid="T2">2</tblr>), than to any other LTR retroelement families in the rice genome (Figure <figr fid="F2">2</figr>). In the clade of five low-copy families, composed of <it>Osr35, Osr36, Osr38, Osr39</it>, and <it>Osr42</it>, an RT was found in the GBRD for only two families, <it>Osr35</it> and <it>Osr36.</it> The other elements were identified in scans of the MRD and their full sequences have since been submitted to GenBank (for accession numbers, see Table <tblr tid="T1">1</tblr>). This clade is closely related to <it>Arabidopsis</it> element <it>Atr-5</it> (Figure <figr fid="F2">2</figr>, Table <tblr tid="T2">2</tblr>).</p>
         </sec>
         <sec>
            <st>
               <p>Families of non-autonomous elements</p>
            </st>
            <p>Members of family <it>Osr25</it> are all internally deleted and thus non-autonomous (mean length 4.3 kb). Although <it>Osr25</it> elements have typical LTRs, PBS, and PPT, the inter-LTR region contains only non-coding, repetitive DNA. The LTRs of <it>Osr25</it> display 65-70% sequence similarity to the autonomous elements of the <it>gypsy</it>-like family <it>Osr26/Rire2.</it> Elements with LTRs having such a high degree of similarity are usually considered members of the same family. Nevertheless, because members of <it>Osr26/Rire2</it> have the usual coding structure typical of other <it>gypsy</it>-like elements (while <it>Osr25</it> elements entirely lack typical retroviral genes) and members of these two families fall into two sharply distinct, non-overlapping clades, we report these two types of elements as separate families. Estimates based on scans of the MRD and the GBRD suggest that the rice genome contains about 500 copies each of <it>Osr25</it> and <it>Osr26/Rire2</it>. <it>Osr25</it> and <it>Osr26/Rire2</it> display 98.9 and 97.9% LNI respectively.</p>
            <p><it>Osr37/Rire4</it> is also aberrant compared to other rice LTR retrotransposon families. The typical element in this family is 4.4 kb long, about the same length as <it>Osr25</it> elements. Members of <it>Osr37/Rire4</it> usually carry a large ORF (up to 600 amino acids) just upstream of the 3' LTR. This ORF shows no significant similarity to any known RT sequence. Up to the present in the GBRD, where these ORFs are generally identified simply as hypothetical proteins, the large ORF of <it>Osr37/Rire4</it> seems not to have been recognized as a retroviral gene. This ORF may serve an integrase function as BLAST searches show that it has low homology to a putative integrase in <it>A. thaliana</it> (28%; AC005171). There are about 600 copies of <it>Osr37/Rire4</it> in the entire rice genome.</p>
            <p>In addition to the foregoing <it>copia</it>- and <it>gypsy</it>-like families, our scans identified two families, <it>Osr43</it> and <it>Osr44</it>, of small elements (overall length &lt; 2,000 bp). With LTRs only 148 bp long and an overall length of 1,207 bp, <it>Osr44</it> elements are especially small. Members of <it>Osr43</it> and <it>Osr44</it> are unique because, although they possess all of the canonical LTR-retrotransposon structural features (LTRs, PBS, PPT, and TSRs), they are internally deleted and either completely lack or encode only very small ORFs with no similarity to any known protein. Both families contain on the order of 100 copies genome-wide.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>Rice LTR retrotransposons are a significant component of the rice genome. We estimate that LTR retrotransposons constitute at least 17% of the <it>O. sativa</it> genome. Although this value is lower than the estimated percentage of LTR retrotransposons in the genomes of other cereal plants [<abbr bid="B2">2</abbr>,<abbr bid="B12">12</abbr>], it is more than tenfold greater than the estimated percentage of LTR retrotransposons in <it>A. thaliana</it>, a species with a genome one-third the size of the rice genome [<abbr bid="B21">21</abbr>]. This disproportionate increase in the percentage of LTR retrotransposons as a function of genome size is consistent with the view that genome size variability in plants is often heavily dependent on variation in LTR retrotransposon content [<abbr bid="B27">27</abbr>,<abbr bid="B28">28</abbr>].</p>
         <p>We have determined that individual full-length LTR-retrotransposons present in the sequenced euchromatic regions of the rice genome are all relatively young, displaying, on average, greater than 98% sequence identity between their LTRs. Comparative genomic studies of LTR retrotransposons in both plants and animals have revealed that species with smaller genomes [<abbr bid="B7">7</abbr>,<abbr bid="B10">10</abbr>,<abbr bid="B19">19</abbr>,<abbr bid="B20">20</abbr>,<abbr bid="B21">21</abbr>] do not harbor older families of LTR retrotransposons, as do species with larger genomes [<abbr bid="B12">12</abbr>,<abbr bid="B22">22</abbr>]. It has been hypothesized that the rate of turnover of retroelements may be higher in small genomes as a result of the presence of less effective epigenetic silencing mechanisms [<abbr bid="B10">10</abbr>]. It remains to be determined whether or not this hypothesis is an adequate explanation of the apparent lack of older full-length LTR retrotransposons in the euchromatic portion of the rice genome.</p>
         <p>In general, the major clades of rice LTR retrotransposons are more closely related to elements present in other species than to the other clades of rice elements, suggesting that horizontal transfer may have occurred over the evolutionary history of rice LTR retrotransposons. Further analysis is required to definitively test the horizontal transfer hypothesis.</p>
         <p>The newly developed search algorithm (LTR_STRUC) we have used in this study to initially identify LTR retrotransposons in the rice genome is not dependent upon sequence homology as are standard search methods such as BLAST. As a consequence, we identified several previously unreported families of rice LTR retrotransposons consisting of non-coding and, in some cases, repeating, sequence motifs. LTR retrotransposons of similar structure have recently been identified within the genomes of both monocotyledonous and dicotyledonous plants [<abbr bid="B29">29</abbr>]. Preliminary evidence suggests that these elements may have a significant role in restructuring plant genomes over evolutionary time [<abbr bid="B29">29</abbr>].</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Automated characterization of LTR retrotransposons using LTR_STRUC</p>
            </st>
            <p>LTR_STRUC identifies new LTR retrotransposons on the basis of the presence of characteristic retroelement features (E.M.M. and J.F.M., unpublished work). It scans nucleotide sequence data for putative LTR pairs, aligns the putative pairs, and scores them on the basis of the presence/absence of expected motifs such as TSRs, canonical dinucleotides, PBS, PPT, and so on. When a given pair receives a score above a (user-specified) cut-off, an output record is generated that specifies salient information about the putative element, such as the length of the transposon and its LTRs, its position within the contig, an alignment of its LTRs, the nucleotide sequence of the transposon, its LTRs and target-site repeats, as well as a file listing all ORFs. In our study, once putative elements were identified, sequence analysis was carried out on the individual output files to identify those that described actual LTR retrotransposons. Additional elements were identified by BLAST searches using elements located by LTR_STRUC as queries.</p>
         </sec>
         <sec>
            <st>
               <p>Datasets scanned</p>
            </st>
            <p>Initial scans with LTR_STRUC were conducted on a dataset consisting of the 29.8 Mb of <it>O. sativa</it> BAC-derived sequence data available in GenBank at the time of the initial scan (December 2000). This dataset (TDS) was obtained from the TIGR website [<abbr bid="B30">30</abbr>]. Subsequently, LTR_STRUC was used to scan the non-redundant MRD, a product of the Monsanto Rice Genome Sequencing Project. The MRD is based on an initial dataset of 3,391 BACs distributed across the genome of <it>O. sativa</it> cv. Nipponbare - the same cultivar used by the International Rice Genome Sequencing Project. Removal of contaminants and redundancies from this initial dataset produced the MRD (consisting of 52,202 contigs, totaling 259 Mb of the 430-Mb rice genome). More recently, in an effort to determine the relative copy numbers of the various families and identify additional elements not picked up in our initial survey with LTR_STRUC, we have used representative sequences from each retrotransposon family identified in this study as queries to conduct BLAST searches against both the MRD and the GBRD. Thus, the results reported here constitute a reasonably unbiased survey of LTR-retrotransposon diversity in rice. Both the MRD and GBRD are heavily weighted toward euchromatic sequences. The amount of data scanned was significantly less than the total amount of nucleotide sequence contained in the MRD and GBRD. Much of the MRD (~36%) is composed of contigs that are less than 10 kb long and are therefore of limited utility for the LTR_STRUC program, which finds only full-length elements (rice <it>gypsy</it>-like elements are typically longer than 10 kb and are not entirely contained in such short contigs). In the case of the GBRD, the amount of rice nucleotide sequence available for search was less than one-third of the 174 Mb released to the public (because of 15% redundancy, the GBRD sequences amounted to a total of only about 150 Mb, of which only some 50 Mb were actually available for BLAST search because most of these sequences were in the process of being 'finished'). RT sequences were identified according to previously described criteria [<abbr bid="B31">31</abbr>,<abbr bid="B32">32</abbr>].</p>
         </sec>
         <sec>
            <st>
               <p>Multiple sequence alignments and phylogenetic analyses</p>
            </st>
            <p>The RT domains of the <it>Osr</it> elements were aligned with previously reported RT sequences (Table <tblr tid="T2">2</tblr>). The ClustalW analysis [<abbr bid="B33">33</abbr>] extension to MacVector 7.0 was used to generate two amino-acid alignments, one for <it>gypsy</it>-like, and one for <it>copia</it>-like elements. Draw N-J Tree and Bootstrap N-J commands of ClustalW were then used to generate non-bootstrapped and bootstrapped trees, respectively.</p>
         </sec>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was supported by grant DBI-0077709 from the National Science Foundation. We thank Rebecca McCarthy for editorial assistance and Eric Ganko for constructive criticism.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Nested retrotransposons in the intergenic regions of the maize genome.</p>
            </title>
            <aug>
               <au>
                  <snm>SanMiguel</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Tikhanov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Jin</snm>
                  <fnm>YK</fnm>
               </au>
               <au>
                  <snm>Motchoulskaia</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Zakharov</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Melake-Berhan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Springer</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Edwards</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Avramova</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Bennetzen</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1996</pubdate>
            <volume>274</volume>
            <fpage>765</fpage>
            <lpage>768</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.274.5288.765</pubid>
                  <pubid idtype="pmpid" link="fulltext">8864112</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Repetitive DNA and chromosome evolution in plants.</p>
            </title>
            <aug>
               <au>
                  <snm>Flavell</snm>
                  <fnm>RB</fnm>
               </au>
            </aug>
            <source>Philos Trans R Soc Lond B Biol Sci</source>
            <pubdate>1986</pubdate>
            <volume>312</volume>
            <fpage>227</fpage>
            <lpage>242</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2870519</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Cytosine methylation and the ecology of intragenomic parasites.</p>
            </title>
            <aug>
               <au>
                  <snm>Yoder</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Walsh</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Bestor</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>1997</pubdate>
            <volume>13</volume>
            <fpage>335</fpage>
            <lpage>340</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(97)01181-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">9260521</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Interspersed repeats and other mementos of transposable elements in mammalian genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Smit</snm>
                  <fnm>AF</fnm>
               </au>
            </aug>
            <source>Curr Opin Genet Dev</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <fpage>657</fpage>
            <lpage>663</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0959-437X(99)00031-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">10607616</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Nuclear DNA content of some important plant species.</p>
            </title>
            <aug>
               <au>
                  <snm>Arumuganathan</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Earle</snm>
                  <fnm>ED</fnm>
               </au>
            </aug>
            <source>Plant Mol Biol Rep</source>
            <pubdate>1991</pubdate>
            <volume>9</volume>
            <fpage>208</fpage>
            <lpage>218</lpage>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Plant DNA C-values database</p>
            </title>
            <url>http://www.rbgkew.org.uk/cval/searchguide.html</url>
         </bibl>
         <bibl id="B7">
            <title>
               <p><it>Drosophila</it> euchromatic LTR retrotransposons are much younger than the host species in which they reside.</p>
            </title>
            <aug>
               <au>
                  <snm>Bowen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>McDonald</snm>
                  <fnm>JF</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>11</volume>
            <fpage>1527</fpage>
            <lpage>1540</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.164201</pubid>
                  <pubid idtype="pmpid" link="fulltext">11544196</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Retrotransposon families in rice.</p>
            </title>
            <aug>
               <au>
                  <snm>Hirochika</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Fukuchi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kikuchi</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Mol Gen Genet</source>
            <pubdate>1992</pubdate>
            <volume>233</volume>
            <fpage>209</fpage>
            <lpage>216</lpage>
            <xrefbib>
               <pubid idtype="pmpid">1376404</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Identification and characterization of two tandem repeat sequences (<it>TrsB</it> and <it>TrsC</it>) and a retrotransposon (<it>Rire1</it>) as genome-general sequences in rice.</p>
            </title>
            <aug>
               <au>
                  <snm>Nakajima</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Noma</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ohtsubo</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Ohtsubo</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Genes Genet Syst</source>
            <pubdate>1996</pubdate>
            <volume>71</volume>
            <fpage>373</fpage>
            <lpage>382</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9080684</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Genomic analysis of <it>Caenorhabditis elegans</it> reveals ancient families of retroviral-like elements.</p>
            </title>
            <aug>
               <au>
                  <snm>Bowen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>McDonald</snm>
                  <fnm>JF</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <fpage>924</fpage>
            <lpage>935</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.9.10.924</pubid>
                  <pubid idtype="pmpid" link="fulltext">10523521</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>High rates of frameshift mutations within homo-oligomeric runs during a single cycle of retroviral replication.</p>
            </title>
            <aug>
               <au>
                  <snm>Burns</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Temin</snm>
                  <fnm>HM</fnm>
               </au>
            </aug>
            <source>J Virol</source>
            <pubdate>1994</pubdate>
            <volume>68</volume>
            <fpage>4196</fpage>
            <lpage>4203</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">236342</pubid>
                  <pubid idtype="pmpid">7515970</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>The paleontology of intergene retrotransposons of maize.</p>
            </title>
            <aug>
               <au>
                  <snm>SanMiguel</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gaut</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Tikhonov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nakajima</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Bennetzen</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>1998</pubdate>
            <volume>20</volume>
            <fpage>43</fpage>
            <lpage>45</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/1695</pubid>
                  <pubid idtype="pmpid" link="fulltext">9731528</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Survey of transposable elements from rice genomic sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Turcotte</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Srinivasan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bureau</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2001</pubdate>
            <volume>25</volume>
            <fpage>169</fpage>
            <lpage>180</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-313x.2001.00945.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">11169193</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>The distribution and copy number of <it>copia</it>-like retrotransposons in rice (<it>Oryza sativa</it> L.) and their implications in the organization and evolution of the rice genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Peng</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>QF</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <fpage>6824</fpage>
            <lpage>6828</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">22000</pubid>
                  <pubid idtype="pmpid" link="fulltext">10359797</pubid>
                  <pubid idtype="doi">10.1073/pnas.96.12.6824</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Rice transposable elements: a survey of 73,000 sequence-tagged-connectors.</p>
            </title>
            <aug>
               <au>
                  <snm>Mao</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wood</snm>
                  <fnm>TC</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Budiman</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Tomkins</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Woo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sasinowski</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Presting</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Frisch</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Goff</snm>
                  <fnm>S</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <fpage>982</fpage>
            <lpage>990</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.10.7.982</pubid>
                  <pubid idtype="pmpid" link="fulltext">10899147</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>The complete sequence of 340 kb of DNA around the rice <it>Adh1-Adh2</it> region reveals interrupted co-linearity with maize chromosome 4.</p>
            </title>
            <aug>
               <au>
                  <snm>Tarchini</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Biddle</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Wineland</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tingey</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rafalski</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2000</pubdate>
            <volume>12</volume>
            <fpage>381</fpage>
            <lpage>392</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">139838</pubid>
                  <pubid idtype="pmpid" link="fulltext">10715324</pubid>
                  <pubid idtype="doi">10.1105/tpc.12.3.381</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>The chromosomal distributions of Ty1-copia group retrotransposable elements in higher plants and their implications for genome evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Heslop-Harrison</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Brandes</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Taketa</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Vershinin</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Alkhimova</snm>
                  <fnm>EG</fnm>
               </au>
               <au>
                  <snm>Karum</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Doudrick</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Scwarzacher</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Katsiotis</snm>
                  <fnm>A</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genetica</source>
            <pubdate>1997</pubdate>
            <volume>100</volume>
            <fpage>197</fpage>
            <lpage>204</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1018337831039</pubid>
                  <pubid idtype="pmpid">9440273</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Retrotransposons, endogenous retroviruses and the evolution of retroviruses.</p>
            </title>
            <aug>
               <au>
                  <snm>Boeke</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Stoye</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>In Retroviruses,</source>
            <publisher>Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press;</publisher>
            <editor>Coffin J, Hughes S, Varmus H</editor>
            <pubdate>1997</pubdate>
            <fpage>343</fpage>
            <lpage>435</lpage>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Tempo and mode of evolution in <it>Saccharomyces cerevisiae</it> genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Jordan</snm>
                  <fnm>IK</fnm>
               </au>
               <au>
                  <snm>Mc