<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2009-10-5-r49</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Origin of nascent lineages and the mechanisms used to prime second-strand DNA synthesis in the R1 and R2 retrotransposons of <it>Drosophila</it></p>
         </title>
         <aug>
            <au id="A1">
               <snm>Stage</snm>
               <mi>E</mi>
               <fnm>Deborah</fnm>
               <insr iid="I1"/>
               <email>dstage@mail.rochester.edu</email>
            </au>
            <au id="A2" ca="yes">
               <snm>Eickbush</snm>
               <mi>H</mi>
               <fnm>Thomas</fnm>
               <insr iid="I1"/>
               <email>eick@mail.rochester.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Biology Department, University of Rochester, 213 Hutchison, Rochester NY, 14627-0211, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2009</pubdate>
         <volume>10</volume>
         <issue>5</issue>
         <fpage>R49</fpage>
         <url>http://genomebiology.com/2009/10/5/R49</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">19416522</pubid>
               <pubid idtype="doi">10.1186/gb-2009-10-5-r49</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>27</day>
               <month>1</month>
               <year>2009</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>27</day>
               <month>3</month>
               <year>2009</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>5</day>
               <month>5</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>5</day>
               <month>5</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Stage and Eickbush; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Evolution and integration of R1 and R2 retrotransposons</p>
      </shorttitle>
      <shortabs>
         <p>Comparative analysis of 12 Drosophila genomes reveals insights into the evolution and mechanism of integration of R1 and R2 retrotransposons.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Most arthropods contain R1 and R2 retrotransposons that specifically insert into the 28S rRNA genes. Here, the sequencing reads from 12 <it>Drosophila </it>genomes have been used to address two questions concerning these elements. First, to what extent is the evolution of these elements subject to the concerted evolution process that is responsible for sequence homogeneity among the different copies of rRNA genes? Second, how precise are the target DNA cleavages and priming of DNA synthesis used by these elements?</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Most copies of R1 and R2 in each species were found to exhibit less than 0.2% sequence divergence. However, in many species evidence was obtained for the formation of distinct sublineages of elements, particularly in the case of R1. Analysis of the hundreds of R1 and R2 junctions with the 28S gene revealed that cleavage of the first DNA strand was precise both in location and the priming of reverse transcription. Cleavage of the second DNA strand was less precise within a species, differed between species, and gave rise to variable priming mechanisms for second strand synthesis.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>These findings suggest that the high sequence identity amongst R1 and R2 copies is because all copies are relatively new. However, each active element generates its own independent lineage that can eventually populate the locus. Independent lineages occur more often with R1, possibly because these elements contain their own promoter. Finally, both R1 and R2 use imprecise, rapidly evolving mechanisms to cleave the second strand and prime second strand synthesis.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010015">Model organisms</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Transposable elements (TEs) are ubiquitous components and extensive manipulators of eukaryotic genomes. Because TEs constitute a significant mutation source and their remnants often comprise the majority of genomes, they are usually regarded as genomic parasites that are occasionally co-opted for host benefits <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. While tracing the evolution of any genome should include a description of the natural history of its transposable elements, the diversity of TEs and their histories are so extensive that even with the advent of genome sequencing and assembly it remains challenging to follow the interplay between TEs and their host.</p>
         <p>The rRNA genes provide a microcosm within the genome that is amenable to a detailed description of the interactions between TEs and their host. In eukaryotes these genes are organized into one or more loci, the rDNA loci, containing hundreds to thousands of copies of the 18S, 5.8S and 28S genes (Figure <figr fid="F1">1</figr>) <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. A number of TEs specifically insert into the 28S genes of different animals <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. The most extensively studied of these elements are the non-long terminal repeat (non-LTR) retrotransposable elements R1 and R2 of arthropods <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. These two elements appear to have been inserting in the 28S genes of most arthropods since the origin of this phylum <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. R2 elements have also been identified in a variety of other animal lineages <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. The retrotransposition mechanism of R2 elements has been studied in detail <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. The current model for their integration, called target primed reverse transcription (TPRT), has four basic steps: first, the bottom DNA strand of the target site is cleaved; second, the released 3' hydroxyl is used to prime cDNA synthesis by the element's reverse transcriptase; third, the top DNA strand is cleaved; and fourth, the released 3' hydroxyl is used to prime second-strand DNA synthesis <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. This basic mechanism is likely used by R1 <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp> and most other non-LTR retrotransposons <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>The rDNA loci of <it>Drosophila </it>species</p>
            </caption>
            <text>
               <p>The rDNA loci of <it>Drosophila </it>species. Each rDNA transcription unit (diagramed in detail) consists of the 18S, 5.8S, 2S and 28S genes, the external transcribed spacer (ETS) and internal transcribed spacers (ITS1, ITS2a and ITS2). The location of the R1 and R2 insertion sites are indicated with arrowheads. Transcription units are separated by an internally repetitive intergenic spacer (IGS). The rDNA loci are usually, but not always, located on the X and Y chromosomes and typically contain hundreds of copies of the rDNA unit arranged in tandem arrays.</p>
            </text>
            <graphic file="gb-2009-10-5-r49-1"/>
         </fig>
         <p>Evolution of the rDNA locus is known to be dominated by concerted evolution, a recombinational process involving unequal crossovers and gene conversions that maintain near identity among repeats within a species while allowing those repeats to diverge between species <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Abundant evidence corroborates the extremely low sequence variation present among the many copies of the rDNA unit <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>. Sequence variants present at the lowest frequencies are equally distributed between the coding and non-coding regions of the unit. In contrast, the rare variants present at higher frequencies are greatly enriched in non-coding regions, indicating that selective pressures guide the extent of standing variation within the locus <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>.</p>
         <p>In arthropods from a few percent to over 50% of the rDNA units are inserted by R1 or R2 elements <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, and those units are thus prevented from producing functional 28S rRNA <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Within a species these many copies of R1 and R2 elements also exhibit low levels of sequence variation <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. Surprisingly, divergent lineages of R1 or R2 are frequently found in a species, which cannot be explained by horizontal transfers between species <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. This suggests that divergent lineages of elements must be able to form within a species.</p>
         <p>The rDNA locus is not assembled as part of genome projects because of the highly repetitive nature of the rDNA locus. Thus, in this report we used the original sequencing reads generated from the 12 <it>Drosophila </it>genomes project <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> to address specific questions concerning the evolution and mechanism of integration of R1 and R2 elements. Can different lineages of R1 and R2 arise within a species despite concerted evolution maintaining sequence homogeneity among the rRNA genes? What is the location of second-strand DNA cleavage? How is this site used to prime second-strand synthesis in the retrotransposition reaction?</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <p>The phylogenetic relationships among the 12 <it>Drosophila </it>species used in this report are shown in Figure <figr fid="F2">2a</figr>. This phylogeny, based on the complete sequences of the18S and 28S genes, is consistent with the species relationships obtained with many other gene sequences <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. In eight of the <it>Drosophila </it>species a complete R2 element could be assembled (Figure <figr fid="F2">2b</figr>; Additional data files 1, 2, 3, 4, 5, 6, 7 and 8). The structure of these elements conformed to previously identified R2 elements <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> and dN/dS analysis indicated that the assembled R2 elements had undergone purifying selection (mean dN/dS = 0.24 with a standard deviation of 0.321). In a ninth species, <it>D. mojavensis</it>, R2 sequences were identified but too few copies existed to assemble a complete sequence. R2 elements have been previously documented in several species groups of the <it>Drosophila </it>subgenus <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>; however, our failure to detect R2 sequences in <it>D. virilis </it>and <it>D. grimshawi </it>suggests R2 elements are frequently lost from this subgenus. The only example of R2 loss in the Sophophora subgenus, <it>D. erecta</it>, had been previously noted <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>.</p>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Phylogenetic relationships among the 12 sequenced <it>Drosophila </it>species and structures of R1 and R2 elements</p>
            </caption>
            <text>
               <p>Phylogenetic relationships among the 12 sequenced <it>Drosophila </it>species and structures of R1 and R2 elements. <b>(a) </b>Phylogenetic relationships of the species based on maximum likelihood trees of their consensus 18S and 28S rRNA gene sequences. <b>(b) </b>Structures of the R1 and R2 elements found in each species. The 'A' and 'B' designations refer to the two divergent R1 lineages that are present among <it>Drosophila </it>species <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. Filled rectangles correspond to the 5' and 3' untranslated regions (UTRs). Open rectangles correspond to the open reading frames (ORFs). R1 elements have two overlapping ORFs in different frames. <it>D. mojavensis </it>contains R2 elements but a complete sequence could not be assembled. No trace of R2 elements could be identified in <it>D. erecta</it>, <it>D. virilis </it>and <it>D. grimshawi</it>.</p>
            </text>
            <graphic file="gb-2009-10-5-r49-2"/>
         </fig>
         <p>We also searched in all species for R2 copies that might be present outside the rDNA locus. We found no extra-rDNA R2 copies in <it>D. melanogaster</it>, as previously reported <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, or in <it>D. ananassae </it>or <it>D. persimilis</it>. <it>D. pseudoobscura</it>, <it>D. sechellia</it>, <it>D. simulans</it>, <it>D. willistoni</it>, and <it>D. yakuba </it>each had R2 copies not inserted in a 28S gene. These copies were frequently incomplete and all contained sequences that were from 1% to 2% divergent from those R2 copies within the rDNA locus. Thus, these non-rDNA copies of R2 could not have given rise to the current populations of R2 insertions in the rDNA locus. Finally, in <it>D. simulans </it>a fusion of the 5' end of an R1 element with the 3' end of an R2 element was identified as a tandem array outside the rDNA locus.</p>
         <p>Complete R1 elements were assembled in all 12 sequenced genomes (Figure <figr fid="F2">2b</figr>; Additional data files 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 and 21). The coding capacity of all R1 ORFs was consistent with previously characterized R1 elements <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. A test of selection by dN/dS analysis indicated that the assembled R1 elements had undergone purifying selection (R1A, mean dN/dS = 0.30 with standard deviation of 0.376; R1B, mean dN/dS = 0.27 with standard deviation of 0.348). Previous analyses of R1 elements in <it>Drosophila </it>have suggested there are two distinct lineages of elements, A and B, that separated well before the origin of this genus and are differentially retained in the various species lineages <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. Eleven of the sequenced <it>Drosophila </it>species contained a single R1 family of either the R1A or R1B lineage, while <it>D. ananassae </it>contained both lineages (Figure <figr fid="F2">2b</figr>). The only consistent difference in structure between the two lineages was that the two ORFs in the R1A lineage overlapped by 7 bp with a corresponding frame shift of -2, while the ORFs in the R1B lineage had a frameshift of -1 and overlapped from 14 bp in <it>D. ananassae </it>to 59 bp in <it>D. grimshawi</it>. As will be described below, in most species multiple examples were also identified of R1 insertions in non-28S gene locations.</p>
         <sec>
            <st>
               <p>R1 and R2 intraspecies sequence variation</p>
            </st>
            <p>The average levels of sequence variation among the elements within each species are shown in Table <tblr tid="T1">1</tblr>. Because R1 insertions were found in genomic locations outside the 28S gene, we focused our analysis on the first and last 400 bp of each element and 100 bp of their flanking sequence to insure that all sequences were derived from copies located in the 28S rRNA genes. Except in the specific examples described below, the R1 and R2 elements in each species were extremely uniform, averaging less than 0.2% divergence from the consensus sequence. Because R2 elements are seldom present outside the locus, we also monitored nucleotide variation within internal regions of R2 elements in some species. Sequence divergence for central, coding regions of R2 were estimated at less than 0.1%, similar to or slightly lower than the 5' and 3' untranslated regions (UTRs; not shown).</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Variation in the 5' and 3' ends of R1 and R2 elements</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Major copy type: mean divergence* (maximum)</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Atypical sequences: number (divergence)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>5' end</p>
                     </c>
                     <c ca="center">
                        <p>3' end</p>
                     </c>
                     <c ca="center">
                        <p>Variant copies<sup>&#8224;</sup></p>
                     </c>
                     <c ca="center">
                        <p>Variant 5' ends<sup>&#8225;</sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>R1 elements</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. simulans </it>R1A</p>
                     </c>
                     <c ca="center">
                        <p>0.000 (0.000)</p>
                     </c>
                     <c ca="center">
                        <p>0.002 (0.003)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>4 (0.01-0.03)</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. sechellia </it>R1A</p>
                     </c>
                     <c ca="center">
                        <p>&lt;0.001 (0.003)</p>
                     </c>
                     <c ca="center">
                        <p>&lt;0.001 (0.002)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. melanogaster </it>R1A</p>
                     </c>
                     <c ca="center">
                        <p>0.001 (0.008)</p>
                     </c>
                     <c ca="center">
                        <p>0.003 (0.015)</p>
                     </c>
                     <c ca="center">
                        <p>2 (0.07)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. yakuba </it>R1A</p>
                     </c>
                     <c ca="center">
                        <p>0.001 (0.005)</p>
                     </c>
                     <c ca="center">
                        <p>0.000 (0.000)</p>
                     </c>
                     <c ca="center">
                        <p>1 (0.02)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. erecta </it>R1A</p>
                     </c>
                     <c ca="center">
                        <p>0.002 (0.005)</p>
                     </c>
                     <c ca="center">
                        <p>0.001 (0.007)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. ananassae </it>R1A</p>
                     </c>
                     <c ca="center">
                        <p>0.013 (0.043)</p>
                     </c>
                     <c ca="center">
                        <p>&lt;0.001 (0.003)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>1 (0.08-0.11)</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. ananassae </it>R1B</p>
                     </c>
                     <c ca="center">
                        <p>0.000 (0.000)</p>
                     </c>
                     <c ca="center">
                        <p>0.002 (0.005)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>3 (0.15-0.22)</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. pseudoobscura </it>R1A</p>
                     </c>
                     <c ca="center">
                        <p>0.000 (0.000)</p>
                     </c>
                     <c ca="center">
                        <p>0.002 (0.010)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>1 (0.04)</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. persimilis </it>R1A</p>
                     </c>
                     <c ca="center">
                        <p>ND</p>
                     </c>
                     <c ca="center">
                        <p>0.000 (0.000)</p>
                     </c>
                     <c ca="center">
                        <p>1 (0.02)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. willistoni </it>R1A</p>
                     </c>
                     <c ca="center">
                        <p>0.002 (0.005)</p>
                     </c>
                     <c ca="center">
                        <p>0.001 (0.003)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. mojavensis </it>R1A</p>
                     </c>
                     <c ca="center">
                        <p>0.000 (0.000)</p>
                     </c>
                     <c ca="center">
                        <p>0.000 (0.000)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. virilis </it>R1B</p>
                     </c>
                     <c ca="center">
                        <p>0.001 (0.008)</p>
                     </c>
                     <c ca="center">
                        <p>&lt;0.001 (0.003)</p>
                     </c>
                     <c ca="center">
                        <p>6 (0.01-0.05)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. grimshawi </it>R1B<sub>1</sub></p>
                     </c>
                     <c ca="center">
                        <p>0.000 (0.000)</p>
                     </c>
                     <c ca="center">
                        <p>0.001 (0.005)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. grimshawi </it>R1B<sub>2</sub></p>
                     </c>
                     <c ca="center">
                        <p>0.000 (0.000)</p>
                     </c>
                     <c ca="center">
                        <p>0.000 (0.000)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>R2 elements</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. simulans </it>R2</p>
                     </c>
                     <c ca="center">
                        <p>0.001 (0.008)</p>
                     </c>
                     <c ca="center">
                        <p>0.000 (0.000)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. sechellia </it>R2</p>
                     </c>
                     <c ca="center">
                        <p>0.000 (0.000)</p>
                     </c>
                     <c ca="center">
                        <p>&lt;0.001 (0.003)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. melanogaster </it>R2</p>
                     </c>
                     <c ca="center">
                        <p>0.002 (0.005)</p>
                     </c>
                     <c ca="center">
                        <p>0.001 (0.015)</p>
                     </c>
                     <c ca="center">
                        <p>6 (0.02-0.05)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. yakuba </it>R2</p>
                     </c>
                     <c ca="center">
                        <p>0.002 (0.013)</p>
                     </c>
                     <c ca="center">
                        <p>0.006 (0.018)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. ananassae </it>R2</p>
                     </c>
                     <c ca="center">
                        <p>0.004 (0.008)</p>
                     </c>
                     <c ca="center">
                        <p>0.001 (0.005)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. persimilis </it>R2</p>
                     </c>
                     <c ca="center">
                        <p>ND</p>
                     </c>
                     <c ca="center">
                        <p>ND</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. pseudoobscura </it>R2</p>
                     </c>
                     <c ca="center">
                        <p>ND</p>
                     </c>
                     <c ca="center">
                        <p>0.005 (0.010)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. willistoni </it>R2 sub1</p>
                     </c>
                     <c ca="center">
                        <p>0.001 (0.003)</p>
                     </c>
                     <c ca="center">
                        <p>0.002 (0.008)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>D. willistoni </it>R2 sub2<sup>&#167;</sup></p>
                     </c>
                     <c ca="center">
                        <p>0.003 (0.006)</p>
                     </c>
                     <c ca="center">
                        <p>0.015 (0.028)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>*Average nucleotide divergence from the consensus. The maximum divergence observed is shown in parentheses. <sup>&#8224;</sup>Divergence detected at both the 5' and 3' ends except for the insertion in <it>D. yakuba</it>, where divergence was detected only at the 3' end. <sup>&#8225;</sup>The number of distinct 5' ends excludes the majority (consensus) sequence used in the previous columns. The divergence of additional 5' ends is calculated from the consensus. <sup>&#167;</sup>There may be only two copies of this lineage. ND indicates that these numbers were not included because sequencing reads with variant sequences had poor quality scores.</p>
               </tblfn>
            </tbl>
            <p>In Figure <figr fid="F3">3</figr> the level of nucleotide variation for the 5' and 3' ends of R1 and R2 shown in Table <tblr tid="T1">1</tblr> are compared to the levels of nucleotide variation previously found in the 28S genes and internal transcribed spacer (ITS)1 regions of the rDNA units <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. The levels of variation present in R1 and R2 were much higher than that of the 28S gene, and similar to that of the ITS1 region. We have previously shown that the level of nucleotide variation for different regions of the rDNA unit was proportional to the rate at which each region diverged between species <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. This correlation is expected if all regions of the transcribed rDNA unit undergo similar levels of concerted evolution, because increased selective constraints on a sequence removes more variants that arise by mutation, which in turn enables fewer neutral variants to become fixed in all rDNA units (diverge over time). Also shown in Figure <figr fid="F3">3</figr> (gray bars) are the nucleotide divergence rates of R1 and R2 compared to those for the 28S gene and the ITS1 region. These divergence rates were determined by comparing the consensus sequences of each region from <it>D. melanogaster</it>, <it>D. sechellia</it>, <it>D. simulans </it>and <it>D. yakuba</it>. The relationship between the levels of variation within a species and divergence rates between species that was observed for regions of the rDNA unit was not observed for the R1 and R2 sequences. For example, the 5' end of R1 evolved at four times the rate of the R1 3' end, yet had similar levels of nucleotide variation. The 5' and 3' ends of R2 evolved at one-half the rate of the ITS1 sequences, yet had two to four times the level of nucleotide variation within a species. In this latter example, the slower rate of divergence suggests that the R2 sequences are under greater selective pressure than the ITS1 sequences. Therefore, the finding that the R2 sequences have greater levels of variation suggests that they are not undergoing concerted evolution as effectively as the ITS1 sequences.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Average level of within-species sequence variation for R1 and R2</p>
               </caption>
               <text>
                  <p>Average level of within-species sequence variation for R1 and R2. Sequence variation in the 400 bp at the 5' and 3' ends of the R1 and R2 elements from the 12 <it>Drosophila </it>species, the entire 28S rRNA gene and the internal transcribed spacer (ITS)1 region are shown (black bars). All values are calculated as the divergence from the consensus sequence for the species. Grey bars indicate the rates of nucleotide divergence (percent divergence per million years (myr)) of these same regions. Standard deviations are given for all values. The high standard deviation of the R1 5' end is a result of several species with no variation. The divergence data estimates were derived from comparison of the consensus sequences from <it>D. simulans</it>, <it>D. sechellia</it>, <it>D. melanogaster </it>and <it>D. yakuba </it>(divergence times: <it>simulans </it>versus <it>sechellia</it>, 0.25 myr; <it>simulans </it>or <it>sechellia </it>versus <it>melanogaster</it>, 3 myr; <it>simulans</it>, <it>sechellia </it>or <it>melanogaster </it>versus <it>yakuba</it>, 8 myr). Nucleotide variation data for 28S and ITS1 regions are derived from Supplemental Table 2 of <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>.</p>
               </text>
               <graphic file="gb-2009-10-5-r49-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Nascent subfamilies of R1 and R2</p>
            </st>
            <p>In addition to the many highly uniform copies of R1 and R2, five <it>Drosophila </it>species had one or more copies of R1 or R2 with nucleotide divergence of 1% to 7% from the consensus, clearly outside the range of divergences seen for the remaining R1 and R2 copies. The number and level of divergence of these atypical copies are listed in Table <tblr tid="T1">1</tblr>. Among these copies two had premature stop codons, indicating that they were inactive, while the remaining copies appeared to have intact ORFs. Because most divergent copies were not inserted into 28S genes that were also divergent, these R1 and R2 copies could represent distinct retrotranspositionally competent lineages of elements. However, the number of trace reads suggested these divergent R1s and R2s were at single copy levels, and thus it was likely that they had not recently been active.</p>
            <p>Stronger evidence for the formation of nascent subfamilies was found in the examples of distinct 5' ends for the R1 elements of three species ('Variant 5' ends' column in Table <tblr tid="T1">1</tblr>). In <it>D. simulans </it>there were five distinct sequence classes of R1 5' ends, with each class representing from 11% to 26% of the total number of copies. There was no nucleotide divergence within each class, while divergence between classes ranged from 1% to 3%. In the case of <it>D. pseudoobscura </it>there were two distinct 5' ends. One-third of the R1 copies had 5' ends with over 4% nucleotide divergence from the remaining two-thirds of the R1 copies. Finally, R1 elements in <it>D. ananassae </it>showed the greatest tendency to diverge into subclasses with distinct 5' ends. The R1A elements could be separated into two classes that differed by 10% in nucleotide sequence, while the R1B elements could be separated into four classes that differed by 15% to 22% in sequence.</p>
            <p>The separate lineages of the R1 5' ends observed in these three species were not apparent at the 3' ends of the R1 elements (that is, there was one class of 3' ends with mean levels of divergence less than 0.2%). Previous authors have suggested that new sublineages of transposable elements can arise within the same species by the acquisition of new promoter sequences <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. Thus, one possibility is that the different 5' ends of R1 elements in a species correspond to rapidly evolving promoter sequences. R1 elements have been suggested to contain their own promoters because in some insect lineages R1 inserts in the opposite orientation in the 28S gene, or even outside the rDNA locus <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B31">31</abbr></abbrgrp>. R2 elements on the other hand appear to be co-transcribed with the 28S gene and thus do not have their own promoter <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>. No evidence of divergent 5' ends was found for the R2 elements of any <it>Drosophila </it>species.</p>
            <p>Finally, Figure <figr fid="F4">4</figr> summarizes two examples where the formation of nascent families involves sequence divergence of the entire R1 and R2 elements. In <it>D. grimshawi </it>two equally represented groups of R1B elements were detected that had 21% nucleotide divergence at their 5' ends and lower levels in other regions of the element (Figure <figr fid="F4">4a</figr>; Additional data files 12 and 22). The level of divergence for most regions of the two families was less than the divergence between the R1 elements of <it>D. melanogaster </it>and <it>D. simulans</it>, also shown in Figure <figr fid="F4">4a</figr>, suggesting the two R1B subfamilies in <it>D. grimshawi </it>are not as old as the estimated 3 million year separation between <it>D. melanogaster </it>and <it>D. simulans </it><abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. The 5' ends of the subfamilies have undergone accelerated rates of divergence, similar to that described for the different 5' ends of R1 elements in <it>D. ananassae</it>, <it>D. simulans </it>and <it>D. pseudoobscura </it>(Table <tblr tid="T1">1</tblr>).</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Summary of the nascent lineages of R1 and R2 in two species</p>
               </caption>
               <text>
                  <p>Summary of the nascent lineages of R1 and R2 in two species. In both panels the elements are diagramed as in Figure 2b. Values below the element diagrams are nucleotide divergences, while values above the diagrams are amino acid (aa) divergences. <b>(a) </b>Nascent lineages of R1 elements in <it>D. grimshawi</it>. For comparison, the relative level of sequence divergence between the R1 elements of <it>D. melanogaster </it>and <it>D. simulans </it>are also shown. The estimated time of separation of these two species is 3 million years <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. <b>(b) </b>Nascent lineages of R2 elements in <it>D. willistoni</it>. The divergence between the R2 elements of <it>D. melanogaster </it>and <it>D. simulans </it>are shown.</p>
               </text>
               <graphic file="gb-2009-10-5-r49-4"/>
            </fig>
            <p>A second example of the presence of subfamilies within a species was found for the R2 elements in <it>D. willistoni</it>. In this case one subfamily, R2.1, was highly abundant while the R2.2 subfamily (Additional data file 23) was present in only a few copies. As shown in Figure <figr fid="F4">4b</figr> the subfamilies have diverged by 4.7% in their 5' UTRs and 3.6% in their 3' UTRs, similar to the divergence between the R2 elements of <it>D. melanogaster </it>and <it>D. simulans</it>. The amino acid divergence of the ORF from the two subfamilies (2.6%) was also similar to the divergence between <it>D. simulans </it>and <it>D. melanogaster </it>(2.4%), suggesting the divergence time between the <it>D. willistoni </it>subfamilies is similar to the time of divergence of <it>D. melanogaster </it>and <it>D. simulans</it>.</p>
            <p>Unequal crossover events occurring within R1 (or R2) elements would homogenize their sequences, thus preventing the separation of two distinct lineages. Because the nucleotide divergence for most of the region encoding ORF2 of the R1B.1 and R1B.2 elements in <it>D. grimshawi </it>was less than 1%, and thus could still undergo recombination, we looked for evidence of such events. Blast searches were conducted using a query from the end of each subfamily. We then examined the sequence trace from the other end of each approximately 3.5 kb plasmid to determine whether it contained sequence from the same or opposite subfamily. Of 115 informative plasmid ends examined, only one pair indicated recombination between the subfamilies. This paucity of recombination can explain how these nascent subfamilies are able to avoid concerted evolution and remain independent lineages.</p>
         </sec>
         <sec>
            <st>
               <p>Mechanism of R2 retrotransposition</p>
            </st>
            <sec>
               <st>
                  <p>Analysis of R2 junctions</p>
               </st>
               <p>As shown in Figure <figr fid="F5">5a</figr>, when viewed from their 3' junctions with the 28S gene, all R2 copies present in the sequenced <it>Drosophila </it>genomes were inserted into the same site as previously characterized R2 elements in all animals <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B35">35</abbr></abbrgrp>. This location corresponds to the site of bottom strand DNA cleavage by the R2 endonuclease from <it>Bombyx mori </it>(Figure <figr fid="F5">5c</figr>). This cleavage site serves as the primer for reverse transcription of the element RNA <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. For about 1% of the R2 insertions identified in the <it>Drosophila </it>genomes, bottom-strand cleavage appears to have occurred 1 or 2 bp downstream of this usual site. The uncertainty in cleavage location is because the second nucleotide downstream of the typical cleavage site is an 'A' and all <it>Drosophila </it>R2s end in a variable length poly-A tail (Figure <figr fid="F5">5a</figr>).</p>
               <fig id="F5">
                  <title>
                     <p>Figure 5</p>
                  </title>
                  <caption>
                     <p>Junction sequences of the R2 elements with the 28S gene</p>
                  </caption>
                  <text>
                     <p>Junction sequences of the R2 elements with the 28S gene. <b>(a) </b>3' junction sequences. All <it>Drosophila </it>R2 elements contain 3' poly(A) tails. Most R2 insertions are consistent with the location of the R2 DNA cleavage sites on the bottom strand (see panel (c)) and its use in priming reverse transcription. <b>(b) </b>Representative examples of the 5' junctions of R2 elements with the 28S gene. All full-length examples are from <it>D. melanogaster</it>. R2 sequences are boxed, 28S sequences are in bold, non-templated sequences are in plain text, and duplications of 28S sequences are underlined. Boxed residues shaded grey correspond to microhomologies: sequences that could correspond to either the 28S sequence or the R2 element. <b>(c) </b>Location of the probable cleavage sites on the 28S gene. Arrows show cleavage locations determined <it>in vitro </it>for the R2 endonuclease from <it>B. mori </it><abbrgrp><abbr bid="B10">10</abbr></abbrgrp>; the arrow head topped by '0' shows the location of the top-strand cleavage site inferred after analysis of the <it>Drosophila </it>R2 5' junctions. The bottom diagram shows a hypothetical intermediate in the integration reaction after first-strand synthesis (boxed nucleotides) and second strand cleavage. The terminal two nucleotides of the cDNA are proposed to anneal to the top strand of the cleaved target site. This microhomology allows precise priming of second-strand DNA synthesis and the generation of the precise junctions seen in example a in panel (b).</p>
                  </text>
                  <graphic file="gb-2009-10-5-r49-5"/>
               </fig>
               <p><it>In vitro </it>studies with the <it>B. mori </it>R2 endonuclease suggested the location of top-strand cleavage occurred 2 bp upstream of the bottom-strand site (Figure <figr fid="F5">5c</figr>) <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Previous analyses of a few R2 5' junctions from each of several <it>Drosophila </it>species, as well as other insect species, were interpreted in a manner that was consistent with such a cleavage <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>. However, there is significant variation at the 5' junctions of R2 elements, and the comprehensive analysis of this variation made possible using the genomic sequences suggested a reevaluation of this second-strand cleavage location was needed.</p>
               <p>Similar to most non-LTR retrotransposons, R2 insertions can be full-length or contain truncations of their 5' ends. The 5' truncations have been suggested to be due to the failure of the reverse transcriptase to copy the entire RNA template, degradation of the RNA template, or the initiation of second-strand synthesis before reverse transcription is completed <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Figure <figr fid="F5">5b</figr> shows representative examples of full-length and 5'-truncated R2 elements. All full-length examples are from <it>D. melanogaster </it>but are representative of the R2 elements observed in all <it>Drosophila </it>species. Almost two-thirds of the full-length insertions have 5' junctions that include the 28S sequence to the position across from the site of bottom-strand cleavage, position '0' (examples a, d and e), with the remaining third containing variable deletions of the upstream 28S sequence (examples b and c). Many junctions contain additional bases at the junction. In some cases the additional bases represented duplications of the 28S gene (example e), while for other junctions the origin of the additional bases could not be identified, here called non-templated bases (example d). In the case of the 5' truncated elements most junctions contain from one to five bases at the precise 28S/R2 junction that may be assigned to either 28S or R2 (examples of these microhomologies are indicated by the shaded bases in Figure <figr fid="F5">5</figr>). R2 5' truncated junctions can also be associated with deletions of upstream 28S sequences (examples g and h), and non-templated additions (example i).</p>
               <p>Figure <figr fid="F6">6</figr> is an attempt to summarize the 5' junction data for the R2 copies in several species. Plotted in these figures is the last contiguous nucleotide of the 28S gene found for each R2 insertion. For most full-length copies the last 28S nucleotide corresponded to the position opposite the bottom-strand cleavage site (Figures <figr fid="F5">5c</figr> and <figr fid="F6">6a</figr>). In the case of the 5' truncated R2 elements (Figure <figr fid="F6">6b</figr>), more copies are associated with deletions of 28S sequences, but again the most frequent final base of the 28S gene is opposite bottom-strand cleavage.</p>
               <fig id="F6">
                  <title>
                     <p>Figure 6</p>
                  </title>
                  <caption>
                     <p>Probable top-strand cleavage sites for the R2 element insertions</p>
                  </caption>
                  <text>
                     <p>Probable top-strand cleavage sites for the R2 element insertions. Dots indicate for all R2 elements the last nucleotide at the 5' junction that corresponds to the upstream 28S sequence. In instances where multiple copies of an element within a species had identical junctions, the number of genomic copies was estimated by dividing the number of traces by the fold coverage of the genome sequencing project. Arrows show the location of the insertion site (bottom-strand cleavage used for TPRT). The data were obtained from the following species: <it>D. ananassae</it>, <it>D. melanogaster</it>, <it>D. pseudoobscura</it>, <it>D. sechellia</it>, <it>D. simulans </it>and <it>D. yakuba</it>. <b>(a) </b>Full-length R2 elements. <b>(b) </b>5' truncated R2 elements.</p>
                  </text>
                  <graphic file="gb-2009-10-5-r49-6"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>R2 retrotransposition model</p>
               </st>
               <p>This analysis of the 5' junctions of R2 insertions supports the following additions to the TPRT model for R2 retrotransposition. First, the most frequently used top-strand cleavage site in <it>Drosophila </it>is directly opposite the bottom-strand site, rather than 2 bp upstream (position -2) as suggested from the <it>in vitro </it>studies with the R2 endonuclease from <it>B. mori </it><abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Cleavage opposite the bottom-strand site readily explains junctions such as examples d, e and i in Figure <figr fid="F5">5b</figr>, junctions difficult to explain if top-strand cleavage was at position -2. Second, we suggest that full-length R2 RNA transcripts in <it>Drosophila </it>contain G residues at their 5' end. All integrated full-length R2 copies in <it>D. melanogaster </it>begin with four G residues (examples a to e in Figure <figr fid="F5">5b</figr>), and similar analysis of the other <it>Drosophila </it>species indicated that the full-length R2 elements in these species contained at least two terminal G residues (data not shown). No conservation of R2 5' end sequences is found beyond these two Gs.</p>
               <p>To explain the most frequently observed R2 full length junctions (example a in Figure <figr fid="F5">5b</figr>), we suggest that two terminal C residues on the cDNA strand made from the R2 transcript anneal to the cleaved target site (see Figure <figr fid="F5">5c</figr> for a diagram). A tendency to anneal a few nucleotides of the cDNA strand to the top strand of the target DNA before initiating second-strand DNA synthesis would explain the frequent 'microhomologies' between internal R2 sequences and upstream 28S sequences that are found associated with 5' truncated R2 insertions. The non-templated nucleotides found at the 5' junctions of full-length and truncated copies of R2 are suggested to result when the annealing of microhomologies does not occur. <it>In vitro </it>studies have shown that the R2 polymerase adds non-templated nucleotides before initiating from a primer that is not annealed to the template <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>, as well as when it 'runs-off' the end of a template <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. These non-templated nucleotides can be of any sequence and thus could also lead to microhomologies used to initiate polymerization of the top DNA strand. Microhomologies between these non-templated nucleotides and the 28S gene would go undetected by our analysis. Finally, the R2 junctions with deletions of the 28S gene, as well as the few with duplications of the 28S gene, could represent top-strand cleavages outside the preferred site.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Mechanism of R1 retrotransposition</p>
            </st>
            <sec>
               <st>
                  <p>R1 junction sequences</p>
               </st>
               <p>Based on their 3' junctions with the 28S gene, all R1 elements within the 28S gene are located 60 bp downstream of the R2 insertion site. <it>In vitro </it>studies with the <it>B. mori </it>R1 endonuclease suggest this site corresponds to the location of the bottom-strand DNA cleavage site used to prime a TPRT reaction <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. As in the case of R2, the 5' ends of both full-length and truncated R1 copies showed significant variation (Figure <figr fid="F7">7</figr>).</p>
               <fig id="F7">
                  <title>
                     <p>Figure 7</p>
                  </title>
                  <caption>
                     <p>Full length and 5' truncated junctions of the <it>Drosophila </it>R1 insertions</p>
                  </caption>
                  <text>
                     <p>Full length and 5' truncated junctions of the <it>Drosophila </it>R1 insertions. Shown at the top is the sequence of the 28S gene insertion site. Various regions of the sequence have been indicated with colors to allow the 5' and 3' junctions of the R1 insertions to be summarized. Position +1 corresponds to the position of bottom-strand cleavage based on the 3' junctions of all R1 elements as well as from <it>in vitro </it>studies of the R1 endonuclease <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. Positions -9 and +14 correspond to the inferred most frequent sites of top-strand cleavage. Shown at the bottom are diagrams of the 5' and 3' junctions of R1 insertions. Full-length as well as 5' truncated insertions of the ancestral type R1s have 14 bp target site duplications (left side). The bracketed region of the junctions exhibited sequence variation. This variation can correspond to non-templated nucleotides (sequences corresponding to neither the 28S gene nor the R1 element), or microhomologies (1 to 5 nucleotides that could correspond to either the 28S gene or the R1 element). Melanogaster group R1s have three classes of junctions (right side): full length insertions with a precise 9 bp target site deletion; full length insertions with an insertion site rearrangement (ISR); and 5' truncated insertions. Sequence variation at these junctions is limited to the bracketed region and corresponds to the variation seen in the ancestral type R1s. The tables at the bottom show the fraction of copies observed at the bracketed site that are precise, contain non-templated nucleotides (extra nt) or microhomologies (micro).</p>
                  </text>
                  <graphic file="gb-2009-10-5-r49-7"/>
               </fig>
               <p>Based on their 5' junctions the R1 elements could be divided into two groups. For R1B elements and the R1A elements outside the melanogaster species group, the upstream 28S gene sequences typically extended to a position 14 bp downstream of the bottom-strand site (+14), and thus a 14 bp target site duplication (TSD) flanks the R1 insertions (Figure <figr fid="F7">7</figr>, left side). Because similar length TSDs flank the R1 elements in most other arthropods <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr></abbrgrp>, this group is called the 'ancestral type' R1 insertions. All sequence variation associated with the 5' junctions of both full-length and 5' truncated elements of the ancestral type were located at the end of the TSD (bracketed region in Figure <figr fid="F7">7</figr>). As with the R2 elements, these 5' junctions could be classified as precise, containing microhomologies, or containing non-templated nucleotides. Most full-length ancestral type R1 insertions were precise, with the remainder containing non-templated nucleotides. The 5' truncated ancestral R1s are more broadly distributed between precise, non-templated and microhomology junctions.</p>
               <p>In contrast to the ancestral type R1s, many copies of the R1A elements in the melanogaster species group (<it>D. ananassae</it>, <it>D. erecta</it>, <it>D. melanogaster</it>, <it>D. sechellia</it>, <it>D. simulans</it>, and <it>D. yakuba</it>) contained upstream 28S gene sequences that extended only to position -9. The relative proportion varied from species to species, but altogether 75% of the full-length 'melanogaster-type' R1 insertions contained this 9 bp deletion. No 5' sequence variation was associated with these insertions. The remaining full-length insertions contained variable-length TSDs up to 17 bp in length and an unusual duplication of 28S sequences that we called insertion site rearrangements (ISRs). These duplications of the 28S sequence extended for 16 to 27 nucleotides upstream of position -9. Microhomologies and non-templated nucleotides were common at these junctions and were always located between the TSD and the ISR (bracketed region). The 5' truncated insertions of the melanogaster-type R1s also contained precise, non-templated and microhomology junctions.</p>
               <p>Figure <figr fid="F8">8</figr> is again a plot of the last contiguous nucleotide of the 28S gene found at the 5' end of the R1 elements in several species. In the case of the ancestral type R1s, the full-length and 5' truncated junctions are consistent with a top-strand cleavage at position +14. Most exceptions to this cleavage location were in <it>D. ananassae</it>, where R1B insertions made an 11 or 13 bp TSD, and in <it>D. willistoni </it>where some R1B insertions contained an 8 bp TSD. The probable locations of the top-strand cleavage for the melanogaster-type full-length R1 elements (Figure <figr fid="F8">8b</figr>) were clustered about two locations. Precise full-length insertions had top-strand cleavage at position -9. For the full-length ISR elements and the 5' truncated elements top-strand cleavages were less specific but clustered around position +14.</p>
               <fig id="F8">
                  <title>
                     <p>Figure 8</p>
                  </title>
                  <caption>
                     <p>Probable top-strand cleavage sites for the R1 element insertions</p>
                  </caption>
                  <text>
                     <p>Probable top-strand cleavage sites for the R1 element insertions. Dots indicate for all R1 elements the last nucleotide at the 5' junction that corresponds to the upstream 28S sequence. In instances where multiple copies of an element within a species had identical junctions, the number of genomic copies was estimated by dividing the number of traces by the fold coverage of the genome sequencing project. Arrows show the location of the insertion site (bottom-strand cleavage used for TPRT). The different types of junctions diagrammed in Figure 7 are given different symbols. <b>(a) </b>Ancestral type R1 elements. Data derived from <it>D. ananassae A</it>, <it>D. mojavensis</it>, <it>D. pseudoobscua </it>and <it>D. willistoni</it>. <b>(b) </b>Melanogaster-type R1 elements. Data derived from <it>D. melanogaster</it>, <it>D. sechellia</it>, <it>D. simulans </it>and <it>D. yakuba</it>.</p>
                  </text>
                  <graphic file="gb-2009-10-5-r49-8"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Tandem R1 arrays</p>
               </st>
               <p>In seven of the <it>Drosophila </it>species R1 elements were found organized as tandem arrays (Table <tblr tid="T2">2</tblr>). Evaluation of the sequencing reads at the other end of clones containing tandem R1s revealed that many, and perhaps all, of these tandem R1 elements were located within the rDNA loci. These tandem arrays were located at the normal R1 insertion site with the individual R1 copies separated by the 14 bp 28S gene sequence corresponding to the typical TSD. Such R1 tandem arrays have been previously described in <it>D. virilis </it><abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. Because R1 insertions were never found inserted upstream of R1 insertions without the TSD in these species, the mechanism of tandem R1 formation appears to be the insertion of additional R1 elements into the TSD present at the 5' end of R1 elements already inserted into a 28S gene. Consistent with this, melanogaster-type R1 elements have no or few tandem R1 insertions (Table <tblr tid="T2">2</tblr>, but see the legend). The highest levels of tandem insertions were in <it>D. pseudoobscura </it>and <it>D. ananassae </it>R1B where each R1-inserted rDNA unit contained an average of two to three R1 elements.</p>
               <tbl id="T2">
                  <title>
                     <p>Table 2</p>
                  </title>
                  <caption>
                     <p>Ratio of R1 elements in the canonical 28S site, tandem arrays and non-28S locations</p>
                  </caption>
                  <tblbdy cols="4">
                     <r>
                        <c ca="left">
                           <p>Species</p>
                        </c>
                        <c ca="center">
                           <p>Fraction of rDNA units</p>
                        </c>
                        <c ca="center">
                           <p>Tandem*</p>
                        </c>
                        <c ca="center">
                           <p>Non-28S*</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="4">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>D. simulans</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.08</p>
                        </c>
                        <c ca="center">
                           <p>-</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>D. sechellia</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.62</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>D. melanogaster</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.11</p>
                        </c>
                        <c ca="center">
                           <p>-<sup>&#8224;</sup></p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>D. yakuba</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.11</p>
                        </c>
                        <c ca="center">
                           <p>-</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>D. erecta</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.36</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>-</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>D. ananassae </it>R1A</p>
                        </c>
                        <c ca="center">
                           <p>0.17</p>
                        </c>
                        <c ca="center">
                           <p>++<sup>&#8225;</sup></p>
                        </c>
                        <c ca="center">
                           <p>++</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>D. ananassae </it>R1B</p>
                        </c>
                        <c ca="center">
                           <p>0.13</p>
                        </c>
                        <c ca="center">
                           <p>+++</p>
                        </c>
                        <c ca="center">
                           <p>+++</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>D. pseudoobscura</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.40</p>
                        </c>
                        <c ca="center">
                           <p>+++</p>
                        </c>
                        <c ca="center">
                           <p>+++</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>D. persimilis</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.26</p>
                        </c>
                        <c ca="center">
                           <p>+++</p>
                        </c>
                        <c ca="center">
                           <p>++</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>D. willistoni</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.10</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>++</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>D. mojavensis</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.68</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>D. virilis</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.75</p>
                        </c>
                        <c ca="center">
                           <p>++</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>D. grimshawi</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.20</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>*Plus signs indicate: +, less than 0.4 of those in 28S; ++, from 0.4 to 0.9 of those in 28S; +++, as many or more than those in the 28S; -, indicates absence. <sup>&#8224;</sup>Unlike the tandem R1s in the ancestral group, the sequence between the many tandem-like, consecutive R1 elements in <it>D. melanogaster </it>is typical of that found in an insertion site rearrangement (ISR) junction. Unlike the highly variable ISR junctions found in the locus, the ISRs between these tandem-like R1s are the same. Thus, these elements are likely the result of amplification of an original unique event. <sup>&#8225;</sup>All found upstream of R1B.</p>
                  </tblfn>
               </tbl>
            </sec>
            <sec>
               <st>
                  <p>R1 insertions into non-28S locations</p>
               </st>
               <p>In most <it>Drosophila </it>species copies of R1 were also identified that had inserted into sequences outside the 28S gene (Table <tblr tid="T2">2</tblr>). Frequent R1 insertions outside the 28S gene have also been reported in <it>B. mori </it><abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. The <it>Drosophila </it>species with the most abundant examples of non-28S R1 insertions were <it>D. ananassae </it>and <it>D. pseudoobscura</it>, the two species with the highest levels of tandem duplications. This may suggest that R1 insertions in these species are either less specific or retrotranspose more often. We identified 118 unique examples of non-28S gene R1 insertions that had intact 3' junctions, suggesting that they represented authentic retrotransposition events, not segments of R1 sequence that had been displaced by recombination to locations outside the rDNA locus. The insertion sites for these copies frequently corresponded to repeated DNA sequences, probably in the pericentromeric or telomeric regions of the genome. An exception was in <it>D. ananassae</it>, where a 22 bp region of the 28S gene corresponding to the R1 28S insertion site was found in the IGS region of the rDNA unit. One R1A element and seven R1B elements were found in rDNA units containing this unusual insertion of 28S sequences within the IGS.</p>
               <p>Because the insertions had occurred into repeated DNA sequences we were able to identify the likely target sequences for some of the R1 insertions. Figure <figr fid="F9">9</figr> shows examples in which we were able to identify either or both 5' and 3' ends of the insertions and their likely target sequences. In all cases where both junctions were recovered (Figure <figr fid="F9">9a</figr>), the target sites contained from 4 to 9 bp of sequence identity to the 28S target site centered near the lower strand cleavage site, and the insertions generated TSDs of from 3 to 14 bp.</p>
               <fig id="F9">
                  <title>
                     <p>Figure 9</p>
                  </title>
                  <caption>
                     <p>Junction sequences and the probable target sites of R1 elements inserted outside the 28S genes</p>
                  </caption>
                  <text>
                     <p>Junction sequences and the probable target sites of R1 elements inserted outside the 28S genes. The inferred uninserted target sequence is shown at the top of each example. Boxed sequences indicate R1, bold uppercase sequences are identical to 28S sequences, and bold lower case nucleotides are non-templated nucleotides. <b>(a) </b>Insertion sites in which both 5' and 3' junctions of the R1 element were recovered; 2 to14 bp target site duplications were created by the insertions and are delineated by vertical dotted lines. <b>(b) </b>Insertion sites in which only the 5' junction with R1 was recovered. <b>(c) </b>Insertion sites in which only the 3' junction with R1 was recovered. Junctions come from the following species: <it>D. pseudoobscura</it>, examples 1, 4 and 11; <it>D. mojavensis</it>, example 2; <it>D. persimilis</it>, examples 3 and 12; <it>D. sechellia</it>, examples 5, 8, 9 and 10; <it>D. yakuba</it>, example 6; <it>D. ananassae A</it>, example 7; <it>D. virilis</it>, example 13. M, melanogaster-type R1 elements; A, ancestral type R1 elements.</p>
                  </text>
                  <graphic file="gb-2009-10-5-r49-9"/>
               </fig>
               <p>The 5' junctions of all melanogaster-type full length R1s outside the rDNA array contained upstream 28S gene sequences ending at position -9 and extending upstream for 7 to 28 bp (Figure <figr fid="F9">9</figr>, examples 5 to 10). The 5' ends of the ancestral type R1 insertions showed no evidence for the insertion of flanking 28S rRNA sequence (examples 1 to 4 and 11). The 3' junctions of both the ancestral and melanogaster-type R1s were similar, with most beginning at the precise 3' junction of the R1. However, some insertions showed the incorporation of up to 14 nucleotides of downstream 28S sequences (example 13) or the presence of non-templated nucleotides (example 12).</p>
            </sec>
            <sec>
               <st>
                  <p>R1 retotransposition model</p>
               </st>
               <p>The analysis of the R1 5' junctions with the 28S genes, as well as the insertions into non-28S gene locations, enables us to propose several steps involved in the R1 retrotransposition reaction. These suggestions are based on the standard TPRT mechanism of R2 and other non-LTR retrotransposons and the ability of their polymerases to add non-templated nucleotides at the ends of synthesized DNA strands. First, because no variation was detected within a species or between species for the location of the bottom-strand cleavage used to prime reverse transcription, like R2, this is the most conserved step in the retrotransposition reaction of R1. The analysis of R1 insertions outside the 28S gene site revealed that downstream 28S gene sequences are sometimes also inserted with the R1 element. In addition, many of these non-28S gene target sites contain bases corresponding to the 28S gene (examples 1 to 6 in Figure <figr fid="F9">9</figr>). Together these data suggest that the 3' end of the R1 transcript used for retrotransposition often contains flanking 28S sequences. These flanking 28S sequences may anneal to the bottom strand of the target site and thus account for why there is little sequence variation associated with the 3' junctions of R1.</p>
               <p>For all ancestral type R1s, top strand DNA cleavage is predominantly located 14 bp downstream of the bottom-strand cleavage. This top-strand cleavage location, among others, was also detected for the R1 endonuclease of <it>B. mori </it><abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. The use of this cleavage to prime second strand synthesis results in 14 bp target site duplications (Figure <figr fid="F10">10</figr>, left side). The only deviation in this 'ancestral' cleavage location appeared in <it>D. ananassae </it>R1B elements, which had 11 and 13 bp TSDs, and in <it>D willistoni </it>R1B elements where several examples of elements with 8 bp TSDs were detected. The priming of second-strand synthesis appears less precise than first-strand synthesis, sometimes involving the addition of non-templated nucleotides. This suggests that the cDNA strand does not efficiently anneal to the cleaved top strand, suggesting few if any nucleotides of upstream 28S sequence are included at the 5' end of the R1 transcript. Consistent with this suggestion, no flanking 28S sequences were associated with the ancestral-type non-28S R1 insertions.</p>
               <fig id="F10">
                  <title>
                     <p>Figure 10</p>
                  </title>
                  <caption>
                     <p>R1 retrotransposition models based on the standard target primed reverse transcription reaction</p>
                  </caption>
                  <text>
                     <p>R1 retrotransposition models based on the standard target primed reverse transcription reaction. The uninserted 28S gene is shown at the top. The various regions upstream and downstream of the target site are colored as in Figure 7. Left side: ancestral type R1 transcripts (wavy line) do not contain upstream 28S gene sequences. Ancestral type R1s cleave the top DNA strand 14 bp downstream of the bottom cleavage site. Nucleotide variation at the 5' junctions corresponds to the imprecise nature by which the R1 polymerase uses the top strand of the DNA target to prime second-strand DNA synthesis. Right side: full length melanogaster-type R1 transcripts include 28S sequences starting upstream of position -9. Cleavage of the top strand occurs at one of two sites. If top-strand cleavage occurs 9 bp upstream of the bottom-strand site, then the upstream RNA sequences can anneal to the end of the cDNA strand, resulting in a precise 9 bp deletion of the target site. If top-strand cleavage occurs downstream of the bottom-strand site, then the annealing of cDNA to the target site is not possible, generating variation at the junction of the target site duplication.</p>
                  </text>
                  <graphic file="gb-2009-10-5-r49-10"/>
               </fig>
               <p>R1A lineage elements in the melanogaster species group appear to have evolved changes in the TPRT reaction. In these R1s, top-strand cleavage frequently occurs 9 bp upstream of bottom-strand cleavage, resulting in a 9 bp target site deletion. Priming of second-strand synthesis from this site is highly precise, that is, non-templated sequences were never observed, suggesting 28S gene sequences are included at the 5' end of the RNA transcripts used for retrotransposition. This flanking 28S sequence allows the reverse transcribed strand to anneal to the target site (Figure <figr fid="F10">10</figr>, middle). Consistent with this suggestion, from 7 to 28 bp of flanking 28S sequences ending at position -9 are associated with the non-28S gene insertions of the melanogaster-type R1. The R1A elements in the melanogaster group also appear able to cleave the top DNA strand in a less specific manner around position +14 (Figure <figr fid="F8">8b</figr>). Such cleavages result in ISR junctions in which a typical TSD is present followed by 16 to 27 bp of the 28S gene ending at position -9 (Figure <figr fid="F10">10</figr>, right side). Considerable variation is detected in these junctions between the TSD and the duplicated upstream 28S sequences, presumably because this downstream cleavage eliminates the ability of cDNA sequences to anneal to the top strand in the initiation of second-strand synthesis. An intriguing aspect of these melanogaster-type R1 elements is that all 5' truncated insertions have TSDs (Figure <figr fid="F8">8</figr>), suggesting they can only arise from downstream cleavage of the top strand. One possibility is that the default cleavage site for the R1 endonuclease is downstream of the insertion site, but the 5' end of the full-length R1 transcripts acts as a signal directing the cleavage site to the position at -9.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <sec>
            <st>
               <p>Origin of nascent lineages</p>
            </st>
            <p>The availability of whole genome shotgun sequences has enabled us to evaluate the level of sequence variation of the R1 and R2 elements in 12 species of <it>Drosophila</it>. The level of nucleotide divergence for most copies was typically less than 0.2%, suggesting either the elements are subject to the same concerted evolution mechanisms that enable the rRNA genes themselves to remain nearly identical, or that the R1 and R2 elements are gained and lost rapidly from the locus (that is, all copies are recent insertions). All previous analysis suggested the latter explanation. Analyses of the 5' truncated copies of R1 and R2 in several species have suggested that these elements do turnover rapidly. Different animals from the same population were found to have different collections of 5' truncated copies <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp> and most 5' truncated elements within an animal had not undergone duplication by recombination <abbrgrp><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp>. Thus, the individual copies of R1 and R2 do not appear to remain in the rDNA locus for long enough periods to be substantially influenced by the recombinations leading to the concerted evolution of the locus.</p>
            <p>One puzzling aspect concerning the evolution of R1 and R2 was the presence of multiple families in some species, yet no evidence for the origin of these lineages by horizontal transfer <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B48">48</abbr></abbrgrp>. The rapid turnover of individual R1 and R2 elements suggests that each active copy can generate its own lineage, which over time should accumulate sequence variation. Thus, while separate lineages of R1 and R2 should be able to arise, the question remained as to whether they could be maintained within a species. In this report we have for the first time detected these nascent lineages of R1 and R2 elements within a species. Two distinct subfamilies of R1B elements were detected in <it>D. grimshawi </it>and two distinct subfamilies of R2 elements in <it>D. willistoni </it>(Figure <figr fid="F4">4</figr>). In addition to these distinct lineages, other species contained individual copies of R1 or R2 that had from 1% to 7% nucleotide divergence from the majority of elements in the species. Many of these divergent copies had intact ORFs, and thus were potentially active. Another frequent finding was the presence of distinct 5' UTRs for the R1 elements (Table <tblr tid="T1">1</tblr>). This was most widespread in <it>D. ananassae </it>where the R1A elements could be divided into two groups that diverged 10% in their 5' UTR sequences, while the R1B elements could be divided into four groups with 5' UTRs that diverged from 15% to 22%. We suggest this accelerated divergence of the 5' UTRs represents the evolution of new promoter sequences driving the transcription of the elements. Once a new promoter is formed, copies containing this new promoter may differ in their expression, thus giving rise to new lineages of elements.</p>
            <p>Similar examples of the rapid evolution of the 5' UTRs of R2s were not detected. We have previously suggested that R2 elements do not contain their own promoter but are co-transcribed with the 28S gene <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>. This co-transcription may make it more difficult for independent R2 lineages to evolve. A single lineage of R2 elements has been found in the <it>Drosophila </it>genus <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, while four distinct lineages of <it>Drosophila </it>R1 elements exist <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. It is also possible that the more frequent loss of R2 from a <it>Drosophila </it>species (3 out of 12 species) compared to R1 (no losses among the 12 species) could also be the result of R2's reliance upon co-transcription with the rDNA units in which they reside.</p>
         </sec>
         <sec>
            <st>
               <p>Second-strand DNA synthesis</p>
            </st>
            <p>Many of the initial steps involved in the cleavage of the target site bottom strand and its use to prime reverse transcription have been characterized using purified R2 protein <it>in vitro </it><abbrgrp><abbr bid="B10">10</abbr><abbr bid="B38">38</abbr><abbr bid="B49">49</abbr></abbrgrp>. The recent discoveries that sequences near the 5' end of the RNA transcript regulate top-strand cleavage <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, and that the R2 polymerase can efficiently displace an RNA strand while using a DNA strand as template <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>, suggest that the R2 protein also synthesizes the second DNA strand. However, many questions remain concerning whether and how top-strand cleavage of the target DNA is used to prime second-strand synthesis.</p>
            <p>Because of an unusual template jumping ability of the R2 polymerase <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>, we previously suggested that during reverse transcription the R2 polymerase jumps from the R2 transcript onto the upstream DNA target sequences <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>. The hundreds of R2 5' junctions analyzed here suggest a different model. In <it>Drosophila </it>we propose that the 5' ends of the R2 RNA transcripts contain terminal G residues that, after reverse transcription and top-strand cleavage, enable the terminal C residues to anneal to the G residues of the top DNA strand after cleavage (Figure <figr fid="F5">5c</figr>). This model would explain the most common full-length R2 insertions in all <it>Drosophila </it>species. Similarly, microhomologies between internal R2 sequences and the upstream 28S gene sequences can explain the priming of second-strand DNA synthesis in many 5' truncated R2 insertions. When annealing of microhomologies does not occur, non-templated residues can be observed that were either added during first-strand synthesis when the R2 polymerase runs off an RNA template <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> or during second-strand synthesis before the R2 polymerase engages the cDNA <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>.</p>
            <p>The only aspect of this integration model that does not agree with previous biochemical studies is the location of the top-strand cleavage site. The experiments using the <it>B. mori </it>R2 protein suggested this cleavage was two nucleotides upstream of the bottom-strand site <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> while our analysis of <it>Drosophila </it>R2 junctions suggest it is opposite the bottom-strand site (Figure <figr fid="F5">5c</figr>). We suggest this represents an evolved difference between the R2 elements in these divergent insects. We have analyzed the 5' junctions of over 40 <it>B. mori </it>R2 insertions and found their structure is consistent with the location of top-strand cleavage determined with the purified protein <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. We suggest that cleavage of the top strand by the R2 endonuclease is not rigidly determined, and thus its location can vary.</p>
            <p>Much less was previously known about the R1 retrotransposition mechanism. <it>In vitro </it>studies of the R1 endonuclease, again from <it>B. mori</it>, revealed a bottom-strand cleavage site in a position consistent with its use to prime reverse transcription <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. All <it>Drosophila </it>R1 3' junctions, as well as 3' junctions in other insect species <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, are consistent with this cleavage site. As in the case of R2, the R1 5' junctions suggest cleavage of the top strand is less precise and subject to evolutionary changes. For many <it>Drosophila </it>R1 insertions top-strand cleavage was proposed to be at a site 14 bp downstream of the bottom strand, again consistent with the biochemical studies. Use of this site to prime second-strand synthesis results in a 14 bp TSD. Minor changes in this cleavage location were found in <it>D. ananassae</it>, where the R1B insertions have 11 or 13 bp TSDs, and in <it>D. willistoni</it>, where some ancestral type R1A insertions have an 8 bp TSD. Annealing of cDNA sequences to the upstream target site presumably does not occur as significant nucleotide variation is observed at the junctions.</p>
            <p>A radical change in top-strand cleavage site preference is proposed for the R1A elements of the melanogaster species group. In these species, many R1 insertions result in a 9 bp target site deletion, suggesting cleavage frequently occurs 9 bp upstream of the bottom-strand site (Figure <figr fid="F10">10</figr>). Associated with this remarkable change in cleavage site preference also appears to be a change in the nature of the R1 RNA transcript used for retrotransposition. Based on the structure of the ISR full-length insertions and of non-28S insertions, these melanogaster-type R1 transcripts contain from 6 to 27 nucleotides of 28S sequence ending 9 bp upstream of the integration site. These upstream 28S sequences on full length R1 transcripts appear to anneal to the target site, giving rise to highly precise 5' junctions. Interestingly, these melanogaster-type R1s retain the ability to cleave downstream of the integration site, resulting in the ISRs seen in some full-length insertions and the TSDs observed for all 5' truncated insertions. Because annealing of the upstream 28S sequences is not possible with these downstream cleavage sites, significant variation is observed at these junctions. We searched for evidence of these additional 28S sequence tags among available expressed sequence tag sequences from <it>D. melanogaster</it>. However, all R1 sequences contained extensive upstream 28S sequences or were transcripts of R1 tandem arrays (data not shown).</p>
            <p>Thus, the integration mechanisms used by the R1 and R2 elements are quite similar. Cleavage of the bottom strand and its use to prime first-strand DNA synthesis is rigidly determined, and no variation was observed between species. Cleavage of the top strand and its use to prime second-strand synthesis is flexible, which results in different junctions both within and between species. R1 and R2 are highly successful in the 28S niche. Whether change in the location of top-strand cleavage occurs simply because it is neutral or because it increases insertion efficiency is unclear. Nevertheless, the ability of R1 and R2 to explore the top-strand cleavage site suggests that variations of the retrotransposition mechanism can evolve among arthropods that could affect both the elements and the rDNA loci in which they reside.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Species and databases</p>
            </st>
            <p>Original sequencing reads from the whole genome shotgun sequencing projects were accessed by Blast search (version 2.2.17) in the trace archives at NCBI <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. The 12 <it>Drosophila </it>species and their sequencing fold coverage were: <it>D. ananassae </it>(9-fold), <it>D. erecta </it>(10-fold), <it>D. grimshawi </it>(8-fold), <it>D. melanogaster </it>(12-fold), <it>D. mojavensis </it>(8-fold), <it>D. persimilis </it>(4-fold), <it>D. pseudoobscura </it>(9-fold), <it>D. sechellia </it>(5-fold), <it>D. simulans </it>(4-fold; white 501 strain only), <it>D. virilis </it>(8-fold), <it>D. willistoni </it>(8.4-fold) and <it>D. yakuba </it>(9-fold).</p>
         </sec>
         <sec>
            <st>
               <p>Assembling R1 and R2 elements</p>
            </st>
            <p>Downstream 28S sequences flanking the known R1 and R2 insertion sites were used as Blast queries using default parameters except for requesting the maximum of 20,000 target sequences. The reads were collected, trimmed upstream of the query and aligned in ClustalX <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. Those sequences upstream of the query that were not 28S rRNA sequences were considered putative R1 or R2 elements and were used as Blast queries in the nucleotide collection database (nr/nt) and VecScreen at NCBI <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. Some putative elements were identified as cloning vector sequence and disregarded. For the remaining majority of sequences, 5' extensions of the sequences were assembled by Blast search, followed by alignment and extraction of a consensus. Iterations were done until the 5' junction of the element with the 28S was reached. Full assemblies were not possible in a few instances because of low element copy number and low coverage of some genome sequences.</p>
         </sec>
         <sec>
            <st>
               <p>Phylogenetics</p>
            </st>
            <p>Alignments of concatenated 18S and 28S sequences (from Stage and Eickbush <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> and available at the Eickbush lab website <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>) were done in ClustalX <abbrgrp><abbr bid="B52">52</abbr></abbrgrp> with minor manipulations done in Jalview <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>. Default parameters of phyML version 3.0 <abbrgrp><abbr bid="B55">55</abbr></abbrgrp> as implemented through the LIRMM website <abbrgrp><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr></abbrgrp> were used to construct maximum likelihood trees and visualized using TreeDyn 198 <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>. Branch support values were the minimum score based on Shimodaira-Hasegawa-like and Chi2-based approximate likelihood-ratio tests <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Nucleotide variation in the 5' and 3' ends of elements and sequence divergence between species</p>
            </st>
            <p>Blast queries were 75 bp long near the 5' or 3' end of each consensus R1 or R2 sequence; default Blast parameters were used except for requesting the maximum of 20,000 target sequences and using a match/mismatch score of 1,-1. The Blast results were trimmed to include 400 bp of the element and 100 bp of flanking sequence. Retention of the flanking sequence enabled those elements present in the 28S gene to be separated from those copies inserted elsewhere in the genome. ClustalX alignment <abbrgrp><abbr bid="B52">52</abbr></abbrgrp> of results from each Blast search was conducted and groups of like sequences were extracted and consensus sequences derived. Quality scores for reads containing differences from the main consensus were examined to verify that the variation observed was not sequencing error (scores less than 40 were considered sequencing error). The Sequence Manipulation Suite: Ident and Sim <abbrgrp><abbr bid="B60">60</abbr></abbrgrp> aided determination of the percent identity of the variants relative to the consensus.</p>
            <p>In theory it should be possible to make copy number estimates based on the genome sequence coverage and the number of trace reads recovered. In practice, however, random variation in fold coverage for any given sequence means only an estimate of copy number for repetitive sequences can be inferred. In addition, rDNA loci in many species of <it>Drosophila </it>are located on the X and Y chromosomes, resulting in different sequencing coverage. Finally, by requiring analyzed reads to cover 400 bp to enable a significant and uniform level of information to be recovered from each read, fewer reads were obtained for each analysis. Thus, estimates of element abundance were made relative to the number of rDNA units also recovered in the trace archives (that is, fraction of rDNA units inserted). Copies with small numbers of reads (approximately 2 to 12) and identical sequence variation were interpreted as single elements.</p>
         </sec>
         <sec>
            <st>
               <p>Insertions outside the 28S genes</p>
            </st>
            <p>Using the 5' and 3' ends of R1 and R2 elements as Blast queries and analyzing the flanking sequences, we found many examples of R1 elements inserted outside the usual 28S site. Identification of the non-28S sequences was attempted by comparing them to our assembled rDNA-related sequences and by Blast search of the NCBI non-redundant nucleotide database <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. While a few target sites could be identified, the majority of sequences had not been characterized and represented repeated sequences, and thus were presumed to be heterochromatic. We then attempted to find uninserted copies of the sequences using the flanking sequences as Blast queries. In those cases where the target site was repeated, we could identify examples of uninserted target sites, and in some instances the opposite junction of the R1 insertion.</p>
         </sec>
         <sec>
            <st>
               <p>dN/dS analysis</p>
            </st>
            <p>Selection analysis was done using the HyPhy package available at Datamonkey <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. We conducted all the available tests and report here the dN/dS results of the PARRIS test <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>, which were similar in all the tests.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>ISR: insertion site rearrangement; ITS: internal transcribed spacer; LTR: long terminal repeat; ORF: open reading frame; rDNA loci: loci encoding the tandem array of rRNA genes; TE: transposable element; TPRT: target primed reverse transcription; TSD: target site duplication; UTR: untranslated region.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>DS participated in the design, carried out all the experiments and drafted the manuscript. TE participated in the design and helped with the manuscript. Both authors read and approved the final manuscript.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper. Each file contains the consensus nucleotide sequence for an R1 or R2 element: <it>Drosophila ananassae </it>R2 (Additional data file <supplr sid="S1">1</supplr>); <it>Drosophila melanogaster </it>R2 (Additional data file <supplr sid="S2">2</supplr>); <it>Drosophila persimilis </it>R2 (Additional data file <supplr sid="S3">3</supplr>); <it>Drosophila pseudoobscura </it>R2 (Additional data file <supplr sid="S4">4</supplr>); <it>Drosophila sechellia </it>R2 (Additional data file <supplr sid="S5">5</supplr>); <it>Drosophila simulans </it>R2 (Additional data file <supplr sid="S6">6</supplr>); <it>Drosophila willistoni </it>R2.1 (Additional data file <supplr sid="S7">7</supplr>); <it>Drosophila yakuba </it>R2 (Additional data file <supplr sid="S8">8</supplr>); <it>Drosophila ananassae </it>R1A (Additional data file <supplr sid="S9">9</supplr>); <it>Drosophila ananassae </it>R1B (Additional data file <supplr sid="S10">10</supplr>); <it>Drosophila erecta </it>R1 (Additional data file <supplr sid="S11">11</supplr>); <it>Drosophila grimshawi </it>R1.1 (Additional data file <supplr sid="S12">12</supplr>); <it>Drosophila mojavensis </it>R1 (Additional data file <supplr sid="S13">13</supplr>); <it>Drosophila melanogaster </it>R1 (Additional data file <supplr sid="S14">14</supplr>); <it>Drosophila persimilis </it>R1 (Additional data file <supplr sid="S15">15</supplr>); <it>Drosophila pseudoobscura </it>R1 (Additional data file <supplr sid="S16">16</supplr>); <it>Drosophila sechellia </it>R1 (Additional data file <supplr sid="S17">17</supplr>); <it>Drosophila simulans </it>R1 (Additional data file <supplr sid="S18">18</supplr>); <it>Drosophila virilis </it>R1 (Additional data file <supplr sid="S19">19</supplr>); <it>Drosophila willistoni </it>R1 (Additional data file <supplr sid="S20">20</supplr>); <it>Drosophila yakuba </it>R1 (Additional data file <supplr sid="S21">21</supplr>); <it>Drosophila grimshawi </it>R1.2 (Additional data file <supplr sid="S22">22</supplr>); <it>Drosophila willistoni </it>R2.2 (Additional data file <supplr sid="S23">23</supplr>).</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p><it>Drosophila ananassae </it>R2 complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila ananassae </it>R2 complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S1.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p><it>Drosophila melanogaster </it>R2 complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila melanogaster </it>R2 complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S2.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p><it>Drosophila persimilis </it>R2 complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila persimilis </it>R2 complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S3.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S4">
            <title>
               <p>Additional data file 4</p>
            </title>
            <caption>
               <p><it>Drosophila pseudoobscura </it>R2 complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila pseudoobscura </it>R2 complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S4.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S5">
            <title>
               <p>Additional data file 5</p>
            </title>
            <caption>
               <p><it>Drosophila sechellia </it>R2 complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila sechellia </it>R2 complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S5.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S6">
            <title>
               <p>Additional data file 6</p>
            </title>
            <caption>
               <p><it>Drosophila simulans </it>R2 complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila simulans </it>R2 complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S6.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S7">
            <title>
               <p>Additional data file 7</p>
            </title>
            <caption>
               <p><it>Drosophila willistoni </it>R2 complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila willistoni </it>R2 complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S7.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S8">
            <title>
               <p>Additional data file 8</p>
            </title>
            <caption>
               <p><it>Drosophila yakuba </it>R2 complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila yakuba </it>R2 complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S8.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S9">
            <title>
               <p>Additional data file 9</p>
            </title>
            <caption>
               <p><it>Drosophila ananassae </it>R1A complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila ananassae </it>R1A complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S9.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S10">
            <title>
               <p>Additional data file 10</p>
            </title>
            <caption>
               <p><it>Drosophila ananassae </it>R1B complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila ananassae </it>R1B complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S10.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S11">
            <title>
               <p>Additional data file 11</p>
            </title>
            <caption>
               <p><it>Drosophila erecta </it>R1 complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila erecta </it>R1 complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S11.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S12">
            <title>
               <p>Additional data file 12</p>
            </title>
            <caption>
               <p><it>Drosophila grimshawi </it>R1 complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila grimshawi </it>R1 complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S12.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S13">
            <title>
               <p>Additional data file 13</p>
            </title>
            <caption>
               <p><it>Drosophila mojavensis </it>R1 complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila mojavensis </it>R1 complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S13.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S14">
            <title>
               <p>Additional data file 14</p>
            </title>
            <caption>
               <p><it>Drosophila melanogaster </it>R1 complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila melanogaster </it>R1 complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S14.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S15">
            <title>
               <p>Additional data file 15</p>
            </title>
            <caption>
               <p><it>Drosophila persimilis </it>R1 complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila persimilis </it>R1 complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S15.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S16">
            <title>
               <p>Additional data file 16</p>
            </title>
            <caption>
               <p><it>Drosophila pseudoobscura </it>R1 complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila pseudoobscura </it>R1 complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S16.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S17">
            <title>
               <p>Additional data file 17</p>
            </title>
            <caption>
               <p><it>Drosophila sechellia </it>R1 complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila sechellia </it>R1 complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S17.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S18">
            <title>
               <p>Additional data file 18</p>
            </title>
            <caption>
               <p><it>Drosophila simulans </it>R1 complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila simulans </it>R1 complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S18.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S19">
            <title>
               <p>Additional data file 19</p>
            </title>
            <caption>
               <p><it>Drosophila virilis </it>R1 complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila virilis </it>R1 complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S19.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S20">
            <title>
               <p>Additional data file 20</p>
            </title>
            <caption>
               <p><it>Drosophila willistoni </it>R1 complete consensussequence</p>
            </caption>
            <text>
               <p><it>Drosophila willistoni </it>R1 complete consensussequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S20.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S21">
            <title>
               <p>Additional data file 21</p>
            </title>
            <caption>
               <p><it>Drosophila yakuba </it>R1 complete sequence</p>
            </caption>
            <text>
               <p><it>Drosophila yakuba </it>R1 complete sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S21.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S22">
            <title>
               <p>Additional data file 22</p>
            </title>
            <caption>
               <p><it>Drosophila grimshawi </it>R1.2 complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila grimshawi </it>R1.2 complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S22.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S23">
            <title>
               <p>Additional data file 23</p>
            </title>
            <caption>
               <p><it>Drosophila willistoni </it>R2.2 complete consensus sequence</p>
            </caption>
            <text>
               <p><it>Drosophila willistoni </it>R2.2 complete consensus sequence.</p>
            </text>
            <file name="gb-2009-10-5-r49-S23.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Danna Eickbush for comments on the manuscript. This research was funded by National Science Foundation grant (MCB-0544071) and National Institutes of Health grant (GM42790).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Turning junk into gold: domestication of transposable elements and the creation of new genes in eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Volff</snm>
                  <fnm>JN</fnm>
               </au>
            </aug>
            <source>Bioessays</source>
            <pubdate>2006</pubdate>
            <volume>28</volume>
            <fpage>913</fpage>
            <lpage>922</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/bies.20452</pubid>
                  <pubid idtype="pmpid" link="fulltext">16937363</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Inviting instability: Transposable elements, double-strand breaks, and the maintenance of genome integrity.</p>
            </title>
            <aug>
               <au>
                  <snm>Hedges</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Deininger</snm>
                  <fnm>PL</fnm>
               </au>
            </aug>
            <source>Mutat Res</source>
            <pubdate>2007</pubdate>
            <volume>616</volume>
            <fpage>46</fpage>
            <lpage>59</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1850990</pubid>
                  <pubid idtype="pmpid" link="fulltext">17157332</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Repeated genes in eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Long</snm>
                  <fnm>EO</fnm>
               </au>
               <au>
                  <snm>Dawid</snm>
                  <fnm>IB</fnm>
               </au>
            </aug>
            <source>Annu Rev Biochem</source>
            <pubdate>1980</pubdate>
            <volume>49</volume>
            <fpage>727</fpage>
            <lpage>764</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.bi.49.070180.003455</pubid>
                  <pubid idtype="pmpid" link="fulltext">6996571</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Finely orchestrated movements: evolution of the ribosomal RNA genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2007</pubdate>
            <volume>175</volume>
            <fpage>477</fpage>
            <lpage>485</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1800602</pubid>
                  <pubid idtype="pmpid">17322354</pubid>
                  <pubid idtype="doi">10.1534/genetics.107.071399</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>R2 and related site-specific non-long terminal repeat retrotransposons.</p>
            </title>
            <aug>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Mobile DNA II</source>
            <publisher>Washington DC: American Society of Microbiology</publisher>
            <editor>Craig NL, Craigie R, Gellart M, Lambowitz AM</editor>
            <pubdate>2002</pubdate>
            <fpage>813</fpage>
            <lpage>835</lpage>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Are retrotransposons long-term hitchhikers?.</p>
            </title>
            <aug>
               <au>
                  <snm>Burke</snm>
                  <fnm>WD</fnm>
               </au>
               <au>
                  <snm>Malik</snm>
                  <fnm>HS</fnm>
               </au>
               <au>
                  <snm>Lathe</snm>
                  <fnm>WC</fnm>
                  <suf>III</suf>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1998</pubdate>
            <volume>392</volume>
            <fpage>141</fpage>
            <lpage>142</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/32330</pubid>
                  <pubid idtype="pmpid" link="fulltext">9515960</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>The age and evolution of non-LTR retrotransposable elements.</p>
            </title>
            <aug>
               <au>
                  <snm>Malik</snm>
                  <fnm>HS</fnm>
               </au>
               <au>
                  <snm>Burke</snm>
                  <fnm>WD</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1999</pubdate>
            <volume>16</volume>
            <fpage>793</fpage>
            <lpage>805</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10368957</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Cross-genome screening of novel sequence-specific non-LTR retrotransposons: various multicopy RNA genes and satellites are selected as targets.</p>
            </title>
            <aug>
               <au>
                  <snm>Kojima</snm>
                  <fnm>KK</fnm>
               </au>
               <au>
                  <snm>Fujiwara</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <fpage>207</fpage>
            <lpage>217</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msg235</pubid>
                  <pubid idtype="pmpid" link="fulltext">12949131</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Identification of rDNA-specific non-LTR retrotransposons in Cnidaria.</p>
            </title>
            <aug>
               <au>
                  <snm>Kojima</snm>
                  <fnm>KK</fnm>
               </au>
               <au>
                  <snm>Kuma</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Toh</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Fujiwara</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Mol Evol Biol</source>
            <pubdate>2006</pubdate>
            <volume>23</volume>
            <fpage>1984</fpage>
            <lpage>1993</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msl067</pubid>
                  <pubid idtype="pmpid" link="fulltext">16870681</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Reverse transcription of R2Bm RNA is primed by a nick at the chromosomal target site: a mechanism for non-LTR retrotransposition.</p>
            </title>
            <aug>
               <au>
                  <snm>Luan</snm>
                  <fnm>DD</fnm>
               </au>
               <au>
                  <snm>Korman</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Jakubczak</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1993</pubdate>
            <volume>72</volume>
            <fpage>595</fpage>
            <lpage>605</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0092-8674(93)90078-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">7679954</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>RNA from the 5' end of the R2 retrotransposon controls R2 protein binding to and cleavage of its DNA target.</p>
            </title>
            <aug>
               <au>
                  <snm>Christensen</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Ye</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>17602</fpage>
            <lpage>17607</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1693793</pubid>
                  <pubid idtype="pmpid" link="fulltext">17105809</pubid>
                  <pubid idtype="doi">10.1073/pnas.0605476103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Retrotransposon R1Bm endonuclease cleaves the target sequence.</p>
            </title>
            <aug>
               <au>
                  <snm>Feng</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Schumann</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Boeke</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>2083</fpage>
            <lpage>2088</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">19257</pubid>
                  <pubid idtype="pmpid" link="fulltext">9482842</pubid>
                  <pubid idtype="doi">10.1073/pnas.95.5.2083</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Characterization of the sequence specificity of the R1Bm endonuclease domain by structural and biochemical studies.</p>
            </title>
            <aug>
               <au>
                  <snm>Maita</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Aoyagi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Osani</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shirakawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fujiwara</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>3918</fpage>
            <lpage>3927</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1919474</pubid>
                  <pubid idtype="pmpid" link="fulltext">17537809</pubid>
                  <pubid idtype="doi">10.1093/nar/gkm397</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Twin priming: A proposed mechanism for the creation of inversions in L1 retrotransposition.</p>
            </title>
            <aug>
               <au>
                  <snm>Ostertag</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Kazazian</snm>
                  <fnm>HH</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>2059</fpage>
            <lpage>2065</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">311219</pubid>
                  <pubid idtype="pmpid" link="fulltext">11731496</pubid>
                  <pubid idtype="doi">10.1101/gr.205701</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Molecular evidence for genetic exchange among ribosomal genes on nonhomologous chromosomes in man and apes.</p>
            </title>
            <aug>
               <au>
                  <snm>Arnheim</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Krystal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Schmickel</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Ryder</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Zimmer</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1980</pubdate>
            <volume>77</volume>
            <fpage>7323</fpage>
            <lpage>7327</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">350495</pubid>
                  <pubid idtype="pmpid">6261251</pubid>
                  <pubid idtype="doi">10.1073/pnas.77.12.7323</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Human rDNA: evolutionary patterns within the genes and tandem arrays derived from multiple chromosomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Gonzalez</snm>
                  <fnm>IL</fnm>
               </au>
               <au>
                  <snm>Sylvester</snm>
                  <fnm>JE</fnm>
               </au>
            </aug>
            <source>Genomics</source>
            <pubdate>2001</pubdate>
            <volume>73</volume>
            <fpage>255</fpage>
            <lpage>263</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/geno.2001.6540</pubid>
                  <pubid idtype="pmpid" link="fulltext">11350117</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Highly efficient concerted evolution in the ribosomal DNA repeats: Total DNA repeat variation revealed by whole-genome shotgun sequence data.</p>
            </title>
            <aug>
               <au>
                  <snm>Ganley</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Kobayashi</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <fpage>184</fpage>
            <lpage>191</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1781350</pubid>
                  <pubid idtype="pmpid" link="fulltext">17200233</pubid>
                  <pubid idtype="doi">10.1101/gr.5457707</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Sequence variation within the rRNA gene loci of 12 <it>Drosophila </it>species.</p>
            </title>
            <aug>
               <au>
                  <snm>Stage</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <fpage>1888</fpage>
            <lpage>1897</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2099596</pubid>
                  <pubid idtype="pmpid" link="fulltext">17989256</pubid>
                  <pubid idtype="doi">10.1101/gr.6376807</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Retrotransposable elements R1 and R2 interrupt the rRNA genes of most insects.</p>
            </title>
            <aug>
               <au>
                  <snm>Jakubczak</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Burke</snm>
                  <fnm>WD</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1991</pubdate>
            <volume>88</volume>
            <fpage>3295</fpage>
            <lpage>3299</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">51433</pubid>
                  <pubid idtype="pmpid">1849649</pubid>
                  <pubid idtype="doi">10.1073/pnas.88.8.3295</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Transcription of endogenous and exogenous R2 elements in the rRNA gene locus of <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>2003</pubdate>
            <volume>23</volume>
            <fpage>3825</fpage>
            <lpage>3836</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">155226</pubid>
                  <pubid idtype="pmpid" link="fulltext">12748285</pubid>
                  <pubid idtype="doi">10.1128/MCB.23.11.3825-3836.2003</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Evolution of R1 and R2 in the rDNA units of the genus <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
               <au>
                  <snm>Burke</snm>
                  <fnm>WD</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Lathe</snm>
                  <fnm>WC</fnm>
                  <suf>III</suf>
               </au>
            </aug>
            <source>Genetica</source>
            <pubdate>1997</pubdate>
            <volume>100</volume>
            <fpage>49</fpage>
            <lpage>61</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1018396505115</pubid>
                  <pubid idtype="pmpid" link="fulltext">9440258</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Sequence relationship of retrotransposable elements R1 and R2 within and between divergent insect species.</p>
            </title>
            <aug>
               <au>
                  <snm>Burke</snm>
                  <fnm>WD</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Xiong</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Jakubczak</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1993</pubdate>
            <volume>10</volume>
            <fpage>163</fpage>
            <lpage>185</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8383793</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Evolution of genes and genomes on the <it>Drosophila </it>phylogeny.</p>
            </title>
            <aug>
               <au>
                  <snm>Clark</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Bergman</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Oliver</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Markow</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Kaufman</snm>
                  <fnm>TC</fnm>
               </au>
               <au>
                  <snm>Kellis</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gelbart</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Iyer</snm>
                  <fnm>VN</fnm>
               </au>
               <au>
                  <snm>Pollard</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Sackton</snm>
                  <fnm>TB</fnm>
               </au>
               <au>
                  <snm>Larracuente</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Singh</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Abad</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Abt</snm>
                  <fnm>DN</fnm>
               </au>
               <au>
                  <snm>Adryan</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Aguade</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Akashi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Aquadro</snm>
                  <fnm>CF</fnm>
               </au>
               <au>
                  <snm>Ardell</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Arguello</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Artieri</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Barbash</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Barker</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Barsanti</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Batterham</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Batzoglou</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Begun</snm>
                  <fnm>D</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>450</volume>
            <fpage>203</fpage>
            <lpage>218</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature06341</pubid>
                  <pubid idtype="pmpid" link="fulltext">17994087</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Type I (R1) and Type II (R2) ribosomal DNA insertions of <it>Drosophila melanogaster </it>are retrotransposable elements closely related to those of <it>Bombyx mori</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Jakubczak</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Xiong</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1990</pubdate>
            <volume>212</volume>
            <fpage>37</fpage>
            <lpage>52</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0022-2836(90)90303-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">1690812</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>A single lineage of R2 retrotransposable elements is an active evolutionarily stable component of the <it>Drosophila </it>rDNA locus.</p>
            </title>
            <aug>
               <au>
                  <snm>Lathe</snm>
                  <fnm>WC</fnm>
                  <suf>III</suf>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1997</pubdate>
            <volume>14</volume>
            <fpage>1232</fpage>
            <lpage>1241</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9402733</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Vertical transmission of the retrotransposable elements R1 and R2 during the evolution of the <it>Drosophila melanogaster </it>species subgroup.</p>
            </title>
            <aug>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1995</pubdate>
            <volume>139</volume>
            <fpage>671</fpage>
            <lpage>684</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1206373</pubid>
                  <pubid idtype="pmpid" link="fulltext">7713424</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>The transposable elements of the <it>Drosophila melanogaster </it>euchromatin: a genomics perspective.</p>
            </title>
            <aug>
               <au>
                  <snm>Kaminker</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Bergman</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Kronmiller</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Carlson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Svirskas</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Patel</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Frise</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Wheeler</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Celniker</snm>
                  <fnm>SE</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>RESEARCH0084</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">151186</pubid>
                  <pubid idtype="pmpid" link="fulltext">12537573</pubid>
                  <pubid idtype="doi">10.1186/gb-2002-3-12-research0084</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Multiple lineages of R1 retrotransposable elements can coexist in the rDNA loci of <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Gentile</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Burke</snm>
                  <fnm>WD</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2001</pubdate>
            <volume>18</volume>
            <fpage>235</fpage>
            <lpage>245</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11158382</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Identification of Waldo-A and Waldo-B, two closely related non-LTR retrotransposons in <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Busseau</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Berezikov</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bucheton</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2001</pubdate>
            <volume>18</volume>
            <fpage>196</fpage>
            <lpage>205</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11158378</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Subfamilies of CR1 non-LTR retrotransposons have different 5' UTR sequences but are otherwise conserved.</p>
            </title>
            <aug>
               <au>
                  <snm>Haas</snm>
                  <fnm>NB</fnm>
               </au>
               <au>
                  <snm>Grabowski</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>North</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Moran</snm>
                  <fnm>JV</fnm>
               </au>
               <au>
                  <snm>Burch</snm>
                  <fnm>JB</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2001</pubdate>
            <volume>265</volume>
            <fpage>175</fpage>
            <lpage>183</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1119(01)00344-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">11255020</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>A retrotransposable element from the mosquito <it>Anopheles gambiae</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Besansky</snm>
                  <fnm>NJ</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>1990</pubdate>
            <volume>10</volume>
            <fpage>863</fpage>
            <lpage>871</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">360921</pubid>
                  <pubid idtype="pmpid" link="fulltext">1689457</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Conserved features at the 5' end of <it>Drosophila </it>R2 retrotransposable elements: implications for transcription and translation.</p>
            </title>
            <aug>
               <au>
                  <snm>George</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Insect Mol Biol</source>
            <pubdate>1999</pubdate>
            <volume>8</volume>
            <fpage>3</fpage>
            <lpage>10</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2583.1999.810003.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">9927169</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Epigenetic regulation of retrotransposons within the nucleolus of <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Ye</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Burke</snm>
                  <fnm>WD</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>2008</pubdate>
            <volume>28</volume>
            <fpage>6452</fpage>
            <lpage>6461</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2577413</pubid>
                  <pubid idtype="pmpid" link="fulltext">18678644</pubid>
                  <pubid idtype="doi">10.1128/MCB.01015-08</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Population genetics and phylogenetics of DNA sequence variation at multiple loci within the <it>Drosophila melanogaster </it>species complex.</p>
            </title>
            <aug>
               <au>
                  <snm>Hey</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kliman</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1993</pubdate>
            <volume>10</volume>
            <fpage>804</fpage>
            <lpage>822</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8355601</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Long-term inheritance of the 28S rDNA-specific retrotransposon R2.</p>
            </title>
            <aug>
               <au>
                  <snm>Kojima</snm>
                  <fnm>KK</fnm>
               </au>
               <au>
                  <snm>Fujiwara</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <fpage>2157</fpage>
            <lpage>2165</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msi210</pubid>
                  <pubid idtype="pmpid" link="fulltext">16014872</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Analysis of the 5' junctions of <it>R2 </it>insertions with the 28S gene: implications for non-LTR retrotransposition.</p>
            </title>
            <aug>
               <au>
                  <snm>George</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Burke</snm>
                  <fnm>WD</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1996</pubdate>
            <volume>142</volume>
            <fpage>853</fpage>
            <lpage>863</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1207023</pubid>
                  <pubid idtype="pmpid" link="fulltext">8849892</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>The domain structure and retrotransposition mechanism of R2 elements are conserved throughout arthropods.</p>
            </title>
            <aug>
               <au>
                  <snm>Burke</snm>
                  <fnm>WD</fnm>
               </au>
               <au>
                  <snm>Malik</snm>
                  <fnm>HS</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1999</pubdate>
            <volume>16</volume>
            <fpage>502</fpage>
            <lpage>511</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10331276</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>RNA template requirements for target DNA-primed reverse transcription by the R2 retrotransposable element.</p>
            </title>
            <aug>
               <au>
                  <snm>Luan</snm>
                  <fnm>DD</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>1995</pubdate>
            <volume>15</volume>
            <fpage>3882</fpage>
            <lpage>3891</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">230628</pubid>
                  <pubid idtype="pmpid" link="fulltext">7540721</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>End-to-end template jumping by the reverse transcriptase encoded by the R2 retrotransposon.</p>
            </title>
            <aug>
               <au>
                  <snm>Bibillo</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2004</pubdate>
            <volume>279</volume>
            <fpage>14945</fpage>
            <lpage>14953</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">14752111</pubid>
                  <pubid idtype="doi">10.1074/jbc.M310450200</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Characterization of <it>Aphidius ervi </it>(Hymenoptera, Braconidae) ribosomal genes and identification of site-specific insertion elements belonging to the non-LTR retrotransposon family.</p>
            </title>
            <aug>
               <au>
                  <snm>Varricchio</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gargiulo</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Graziani</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Manzi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pennacchio</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Digilio</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tremblay</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Malva</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Insect Biochem Mol Biol</source>
            <pubdate>1995</pubdate>
            <volume>25</volume>
            <fpage>603</fpage>
            <lpage>612</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">7787843</pubid>
                  <pubid idtype="doi">10.1016/0965-1748(94)00102-N</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Functional expression of a sequence-specific endonuclease encoded by the retrotransposon R2Bm.</p>
            </title>
            <aug>
               <au>
                  <snm>Xiong</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1988</pubdate>
            <volume>55</volume>
            <fpage>235</fpage>
            <lpage>246</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0092-8674(88)90046-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">2844414</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Unequal crossing-over accounts for the organization of <it>Drosophila virilis </it>rDNA insertions and the integrity of flanking 28S gene.</p>
            </title>
            <aug>
               <au>
                  <snm>Rae</snm>
                  <fnm>PM</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1982</pubdate>
            <volume>296</volume>
            <fpage>579</fpage>
            <lpage>581</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/296579a0</pubid>
                  <pubid idtype="pmpid">6280055</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Genome-wide screening and characterization of transposable elements and their distribution analysis in the silkworm, <it>Bombyx mori</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Osani-Futahashi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Suetsugu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Mita</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Fujiwara</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Insect Biochem Mol Biol</source>
            <pubdate>2008</pubdate>
            <volume>38</volume>
            <fpage>1046</fpage>
            <lpage>1057</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ibmb.2008.05.012</pubid>
                  <pubid idtype="pmpid">19280695</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Dynamics of R1 and R2 elements in the rDNA locus of <it>Drosophila simulans</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>P&#233;rez-Gonz&#225;lez</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2001</pubdate>
            <volume>158</volume>
            <fpage>1557</fpage>
            <lpage>1567</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1461747</pubid>
                  <pubid idtype="pmpid">11514447</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>The pattern of R2 retrotransposon activity in natural populations of <it>Drosophila simulans </it>reflects the dynamic nature of the rDNA locus.</p>
            </title>
            <aug>
               <au>
                  <snm>Zhou</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>PLoS Genet</source>
            <pubdate>2009</pubdate>
            <volume>5</volume>
            <fpage>e1000386</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2637433</pubid>
                  <pubid idtype="pmpid" link="fulltext">19229317</pubid>
                  <pubid idtype="doi">10.1371/journal.pgen.1000386</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Rates of R1 and R2 retrotransposition and elimination from the rDNA locus of <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>P&#233;rez-Gonz&#225;lez</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2002</pubdate>
            <volume>162</volume>
            <fpage>799</fpage>
            <lpage>811</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1462293</pubid>
                  <pubid idtype="pmpid">12399390</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Characterization of active R2 retrotransposition in the rDNA locus of <it>Drosophila simulans</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2005</pubdate>
            <volume>170</volume>
            <fpage>195</fpage>
            <lpage>200</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1449725</pubid>
                  <pubid idtype="pmpid">15781697</pubid>
                  <pubid idtype="doi">10.1534/genetics.104.038703</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Retrotransposable elements R1 and R2 in the rDNAunits of <it>Drosophila mercatorum: abnormal abdomen </it>revisited.</p>
            </title>
            <aug>
               <au>
                  <snm>Malik</snm>
                  <fnm>HS</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1999</pubdate>
            <volume>151</volume>
            <fpage>653</fpage>
            <lpage>665</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1460499</pubid>
                  <pubid idtype="pmpid">9927458</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Downstream 28S gene sequences on the RNA template affect the choice of primer and the accuracy of initiation by the reverse transcriptase.</p>
            </title>
            <aug>
               <au>
                  <snm>Luan</snm>
                  <fnm>DD</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>1996</pubdate>
            <volume>16</volume>
            <fpage>4726</fpage>
            <lpage>4734</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">231473</pubid>
                  <pubid idtype="pmpid" link="fulltext">8756630</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>DNA-directed DNA polymerase and strand displacement activity of the reverse transcriptase encoded by the R2 retrotransposon.</p>
            </title>
            <aug>
               <au>
                  <snm>Kurzynska-Kokorniak</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Jamburuthugoda</snm>
                  <fnm>VK</fnm>
               </au>
               <au>
                  <snm>Bibillo</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2007</pubdate>
            <volume>374</volume>
            <fpage>322</fpage>
            <lpage>333</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2121658</pubid>
                  <pubid idtype="pmpid" link="fulltext">17936300</pubid>
                  <pubid idtype="doi">10.1016/j.jmb.2007.09.047</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>NCBI BLAST.</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/BLAST/</url>
         </bibl>
         <bibl id="B52">
            <title>
               <p>The CLUSTAL_X windows interface: Flexible strategies for multiple sequence alignment aided by quality analysis tool.</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Plewniak</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Jeanmougin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>4876</fpage>
            <lpage>4882</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">147148</pubid>
                  <pubid idtype="pmpid" link="fulltext">9396791</pubid>
                  <pubid idtype="doi">10.1093/nar/25.24.4876</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Eickbush Lab Website.</p>
            </title>
            <url>http://www.rochester.edu/college/BIO/labs/thelab/</url>
         </bibl>
         <bibl id="B54">
            <title>
               <p>The Jalview Java alignment editor.</p>
            </title>
            <aug>
               <au>
                  <snm>Clamp</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cuff</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Searle</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Barton</snm>
                  <fnm>GJ</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>426</fpage>
            <lpage>427</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg430</pubid>
                  <pubid idtype="pmpid" link="fulltext">14960472</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood.</p>
            </title>
            <aug>
               <au>
                  <snm>Guindon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gascuel</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Syst Biol</source>
            <pubdate>2003</pubdate>
            <volume>52</volume>
            <fpage>696</fpage>
            <lpage>704</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10635150390235520</pubid>
                  <pubid idtype="pmpid">14530136</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>LIRMM.</p>
            </title>
            <url>http://www.phylogeny.fr</url>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Phylogeny.fr: robust phylogenetic analysis for the non-specialist.</p>
            </title>
            <aug>
               <au>
                  <snm>Dereeper</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Guignon</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Blanc</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Audic</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Buffet</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chevenet</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Dufayard</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Guindon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lefort</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Lescot</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Claverie</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Gascuel</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2008</pubdate>
            <volume>36</volume>
            <fpage>W465</fpage>
            <lpage>469</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2447785</pubid>
                  <pubid idtype="pmpid" link="fulltext">18424797</pubid>
                  <pubid idtype="doi">10.1093/nar/gkn180</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>TreeDyn: towards dynamic graphics and annotations for analyses of trees.</p>
            </title>
            <aug>
               <au>
                  <snm>Chevenet</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Brun</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ba&#241;uls</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Jacq</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Christen</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>439</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1615880</pubid>
                  <pubid idtype="pmpid" link="fulltext">17032440</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-7-439</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Approximate likelihood-ratio test for branches: A fast, accurate, and powerful alternative.</p>
            </title>
            <aug>
               <au>
                  <snm>Anisimova</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gascuel</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Syst Biol</source>
            <pubdate>2006</pubdate>
            <volume>55</volume>
            <fpage>539</fpage>
            <lpage>552</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10635150600755453</pubid>
                  <pubid idtype="pmpid" link="fulltext">16785212</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Sequence Manipulation Suite: Ident and Sim.</p>
            </title>
            <url>http://www.ualberta.ca/~stothard/javascript/ident_sim.html</url>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Datamonkey: rapid detection of selective pressure on individual sites of codon alignments.</p>
            </title>
            <aug>
               <au>
                  <snm>Pond</snm>
                  <fnm>SLK</fnm>
               </au>
               <au>
                  <snm>Frost</snm>
                  <fnm>SDW</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>2531</fpage>
            <lpage>2533</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti320</pubid>
                  <pubid idtype="pmpid" link="fulltext">15713735</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Robust inference of positive selection from recombining coding sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Scheffler</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Seoighe</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <fpage>2493</fpage>
            <lpage>2499</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btl427</pubid>
                  <pubid idtype="pmpid" link="fulltext">16895925</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
