<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2008-9-2-r39</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Abundant and species-specific <it>DINE-1 </it>transposable elements in 12 <it>Drosophila </it>genomes</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Yang</snm>
               <fnm>Hsiao-Pei</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>hy31@cornell.edu</email>
            </au>
            <au id="A2">
               <snm>Barbash</snm>
               <mi>A</mi>
               <fnm>Daniel</fnm>
               <insr iid="I2"/>
               <email>dab87@cornell.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Institute of Genetics, National Yang-Ming University, Taipei 112, Taiwan</p>
            </ins>
            <ins id="I2">
               <p>Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>2</issue>
         <fpage>R39</fpage>
         <url>http://genomebiology.com/2008/9/2/R39</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18291035</pubid>
               <pubid idtype="doi">10.1186/gb-2008-9-2-r39</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>21</day>
               <month>9</month>
               <year>2007</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>17</day>
               <month>12</month>
               <year>2007</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>21</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>21</day>
               <month>02</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Yang and Barbash; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>MITE transposons in Drosophila</p>
      </shorttitle>
      <shortabs>
         <p>Evidence is presented that DINE-1 is a highly abundant miniature inverted-repeat transposable element (MITE) family present in all 12 Drosophila species with whole-genome sequence available.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Miniature inverted-repeat transposable elements (MITEs) are non-autonomous DNA-mediated transposable elements (TEs) derived from autonomous TEs. Unlike in many plants or animals, MITEs and other types of DNA-mediated TEs were previously thought to be either rare or absent in <it>Drosophila</it>. Most other TE families in <it>Drosophila </it>exist at low or intermediate copy number (around &lt; 100 per genome).</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We present evidence here that the dispersed repeat <it>Drosophila </it>interspersed element 1 (<it>DINE-1</it>; also named <it>INE-1 </it>and <it>DNAREP1</it>) is a highly abundant DNA-mediated TE containing inverted repeats found in all 12 sequenced <it>Drosophila </it>genomes. All <it>DINE-1</it>s share a similar sequence structure, but are more homogeneous within species than they are among species. The inferred phylogenetic relationship of the <it>DINE-1 </it>consensus sequence from each species is generally consistent with the known species phylogeny, suggesting vertical transmission as the major mechanism for <it>DINE-1 </it>propagation. Exceptions observed in <it>D. willistoni </it>and <it>D. ananassae </it>could be due to either horizontal transfer or reactivation of ancestral copies. Our analysis of pairwise percentage identity of <it>DINE-1 </it>copies within species suggests that the transpositional activity of <it>DINE-1 </it>is extremely dynamic, with some lineages showing evidence for recent transpositional bursts and other lineages appearing to have silenced their <it>DINE-1</it>s for long periods of time. We also find that all species have many <it>DINE-1 </it>insertions in introns and adjacent to protein-coding genes. Finally, we discuss our results in light of a recent proposal that <it>DINE-1</it>s belong to the <it>Helitron </it>family of TEs.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>We find that all 12 <it>Drosophila </it>species with whole-genome sequence contain the high copy element <it>DINE-1</it>. Although all <it>DINE-1</it>s share a similar structure, species-specific variation in the distribution of average pairwise divergence suggests that <it>DINE-1 </it>has gone through multiple independent cycles of activation and suppression. <it>DINE-1 </it>also has had a significant impact on gene structure evolution.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010009">Genetics</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010016">Molecular biology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Interspersed repetitive sequences are ubiquitous to all eukaryotic organisms, and make up a significant portion of the genome <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. These sequences are mostly transposable elements (TEs) or TE-derived sequences, and they play important roles in the evolution of chromosome organization and genome complexity <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>.</p>
         <p>Based on their mechanism of transposition, TEs can be divided into two classes: class I comprises retrotransposons, which transpose through RNA-mediated mechanisms, and class II comprises transposons, which mobilize through DNA-mediated mechanisms <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. Depending on their ability to direct their own transposition, each class of TEs can contain two types: autonomous and non-autonomous copies. Autonomous TEs code for the proteins that are required for their transposition, and are mobilized in <it>cis</it>. Non-autonomous TEs are mobilized in <it>trans </it>by enzymes produced from autonomous elements. Well-known examples include the vertebrate retroelements LINEs (long interspersed elements) and SINEs (short interspersed elements). The mobilization of non-autonomous SINEs requires retrotransposase from autonomous LINEs, and these elements co-evolve in a highly species-specific manner <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. Another example is the miniature inverted-repeat transposable elements (MITEs) found in many plant genomes. MITEs are non-autonomous DNA elements (class II) that originated from a subset of autonomous DNA transposons <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. They are characterized by short sequences with no coding capacity, flanked by terminal (or occasionally subterminal) inverted repeats (TIRs) and very short direct repeats caused by target site duplication (TSD). MITEs have no internal homology to their parental autonomous transposons and often include non-homologous sequences in their internal regions. MITEs have also been found in several animal genomes, including <it>Caenorhabditis elegans</it>, mosquitoes, fish and humans (reviewed in <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>). Both SINEs and MITEs are highly abundant (usually > 1,000 copies per genome) in many host species across a broad taxonomic range. Because of their high abundance and active movement, and their frequent association with genes <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>, MITEs have had a significant impact on the evolution and complexity of eukaryotic genomes.</p>
         <p>TE activity and evolution have been intensively studied in <it>Drosophila </it>and many families of TEs have been described <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>. Most TEs are at low or intermediate copy number in <it>D. melanogaster</it>. MITEs and SINEs have been previously reported as being either rare or absent in most species of this genus that have been examined <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. <it>D. melanogaster DINE-1 </it>(<it>Drosophila </it>interspersed element 1; also named <it>INE-1</it>, <it>DNAREP1</it>) is an exception to these observations <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. <it>D. melanogaster </it>contains thousands of copies of <it>DINE-1 </it><abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. All copies appear to be non-autonomous, and analyses of their divergence patterns suggest that <it>D. melanogaster DINE-1 </it>has been inactive for over 4 million years <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. Although <it>DINE-1 </it>was originally suggested to be a SINE-like retroelement, we have suggested that it is more likely to be a MITE, based on analysis of <it>DINE-1 </it>elements in <it>D. yakuba </it>that show evidence of recent transpositional activity <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. We discuss below the structural features of <it>DINE-1 </it>supporting this designation, as well as the more recent proposal <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> that <it>DINE-1</it>s are members of the <it>Helitron </it>family of TEs.</p>
         <p>Several earlier studies found high copy TEs that we here classify as <it>DINE-1</it>. Vivas <it>et al</it>. <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> discovered an element in <it>D. subobscura </it>called <it>GEM </it>that is composed of repetitive modules, one of which they also found in the <it>D. melanogaster </it>and <it>D. virilis </it>genomes. Miller <it>et al</it>. <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> characterized an abundant element called <it>SGM </it>in <it>D. subobscura</it>, <it>D. guanche </it>and <it>D. madeirensis</it>, noted its similarity to <it>GEM</it>, and also described that other species, including <it>D. melanogaster </it>and <it>D. virilis</it>, have similar sequences; <it>GEM </it>and <it>SGM </it>are the same as <it>DINE-1</it>. Wilder and Hollocher <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> subsequently discovered an element in a number of <it>Drosophila </it>species that they called <it>mini-me </it>and noted its similarity to <it>D. melanogaster DINE-1</it>. However, a comprehensive assessment of the abundance and transpositional dynamics of <it>DINE-1 </it>has not been reported. Here we expand our study of the evolutionary dynamics of <it>DINE-1 </it>using the recently available genome sequences of 12 <it>Drosophila </it>species <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. We found that <it>DINE-1</it>-related sequences are not only highly abundant in all 12 species, but also share a similar sequence structure, suggesting that a common mechanism was used for their transposition. Different lineages, however, show different distributions of divergence, suggesting that <it>DINE-1 </it>has gone through multiple cycles of transposition and subsequent silencing.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Identification and common sequence structure characteristics of <it>DINE-1</it>s in 12 <it>Drosophila </it>species</p>
            </st>
            <p>Previously, we discovered that <it>DINE-1 </it>is highly abundant and appears to have experienced a recent transpositional burst in the lineage leading to <it>D. yakuba </it><abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Using the <it>D. yakuba DINE-1 </it>consensus sequence, we searched using BLAST for related sequences in 11 other sequenced genomes of <it>Drosophila </it>(Figure <figr fid="F1">1</figr>). This initial screen suggested that all 12 <it>Drosophila </it>species contain hundreds to thousands of copies of <it>DINE-1</it>-related sequences. To infer the structure of <it>DINE-1 </it>in each species, we manually aligned 50 sequences with the highest BLAST scores from each species. Then we aligned together these sequences from all the species. This analysis revealed that <it>DINE-1</it>-related sequences from the ten newly analyzed <it>Drosophila </it>species share a number of structural similarities with <it>DINE-1 </it>from <it>D. yakuba </it>and <it>D. melanogaster </it>(Figure <figr fid="F2">2</figr>; Table <tblr tid="T1">1</tblr>; also see Additional data file 1).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Flow chart of the strategy for identifying <it>DINE-1 </it>sequences in the 12 <it>Drosophila </it>genomes</p>
               </caption>
               <text>
                  <p>Flow chart of the strategy for identifying <it>DINE-1 </it>sequences in the 12 <it>Drosophila </it>genomes.</p>
               </text>
               <graphic file="gb-2008-9-2-r39-1"/>
            </fig>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>The generalized structure of <it>DINE-1 </it>sequences from 12 <it>Drosophila </it>genomes, and alignment of the <it>DINE-1 </it>consensus sequences from 12 species with each feature boxed</p>
               </caption>
               <text>
                  <p>The generalized structure of <it>DINE-1 </it>sequences from 12 <it>Drosophila </it>genomes, and alignment of the <it>DINE-1 </it>consensus sequences from 12 species with each feature boxed. The element contains two conserved blocks, A and B. Within block A, the sequence can be further divided into two parts, A1 and A2, separated by a region of variable length containing the tandem repeats (CCGT)<sub>n</sub>(CTGT)<sub>n</sub>. Between blocks A and B is a region of central repeats, containing species-specific repeats. These central repeat sequences do not share homology among species; the length of the repeat unit can range from approximately 50 bp to approximately 500 bp, and the number of repeats is also variable within species. Locations of the subTIRs are shown as gray arrows; see Table 1 for precise designations of subTIR sequences. The 5' end also contains a second inverted repeat (IR) sequence that is partially complementary to the 5'-end terminal repeat and is shown as a gray arrowhead. An inverted repeat near the 3' end forms a potential stem-loop structure and is indicated by white arrowheads.</p>
               </text>
               <graphic file="gb-2008-9-2-r39-2"/>
            </fig>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Abundance and sequence diversity of <it>DINE-1 </it>in 12 <it>Drosophila </it>species</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Species</p>
                     </c>
                     <c ca="left">
                        <p>subTIR*</p>
                     </c>
                     <c ca="center">
                        <p>Insertion preference</p>
                     </c>
                     <c ca="center">
                        <p>No. of copies<sup>&#8224;</sup></p>
                     </c>
                     <c ca="center">
                        <p>Average % identity (STD)<sup>&#8225;</sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. melanogaster</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>ATACCCGTTACTC</p>
                     </c>
                     <c ca="center">
                        <p>TT</p>
                     </c>
                     <c ca="center">
                        <p>355</p>
                     </c>
                     <c ca="center">
                        <p>89.66 (3.57)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. simulans</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>ATACCCGTTACTC</p>
                     </c>
                     <c ca="center">
                        <p>TT</p>
                     </c>
                     <c ca="center">
                        <p>478</p>
                     </c>
                     <c ca="center">
                        <p>90.06 (3.36)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. sechellia</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>ATACCCGTTACTC</p>
                     </c>
                     <c ca="center">
                        <p>TT</p>
                     </c>
                     <c ca="center">
                        <p>502</p>
                     </c>
                     <c ca="center">
                        <p>90.16 (3.38)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. yakuba</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>ATACCCGTTACTC</p>
                     </c>
                     <c ca="center">
                        <p>TT</p>
                     </c>
                     <c ca="center">
                        <p>5,424</p>
                     </c>
                     <c ca="center">
                        <p>96.43 (3.31)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. erecta</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>ATACCCGTTACTC</p>
                     </c>
                     <c ca="center">
                        <p>TT</p>
                     </c>
                     <c ca="center">
                        <p>1,013</p>
                     </c>
                     <c ca="center">
                        <p>91.08 (3.41)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. ananassae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>TATACCCTTGCAG</p>
                     </c>
                     <c ca="center">
                        <p>TT</p>
                     </c>
                     <c ca="center">
                        <p>5,027</p>
                     </c>
                     <c ca="center">
                        <p>97.15 (3.32)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. persimilis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>TATACCCGATACT</p>
                     </c>
                     <c ca="center">
                        <p>TT</p>
                     </c>
                     <c ca="center">
                        <p>1,103</p>
                     </c>
                     <c ca="center">
                        <p>96.63 (3.29)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. pseudoobscura</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>TATACCCGATACT</p>
                     </c>
                     <c ca="center">
                        <p>TT</p>
                     </c>
                     <c ca="center">
                        <p>1,047</p>
                     </c>
                     <c ca="center">
                        <p>95.39 (4.24)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>D. willistoni </it>type 1</p>
                     </c>
                     <c ca="left">
                        <p>TATACCATACACC</p>
                     </c>
                     <c ca="center">
                        <p>TT</p>
                     </c>
                     <c ca="center">
                        <p>2,396</p>
                     </c>
                     <c ca="center">
                        <p>96.70 (4.13)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>D. willistoni </it>type 2</p>
                     </c>
                     <c ca="left">
                        <p>TATACCCTTGCAA</p>
                     </c>
                     <c ca="center">
                        <p>TT</p>
                     </c>
                     <c ca="center">
                        <p>2,828</p>
                     </c>
                     <c ca="center">
                        <p>96.13 (3.47)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>D. willistoni </it>type 3</p>
                     </c>
                     <c ca="left">
                        <p>TATACCCTTGCAG</p>
                     </c>
                     <c ca="center">
                        <p>TT</p>
                     </c>
                     <c ca="center">
                        <p>1,073</p>
                     </c>
                     <c ca="center">
                        <p>96.69 (2.93)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. mojavensis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>ATACCCTGAACCC</p>
                     </c>
                     <c ca="center">
                        <p>TT</p>
                     </c>
                     <c ca="center">
                        <p>5,190</p>
                     </c>
                     <c ca="center">
                        <p>93.17 (4.64)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. virilis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>ATACCCTGAACCC</p>
                     </c>
                     <c ca="center">
                        <p>TT</p>
                     </c>
                     <c ca="center">
                        <p>3,222</p>
                     </c>
                     <c ca="center">
                        <p>94.40 (3.20)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. grimshawi</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>TACCCTGAACCCA</p>
                     </c>
                     <c ca="center">
                        <p>TT</p>
                     </c>
                     <c ca="center">
                        <p>334</p>
                     </c>
                     <c ca="center">
                        <p>87.89 (4.34)</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>*subTIR of <it>D. mojavensis </it>is from the 5' side; the 3' subTIR has a single mismatch (Figure 2). <sup>&#8224;</sup>Number of hits found by BLAST search using block A consensus of <it>DINE-1 </it>of each species. Only hits > 100 bp in length were included. <sup>&#8225;</sup>Means (and standard deviations (STD) in parentheses) of percentage identity between the <it>DINE-1 </it>consensus to all BLAST hits of block A.</p>
               </tblfn>
            </tbl>
            <p>We previously defined <it>D. yakuba DINE-1 </it>as beginning at its 5' inverted repeat, based on the assumption that this sequence is a TIR <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. We have here placed this repeat one nucleotide internal to <it>D. yakuba DINE-1</it>. This change in based on the recent designation of <it>DINE-1 </it>as a <it>Helitron </it>element <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B28">28</abbr></abbrgrp>. Our designation of the boundaries of <it>DINE-1 </it>differs by one nucleotide from that in <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, based on our analysis presented below of polymorphic insertions in <it>D. yakuba </it>(Figure <figr fid="F3">3</figr>). The precise boundaries of <it>DINE-1 </it>in each species are difficult to determine because of their preference for inserting in T-rich regions. We have annotated the sequences in Figure <figr fid="F2">2</figr> in order to maximize similarity to <it>D. yakuba DINE-1</it>. Based on this alignment the 5' inverted repeat ranges from being terminal to 2 bp internal; in all species the corresponding 3' inverted repeat is clearly internal. We thus refer to these repeats as subterminal inverted repeats (subTIRs).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Analysis of ten sites that are polymorphic for <it>DINE-1 </it>insertions in natural populations of <it>D. yakuba</it></p>
               </caption>
               <text>
                  <p>Analysis of ten sites that are polymorphic for <it>DINE-1 </it>insertions in natural populations of <it>D. yakuba</it>. For each site, the sequence from a strain containing a <it>DINE-1 </it>insertion is shown at the top, and the sequence from a strain lacking the insertion is shown at the bottom. Only the terminal sequences of <it>DINE-1 </it>and its flanking sequences are shown. The 5' subTIR is shown in bold italics. Insertions 1-3 were previously reported in [23]. The interpretation of these data depends on the designation of the <it>DINE-1 </it>termini and whether insertion causes a TSD. <b>(a) </b>Analysis using the annotation of <it>DINE-1 </it>structure presented in this paper. This annotation places the subTIR of <it>D. yakuba </it>1 bp internal to the 5' end. It also assumes that no TSD is created, in accord with the proposed mechanism of <it>Helitron</it>-type replication [24]. Under these stipulations, all ten insertions occur between the dinucleotide TT in the consensus sequence WTT (where W = A or T), and eight of ten match a longer consensus sequence of insertion after the second nucleotide in the sequence of WTTTT. <b>(b) </b>Analysis assuming the <it>DINE-1 </it>termini of [28], and <it>Helitron</it>-type replication. Only sites 1 and 10 are shown. Under this annotation, <it>DINE-1 </it>would have an insertion preference for WT dinucleotides. <b>(c) </b>Analysis assuming that the <it>DINE-1 </it>5' end begins at its inverted repeat, inserts between the dinucleotide TT and causes a 2 bp TSD, as in MITE-like DNA transposons. The TSDs caused by <it>DINE-1 </it>are boxed.</p>
               </text>
               <graphic file="gb-2008-9-2-r39-3"/>
            </fig>
            <p>The common features of <it>DINE-1 </it>from all species include: 13 bp subTIRs (the exact location of such repeats differs by 1-2 nucleotides among species); a partial inverted repeat next to the 5' subTIR; terminal regions that are relatively well-conserved within species, called blocks A and B; a GTCY-rich microsatellite repeat of variable length within block A; a variable central repeat region, which is responsible for most of the total length variation among elements; the lack of any significant open reading frames; and a propensity to insert between TT dinucleotides (discussed further below).</p>
            <p>Our analysis revealed one novel feature not previously described for <it>DINE-1</it>, <it>SGM </it>or <it>mini-me</it>, namely a short hairpin stem-loop structure (with 7-11 nucleotide-long stems) located a few nucleotides downstream of the 3' subTIR (Figure <figr fid="F2">2</figr>). The sequence of the self-complementary stem differs among species, suggesting that compensatory mutations maintain its structure. This stem-loop may function as a terminator during rolling-circle replication (see Discussion).</p>
            <p>Several of the features we characterized refine structural features inferred previously from <it>SGM </it><abbrgrp><abbr bid="B26">26</abbr></abbrgrp> and <it>mini-me </it><abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. TIRs from <it>mini-me </it>were reported to vary from 10-20 bp in length from different species, while our analysis identified 13 bp TIRs in all species. These differences likely reflect the fact that we have analyzed many more sequences. The 13 bp subTIRs from <it>D. melanogaster </it>and <it>D. virilis </it>that we describe contain the 10 and 11 bp sequences reported by Wilder and Hollocher <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. Likewise, the 17 bp TIR reported previously for <it>D. subobscura mini-me </it>contains the 13 bp subTIR reported here for <it>D. pseudoobscura</it>, with one internal base-pair difference. The partial inverted repeat flanking the 5' subTIR we identified is more variable than that reported based on two copies of <it>mini-me</it>, being a partial and/or interrupted repeat of the 5' subTIR in different species.</p>
            <p>Wilder and Hollocher <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> also reported that <it>mini-me </it>elements from all species contain a highly conserved 33 bp core region. We find that this conserved core region actually extends over 90 bp, including the (TA)<sub>4 </sub>repeat and additional sequences 5' to this repeat. These sequences partially overlap with the LS module described for <it>SGM </it>elements <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. The striking conservation of this core among the 12 species suggests that it is of functional significance for <it>DINE-1 </it>transposition.</p>
            <p>We previously proposed that transposition of <it>D. yakuba DINE-1 </it>creates a dinucleotide (TT) TSD upon insertion <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. However, based on analysis of the genome sequence from a single strain this conclusion was not definitive because some copies did not have a TT dinucleotide flanking both sides. This could be due either to accumulated mutations after insertion or because the TT site preference is not absolute. We performed a preliminary test of this hypothesis by comparing the sequences of three polymorphic insertion sites of <it>DINE-1 </it>in multiple strains of <it>D. yakuba </it><abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Here we extended this analysis with seven additional <it>DINE-1 </it>insertions that are polymorphic among different <it>D. yakuba </it>strains (see Materials and methods).</p>
            <p>We found that all of these ten sites have a similar sequence structure (Figure <figr fid="F3">3</figr>). The interpretation of these data, however, depends on where precisely <it>DINE-1 </it>starts. Our previous interpretation of <it>DINE-1 </it>beginning at the 5' subTIR is consistent with insertions causing a TT TSD (Figure <figr fid="F3">3c</figr>). With the re-designation of the 5' end of <it>DINE-1</it>, these two nucleotides are instead part of the element, and <it>DINE-1 </it>would not create a TSD. The lack of a TSD is consistent with the proposal that <it>DINE-1</it>s are <it>Helitrons </it>(see below). Based on our designation of the <it>DINE-1 </it>boundaries, all <it>D. yakuba </it>insertions occur between the dinucleotide TT (Figure <figr fid="F3">3a</figr>). Using the <it>DINE-1 </it>boundaries from <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, the insertion site preference is more variable (Figure <figr fid="F3">3b</figr>). We then examined the sequences flanking the putative <it>DINE-1</it>s identified in the other <it>Drosophila </it>species. The majority (> 80%) of <it>DINE-1</it>s are flanked by TT dinucleotides. The conservation of this site preference in all 12 species, combined with the numerous other similarities described above, strongly suggests that each of these elements is in fact a species-specific <it>DINE-1 </it>and that they likely share a common mechanism of transposition.</p>
         </sec>
         <sec>
            <st>
               <p>Relationships of <it>DINE-1</it>s within and among species</p>
            </st>
            <p>Two pieces of evidence demonstrate that <it>DINE-1 </it>is highly homogeneous within 11 of the 12 species, with <it>D. willistoni </it>discussed below as being exceptional. First, we performed BLAST searches using the 90 bp sequence of the core region, which is conserved among all types of <it>DINE-1</it>s. By comparing the TIRs and block A sequences we found that each species contains only one type of <it>DINE-1</it>. Second, we searched for <it>DINE-1</it>s in one genome using the <it>DINE-1 </it>consensus sequences from other genomes as queries, and found only the same sets of sequences.</p>
            <p>Among the 11 species (again excluding <it>D. willistoni</it>), there are 5 different subTIRs (Table <tblr tid="T1">1</tblr>). All five <it>melanogaster </it>subgroup species have the same subTIR sequence. <it>D. ananassae </it>has a unique subTIR, while the closely related species <it>D. pseudoobscura </it>and <it>D. persimilis </it>share the same subTIR, as do <it>D. virilis </it>and <it>D. mojavensis</it>. The <it>DINE-1 </it>subTIR from <it>D. grimshawii </it>shares 12/13 bp with <it>D. virilis </it>and <it>D. mojavensis</it>. The central repeat is the most diverse region of <it>DINE-1 </it>among species. Even species sharing the same subTIRs, such as <it>D. virilis </it>and <it>D. mojavensis</it>, have unrelated central repeat regions.</p>
            <p>Analysis of <it>D. willistoni </it>gave uniquely different results. <it>D. willistoni </it>contains three different subtypes of <it>DINE-1</it>s, each with different subTIRs and different central repeat sequences (Figure <figr fid="F2">2</figr>). Phylogenetic evidence presented below suggests that they have at least two independent evolutionary origins.</p>
         </sec>
         <sec>
            <st>
               <p>Abundance and divergence of <it>DINE-1</it>s within species</p>
            </st>
            <p><it>DINE-1 </it>is highly abundant in all 12 <it>Drosophila </it>species. It is difficult to determine an exact number because each species contains small and fragmented copies that cannot always be unambiguously identified as <it>DINE-1</it>s. We therefore used stringent criteria to identify <it>DINE-1</it>s in order to obtain a reliable comparison among species (Table <tblr tid="T1">1</tblr>). For example, this search identified 355 copies in <it>D. melanogaster </it>compared to previous analyses that suggested that <it>D. melanogaster </it>has approximately 1,000 copies <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. Using identical search criteria, we found vast differences in the copy number of <it>DINE-1</it>s among species, ranging from 334 in <it>D. grimshawii </it>to 6,297 in <it>D. willistoni</it>.</p>
            <p>We identified similar numbers of <it>DINE-1</it>s in the <it>D. melanogaster </it>sister species <it>D. simulans </it>and <it>D. sechellia </it>compared to <it>D. melanogaster</it>. In contrast, more than ten-fold more copies were identified in <it>D. yakuba</it>. This high copy number is due to the large number of closely related copies in <it>D. yakuba</it>, and is consistent with previous work that suggested that <it>DINE-1 </it>has been inactive in <it>D. melanogaster </it>but underwent a recent transpositional burst in <it>D. yakuba </it><abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
            <p>We therefore sought to determine whether other species with high copy number also show evidence of recent transpositional bursts. We used BLAST percent identity scores as an approximate method to estimate divergence among individual copies within species (Table <tblr tid="T1">1</tblr>; Figure <figr fid="F4">4</figr>). This method accurately recapitulates previous analyses for <it>D. yakuba </it>and <it>D. melanogaster </it>that were based on estimates of per-site divergence <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>: <it>DINE-1</it>s from <it>D. melanogaster </it>have a broad peak of identities centered approximately around 90%, while <it>D. yakuba </it>shows a peak from approximately 96-100%, with a long tail of more diverged copies. These differences are highly significant (Mann-Whitney <it>U </it>test, two-tailed, <it>p </it>&lt; 0.001).</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>The frequency distribution of sequence identity of <it>DINE-1 </it>in different species</p>
               </caption>
               <text>
                  <p>The frequency distribution of sequence identity of <it>DINE-1 </it>in different species. The percentage identity was based on BLAST search, using consensus sequences of part A1 of block A from each species as the query. To exclude short and fragmented sequences from our analysis, only hits > 100 bp were used. Note that the y-axis scale differs among species.</p>
               </text>
               <graphic file="gb-2008-9-2-r39-4"/>
            </fig>
            <p><it>D. simulans </it>and <it>D. sechellia </it>show distributions similar to <it>D. melanogaster </it>(Figure <figr fid="F4">4</figr>; <it>p </it>> 0.05) and have average percent identities around 90-91% (Table <tblr tid="T1">1</tblr>). <it>D. erecta </it>has an average percent identity more similar to <it>D. melanogaster </it>than to <it>D. yakuba</it>; however, its distribution is significantly different from both species (<it>p </it>&lt; 0.001). These data suggest that <it>D. yakuba </it>is the only <it>melanogaster </it>subgroup species that experienced a recent transpositional burst. <it>D. grimshawii </it>also has a distribution with very few copies of high similarity, and a similar copy number to <it>D. melanogaster</it>, suggesting that <it>DINE-1 </it>has not been recently active in this species. In contrast, <it>DINE-1</it>s from <it>D. pseudoobscura</it>, <it>D. persimilis</it>, and <it>D. ananassae </it>have average percent identities > 95% with distributions highly skewed toward young copies, suggesting recent transpositional bursts in these species. The distributions in <it>D. pseudoobscura </it>and <it>D. persimilis </it>are significantly different (<it>p </it>&lt; 0.001; see Discussion). <it>D. ananassae </it>in particular stands out for having many copies identical to the consensus sequence (in the block A region). <it>D. virilis </it>and <it>D. mojavensis </it>also have substantial numbers of young copies but more broad distributions, suggesting the possibility that multiple rounds of transposition, silencing and reactivation may have occurred in these species.</p>
            <p><it>D. willistoni </it>has more than 1,000 copies of each of its three subtypes (Table <tblr tid="T1">1</tblr>), with subtype 1 and 3 having about twice as many copies as subtype 2. Each subtype has a peak near 100% identity, suggesting recent transpositional activity; however, their distributions are significantly different from each other (<it>p </it>&lt; 0.001). Interestingly, these subtypes also have different phylogenetic patterns (see below).</p>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic relationship of <it>DINE-1</it>s</p>
            </st>
            <p>These very different estimates of <it>DINE-1 </it>divergence within different species, and in particular our evidence for recent transpositional bursts, raises the question of whether <it>DINE-1 </it>may have undergone horizontal transmission into some <it>Drosophila </it>species. To understand the evolutionary dynamics of <it>DINE-1</it>s and their association with their host species, we analyzed the phylogenetic relationship of <it>DINE-1 </it>consensus sequences from the 12 <it>Drosophila </it>species (Additional data file 1) and compared it with the known phylogeny of <it>Drosophila </it><abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Because of the rapid evolution in the central repeat region, reliable alignment for phylogenetic reconstruction could be obtained only for blocks A and B.</p>
            <p>With the exceptions of <it>D. willistoni </it>and <it>D. ananassae</it>, the phylogenetic relationships of the <it>DINE-1 </it>sequences are, in general, consistent with the host species phylogeny (Figure <figr fid="F5">5</figr>). The grouping of <it>DINE-1 </it>in a separate clade containing <it>D. ananassae </it>and two of the three types from <it>D. willistoni </it>is surprising. The fact that this clade is an outgroup suggests that this result is not due to horizontal transfer from other <it>Drosophila </it>into these species. One possibility is that <it>DINE-1 </it>was horizontally transferred from non-<it>Drosophila </it>into these species. Alternatively, <it>DINE-1 </it>sequences resembling ancestral copies may have become reactivated in <it>D. ananassae </it>and <it>D. willistoni</it>. These sequences would be related to ancestral copies that were vertically inherited in the common ancestor of <it>Drosophila</it>.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Phylogenetic relationship of <it>DINE-1 </it>consensus sequences compared to their host species</p>
               </caption>
               <text>
                  <p>Phylogenetic relationship of <it>DINE-1 </it>consensus sequences compared to their host species. <b>(a) </b>Phylogenetic tree based on pooled sequences of block A and B (Additional data file 1) and constructed using the neighbor-joining method with the Jukes-Cantor one parameter substitution model [55]. Bootstrap resampling percentages based on 500 replications are indicated. Scale bar represents the estimated number of substitutions. <b>(b) </b>The host species phylogeny is adapted from [29]. Myrs, million years.</p>
               </text>
               <graphic file="gb-2008-9-2-r39-5"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p><it>DINE-1 </it>insertions in or near to genes</p>
            </st>
            <p><it>In situ </it>hybridizations to polytene chromosomes from several species using species-specific <it>DINE-1 </it>probes revealed strong signals in the heterochromatic chromocenter region, with additional hybridization observed along chromosome arms (data not shown). These non-chromocenter sites of hybridization led us ask whether <it>DINE-1 </it>insertions (whole or partial) are found in or near to protein-coding genes. <it>SGM</it>-related sequences were previously found in introns or adjacent to a number of <it>D. melanogaster </it>genes <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. We found hundreds of <it>DINE-1 </it>insertions in predicted introns in all species (Table <tblr tid="T2">2</tblr>; Additional data file 1). We also found many <it>DINE-1 </it>copies within 1 kb of genes, which could potentially be in either untranslated regions or in regulatory regions. These results suggest that <it>DINE-1 </it>has had a significant impact on gene structure evolution throughout the Drosophilidae. Few <it>DINE-1 </it>insertions were found in predicted coding sequences (CDSs). Strikingly, the largest number (15) was found in <it>D. ananassae</it>, a species that has a very high copy number of highly similar (young) <it>DINE-1 </it>copies (Table <tblr tid="T1">1</tblr>; Figure <figr fid="F4">4</figr>). This result suggests that recent transpositional activity of <it>DINE-1 </it>in <it>D. ananassae </it>has resulted in mildly deleterious insertions into coding regions that have not yet been removed by selection.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Numbers of <it>DINE-1</it>s within or near to predicted genes</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>Species</p>
                     </c>
                     <c ca="center">
                        <p>CDS</p>
                     </c>
                     <c ca="center">
                        <p>Intron</p>
                     </c>
                     <c ca="center">
                        <p>Flanking*</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. melanogaster</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>669</p>
                     </c>
                     <c ca="center">
                        <p>283</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. simulans</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>537</p>
                     </c>
                     <c ca="center">
                        <p>282</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. sechellia</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>582</p>
                     </c>
                     <c ca="center">
                        <p>291</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. yakuba</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>717</p>
                     </c>
                     <c ca="center">
                        <p>287</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. erecta</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>580</p>
                     </c>
                     <c ca="center">
                        <p>280</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. ananassae</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>610</p>
                     </c>
                     <c ca="center">
                        <p>276</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. persimilis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>460</p>
                     </c>
                     <c ca="center">
                        <p>284</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. pseudoobscura</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>352</p>
                     </c>
                     <c ca="center">
                        <p>277</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. virilis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1,104</p>
                     </c>
                     <c ca="center">
                        <p>278</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. grimshawi</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>262</p>
                     </c>
                     <c ca="center">
                        <p>174</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. mojavensis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>625</p>
                     </c>
                     <c ca="center">
                        <p>292</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Only block A and B sequences were used for BLAST search. *1 kb upstream and downstream of the predicted coding regions.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Analysis of <it>Helitron </it>sequences in <it>D. yakuba </it>and <it>D. virilis</it></p>
            </st>
            <p>Kapitonov and Jurka <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B28">28</abbr></abbrgrp> recently proposed that <it>DINE-1 </it>is related to <it>Helitron</it>, a family of DNA-mediated TEs. They reported consensus sequences of autonomous and non-autonomous copies of <it>Helitron </it>in <it>D. yakuba </it>and <it>D. virilis</it>. The non-autonomous consensus sequences are closely similar to our consensus sequences reported here. The consensus autonomous copies have an open reading frame (ORF) encoding the RepHel protein found in many other <it>Helitrons</it>, and sequences at each end similar to what we report here for <it>DINE-1</it>. These include block A at the 5' end and block B at the 3' end. We searched these two species using the <it>RepHel </it>portion of the autonomous consensus sequence as a query to determine whether these species contain potentially active copies. Among the top ten hits in <it>D. yakuba</it>, none have a fully intact <it>RepHel </it>ORF. Three copies have <it>DINE-1 </it>sequences flanking both sides of the <it>RepHel </it>sequences; two of these have over 500 bp of <it>DINE-1 </it>sequence at each end while the third has only approximately 50-60 bp of <it>DINE-1 </it>sequence at each end. Six of the remaining copies have <it>DINE-1 </it>sequences flanking one side of the <it>RepHel </it>sequence, and the last hit has no flanking <it>DINE-1 </it>sequences. Among the top ten hits in <it>D. virilis </it>we again found no copies with a fully intact <it>RepHel </it>ORF. One copy has <it>DINE-1 </it>sequences flanking both sides and five copies have <it>DINE-1 </it>sequences flanking one side. Among the remaining copies, one is in a highly repetitive region and could not be further analyzed, and the remaining four copies have no flanking <it>DINE-1 </it>sequences. We conclude that <it>D. yakuba </it>and <it>D. virilis </it>are unlikely to contain currently active autonomous <it>Helitrons</it>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p><it>DINE-1 </it>is the most abundant repetitive sequence in the <it>Drosophila </it>genome. <it>DINE-1 </it>was first identified on the fourth chromosome of <it>D. melanogaster </it><abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, and was suggested to be a non-autonomous retroelement, analogous to vertebrate SINEs. This argument was based on its high abundance, composing > 1% of the total genome, its small size and its lack of significant ORFs <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B27">27</abbr><abbr bid="B30">30</abbr></abbrgrp>. However, unlike known SINEs, <it>D. melanogaster DINE-1 </it>did not appear to have polymerase III promoter consensus sequences or similarity to tRNAs or other small RNAs.</p>
         <p>Subsequently, <it>DINE-1</it>-related sequences were found in other Dipteran species and were classified as novel TE families. Miller <it>et al</it>. <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, following earlier observations by Vivas <it>et al</it>. <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, identified <it>SGM </it>from several <it>obscura </it>group species as well as related sequences in GenBank from at least eight other <it>Drosophila </it>species, and noted its possible similarity to MITEs <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. They further suggested that <it>SGM </it>elements composed approximately 10% of the <it>D. guanche </it>genome. Wilder and Hollocher <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> identified '<it>mini-me</it>' and characterized its sequence structure based on approximately 80 clones isolated from 2 species of the <it>cardini </it>group, <it>D. dunni </it>and <it>D. nigrodunni</it>, and 28 sequences from 14 different species obtained from GenBank. <it>mini-me </it>was classified as a non-autonomous retroelement, although no direct relationship to previously known retroelements was observed.</p>
         <p>Previously, Yang <it>et al</it>. <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> identified a recent transpositional burst of <it>DINE-1 </it>in the genome of <it>D. yakuba</it>. The analysis of highly similar, newly inserted <it>DINE-1</it>s in this species allowed for a more detailed characterization of <it>DINE-1 </it>sequence structure. We concluded that <it>DINE-1 </it>is more likely to be a non-autonomous DNA transposon, similar to MITEs first described in maize <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, rather than a SINE-like retroelement, based on the existence of perfect terminal and subterminal inverted repeats and a TSD (TT), which are typical characteristics of DNA transposons. Moreover, the lack of polymerase III binding sites or tRNA-related structures in these recently inserted copies argued against <it>DINE-1 </it>being similar to SINEs <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Bergman <it>et al</it>. <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> also characterized <it>DINE-1 </it>as being a TIR transposon.</p>
         <p>In order to understand the origin and distribution of <it>DINE-1 </it>in the Drosophilidae, we expanded our search to ten additional partial or complete <it>Drosophila </it>genome databases using the consensus sequence of <it>D. yakuba DINE-1</it>. Strikingly, we found that sequences related to <it>D. yakuba DINE-1 </it>are very abundant in all these genomes (Table <tblr tid="T1">1</tblr>). BLAST searches did not find any related sequences in the mosquito, silk worm or other eukaryotic genomes, suggesting that <it>DINE-1 </it>is unique to Diptera. <it>DINE-1</it>-related sequences from all the <it>Drosophila </it>species share the same sequence structure that was defined from <it>D. yakuba DINE-1 </it>or from <it>mini-me</it>, with each containing: highly conserved blocks A and B at both ends, including a core region of approximately 90 bp in block A; a central repeat region of variable length; inverted repeats 13 nucleotides long at or near the 5' end and close to the 3' end; and insertion preference for T-rich regions (Figure <figr fid="F2">2</figr>). The sequences of the central repeat region from different species are very different, suggesting non-homologous origins of this region among species. In contrast, the within species divergence of this region is much smaller.</p>
         <p>Our comparison of <it>DINE-1 </it>from 12 species revealed a previously unobserved 3' inverted repeat structure that could potentially form a stem-loop (Figure <figr fid="F2">2</figr>). It is important to note that in the absence of any internal ORFs, the designation of 5' and 3' for <it>DINE-1 </it>is arbitrary. The presence of potential stem-loops near both ends of <it>DINE-1 </it>raises the possibility that these structures are recognized by a reverse transcriptase, which would imply that <it>DINE-1 </it>is in fact a non-autonomous retroelement. However, considering all the evidence outlined above, we suggest that <it>DINE-1 </it>transposition is DNA mediated.</p>
         <p>It was thought previously that MITE-like DNA transposons are rare in <it>Drosophila</it>, with only a few having been identified. One example of a <it>Drosophila </it>MITE is derived from <it>pogo</it>-like transposons in <it>D. melanogaster </it><abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. Other examples are <it>Vege </it>and <it>Mar</it>, derived from the autonomous TE <it>hobo </it>of the <it>hAT </it>superfamily in <it>D. willistoni </it><abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. However, unlike most MITEs, which are usually highly abundant in the host genome, only a few copies (&lt; 10) of <it>Vege </it>and <it>Mar </it>were found in the genomes of their <it>Drosophila </it>hosts <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>.</p>
         <p>Non-autonomous DNA transposons require an external source of transposase for transposition. For many TEs transposase initiates transposition by recognizing and binding to the TIR sequence, and this interaction is highly specific <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp> Recently, Feschotte <it>et al</it>. <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> have shown that autonomous <it>mariner</it>-like transposase can not only interact with its own TIR, but can also interact with the TIR of <it>Stowaway </it>MITEs in rice. This provides strong evidence that <it>Stowaway </it>MITEs may use <it>mariner</it>-like TEs as their source for transposase.</p>
         <p>Casola <it>et al</it>. <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> recently identified several <it>Drosophila </it>PIF-like transposons (DPLTs), which are found among <it>Drosophila </it>in both apparently autonomous and non-autonomous forms. Neither the TIR nor TSD sequences of these transposons match that of <it>DINE-1</it>, which suggests that they are not the autonomous parental copies of <it>DINE-1</it>. Intriguingly, however, <it>DPLT1 </it>has apparently active copies in <it>D. yakuba</it>, <it>D. pseudoobscura</it>, <it>D. persimilis </it>and <it>D. willistoni </it>and only inactive MITE-like copies in <it>D. melanogaster</it>, <it>D. simulans</it>, <it>D. sechellia</it>, <it>D. erecta </it>and <it>D. mojavensis</it>. This pattern closely resembles the division seen here for species that either do or do not show evidence for recent transpositional bursts of <it>DINE-1 </it>(Figure <figr fid="F4">4</figr>). These shared patterns suggest that species such as <it>D. yakuba </it>have experienced recent and ongoing movement of several DNA transposon families.</p>
         <sec>
            <st>
               <p><it>DINE-1</it>: MITE or <it>Helitron</it>?</p>
            </st>
            <p><it>DINE-1 </it>has many features characteristic of MITEs - small size, lack of coding potential, high copy number, and frequent association with genes. On the other hand, most MITEs have TIRs, which are presumably sites of transposase binding. A few MITE-like elements have been discovered that have subTIRs rather than TIRs but their corresponding autonomous elements have not been identified <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>. <it>DINE-1 </it>has inverted repeats and their conservation in structure despite ongoing changes in primary sequence argues strongly that they are of functional importance. We have placed the 5' inverted repeat 0-2 nucleotides internal to the end of <it>DINE-1 </it>in different species. Under this annotation, <it>D. yakuba DINE-1 </it>insertions would not cause a TSD (Figure <figr fid="F3">3</figr>). If the true 5' end of <it>DINE-1 </it>instead corresponds to the 5' inverted repeat, then <it>D. yakuba DINE-1 </it>insertion would cause a 2 bp TSD, as seen in other MITEs. The 3' inverted repeat, however, is clearly subterminal, which would be unusual for a MITE element.</p>
            <p>Kapitonov and Jurka <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B28">28</abbr></abbrgrp> have recently proposed that <it>DINE-1 </it>is instead a non-autonomous <it>Helitron </it>element. They noted that <it>DINE-1 </it>has a number of features unusual for <it>Helitrons</it>. One was the absence of a short hairpin or palindrome at the 3' end, which is thought to function as a replication terminator. We have identified here a 3' hairpin structure in all 12 species that may fulfill this function. A number of unusual features remain. Foremost are the termini. <it>Helitrons </it>do not contain TIRs but instead have highly conserved 5' TC or 3' CTRR sequences. In contrast, <it>DINE-1 </it>lacks these short termini sequences but instead contains conserved subTIRs. The presence of relatively long blocks of conserved sequence between non-autonomous <it>DINE-1 </it>and the proposed autonomous copies also contrasts with other species. For example, bats contain several families of very high copy number non-autonomous <it>Helitrons</it>, which differ almost entirely from their autonomous master copies other than at their di- and tetra-nucleotide termini <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>.</p>
            <p>The most decisive evidence favoring the <it>Helitron </it>hypothesis is the association of <it>DINE-1 </it>elements with non-functional but recognizable partial ORFs of the RepHel protein in <it>D. yakuba </it>and <it>D. virilis</it>, making these copies the candidate autonomous elements responsible for the recent transpositional bursts of non-autonomous <it>DINE-1</it>s in these species. Considering some of the unusual features mentioned above, it will be of great interest to investigate experimentally the mechanism of <it>DINE-1 </it>transposition.</p>
         </sec>
         <sec>
            <st>
               <p><it>DINE-1 </it>in the <it>melanogaster </it>subgroup</p>
            </st>
            <p>From our previous study <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, we found that <it>D. melanogaster </it>and <it>D. yakuba </it>contain structurally similar types of <it>DINE-1</it>s. The species differed significantly, however, in their distributions of sequence divergence and the chromosomal location of their <it>DINE-1 </it>copies. <it>D. yakuba </it>contains many similar copies, and these apparently younger copies have a higher relative frequency in euchromatic regions compared to older, more diverged copies. We hypothesized that <it>DINE-1</it>s in <it>D. melanogaster </it>and <it>D. yakuba </it>derive from a common ancestor that existed before the divergence of the <it>melanogaster </it>subgroup species. This hypothesis was tested here by our identification of <it>DINE-1 </it>from three other species of the <it>melanogaster </it>subgroup, <it>D. simulans</it>, <it>D. sechellia </it>and <it>D. erecta</it>. All five subgroup species share the same TIRs, core, central repeat unit, and 3' end stem-loop sequences. <it>DINE-1</it>s from the three newly characterized species have similar copy numbers and distributions of sequence divergence, an observation consistent with the hypothesis that <it>DINE-1 </it>was active and then silenced in the common ancestor of the <it>melanogaster </it>subgroup. <it>D. yakuba </it>is the only species showing evidence of a second, recent transpositional burst. We did find that <it>DINE-1</it>s from <it>D. erecta </it>have a different sequence in the region joining the central repeat to block B, suggesting that this is the most rapidly evolving region of <it>DINE-1</it>.</p>
         </sec>
         <sec>
            <st>
               <p>Dynamic nature and genomic impact of <it>DINE-1</it></p>
            </st>
            <p>Our analysis reveals that several species outside the <it>melanogaster </it>subgroup have distributions of <it>DINE-1 </it>identity similar to that described above for <it>D. yakuba</it>, suggesting that <it>DINE-1 </it>has undergone multiple, independent transpositional bursts. <it>D. ananassae </it>and <it>D. willistoni </it>show the strongest evidence, with distributions skewed toward 100% (Figure <figr fid="F4">4</figr>). <it>D. virilis </it>has a somewhat broader distribution, with many similar copies suggestive of recent transpositional activity. <it>D. mojavensis </it>shows a broad distribution that is suggestive of multiple rounds of transposition and silencing at different times.</p>
            <p><it>D. pseudoobscura </it>and <it>D. persimilis </it>have distributions with peaks around 98-99% identity. These species diverged less than one million years ago <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>, which might suggest that the similarly high identity in both species reflects activity of <it>DINE-1 </it>before or during their speciation. However, the distributions are significantly different, with <it>D. pseudoobscura </it>retaining proportionally more copies of high divergence. One possible explanation is that <it>DINE-1 </it>remained active more recently in <it>D. persimilis</it>. Alternatively, the strength of selection against older copies may differ between the species.</p>
            <p>The discovery of multiple and relatively distant species each showing evidence for recently active <it>DINE-1 </it>copies raises the question of whether this element has been transmitted vertically or horizontally. The phylogenetic relationship among different <it>DINE-1</it>s (Figure <figr fid="F5">5</figr>) suggests vertical inheritance, with transpositional bursts resulting from existing copies escaping from host suppression. <it>DINE-1 </it>from <it>D. ananassae </it>and two subtypes from <it>D. willistoni </it>give a pattern discordant from the accepted species phylogeny but this pattern is also not consistent with a simple model of horizontal transfer among <it>Drosophila </it>species. Instead, we suggest that the phylogenetic pattern is likely to reflect reactivation of a related ancestral element in both <it>D. ananassae </it>and <it>D. willistoni</it>. Our analysis is necessarily limited by the relatively short sequences available for analysis. We suggest that further phylogenetic analysis of the autonomous elements from each species will help to further understand the evolution of <it>DINE-1</it>. Nevertheless, the combination of our phylogenetic analysis and the divergence data indicates that the activity of <it>DINE-1 </it>is extremely dynamic. The activation and suppression of the element seems to have evolved rapidly and repeatedly in multiple lineages leading to the 12 species.</p>
            <p>Similar dynamics of transposition and suppression are found in LINEs and SINEs of mammalian genomes <abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp> and MITEs in plant genomes (see review in <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>). Some insertions of MITEs in plants have been shown to affect gene regulation <abbrgrp><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr></abbrgrp>. We have found that <it>DINE-1 </it>insertions are frequently found in the flanking regions and introns of genes, suggesting that some copies may also influence gene regulation (Table <tblr tid="T2">2</tblr>).</p>
            <p>Highly abundant interspersed repetitive sequences can also serve as targets for ectopic recombination. Such recombination may be deleterious by promoting genome instability <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp>, but may also catalyze structural evolution of existing genes and contribute to new gene formation <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. <it>DINE-1 </it>is a candidate for causing analogous phenomenon in <it>Drosophila</it>. The testis-expressed gene <it>hydra </it>is one well-characterized example <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. <it>hydra </it>exists only in the <it>melanogaster </it>subgroup, and its exon 1 has undergone multiple independent duplications. Many of these duplicated exon 1s are flanked by <it>DINE-1 </it>insertions, which suggests that <it>DINE-1 </it>may have facilitated some of these duplications by providing homologous target sequences for unequal crossing over. Given its high abundance and evidence for multiple rounds of transpositional activity, <it>DINE-1 </it>has clearly had a significant impact on <it>Drosophila </it>genome evolution, and we suggest that other examples of gene structural evolution associated with <it>DINE-1 </it>will be found among these species.</p>
            <p><it>DINE-1 </it>can also be a valuable system for studying rates and patterns of mutations. One can study <it>de novo </it>mutations in species that have had recent transpositional bursts by comparing the sequence variation among young, recently inserted <it>DINE-1 </it>copies. One can also use <it>DINE-1 </it>to examine substitution patterns between species. Previous comparative analysis of the chromosome distribution of <it>DINE-1 </it>in <it>D. yakuba </it>and <it>D. melanogaster </it>suggests that most new insertions are eliminated from the genome by negative selection <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Old copies that remain are thus likely to be evolving neutrally. One could therefore identify orthologous insertions between <it>D. melanogaster </it>and <it>D. simulans</it>, whose insertions must predate the divergence of these species, in order to infer the substitution pattern along lineages leading to both species. The ability to perform similar studies in multiple <it>Drosophila </it>species will allow unprecedented power for determining whether and how patterns of mutations vary in different lineages.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Identifying <it>DINE-1</it>-related sequences from 12 <it>Drosophila </it>genomes</p>
            </st>
            <p>Using the D. yakuba DINE-1 consensus sequence as a query, we searched for DINE-1-related sequences in all 12 Drosophila genome databases (from Comparative Assembly Freeze 1 (CAF1) <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>) using BLAST with the default setting of the parameters (Figure <figr fid="F1">1</figr>). Note that D. persimilis and D. sechellia were sequenced at only approximately three- to four-fold coverage and, thus, are incomplete. We retrieved the 50 copies of DINE-1 with the lowest E-value in each species, aligned them, and derived a consensus sequence for each species (Additional data file 1).</p>
         </sec>
         <sec>
            <st>
               <p>Sequence divergence and copy number among <it>DINE-1</it>s</p>
            </st>
            <p>We then BLAST-searched each genome using part of the consensus sequences of <it>DINE-1 </it>(5' end to end of core sequence) from its own species, using the default settings of the program. All BLAST hits greater than 100 bp were retained. The frequencies of percent identity between the query sequence and all hits were plotted for each species.</p>
         </sec>
         <sec>
            <st>
               <p>Sequence alignment and phylogenetic analysis</p>
            </st>
            <p>Sequences were aligned using ClustalW <abbrgrp><abbr bid="B52">52</abbr></abbrgrp> with the default parameter settings. Alignments were further improved by manual adjustment. Inferred phylogenetic trees of the species consensus of <it>DINE-1 </it>were constructed using the neighbor-joining method with bootstrap resampling (500 replicates) using MEGA 3.0 <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Searching for <it>DINE-1 </it>within or near genes</p>
            </st>
            <p>The UCSC Genome Browser Gateway <abbrgrp><abbr bid="B54">54</abbr></abbrgrp> was used to obtain locations of <it>DINE-1 </it>in the annotated genomes, with the exception of <it>D. willistoni</it>. DNA sequences were retrieved using the Genes and Gene Prediction tracks (track setting: Other RefSeq) and grouped into the following categories: category 1, 1,000 bp upstream of CDS; category 2, CDS; category 3, introns; and category 4, 1,000 bp downstream of CDS. Categories 1 and 4 were then merged into a single class of flanking sequences. We then performed BLAST search to each of these three classes of sequences, using the block A regions of the <it>DINE-1 </it>consensus of each species. Only hits longer than 40 bp with an E-value lower than 10<sup>-5 </sup>were included.</p>
         </sec>
         <sec>
            <st>
               <p>Characterizing target site duplication of <it>DINE-1</it></p>
            </st>
            <p>From the <it>D. yakuba </it>genome database, seven sites of <it>DINE-1 </it>insertion with sequence similarity > 97% to the <it>D. yakuba DINE-1 </it>consensus were chosen for analysis. PCR primers complementary to the 100 bp flanking sequence of each site were designed (Additional data file 2). A total of ten lines of <it>D. yakuba</it>, including nine Cy lines from a natural population (gift from Dr Peter Andolfatto at UCSD) and the strain Tai18E2, which was used for whole-genome sequencing, were checked for the presence of <it>DINE-1 </it>insertions at each site. Genomic DNA was phenol-chloroform extracted from 20-30 flies per line followed by ethanol precipitation. The program for PCR reaction was: 94&#176;C for 5 minutes, followed by 30 cycles of 94&#176;C (30s), 60&#176;C (30s), and 70&#176;C (1 minute), and extension at 70&#176;C for 7 minutes. For lines not containing the <it>DINE-1 </it>insertion, PCR products were directly sequenced using ABI BigDye (Applied Biosystems, Foster City, CA, USA) technologies.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>CDS, coding sequence; <it>DINE-1</it>, <it>Drosophila </it>interspersed element 1; LINE, long interspersed elements; MITE, miniature inverted-repeat transposable element; ORF, open reading frame; SINE, short interspersed elements; TE, transposable element; TIR, terminal inverted repeat; TSD, target site duplication.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>H-PY designed the research, performed the research and analyzed the data. H-PY and DAB wrote the paper.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper. Additional data file <supplr sid="S1">1</supplr> is the alignment of <it>DINE-1 </it>consensus sequences from the 12 <it>Drosophila </it>species. Additional data file <supplr sid="S2">2</supplr> is a table listing the genome locations and sequences of primers used for the presence/absence screen of <it>DINE-1</it>s in <it>D. yakuba</it>. Additional data file <supplr sid="S3">3</supplr> is a table listing <it>DINE-1 </it>insertions in or near to genes.</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>Alignment of <it>DINE-1 </it>consensus sequences from the 12 <it>Drosophila </it>species</p>
            </caption>
            <text>
               <p>Alignment of <it>DINE-1 </it>consensus sequences from the 12 <it>Drosophila </it>species.</p>
            </text>
            <file name="gb-2008-9-2-r39-S1.fst">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>Genome locations and sequences of primers used for the presence/absence screen of <it>DINE-1</it>s in <it>D. yakuba</it></p>
            </caption>
            <text>
               <p>Genome locations and sequences of primers used for the presence/absence screen of <it>DINE-1</it>s in <it>D. yakuba</it>.</p>
            </text>
            <file name="gb-2008-9-2-r39-S2.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p><it>DINE-1 </it>insertions in or near to genes</p>
            </caption>
            <text>
               <p><it>DINE-1 </it>insertions in or near to genes.</p>
            </text>
            <file name="gb-2008-9-2-r39-S3.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We gratefully acknowledge Dr Pierre Capy, who pointed out the possibility of species-specific <it>DINE-1</it>s in <it>Drosophila</it>, and Dr Michael Ashburner and the anonymous reviewers for helpful comments on the manuscript. We thank Hsin-Chien Cheng and Jin-Yen Huang for helping with sequence alignments, Pei-San Li for technical support, and Dr David Waxman for statistical consultation. This work was supported by a grant to H-PY from National Science Council, Taiwan (NSC 93-2311-B-010-007-).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Mobile elements in animal and plant genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Deininger</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Roy-Engel</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Mobile DNA II</source>
            <publisher>Washington, DC: ASM Press</publisher>
            <editor>Craig NL</editor>
            <pubdate>2002</pubdate>
            <fpage>1074</fpage>
            <lpage>1092</lpage>
         </bibl>
         <bibl id="B2">
            <title>
               <p>On the abundance and distribution of transposable elements in the genome of <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Bartolome</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Maside</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Charlesworth</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>926</fpage>
            <lpage>937</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12032249</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Transposable elements as sources of variation in animals and plants.</p>
            </title>
            <aug>
               <au>
                  <snm>Kidwell</snm>
                  <fnm>MG</fnm>
               </au>
               <au>
                  <snm>Lisch</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1997</pubdate>
            <volume>94</volume>
            <fpage>7704</fpage>
            <lpage>7711</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">33680</pubid>
                  <pubid idtype="pmpid" link="fulltext">9223252</pubid>
                  <pubid idtype="doi">10.1073/pnas.94.15.7704</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Eukaryotic transposable elements and genome evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Finnegan</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>1989</pubdate>
            <volume>5</volume>
            <fpage>103</fpage>
            <lpage>107</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0168-9525(89)90039-5</pubid>
                  <pubid idtype="pmpid">2543105</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>The transposable elements of the <it>Drosophila melanogaster </it>euchromatin: a genomics perspective.</p>
            </title>
            <aug>
               <au>
                  <snm>Kaminker</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bergman</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kronmiller</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Carlson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Svirskas</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Patel</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Frise</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Wheeler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>R0084</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1186/gb-2002-3-12-research0084</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Recurrent insertion and duplication generate networks of transposable element sequences in the <it>Drosophila melanogaster </it>genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Bergman</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Quesneville</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Anxolabehere</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>R112</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1794594</pubid>
                  <pubid idtype="pmpid" link="fulltext">17134480</pubid>
                  <pubid idtype="doi">10.1186/gb-2006-7-11-r112</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Evolution of genes and genomes on the <it>Drosophila </it>phylogeny.</p>
            </title>
            <aug>
               <au>
                  <cnm><it>Drosophila </it>12 Genomes Consortium</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>450</volume>
            <fpage>203</fpage>
            <lpage>218</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature06341</pubid>
                  <pubid idtype="pmpid" link="fulltext">17994087</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Mobile elements: drivers of genome evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Kazazian</snm>
                  <fnm>HH</fnm>
                  <suf>Jr</suf>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>303</volume>
            <fpage>1626</fpage>
            <lpage>1632</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1089670</pubid>
                  <pubid idtype="pmpid" link="fulltext">15016989</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <aug>
               <au>
                  <snm>Berg</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mobile DNA</source>
            <publisher>Washington, DC: American Society of Microbiology</publisher>
            <pubdate>1989</pubdate>
         </bibl>
         <bibl id="B10">
            <aug>
               <au>
                  <snm>McDonald</snm>
                  <fnm>JF</fnm>
               </au>
            </aug>
            <source>Transposable Elements and Evolution</source>
            <publisher>Dodrect, The Netherlands: Kluver</publisher>
            <pubdate>1993</pubdate>
         </bibl>
         <bibl id="B11">
            <title>
               <p>LINEs mobilize SINEs in the eel through a shared 3' sequence.</p>
            </title>
            <aug>
               <au>
                  <snm>Kajikawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Okada</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2002</pubdate>
            <volume>111</volume>
            <fpage>433</fpage>
            <lpage>444</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(02)01041-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">12419252</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>SINEs and LINEs share common 3' sequences: a review.</p>
            </title>
            <aug>
               <au>
                  <snm>Okada</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hamada</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ogiwara</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Ohshima</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>1997</pubdate>
            <volume>205</volume>
            <fpage>229</fpage>
            <lpage>243</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1119(97)00409-5</pubid>
                  <pubid idtype="pmpid">9461397</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Tourist: a large family of small inverted repeat elements frequently associated with maize genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Bureau</snm>
                  <fnm>TE</fnm>
               </au>
               <au>
                  <snm>Wessler</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>1992</pubdate>
            <volume>4</volume>
            <fpage>1283</fpage>
            <lpage>1294</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">160215</pubid>
                  <pubid idtype="pmpid" link="fulltext">1332797</pubid>
                  <pubid idtype="doi">10.1105/tpc.4.10.1283</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Plant transposable elements: where genetics meets genomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Feschotte</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Wessler</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>329</fpage>
            <lpage>341</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg793</pubid>
                  <pubid idtype="pmpid" link="fulltext">11988759</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Mobile inverted-repeat elements of the <it>tourist </it>family are associated with the genes of many cereal grasses.</p>
            </title>
            <aug>
               <au>
                  <snm>Bureau</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Wessler</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1994</pubdate>
            <volume>91</volume>
            <fpage>1411</fpage>
            <lpage>1415</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">43168</pubid>
                  <pubid idtype="pmpid" link="fulltext">8108422</pubid>
                  <pubid idtype="doi">10.1073/pnas.91.4.1411</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Plant transposable elements: where genetics meets genomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Feschotte</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Wessler</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>329</fpage>
            <lpage>341</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg793</pubid>
                  <pubid idtype="pmpid" link="fulltext">11988759</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Combined evidence annotation of transposable elements in genome sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Quesneville</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Bergman</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Andrieu</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Autard</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Nouaud</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Anxolabehere</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>PLoS Comput Biol</source>
            <pubdate>2005</pubdate>
            <volume>1</volume>
            <fpage>166</fpage>
            <lpage>175</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1185648</pubid>
                  <pubid idtype="pmpid" link="fulltext">16110336</pubid>
                  <pubid idtype="doi">10.1371/journal.pcbi.0010022</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Classification and nomenclature of retrotransposable elements.</p>
            </title>
            <aug>
               <au>
                  <snm>Capy</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Cytogenet Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>110</volume>
            <fpage>457</fpage>
            <lpage>461</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1159/000084978</pubid>
                  <pubid idtype="pmpid" link="fulltext">16093698</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Molecular paleontology of transposable elements in the <it>Drosophila melanogaster </it>genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Kapitonov</snm>
                  <fnm>VV</fnm>
               </au>
               <au>
                  <snm>Jurka</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>6569</fpage>
            <lpage>6574</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">164487</pubid>
                  <pubid idtype="pmpid" link="fulltext">12743378</pubid>
                  <pubid idtype="doi">10.1073/pnas.0732024100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>What transposable elements tell us about genome organization and evolution: the case of <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Biemont</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Vieira</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Cytogenet Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>110</volume>
            <fpage>25</fpage>
            <lpage>34</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1159/000084935</pubid>
                  <pubid idtype="pmpid" link="fulltext">16093655</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>The characterization of <it>DINE-1</it>, a short, interspersed repetitive element present on chromosome and in the centric heterochromatin of <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Locke</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Howard</snm>
                  <fnm>LT</fnm>
               </au>
               <au>
                  <snm>Aippersbach</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Podemski</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hodgetts</snm>
                  <fnm>RB</fnm>
               </au>
            </aug>
            <source>Chromosoma</source>
            <pubdate>1999</pubdate>
            <volume>108</volume>
            <fpage>356</fpage>
            <lpage>366</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s004120050387</pubid>
                  <pubid idtype="pmpid" link="fulltext">10591995</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Rapid sequence turnover at an intergenic locus in <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Singh</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Petrov</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <fpage>670</fpage>
            <lpage>680</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msh060</pubid>
                  <pubid idtype="pmpid" link="fulltext">14739245</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Genome-wide comparative analysis of the highly abundant transposable element <it>DINE-1 </it>suggests a recent transpositional burst in <it>Drosophila yakuba</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>H-P</fnm>
               </au>
               <au>
                  <snm>Hung</snm>
                  <fnm>T-L</fnm>
               </au>
               <au>
                  <snm>You</snm>
                  <fnm>T-L</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>T-H</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2006</pubdate>
            <volume>173</volume>
            <fpage>189</fpage>
            <lpage>196</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1461449</pubid>
                  <pubid idtype="pmpid" link="fulltext">16387876</pubid>
                  <pubid idtype="doi">10.1534/genetics.105.051714</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p><it>Helitrons </it>on a roll: eukaryotic rolling-circle transposons.</p>
            </title>
            <aug>
               <au>
                  <snm>Kapitonov</snm>
                  <fnm>VV</fnm>
               </au>
               <au>
                  <snm>Jurka</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <fpage>521</fpage>
            <lpage>529</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2007.08.004</pubid>
                  <pubid idtype="pmpid" link="fulltext">17850916</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p><it>GEM</it>, a cluster of repetitive sequences in the <it>Drosophila subobscura </it>genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Vivas</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Garcia-Planells</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ruiz</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Marfany</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Paricio</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Gonzalez-Duarte</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>de Frutos</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>1999</pubdate>
            <volume>229</volume>
            <fpage>47</fpage>
            <lpage>57</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1119(99)00031-1</pubid>
                  <pubid idtype="pmpid">10095103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Evolutionary dynamics of the <it>SGM </it>transposon family in the <it>Drosophila obscura </it>species group.</p>
            </title>
            <aug>
               <au>
                  <snm>Miller</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Nagel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bachmann</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bachmann</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2000</pubdate>
            <volume>17</volume>
            <fpage>1597</fpage>
            <lpage>1609</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11070048</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Mobile elements and the genesis of microsatellites in Dipterans.</p>
            </title>
            <aug>
               <au>
                  <snm>Wilder</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hollocher</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2001</pubdate>
            <volume>18</volume>
            <fpage>384</fpage>
            <lpage>392</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11230539</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Helitrons in fruit flies.</p>
            </title>
            <aug>
               <au>
                  <snm>Kapitonov</snm>
                  <fnm>VV</fnm>
               </au>
               <au>
                  <snm>Jurka</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Repbase Reports</source>
            <pubdate>2007</pubdate>
            <volume>7</volume>
            <fpage>127</fpage>
            <lpage>132</lpage>
         </bibl>
         <bibl id="B29">
            <title>
               <p><it>Drosophila </it>molecular phylogenies and their uses.</p>
            </title>
            <aug>
               <au>
                  <snm>Powell</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>DeSalle</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Evol Biol</source>
            <pubdate>1995</pubdate>
            <volume>28</volume>
            <fpage>87</fpage>
            <lpage>138</lpage>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Genomic heterogeneity of background substitutional patterns in <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Singh</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Arndt</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <snm>Petrov</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2005</pubdate>
            <volume>169</volume>
            <fpage>709</fpage>
            <lpage>722</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1449091</pubid>
                  <pubid idtype="pmpid" link="fulltext">15520267</pubid>
                  <pubid idtype="doi">10.1534/genetics.104.032250</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>The <it>pogo </it>transposable element family of <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Tudor</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lobocka</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goodell</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pettitt</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>O'Hare</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Mol Gen Genet</source>
            <pubdate>1992</pubdate>
            <volume>232</volume>
            <fpage>126</fpage>
            <lpage>134</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00299145</pubid>
                  <pubid idtype="pmpid">1313144</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p><it>Vege </it>and <it>Mar: </it>two novel <it>hAT MITE </it>families from <it>Drosophila willistoni</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Holyoake</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Kidwell</snm>
                  <fnm>MG</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2003</pubdate>
            <volume>20</volume>
            <fpage>163</fpage>
            <lpage>167</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msg023</pubid>
                  <pubid idtype="pmpid" link="fulltext">12598681</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>DNA-binding activity and subunit interaction of the mariner transposase.</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Dawson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Finnegan</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>3566</fpage>
            <lpage>3575</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">55874</pubid>
                  <pubid idtype="pmpid" link="fulltext">11522826</pubid>
                  <pubid idtype="doi">10.1093/nar/29.17.3566</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>The ITR binding domain of the <it>Mariner Mos-1 </it>transposase.</p>
            </title>
            <aug>
               <au>
                  <snm>Auge-Gouillou</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hamelin</snm>
                  <fnm>M-H</fnm>
               </au>
               <au>
                  <snm>Demattei</snm>
                  <fnm>M-V</fnm>
               </au>
               <au>
                  <snm>Periquet</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Bigot</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Mol Genet Genomics</source>
            <pubdate>2001</pubdate>
            <volume>265</volume>
            <fpage>58</fpage>
            <lpage>65</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s004380000386</pubid>
                  <pubid idtype="pmpid" link="fulltext">11370873</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>DNA-binding specificity of rice mariner-like transposases and interactions with Stowaway MITEs.</p>
            </title>
            <aug>
               <au>
                  <snm>Feschotte</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Osterlund</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Peeler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wessler</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>2153</fpage>
            <lpage>2165</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1079968</pubid>
                  <pubid idtype="pmpid" link="fulltext">15831788</pubid>
                  <pubid idtype="doi">10.1093/nar/gki509</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>PIF-like transposons are common in <it>Drosophila </it>and have been repeatedly domesticated to generate new host genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Casola</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lawing</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Betran</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Feschotte</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2007</pubdate>
            <volume>24</volume>
            <fpage>1872</fpage>
            <lpage>1888</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msm116</pubid>
                  <pubid idtype="pmpid" link="fulltext">17556756</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Identification and characterization of 14 transposon-like elements in the noncoding regions of members of the X<it>a21 </it>family of disease resistance genes in rice.</p>
            </title>
            <aug>
               <au>
                  <snm>Song</snm>
                  <fnm>WY</fnm>
               </au>
               <au>
                  <snm>Pi</snm>
                  <fnm>LY</fnm>
               </au>
               <au>
                  <snm>Bureau</snm>
                  <fnm>TE</fnm>
               </au>
               <au>
                  <snm>Ronald</snm>
                  <fnm>PC</fnm>
               </au>
            </aug>
            <source>Mol Gen Genet</source>
            <pubdate>1998</pubdate>
            <volume>258</volume>
            <fpage>449</fpage>
            <lpage>456</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s004380050755</pubid>
                  <pubid idtype="pmpid">9669326</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p><it>Micron</it>, a microsatellite-targeting transposable element in the rice genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Akagi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yokozeki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Inagaki</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mori</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Fujimura</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Mol Genet Genomics</source>
            <pubdate>2001</pubdate>
            <volume>266</volume>
            <fpage>471</fpage>
            <lpage>480</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s004380100563</pubid>
                  <pubid idtype="pmpid" link="fulltext">11713677</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p><it>Microuli</it>, a family of miniature subterminal inverted-repeat transposable elements (MSITEs): transposition without terminal inverted repeats.</p>
            </title>
            <aug>
               <au>
                  <snm>Tu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Orphanidis</snm>
                  <fnm>SP</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2001</pubdate>
            <volume>18</volume>
            <fpage>893</fpage>
            <lpage>895</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11319273</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Massive amplification of rolling-circle transposons in the lineage of the bat <it>Myotis lucifugus</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Pritham</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Feschotte</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <fpage>1895</fpage>
            <lpage>1900</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1794268</pubid>
                  <pubid idtype="pmpid" link="fulltext">17261799</pubid>
                  <pubid idtype="doi">10.1073/pnas.0609601104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Temporal patterns of fruit fly (<it>Drosophila</it>) evolution revealed by mutation clocks.</p>
            </title>
            <aug>
               <au>
                  <snm>Tamura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Subramanian</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <fpage>36</fpage>
            <lpage>44</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msg236</pubid>
                  <pubid idtype="pmpid" link="fulltext">12949132</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>SINEs and LINEs: highly repeated short and long interspersed sequences in mammalian genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Singer</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1982</pubdate>
            <volume>28</volume>
            <fpage>433</fpage>
            <lpage>434</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0092-8674(82)90194-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">6280868</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Evolutionary dynamics of transposable elements in the short-tailed opossum <it>Monodelphis domestica</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Gentles</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Wakefield</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Kohany</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Gu</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Batzer</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Pollock</snm>
                  <fnm>DD</fnm>
               </au>
               <au>
                  <snm>Jurka</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <fpage>992</fpage>
            <lpage>1004</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1899126</pubid>
                  <pubid idtype="pmpid" link="fulltext">17495012</pubid>
                  <pubid idtype="doi">10.1101/gr.6070707</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Duplication, coclustering, and selection of human <it>Alu </it>retrotransposons.</p>
            </title>
            <aug>
               <au>
                  <snm>Jurka</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kohany</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Pavlicek</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kapitonov</snm>
                  <fnm>VV</fnm>
               </au>
               <au>
                  <snm>Jurka</snm>
                  <fnm>MV</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>1268</fpage>
            <lpage>1272</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">337042</pubid>
                  <pubid idtype="pmpid" link="fulltext">14736919</pubid>
                  <pubid idtype="doi">10.1073/pnas.0308084100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>LTR-retrotransposons and MITEs: important players in the evolution of plant genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Wessler</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Bureau</snm>
                  <fnm>TE</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>SE</fnm>
               </au>
            </aug>
            <source>Curr Opin Genet Dev</source>
            <pubdate>1995</pubdate>
            <volume>5</volume>
            <fpage>814</fpage>
            <lpage>821</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0959-437X(95)80016-X</pubid>
                  <pubid idtype="pmpid">8745082</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Structural domains and matrix attachment regions along colinear chromosomal segments of maize and sorghum.</p>
            </title>
            <aug>
               <au>
                  <snm>Tikhonov</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Bennetzen</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Avramova</snm>
                  <fnm>ZV</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2000</pubdate>
            <volume>12</volume>
            <fpage>249</fpage>
            <lpage>264</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">139762</pubid>
                  <pubid idtype="pmpid" link="fulltext">10662861</pubid>
                  <pubid idtype="doi">10.1105/tpc.12.2.249</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Human genomic deletions mediated by recombination between <it>Alu </it>elements.</p>
            </title>
            <aug>
               <au>
                  <snm>Sen</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Callinan</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Dyer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cordaux</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Liang</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Batzer</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2006</pubdate>
            <volume>79</volume>
            <fpage>41</fpage>
            <lpage>53</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1474114</pubid>
                  <pubid idtype="pmpid" link="fulltext">16773564</pubid>
                  <pubid idtype="doi">10.1086/504600</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Primate segmental duplications: crucibles of evolution, diversity and disease.</p>
            </title>
            <aug>
               <au>
                  <snm>Bailey</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Eichler</snm>
                  <fnm>EE</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>552</fpage>
            <lpage>564</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg1895</pubid>
                  <pubid idtype="pmpid" link="fulltext">16770338</pubid>
      