<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2007-8-2-r28</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>A genome-wide transcriptional activity survey of rice transposable element-related genes</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Jiao</snm>
               <fnm>Yuling</fnm>
               <insr iid="I1"/>
               <email>yuling.jiao@yale.edu</email>
            </au>
            <au id="A2" ca="yes">
               <snm>Deng</snm>
               <mnm>Wang</mnm>
               <fnm>Xing</fnm>
               <insr iid="I1"/>
               <email>xingwang.deng@yale.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Molecular, Cellular and Developmental Biology, Yale University, 165 Prospect Street, New Haven, CT 06520, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>2</issue>
         <fpage>R28</fpage>
         <url>http://genomebiology.com/2007/8/2/R28</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17326825</pubid>
               <pubid idtype="doi">10.1186/gb-2007-8-2-r28</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>22</day>
               <month>9</month>
               <year>2006</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>18</day>
               <month>12</month>
               <year>2006</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>27</day>
               <month>2</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>27</day>
               <month>02</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Jiao and Deng; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Transcription analysis of transposable-element-related genes in rice</p>
      </shorttitle>
      <shortabs>
         <p>A genome-wide survey of the transcriptional activity of TE-related genes that were associated with fifteen developmental stages and stress conditions revealed clear, albeit low, general transcription of TE-related genes.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Transposable element (TE)-related genes comprise a significant portion of the gene catalog of grasses, although their functions are insufficiently characterized. The recent availability of TE-related gene annotation from the complete genome sequence of rice (<it>Oryza sativa</it>) has created an opportunity to conduct a comprehensive evaluation of the transcriptional activities of these potentially mobile elements and their related genes.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We conducted a genome-wide survey of the transcriptional activity of TE-related genes associated with 15 developmental stages and stress conditions. This dataset was obtained using a microarray encompassing 2,191 unique TE-related rice genes, which were represented by oligonucleotide probes that were free from cross-hybridization. We found that TE-related genes exhibit much lower transcriptional activities than do non-TE-related genes, although representative transcripts were detected from all superfamilies of both type I and II TE-related genes. The strongest transcriptional activities were detected in TE-related genes from among the MULE and CACTA superfamilies. Phylogenetic analyses suggest that domesticated TE-related genes tend to form clades with active transcription. In addition, chromatin-level regulations through histone and DNA modifications, as well as enrichment of certain <it>cis </it>elements in the promoters, appear to contribute to the transcriptional activation of representative TE-related genes.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Our findings reveal clear, albeit low, general transcription of TE-related genes. In combination with phylogenetic analysis, transcriptional analysis has the potential to lead to the identification of domesticated TEs with adapted host functions.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010016">Molecular biology</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010019">Plant biology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The completion of the rice (<it>Oryza sativa</it>) genome sequence allowed further functional classification of the coding sequences of this important crop and model of grass species <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Detailed annotation of the rice genome revealed that nearly a quarter of the rice open reading frame (ORF) coding capacity has features of transposable elements (TEs) and are therefore defined as TE-related genes <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. Like other genes, these TE-related genes have predicted normal gene structure with protein coding capacity. However, they share significant sequence similarity with known TEs in either or both of the following ways: they have TE signature sequences in The Institute for Genomic Research (TIGR) <it>Oryza </it>Repeat Database <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> or they contain TE-related protein domains <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. By this definition, TE-related genes can include potentially active TEs (based on the existence of a functional ORF) as well as cellular genes derived from TEs. Many of these TE-related genes encode reverse transcriptases, transposases, or other related proteins <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, and they can be further classified based on protein domain and other sequence features <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. Those TEs overwhelming in number that lack functional ORFs are not considered to be genes <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. Although there are many TE-related genes, the biologic functions of these genes remain elusive <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>.</p>
         <p>TEs are considered to be important for the maintenance and diversification of genomes. TEs are usually separated into two classes that differ in the mode of propagation: retrotransposons, or type I elements, which transpose by reverse transcription of an RNA intermediate; and type II elements, which only use a DNA intermediate in movement within the genome. Both classes can be further divided into several superfamilies, each with a unique evolutionary history. Representatives of virtually all superfamilies of TEs have been detected in grass genomes <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. Accumulating evidence suggests that TE activities have profound impact on the genome <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, influencing genome size, genome rearrangement, chromatin transcription, and gene evolution <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>; many of these factors relying specifically on the transposition activity of TEs.</p>
         <p>Although most TEs are considered inactive <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>, there have been isolated reports of TE transposition in rice and other grasses <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. A common condition promoting transposition is stress, including that which occurs in <it>in vitro </it>cell or tissue culture <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>. Developmental regulation of transposition has also been reported in intact plants <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>.</p>
         <p>Transcription of TE-related genes is required for their own transposition and that of other related TEs, although transcription itself may not be sufficient for transposition <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>. Analysis of TE-related genes from certain subgroups of the type I class and the <it>Mutator</it>-like superfamily of the type II class suggests that their transcripts are widely present in grasses <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>. Most of these transcribed TEs have coding capacity and are therefore considered TE-related genes. A recent study of expressed sequence tags (ESTs) in sugarcane identified 267 active TE-related transcripts <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Transcription of TE-related genes was also reported in an unbiased survey of the transcriptional activity of a single rice chromosome using a tiling microarray <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>.</p>
         <p>Apart from the potentially active TEs among these TE-related genes, domesticated TE-related genes, which acquire new functions for the host, have also been found to exist. Although our current classification for distinguishing TE-related genes from non-TE-related genes is not definitive <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, two recent studies in <it>Arabidopsis </it>identified domesticated TE-related genes contributing to cellular processes <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>. Similar examples were also found in animals <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr></abbrgrp>. Such findings in part support the hypothesis that TE-related genes may influence the evolution of their host by providing a source of novel coding capacity.</p>
         <p>The potential impact of domesticated TE-related genes on the evolution of genomes requires systematic investigation. One attempt to identify further domesticated TE-related genes is sequence mining <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. Because a change of position through transcription can be detrimental to the host, transposon-derived genes with known host function usually lack mobility. As a consequence, they may be devoid of transposon-specific terminal sequences <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B36">36</abbr></abbrgrp>. By employing this criterion in a search, one particular member of the MULE superfamily was identified as a domesticated gene candidate <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. Transcription is an important feature of domesticated TE-related genes, because it is generally required in cellular functions of the host <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>. By surveying transcriptional activity and combining other approaches, we would be able to identify domesticated TE-derived gene candidates.</p>
         <p>Another mechanism for the evolution of new genes from TEs is through their ability to acquire and fuse fragments of genes to new genomic locations, as seen in plant Pack-MULE and, more recently, in certain <it>Helitron</it>-like and CACTA elements <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>. However, many of these Pack-MULEs have been suggested to possess pseudogene-like features <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. Pack-MULE, as a unique group of TE-related genes, is relatively well annotated and is a current focus of interest regarding the origin of genes <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>.</p>
         <p>Given the paucity of information on TE-related genes, a systematic study of their transcriptional activity in a well characterized genome is required to enhance our understanding of the activity of TE-related genes. That the sequence of the rice genome is now completely annotated makes it a good resource for such a genome-wide survey <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. Recent advances in microarray technology allow us to study the transcriptional activity of genes in a high-throughput manner. It is therefore possible to conduct a genome-wide survey of the transcriptional activity of rice TE-related genes, especially those more divergent ones for which unique oligomer probes can be designed. Different from simple TEs composing mostly repetitive sequences, many TE-related genes are diverged enough to have short oligomers representing their unique sequence regions. Such an approach has recently been utilized to analyze transcription of TE-related genes in plants and animals <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B30">30</abbr><abbr bid="B40">40</abbr></abbrgrp>. In addition to TE-related genes, TEs without protein-coding capacity and other tandem repeats may also exhibit transcriptional activity <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B41">41</abbr></abbrgrp>. Transcripts derived from tandem repeats in the heterochromatin can give rise to small RNAs, which in turn direct the modification of histones and DNA in TE-related sequences and nearby regions by means of RNA interference <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Although transcripts from tandem repeats are important for the genome, their highly repetitive nature prohibits characterization of their unique identities in chromosomal organization on a genome-wide scale <abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>.</p>
         <p>We conducted an expression analysis for rice TE-related genes using 70-mer oligonucleotide microarrays. Expression profiles from 4,728 oligonucleotides covering organs from rice plants were analyzed under both normal conditions at various developmental stages as well as under stress conditions. Clear but restricted transcription of TE-related genes were found for all major superfamilies of TE-related genes. Mechanisms controlling representative TE transcription were further analyzed.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Representation of TE-related genes by an oligonucleotide microarray</p>
            </st>
            <p>A 70-mer oligonucleotide set was previously developed to span the rice genome <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Many TE-related genes are included in this oligomer set design, allowing survey of a large number of rice TE-related genes. However, for the sake of simplicity, those oligonucleotide probes representing TE-related genes were removed from analysis in all prior genome profiling analyses <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp>. Here, we collected all of our available datasets and systematically examined the transcriptional activities of TE-related genes in various tissues and growth conditions. In particular, we included datasets representing cell cultures and stress-exposed tissues.</p>
            <p>According to the rice genome annotation at TIGR <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> and a literature review <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B48">48</abbr></abbrgrp>, a total of 14,404 genes were identified as TE-related genes, based on the presence of TE signature sequences in the TIGR <it>Oryza </it>Repeat Database <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> or TE-related Pfam domains. Among these TE-related genes, 9,493 were classified as type I (retrotransposons) TE-related genes and 4,159 were classified as type II (DNA transposon) TE-related genes. These TE-related genes were further classified into superfamilies according to sequence signatures (Table <tblr tid="T1">1</tblr>). The classification at TIGR was followed, modified in accordance with recently published studies <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B48">48</abbr></abbrgrp>. There were another 752 TE-related genes without further classification. A remapping of oligonucleotides in our microarray <abbrgrp><abbr bid="B44">44</abbr></abbrgrp> to annotated genes indicated that 2,191 (15.2%) TE-related genes were represented by at least one 70-mer oligonucleotide that was free from cross-hybridization (see Materials and methods, below). Most oligomers, if not all, mapped to unique coding regions instead of repetitive sequences. In addition, 1,966 70-mer oligonucleotides mapped to more than one TE-related gene while remaining cross-hybridization free from non-TE-related genes. These oligonucleotides covered another 9,396 (65.2%) TE-related genes.</p>
            <tbl id="T1" hint_layout="double">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Summary of annotated TE-related genes in rice and coverage by (cross-hybridization free) microarray probes</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Number of TEs in TIGR</p>
                     </c>
                     <c ca="left">
                        <p>Number of TEs in TIGR and literature review</p>
                     </c>
                     <c ca="left">
                        <p>Covered by microarray</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="4" ca="left">
                        <p>Type I</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Ty1/<it>copia</it></p>
                     </c>
                     <c ca="left">
                        <p>1,273</p>
                     </c>
                     <c ca="left">
                        <p>1,469</p>
                     </c>
                     <c ca="left">
                        <p>235</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Ty3/<it>gypsy</it></p>
                     </c>
                     <c ca="left">
                        <p>3,904</p>
                     </c>
                     <c ca="left">
                        <p>4,218</p>
                     </c>
                     <c ca="left">
                        <p>362</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>LINE</p>
                     </c>
                     <c ca="left">
                        <p>56</p>
                     </c>
                     <c ca="left">
                        <p>62</p>
                     </c>
                     <c ca="left">
                        <p>34</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Undetermined</p>
                     </c>
                     <c ca="left">
                        <p>4,158</p>
                     </c>
                     <c ca="left">
                        <p>3,744</p>
                     </c>
                     <c ca="left">
                        <p>691</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Subtotal</p>
                     </c>
                     <c ca="left">
                        <p>9,391</p>
                     </c>
                     <c ca="left">
                        <p>9,493</p>
                     </c>
                     <c ca="left">
                        <p>1,322</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4" ca="left">
                        <p>Type II</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>hAT</it>-like</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>184</p>
                     </c>
                     <c ca="left">
                        <p>42</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>CACTA</p>
                     </c>
                     <c ca="left">
                        <p>2,392</p>
                     </c>
                     <c ca="left">
                        <p>2,276</p>
                     </c>
                     <c ca="left">
                        <p>231</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>MULE</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>452</p>
                     </c>
                     <c ca="left">
                        <p>607</p>
                     </c>
                     <c ca="left">
                        <p>155</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>PIF</it>/<it>Pong</it>-like</p>
                     </c>
                     <c ca="left">
                        <p>122</p>
                     </c>
                     <c ca="left">
                        <p>238</p>
                     </c>
                     <c ca="left">
                        <p>67</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>Mariner</it>-like</p>
                     </c>
                     <c ca="left">
                        <p>48</p>
                     </c>
                     <c ca="left">
                        <p>48</p>
                     </c>
                     <c ca="left">
                        <p>15</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>Helitron-like</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>19</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>undetermined</p>
                     </c>
                     <c ca="left">
                        <p>999</p>
                     </c>
                     <c ca="left">
                        <p>787</p>
                     </c>
                     <c ca="left">
                        <p>128</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Subtotal</p>
                     </c>
                     <c ca="left">
                        <p>4,026</p>
                     </c>
                     <c ca="left">
                        <p>4,159</p>
                     </c>
                     <c ca="left">
                        <p>645</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Unclassified</p>
                     </c>
                     <c ca="left">
                        <p>779</p>
                     </c>
                     <c ca="left">
                        <p>752</p>
                     </c>
                     <c ca="left">
                        <p>224</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Total<sup>a</sup></p>
                     </c>
                     <c ca="left">
                        <p>14,196</p>
                     </c>
                     <c ca="left">
                        <p>14,404</p>
                     </c>
                     <c ca="left">
                        <p>2,191</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a</sup>The two subtotals plus Unclassified. TE, transposable element.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Transcriptional activity of TE-related genes</p>
            </st>
            <p>To obtain a comprehensive picture of the transcriptional activity of TE-related genes, we assembled their transcription profiles into a collection of 15 datasets acquired from various tissues and under various physical conditions (Table <tblr tid="T2">2</tblr>). Five tissues grown under normal conditions from different developmental stages, four cell cultures, and six tissue samples under conditions of salinity or drought were included <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp>. Three or more independent biologic replicates for each sample were analyzed. In order to assemble a compendium of transcription profiles with minimal sample variation, quantified microarray hybridization signals from different experiments were pulled together and subjected to an automatic processing pipeline, with manual inspection to correct for slide background, normalize experimental variations, filter problem spots, and check data quality. A previously described method, which takes into account both negative and positive controls as well as data reproducibility, was applied here to determine the expression threshold <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Such an experimental expression threshold was also supported by reverse transcription (RT)-polymerase chain reaction (PCR) of randomly selected genes.</p>
            <tbl id="T2" hint_layout="single">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Summary of rice samples used in this study</p>
               </caption>
               <tblbdy cols="2">
                  <r>
                     <c ca="left">
                        <p>Sample</p>
                     </c>
                     <c ca="left">
                        <p>Abbreviation</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Seedling shoot</p>
                     </c>
                     <c ca="left">
                        <p>SS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Tillering stage shoot</p>
                     </c>
                     <c ca="left">
                        <p>TS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Tillering stage root</p>
                     </c>
                     <c ca="left">
                        <p>TR</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Flag leaf</p>
                     </c>
                     <c ca="left">
                        <p>FL</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Heading panicle</p>
                     </c>
                     <c ca="left">
                        <p>HP</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Filling panicle</p>
                     </c>
                     <c ca="left">
                        <p>FP</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Suspension cultured cells</p>
                     </c>
                     <c ca="left">
                        <p>SC</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Somatic root in culture</p>
                     </c>
                     <c ca="left">
                        <p>CR</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Somatic shoot in culture</p>
                     </c>
                     <c ca="left">
                        <p>CS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Tillering stage shoot under drought stress</p>
                     </c>
                     <c ca="left">
                        <p>TSD</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Tillering stage shoot under salt stress</p>
                     </c>
                     <c ca="left">
                        <p>TSS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Flag leaf under drought stress</p>
                     </c>
                     <c ca="left">
                        <p>FLD</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Flag leaf under salt stress</p>
                     </c>
                     <c ca="left">
                        <p>FLS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Heading panicle under drought stress</p>
                     </c>
                     <c ca="left">
                        <p>HPD</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Heading panicle under salt stress</p>
                     </c>
                     <c ca="left">
                        <p>HPS</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>Examination of the expression of TE-related genes in each sample indicates that heading stage panicle has the greatest level of detected expression at 33%, whereas expression percentage in somatic shoot culture is the lowest, at 26% (Figure <figr fid="F1">1a</figr>). We also found that DNA transposons (type II) have 11% to 18% higher expression percentage than retrotransposons (type I) in all samples analyzed (Figure <figr fid="F1">1a</figr>).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Summary of expression of TE-related genes</p>
               </caption>
               <text>
                  <p>Summary of expression of TE-related genes. <b>(a) </b>Percentage of the transcribed type I and type II TE-related genes and non-TE-related genes in different samples. Percentages of transcribed genes in each category are shown for all samples. <b>(b) </b>Levels of transcription can be inferred based on how often (in how many different samples) expression was detected for TE-related and non-TE-related genes. TE, transposable element.</p>
               </text>
               <graphic file="gb-2007-8-2-r28-1"/>
            </fig>
            <p>By monitoring the expression of 2,191 TE-related genes using unique oligomer probes, we identified expression of 1,084 (61.7%) TE-related genes in at least one of our 15 samples. This is in contrast to findings in non-TE-related genes, 85.8% of which are expressed in at least one sample and 22.6% in all samples, using the same selection criteria. Expressed TE-related genes tend to exhibit transcription in a relatively small number of samples. The percentages of expressed TE-related genes in a wide range of samples are markedly lower than those of non-TE-related genes (Figure <figr fid="F1">1b</figr>). For those oligonucleotide probes that match multiple TE-related genes, 73.7% and 5.1% had hybridization signals in at least one sample or in all samples, respectively. Considering that those probes match multiple repetitive genes, a smaller portion of those TE-related genes that they represent is expected to be transcribed.</p>
            <p>To probe quantitatively for the transcriptional activity of TE-related genes, the expression intensities of those 1,084 transcribed TE-related genes and an similar number of randomly selected transcribed non-TE-related genes are visually juxtaposed after clustering (Figure <figr fid="F2">2</figr>). Even though only transcribed genes are being compared here, it is clear that the transcription of TE-related genes was in general weaker than that of their non-TE-related counterparts. Furthermore, a large portion of the transcribed TE-related genes exhibited detectable transcription in fewer rice samples than was the case for non-TE-related genes. However, there are clearly a few clusters of TE-related genes with rampant transcription in most rice samples, and some of this transcription is quite marked (Figure <figr fid="F2">2</figr>). A few organ-specific clusters, such as one for cultured cells (lanes 7, 8 and 9 in Figure <figr fid="F2">2</figr>), were also found.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Global expression map showing transcriptional activity of TE-related and randomly selected non-TE-related genes</p>
               </caption>
               <text>
                  <p>Global expression map showing transcriptional activity of TE-related and randomly selected non-TE-related genes. Only 1,353 TE-related genes with transcription in at least one sample are included. Another 1,353 non-TE-related genes randomly picked from those with transcription in at least one samples are shown in parallel. Each lane represents one sample in the same order as in Table 2. Shades of gray indicate the magnitude of transcription signals, which are based on microarray hybridization signals without units. TE, transposable element.</p>
               </text>
               <graphic file="gb-2007-8-2-r28-2"/>
            </fig>
            <p>To gauge the reliability of our microarray data for TE-related genes, we first compared rice cDNA and EST collections with our data. We found 496 TE-related genes in the cDNA/EST collection in TIGR database <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. These cDNAs and ESTs were derived from six rice samples: callus, seed, shoot and stem, leaf, root, and flower (heading panicle). We have similar (although not identical) rice samples with microarray expression profiles for all of them except seed. A survey of these TE-related cDNAs/ESTs indicates that 80% of those covered by our microarray also had detectable transcription. We further used RT-PCR to verify the microarray data. An attempt to amplify a series of TE-related genes with different levels of microarray signals supported our choice of threshold used to determine expression. Of the 10 genes with expression level within 100 units above the threshold, seven were amplified by RT-PCR; in contrast, only two out of 10 with expression below the threshold were amplified. Moreover, 34 randomly selected TE-related genes identified through microarray analysis as being shoot expressed were tested with RT-PCR using seedling shoot RNA samples. Twenty-nine (85%) of them were clearly detected. An independent tiling microarray analysis of rice transcriptome also covered a significant portion of the TE-related genes <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. A preliminary survey of the transcriptional activities of TE-related genes in this dataset gives a similar portion of expression (about 30%) among tissues examined <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>, although a different platform and hybridization detection procedure were used <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Transcription of type I TE-related genes</p>
            </st>
            <p>In addition to taking an inventory of transcribed TE-related genes in various tissues and under multiple growth conditions, the availability of high-quality complete genome sequence provided an opportunity to elucidate how transcriptional activities evolve following sequence divergence. To this end, phylogenic trees were generated for all major TE-related gene superfamilies and were integrated with their members' expression profiles.</p>
            <p>The type I TE-related genes can be classified into two groups according to the presence or absence of long terminal repeats (LTRs). TE-related genes without LTRs belong to the long interspersed elements (LINEs) type, which may encode retrotransposase and mobilize noncoding short interspersed elements (SINEs). Only 34 LINE-type TE-related genes were identified in rice (Table <tblr tid="T1">1</tblr>). We found a relatively small portion (usually below 20%) of this family transcribed (Figure <figr fid="F3">3</figr>). One rice LINE-type retrotransposon named <it>Karma </it>with active transposition has been reported <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>; its transcriptional activity was detected in a wide range of organs and cultured cells. A 5'-truncated version of <it>Karma </it>was also identified in the rice genome <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, which lacks transcriptional activity in all samples we tested (Figure <figr fid="F3">3</figr>).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Degrees of lineage-specific transcription in the LINE superfamily</p>
               </caption>
               <text>
                  <p>Degrees of lineage-specific transcription in the LINE superfamily. The phylogenetic tree was generated from a multiple alignment of conceptually translated sequences by using neighbor-joining methods and rooted with human <it>L1</it>. Bootstrap values were calculated from 1,000 replicates. Sample numbers are identical to those in Table 2. Shades of gray indicate the magnitude of transcription signals, which are based on microarray hybridization signals without units. Names of previously reported members are shown. *Previously reported members with transcription or transposition. <sup>&#8224; </sup>Previously reported inactivate members. LINE, long interspersed element.</p>
               </text>
               <graphic file="gb-2007-8-2-r28-3"/>
            </fig>
            <p>LTR-type TE-related genes belong to two superfamilies, namely Ty1/<it>copia </it>and Ty3/<it>gypsy</it>, which are both ubiquitous throughout plants and believed to have contributed significantly to the evolution of genome structure and function <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Both families are quite diverse in rice, with Ty3/<it>gypsy </it>elements outnumbering Ty1/<it>copia </it>elements <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. Our expression data indicate that both families are similarly transcribed at low levels at around 25% in most samples, but there are members in both families with strong transcription in widespread tissues. However, they are spread in different clades with only remote similarity (Additional data files 1 and 2). A few active LTR retrotransposons have been reported in rice. Among them, <it>Tos17 </it>is the best characterized and is known to exhibit active transposition in tissue culture <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. We found active transcription of <it>Tos17 </it>not only in cultured cells but also in a wide range of organs (Additional data file 1), suggesting that tissue culture may provide a way to propagate somatic transposition events to progeny. Sireviruses are a plant-specific lineage of the Ty1/<it>copia </it>retrotransposons that interact specifically with proteins related to dynein light chain 8 <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. We found one member of this lineage with ubiquitous strong transcription and several others with transcription in selected rice samples (Additional data file 1).</p>
            <p>A large number of type I TE-related genes have not yet been further classified (Table <tblr tid="T1">1</tblr>). We detected transcription of a smaller proportion of this group of genes than for Ty1/<it>copia </it>and Ty3/<it>gypsy </it>superfamilies.</p>
         </sec>
         <sec>
            <st>
               <p>Transcription of type II TE-related genes</p>
            </st>
            <p>Type II TE-related genes are in general more actively transcribed than type I TE-related genes. Different from type I, type II TE-related genes are highly variable among major superfamilies with respect to transcriptional activity. Whereas CACTA and MULE superfamilies are actively transcribed, <it>hAT</it>-like, <it>PIF</it>/<it>Pong</it>-like, <it>Mariner</it>-like, and <it>Helitron</it>-like superfamilies have transcriptional activities similar to or lower than those of type I TE-related genes.</p>
            <p><it>Mutator</it>-like superfamily (MULE) is one of the first groups of identified transposases with a few reported transcriptionally active members in rice <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. There are 607 autonomous members of this superfamily (Table <tblr tid="T1">1</tblr>), which has one of the strongest transcription levels, at 35% to 40% in each sample (Figure <figr fid="F4">4</figr>). The MULEs can be further divided into three branches: <it>MuDR</it>-like, <it>Jittery</it>-like, and <it>TRAP</it>-like <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. The <it>TRAP</it>-like branch may have recently been amplified, and high similarity among family members has resulted in lack of unique oligo probes with which to examine their expression profiles. Interestingly, we have found at least three clades with clear active transcription in <it>MuDR</it>-like and <it>Jittery</it>-like branches (Figure <figr fid="F4">4</figr>). The one highly transcribed clade in the <it>MuDR</it>-like branch included <it>MUG1</it>, an evolutionarily conserved MULE sequence found in diverse angiosperms and a candidate for categorization as a domesticated transposase-related gene <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. The larger, highly transcribed clade in the <it>Jittery</it>-like branch includes homologs to <it>Arabidopsis </it>genes <it>FAR1 </it>and <it>FHY3</it>, both of which are transposon-derived genes with demonstrated host function as transcription factors downstream of phytochrome A <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr></abbrgrp>. There are no reports on any members of the other highly transcribed clade in the <it>Jittery</it>-like branch, which has rampant transcription (Figure <figr fid="F4">4</figr>, middle).</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Degrees of lineage-specific transcription in MULE superfamily (excluding the <it>TRAP</it>-like class)</p>
               </caption>
               <text>
                  <p>Degrees of lineage-specific transcription in MULE superfamily (excluding the <it>TRAP</it>-like class). The phylogenetic tree was generated from a multiple alignment of conceptually translated sequences by using neighbor-joining methods and rooted with soybean <it>Soymar1</it>. Bootstrap values were calculated from 1,000 replicates. Sample numbers are identical to those in Table 2. Shades of gray indicate the magnitude of transcription signals, which are based on microarray hybridization signals without units. Names of previously reported members are shown. Names in parenthesis indicate members not covered by microarray. Transcriptional active clades are highlighted by bars. *Previously reported members with transcription or transposition.</p>
               </text>
               <graphic file="gb-2007-8-2-r28-4"/>
            </fig>
            <p>The CACTA superfamily is a diverse group of high-copy repetitive genes in grasses <abbrgrp><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr></abbrgrp>. CACTA transposons with active transcription or even transposition have been reported in rice and other grass genomes <abbrgrp><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr></abbrgrp>. A total of 2,276 intact CACTA transposase-coding genes are identified in rice, making it the largest superfamily in type II TE-related genes (Table <tblr tid="T1">1</tblr>). The CACTA superfamily is also highly active, with more than 40% transcribed in each sample. Several clades with active transcription were identified (Additional data file 4). Among them, two clades include over 20 members. No members within these actively transcribed CACTA transposons have previously been characterized.</p>
            <p>The <it>hAT</it>-like superfamily is another widespread superfamily in grasses <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>. It is a medium-sized superfamily in rice with 184 autonomous members (Table <tblr tid="T1">1</tblr>). About 20% of this superfamily is transcribed in a single sample (Figure <figr fid="F5">5</figr>). Interestingly, we found a small clade of four genes that exhibited relatively uniform and strong transcription across a wide range of samples. A sequence comparison indicates that these genes have high similarity with a recently identified domesticated <it>Arabidopsis </it>transposase <it>DAYSLEEPER</it>, which is a pleiotropic regulator of development through its specific DNA-binding activity <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. There is one reported <it>hAT</it>-like transposon group in rice, <it>Dart</it>, which is capable of active transposition in plants <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B59">59</abbr></abbrgrp>. Sequence analysis indicates that <it>Dart </it>is a recently amplified clade with 30 almost identical members. Although no oligonucleotide probes have been developed to represent individual members, there are a few probes that can detect all or most of them. Clear hybridization signals have been found for these probes in all shoot and cell culture samples. This finding suggests that some or all members of <it>Dart </it>are highly transcribed in a large number of rice samples.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Degrees of lineage-specific transcription in <it>hAT</it>-like superfamily</p>
               </caption>
               <text>
                  <p>Degrees of lineage-specific transcription in <it>hAT</it>-like superfamily. The phylogenetic tree was generated from a multiple alignment of conceptually translated sequences by using neighbor-joining methods and rooted with soybean <it>Soymar1</it>. Bootstrap values were calculated from 1,000 replicates. Sample numbers are identical to those given in Table 2. Shades of gray indicate the magnitude of transcription signals, which are based on microarray hybridization signals without units. Names of previously reported members are shown. *Previously reported members with transcription or transposition.</p>
               </text>
               <graphic file="gb-2007-8-2-r28-5"/>
            </fig>
            <p>Both <it>PIF</it>/<it>Pong</it>-like and <it>Mariner</it>-like TE-related genes are autonomous partners of nonautonomous miniature inverted repeat transposable elements (MITEs), which are ubiquitous in the rice genome <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Low proportions of both families have detectable transcription (&lt;20%) in each sample (Figure <figr fid="F6">6</figr> and Additional data file 4). Two transpositionally active <it>PIF</it>/<it>Pong</it>-like elements were recently reported: maize <it>PIF </it>and rice <it>Pong </it><abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B60">60</abbr></abbrgrp>. Interestingly, the rice homolog of <it>PIF</it>, namely <it>OsPIF1 </it><abbrgrp><abbr bid="B60">60</abbr></abbrgrp>, was not expressed in any samples (Figure <figr fid="F6">6</figr>). There are six almost identical <it>Pong </it>elements in the rice genome, which are represented by a single probe in the microarray. This probe detected transcription activity in tillering shoot and drought-exposed panicles only (Figure <figr fid="F6">6</figr>), suggesting rigorous regulation at the transcriptional level for members of this family. We did not detect any transcriptional activity of the <it>Pong </it>element in cultured cells. The <it>Mariner</it>-like superfamily has a much smaller member size <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>; this superfamily includes a small proportion of transcribed genes, similar to that for the <it>PIF</it>/<it>Pong</it>-like superfamily (Additional data file 4).</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Degrees of lineage-specific transcription in <it>PIF</it>/<it>Pong</it>-like superfamily</p>
               </caption>
               <text>
                  <p>Degrees of lineage-specific transcription in <it>PIF</it>/<it>Pong</it>-like superfamily. The phylogenetic tree was generated from a multiple alignment of conceptually translated sequences by using neighbor-joining methods and rooted with soybean <it>Soymar1</it>. Bootstrap values were calculated from 1,000 replicates. Sample numbers are identical to those in Table 2. Shades of gray indicate the magnitude of transcription signals, which are based on microarray hybridization signals without units. Names of previously reported members are shown. Names in parenthesis indicate members not covered by the microarray. *Previously reported members with transcriptional or transpositional activity.</p>
               </text>
               <graphic file="gb-2007-8-2-r28-6"/>
            </fig>
            <p>A recently identified unique type II TE superfamily, <it>Helitron</it>-like, is relatively under-characterized in the rice genome <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>. Strikingly, <it>Helitron</it>-like transposons have the potential to move and shuffle genes or exons in maize <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. In rice, we found only one member with transcriptional activity in all the samples. There is no other <it>Helitron</it>-like transposon among the seven examined ones with transcriptional activity in any samples (Additional data file 5).</p>
            <p>We were unable to further classify another 787 type II TE-related genes into any superfamilies (Table <tblr tid="T1">1</tblr>). Interestingly, a large percentage (>40% out of 128 with unique oligomer probes) was found to be transcribed.</p>
         </sec>
         <sec>
            <st>
               <p>Transcription of Pack-MULE</p>
            </st>
            <p>Genes or exons can be transduplicated by MULEs <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B63">63</abbr></abbrgrp>, which have recently been suggested to be important facilitators of the evolution of genes in higher plants, and have therefore been termed Pack-MULE <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. However, a detailed sequence analysis suggests that the products of this process are more likely to be pseudogenes than novel functional genes <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. To gain better insight into this group, we examined their transcriptional activities using microarray analysis, because transcription is usually a prerequisite for biologic function of a protein-coding gene. By testing the transcription of recently identified 137 Pack-MULEs on chromosomes 1 and 10 that are covered by our microarray <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>, we found that the transcription rates of Pack-MULEs fall between those of TE-related gene models and non-TE-related gene models (Figure <figr fid="F7">7</figr>), being slightly closer to those of TE-related gene models. On the other hand, more Pack-MULEs are transcribed in several samples than for TE-related gene models and non-TE-related gene models (Figure <figr fid="F7">7</figr>).</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Summary of expression of Pack-MULEs in comparison with other TE-related and non-TE-related gene models</p>
               </caption>
               <text>
                  <p>Summary of expression of Pack-MULEs in comparison with other TE-related and non-TE-related gene models. Levels of transcription can be inferred based on how often expression was detected in the different samples for each group. TE, transposable element.</p>
               </text>
               <graphic file="gb-2007-8-2-r28-7"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Association of transcription with DNA and histone modification</p>
            </st>
            <p>TEs, including TE-related ORF encoding genes, are under multiple levels of epigenetic control, including DNA methylation and histone modifications <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. In Arabidopsis, DNA methylation and histone H3 lysine-9 methylation (H3K9m) correlates with the silencing of TEs, and histone H3 lysine-4 methylation (H3K4m) is associated with transcribed genes <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>. However, H3K4m is also found in silenced genes and therefore may not always be a marker for active transcription <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>.</p>
            <p>To determine whether transcribed TE-related genes have different chromatin modification status, we selected nine transcribed and three silenced TE-related genes, including both autonomous TE genes and TE-derived genes, in order to assess histone and DNA methylation (Figure <figr fid="F8">8a</figr>). These are <it>Tos17</it> and <it>Tos3</it> of the Ty1/<it>copia</it> superfamily; Ty3/<it>gypsy</it> elements Os09g15460, Os03g32070 and <it>OSR30</it>; MULE superfamily DNA transposons <it>MUG1</it>, <it>FAR1</it>-like and Os11g05820; CACTA DNA transposons Os10g31320, Os09g29980 and Os04g08710; and <it>DAYSLEEPER</it>-like from the <it>hAT</it>-like superfamily. Seedling shoot samples were used for all analyses discussed here. To verify transcription independently, we used PCR to amplify reverse-transcribed cDNA (RT-PCR). Transcript accumulation assayed by RT-PCR is in general consistent with microarray results (Figure <figr fid="F8">8a</figr>). Using chromatin immunoprecipitation (ChIP) analysis, we found that only silenced genes were associated with high levels of H3K9m. H3K4m was significant for all genes examined, regardless of whether they were transcribed or silenced (Figure <figr fid="F8">8a</figr>). Similar to H3K9m, only silenced genes were heavily methylated at the DNA level (at cytosine, by McrBC digestion assay; Figure <figr fid="F8">8a</figr>). These data imply that levels of H3K9m and DNA methylation were lower in transcribed TE-related genes. Similar correlations of histone and DNA methylation with transcription were also found in non-TE-related genes (controls in Figure <figr fid="F8">8a</figr>). Furthermore, no distinction was found between autonomous TE genes and TE-derived genes from these data.</p>
            <fig id="F8">
               <title>
                  <p>Figure 8</p>
               </title>
               <caption>
                  <p>Chromatin-level modifications of TE-related genes</p>
               </caption>
               <text>
                  <p>Chromatin-level modifications of TE-related genes. Reverse-transcribed cDNA, DNA from ChIP, and McrBC-digested genomic DNA were amplified by PCR for TE-related genes <b>(a) </b>with and without transcription in seedling shoots, and <b>(b) </b>with transcription in cultured cells but not seedling shoots. Primers corresponded to transcribed ORFs. Mock RT-PCR was performed without reverse transcriptase (w/o RT). ChIP was carried out with histone H3 anti-dimethyl lysine-4 (H3K4m) or anti-dimethyl lysine-9 (H3K9m) antibodies together with total DNA input (T) and no antibody (Mock) controls. McrPCR was performed on McrBC digested (+) and untreated (-) total genomic DNA. Actin was used as a positive control and Os10g35890, a gene of unknown function without transcription in seedlings, as a negative control. The same gray scale was used to indicate magnitude of transcription signals from microarray (Array). ChIP, chromatin immunoprecipitation; ORF, open reading frame; PCR, polymerase chain reaction; TE, transposable element.</p>
               </text>
               <graphic file="gb-2007-8-2-r28-8"/>
            </fig>
            <p>To explore these relationships further, we selected five TE-related genes with transcription in cultured cells but not in seedling shoots: the Ty1/<it>copia</it> retroelement Os10g22210; Ty3/<it>gypsy</it> retrotransposons Os09g11940 and Os10g06250; and CACTA DNA transposons Os07g23660 and Os08g32100 (Figure <figr fid="F8">8b</figr>). Three of these five genes were associated with higher levels of H3K9m in shoots (silenced) as compared with in cultured cells (transcribed), according to ChIP-PCR analysis. Levels of H3K4m did not exhibit a clear difference between shoots and cultured cells (Figure <figr fid="F8">8b</figr>). DNA methylation was reduced in three genes in cultured cells compared with shoots (Figure <figr fid="F8">8b</figr>). Thus, lower levels of DNA methylation and H3K4m tend to accompany TE-related gene transcription under developmental regulation.</p>
            <p>It has been shown that small RNAs derived from repetitive genome sequences repress transcription by means of RNA interference in <it>Arabidopsis </it><abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Small RNAs, both microRNAs (miRNAs) and small interfering RNAs (siRNAs), have also been identified in rice, albeit at a small scale <abbrgrp><abbr bid="B66">66</abbr><abbr bid="B67">67</abbr></abbrgrp>. Sixteen out of a total of 44 predicted siRNAs have at least one TE-related gene as their target gene <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>, whereas few miRNA have a TE-related gene target <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>. For the five target TE-related genes covered by microarray, we found active transcription for only one. It is interesting to note that for siRNAs targeting multiple genes, the transcriptional profiles of these target genes may not be at all similar. For example, siRNA P96-E12 has two targets: Os07g10770 (a cellulose synthase) and Os01g05370 (a Ty1/<it>copia </it>family retrotransposon). The cellulose synthase gene has strong transcription in almost all samples we profiled. In contrast, the retrotransposon target does not exhibit transcription in any sample.</p>
         </sec>
         <sec>
            <st>
               <p>Upstream gene transcription affects TE-related gene transcription</p>
            </st>
            <p>It was recently reported in <it>Arabidopsis</it>, as well as in several other eukaryotes, that some adjacent genes tend to have co-expression patterns <abbrgrp><abbr bid="B68">68</abbr><abbr bid="B69">69</abbr><abbr bid="B70">70</abbr><abbr bid="B71">71</abbr></abbrgrp>. Readthrough of TEs derived from upstream genes is also reported in isolated studies <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B72">72</abbr><abbr bid="B73">73</abbr></abbrgrp>. We therefore suspected that transcription of neighboring genes might influence the transcription of a TE-related gene. To test this hypothesis, we calculated the frequency of transcribed TE-related genes relative to the transcriptional activity of neighboring genes. Two scenarios were considered: the upstream gene and the downstream TE-related gene were in the same orientation (or the same strand); and these two were in opposite orientations. In both cases, there was a clear positive association between gene transcription and the neighboring TE-related gene transcription (Figure <figr fid="F9">9</figr>). However, the effect was more significant if the non-TE-related and TE-related genes were in the same orientation. An increase of 16% of downstream transcription was found when transcribed upstream genes were in the same orientation (<it>P </it>&lt; 10<sup>-16</sup>, by Welch two-sample <it>t</it>-test). In the case of opposite orientation, an increase of 9% in transcription level was found (<it>P </it>&lt; 10<sup>-16</sup>). By comparing the effects of transcribed upstream gene orientation in these two scenarios, we found that the same orientation corresponded to 6% more expression than the other scenario (<it>P </it>&lt; 10<sup>-7</sup>). There is no clear distinction between the two scenarios for TE-related genes with untranscribed upstream genes (26% versus 27%; <it>P </it>= 0.3). The orientation of downstream non-TE-related genes did not significantly affect the transcription of upstream TE-related genes.</p>
            <fig id="F9">
               <title>
                  <p>Figure 9</p>
               </title>
               <caption>
                  <p>Effects of relative orientation of upstream genes on transcription of downstream TE-related genes</p>
               </caption>
               <text>
                  <p>Effects of relative orientation of upstream genes on transcription of downstream TE-related genes. All TE-related genes were divided into two groups according to the relative orientations of themselves and upstream genes. Portions of transcribed TE-related genes were calculated for those with transcribed upstream genes and those with silent upstream genes in both groups. TE, transposable element.</p>
               </text>
               <graphic file="gb-2007-8-2-r28-9"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Functions of <it>cis</it>-elements in transcription</p>
            </st>
            <p>To explore further the possible underlying mechanisms that control the transcription of TE-related genes, we attempted to identify possible involvement of <it>cis</it> elements in transcription. To this end, we searched for enrichment of <it>cis</it> elements in the promoter regions of transcribed TE-related genes. We grouped TE-related genes based on the number of samples with transcription and searched for frequency of occurrence of all reported <it>cis</it> elements within each group. Among 439 reported elements in plants <abbrgrp><abbr bid="B74">74</abbr></abbrgrp>, nine of them exhibited marked enrichment in TE-related genes with active transcription (Figure <figr fid="F10">10</figr>), whereas no element was found with similar enrichment patterns from randomized datasets. In addition, most of these elements were found by searching for enrichment in active members in Ty1/<it>copia</it>, Ty3/<it>gypsy</it>, or the CACTA superfamily. TATA box was identified, which is usually found in the 5'-upstream region of eukaryotic genes and is critical for accurate initiation of transcription <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>. The T-box is part of the scaffold/matrix attachment region, which was recently found to regulate the transcription of nearby genes in <it>Arabidopsis </it><abbrgrp><abbr bid="B76">76</abbr></abbrgrp>. We also identified the enrichment of motifs (G-box, Myb binding site, and ATHB5-core) for the major plant transcription factor families (bHLH, Myb, and homeodomain-leucine zipper). In addition, enrichment was also detectable from the light response motifs Hex-motif, pathogen response motif GCC-core, gibberellin response motif Pyrimidine-box, and meristem specific motif site IIa.</p>
            <fig id="F10">
               <title>
                  <p>Figure 10</p>
               </title>
               <caption>
                  <p>Motifs with enrichment in transcribed TE-related gene promoters</p>
               </caption>
               <text>
                  <p>Motifs with enrichment in transcribed TE-related gene promoters. Genes were grouped according to the number of samples that show transcriptional activity. Enrichment was measured as the frequency of a motif in gene promoters of a certain group. TE, transposable element.</p>
               </text>
               <graphic file="gb-2007-8-2-r28-10"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <sec>
            <st>
               <p>Transcription profiles of TE-related genes in rice</p>
            </st>
            <p>TEs account for an overwhelming proportion of plant genomes. To ensure the viability of their host and hence their own survival, the transposition of TEs should be tightly controlled <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. Transcribed autonomous TEs among TE-related genes have the potential to self-activate or activate transcription of related nonautonomous TEs. Transcriptional regulation is therefore one major control step used by plants, but it remains insufficiently understood. The recently available rice genome sequence has enabled us to characterize TE-related gene transcription on a genome-wide scale.</p>
            <p>Using 70-mer oligonucleotide microarrays covering more than 2,000 rice TE-related genes, we surveyed the transcription profiles under a wide range of organ samples under various conditions. Considering that TE-derived cellular genes are relatively rare, autonomous TEs probably contribute to most of these TE-related genes. Genome profiling revealed that 25% to 30% of the TE-related genes were transcribed in one sample, which was much lower than the corresponding percentage of non-TE-related genes (Figures <figr fid="F1">1</figr> and <figr fid="F2">2</figr>). Moreover, TE-related genes differed from their non-TE-related counterparts in two additional aspects. First, TE-related genes tended to be transcribed in only a subset of organs or developmental stages, whereas non-TE-related genes had transcription in more samples on average (Figure <figr fid="F1">1</figr> and Figure <figr fid="F2">2</figr>). Second, transcribed TE-related genes exhibited weaker transcription overall compared with non-TE-related genes in all of the samples we profiled (Figure <figr fid="F2">2</figr>). It worth noting that our estimation of TE-related gene transcription was biased toward low-copy elements, because it was difficult to distinguish transcripts among recently duplicated high-copy TE-related genes, which share high sequence similarity within clades. It has been reported in <it>Arabidopsis </it>and <it>Drosophila </it>that the activity of TE elements may reduce as the copy number increases <abbrgrp><abbr bid="B77">77</abbr><abbr bid="B78">78</abbr></abbrgrp>. Therefore, we expect the transcriptional activity of those high-copy TE-related genes will be lower than for low-copy ones.</p>
            <p>Among TE-related genes, a smaller proportion of type I than type II genes were transcribed (Figure <figr fid="F1">1a</figr>), a discrepancy that resulted primarily from the strong transcription of MULE and CACTA superfamilies as well as unclassified type II members. It is interesting to note that all TE-related gene superfamilies with potential to severely expand, including all type I TE-related genes and <it>PIF</it>/<it>Pong</it>-like, <it>Mariner</it>-like and <it>Helitron</it>-like type II TE-related genes, were more tightly controlled at the transcription level. Type I TE-related genes are amplified through a copy-and-paste mechanism <abbrgrp><abbr bid="B79">79</abbr></abbrgrp>. <it>PIF</it>/<it>Pong</it>-like and <it>Mariner</it>-like superfamilies regulate the activity of MITEs, which dominate the rice genome <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Members of the <it>Helitron</it>-like superfamily go through a unique rolling cycle replication to rapidly amplify themselves <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>.</p>
            <p>Many TE-related genes exhibit organ-specific, growth stage-specific, and stress-specific expression profiles in our collection of samples. These genes exist in all superfamilies, as shown in Figures <figr fid="F3">3</figr> to <figr fid="F7">7</figr>. A number of them, again from various superfamilies of both type I and type II TE-related genes, exhibit clear induction in cultured cells, in certain organs, or in certain stress challenged organs (Figure <figr fid="F2">2</figr>). The precise biologic significance for this observation remains to be elucidated.</p>
            <p>It is important to note that transcriptional activity does not necessarily correspond to transpositional activity. Transcription is just the first of several steps required for the transposition of type I and type II TEs <abbrgrp><abbr bid="B79">79</abbr><abbr bid="B80">80</abbr></abbrgrp>. Active transcription and even translation of TE-related genes has been reported in several isolated cases <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, but only in a few cases was transposition actually confirmed by observed copy number change <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. A two-step regulatory mechanism was therefore proposed for retrotransposons <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. In this model, some elements may have slipped the leash of transcriptional gene silencing <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Nevertheless, they can be controlled by post-transcriptional gene silencing <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. We observed transcription of all major TE-related gene superfamilies in rice, but it is probable that most of them, if not all, are not actively transpositioned. It is therefore likely that such a two-step regulation exists not only for retrotransposons but also for other classes. Post-transcriptional regulation, which is still largely unexplored, is thought to repress transposition activity further <abbrgrp><abbr bid="B81">81</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Transcription of domesticated TE-related genes in the rice genome</p>
            </st>
            <p>It is well accepted that some TE-related genes have actually acquired host functions and play physiologic roles in the host. They can either be derived from TEs or include hijacked TEs or TE fragments by cellular genes. Not surprisingly, we have discovered active transcription of all potential domesticated TE genes previously described in <it>Arabidopsis </it>and rice. Interestingly, domesticated TE genes tend to be within actively transcribed TE gene clades. The rice homologs of the two reported cases of domesticated transposons in <it>Arabidopsis</it>, namely <it>FAR1</it>/<it>FHY3 </it>and <it>DAYSLEEPER</it>, were located in two actively transcribed clades. <it>MUG1</it>, a putative domesticated gene revealed by cross-species sequence comparison analysis, was shown to be transcribed from our data and located within an actively transcribed clade. These examples may suggest that actively transcribed clades of TE-related genes are a rich source for domesticated TE genes. In fact, several other actively transcribed clades have been observed, especially for the MULE and CACTA superfamilies, from our analysis (Figures <figr fid="F4">4</figr> and <figr fid="F5">5</figr>). It is reasonable to suspect that those transcriptionally active clades may contain genes co-opted by hosts to serve adaptive functions. This notion will be worth testing in future research. Clearly, the combination of transcriptional analysis with phylogenetic analysis is instrumental in identifying those TE-derived genes with adapted host function.</p>
            <p>A specific mechanism for the evolution of new genes by mobile DNA elements is through their ability to acquire and fuse fragments of genes to new genomic locations, as represented by Pack-MULE <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. By exploring the transcriptional activity of a subset of Pack-MULEs, we have shown that their transcriptional activity falls in between the levels of TE-related and non-TE-related gene models (Figure <figr fid="F7">7</figr>). This result suggests that many of them might not have biologic functions, and both pseudogenes and evolving new functional genes exist among these annotated Pack-MULEs. Alternatively, functional diversification of recently evolved genes may be another explanation, because newly formed genes usually have more specific expression profiles <abbrgrp><abbr bid="B82">82</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Mechanisms controlling TE-related genes transcription</p>
            </st>
            <p>The presence of such a diverse array of transcribed TE-related genes raises questions regarding the mechanisms that control the transcription. At the chromatin level, we found that actively transcribed TE-related genes have reduced levels of H3K9m and DNA methylation. This finding indicates that proper chromatin modification status is usually required for transcription of TE-related genes. However, histone and DNA modifications are unlikely to be efficient markers for distinguishing between autonomous TE genes and TE-derived cellular genes.</p>
            <p>Consistent with the existence of chromatin-level control, we found that transcribed TE-related genes tend to be located near to transcribed neighboring genes. It is possible that the status of a chromatin domain is marked by histone and DNA modifications. Such chromatin status affects a few genes located in the same or neighboring chromatin domains. The orientation of upstream genes affects downstream TE-related gene transcription (Figure <figr fid="F9">9</figr>). If both genes are in the same orientation, then the downstream TE-related gene would have a greater chance of being transcribed. Readthrough of TE-related genes derived from upstream genes may account for this difference, besides possible chromatin effects.</p>
            <p>Small RNA has been suggested to be a key regulator to silence TE elements transcriptionally and post-transcriptionally <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B81">81</abbr></abbrgrp>. However, only a few examples were found in our dataset. Small RNAs are known to be highly abundant in the <it>Arabidopsis </it>genome <abbrgrp><abbr bid="B83">83</abbr></abbrgrp>, whereas their counterparts in rice are yet to be discovered. A full catalog of small RNAs in rice will provide a better picture of their role in TE transcription.</p>
            <p>Another possible mechanism controlling TE transcription is the existence of <it>cis </it>elements in their promoter regions. Examples have been found previously for LTR retrotransposons, which employ alternating <it>cis </it>elements present in their LTRs <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B84">84</abbr><abbr bid="B85">85</abbr><abbr bid="B86">86</abbr></abbrgrp>. Here, we identified nine <it>cis </it>elements that were clearly enriched in the promoter regions of transcribed TE-related genes. Among them, both basic transcription-related <it>cis </it>elements and elements that respond to developmental or environmental regulation are found to be enriched in the upstream regions of those transcribed TE-related genes (Figure <figr fid="F10">10</figr>). In addition, these enriched <it>cis </it>elements are probably not limited to a certain superfamily but rather widely spread in several superfamilies. Taken together, our data show that transcription of TE-related genes, mostly autonomous TE genes, in rice is a complex process, which is controlled, at least in part, by chromatin-level regulation and <it>cis </it>elements in promoters.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Microarray analysis</p>
            </st>
            <p>The rice 70-mer oligonucleotide set was described previously <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Briefly, 70-mer oligonucleotides were designed based on a combination of FGENESH predicted genes from an improved shotgun sequence <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> and the available full-length cDNAs and ESTs <abbrgrp><abbr bid="B87">87</abbr></abbrgrp>. Designed 70-mer oligonucleotides correspond to the sequence within the coding region of genes, and the design was corrected for such factors as oligo cross-hybridization, uniform TM value, GC content, and hairpin/stem nucleotide number. All oligonucleotides were remapped to TIGR rice genome annotation version 3.1 genes <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> using BLAST. We requested greater than 90% alignment of a 70-mer oligonucleotide probe to a gene during the remapping. Moreover, only those 70-mer probes without a greater than 80% second-best aligned gene were considered to be free from cross-hybridization. These criteria were selected because a mismatch of 20% removes more than 90% of the hybridization signals, whereas a 10% mismatch retains at least half of the hybridization signals <abbrgrp><abbr bid="B88">88</abbr></abbrgrp>.</p>
            <p>TE-related genes were identified in accordance with TIGR annotation, with supplemental literature review of published TE-related genes. A total of 2,191 TE-related genes are represented by at least one oligonucleotide free from cross-hybridization. In addition, there are 1,966 70-mer oligonucleotides mapped to several but only TE-related genes. These oligonucleotides represent another 9,396 TE-related genes.</p>
            <p>Oligonucleotides were custom synthesized by Operon Biotechnologies Inc. (Huntsville, AL, USA) and printed onto poly-L-lysin coated microscope slides using a contact microarrayer. The same recommended set of 12 unique negative control 70-mer oligonucleotides based on heterologous genes <abbrgrp><abbr bid="B89">89</abbr></abbrgrp> were included in all slides. There were 240 negative control spots on each slide.</p>
         </sec>
         <sec>
            <st>
               <p>Microarray data and plant materials</p>
            </st>
            <p>Microarray experiments and detailed rice sample preparation were described previously <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp>. Samples include organs harvested under normal growth conditions (seeding stage shoot, tillering stage shoot, tillering stage root, heading stage flag leaf, heading stage panicle, and filling stage panicle), organs under conditions of salinity or drought (tillering stage shoot, heading stage flag leaf, and heading stage panicle), and cultured cells (suspension-cultured cells, somatic root in culture, and somatic shoot in culture). A summary is provided in Table <tblr tid="T2">2</tblr>. The microarray data discussed in this publication have been deposited in NCBI Gene Expression Omnibus <abbrgrp><abbr bid="B90">90</abbr></abbrgrp> and are accessible through GEO series numbers GSE2360, GSE2691, GSE6533, and GSE6552.</p>
         </sec>
         <sec>
            <st>
               <p>Microarray data processing</p>
            </st>
            <p>Microarray spot intensity signals were acquired using Axon GenePix Pro 3.0 software package (Molecular Devices, Sunnyvale, CA, USA). To identify and remove systematic sources of variation, including dye and spatial effects, spot intensities from the GenePix Pro output files of all repeats of a given sample pair were normalized using limma, a software package for the analysis of gene expression microarray <abbrgrp><abbr bid="B91">91</abbr></abbrgrp>. This normalization process identified and ameliorated spatial, intensity-based, and dye-specific artifacts using multiple step corrections. To determine objectively whether a gene exhibited significant expression in a given sample, we followed a method that relied on negative control spots and data reproducibility <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. To estimate nonspecific hybridization, a distribution of normalized intensities was obtained from the subset of negative control spots present on each array slide. From this distribution, we chose an intensity cutoff at which less than 10% of the distribution was greater than or equal to this threshold. Expression of a gene was only considered detectable if it was above the threshold in two or more repeats out of the three. These criteria had been demonstrated suitable for oligonucleotide arrays with an error rate range of 1% to 3% false negatives <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. RT-PCR results and independent analysis using different microarrays and statistical approaches <abbrgrp><abbr bid="B43">43</abbr></abbrgrp> further supported this threshold.</p>
         </sec>
         <sec>
            <st>
               <p>Sequence analysis</p>
            </st>
            <p>TE family classification was according to TIGR annotation <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. Hand analysis led to the identification of another 208 TE-related genes according to published sequences and BLAST search. Multiple sequence alignments were conducted using Clustal W <abbrgrp><abbr bid="B92">92</abbr></abbrgrp>. The weighing matrix used was Gonnet Pam 250 with the penalty of gap opening 10 and gap extension 0.2. Phylogenetic trees were generated based on the neighbor-joining method, using PAUP* version 4.0b10 with default parameters <abbrgrp><abbr bid="B93">93</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Cluster analysis</p>
            </st>
            <p>Cluster analysis was applied to all TE-related genes and 1,353 randomly selected non-TE-related genes showing expression in at least one sample. Average normalized log-transformed expression intensities were subjected to cluster analysis. For hierarchical clustering, Pearson correlation was used to compute similarities, and a complete linkage clustering algorithm was used. Cluster analysis was performed using the software Cluster <abbrgrp><abbr bid="B94">94</abbr></abbrgrp> and visualized using custom scripts.</p>
         </sec>
         <sec>
            <st>
               <p>RT-PCR analysis</p>
            </st>
            <p>Total RNA was extracted from independently prepared rice seedling shoots using Qiagen RNeasy kit (Qiagen, Valencia, CA, USA). After DNase I treatment, total RNA was used for cDNA synthesis using Superscript II (Invitrogen, Carlsbad, CA, USA) in accordance with the manufacturer's protocol. PCR primers were designed according to sequence using Primer3 <abbrgrp><abbr bid="B95">95</abbr></abbrgrp>. The amplification reaction was carried out for 35 cycles and at an annealing temperature of 55&#176;C. Products were separated by 1% agarose gel electrophoresis. Negative controls using mock cDNA synthesis products without reverse transcriptase were included for all genes to detect potential genomic DNA contamination.</p>
         </sec>
         <sec>
            <st>
               <p>Histone and DNA methylation</p>
            </st>
            <p>ChIP was carried out as described elsewhere <abbrgrp><abbr bid="B64">64</abbr></abbrgrp> using seedling shoots and cultured cells. Histone H3 anti-dimethyl lysine-4 or anti-dimethyl lysine-9 antibodies (Upstate, Avon, NY, USA) were used to precipitate genomic DNA, which was resuspended in water for PCR analysis. The same PCR and gel electrophoresis conditions were used as for RT-PCR analysis.</p>
            <p>Methylation of DNA was assessed by McrBC digestion following a previously published protocol <abbrgrp><abbr bid="B81">81</abbr></abbrgrp>. Genomic DNA was isolated from seedling shoots and cultured cells using Qiagen DNeasy plant kit and divided into two equal samples. One sample was digested with McrBC, a methylation-dependent restriction enzyme that cuts the sequence A/G 5 mC (New England Biolabs, Beverly, MA, USA). Both digested and untreated samples were subject to PCR amplification as described previously. Successful amplification after digestion indicates lack of methylation.</p>
         </sec>
         <sec>
            <st>
               <p>Motif search</p>
            </st>
            <p>The genome sequences 2 kilobases upstream of the annotated translation start site were retrieved from the TIGR database. Both DNA strands were searched for known plant motifs using the PLACE database <abbrgrp><abbr bid="B74">74</abbr></abbrgrp>. Enrichment levels were further calculated using custom scripts <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper. Additional data file <supplr sid="S1">1</supplr> shows degrees of lineage-specific transcription in the Ty1/<it>copia </it>superfamily. Additional data file <supplr sid="S2">2</supplr> shows degrees of lineage-specific transcription in the Ty3/<it>gypsy </it>superfamily. Additional data file <supplr sid="S3">3</supplr> shows degrees of lineage-specific transcription in the CACTA superfamily. Additional data file <supplr sid="S4">4</supplr> shows degrees of lineage-specific transcription in the <it>Mariner </it>superfamily. Additional data file <supplr sid="S5">5</supplr> shows degrees of lineage-specific transcription in the <it>Helitron </it>superfamily.</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>Degrees of lineage-specific transcription in the Ty1/<it>copia </it>superfamily</p>
            </caption>
            <text>
               <p>The phylogenetic tree was generated from a multiple alignment of conceptually translated sequences by using the neighbor-joining methods and rooted with human <it>L1</it>. Bootstrap values were calculated from 300 replicates. The sample numbers are identical to those in Table <tblr tid="T2">2</tblr>. Shades of gray indicate the magnitude of transcription signals, which are based on microarray hybridization signals without units. Names of previously reported members are shown. Names in parenthesis indicate members not covered by microarray. *Previously reported members with transcription or transposition.</p>
            </text>
            <file name="gb-2007-8-2-r28-S1.xml">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>Degrees of lineage-specific transcription in the Ty3/<it>gypsy </it>superfamily</p>
            </caption>
            <text>
               <p>The phylogenetic tree was generated from a multiple alignment of conceptually translated sequences by using neighbor-joining methods and rooted with human <it>L1</it>. Bootstrap values were calculated from 300 replicates. Sample numbers are identical to those in Table <tblr tid="T2">2</tblr>. Shades of gray indicate the magnitude of transcription signals, which are based on microarray hybridization signals without units. Names of previously reported members are shown. Names in parenthesis indicate members not covered by microarray. *Previously reported members with transcription or transposition.</p>
            </text>
            <file name="gb-2007-8-2-r28-S2.xml">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p>Degrees of lineage-specific transcription in the CACTA superfamily</p>
            </caption>
            <text>
               <p>The phylogenetic tree was generated from a multiple alignment of conceptually translated sequences by using neighbor-joining methods and rooted with soybean <it>Soymar1</it>. Bootstrap values were calculated from 300 replicates. Sample numbers are identical to those in Table <tblr tid="T2">2</tblr>. Shades of gray indicate the magnitude of transcription signals, which are based on microarray hybridization signals without units. Names of previously reported members are shown. Names in parenthesis indicate members not covered by microarray. *Previously reported members with transcription or transposition.</p>
            </text>
            <file name="gb-2007-8-2-r28-S3.xml">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S4">
            <title>
               <p>Additional data file 4</p>
            </title>
            <caption>
               <p>Degrees of lineage-specific transcription in the <it>Mariner </it>superfamily</p>
            </caption>
            <text>
               <p>The phylogenetic tree was generated from a multiple alignment of conceptually translated sequences by using neighbor-joining methods and rooted with soybean <it>Soymar1</it>. Bootstrap values were calculated from 300 replicates. Sample numbers are identical to those in Table <tblr tid="T2">2</tblr>. Shades of gray indicate the magnitude of transcription signals, which are based on microarray hybridization signals without units.</p>
            </text>
            <file name="gb-2007-8-2-r28-S4.gif">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S5">
            <title>
               <p>Additional data file 5</p>
            </title>
            <caption>
               <p>Degrees of lineage-specific transcription in the <it>Helitron </it>superfamily</p>
            </caption>
            <text>
               <p>The phylogenetic tree was generated from a multiple alignment of conceptually translated sequences by using neighbor-joining methods and rooted with <it>C. elegance CeHEL1</it>. Bootstrap values were calculated from 300 replicates. Sample numbers are identical to those in Table <tblr tid="T2">2</tblr>. Shades of gray indicate the magnitude of transcription signals, which are based on microarray hybridization signals without units.</p>
            </text>
            <file name="gb-2007-8-2-r28-S5.eps">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We gratefully acknowledge Junli Zhou, Ning Su and Lei Li for sharing unpublished data, Junli Zhou and Xueyong Li for technical assistance in histone and DNA methylation experiments, and Valerie J Karplus and Yeqin Ma for critical reading of this manuscript. This work was supported by National Science Foundation Plant Genome Program Grant DBI-0421675 to XWD.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The map-based sequence of the rice genome.</p>
            </title>
            <aug>
               <au>
                  <cnm>International Rice Genome Sequencing Project</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>436</volume>
            <fpage>793</fpage>
            <lpage>800</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16100779</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>The Genomes of <it>Oryza sativa</it>: a history of duplications.</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ni</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Dong</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zeng</snm>
                  <fnm>C</fnm>
               </au>
               <etal/>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>e38</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">546038</pubid>
                  <pubid idtype="pmpid" link="fulltext">15685292</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>The institute for genomic research Osa1 rice genome annotation database.</p>
            </title>
            <aug>
               <au>
                  <snm>Yuan</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Ouyang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Maiti</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hamilton</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Haas</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sultana</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Cheung</snm>
                  <fnm>F</fnm>
               </au>
               <etal/>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2005</pubdate>
            <volume>138</volume>
            <fpage>18</fpage>
            <lpage>26</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1104156</pubid>
                  <pubid idtype="pmpid" link="fulltext">15888674</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>The TIGR Plant Repeat Databases: a collective resource for the identification of repetitive sequences in plants.</p>
            </title>
            <aug>
               <au>
                  <snm>Ouyang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Buell</snm>
                  <fnm>CR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32 (Database isssue)</volume>
            <fpage>D360</fpage>
            <lpage>D363</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308833</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681434</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Plant transposable elements: where genetics meets genomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Feschotte</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Wessler</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>329</fpage>
            <lpage>341</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11988759</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>The distribution of genes in the genomes of Gramineae.</p>
            </title>
            <aug>
               <au>
                  <snm>Barakat</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Carels</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bernardi</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1997</pubdate>
            <volume>94</volume>
            <fpage>6857</fpage>
            <lpage>6861</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">21249</pubid>
                  <pubid idtype="pmpid" link="fulltext">9192656</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Rice transposable elements: a survey of 73,000 sequence-tagged-connectors.</p>
            </title>
            <aug>
               <au>
                  <snm>Mao</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wood</snm>
                  <fnm>TC</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Budiman</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Tomkins</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Woo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sasinowski</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Presting</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Frisch</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Goff</snm>
                  <fnm>S</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <fpage>982</fpage>
            <lpage>990</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">310901</pubid>
                  <pubid idtype="pmpid" link="fulltext">10899147</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Abundance, distribution, and transcriptional activity of repetitive elements in the maize genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Meyers</snm>
                  <fnm>BC</fnm>
               </au>
               <au>
                  <snm>Tingey</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Morgante</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>1660</fpage>
            <lpage>1676</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">311155</pubid>
                  <pubid idtype="pmpid" link="fulltext">11591643</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Survey of transposable elements from rice genomic sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Turcotte</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Srinivasan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bureau</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2001</pubdate>
            <volume>25</volume>
            <fpage>169</fpage>
            <lpage>179</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11169193</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Rapid recent growth and divergence of rice nuclear genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Ma</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bennetzen</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>12404</fpage>
            <lpage>12410</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">515075</pubid>
                  <pubid idtype="pmpid" link="fulltext">15240870</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Role of transposable elements in heterochromatin and epigenetic control.</p>
            </title>
            <aug>
               <au>
                  <snm>Lippman</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Gendrel</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Black</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Vaughn</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Dedhia</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>McCombie</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Lavine</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Mittal</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>May</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kasschau</snm>
                  <fnm>KD</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>430</volume>
            <fpage>471</fpage>
            <lpage>476</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15269773</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Using rice to understand the origin and amplification of miniature inverted repeat transposable elements (MITEs).</p>
            </title>
            <aug>
               <au>
                  <snm>Jiang</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Feschotte</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Wessler</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Curr Opin Plant Biol</source>
            <pubdate>2004</pubdate>
            <volume>7</volume>
            <fpage>115</fpage>
            <lpage>119</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15003209</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Gene movement by <it>Helitron </it>transposons contributes to the haplotype variability of maize.</p>
            </title>
            <aug>
               <au>
                  <snm>Lai</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Messing</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dooner</snm>
                  <fnm>HK</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>9068</fpage>
            <lpage>9073</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1157042</pubid>
                  <pubid idtype="pmpid" link="fulltext">15951422</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Gene duplication and exon shuffling by <it>helitron</it>-like transposons generate intraspecies diversity in maize.</p>
            </title>
            <aug>
               <au>
                  <snm>Morgante</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brunner</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pea</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Fengler</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Zuccolo</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rafalski</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2005</pubdate>
            <volume>37</volume>
            <fpage>997</fpage>
            <lpage>1002</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16056225</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>High rate of chimeric gene origination by retroposition in plant genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Zheng</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Fan</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Shi</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cai</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Vang</snm>
                  <fnm>S</fnm>
               </au>
               <etal/>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2006</pubdate>
            <volume>18</volume>
            <fpage>1791</fpage>
            <lpage>1802</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16829590</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>The role of RNA interference in heterochromatic silencing.</p>
            </title>
            <aug>
               <au>
                  <snm>Lippman</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Martienssen</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>431</volume>
            <fpage>364</fpage>
            <lpage>370</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15372044</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Silencing of transposons in plant genomes: kick them when they're down.</p>
            </title>
            <aug>
               <au>
                  <snm>Zilberman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Henikoff</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>249</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">545787</pubid>
                  <pubid idtype="pmpid" link="fulltext">15575975</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Silencing of transposable elements in plants.</p>
            </title>
            <aug>
               <au>
                  <snm>Okamoto</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hirochika</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Trends Plant Sci</source>
            <pubdate>2001</pubdate>
            <volume>6</volume>
            <fpage>527</fpage>
            <lpage>534</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11701381</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Retrotransposons of rice involved in mutations induced by tissue culture.</p>
            </title>
            <aug>
               <au>
                  <snm>Hirochika</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Sugimoto</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Otsuki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Tsugawa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kanda</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1996</pubdate>
            <volume>93</volume>
            <fpage>7783</fpage>
            <lpage>7788</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">38825</pubid>
                  <pubid idtype="pmpid" link="fulltext">8755553</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Two-step regulation and continuous retrotransposition of the rice LINE-type retrotransposon <it>Karma</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Komatsu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shimamoto</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kyozuka</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2003</pubdate>
            <volume>15</volume>
            <fpage>1934</fpage>
            <lpage>1944</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">167180</pubid>
                  <pubid idtype="pmpid" link="fulltext">12897263</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>An active DNA transposon family in rice.</p>
            </title>
            <aug>
               <au>
                  <snm>Jiang</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bao</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Hirochika</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>McCouch</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Wessler</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>421</volume>
            <fpage>163</fpage>
            <lpage>167</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12520302</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>The plant MITE <it>mPing </it>is mobilized in anther culture.</p>
            </title>
            <aug>
               <au>
                  <snm>Kikuchi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Terauchi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wada</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hirano</snm>
                  <fnm>H-Y</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>421</volume>
            <fpage>167</fpage>
            <lpage>170</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12520303</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Mobilization of a transposon in the rice genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Nakazaki</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Okumoto</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Horibata</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Yamahira</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Teraishi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nishida</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Inoue</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tanisaka</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>421</volume>
            <fpage>170</fpage>
            <lpage>172</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12520304</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Identification of an active transposon in intact rice plants.</p>
            </title>
            <aug>
               <au>
                  <snm>Fujino</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sekiguchi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kiguchi</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Mol Genet Genomics</source>
            <pubdate>2005</pubdate>
            <volume>273</volume>
            <fpage>150</fpage>
            <lpage>157</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15803319</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Activation of plant retrotransposons under stress conditions.</p>
            </title>
            <aug>
               <au>
                  <snm>Grandbastien</snm>
                  <fnm>M-A</fnm>
               </au>
            </aug>
            <source>Trends Plant Sci</source>
            <pubdate>1998</pubdate>
            <volume>3</volume>
            <fpage>181</fpage>
            <lpage>187</lpage>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Transposons, tandem repeats, and the silencing of imprinted genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Martienssen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lippman</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>May</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ronemus</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Vaughn</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Cold Spring Harb Symp Quant Biol</source>
            <pubdate>2004</pubdate>
            <volume>69</volume>
            <fpage>371</fpage>
            <lpage>379</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16117670</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Mutator transposase is widespread in the grasses.</p>
            </title>
            <aug>
               <au>
                  <snm>Lisch</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Freeling</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Langham</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Choy</snm>
                  <fnm>MY</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2001</pubdate>
            <volume>125</volume>
            <fpage>1293</fpage>
            <lpage>1303</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">65609</pubid>
                  <pubid idtype="pmpid" link="fulltext">11244110</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Active retrotransposons are a common feature of grass genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Vicient</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>J&#228;&#228;skel&#228;inen</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Kalendar</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Schulman</snm>
                  <fnm>AH</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2001</pubdate>
            <volume>125</volume>
            <fpage>1283</fpage>
            <lpage>1292</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">65608</pubid>
                  <pubid idtype="pmpid" link="fulltext">11244109</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Transcriptionally active transposable elements in recent hybrid sugarcane.</p>
            </title>
            <aug>
               <au>
                  <snm>de Araujo</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Rossi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>de Jesus</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Saccaro</snm>
                  <fnm>NL</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Kajihara</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Massa</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>de Felix</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Drummond</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Falco</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Chabregas</snm>
                  <fnm>SM</fnm>
               </au>
               <etal/>
            </aug>
            <source>Plant J</source>
            <pubdate>2005</pubdate>
            <volume>44</volume>
            <fpage>707</fpage>
            <lpage>717</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16297064</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>A tiling microarray expression analysis of rice chromosome 4 suggests a chromosome-level regulation of transcription.</p>
            </title>
            <aug>
               <au>
                  <snm>Jiao</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Jia</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Su</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Feng</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Jin</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>L</fnm>
               </au>
               <etal/>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2005</pubdate>
            <volume>17</volume>
            <fpage>1641</fpage>
            <lpage>1657</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1143067</pubid>
                  <pubid idtype="pmpid" link="fulltext">15863518</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Consistent over-estimation of gene number in complex plant genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Bennetzen</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Coleman</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ramakrishna</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Curr Opin Plant Biol</source>
            <pubdate>2004</pubdate>
            <volume>7</volume>
            <fpage>732</fpage>
            <lpage>736</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15491923</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>The <it>FHY3 </it>and <it>FAR1 </it>genes encode transposase-related proteins involved in regulation of gene expression by the phytochrome A-signaling pathway.</p>
            </title>
            <aug>
               <au>
                  <snm>Hudson</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Lisch</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Quail</snm>
                  <fnm>PH</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2003</pubdate>
            <volume>34</volume>
            <fpage>453</fpage>
            <lpage>471</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12753585</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
  