<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2007-8-9-r198</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Evolutionary dynamics of eukaryotic selenoproteomes: large selenoproteomes may associate with aquatic life and small with terrestrial life</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Lobanov</snm>
               <mi>V</mi>
               <fnm>Alexey</fnm>
               <insr iid="I1"/>
               <email>lobanov@genomics.unl.edu</email>
            </au>
            <au id="A2">
               <snm>Fomenko</snm>
               <mi>E</mi>
               <fnm>Dmitri</fnm>
               <insr iid="I1"/>
               <email>dfomenko@genomics.unl.edu</email>
            </au>
            <au id="A3">
               <snm>Zhang</snm>
               <fnm>Yan</fnm>
               <insr iid="I1"/>
               <email>yzhang@genomics.unl.edu</email>
            </au>
            <au id="A4">
               <snm>Sengupta</snm>
               <fnm>Aniruddha</fnm>
               <insr iid="I2"/>
               <email>sengupta@mail.nih.gov</email>
            </au>
            <au id="A5">
               <snm>Hatfield</snm>
               <mi>L</mi>
               <fnm>Dolph</fnm>
               <insr iid="I2"/>
               <email>hatfield@dc37a.nci.nih.gov</email>
            </au>
            <au id="A6" ca="yes">
               <snm>Gladyshev</snm>
               <mi>N</mi>
               <fnm>Vadim</fnm>
               <insr iid="I1"/>
               <email>vgladyshev1@unl.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Biochemistry, University of Nebraska, Lincoln, NE 68588, USA</p>
            </ins>
            <ins id="I2">
               <p>Section on the Molecular Biology of Selenium, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>9</issue>
         <fpage>R198</fpage>
         <url>http://genomebiology.com/2007/8/9/R198</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17880704</pubid>
               <pubid idtype="doi">10.1186/gb-2007-8-9-r198</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>27</day>
               <month>9</month>
               <year>2006</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>18</day>
               <month>9</month>
               <year>2007</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>19</day>
               <month>9</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>19</day>
               <month>09</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Lobanov et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Selenoproteome evolution</p>
      </shorttitle>
      <shortabs>
         <p>In silico and metabolic labeling studies of the selenoproteomes of several eukaryotes revealed distinct selenoprotein patterns as well as an ancient origin of selenoproteins and massive, independent losses in land plants, fungi, nematodes, insects and some protists, suggesting that the environment plays an important role in selenoproteome evolution.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Selenocysteine (Sec) is a selenium-containing amino acid that is co-translationally inserted into nascent polypeptides by recoding UGA codons. Selenoproteins occur in both eukaryotes and prokaryotes, but the selenoprotein content of organisms (selenoproteome) is highly variable and some organisms do not utilize Sec at all.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We analyzed the selenoproteomes of several model eukaryotes and detected 26 and 29 selenoprotein genes in the green algae <it>Ostreococcus tauri </it>and <it>Ostreococcus lucimarinus</it>, respectively, five in the social amoebae <it>Dictyostelium discoideum</it>, three in the fly <it>Drosophila pseudoobscura</it>, and 16 in the diatom <it>Thalassiosira pseudonana</it>, including several new selenoproteins. Distinct selenoprotein patterns were verified by metabolic labeling of <it>O. tauri </it>and <it>D. discoideum </it>with <sup>75</sup>Se. More than half of the selenoprotein families were shared by unicellular eukaryotes and mammals, consistent with their ancient origin. Further analyses identified massive, independent selenoprotein losses in land plants, fungi, nematodes, insects and some protists. Comparative analyses of selenoprotein-rich and -deficient organisms revealed that aquatic organisms generally have large selenoproteomes, whereas several groups of terrestrial organisms reduced their selenoproteomes through loss of selenoprotein genes and replacement of Sec with cysteine.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Our data suggest many selenoproteins originated at the base of the eukaryotic domain and show that the environment plays an important role in selenoproteome evolution. In particular, aquatic organisms apparently retained and sometimes expanded their selenoproteomes, whereas the selenoproteomes of some terrestrial organisms were reduced or completely lost. These findings suggest a hypothesis that, with the exception of vertebrates, aquatic life supports selenium utilization, whereas terrestrial habitats lead to reduced use of this trace element due to an unknown environmental factor.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010001">Biochemistry and structural biology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Selenium is an essential trace element in many, but not all, life forms. Its essentiality is based on the fact that this element is present in natural proteins in the form of selenocysteine (Sec), a rare amino acid that chemically differs from serine or cysteine (Cys) by a single atom (for example, Se instead of O or S) <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Sec is known as the 21st amino acid in the genetic code as it has its own biosynthetic machinery, a tRNA and an elongation factor, and is inserted into nascent polypeptides co-translationally in response to the Sec codon, UGA <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. Selenoproteins often escape attention of genome annotators, because in-frame UGA codons are interpreted as stop signals. However, several bioinformatics tools have recently been developed that help identify these genes <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. The use of these methods begins to shed light on proteins and processes dependent on selenium, as well as on the occurrence and distribution of these processes in various life forms.</p>
         <p>Sec is typically found in active sites of redox enzymes, which are functionally similar to thiol-based oxidoreductases <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Sec-containing proteins occur in all major lines of descent (for example, eukaryota, eubacteria and archaea), but not all organisms have these proteins. Prokaryotic genomes have been extensively analyzed for the occurrence of selenoprotein genes <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, but among eukaryotes, only the genomes of mammals (human, mouse) <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, nematodes (<it>Caenorhabditis elegans </it>and <it>C. briggzae</it>) <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, fruit fly (<it>Drosophila melanogaster</it>) <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, green alga (<it>Chlamydomonas reinhardtii</it>) <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and Plasmodia <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp> have been analyzed with regard to the entire set of selenoproteins (selenoproteomes). In addition, the genomes of the plant <it>Arabidopsis thaliana </it>and the yeast <it>Saccharomyces cerevisiae </it>have been scanned for the occurrence of selenoprotein genes and Sec biosynthetic/insertion machinery genes and found to have neither <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>.</p>
         <p>Selenoproteome analyses also revealed that various organisms have substantially different sets of selenoproteins. One example of uneven selenoprotein occurrence is selenoprotein U (SelU), which occurs in fish, birds and some unicellular eukaryotes, but is present in the form of a Cys-containing homolog in mammals and many other eukaryotes. Even a narrower occurrence has been described for SelJ and Fep15 <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>.</p>
         <p>In this study, we characterized the selenoproteomes encoded in several completely sequenced eukaryotic genomes. Detailed analyses of these selenoproteomes and comparison with those of other eukaryotic model organisms revealed an ancient origin of most eukaryotic selenoproteins and a possibility of increased Sec utilization in aquatic environments and decreased use of Sec in terrestrial habitats. These studies provide important insights into selenoprotein origin and dynamics of selenoprotein evolution.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Eukaryotic selenoproteomes</p>
            </st>
            <p>Several eukaryotes have been previously analyzed for their selenoprotein content (selenoproteomes). These studies identified 24-25 selenoproteins in mammals and 0-4 selenoproteins in other organisms. It is generally thought that many eukaryotic selenoproteins evolved in vertebrates, but evolutionary paths have not been examined for the majority of these proteins. In this work, we analyzed the selenoproteomes of several additional model eukaryotes, whose genomes have been completed. These included marine algae (<it>Ostreococcus tauri </it>and <it>O. lucimarinus</it>), a diatom (<it>Thalassiosira pseudonana</it>), a soil amoeba (<it>Dictyostelium discoideum</it>), an insect (<it>Drosophila pseudoobscura</it>), and a red alga (<it>Cyanidioschyzon merolae</it>).</p>
            <sec>
               <st>
                  <p>Drosophila pseudoobscura</p>
               </st>
               <p>The <it>D. pseudoobscura </it>subgroup <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> is found mainly in the temperate and tropical zones of the New World <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. Application of an earlier version of SECISearch to the <it>D. melanogaster </it>genome identified three selenoprotein genes (SelK/G-rich, SelH/BthD and SPS2); however, it was not known whether this set represents the entire <it>Drosophila </it>selenoproteome. We applied an advanced version of SECISearch (see Materials and methods and Additional data file 1) to analyze the <it>D. pseudoobscura </it>genome and, in addition, analyzed <it>D. pseudoobscura </it>and <it>D. melanogaster </it>genomes in parallel to identify evolutionarily conserved selenocysteine insertion sequence (SECIS) elements using relaxed SECIS criteria. These searches resulted in the same, already known set of three selenoproteins (Table <tblr tid="T1">1</tblr>), suggesting that the selenoproteome of insects of the <it>Drosophila </it>genus consists of these three proteins. By homology analyses, we then identified three selenoproteins in a mosquito, <it>Anopheles gambiae</it>, and one in a honey bee, <it>Apis mellifera</it>.</p>
               <tbl id="T1">
                  <title>
                     <p>Table 1</p>
                  </title>
                  <caption>
                     <p>Identification of selenoprotein genes in eukaryotic model organisms</p>
                  </caption>
                  <tblbdy cols="7">
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c cspan="2" ca="center">
                           <p>Loose pattern</p>
                        </c>
                        <c cspan="2" ca="center">
                           <p>Default pattern</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c cspan="2">
                           <hr/>
                        </c>
                        <c cspan="2">
                           <hr/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Organism name</p>
                        </c>
                        <c ca="center">
                           <p>Genome, thousands of bp</p>
                        </c>
                        <c ca="center">
                           <p>Primary sequence criteria</p>
                        </c>
                        <c ca="center">
                           <p>Energy criteria</p>
                        </c>
                        <c ca="center">
                           <p>Primary sequence criteria</p>
                        </c>
                        <c ca="center">
                           <p>Energy criteria</p>
                        </c>
                        <c ca="center">
                           <p>Number of selenoproteins</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="7">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>O. lucimarinus</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>13,393</p>
                        </c>
                        <c ca="center">
                           <p>31,132</p>
                        </c>
                        <c ca="center">
                           <p>7,541</p>
                        </c>
                        <c ca="center">
                           <p>2,120</p>
                        </c>
                        <c ca="center">
                           <p>464</p>
                        </c>
                        <c ca="center">
                           <p>29</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>O. tauri</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>16,414</p>
                        </c>
                        <c ca="center">
                           <p>30,381</p>
                        </c>
                        <c ca="center">
                           <p>7,379</p>
                        </c>
                        <c ca="center">
                           <p>1,934</p>
                        </c>
                        <c ca="center">
                           <p>401</p>
                        </c>
                        <c ca="center">
                           <p>26</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>T. pseudonana</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>32,577</p>
                        </c>
                        <c ca="center">
                           <p>81,040</p>
                        </c>
                        <c ca="center">
                           <p>8,977</p>
                        </c>
                        <c ca="center">
                           <p>3,129</p>
                        </c>
                        <c ca="center">
                           <p>675</p>
                        </c>
                        <c ca="center">
                           <p>16</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>D. discoideum</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>34,564</p>
                        </c>
                        <c ca="center">
                           <p>37,435</p>
                        </c>
                        <c ca="center">
                           <p>7,11</p>
                        </c>
                        <c ca="center">
                           <p>2,128</p>
                        </c>
                        <c ca="center">
                           <p>37</p>
                        </c>
                        <c ca="center">
                           <p>5</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>D. pseudoobscura</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>138,581</p>
                        </c>
                        <c ca="center">
                           <p>181,793</p>
                        </c>
                        <c ca="center">
                           <p>20,702</p>
                        </c>
                        <c ca="center">
                           <p>6,303</p>
                        </c>
                        <c ca="center">
                           <p>1,010</p>
                        </c>
                        <c ca="center">
                           <p>3</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>C. merolae</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>16,381</p>
                        </c>
                        <c ca="center">
                           <p>27,578</p>
                        </c>
                        <c ca="center">
                           <p>5,987</p>
                        </c>
                        <c ca="center">
                           <p>651</p>
                        </c>
                        <c ca="center">
                           <p>149</p>
                        </c>
                        <c ca="center">
                           <p>0</p>
                        </c>
                     </r>
                  </tblbdy>
               </tbl>
            </sec>
            <sec>
               <st>
                  <p>Ostreococcus tauri</p>
               </st>
               <p><it>O. tauri </it>is a unicellular green alga that was discovered in the Mediterranean Thau lagoon in 1994. It belongs to the family Prasinophyceae, which is thought to be the most primitive in the green plant lineage from which all other green algae and ancestors of land plants have descended. This organism has a very small genome, 11.5 Mb <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, especially when compared to other sequenced Plantae genomes (for example, the <it>Arabidopsis </it>genome is 125 Mb <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> and that of <it>Chlamydomonas </it>exceeds 100 Mb <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>). The <it>O. tauri </it>genome is densely packed and provides a useful genomic model for green plants <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Previous research revealed the lack of selenoproteins in land plants <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, whereas 10 selenoproteins were detected in the green alga <it>C. reinhardtii </it><abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Surprisingly, we detected 26 selenoprotein genes in <it>O. tauri</it>.</p>
               <p>Among the known selenoproteins detected in <it>O. tauri</it>, fourteen were homologs of human selenoproteins (thioredoxin reductase (TR), SelT, SelM, SelK, SelS, Sep15, SelO, SelH, SelW and five glutathione peroxidase (GPx) homologs), five were homologs of eukaryotic selenoproteins with restricted distribution (MsrA, SelU and three PDI homologs) and three were homologs of bacterial selenoproteins (methyltransferase, thioredoxin-fold protein and peroxiredoxin). We also identified four novel eukaryotic selenoproteins in the <it>O. tauri </it>genome. These included a predicted membrane selenoprotein (MSP) and three hypothetical proteins of unknown function. In addition, several excellent SECIS element candidates were identified during analysis, but at present no suitable open reading frames (ORFs) could be identified upstream of these structures, in part because of the inadequate length of contigs. Therefore, the total number of <it>Ostreococcus </it>selenoproteins might be even higher than 26.</p>
               <p>Of interest was the observation that all <it>O. tauri </it>SECIS elements except one had a conserved G in the position directly preceding the quartet of non-Watson-Crick interacting nucleotides (Figure <figr fid="F1">1</figr>). Most eukaryotic SECIS elements have an A in this position, although the G was described in several zebrafish and nematode selenoprotein genes <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>. In addition, almost all <it>O. tauri </it>SECIS elements had a long mini-stem in the apical portion of the structure (for example, SelT in Figure <figr fid="F1">1</figr>). This feature was also observed previously in a number of <it>Chlamydomonas </it>SECIS elements <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>.</p>
               <fig id="F1">
                  <title>
                     <p>Figure 1</p>
                  </title>
                  <caption>
                     <p><it>Ostreococcus </it>SECIS elements</p>
                  </caption>
                  <text>
                     <p><it>Ostreococcus </it>SECIS elements. <b>(a) </b>The most characteristic features of <it>O. tauri </it>and <it>O. lucimarinus </it>SECIS elements are a long mini-stem and an unpaired G preceding the SECIS quartet (core). A SelT SECIS element is shown as a typical example (left structure). Only two exceptions were found, including a type I SECIS element in SelH (middle structure) and a SECIS element with an unpaired A nucleotide preceding the SECIS core (right structure). <b>(b) </b>Alignment of nucleotide sequences of all <it>O. tauri </it>SECIS elements. Location of the SECIS core is indicated. Conserved nucleotides are highlighted. Black and grey highlighting shows sequence conservation.</p>
                  </text>
                  <graphic file="gb-2007-8-9-r198-1"/>
               </fig>
               <p>We metabolically labeled <it>O. tauri </it>cells with <sup>75</sup>Se and analyzed the selenoprotein pattern on SDS PAGE gels using a PhosphorImager (Figure <figr fid="F2">2a</figr>). This method detects the most abundant selenoproteins. The overall pattern was similar to that of human HEK 293 and other mammalian cells. As in mammalian cells, the dominant 25 kDa band in the alga was likely a glutathione peroxidase, and one or both major selenoprotein bands in the 50-55 kDa range likely corresponded to thioredoxin reductase. Consistent with the genomics analysis, the number of selenoprotein bands in the <it>O. tauri </it>sample was higher than in mammalian cells.</p>
               <fig id="F2">
                  <title>
                     <p>Figure 2</p>
                  </title>
                  <caption>
                     <p>Metabolic labeling of <it>O. tauri </it>and <it>D. discoideum </it>with <sup>75</sup>Se. <it>O. tauri </it>and <it>D. discoideum </it>cells were grown in the presence of <sup>75</sup>Se [selenite], cell lysates prepared, proteins resolved by SDS-PAGE and analyzed using a PhosphorImager</p>
                  </caption>
                  <text>
                     <p>Metabolic labeling of <it>O. tauri </it>and <it>D. discoideum </it>with <sup>75</sup>Se. <it>O. tauri </it>and <it>D. discoideum </it>cells were grown in the presence of <sup>75</sup>Se [selenite], cell lysates prepared, proteins resolved by SDS-PAGE and analyzed using a PhosphorImager. <b>(a) </b><it>O. tauri</it>. Three middle lanes represent the soluble fraction, homogenate and pellet fraction as shown above the gel. For comparison, HEK 293 cells were metabolically labeled with <sup>75</sup>Se, and migrations of thioredoxin reductase 1 (TR1) and glutathione peroxidase 1 (GPx1) are shown. <b>(b) </b><it>D. discoideum</it>. Two middle lanes represent two independent samples of <sup>75</sup>Se-labeled <it>D. discoideum </it>cells. The four radioactive bands correspond to the indicated selenoproteins identified <it>in silico</it>. For comparison, monkey CV-1 cells were metabolically labeled with <sup>75</sup>Se, and migrations of TR1 and GPx1 are shown on the right.</p>
                  </text>
                  <graphic file="gb-2007-8-9-r198-2"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Ostreococcus lucimarinus</p>
               </st>
               <p><it>O. lucimarinus</it>, previously known as <it>Ostreococcus </it>sp. CCE9901, is a close relative of <it>O. tauri </it>adapted to high light and isolated from surface waters. Its genome size is 13.2 Mb. Homologs of all identified <it>O. tauri </it>selenoproteins were found in <it>O. lucimarinus</it>. In addition, three new sequences were identified, raising the number of selenoproteins in this organism to 29. This is the largest selenoproteome of all previously analyzed eukaryotes (although even larger selenoproteomes apparently exist; Lobanov and Gladyshev, unpublished). Additional selenoproteins included a peroxiredoxin, and peroxiredoxin-like and SelW-like proteins. The latter <it>O. lucimarinus </it>selenoprotein contained two predicted Sec residues.</p>
               <p>Similar to <it>O. tauri</it>, all <it>O. lucimarinus </it>SECIS elements except one had a conserved G in the position directly preceding the SECIS core (Figure <figr fid="F1">1a</figr>), and in addition a single ATGA-type SECIS element was found. Interestingly, single ATGA-type SECIS elements occur in different selenoprotein genes in the two <it>Ostreococcus </it>species. In <it>O. lucimarinus</it>, this SECIS type is within a glutathione peroxidase gene, while in <it>O. tauri </it>the ATGA-type SECIS is in the gene for a hypothetical protein. In contrast to <it>O. tauri</it>, no type I SECIS elements (Figure <figr fid="F1">1a</figr>) were found in <it>O. lucimarinus</it>.</p>
            </sec>
            <sec>
               <st>
                  <p>Cyanidioschyzon merolae</p>
               </st>
               <p><it>C. merolae </it>is an ultrasmall unicellular red alga that lives in acidic hot springs. It is thought to retain primitive features of cellular and genome organization. <it>C. merolae </it>has a simple cell architecture, containing a single nucleus, a single mitochondrion and a single chloroplast. Its genome size is 16 Mbp, which is approximately one-seventh the size of the <it>A. thaliana </it>genome. Its chloroplast might be among the most ancestral <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. A BLAST search against the <it>C. merolae </it>genome revealed several known components of the Sec insertion machinery, including SBP2, EFsec, SecS and SPS2, suggesting that selenoproteins should also be present in this organism. However, a search for SECIS elements followed by ORF analyses revealed no candidate selenoproteins in the <it>C. merolae </it>genome.</p>
               <p>A BLASTN-based analysis of the <it>C. merolae </it>genome using known Sec tRNAs as query sequences did not identify Sec tRNA homologs, and the searches that utilized default versions of standard tRNA detection programs, ARAGORN and tRNAscan-SE, were also unsuccessful. We were able to identify the <it>C. merolae </it>Sec tRNA using our recently described tool for detection of unusual tRNAs <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. This tRNA (Figure <figr fid="F3">3</figr>) has all the features characteristic of Sec tRNAs, such as the UCA anticodon and a long variable stem.</p>
               <fig id="F3">
                  <title>
                     <p>Figure 3</p>
                  </title>
                  <caption>
                     <p>Sec tRNA</p>
                  </caption>
                  <text>
                     <p>Sec tRNA. <b>(a) </b>Cloverleaf structures of Sec tRNAs from <it>C. reinhardtii, O. tauri </it>and <it>C. merolae</it>. <b>(b) </b>Nucleotide sequence alignment of <it>C. reinhardtii </it>and <it>C. merolae </it>Sec tRNAs with known Sec tRNAs. Black and grey highlighting shows sequence conservation.</p>
                  </text>
                  <graphic file="gb-2007-8-9-r198-3"/>
               </fig>
               <p>We applied additional sensitive tools for identification of selenoproteins in the red algal genome. Most homologs of known selenoproteins were found to either have Cys in place of Sec or were missing in this organism. We further carried out a search for Sec/Cys pairs in homologous sequences using the <it>C. merolae </it>genome and all protein sequences extracted from NCBI non-redundant database. Again, no selenoproteins were detected in <it>C. merolae</it>. To test if related organisms possess selenoproteins, all available red algal ESTs were extracted from NCBI dbEST and searched for SECIS elements using SECISearch. This analysis revealed one <it>bona-fide </it>selenoprotein, SelO, in <it>Porphyra haitanensis</it>, which was also highly homologous to the <it>O. tauri </it>SelO (Additional data file 2). The red algal SECIS element was also detected in these sequences (Figure <figr fid="F4">4</figr>).</p>
               <fig id="F4">
                  <title>
                     <p>Figure 4</p>
                  </title>
                  <caption>
                     <p>Red algae selenoprotein O. SECIS elements in <it>O. tauri </it>(green alga) and <it>P. haitanensis </it>(red alga) SelO genes</p>
                  </caption>
                  <text>
                     <p>Red algae selenoprotein O. SECIS elements in <it>O. tauri </it>(green alga) and <it>P. haitanensis </it>(red alga) SelO genes. The <it>P. haitanensis </it>SECIS element belongs to type I, while <it>O. tauri </it>to type II structures.</p>
                  </text>
                  <graphic file="gb-2007-8-9-r198-4"/>
               </fig>
               <p>The presence of the Sec insertion machinery in <it>C. merolae </it>and detection of a selenoprotein in a related red alga suggest that Sec-containing proteins exist in this evolutionary branch. It is possible that the difficulties in identifying selenoproteins in <it>C. merolae </it>may be due to incompleteness of the genome or presence of lineage-specific selenoprotein(s), whose homologs are not represented in sequence databases. In addition, it is possible that the small selenoproteome of <it>C. merolae </it>resulted in unusual SECIS elements, which could not be detected by SECISearch. It is clear, however, that the selenoproteome of this organism is extremely small.</p>
            </sec>
            <sec>
               <st>
                  <p>Thalassiosira pseudonana</p>
               </st>
               <p><it>T. pseudonana </it>is a marine-centric diatom that serves as a model for studies on diatom physiology <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. A Sec tRNA sequence <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> and one selenoprotein, Sec-containing glutathione peroxidase <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>, have been identified in this organism. In this work, we isolated and directly sequenced the <it>T. pseudonana </it>Sec tRNA (see Additional data file 3 for the sequence and clover-leaf structure), which exhibited features typical of eukaryotic Sec tRNAs.</p>
               <p>By searching for SECIS elements, we detected 16 selenoprotein genes in <it>T. pseudonana </it>(Table <tblr tid="T1">1</tblr>). In addition, a partial SelO sequence was detected, but it did not include the regions corresponding to the possible Sec codon and SECIS element. The <it>T. pseudonana </it>selenoproteome includes two GPx homologs, SelT, TR, SPS2, two SelM, two SelU, MsrA, two PDI homologs, a predicted SAM-dependent methyltransferase, two peroxiredoxins and one thioredoxin-like protein. It is remarkable that in spite of large evolutionary distances, <it>Ostreococcus</it>, <it>Thalassiosira </it>and mammalian selenoprotein sets were large and showed a significant overlap, whereas many other eukaryotes, including some animals, had small selenoproteomes.</p>
            </sec>
            <sec>
               <st>
                  <p>Dictyostelium discoideum</p>
               </st>
               <p><it>D. discoideum </it>is a slime mold that primarily inhabits soil or dung and feeds on bacteria. We previously reported the finding of Sec tRNA in this organism <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. In the present study, we analyzed its selenoproteome and found SPS2, SelK, Sep15, MSP and a homolog of thyroid hormone deiodinase (Table <tblr tid="T2">2</tblr>). The presence of the deiodinase homolog was unexpected as thyroid hormones are not known to occur in amoebae. However, this sequence assignment was unambiguous; for example, the <it>D. discoideum </it>selenoprotein exhibited 39% sequence identity to iodothyronine deiodinase type I from <it>Fundulus heteroclitus </it>(accession number AAO31952) and 37% identity to iodothyronine deiodinase type III from <it>Sus scrofa </it>(accession number NP_001001625). Among the five amoebae selenoproteins, MSP had the narrowest distribution and could only be detected in <it>Dictyostelium</it>, <it>Chlamydomonas</it>, <it>Volvox </it>and both <it>Ostreococcus </it>species. This novel selenoprotein had two Sec residues.</p>
               <tbl id="T2">
                  <title>
                     <p>Table 2</p>
                  </title>
                  <caption>
                     <p>Selenoproteins identified in the analyzed eukaryotic genomes</p>
                  </caption>
                  <tblbdy cols="6">
                     <r>
                        <c ca="left">
                           <p>Selenoprotein family</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>O. tauri</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>O. lucimarinus</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>T. pseudonana</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>D. discoideum</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>D. pseudoobscura</it>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="6">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>SelK</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>SelH</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>SPS2</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>DI</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Sep15</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>MSP</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Gpx</p>
                        </c>
                        <c ca="center">
                           <p>+++++</p>
                        </c>
                        <c ca="center">
                           <p>+++++</p>
                        </c>
                        <c ca="center">
                           <p>++</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>SelT</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>TR</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>SelM</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>++</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>SelU</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>++</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>MsrA</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PDI</p>
                        </c>
                        <c ca="center">
                           <p>+++</p>
                        </c>
                        <c ca="center">
                           <p>+++</p>
                        </c>
                        <c ca="center">
                           <p>++</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Methyltransferase</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Peroxiredoxin</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+++</p>
                        </c>
                        <c ca="center">
                           <p>++</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Thioredoxin-fold protein</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>SelO</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>SelW</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>++</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>SelS</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Hypothetical protein 1</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Hypothetical protein 2</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Hypothetical protein 3</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c ca="center">
                           <p>+</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c cspan="6">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Total</p>
                        </c>
                        <c ca="center">
                           <p>26</p>
                        </c>
                        <c ca="center">
                           <p>29</p>
                        </c>
                        <c ca="center">
                           <p>16</p>
                        </c>
                        <c ca="center">
                           <p>5</p>
                        </c>
                        <c ca="center">
                           <p>3</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>Each '+' corresponds to one selenoprotein gene.</p>
                  </tblfn>
               </tbl>
               <p>Interestingly, all identified <it>Dictyostelium </it>SECIS elements had a highly conserved UGUA sequence that preceded the SECIS core, and a U-U mismatch immediately following it (Figure <figr fid="F5">5</figr>). The SECIS element of the deiodinase-like protein had two U-U mismatches; however, they were located further from the SECIS core. All detected SECIS elements were type II structures <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. The deiodinase-like SECIS element had an extremely long mini-stem. As discussed above, the latter feature was also observed in many <it>Ostreococcus </it>selenoprotein genes, whereas it rarely occurs in SECIS structures in other organisms. All <it>Dictyostelium </it>SECIS elements had an unpaired AAA in the apical bulge. The areas of strong conservation include an SBP2-binding site and nucleotides interacting with this protein <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. Since the five selenoproteins have different evolutionary histories and are not homologous with each other, the conservation of primary sequences in <it>Dictyostelium </it>SECIS elements must represent convergent evolutionary events.</p>
               <fig id="F5">
                  <title>
                     <p>Figure 5</p>
                  </title>
                  <caption>
                     <p><it>Dictyostelium discoideum </it>SECIS elements</p>
                  </caption>
                  <text>
                     <p><it>Dictyostelium discoideum </it>SECIS elements. <b>(a) </b>SECIS elements in <it>D. discoideum </it>selenoprotein genes. Sequences conserved in eukaryotic SECIS elements are shown in red, and <it>Dictyostelium</it>-specific conserved sequences are shown in blue. <b>(b) </b>Alignment of <it>D. discoideum </it>SECIS elements. A UGUA sequence preceding the SECIS core, and a U-U mismatch in the stem-loop structure represent additional conserved features in <it>Dictyostelium </it>SECIS elements. Black and grey highlighting shows sequence conservation.</p>
                  </text>
                  <graphic file="gb-2007-8-9-r198-5"/>
               </fig>
               <p>We used the observation of unusually high sequence conservation of <it>Dictyostelium </it>SECIS elements to develop a modified version of SECISearch, which allowed the searches wherein other search parameters were relaxed. However, application of this procedure did not detect additional selenoproteins.</p>
               <p>To further examine the <it>Dictyostelium </it>selenoproteome, we metabolically labeled the amoebae cells with <sup>75</sup>Se and analyzed the selenoprotein pattern on SDS PAGE using a PhosphorImager (Figure <figr fid="F2">2b</figr>). Four selenoprotein bands were detected, which corresponded in size to the four selenoproteins identified computationally (SPS, MSP, DI and Sep15). Apparently, Sep15 was a major selenoprotein in <it>D. discoideum</it>, whereas SelK was not detected. The latter selenoprotein might be expressed at low levels or under different growth or developmental conditions than those examined in our study.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Comparative analysis of eukaryotic selenoproteomes</p>
            </st>
            <p>Selenoproteins are found in all three domains of life, which share several protein and RNA components involved in Sec biosynthesis and insertion, suggesting an origin of the Sec machinery that predates the last universal common ancestor. Thus, Sec decoding is an ancient trait that has been maintained for hundreds of million of years without widespread expansion or loss.</p>
            <p>We compiled newly and previously characterized selenoproteomes and analyzed the occurrence of particular selenoproteins against taxonomic distribution of species based on the tree of life <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. The number of selenoproteins varied from zero (in plants, yeast and some protists) to 29 (in <it>Ostreococcus</it>) (Figure <figr fid="F6">6a</figr>). Significant differences in the composition of selenoproteomes could be seen even among related organisms. For example, among viridiplantae, all higher plants lacked selenoproteins, whereas the green algae <it>Chlamydomonas </it>and <it>Ostreococcus </it>had 12 and 26-29 selenoproteins, respectively (Figure <figr fid="F6">6b</figr>). Three selenoproteins were found in <it>Mesostigma viride</it>, a Streptophyte and a common ancestor of land plants <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>.</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Eukaryotic selenoproteomes</p>
               </caption>
               <text>
                  <p>Eukaryotic selenoproteomes. <b>(a) </b>A simplified cladogram of model organisms discussed in the text that illustrates distribution of selenoproteins in eukaryotes. The number of selenoproteins in each indicated model organism is shown in red (current study) and gray (previously analyzed and other model organisms) squares, and is proportional to the size of the bars on the left. Yellow circles show possible origins of various selenoprotein families, and red crosses examples of massive selenoprotein loss. <b>(b) </b>Selenoprotein evolution in plants. The 'mountain' symbols show terrestrial organisms, and 'anchors' those that live in aquatic environments. Green checkmarks indicate the presence of an indicated selenoprotein in the corresponding genome. The presence of Cys-containing homologs is shown by blue checkmarks. Crossed red circles indicate absence of either Sec- or Cys-containing homologs. Unfilled spots correspond to lack of data due to unfinished genomes, unclear relationship between proteins and lineage specific gene duplications.</p>
               </text>
               <graphic file="gb-2007-8-9-r198-6"/>
            </fig>
            <p>Tracing individual selenoproteins, we found that some selenoprotein families were present in many organisms and others in only a few species, yet each identified family had a unique pattern of occurrence (Figure <figr fid="F6">6a</figr>). None of the selenoproteins matched the overall Sec trait (compared to the occurrence of Sec machinery). SelK was among the most widespread selenoproteins. This protein of unknown function is present in nearly all eukaryotes that utilize Sec (but is replaced with a Cys-containing homolog in nematodes and several other organisms). An additional widespread selenoprotein was SelW, which also occurs in most (but not all) selenoprotein-containing eukaryotes. Several other selenoproteins, such as glutathione peroxidase and thioredoxin reductase, also had a wide distribution.</p>
         </sec>
         <sec>
            <st>
               <p>Origin of many selenoproteins precedes animal evolution</p>
            </st>
            <p>Since mammalian selenoproteomes were large and included essentially all known eukaryotic selenoproteins, they were initially thought to represent the entire eukaryotic selenoproteome. Subsequent identification of selenoproteins with highly restricted occurrence added further complexity, but did not challenge the overall idea of recent evolution of the majority of eukaryotic selenoproteins. However, our analysis of selenoproteomes of six eukaryotic model organisms and their comparison with the previously characterized selenoproteomes revealed that 20 of the 25 human selenoproteins have Sec-containing homologs in many unicellular organisms. Similarly, taking into account protein families, at least 11 of the 16 mammalian selenoprotein families could be traced back to single-cell eukaryotes. SelU, which is not a selenoprotein in mammals, is present in some animals and protozoa and may be viewed as an additional ancient selenoprotein family. Overall, these data suggest that the origin of many selenoproteins not only precedes animal evolution, but can be dated back to the ancestral eukaryotes. Thus, many of these original selenoproteins were preserved during evolution and remain in vertebrates (including mammals), green algae and a variety of protists, whereas many other organisms manifested massive selenoprotein losses.</p>
            <p>It should be noted that Cys/Sec replacement is not always unidirectional and that prior evolutionary analyses suggest that both a Sec loss and gain is possible <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. However, the probability of independent parallel Sec gain, as well as consecutive homoplastic Sec-to-Cys and Cys-to-Sec substitutions in a single protein position, is extremely rare, and no selenoprotein families are known that evolved more than once. Two factors are required for a Cys-to-Sec change to take place. First, the presence of Sec insertion machinery, such as Sec tRNA, SECIS-binding protein SBP2, Sec-specific elongation factor and Sec synthase. This requirement is met (for example, all components of the machinery are present) if at least one other selenoprotein is present in the same organism. Second, a SECIS element should evolve in the 3'-untranslated region. While only a single nucleotide change is sufficient to change the codon from Cys to Sec (that is, UGA instead of UGC or UGU), evolution of new SECIS elements is difficult. On the other hand, once Sec is replaced with Cys, the presence of the SECIS element provides no competitive advantage and this structure is quickly lost. Unless the reverse Cys-to-Sec mutation takes place before disruption of the SECIS element, the probability of restoring Sec is extremely low. Unless strong pressure exists to preserve Sec, its functional replacement with Cys may be expected. Combined, these factors allow us to assume that the character-state Sec follows Dollo's behavior.</p>
         </sec>
         <sec>
            <st>
               <p>Selenoproteins with restricted occurrence are common to organisms with large selenoproteomes</p>
            </st>
            <p>In addition to the many ancient eukaryotic selenoproteins, several selenoproteins have a more narrow distribution. For example, SelP, SelN, MsrB and SelI appear to be specific to animals, whereas MSP, peroxiredoxin and thioredoxin-like protein could be detected only in unicellular eukaryotes. These observations suggest an emerging picture of selenoprotein evolution wherein core selenoprotein families evolved first, followed by the origin of additional selenoproteins in more narrow groups of organisms. The new selenoproteins further increased the size of the selenoproteomes and remain prevalent in organisms with large selenoproteomes. In our current analysis, several <it>Ostreococcus </it>and <it>Thalassiosira </it>selenoproteins fit this pattern, in addition to the rare selenoproteins previously discovered (for example, SelU, SelJ and Fep15). However, it could not be excluded that new selenoproteins might also occasionally evolve in organisms with small selenoproteomes (for example, red algae).</p>
         </sec>
         <sec>
            <st>
               <p>Independent events of massive selenoprotein loss in eukaryotes</p>
            </st>
            <p>We further identified and examined several groups of organisms characterized by massive selenoprotein loss. Location of these organisms on the eukaryotic tree of life suggests independent events of selenoprotein loss (Figure <figr fid="F6">6a</figr>). Five examples of selenoprotein loss are discussed below.</p>
            <sec>
               <st>
                  <p>Plants</p>
               </st>
               <p>As discussed above, <it>A. thaliana</it>, <it>O. sativa </it>and other higher plants lost both selenoproteins and Sec insertion machinery, whereas these genes were preserved in green algae, for example, <it>Chlamydomonas</it>, <it>Volvox </it>and <it>Ostreococcus</it>. An early Streptophyte, <it>M. viride</it>, has both Sec machinery and selenoproteins. Thus, there was a specific selenoprotein loss event in the Streptophyte subset of Viridiplantae, which invaded land. Analysis of selenoproteins present in green algae suggests that they were either replaced with Cys-containing homologs or entirely lost in land plants (Figure <figr fid="F6">6b</figr>). A more distantly related <it>C. merolae </it>also manifested a large-scale selenoprotein loss.</p>
            </sec>
            <sec>
               <st>
                  <p>Apicomplexan parasites</p>
               </st>
               <p>The high selenoprotein content of <it>Thalassiosira </it>(as a reference point), the reduced selenoproteome of <it>Plasmodium </it>and the lack of selenoproteins in <it>Cryptosporidium parvum </it>illustrates an example of massive selenoprotein loss in apicomplexan parasites.</p>
            </sec>
            <sec>
               <st>
                  <p>Fungi</p>
               </st>
               <p>We screened all completely sequenced fungal genomes and could detect neither selenoproteins nor Sec insertion machinery. These data suggest that selenoprotein genes were likely lost at the base of the fungi kingdom.</p>
            </sec>
            <sec>
               <st>
                  <p>Insects</p>
               </st>
               <p>The small selenoproteomes of <it>A. gambiae</it>, <it>A. mellifera</it>, <it>D. pseudoobscura </it>and <it>D. melanogaster</it>, which consist of one to three selenoproteins, is an additional example of large-scale selenoprotein loss. On the other hand, aquatic arthropods, such as shrimp, have many selenoprotein genes (based on the expressed sequence tag (EST) analyses as the genomes are not yet available; unpublished data). Thus, it appears that selenoprotein genes were massively lost in either insects, or all terrestrial arthropods.</p>
            </sec>
            <sec>
               <st>
                  <p>Nematodes</p>
               </st>
               <p>The selenoproteomes of <it>C. elegans </it>and <it>C. briggsae </it>have only one selenoprotein, thioredoxin reductase, and, therefore, the Sec insertion system is used to decode only a single UGA codon in these nematodes <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>.</p>
               <p>The decreased size of selenoproteomes in these five groups of organisms appears to be not only due to the loss of entire selenoprotein genes, but also due to replacement of Sec with Cys. Thus, Cys-containing homologs, while often catalytically inefficient, may occasionally compensate for selenoprotein loss <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>A hypothesis for association of large selenoproteomes and aquatic life</p>
            </st>
            <p>The mosaic occurrence of eukaryotic selenoproteins and their consistent loss in different phyla suggest that the decreased selenoproteome size is the result of a selective force. What could be the factors responsible for or associated with selenoprotein loss? Comparative analysis of organisms with large and small selenoproteomes shows that many of the selenoprotein-rich organisms live in aquatic environments. In contrast, almost all organisms that lack or have a small number of selenoproteins are terrestrial (Figure <figr fid="F6">6</figr>). Considering independent, large-scale selenoprotein loss in these organisms, a common denominator appears to be the non-aquatic habitat. It should be noted, however, that the differences between aquatic and terrestrial selenoproteomes are ultimately influenced by specific environmental factors that differ with habitat. Therefore, the aquatic/terrestrial association should not be viewed as the basis for selenoprotein loss/gain, but rather a convenient illustration of differences between these organisms. Once environmental factors are identified, this association may be modified to reflect these factors rather than habitat.</p>
            <p>To further examine selenoprotein content of aquatic and terrestrial organisms, we analyzed organisms that are well represented by ESTs. We excluded large animals (vertebrates) from this analysis because their intra-organismal environment would be less affected by environmental conditions due to availability of their outside protective cover and complex morphology. With this limitation, aquatic eukaryotes had more selenoprotein genes than terrestrial organisms (Figure <figr fid="F7">7</figr>).</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Aquatic invertebrates have more selenoproteins than terrestrial organisms</p>
               </caption>
               <text>
                  <p>Aquatic invertebrates have more selenoproteins than terrestrial organisms. Numbers of detected selenoproteins were plotted against the total number of available (redundant) ESTs for organisms that are represented by more than 25,000 ESTs. Vertebrate ESTs were excluded from this analysis due to large size of these organisms. Blue circles correspond to aquatic and brown squares to terrestrial organisms. The difference is statistically significant (<it>P </it>value is less than 2 &#215; 10<sup>-6</sup>).</p>
               </text>
               <graphic file="gb-2007-8-9-r198-7"/>
            </fig>
            <p>Whether <it>C. merolae </it>fits this association is not clear. This organism lives in highly acidic sulfate-rich hot springs (pH 1.5, 45&#176;C). It is possible that this extreme environment is responsible for the reduced use of Sec in red algae. The pKa of Sec is approximately 5.5. Whereas this residue would be ionized in most organisms under physiological conditions, at low pH, protonation of Sec may minimize its catalytic advantages. Abundance of sulfate in hot springs might also be of importance, as selenium and sulfur have similar chemistries.</p>
            <p>One possible explanation for the occurrence of large selenoproteomes in aquatic organisms is bioavailability of selenium in oceans. Dissolved organic selenides can account for approximately 80% of the dissolved selenium in ocean water <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> and represent an important source of selenium for phytoplankton. Following the food chain, this could explain a large number of selenoproteins in algae and fish. Likewise, a considerable number of selenoproteins in mammals could reflect the consequence of food sources, body size and relatively recent (in evolutionary terms) emergence of these organisms from marine environments. An additional factor may be constancy in the environmental conditions and nutrients in the aquatic environments. For aquatic organisms, environmental changes are slower and involve gradients of temperature, pH, pressure, oxygen and chemical environment. In contrast, in terrestrial environments, the changes are more frequent and they happen more suddenly. As a result, terrestrial organisms often face feast and starvation situations. An attractive factor to explain the differences between aquatic and terrestrial selenoproteomes may be oxygen content. Higher content of oxygen in air than in aquatic environments may make highly reactive selenoproteins more susceptible to oxidation in terrestrial organisms and select against the use of these proteins.</p>
            <p>Whether mammals and other vertebrates fit the hypothesis on the preferential use of selenium in aquatic environments is not clear. We note, however, that fish have larger selenoproteomes than those living in terrestrial environments, including mammals, reptiles and birds. Further genomic analyses of these organisms could clarify evolutionary changes in utilization of selenium. In future studies, it would also be important to determine which of the factors discussed above influence the preferential use of Sec in aquatic organisms or are responsible for the loss of selenoproteins in terrestrial organisms.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Until recently, the mammalian selenoproteome was thought to represent accurately eukaryotic selenoproteins and to be of recent (perhaps vertebrate) origin. However, as additional genome sequences became available, selenoproteins with restricted occurrence have been identified. In mammals, these proteins either occur in the form of Cys-containing homologs or are absent altogether; instead, these rare selenoproteins have been found in several lower eukaryotic organisms. In our work, the searches of additional eukaryotic genomes identified new selenoprotein genes, revealed examples of convergent evolution of SECIS elements, and identified many features of selenoproteome organization and evolution. Integrated analyses of eukaryotic selenoproteomes suggested that the majority of eukaryotic selenoprotein families evolved in single-celled eukaryotes. Our data show that the mosaic occurrence of selenoproteins is the consequence of selective, independent selenoprotein loss events in various eukaryotic phyla. Moreover, these analyses revealed an interesting pattern: large selenoproteomes tend to occur in aquatic life, whereas the organisms that lack selenoproteins or have small selenoproteomes are mostly terrestrial (with the notable exception of mammals, whose large bodies and intra-organismal homeostasis support an internal environment that may be less dependent on habitat). Further studies will be needed to test this hypothesis and identify environmental factors that influence selenium utilization.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Databases and programs</p>
            </st>
            <p>All genome, EST and predicted protein sequences were downloaded from NCBI <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>, except for the genomes of <it>T. pseudonana</it>, <it>O. tauri</it>, and <it>O. lucimarinus</it>, which were obtained from Joint Genome Institute <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. SECISearch <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> was used for identification of SECIS elements. FASTA package <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> and BLAST were used for similarity searches. MFOLD version 3.2 <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> was used for prediction of RNA secondary structures.</p>
         </sec>
         <sec>
            <st>
               <p>Identification of homologs of known selenoprotein genes</p>
            </st>
            <p>Query sequences included a full set of human selenoproteins <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> as well as the following selenoproteins absent in mammals: <it>Chlamydomonas </it>MsrA <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, <it>Gallus gallus </it>SelU <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>, <it>Danio rerio </it>SelJ and Fep15 <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>, and <it>Emiliania huxleyi </it>protein disulfide isomerase <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. A stand-alone version of TBLASTN program was utilized for detection of nucleotide sequences corresponding to known selenoprotein families. A candidate Sec residue should correspond to a Sec residue in a known selenoprotein family or a Cys residue in orthologous proteins in order to be considered further. Downstream regions of predicted selenoprotein sequences were analyzed for the presence of candidate SECIS elements using SECISearch and for SECIS-like structures using MFOLD <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. All detected SECIS candidates were further examined for compliance with the current SECIS consensus model.</p>
         </sec>
         <sec>
            <st>
               <p>Searches for SECIS elements</p>
            </st>
            <p>Nucleotide sequences were scanned using SECISearch (Additional data file 1). In addition, the default and loose patterns of SECISearch were modified as described elsewhere <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> to accommodate organism-specific selenoprotein searches. These modifications allowed increased sensitivity of SECISearch and supported identification of unusual SECIS structures. The overall strategy of the searches was similar to that previously described <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Statistics of the searches (numbers of candidates corresponding to different steps in the search process) are shown in Table <tblr tid="T1">1</tblr>. In an additional search for <it>D. discoideum </it>SECIS elements, the following pattern was used as a query: TGTAATGATT_(10-12 nucleotides)_AAA_(24-35 nucleotides)_TGAT. This search then continued as described for other organisms.</p>
            <p>The primary sequence analysis step included searches for SECIS-like structures that satisfy NTGA__AA__GA or NTGA__CC__GA (N is any nucleotide) motifs in nucleotide sequences. Additional requirements were that the distance between the quartet (NTGA) and the unpaired AA in the apical loop is 10-13 nucleotides, and the distance between the unpaired AA and the GA that base-paired with the quartet is 15-39 nucleotides.</p>
            <p>The secondary structure analysis step examined for consistency with the eukaryotic SECIS element consensus. Several additional filters were implemented to filter out candidates with unsuitable secondary structures, including SECIS elements with more than two unpaired nucleotides in a row and Y-shaped SECIS elements.</p>
            <p>The free energy for each candidate structure was estimated; the free energies for the whole structure (threshold value of -12.6 kcal/mol) and the upper stem-loop (threshold value of -3.7 kcal/mol) were calculated. Only thermodynamically stable structures were considered further.</p>
            <p>Based on the location of candidate SECIS elements, candidate ORFs were predicted in upstream regions. SECIS candidates located within coding regions of known proteins were filtered out. An additional requirement was the presence of at least one homologous protein in the NCBI non-redundant database. If SECIS elements and ORFs corresponding to known protein families were on different DNA strands, the candidates were filtered out.</p>
            <p>The final step included manual sequence analyses of predicted selenoprotein ORFs located upstream of candidate SECIS elements.</p>
         </sec>
         <sec>
            <st>
               <p>Searches using the Sec/Cys homology approach</p>
            </st>
            <p>For three organisms, <it>O. tauri</it>, <it>O. lucimarinus </it>and <it>C. merolae</it>, additional procedures for selenoprotein detection included the search for Sec/Cys pairs in homologous sequences. ORFs with in-frame TGA codons were extracted that satisfied the following criteria: Sec-flanking regions for these proteins were conserved; and homologs could be detected that contained Cys in place of Sec. TBLASTX was used to examine all potential ORFs with in-frame UGA codons against NCBI non-redundant protein database. All hits were then tested for the occurrence of SECIS elements. Orthologous proteins were defined as bidirectional best hits. PSI-BLAST was used for identification of distant homologs. Homologs were further confirmed by phylogenetic trees construction.</p>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic analyses</p>
            </st>
            <p>Numerous attempts to derive a tree of life using various methods that were based on genes encoding ribosomal RNAs and several proteins have been published. However, their principle existence has been questioned recently because of either an insufficient amount of discriminating characters or other biases such as horizontal gene transfer and chimerism. To avoid such problems, we adopted a eukaryotic branch of a phylogenetic tree recently developed by Ciccarelli <it>et al</it>. <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. This highly resolved tree of life utilized 31 concatenated, universally occurring genes with indisputable orthology in 191 species with completed genomes across all three domains of life. The missing organisms were filled in using a 'Tree of Life' web project <abbrgrp><abbr bid="B44">44</abbr></abbrgrp> and selected publications 5-48]. Although the horizontal gene transfer is highly prevalent in prokaryotes, it is less so in eukaryotes, particularly in multicellular organisms. We also analyzed selenoprotein evolution in the eukaryotic domain. To reconstruct the phylogenies of selenoproteins, we adopted a character-based tree estimation method, a maximum parsimony approach that implies that the preferred phylogenetic tree is the tree that requires the least number of evolutionary changes.</p>
         </sec>
         <sec>
            <st>
               <p>Metabolic labeling of <it>D. discoideum </it>and <it>O. tauri </it>cells</p>
            </st>
            <p><it>D. discoideum </it>cells were grown as previously described <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, the medium was supplemented with 100 &#956;Ci of <sup>75</sup>Se [selenite] (University of Missouri Research Reactor), and the cells were further maintained under continuous shaking for two days. A similar procedure was used for labeling <it>O. tauri </it>cells, except that they were grown in K-medium. The radioactive bands were visualized on the gel with a PhosphorImager. Samples of <sup>75</sup>Se-labeled mammalian HEK 293 and CV-1 cells were included, which were prepared as described previously <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>Cys, cysteine; EST, expressed sequence tag; GPx, glutathione peroxidase; MSP, membrane selenoprotein; ORF, open reading frame; Sec, selenocysteine; SECIS, selenocysteine insertion sequence; TR, thioredoxin reductase.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>AVL, DEF and YZ performed computational analyses. DEF and AS carried out experimental analyses. AVL, DLH and VNG wrote the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
      <sec>
         <st>
            <p>Note added in proof</p>
         </st>
         <p>Two recent studies reported the complete genomes of <it>O. tauri </it>and <it>O. lucimarinus </it><abbrgrp><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr></abbrgrp>. One of these articles identified 18 and 20 selenoprotein genes in <it>O. tauri </it>and <it>O. lucimarinus</it>, respectively <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. Compared to our analyses, this published study did not detect 17 selenoproteins in the two organisms, whereas the protein they designated as SelA and predicted to contain three selenocysteines appears to be a false positive. Nevertheless, the large number of detected selenoproteins in <it>Ostreococcus </it>further highlights the association with aquatic life reported in our work.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper. Additional data file <supplr sid="S1">1</supplr> presents a block-scheme of the searches for selenoprotein genes. Additional data file <supplr sid="S2">2</supplr> contains amino acid sequence alignments of selenoproteins identified in this study. Additional data file <supplr sid="S3">3</supplr> contains sequence and predicted clover-leaf structure of T. pseudonana Sec tRNA. Additional data file <supplr sid="S4">4</supplr> has representative phylogenetic trees of selenoproteins.</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>Block-scheme of the searches for selenoprotein genes</p>
            </caption>
            <text>
               <p>Block-scheme of the searches for selenoprotein genes.</p>
            </text>
            <file name="gb-2007-8-9-r198-S1.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>Amino acid sequence alignments of selenoproteins identified in this study</p>
            </caption>
            <text>
               <p>Amino acid sequence alignments of selenoproteins identified in this study.</p>
            </text>
            <file name="gb-2007-8-9-r198-S2.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p>Sequence and predicted clover-leaf structure of <it>T. pseudonana </it>Sec tRNA</p>
            </caption>
            <text>
               <p>Sequence and predicted clover-leaf structure of <it>T. pseudonana </it>Sec tRNA.</p>
            </text>
            <file name="gb-2007-8-9-r198-S3.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S4">
            <title>
               <p>Additional data file 4</p>
            </title>
            <caption>
               <p>Representative phylogenetic trees of selenoproteins</p>
            </caption>
            <text>
               <p>Representative phylogenetic trees of selenoproteins.</p>
            </text>
            <file name="gb-2007-8-9-r198-S4.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This study was supported by NIH GM061603 (to VNG). We thank Dr Catherine Chia for providing <it>Dictyostelium </it>cells, and Dr Konstantin Korotkov for labeling the <it>Dictyostelium </it>cells with <sup>75</sup>Se. Study was completed in part utilizing the PrairieFire Beowulf cluster from Research Computing Facility of the University of Nebraska - Lincoln.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>How selenium has altered our understanding of the genetic code.</p>
            </title>
            <aug>
               <au>
                  <snm>Hatfield</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Gladyshev</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>2002</pubdate>
            <volume>22</volume>
            <fpage>3565</fpage>
            <lpage>3576</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">133838</pubid>
                  <pubid idtype="pmpid" link="fulltext">11997494</pubid>
                  <pubid idtype="doi">10.1128/MCB.22.11.3565-3576.2002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Regulation of gene expression by stop codon recoding: selenocysteine.</p>
            </title>
            <aug>
               <au>
                  <snm>Copeland</snm>
                  <fnm>PR</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2003</pubdate>
            <volume>312</volume>
            <fpage>17</fpage>
            <lpage>25</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1119(03)00588-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">12909337</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Mechanism and regulation of selenoprotein synthesis.</p>
            </title>
            <aug>
               <au>
                  <snm>Driscoll</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Copeland</snm>
                  <fnm>PR</fnm>
               </au>
            </aug>
            <source>Annu Rev Nutr</source>
            <pubdate>2003</pubdate>
            <volume>23</volume>
            <fpage>17</fpage>
            <lpage>40</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.nutr.23.011702.073318</pubid>
                  <pubid idtype="pmpid" link="fulltext">12524431</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Decoding apparatus for eukaryotic selenocysteine insertion.</p>
            </title>
            <aug>
               <au>
                  <snm>Tujebajeva</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Copeland</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>XM</fnm>
               </au>
               <au>
                  <snm>Carlson</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Harney</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Driscoll</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Hatfield</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Berry</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>EMBO Rep</source>
            <pubdate>2000</pubdate>
            <volume>1</volume>
            <fpage>158</fpage>
            <lpage>163</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1084265</pubid>
                  <pubid idtype="pmpid">11265756</pubid>
                  <pubid idtype="doi">10.1093/embo-reports/kvd033</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Computing expectation values for RNA motifs using discrete convolutions.</p>
            </title>
            <aug>
               <au>
                  <snm>Lambert</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Legendre</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fontaine</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Gautheret</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>118</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1168889</pubid>
                  <pubid idtype="pmpid" link="fulltext">15892887</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-6-118</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Mammalian selenoprotein gene signature: identification and functional analysis of selenoprotein genes using bioinformatics methods.</p>
            </title>
            <aug>
               <au>
                  <snm>Kryukov</snm>
                  <fnm>GV</fnm>
               </au>
               <au>
                  <snm>Gladyshev</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>Methods Enzymol</source>
            <pubdate>2002</pubdate>
            <volume>347</volume>
            <fpage>84</fpage>
            <lpage>100</lpage>
            <xrefbib>
               <pubid idtype="pmpid">11898441</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Different catalytic mechanisms in mammalian selenocysteine- and cysteine-containing methionine-R-sulfoxide reductases.</p>
            </title>
            <aug>
               <au>
                  <snm>Kim</snm>
                  <fnm>HY</fnm>
               </au>
               <au>
                  <snm>Gladyshev</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>e375</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1278935</pubid>
                  <pubid idtype="pmpid" link="fulltext">16262444</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0030375</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>The prokaryotic selenoproteome.</p>
            </title>
            <aug>
               <au>
                  <snm>Kryukov</snm>
                  <fnm>GV</fnm>
               </au>
               <au>
                  <snm>Gladyshev</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>EMBO Rep</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>538</fpage>
            <lpage>543</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1299047</pubid>
                  <pubid idtype="pmpid" link="fulltext">15105824</pubid>
                  <pubid idtype="doi">10.1038/sj.embor.7400126</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Characterization of mammalian selenoproteomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Kryukov</snm>
                  <fnm>GV</fnm>
               </au>
               <au>
                  <snm>Castellano</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Novoselov</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Lobanov</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Zehtab</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Guigo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gladyshev</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>300</volume>
            <fpage>1439</fpage>
            <lpage>1443</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1083516</pubid>
                  <pubid idtype="pmpid" link="fulltext">12775843</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Nematode selenoproteome: the use of the selenocysteine insertion system to decode one codon in an animal genome?</p>
            </title>
            <aug>
               <au>
                  <snm>Taskov</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Chapple</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kryukov</snm>
                  <fnm>GV</fnm>
               </au>
               <au>
                  <snm>Castellano</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lobanov</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Korotkov</snm>
                  <fnm>KV</fnm>
               </au>
               <au>
                  <snm>Guigo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gladyshev</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>2227</fpage>
            <lpage>2238</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1083425</pubid>
                  <pubid idtype="pmpid" link="fulltext">15843685</pubid>
                  <pubid idtype="doi">10.1093/nar/gki507</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p><it>In silico </it>identification of novel selenoproteins in the <it>Drosophila melanogaster </it>genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Castellano</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Morozova</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Morey</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Berry</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Serras</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Corominas</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Guigo</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>EMBO Rep</source>
            <pubdate>2001</pubdate>
            <volume>2</volume>
            <fpage>697</fpage>
            <lpage>702</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1083988</pubid>
                  <pubid idtype="pmpid" link="fulltext">11493597</pubid>
                  <pubid idtype="doi">10.1093/embo-reports/kve151</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Selenoproteins and selenocysteine insertion system in the model plant cell system, <it>Chlamydomonas reinhardtii</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Novoselov</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Rao</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Onoshko</snm>
                  <fnm>NV</fnm>
               </au>
               <au>
                  <snm>Zhi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kryukov</snm>
                  <fnm>GV</fnm>
               </au>
               <au>
                  <snm>Xiang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Weeks</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Hatfield</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Gladyshev</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>EMBO J</source>
            <pubdate>2002</pubdate>
            <volume>21</volume>
            <fpage>3681</fpage>
            <lpage>3693</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">126117</pubid>
                  <pubid idtype="pmpid" link="fulltext">12110581</pubid>
                  <pubid idtype="doi">10.1093/emboj/cdf372</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The Plasmodium selenoproteome.</p>
            </title>
            <aug>
               <au>
                  <snm>Lobanov</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Delgado</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rahlfs</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Novoselov</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Kryukov</snm>
                  <fnm>GV</fnm>
               </au>
               <au>
                  <snm>Gromer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hatfield</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Becker</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Gladyshev</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <fpage>496</fpage>
            <lpage>505</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1342035</pubid>
                  <pubid idtype="pmpid" link="fulltext">16428245</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj450</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>A selenocysteine tRNA and SECIS element in <it>Plasmodium falciparum</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Mourier</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Pain</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Rna</source>
            <pubdate>2005</pubdate>
            <volume>11</volume>
            <fpage>119</fpage>
            <lpage>122</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1370700</pubid>
                  <pubid idtype="pmpid" link="fulltext">15659354</pubid>
                  <pubid idtype="doi">10.1261/rna.7185605</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Diversity and functional plasticity of eukaryotic selenoproteins: identification and characterization of the SelJ family.</p>
            </title>
            <aug>
               <au>
                  <snm>Castellano</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lobanov</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Chapple</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Novoselov</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Albrecht</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hua</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lescure</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lengauer</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Krol</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gladyshev</snm>
                  <fnm>VN</fnm>
               </au>
               <etal/>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>16188</fpage>
            <lpage>16193</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1283428</pubid>
                  <pubid idtype="pmpid" link="fulltext">16260744</pubid>
                  <pubid idtype="doi">10.1073/pnas.0505146102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Identification and characterization of Fep15, a new selenocysteine-containing member of the Sep15 protein family.</p>
            </title>
            <aug>
               <au>
                  <snm>Novoselov</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Hua</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lobanov</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Gladyshev</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>Biochem J</source>
            <pubdate>2006</pubdate>
            <volume>394</volume>
            <fpage>575</fpage>
            <lpage>579</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1383707</pubid>
                  <pubid idtype="pmpid" link="fulltext">16236027</pubid>
                  <pubid idtype="doi">10.1042/BJ20051569</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Comparative genome sequencing of <it>Drosophila pseudoobscura </it>: chromosomal, gene, and cis-element evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Richards</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Bettencourt</snm>
                  <fnm>BR</fnm>
               </au>
               <au>
                  <snm>Hradecky</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Letovsky</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Thornton</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hubisz</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Meisel</snm>
                  <fnm>RP</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>1</fpage>
            <lpage>18</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540289</pubid>
                  <pubid idtype="pmpid" link="fulltext">15632085</pubid>
                  <pubid idtype="doi">10.1101/gr.3059305</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Evolution and speciation in the <it>Drosophila obscura </it>group.</p>
            </title>
            <aug>
               <au>
                  <snm>Lakovaara</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sauna</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>The Genetics and Biology of Drosophila</source>
            <publisher>New York: Academic Press</publisher>
            <editor>Ashburner M, Carson HL, Thompson JN Jr</editor>
            <pubdate>1982</pubdate>
            <volume>3b</volume>
            <fpage>1</fpage>
            <lpage>59</lpage>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Phylogenetic analysis and genome size of <it>Ostreococcus tauri </it>(Chlorophyta, prasinophyceae).</p>
            </title>
            <aug>
               <au>
                  <snm>Courties</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Perasso</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Chr&#233;tiennot-Dinet</snm>
                  <fnm>M-J</fnm>
               </au>
               <au>
                  <snm>Gouy</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Guillou</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Troussellier</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Phycol</source>
            <pubdate>1998</pubdate>
            <volume>34</volume>
            <fpage>844</fpage>
            <lpage>849</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1046/j.1529-8817.1998.340844.x</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Analysis of the genome sequence of the flowering plant <it>Arabidopsis thaliana</it>.</p>
            </title>
            <aug>
               <au>
                  <cnm>Arabidopsis Genome Initiative</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>408</volume>
            <fpage>796</fpage>
            <lpage>815</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35048692</pubid>
                  <pubid idtype="pmpid" link="fulltext">11130711</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Chlamydomonas as a model organism.</p>
            </title>
            <aug>
               <au>
                  <snm>Harris</snm>
                  <fnm>EH</fnm>
               </au>
            </aug>
            <source>Annu Rev Plant Physiol Plant Mol Biol</source>
            <pubdate>2001</pubdate>
            <volume>52</volume>
            <fpage>363</fpage>
            <lpage>406</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.arplant.52.1.363</pubid>
                  <pubid idtype="pmpid" link="fulltext">11337403</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Chlamydomonas and <it>Arabidopsis</it>. A dynamic duo.</p>
            </title>
            <aug>
               <au>
                  <snm>Gutman</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Niyogi</snm>
                  <fnm>KK</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2004</pubdate>
            <volume>135</volume>
            <fpage>607</fpage>
            <lpage>610</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">514095</pubid>
                  <pubid idtype="pmpid" link="fulltext">15208408</pubid>
                  <pubid idtype="doi">10.1104/pp.104.041491</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>DNA libraries for sequencing the genome of <it>Ostreococus tauri </it>(Chlorophyta, prasinophyceae): the smallest free-living eukaryotic cell.</p>
            </title>
            <aug>
               <au>
                  <snm>Derelle</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ferraz</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lagoda</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Eycheni&#233;</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cooke</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Regad</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Sabau</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Courties</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Delseny</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Demaille</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Phycol</source>
            <pubdate>2002</pubdate>
            <volume>38</volume>
            <fpage>1150</fpage>
            <lpage>1156</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1046/j.1529-8817.2002.02021.x</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Two distinct SECIS structures capable of directing selenocysteine incorporation in eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Grundner-Culemann</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>GW</fnm>
                  <suf>3rd</suf>
               </au>
               <au>
                  <snm>Harney</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Berry</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Rna</source>
            <pubdate>1999</pubdate>
            <volume>5</volume>
            <fpage>625</fpage>
            <lpage>635</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1369790</pubid>
                  <pubid idtype="pmpid" link="fulltext">10334333</pubid>
                  <pubid idtype="doi">10.1017/S1355838299981542</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Structural analysis of new local features in SECIS RNA hairpins.</p>
            </title>
            <aug>
               <au>
                  <snm>Fagegaltier</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lescure</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Walczak</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Carbon</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Krol</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <fpage>2679</fpage>
            <lpage>2689</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">102651</pubid>
                  <pubid idtype="pmpid" link="fulltext">10908323</pubid>
                  <pubid idtype="doi">10.1093/nar/28.14.2679</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Cyanidioschyzon merolae genome. A tool for facilitating comparable studies on organelle biogenesis in photosynthetic eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Misumi</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Matsuzaki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nozaki</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Miyagishima</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Mori</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nishida</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yagisawa</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Yoshida</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kuroiwa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kuroiwa</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2005</pubdate>
            <volume>137</volume>
            <fpage>567</fpage>
            <lpage>585</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1065357</pubid>
                  <pubid idtype="pmpid" link="fulltext">15681662</pubid>
                  <pubid idtype="doi">10.1104/pp.104.053991</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Is there a twenty third amino acid in the genetic code?</p>
            </title>
            <aug>
               <au>
                  <snm>Lobanov</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Kryukov</snm>
                  <fnm>GV</fnm>
               </au>
               <au>
                  <snm>Hatfield</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Gladyshev</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <fpage>357</fpage>
            <lpage>360</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2006.05.002</pubid>
                  <pubid idtype="pmpid" link="fulltext">16713651</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and metabolism.</p>
            </title>
            <aug>
               <au>
                  <snm>Armbrust</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Berges</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Bowler</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>BR</fnm>
               </au>
               <au>
                  <snm>Martinez</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Putnam</snm>
                  <fnm>NH</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Allen</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Apt</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Bechner</snm>
                  <fnm>M</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>306</volume>
            <fpage>79</fpage>
            <lpage>86</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1101156</pubid>
                  <pubid idtype="pmpid" link="fulltext">15459382</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Selenocysteyl tRNA occurs in the diatom, <it>Thalassiosira</it>, and in the ciliate, <it>Tetrahymena</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Hatfield</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Price</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Stadtman</snm>
                  <fnm>TC</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1991</pubdate>
            <volume>5</volume>
            <fpage>1183</fpage>
            <lpage>1186</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2958.1991.tb01891.x</pubid>
                  <pubid idtype="pmpid">1835508</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Specific selenium-containing macromolecules in the marine diatom <it>Thalassiosira pseudonana</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Price</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Harrison</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>1988</pubdate>
            <volume>86</volume>
            <fpage>192</fpage>
            <lpage>199</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1054453</pubid>
                  <pubid idtype="pmpid" link="fulltext">16665865</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Selenocysteine tRNA identification in the model organisms <it>Dictyostelium discoideum </it>and <it>Tetrahymena thermophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Shrimali</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Lobanov</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>XM</fnm>
               </au>
               <au>
                  <snm>Rao</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Carlson</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Mahadeo</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Parent</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Gladyshev</snm>
                  <fnm>VN</fnm>
               </au>
               <au>
                  <snm>Hatfield</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <source>Biochem Biophys Res Commun</source>
            <pubdate>2005</pubdate>
            <volume>329</volume>
            <fpage>147</fpage>
            <lpage>151</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.bbrc.2005.01.120</pubid>
                  <pubid idtype="pmpid" link="fulltext">15721286</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>The selenocysteine incorporation machinery: interactions between the SECIS RNA and the SECIS-binding protein SBP2.</p>
            </title>
            <aug>
               <au>
                  <snm>Fletcher</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Copeland</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Driscoll</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Krol</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>RNA</source>
            <pubdate>2001</pubdate>
            <volume>7</volume>
            <fpage>1442</fpage>
            <lpage>1453</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1370188</pubid>
                  <pubid idtype="pmpid" link="fulltext">11680849</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Toward automatic reconstruction of a highly resolved tree of life.</p>
            </title>
            <aug>
               <au>
                  <snm>Ciccarelli</snm>
                  <fnm>FD</fnm>
               </au>
               <au>
                  <snm>Doerks</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>von Mering</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Creevey</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2006</pubdate>
            <volume>311</volume>
            <fpage>1283</fpage>
            <lpage>1287</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1123061</pubid>
                  <pubid idtype="pmpid" link="fulltext">16513982</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>The closest living relatives of land plants.</p>
            </title>
            <aug>
               <au>
                  <snm>Karol</snm>
                  <fnm>KG</fnm>
               </au>
               <au>
                  <snm>McCourt</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Cimino</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Delwiche</snm>
                  <fnm>CF</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>294</volume>
            <fpage>2351</fpage>
            <lpage>2353</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1065156</pubid>
                  <pubid idtype="pmpid" link="fulltext">11743201</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Dynamic evolution of selenocysteine utilization in bacteria: a balance between selenoprotein loss and evolution of selenocysteine from redox active cysteine residues.</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Romero</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Salinas</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gladyshev</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>R94</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1794560</pubid>
                  <pubid idtype="pmpid" link="fulltext">17054778</pubid>
                  <pubid idtype="doi">10.1186/gb-2006-7-10-r94</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Active sites of thioredoxin reductases: why selenoproteins?</p>
            </title>
            <aug>
               <au>
                  <snm>Gromer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Johansson</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Bauer</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Arscott</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Rauch</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ballou</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>CH</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Schirmer</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Arner</snm>
                  <fnm>ES</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>12618</fpage>
            <lpage>12623</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">240667</pubid>
                  <pubid idtype="pmpid" link="fulltext">14569031</pubid>
                  <pubid idtype="doi">10.1073/pnas.2134510100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>The estuarine behaviour of selenium in San Francisco Bay.</p>
            </title>
            <aug>
               <au>
                  <snm>Cutter</snm>
                  <fnm>GA</fnm>
               </au>
            </aug>
            <source>EstuarineCoastal Shelf Sci</source>
            <pubdate>1989</pubdate>
            <volume>28</volume>
            <fpage>13</fpage>
            <lpage>34</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/0272-7714(89)90038-3</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>NCBI</p>
            </title>
            <url>ftp://ftp.ncbi.nih.gov/genbank/</url>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Joint Genome Institute</p>
            </title>
            <url>http://www.jgi.doe.gov</url>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Improved tools for biological sequence comparison.</p>
            </title>
            <aug>
               <au>
                  <snm>Pearson</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1988</pubdate>
            <volume>85</volume>
            <fpage>2444</fpage>
            <lpage>2448</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">280013</pubid>
                  <pubid idtype="pmpid" link="fulltext">3162770</pubid>
                  <pubid idtype="doi">10.1073/pnas.85.8.2444</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Mfold web server for nucleic acid folding and hybridization prediction.</p>
            </title>
            <aug>
               <au>
                  <snm>Zuker</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>3406</fpage>
            <lpage>3415</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">169194</pubid>
                  <pubid idtype="pmpid" link="fulltext">12824337</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg595</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Reconsidering the evolution of eukaryotic selenoproteins: a novel nonmammalian family with scattered phylogenetic distribution.</p>
            </title>
            <aug>
               <au>
                  <snm>Castellano</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Novoselov</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Kryukov</snm>
                  <fnm>GV</fnm>
               </au>
               <au>
                  <snm>Lescure</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Blanco</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Krol</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gladyshev</snm>
                  <fnm>VN</fnm>
               </au>
               <au>
                  <snm>Guigo</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>EMBO Rep</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>71</fpage>
            <lpage>77</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1298953</pubid>
                  <pubid idtype="pmpid" link="fulltext">14710190</pubid>
                  <pubid idtype="doi">10.1038/sj.embor.7400036</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>A novel eukaryotic selenoprotein in the haptophyte alga <it>Emiliania huxleyi</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Obata</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Shiraiwa</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2005</pubdate>
            <volume>280</volume>
            <fpage>18462</fpage>
            <lpage>18468</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M501517200</pubid>
                  <pubid idtype="pmpid" link="fulltext">15743763</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>"Tree of Life" Web Project</p>
            </title>
            <url>http://www.tolweb.org/tree/</url>
         </bibl>
         <bibl id="B45">
            <aug>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Animal Evolution: Interrelationships of the Living Phyla</source>
            <publisher>Oxford: Oxford University Press</publisher>
            <pubdate>2001</pubdate>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Origin and evolution of the slime molds (Mycetozoa).</p>
            </title>
            <aug>
               <au>
                  <snm>Baldauf</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1997</pubdate>
            <volume>94</volume>
            <fpage>12007</fpage>
            <lpage>12012</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">23686</pubid>
                  <pubid idtype="pmpid" link="fulltext">9342353</pubid>
                  <pubid idtype="doi">10.1073/pnas.94.22.12007</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>A kingdom-level phylogeny of eukaryotes based on combined protein data.</p>
            </title>
            <aug>
               <au>
                  <snm>Baldauf</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Roger</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Wenk-Siefert</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>290</volume>
            <fpage>972</fpage>
            <lpage>977</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.290.5493.972</pubid>
                  <pubid idtype="pmpid" link="fulltext">11062127</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>The protistan origins of animals and fungi.</p>
            </title>
            <aug>
               <au>
                  <snm>Steenkamp</snm>
                  <fnm>ET</fnm>
               </au>
               <au>
                  <snm>Wright</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Baldauf</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2006</pubdate>
            <volume>23</volume>
            <fpage>93</fpage>
            <lpage>106</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msj011</pubid>
                  <pubid idtype="pmpid" link="fulltext">16151185</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Association between the 15-kDa selenoprotein and UDP-glucose:glycoprotein glucosyltransferase in the endoplasmic reticulum of mammalian cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Korotkov</snm>
                  <fnm>KV</fnm>
               </au>
               <au>
                  <snm>Kumaraswamy</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hatfield</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Gladyshev</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2001</pubdate>
            <volume>276</volume>
            <fpage>15330</fpage>
            <lpage>15336</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M009861200</pubid>
                  <pubid idtype="pmpid" link="fulltext">11278576</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>The tiny eukaryote Ostreococcus provides genomic insights into the paradox of plankton speciation.</p>
            </title>
            <aug>
               <au>
                  <snm>Palenik</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Grimwood</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Aerts</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rouze</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Salamov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Putnam</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Dupont</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jorgensen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Derelle</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Rombauts</snm>
                  <fnm>S</fnm>
               </au>
               <etal/>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <fpage>7705</fpage>
            <lpage>7710</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1863510</pubid>
                  <pubid idtype="pmpid" link="fulltext">17460045</pubid>
                  <pubid idtype="doi">10.1073/pnas.0611046104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Genome analysis of the smallest free-living eukaryote Ostreococcus tauri unveils many unique features.</p>
            </title>
            <aug>
               <au>
                  <snm>Derelle</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ferraz</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rombauts</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rouze</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Worden</snm>
                  <fnm>AZ</fnm>
               </au>
               <au>
                  <snm>Robbens</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Partensky</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Degroeve</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Echeynie</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cooke</snm>
                  <fnm>R</fnm>
               </au>
               <etal/>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>11647</fpage>
            <lpage>11652</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1544224</pubid>
                  <pubid idtype="pmpid" link="fulltext">16868079</pubid>
                  <pubid idtype="doi">10.1073/pnas.0604795103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
