<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2010-11-5-209</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Review</dochead>
      <bibl>
         <title>
            <p>The origin and early evolution of eukaryotes in the light of phylogenomics</p>
         </title>
         <aug>
            <au ca="yes" id="A1"><snm>Koonin</snm><mi>V</mi><fnm>Eugene</fnm><insr iid="I1"/><email>Koonin@ncbi.nlm.nih.gov</email></au>
         </aug>
         <insg>
            <ins id="I1"><p>National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD 20894, USA</p></ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2010</pubdate>
         <volume>11</volume>
         <issue>5</issue>
         <fpage>209</fpage>
         <url>http://genomebiology.com/2010/11/5/209</url>
         <xrefbib><pubidlist><pubid idtype="pmpid">20441612</pubid><pubid idtype="doi">10.1186/gb-2010-11-5-209</pubid></pubidlist></xrefbib>
      </bibl>
      <history><pub><date><day>5</day><month>5</month><year>2010</year></date></pub></history>
      <cpyrt><year>2010</year><collab>BioMed Central Ltd</collab></cpyrt>
      <shorttitle>
         <p>The origin and early evolution of eukaryotes in the light of phylogenomics</p>
      </shorttitle>
      <shortabs>
         <p>Comparative genomics and new phylogenies of eukaryote groups suggest a scenario in which the mitochondrial endosymbiosis triggered the origin of eukaryotes.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <p>Phylogenomics of eukaryote supergroups suggest a highly complex last common ancestor of eukaryotes and a key role of mitochondrial endosymbiosis in the origin of eukaryotes.</p>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification id="10thanniversary" subtype="theme_series_title" type="BMC">10th Anniversary Collection</classification>
         <classification id="10thanniversary" subtype="theme_series_editor" type="BMC"/>
         <classification id="30010008" subtype="man_spc_id" type="BMC">Evolution</classification>
         <classification id="30010004" subtype="man_spc_id" type="BMC">Cell biology</classification>
         <classification id="300100016" subtype="man_spc_id" type="BMC">Molecular biology</classification>
         <classification id="300100010" subtype="man_spc_id" type="BMC">Genome studies</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Eukaryotes</p>
         </st>
         <p>The origin of eukaryotes is a huge enigma and a major challenge for evolutionary biology <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. There is a sharp divide in the organizational complexity of the cell between eukaryotes, which have complex intracellular compartmentalization, and even the most sophisticated prokaryotes (archaea and bacteria), which do not <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. A typical eukaryotic cell is about 1,000-fold bigger by volume than a typical bacterium or archaeon, and functions under different physical principles: free diffusion has little role in eukaryotic cells, but is crucial in prokaryotes <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. The compartmentalization of eukaryotic cells is supported by an elaborate endomembrane system and by the actin-tubulin-based cytoskeleton <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. There are no direct counterparts of these organelles in archaea or bacteria. The other hallmark of the eukaryotic cell is the presence of mitochondria, which have a central role in energy transformation and perform many additional roles in eukaryotic cells, such as in signaling and cell death.</p>
         <p>The conservation of the major features of cellular organization and the existence of a large set of genes that are conserved across eukaryotes leave no doubt that all extant eukaryotic forms evolved from a last eukaryote common ancestor (LECA; see below). All eukaryotes that have been studied in sufficient detail possess either mitochondria or organelles derived from mitochondria <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>, so it is thought that LECA already possessed mitochondria (see below). Plants and many unicellular eukaryotes also have another type of organelle, plastids.</p>
         <p>The organizational complexity of the eukaryotic cells is complemented by extremely sophisticated, cross-talking signaling networks <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. The main signaling systems in eukaryotes are the kinase-phosphatase machinery that regulates protein function through phosphorylation and dephosphorylation <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>; the ubiquitin network that governs protein turnover and localization through reversible protein ubiquitylation <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>; regulation of translation by microRNAs <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>; and regulation of transcription at the levels of individual genes and chromatin remodeling <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. Eukaryotes all share the main features of cellular architecture and the regulatory circuitry that clearly differentiate them from prokaryotes, although the ancestral forms of some signature eukaryotic systems are increasingly detected in prokaryotes, as discussed below. Phylogenomic reconstructions show that the characteristic eukaryotic complexity arose almost 'ready made', without any intermediate grades seen between the prokaryotic and eukaryotic levels of organization <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. Explaining this apparent leap in complexity at the origin of eukaryotes is one of the principal challenges of evolutionary biology.</p>
         <p>The key to the origin of eukaryotes will undoubtedly be found using comparative genomics of eukaryotes, archaea and bacteria. Complete genome sequences from all three domains of cellular life are accumulating exponentially, albeit at markedly different paces. As of March 2010, the NCBI genome database contained over 1,000 bacterial genomes, about 100 archaeal genomes, and about 100 genomes of eukaryotes <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. Here, I discuss some of the main insights that have come from comparative analysis of these genomes, which may help to shed light on the origin and the early stages of evolution of eukaryotes. So far, the comparative genomics era has brought fascinating clues but no decisive breakthrough.</p>
      </sec>
      <sec>
         <st>
            <p>The supergroups of eukaryotes and the root of the eukaryotic evolutionary tree</p>
         </st>
         <p>Although several eukaryotic kingdoms, such as animals, fungi, plants and ciliates, are well defined and seem to be monophyletic beyond reasonable doubt, deciphering the evolutionary relationships between these kingdoms and numerous other groups of unicellular eukaryotes (also called protists) turned out to be daunting. For many years, evolutionary biologists tended to favor the so called crown group phylogeny <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B32">32</abbr></abbrgrp>. The 'crown' of this evolutionary tree included animals (Metazoa) and plants (Viridiplantae), fungi and various assortments of protists, depending on the methods used for tree construction <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>. The rest of the protists, such as microsporidia, diplomonads and parabasalia, were considered 'early branching eukaryotes'; for some of them, this conclusion was reached because they appeared to lack mitochondria and were therefore thought to have evolved before the mitochondrial symbiosis. The scenario resulting from the crown group phylogeny was called the archezoan scenario: the archaezoan was defined as a hypothetical ancestral form that lacked mitochondria but possessed the other signature features of the eukaryotic cell. However, during the past decade, the early branching groups have lost their positions at the root of the eukaryotic tree, one after another <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>. The improved taxon sampling as a result of genome sequencing together with new, more robust methods for phylogenetic analysis indicate that the deep placing of these groups seen in early trees was a long-branch artifact caused by the fast evolution of the respective organisms <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>. At the same time, comparative-genomic and ultrastructural studies destroyed the biological underpinning of the near-root positions of the (former) early branching groups of protists by showing that none of them ancestrally lack mitochondria, as they all have genes of apparent mitochondrial origin and mitochondria-related organelles, such as hydrogenosomes and mitosomes <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13	</abbr><abbr bid="B40">40</abbr></abbrgrp>.</p>
         <p>There are therefore no grounds to consider any group of eukaryotes primitive, a presymbiotic archezoan. Rather, taking into account the small genomes and high rate of evolution characteristic of most of the protist groups thought to be early branching, and their parasitic lifestyle, it is becoming increasingly clear that most or perhaps all of them evolved from more complex ancestral forms by reductive evolution <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B39">39</abbr></abbrgrp>. Reductive evolution refers to the evolutionary modality typical of parasites: they tend to lose genes, organelles and functions when the respective functionalities are taken over by the host. So the archezoan (crown group) phylogeny seems to have been disproved, and deep phylogeny and the theories of the origin of eukaryotes effectively had to start from scratch.</p>
         <p>This time phylogenomic approaches were mainly used, that is, phylogenetic analysis of genome-wide sets of conserved genes; this was made possible by the much larger number of genomes that had been sequenced <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp>. The key accomplishment at this new stage was the proposal of 'supergroups' of eukaryotes that are suggested to combine highly diverse groups of organisms in a monophyletic group <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>. Most of the phylogenomic analyses published so far converge on five supergroups (or six if the Amoebozoa and Opisthokonts do not form a single supergroup, the Unikonts; Figure <figr fid="F1">1</figr>). Although proving monophyly is non-trivial for these groups <abbrgrp><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp>, the general structure of the tree, with a few supergroups forming a star-like phylogeny (Figure <figr fid="F1">1</figr>), is reproduced consistently, and the latest results <abbrgrp><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr></abbrgrp> seem to support the monophyly of the five supergroups.</p>
         <fig id="F1"><title><p>Figure 1</p></title><caption><p>Evolution of the eukaryotes</p></caption><text>
   <p><b>Evolution of the eukaryotes</b>. The relationship between the five eukaryotic supergroups - Excavates, Rhizaria, Unikonts, Chromalveolates and Plantae - are shown as a star phylogeny with LECA placed in the center. The 4,134 genes assigned to LECA are those shared by the free-living excavate amoeboflagellate <it>Naegleria gruberi </it>with representatives of at least one other supergroup <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>. The numbers of these putative ancestral genes retained in selected lineages from different supergroups are also indicated. Branch lengths are arbitrary. Two putative root positions are shown: I, the Unikont-Bikont rooting <abbrgrp><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr></abbrgrp>; II, rooting at the base of Plantae <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>.</p>
</text><graphic file="gb-2010-11-5-209-1"/></fig>
         <p>The relationship between the supergroups is a formidable problem as the internal branches are extremely short, suggesting that the radiation of the supergroups occurred rapidly (on the evolutionary scale), perhaps resembling an evolutionary 'big bang' <abbrgrp><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr></abbrgrp>. Two recent, independent phylogenetic studies <abbrgrp><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr></abbrgrp> each analyzed over 130 conserved proteins from several dozen eukaryotic species and, after exploring the effects of removing fast-evolving taxa, arrived at a three-megagroup structure of the eukaryotic tree. The megagroups consist of Unikonts, Excavates, and the assemblage of Plantae, Chromalveolata and Rhizaria <abbrgrp><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr></abbrgrp>.</p>
         <p>Furthermore, there have been several attempts to infer the position of the root of the eukaryotic tree (Figure <figr fid="F1">1</figr>). The first alternative to the crown group tree was proposed by Cavalier-Smith and coworkers <abbrgrp><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr></abbrgrp>, who used rare genomic changes (RGCs) <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>, such as the fusion of two enzyme genes <abbrgrp><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr></abbrgrp> and the domain structure of myosins <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>, to place the root between the Unikonts and the rest of eukaryotes (I (red arrow) in Figure <figr fid="F1">1</figr>). This separation seems biologically plausible because Unikont cells have a single cilium, whereas all other eukaryotic cells have two. However, this conclusion could be suspect because the use of only a few RGCs makes it difficult to rule out homoplasy (parallel emergence of the same RGC, such as gene fusion or fission, in different lineages). Rogozin and coworkers <abbrgrp><abbr bid="B60">60</abbr></abbrgrp> used a different RGC approach based on rare replacements of highly conserved amino acid residues requiring two nucleotide substitutions and inferred the most likely position of the root to be between Plantae and the rest of eukaryotes (II (green arrow) in Figure <figr fid="F1">1</figr>). Again, this seems biologically plausible because the cyanobacterial endosymbiosis that gave rise to plastids occurred on the Plantae lineage.</p>
         <p>The controversy about the root position and the lack of consensus regarding the monophyly of at least some of the supergroups, let alone the megagroups, indicate that, despite the emerging clues, the deep phylogeny of eukaryotes currently should be considered unresolved. In a sense, given the likely 'big bang' of early eukaryote radiation, the branching order of the supergroups, in itself, might be viewed as relatively unimportant <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. However, the biological events that triggered these early radiations are of major interest, so earnest attempts to resolve the deepest branches of the eukaryotic tree will undoubtedly continue with larger and further improved datasets and methods.</p>
      </sec>
      <sec>
         <st>
            <p>The last common ancestor of eukaryotes</p>
         </st>
         <p>Comparative analysis of representative genomes from different eukaryotic supergroups enables the reconstruction of the gene complement of LECA using maximum parsimony (MP) or more sophisticated maximum likelihood (ML) methods <abbrgrp><abbr bid="B62">62</abbr><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr></abbrgrp>. Essentially, genes that are represented in diverse extant representatives of different supergroups, even though lost in some lineages, can be mapped back to LECA. The results of all these reconstructions consistently point to a complex LECA, in terms of both the sheer number of ancestral genes and, perhaps even more importantly, the ancestral presence of the signature functional systems of the eukaryotic cell (see below). A MP reconstruction based on phyletic patterns in clusters of orthologous genes of eukaryotes mapped 4,137 genes to LECA (Figure <figr fid="F1">1</figr>) <abbrgrp><abbr bid="B63">63</abbr><abbr bid="B65">65</abbr><abbr bid="B66">66</abbr></abbrgrp>. Remarkably, an even simpler estimation, based on the recent analysis of the genome of <it>Naegleria gruberi</it>, the first sequenced genome of a free-living excavate <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>, revealed about a nearly identical number of genes, 4,134, that are shared by <it>Naegleria </it>and at least one other supergroup of eukaryotes, suggesting that these genes are part of the LECA heritage (Figure <figr fid="F1">1</figr>). Such estimates are highly conservative as they do not account for lineage-specific loss of ancestral genes, a major aspect in the evolution of eukaryotes. Indeed, even animals and plants, the eukaryotic kingdoms that seem to be the least prone to gene loss, have still lost about 20% of the putative ancestral genes identified in the unicellular <it>Naegleria </it>(Figure <figr fid="F1">1</figr>). Given that the current estimate for the gene complement of LECA must be conservative, the genome of LECA is likely to have been as complex as those of typical extant free-living unicellular eukaryotes <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>.</p>
         <p>This conclusion is supported by reconstructions from comparative genomics of the ancestral composition of the key functional systems of the LECA, such as the nuclear pore <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B69">69</abbr></abbrgrp>, the spliceosome <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, the RNA interference machinery <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>, the proteasome and the ubiquitin signaling system <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>, and the endomembrane apparatus <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. The outcomes of these reconstructions are all straightforward and consistent, even when different topologies of the phylogenetic tree of eukaryotes were used as the scaffold for the reconstruction: LECA already possessed all these structures in its fully functional state, possibly as complex as the counterparts in modern eukaryotes.</p>
         <p>Reconstruction of other aspects of the genomic composition and architecture of LECA similarly points to a highly complex ancestral genome. Comparative-genomic analysis of intron positions in orthologous genes within and between supergroups suggests high intron densities in the ancestors of the supergroups and in LECA, at least as dense as in modern free-living unicellular eukaryotes <abbrgrp><abbr bid="B72">72</abbr><abbr bid="B73">73</abbr><abbr bid="B74">74</abbr><abbr bid="B75">75</abbr></abbrgrp>. A systematic analysis of widespread gene duplications in eukaryotes indicates that hundreds of duplications predate LECA, especially duplications of genes involved in protein turnover <abbrgrp><abbr bid="B63">63</abbr><abbr bid="B65">65</abbr><abbr bid="B66">66</abbr></abbrgrp>. Taken together, these results clearly indicate that LECA was a typical, fully developed eukaryotic cell. The subsequent evolution of eukaryotes has seemingly shown no consistent trend toward increased complexity, except for lineage-specific embellishments, such as those seen in animals and plants. There was obviously an important stage of evolution on the 'stem' of eukaryotes, after they first evolved but before LECA, which included extensive duplication of numerous essential genes, so that the set of ancestral genes approximately doubled <abbrgrp><abbr bid="B63">63</abbr><abbr bid="B65">65</abbr><abbr bid="B66">66</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>The archaeal and bacterial roots of eukaryotes</p>
         </st>
         <p>Eukaryotes are hybrid organisms in terms of both their cellular organization and their gene complement. All eukaryotes seem to possess mitochondria or related organelles derived from &#945;-proteobacteria, whereas Plantae and many groups of Chromalveolata additionally have cyanobacteria-derived plastids <abbrgrp><abbr bid="B76">76</abbr><abbr bid="B77">77</abbr></abbrgrp>. The gene complement of eukaryotes is an uneven mix of genes of apparent archaeal origin, genes of probable bacterial origin, and genes that so far seem eukaryote-specific, without convincing evidence of ancestry in either of the two prokaryote domains (Figure <figr fid="F2">2</figr>). Paradoxical as this might appear, although trees based on rRNA genes and concatenated alignments of information-processing proteins, such as polymerases or splicing proteins, both put archaea and eukaryotes together, genome-wide analyses consistently and independently show that there are three or more times more genes with bacterial homologs than with archaeal homologs <abbrgrp><abbr bid="B62">62</abbr><abbr bid="B63">63</abbr><abbr bid="B78">78</abbr><abbr bid="B79">79</abbr></abbrgrp> (Figure <figr fid="F2">2</figr>). The archaeal subset is strongly enriched in information processing functions (translation, transcription, replication, splicing), whereas the bacterial subset consists largely of metabolic enzymes <abbrgrp><abbr bid="B62">62</abbr><abbr bid="B78">78</abbr></abbrgrp> (see below for more details).</p>
         <fig id="F2"><title><p>Figure 2</p></title><caption><p>Breakdown of the genes from two eukaryotes by the putative evolutionary affinities</p></caption><text>
   <p><b>Breakdown of the genes from two eukaryotes by the putative evolutionary affinities</b>. <b>(a) </b>Yeast and <b>(b) </b>red algae. The putative origin of genes was tentatively inferred from the best hits obtained by searching the NCBI non-redundant protein sequence database using the BLASTP program <abbrgrp><abbr bid="B125">125</abbr></abbrgrp>, with all protein sequences from the respective organisms used as queries. Although sequence similarity searches are often regarded as a very rough approximation of the phylogenetic position <abbrgrp><abbr bid="B126">126</abbr></abbrgrp>, the previous analysis of the yeast genome showed a high level of congruence between the best hits and phylogenomic results <abbrgrp><abbr bid="B78">78</abbr></abbrgrp>. Major archaeal and bacterial groups are color-coded and denoted 1 to 18; the number of proteins with the best hit to the given groups is indicated. The groups are: 1, Euryarchaeota; 2, Crenarchaeota-Thaumarchaeota-Nanoarchaeota; 3, Firmicutes; 4, &#947;-Proteobacteria; 5, &#945;-Proteobacteria; 6, &#948;- and &#949;-Proteobacteria; 7, &#946;-Proteobacteria; 8, unclassified Proteobacteria; 9, Cyanobacteria; 10, Actinobacteria; 11, Bacteroides-Chlorobi group; 12, Chloroflexi; 13, Planctomycetes; 14, Verrucomicrobia-Chlamydiae-Spirochetes; 15, <it>Deinococcus-Thermus </it>group; 16, Aquificacae and Thermotogae; 17, other bacteria; 18, no archaeal or bacterial homologs.</p>
</text><graphic file="gb-2010-11-5-209-2"/></fig>
         <p>At a coarse level, these observations are best compatible with genome fusion scenarios <abbrgrp><abbr bid="B79">79</abbr><abbr bid="B80">80</abbr></abbrgrp> whereby the eukaryotic genome emerged through a fusion between two ancestral genomes, an archaeal or archaea-related one, and a bacterial, most likely &#945;-proteobacterial, one, given the well-established ancestry of the mitochondrial endosymbiont <abbrgrp><abbr bid="B81">81</abbr></abbrgrp>. However, attempts to pinpoint the specific archaeal and bacterial 'parents' of eukaryotes reveal complicated evolutionary relationships. Although many of the bacterial-like genes in eukaryotes have &#945;-proteobacterial homologues, these are far from dominant amongst the bacterial-like genes which show apparent evolutionary affinities with a variety of bacterial groups (Figure <figr fid="F2">2</figr>). An important cause of this complicated breakdown of the bacterial-like component of the eukaryotic gene complement is the large size of the &#945;-proteobacterial pangenome, that is, of the combined genes found in all &#945;-proteobacteria, and the associated diversity of the gene sets in individual members of this group <abbrgrp><abbr bid="B82">82</abbr></abbrgrp>. Thus, without knowing the exact identity within the &#945;-proteobacteria of the bacterial endosymbiont that gave rise to the eukaryotic mitochondria, it is hard to delineate its genetic contribution. Apart from this uncertainty about the gene complement of the endosymbiont, it is impossible to rule out multiple sources of the bacterial-like genes in eukaryotes <abbrgrp><abbr bid="B83">83</abbr></abbrgrp>, which may have origins other than the genome of the bacterial endosymbiont. In particular, whatever the actual nature of the archaeal-like ancestor, it probably lived at moderate temperatures and non-extreme conditions and was consequently in contact with a diverse bacterial community. Modern archaea with such lifestyles have numerous genes of diverse bacterial origins, indicating extensive horizontal acquisition of genes from bacteria <abbrgrp><abbr bid="B84">84</abbr><abbr bid="B85">85</abbr></abbrgrp>. Thus, the archaeal-like host of the endosymbiont could have already had many bacterial genes, partly explaining the observed pattern.</p>
         <p>The case of the archaeal(-like) parent is far more difficult than that of the bacterial ancestor(s) as there are no data on the ancestral lineage that would parallel the unambiguous origin of mitochondria from &#945;-proteobacteria. Phylogenomic studies using different methods point to different archaeal lineages - Crenarchaeota <abbrgrp><abbr bid="B86">86</abbr><abbr bid="B87">87</abbr></abbrgrp>, Euryarchaeota <abbrgrp><abbr bid="B88">88</abbr></abbrgrp>, or an unidentified deep branch <abbrgrp><abbr bid="B89">89</abbr><abbr bid="B90">90</abbr></abbrgrp> - as the candidates for the eukaryote ancestor (Figure <figr fid="F3">3</figr>). Unequivocal resolution of such deep evolutionary relationships is extremely difficult. Moreover, at least one of these analyses <abbrgrp><abbr bid="B89">89</abbr></abbrgrp> explicitly suggests the possibility that the archaeal heritage of eukaryotes is genuinely mixed, with the largest contribution coming from a deep lineage, followed by the contributions from Crenarchaeota (Thaumoarchaeota) and the Euryarchaeota (Figure <figr fid="F3">3</figr>). In the next section I examine the possibility of multiple archaeal and bacterial ancestors of the eukaryotes with respect to distinct functional systems of eukaryotic cells.</p>
         <fig id="F3"><title><p>Figure 3</p></title><caption><p>Possible archaeal origins of eukaryotic genes</p></caption><text>
   <p><b>Possible archaeal origins of eukaryotic genes</b>. The archaeal tree is shown as a bifurcation of Euryarchaeota and the putative second major branch combining Crenarchaeota, Thaumarchaeota, and Korarchaeota <abbrgrp><abbr bid="B127">127</abbr></abbrgrp>; deep, possibly extinct lineages are shown as a single stem.</p>
</text><graphic file="gb-2010-11-5-209-3"/></fig>
      </sec>
      <sec>
         <st>
            <p>Mixed origins of the key functional systems of eukaryotes</p>
         </st>
         <p>Some of the most compelling indications on the course of evolution and the nature of ancestral forms come from signature genes that are uniquely shared by two or more major lineages and from detailed evolutionary analysis of well characterized functional systems, in particular the signature systems of the eukaryotic cell. Comparative genome sequence analysis has revealed that some of the key molecular machines of the eukaryotes, and not only those directly involved in information processing, can be confidently derived from archaeal ancestors (Table <tblr tid="T1">1</tblr> and Figure <figr fid="F4">4</figr>). Strikingly, this archaeal heritage seems to be patchy with respect to the specific origins, with apparent evolutionary affinities to different groups of archaea (Table <tblr tid="T1">1</tblr> and Figure <figr fid="F4">4</figr>). For instance, comparative analysis of the translation system components tends to suggest an affinity between eukaryotes and Crenarchaeota <abbrgrp><abbr bid="B91">91</abbr></abbrgrp>. Similarly, the core transcription machinery of eukaryotes shares some important proteins with Crenarchaeota, Thaumarchaeota and Korarchaeota, to the exclusion of Euryarchaeota <abbrgrp><abbr bid="B92">92</abbr><abbr bid="B93">93</abbr><abbr bid="B94">94</abbr></abbrgrp>. By contrast, the histones, the primary components of nucleosomes, are missing in most of the Crenarchaeota but invariably conserved in Euryarchaeota (and also present in <it>Korarchaeum </it>and some Thaumarchaeota) <abbrgrp><abbr bid="B95">95</abbr></abbrgrp>.</p>
         <tbl id="T1"><title><p>Table 1</p></title><caption><p>Apparent origins of some key functional systems and molecular machines of eukaryotes</p></caption><tblbdy cols="3">
      <r>
         <c ca="left">
            <p>
               <b>System/complex/function</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Inferred origins</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>References</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>DNA replication and repair machinery</p>
         </c>
         <c ca="left">
            <p>Archaeal, with either crenarchaeotal or euryarchaeotal affinities for DNA polymerases and other central replication proteins; a mix of archaeal and bacterial for repair enzymes</p>
         </c>
         <c ca="left">
            <p>
               <abbrgrp>
                  <abbr bid="B99">99</abbr>
                  <abbr bid="B100">100</abbr>
                  <abbr bid="B128">128</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Transcription machinery</p>
         </c>
         <c ca="left">
            <p>Archaeal; at least two RNA polymerase subunits of crenarchaeotal/korarchaeotal origin</p>
         </c>
         <c ca="left">
            <p>
               <abbrgrp>
                  <abbr bid="B63">63</abbr>
                  <abbr bid="B86">86</abbr>
                  <abbr bid="B89">89</abbr>
                  <abbr bid="B93">93</abbr>
                  <abbr bid="B94">94</abbr>
                  <abbr bid="B129">129</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Translation apparatus, including ribosomes</p>
         </c>
         <c ca="left">
            <p>Mostly archaeal; some aminoacyl-tRNA synthetases displaced with bacterial homologs</p>
         </c>
         <c ca="left">
            <p>
               <abbrgrp>
                  <abbr bid="B91">91</abbr>
                  <abbr bid="B130">130</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Cell division and membrane remodeling systems; phagocytosis</p>
         </c>
         <c ca="left">
            <p>Primarily archaeal (Crenarchaeota) but some key regulators like Ras superfamily GTPases of bacterial origin</p>
         </c>
         <c ca="left">
            <p>
               <abbrgrp>
                  <abbr bid="B105">105</abbr>
                  <abbr bid="B113">113</abbr>
                  <abbr bid="B114">114</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Cytoskeleton</p>
         </c>
         <c ca="left">
            <p>Primarily archaeal; euryarchaeal affinity for tubulin, crenarchaeotal for actin</p>
         </c>
         <c ca="left">
            <p>
               <abbrgrp>
                  <abbr bid="B96">96</abbr>
                  <abbr bid="B105">105</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Proteasome: regulated proteolysis</p>
         </c>
         <c ca="left">
            <p>Archaeal</p>
         </c>
         <c ca="left">
            <p>
               <abbrgrp>
                  <abbr bid="B110">110</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Ubiquitin signaling: regulated proteolysis and protein topogenesis</p>
         </c>
         <c ca="left">
            <p>Archaeal but origin of some essential components, such as E2 and E3 ubiquitin ligases, uncertain</p>
         </c>
         <c ca="left">
            <p>
               <abbrgrp>
                  <abbr bid="B115">115</abbr>
                  <abbr bid="B131">131</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Exosome: regulated RNA degradation</p>
         </c>
         <c ca="left">
            <p>Archaeal</p>
         </c>
         <c ca="left">
            <p>
               <abbrgrp>
                  <abbr bid="B132">132</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Nuclear pore complex: nucleocytosolic transport</p>
         </c>
         <c ca="left">
            <p>Bacterial; some key proteins of the nuclear pore complex repetitive and of uncertain origin</p>
         </c>
         <c ca="left">
            <p>
               <abbrgrp>
                  <abbr bid="B28">28</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Chromatin/nucleosomes</p>
         </c>
         <c ca="left">
            <p>Complex mix of archaeal and bacterial</p>
         </c>
         <c ca="left">
            <p>
               <abbrgrp>
                  <abbr bid="B66">66</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>RNA interference</p>
         </c>
         <c ca="left">
            <p>Hybrid of archaeal and bacterial</p>
         </c>
         <c ca="left">
            <p>
               <abbrgrp>
                  <abbr bid="B70">70</abbr>
                  <abbr bid="B133">133</abbr>
                  <abbr bid="B134">134</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Endomembrane system/endoplasmic reticulum</p>
         </c>
         <c ca="left">
            <p>Complex mix of archaeal and bacterial</p>
         </c>
         <c ca="left">
            <p>
               <abbrgrp>
                  <abbr bid="B9">9</abbr>
                  <abbr bid="B10">10</abbr>
                  <abbr bid="B105">105</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Mitochondrion/electron transfer chain</p>
         </c>
         <c ca="left">
            <p>Bacterial</p>
         </c>
         <c ca="left">
            <p>
               <abbrgrp>
                  <abbr bid="B81">81</abbr>
                  <abbr bid="B135">135</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
   </tblbdy></tbl>
         <fig id="F4"><title><p>Figure 4</p></title><caption><p>Apparent complex origins of some key functional systems of eukaryotes</p></caption><text>
   <p><b>Apparent complex origins of some key functional systems of eukaryotes</b>. The likely origins of proteins and domains are shown by color code for three key functional systems of the eukaryotic cell: <b>(a) </b>B-family DNA polymerases comprising the core of the replication apparatus (triangles show Zn-finger modules; crosses indicate inactivated enzymatic domains; pol, polymerase; exo, exonuclease) <abbrgrp><abbr bid="B100">100</abbr></abbrgrp>; <b>(b) </b>RNA interference (RNAi) machinery (RdRp, RNA-dependent RNA polymerase) <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>; and <b>(c) </b>cell division apparatus (the Vps4 ATPase and Snf7-like proteins comprise the ESCRT-III machinery) and cytoskeleton <abbrgrp><abbr bid="B97">97</abbr><abbr bid="B98">98</abbr><abbr bid="B105">105</abbr><abbr bid="B113">113</abbr></abbrgrp>. The domains are not drawn to scale. The light blue color of the three amino-terminal domains of Pol&#949; indicates the substantial sequence divergence from the homologous domains of other eukaryotic polymerases.</p>
</text><graphic file="gb-2010-11-5-209-4"/></fig>
         <p>Eukaryotic cell division components are also conserved in several but not all of the major archaeal lineages. For example, homologs of the ESCRT-III complex, which performs key roles in vesicle biogenesis and cytokinesis in eukaryotes, are responsible for cell division in the Crenarchaeota but are missing in most of the Euryarchaeota, which possess a bacterial-like division mechanism using the GTPase FtsZ, a distant homolog of tubulin <abbrgrp><abbr bid="B96">96</abbr><abbr bid="B97">97</abbr></abbrgrp>. However, a few members of the Euryarchaeota have both systems, with FtsZ probably responsible for division and ESCRT-III for vesicle biogenesis <abbrgrp><abbr bid="B98">98</abbr></abbrgrp>.</p>
         <p>Eukaryote B-family DNA polymerases, a group of four paralogs that are collectively responsible for genome replication, show a complex pattern of ancestry (Figure <figr fid="F4">4</figr>): one branch of the eukaryotic polymerases seems to have evolved from archaeal PolBI, which is conserved in all archaea, whereas the other branch appears to derive from the Crenarchaea-specific PolBII <abbrgrp><abbr bid="B99">99</abbr><abbr bid="B100">100</abbr></abbrgrp>. Surprisingly, the eukaryotic polymerases additionally contain a Zn-finger domain homologous to that of PolD, which is restricted to Euryarchaeota <abbrgrp><abbr bid="B100">100</abbr></abbrgrp>; furthermore, the small subunits of eukaryotic Pol&#945; and Pol&#948; are inactivated derivatives of the exonuclease subunit of PolD <abbrgrp><abbr bid="B101">101</abbr></abbrgrp>.</p>
         <p>Another major theme emerging from these studies is the bacterial contribution and the formation of archaeao-bacterial chimeras (Table <tblr tid="T1">1</tblr> and Figure <figr fid="F4">4</figr>). A clear-cut case of a chimeric eukaryotic system is the RNA interference machinery, in which one of the key proteins, the endonuclease Dicer, consists of two bacterial RNAse III domains and a helicase domain of apparent euryarchaeal origin, and the other essential protein, Argonaute, also shows a euryarchaeal affinity (Figure <figr fid="F4">4</figr>) <abbrgrp><abbr bid="B70">70</abbr><abbr bid="B102">102</abbr></abbrgrp>. The nuclear pore complex, a quintessential eukaryotic molecular machine, does not show any indications of archaeal ancestry but rather consists of proteins of apparent bacterial origin combined with proteins consisting of simple repeats whose provenance is difficult to ascertain <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>.</p>
         <p>These observations suggest that the archaeal ancestor of eukaryotes combined a variety of features found separately in diverse extant archaea. This inference is consistent with the results of phylogenomic analysis and evolutionary reconstruction discussed above. Thus, the currently existing archaeal lineages probably evolved by differential streamlining, or reductive evolution of the complex ancestral forms, whereas eukaryotes largely retained the ancestral complexity. The diverse origins of eukaryotic functional systems has major implications for how eukaryotes originated, as explained below.</p>
      </sec>
      <sec>
         <st>
            <p>Eukaryogenesis: where did the eukaryotes come from?</p>
         </st>
         <p>The results of comparative genomics and ultrastructural studies do not yet definitively show where the eukaryotic cell came from, but they do offer important insights. Box <figr fid="F6">1</figr> lists the key observations that must be included in any evolutionary scenario for the evolution of eukaryotes (called eukaryogenesis) and summarizes the two alternative scenarios, which are depicted in Figure <figr fid="F5">5</figr>. The main issue revolves around the role of endosymbiosis <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B103">103</abbr><abbr bid="B104">104</abbr></abbrgrp>: was it the cause of the entire chain of events that led to the emergence of LECA (the stem phase of evolution), as proposed by the symbiogenesis scenario, or was it a step in the evolution of the already formed eukaryotic cell, as proposed by the archaezoan scenario? In other words, was the host of the &#945;-proteobacterial symbiont (the future mitochondrion) a prokaryote (as in the symbiogenesis scenario) or an amitochondrial eukaryote, an archaezoan?</p>
         <fig id="F5"><title><p>Figure 5</p></title><caption><p>The two alternative scenarios of eukaryogenesis</p></caption><text>
   <p><b>The two alternative scenarios of eukaryogenesis</b>. <b>(a) </b>The archaezoan scenario; <b>(b) </b>the symbiogenesis scenario. The putative archaeal or archaezoan hosts of the &#945;-proteobacterial endosymbiont are shown with elements of their cytoskeleton and cell division apparatus colored as in Figure 4.</p>
</text><graphic file="gb-2010-11-5-209-5"/></fig>
         <fig id="F6"><title><p>Box 1</p></title><caption><p>General concepts in the evolution of the eukaryotes</p></caption><text>
   <p>
      <b>General concepts in the evolution of the eukaryotes</b>
   </p>
</text><graphic file="gb-2010-11-5-209-6"/></fig>
         <p>Given that eukaryogenesis may have been a unique event and that intermediate stages in the process cannot be seen, these questions are enormously difficult, and final answers might not be attainable. But the symbiogenesis scenario seems to be more plausible than the archaezoan scenario <abbrgrp><abbr bid="B105">105</abbr></abbrgrp>, for three main reasons. First, under the archaezoan scenario, there is no plausible selective force behind the evolution of the nucleus, and in particular the elaborate nuclear pore complex. The nucleus disrupts the transcription-translation coupling that is typical of bacteria and archaea <abbrgrp><abbr bid="B106">106</abbr><abbr bid="B107">107</abbr><abbr bid="B108">108</abbr></abbrgrp> and necessitates the evolution of the time- and energy-consuming mechanism of nucleocytosolic transport of mRNA. By contrast, the symbiogenesis hypothesis offers a plausible selective factor: defense against the invasion of the host genome by Group II self-splicing introns, which are abundant in &#945;-proteobacteria and could have been unleashed as a result of exposure of the archaeal host genome to the bacterial endosymbiont DNA; these would disrupt gene expression unless transcription and translation were decoupled and compartmentalized <abbrgrp><abbr bid="B106">106</abbr></abbrgrp>. At least some additional innovations of eukaryogenesis, such as the evolution of the nonsense-mediated decay of transcripts containing premature stop codons and expansion of the ubiquitin system, can be envisaged as part of the same chain of adaptations to the intron bombardment as the origin of the nucleus <abbrgrp><abbr bid="B109">109</abbr></abbrgrp> (Figure <figr fid="F5">5</figr>).</p>
         <p>Second, functional studies in prokaryotes, particularly archaea, show that not only the molecular components of the several signature eukaryotic systems but also their actual structures and functions have evolved in archaea and thus predate eukaryogenesis. These include the archaeal proteasome <abbrgrp><abbr bid="B110">110</abbr></abbrgrp>, exosome <abbrgrp><abbr bid="B111">111</abbr></abbrgrp> and Sm-protein complex, the progenitor of the spliceosome <abbrgrp><abbr bid="B112">112</abbr></abbrgrp>, the ESCRT-III membrane remodeling system <abbrgrp><abbr bid="B113">113</abbr><abbr bid="B114">114</abbr></abbrgrp>, actin-like proteins <abbrgrp><abbr bid="B105">105</abbr></abbrgrp> and a prototype of the ubiquitin system of protein modification <abbrgrp><abbr bid="B115">115</abbr></abbrgrp>. Each of these molecular machines found in different groups of archaea has been shown or predicted to be mechanistically similar to the eukaryotic counterpart, but they all function within the prokaryotic cell. The endomembrane system and the nucleus are dramatic exceptions, and so are the mitochondria themselves. It is tempting to connect these dots by proposing that eukaryogenesis was triggered by endosymbiosis, and that the endomembrane systems including the nucleus evolved as defense against invasion of Group II introns and perhaps foreign DNA in general <abbrgrp><abbr bid="B106">106</abbr><abbr bid="B109">109</abbr></abbrgrp>. It does not seem accidental that many key components of these endomembrane systems seem to be of bacterial origin whereas others are repetitive proteins that might have evolved <it>de novo </it><abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. Under the symbiogenesis scenario, diverse pre-existing systems of the archaeal host were co-opted and expanded within the emerging eukaryotic cellular organization <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>.</p>
         <p>Several arguments can be and have been put forward against the symbiogenesis scenario. First, prokaryotic endosymbionts in prokaryotic hosts are not widespread, prompting the view that phagocytosis, which is apparently unique to eukaryotic cells, was critical for the acquisition of the mitochondrion <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. This argument is not compelling because: (1) eukaryogenesis is extremely rare, probably unique, in the history of life; (2) endosymbiotic bacteria within other bacteria are rare but known <abbrgrp><abbr bid="B116">116</abbr><abbr bid="B117">117</abbr><abbr bid="B118">118</abbr></abbrgrp>, and intracellular bacterial predation has been suggested as a potential route to endosymbiosis <abbrgrp><abbr bid="B119">119</abbr></abbrgrp>; and (3) recent observations on membrane remodeling systems and actin-like proteins in archaea suggest the possibility of still unexplored mechanisms for engulfment of other prokaryotes, perhaps resembling primitive phagocytosis <abbrgrp><abbr bid="B105">105</abbr></abbrgrp>.</p>
         <p>Second, a potentially strong argument against the symbiogenesis scenario could be the existence of a substantial number of eukaryote signature proteins (ESPs), so far found only in eukaryotes <abbrgrp><abbr bid="B120">120</abbr></abbrgrp>. The provenance of ESPs is an intriguing question. However, on many occasions, careful sequence and structure searches have revealed archaeal and/or bacterial homologs of proteins originally considered ESPs, or else the existence of such homologs became obvious with the appearance of new genomes <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>. The discovery of prokaryotic homologs of tubulin, actin and ubiquitin are well known examples <abbrgrp><abbr bid="B71">71</abbr><abbr bid="B97">97</abbr></abbrgrp>, and more recent cases include the GINS proteins, which are involved in DNA replication <abbrgrp><abbr bid="B121">121</abbr></abbrgrp>, the ESCRT-III systems and the subunits of the TRAPP3 complex, which have a key role in eukaryotic vesicle trafficking <abbrgrp><abbr bid="B122">122</abbr></abbrgrp>. Under the symbiogenesis scenario, the former and remaining ESPs result primarily from acceleration of evolution of genes whose functions have substantially changed during eukaryogenesis.</p>
         <p>A third, potentially serious difficulty with the symbiogenesis scenario is that neither archaeal-like nor bacterial-like genes can be traced to a single prokaryotic lineage (although the origin of the mitochondria from &#945;-proteobacteria is well established). However, the pangenomes of prokaryotes are large whereas the gene composition of individual organisms is highly flexible <abbrgrp><abbr bid="B123">123</abbr><abbr bid="B124">124</abbr></abbrgrp>, so reconstruction of the actual partners of the endosymbiosis that led to eukaryogenesis might not be feasible from a limited set of extant genomes. Moreover, many if not most archaea and bacteria might have evolved by streamlining, so eukaryogenesis could have been triggered by symbiosis between two prokaryotes with complex genomes.</p>
         <p>In short, it is currently impossible to strictly rule out the possibility that the key eukaryotic innovations evolved independently from and prior to the mitochondrial endosymbiosis. In other words, the host of the endosymbiont might have been an archaezoan. However, the archaezoan scenario does not provide a plausible staging of events during the evolution of the complex internal organization of the eukaryotic cell, does not offer a <it>raison d'&#234;tre </it>for the nucleus, and does not account for the presence of signature functional systems of eukaryotes in different archaeal lineages. In contrast, the symbiogenesis scenario can tie all these diverse lines of evidence into a coherent, even if still woefully incomplete, narrative.</p>
         <p>Comparative genomics has so far neither solved the enigma of eukaryogenesis nor offered a definitive picture of the primary radiation of the major eukaryote lineages. However, although falling short of decisive answers, phylogenomic analysis has yielded many insights into the origin and earliest stages of evolution of eukaryotes. Recent findings indicate that several key cellular systems of eukaryotes exist in archaea. The scattering of these systems among different archaeal lineages, along with the phylogenies of conserved proteins, suggests that the archaeal ancestor of eukaryotes belonged to a deep, possibly extinct archaeal branch with a highly complex genome and diverse cellular functionalities. In contrast, the endomembrane systems of eukaryotes, and in particular the nucleus with its elaborate nuclear pore complex, are not found in archaea, and seem to be derived, at least in part, from bacterial ancestral components. These findings seem to be best compatible with a symbiogenesis scenario for the origin of eukaryotes under which eukaryogenesis was triggered by the endosymbiosis of an &#945;-proteobacterium with an ancestral archaeon, with the nucleus evolving as a defense against intron invasion.</p>
         <p>Phylogenomic analysis has clarified the evolutionary links between major groups of eukaryotes and led to the delineation of five or six supergroups. The relationships between the supergroups and the root position in the tree of eukaryotes remain extremely difficult to decipher, probably owing to a compressed cladogenesis or 'big bang' phase of evolution that followed eukaryogenesis. The expanding sampling of genomes from diverse branches of life is far from being a trivial pursuit, but has rather delivered unexpected biological insights.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>I thank Yuri Wolf for providing the data used in Figure <figr fid="F2">2</figr>, Bill Martin for helpful discussions and Tania Senkevich for critical reading of the manuscript. The author's research is supported by the DHHS (National Library of Medicine) intramural funds.</p>
         </sec>
      </ack>
      <refgrp><bibl id="B1"><title><p>Reconstructing/deconstructing the earliest eukaryotes: how comparative genomics can help.</p></title><aug><au><snm>Dacks</snm><fnm>JB</fnm></au><au><snm>Doolittle</snm><fnm>WF</fnm></au></aug><source>Cell</source><pubdate>2001</pubdate><volume>107</volume><fpage>419</fpage><lpage>425</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0092-8674(01)00584-0</pubid><pubid idtype="pmpid" link="fulltext">11719183</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>Eukaryotic evolution, changes and challenges.</p></title><aug><au><snm>Embley</snm><fnm>TM</fnm></au><au><snm>Martin</snm><fnm>W</fnm></au></aug><source>Nature</source><pubdate>2006</pubdate><volume>440</volume><fpage>623</fpage><lpage>630</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature04546</pubid><pubid idtype="pmpid" link="fulltext">16572163</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Genomics and the irreducible nature of eukaryote cells.</p></title><aug><au><snm>Kurland</snm><fnm>CG</fnm></au><au><snm>Collins</snm><fnm>LJ</fnm></au><au><snm>Penny</snm><fnm>D</fnm></au></aug><source>Science</source><pubdate>2006</pubdate><volume>312</volume><fpage>1011</fpage><lpage>1014</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1121674</pubid><pubid idtype="pmpid" link="fulltext">16709776</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>On the origin of intracellular compartmentation and organized metabolic systems.</p></title><aug><au><snm>Ovadi</snm><fnm>J</fnm></au><au><snm>Saks</snm><fnm>V</fnm></au></aug><source>Mol Cell Biochem</source><pubdate>2004</pubdate><volume>256-257</volume><fpage>5</fpage><lpage>12</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1023/B:MCBI.0000009855.14648.2c</pubid><pubid idtype="pmpid" link="fulltext">14977166</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Evolutionary origins of metabolic compartmentalization in eukaryotes.</p></title><aug><au><snm>Martin</snm><fnm>W</fnm></au></aug><source>Philos Trans R Soc Lond B Biol Sci</source><volume>365</volume><fpage>847</fpage><lpage>855</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">20124349</pubid></xrefbib></bibl><bibl id="B6"><title><p>Origin of eukaryotic endomembranes: a critical evaluation of different model scenarios.</p></title><aug><au><snm>Jekely</snm><fnm>G</fnm></au></aug><source>Adv Exp Med Biol</source><pubdate>2007</pubdate><volume>607</volume><fpage>38</fpage><lpage>51</lpage><xrefbib><pubidlist><pubid idtype="doi">full_text</pubid><pubid idtype="pmpid" link="fulltext">17977457</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Organization of mammalian cytoplasm.</p></title><aug><au><snm>Hudder</snm><fnm>A</fnm></au><au><snm>Nathanson</snm><fnm>L</fnm></au><au><snm>Deutscher</snm><fnm>MP</fnm></au></aug><source>Mol Cell Biol</source><pubdate>2003</pubdate><volume>23</volume><fpage>9318</fpage><lpage>9326</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/MCB.23.24.9318-9326.2003</pubid><pubid idtype="pmcid">309675</pubid><pubid idtype="pmpid" link="fulltext">14645541</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>The degree of macromolecular crowding in the cytoplasm and nucleoplasm of mammalian cells is conserved.</p></title><aug><au><snm>Guigas</snm><fnm>G</fnm></au><au><snm>Kalla</snm><fnm>C</fnm></au><au><snm>Weiss</snm><fnm>M</fnm></au></aug><source>FEBS Lett</source><pubdate>2007</pubdate><volume>581</volume><fpage>5094</fpage><lpage>5098</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.febslet.2007.09.054</pubid><pubid idtype="pmpid" link="fulltext">17923125</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Evolution of specificity in the eukaryotic endomembrane system.</p></title><aug><au><snm>Dacks</snm><fnm>JB</fnm></au><au><snm>Peden</snm><fnm>AA</fnm></au><au><snm>Field</snm><fnm>MC</fnm></au></aug><source>Int J Biochem Cell Biol</source><pubdate>2009</pubdate><volume>41</volume><fpage>330</fpage><lpage>340</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.biocel.2008.08.041</pubid><pubid idtype="pmpid" link="fulltext">18835459</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>First and last ancestors: reconstructing evolution of the endomembrane system with ESCRTs, vesicle coat proteins, and nuclear pore complexes.</p></title><aug><au><snm>Field</snm><fnm>MC</fnm></au><au><snm>Dacks</snm><fnm>JB</fnm></au></aug><source>Curr Opin Cell Biol</source><pubdate>2009</pubdate><volume>21</volume><fpage>4</fpage><lpage>13</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ceb.2008.12.004</pubid><pubid idtype="pmpid">19201590</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Degenerate mitochondria.</p></title><aug><au><snm>Giezen</snm><mnm>van der</mnm><fnm>M</fnm></au><au><snm>Tovar</snm><fnm>J</fnm></au></aug><source>EMBO Rep</source><pubdate>2005</pubdate><volume>6</volume><fpage>525</fpage><lpage>530</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/sj.embor.7400440</pubid><pubid idtype="pmcid">1369098</pubid><pubid idtype="pmpid" link="fulltext">15940286</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Diversity and reductive evolution of mitochondria among microbial eukaryotes.</p></title><aug><au><snm>Hjort</snm><fnm>K</fnm></au><au><snm>Goldberg</snm><fnm>AV</fnm></au><au><snm>Tsaousis</snm><fnm>AD</fnm></au><au><snm>Hirt</snm><fnm>RP</fnm></au><au><snm>Embley</snm><fnm>TM</fnm></au></aug><source>Philos Trans R Soc Lond B Biol Sci</source><volume>365</volume><fpage>713</fpage><lpage>727</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">20124340</pubid></xrefbib></bibl><bibl id="B13"><title><p>Hydrogenosomes and mitosomes: conservation and evolution of functions.</p></title><aug><au><snm>Giezen</snm><mnm>van der</mnm><fnm>M</fnm></au></aug><source>J Eukaryot Microbiol</source><pubdate>2009</pubdate><volume>56</volume><fpage>221</fpage><lpage>231</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1550-7408.2009.00407.x</pubid><pubid idtype="pmpid" link="fulltext">19527349</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>The age of crosstalk: phosphorylation, ubiquitination, and beyond.</p></title><aug><au><snm>Hunter</snm><fnm>T</fnm></au></aug><source>Mol Cell</source><pubdate>2007</pubdate><volume>28</volume><fpage>730</fpage><lpage>738</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.molcel.2007.11.019</pubid><pubid idtype="pmpid" link="fulltext">18082598</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>Protein tyrosine kinase structure and function.</p></title><aug><au><snm>Hubbard</snm><fnm>SR</fnm></au><au><snm>Till</snm><fnm>JH</fnm></au></aug><source>Annu Rev Biochem</source><pubdate>2000</pubdate><volume>69</volume><fpage>373</fpage><lpage>398</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1146/annurev.biochem.69.1.373</pubid><pubid idtype="pmpid" link="fulltext">10966463</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Kinome signaling through regulated protein-protein interactions in normal and cancer cells.</p></title><aug><au><snm>Pawson</snm><fnm>T</fnm></au><au><snm>Kofler</snm><fnm>M</fnm></au></aug><source>Curr Opin Cell Biol</source><pubdate>2009</pubdate><volume>21</volume><fpage>147</fpage><lpage>153</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ceb.2009.02.005</pubid><pubid idtype="pmpid">19299117</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>Specificity in signal transduction: from phosphotyrosine-SH2 domain interactions to complex cellular systems.</p></title><aug><au><snm>Pawson</snm><fnm>T</fnm></au></aug><source>Cell</source><pubdate>2004</pubdate><volume>116</volume><fpage>191</fpage><lpage>203</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0092-8674(03)01077-8</pubid><pubid idtype="pmpid" link="fulltext">14744431</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>Kinomics: methods for deciphering the kinome.</p></title><aug><au><snm>Johnson</snm><fnm>SA</fnm></au><au><snm>Hunter</snm><fnm>T</fnm></au></aug><source>Nat Methods</source><pubdate>2005</pubdate><volume>2</volume><fpage>17</fpage><lpage>25</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nmeth731</pubid><pubid idtype="pmpid" link="fulltext">15789031</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>Ubiquitin-binding domains - from structures to functions.</p></title><aug><au><snm>Dikic</snm><fnm>I</fnm></au><au><snm>Wakatsuki</snm><fnm>S</fnm></au><au><snm>Walters</snm><fnm>KJ</fnm></au></aug><source>Nat Rev Mol Cell Biol</source><pubdate>2009</pubdate><volume>10</volume><fpage>659</fpage><lpage>671</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrm2767</pubid><pubid idtype="pmpid" link="fulltext">19773779</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>The ubiquitin system.</p></title><aug><au><snm>Hershko</snm><fnm>A</fnm></au><au><snm>Ciechanover</snm><fnm>A</fnm></au></aug><source>Annu Rev Biochem</source><pubdate>1998</pubdate><volume>67</volume><fpage>425</fpage><lpage>479</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1146/annurev.biochem.67.1.425</pubid><pubid idtype="pmpid" link="fulltext">9759494</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Ubiquitin-mediated proteolysis: biological regulation via destruction.</p></title><aug><au><snm>Ciechanover</snm><fnm>A</fnm></au><au><snm>Orian</snm><fnm>A</fnm></au><au><snm>Schwartz</snm><fnm>AL</fnm></au></aug><source>Bioessays</source><pubdate>2000</pubdate><volume>22</volume><fpage>442</fpage><lpage>451</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/(SICI)1521-1878(200005)22:5&lt;442::AID-BIES6&gt;3.0.CO;2-Q</pubid><pubid idtype="pmpid" link="fulltext">10797484</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>Biochemical principles of small RNA pathways.</p></title><aug><au><snm>Liu</snm><fnm>Q</fnm></au><au><snm>Paroo</snm><fnm>Z</fnm></au></aug><source>Annu Rev Biochem</source><pubdate>2010</pubdate><inpress/><xrefbib><pubid idtype="pmpid" link="fulltext">20205586</pubid></xrefbib></bibl><bibl id="B23"><title><p>Specialization and evolution of endogenous small RNA pathways.</p></title><aug><au><snm>Chapman</snm><fnm>EJ</fnm></au><au><snm>Carrington</snm><fnm>JC</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2007</pubdate><volume>8</volume><fpage>884</fpage><lpage>896</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrg2179</pubid><pubid idtype="pmpid" link="fulltext">17943195</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>The eukaryotic genome as an RNA machine.</p></title><aug><au><snm>Amaral</snm><fnm>PP</fnm></au><au><snm>Dinger</snm><fnm>ME</fnm></au><au><snm>Mercer</snm><fnm>TR</fnm></au><au><snm>Mattick</snm><fnm>JS</fnm></au></aug><source>Science</source><pubdate>2008</pubdate><volume>319</volume><fpage>1787</fpage><lpage>1789</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1155472</pubid><pubid idtype="pmpid" link="fulltext">18369136</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>The complex eukaryotic transcriptome: unexpected pervasive transcription and novel small RNAs.</p></title><aug><au><snm>Jacquier</snm><fnm>A</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2009</pubdate><volume>10</volume><fpage>833</fpage><lpage>844</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrg2683</pubid><pubid idtype="pmpid" link="fulltext">19920851</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Shifting players and paradigms in cell-specific transcription.</p></title><aug><au><snm>D'Alessio</snm><fnm>JA</fnm></au><au><snm>Wright</snm><fnm>KJ</fnm></au><au><snm>Tjian</snm><fnm>R</fnm></au></aug><source>Mol Cell</source><pubdate>2009</pubdate><volume>36</volume><fpage>924</fpage><lpage>931</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.molcel.2009.12.011</pubid><pubid idtype="pmpid" link="fulltext">20064459</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>Evolution of transcriptional control in mammals.</p></title><aug><au><snm>Wilson</snm><fnm>MD</fnm></au><au><snm>Odom</snm><fnm>DT</fnm></au></aug><source>Curr Opin Genet Dev</source><pubdate>2009</pubdate><volume>19</volume><fpage>579</fpage><lpage>585</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.gde.2009.10.003</pubid><pubid idtype="pmpid" link="fulltext">19913406</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>Comparative genomics, evolution and origins of the nuclear envelope and nuclear pore complex.</p></title><aug><au><snm>Mans</snm><fnm>BJ</fnm></au><au><snm>Anantharaman</snm><fnm>V</fnm></au><au><snm>Aravind</snm><fnm>L</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Cell Cycle</source><pubdate>2004</pubdate><volume>3</volume><fpage>1612</fpage><lpage>1637</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">15611647</pubid></xrefbib></bibl><bibl id="B29"><title><p>Complex spliceosomal organization ancestral to extant eukaryotes.</p></title><aug><au><snm>Collins</snm><fnm>L</fnm></au><au><snm>Penny</snm><fnm>D</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2005</pubdate><volume>22</volume><fpage>1053</fpage><lpage>1066</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msi091</pubid><pubid idtype="pmpid" link="fulltext">15659557</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Modular networks and cumulative impact of lateral transfer in prokaryote genome evolution.</p></title><aug><au><snm>Dagan</snm><fnm>T</fnm></au><au><snm>Artzy-Randrup</snm><fnm>Y</fnm></au><au><snm>Martin</snm><fnm>W</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2008</pubdate><volume>105</volume><fpage>10039</fpage><lpage>10044</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0800679105</pubid><pubid idtype="pmcid">2474566</pubid><pubid idtype="pmpid" link="fulltext">18632554</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>Entrez Genome</p></title><url>http://www.ncbi.nlm.nih.gov/sites/genome</url></bibl><bibl id="B32"><title><p>Reconstructing early events in eukaryotic evolution.</p></title><aug><au><snm>Roger</snm><fnm>AJ</fnm></au></aug><source>Am Nat</source><pubdate>1999</pubdate><volume>154 Suppl 4</volume><fpage>S146</fpage><lpage>S163</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1086/303290</pubid><pubid idtype="pmpid" link="fulltext">10527924</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>Ancestral relationships of the major eukaryotic lineages.</p></title><aug><au><snm>Sogin</snm><fnm>ML</fnm></au><au><snm>Morrison</snm><fnm>HG</fnm></au><au><snm>Hinkle</snm><fnm>G</fnm></au><au><snm>Silberman</snm><fnm>JD</fnm></au></aug><source>Microbiologia</source><pubdate>1996</pubdate><volume>12</volume><fpage>17</fpage><lpage>28</lpage><xrefbib><pubid idtype="pmpid">9019131</pubid></xrefbib></bibl><bibl id="B34"><title><p>A kingdom-level phylogeny of eukaryotes based on combined protein data.</p></title><aug><au><snm>Baldauf</snm><fnm>SL</fnm></au><au><snm>Roger</snm><fnm>AJ</fnm></au><au><snm>Wenk-Siefert</snm><fnm>I</fnm></au><au><snm>Doolittle</snm><fnm>WF</fnm></au></aug><source>Science</source><pubdate>2000</pubdate><volume>290</volume><fpage>972</fpage><lpage>977</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.290.5493.972</pubid><pubid idtype="pmpid" link="fulltext">11062127</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>The deep roots of eukaryotes.</p></title><aug><au><snm>Baldauf</snm><fnm>SL</fnm></au></aug><source>Science</source><pubdate>2003</pubdate><volume>300</volume><fpage>1703</fpage><lpage>1706</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1085544</pubid><pubid idtype="pmpid" link="fulltext">12805537</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>The origin and diversification of eukaryotes: problems with molecular phylogenetics and molecular clock estimation.</p></title><aug><au><snm>Roger</snm><fnm>AJ</fnm></au><au><snm>Hug</snm><fnm>LA</fnm></au></aug><source>Philos Trans R Soc Lond B Biol Sci</source><pubdate>2006</pubdate><volume>361</volume><fpage>1039</fpage><lpage>1054</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1098/rstb.2006.1845</pubid><pubid idtype="pmcid">1578731</pubid><pubid idtype="pmpid" link="fulltext">16754613</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>The diversity of eukaryotes and the root of the eukaryotic tree.</p></title><aug><au><snm>Brinkmann</snm><fnm>H</fnm></au><au><snm>Philippe</snm><fnm>H</fnm></au></aug><source>Adv Exp Med Biol</source><pubdate>2007</pubdate><volume>607</volume><fpage>20</fpage><lpage>37</lpage><xrefbib><pubidlist><pubid idtype="doi">full_text</pubid><pubid idtype="pmpid" link="fulltext">17977456</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>Origins of microsporidia.</p></title><aug><au><snm>Keeling</snm><fnm>PJ</fnm></au><au><snm>McFadden</snm><fnm>GI</fnm></au></aug><source>Trends Microbiol</source><pubdate>1998</pubdate><volume>6</volume><fpage>19</fpage><lpage>23</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0966-842X(97)01185-2</pubid><pubid idtype="pmpid" link="fulltext">9481819</pubid></pubidlist></xrefbib></bibl><bibl id="B39"><title><p>The new phylogeny of eukaryotes.</p></title><aug><au><snm>Philippe</snm><fnm>H</fnm></au><au><snm>Germot</snm><fnm>A</fnm></au><au><snm>Moreira</snm><fnm>D</fnm></au></aug><source>Curr Opin Genet Dev</source><pubdate>2000</pubdate><volume>10</volume><fpage>596</fpage><lpage>601</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0959-437X(00)00137-4</pubid><pubid idtype="pmpid" link="fulltext">11088007</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>Evolutionary position of breviate amoebae and the primary eukaryote divergence.</p></title><aug><au><snm>Minge</snm><fnm>MA</fnm></au><au><snm>Silberman</snm><fnm>JD</fnm></au><au><snm>Orr</snm><fnm>RJ</fnm></au><au><snm>Cavalier-Smith</snm><fnm>T</fnm></au><au><snm>Shalchian-Tabrizi</snm><fnm>K</fnm></au><au><snm>Burki</snm><fnm>F</fnm></au><au><snm>Skjaeveland</snm><fnm>A</fnm></au><au><snm>Jakobsen</snm><fnm>KS</fnm></au></aug><source>Proc Biol Sci</source><pubdate>2009</pubdate><volume>276</volume><fpage>597</fpage><lpage>604</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1098/rspb.2008.1358</pubid><pubid idtype="pmcid">2660946</pubid><pubid idtype="pmpid" link="fulltext">19004754</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>Genomes as documents of evolutionary history.</p></title><aug><au><snm>Boussau</snm><fnm>B</fnm></au><au><snm>Daubin</snm><fnm>V</fnm></au></aug><source>Trends Ecol Evol</source><pubdate>2009</pubdate><volume>25</volume><fpage>224</fpage><lpage>232</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.tree.2009.09.007</pubid><pubid idtype="pmpid" link="fulltext">19880211</pubid></pubidlist></xrefbib></bibl><bibl id="B42"><title><p>Phylogenomics and the reconstruction of the tree of life.</p></title><aug><au><snm>Delsuc</snm><fnm>F</fnm></au><au><snm>Brinkmann</snm><fnm>H</fnm></au><au><snm>Philippe</snm><fnm>H</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2005</pubdate><volume>6</volume><fpage>361</fpage><lpage>375</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrg1603</pubid><pubid idtype="pmpid" link="fulltext">15861208</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>The new higher level classification of eukaryotes with emphasis on the taxonomy of protists.</p></title><aug><au><snm>Adl</snm><fnm>SM</fnm></au><au><snm>Simpson</snm><fnm>AG</fnm></au><au><snm>Farmer</snm><fnm>MA</fnm></au><au><snm>Andersen</snm><fnm>RA</fnm></au><au><snm>Anderson</snm><fnm>OR</fnm></au><au><snm>Barta</snm><fnm>JR</fnm></au><au><snm>Bowser</snm><fnm>SS</fnm></au><au><snm>Brugerolle</snm><fnm>G</fnm></au><au><snm>Fensome</snm><fnm>RA</fnm></au><au><snm>Fredericq</snm><fnm>S</fnm></au><au><snm>James</snm><fnm>TY</fnm></au><au><snm>Karpov</snm><fnm>S</fnm></au><au><snm>Kugrens</snm><fnm>P</fnm></au><au><snm>Krug</snm><fnm>J</fnm></au><au><snm>Lane</snm><fnm>CE</fnm></au><au><snm>Lewis</snm><fnm>LA</fnm></au><au><snm>Lodge</snm><fnm>J</fnm></au><au><snm>Lynn</snm><fnm>DH</fnm></au><au><snm>Mann</snm><fnm>DG</fnm></au><au><snm>McCourt</snm><fnm>RM</fnm></au><au><snm>Mendoza</snm><fnm>L</fnm></au><au><snm>Moestrup</snm><fnm>O</fnm></au><au><snm>Mozley-Standridge</snm><fnm>SE</fnm></au><au><snm>Nerad</snm><fnm>TA</fnm></au><au><snm>Shearer</snm><fnm>CA</fnm></au><au><snm>Smirnov</snm><fnm>AV</fnm></au><au><snm>Spiegel</snm><fnm>FW</fnm></au><au><snm>Taylor</snm><fnm>MF</fnm></au></aug><source>J Eukaryot Microbiol</source><pubdate>2005</pubdate><volume>52</volume><fpage>399</fpage><lpage>451</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1550-7408.2005.00053.x</pubid><pubid idtype="pmpid" link="fulltext">16248873</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Genomics. Deep questions in the tree of life.</p></title><aug><au><snm>Keeling</snm><fnm>PJ</fnm></au></aug><source>Science</source><pubdate>2007</pubdate><volume>317</volume><fpage>1875</fpage><lpage>1876</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1149593</pubid><pubid idtype="pmpid" link="fulltext">17901321</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>The tree of eukaryotes.</p></title><aug><au><snm>Keeling</snm><fnm>PJ</fnm></au><au><snm>Burger</snm><fnm>G</fnm></au><au><snm>Durnford</snm><fnm>DG</fnm></au><au><snm>Lang</snm><fnm>BF</fnm></au><au><snm>Lee</snm><fnm>RW</fnm></au><au><snm>Pearlman</snm><fnm>RE</fnm></au><au><snm>Roger</snm><fnm>AJ</fnm></au><au><snm>Gray</snm><fnm>MW</fnm></au></aug><source>Trends Ecol Evol</source><pubdate>2005</pubdate><volume>20</volume><fpage>670</fpage><lpage>676</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.tree.2005.09.005</pubid><pubid idtype="pmpid" link="fulltext">16701456</pubid></pubidlist></xrefbib></bibl><bibl id="B46"><title><p>Evaluating support for the current classification of eukaryotic diversity.</p></title><aug><au><snm>Parfrey</snm><fnm>LW</fnm></au><au><snm>Barbero</snm><fnm>E</fnm></au><au><snm>Lasser</snm><fnm>E</fnm></au><au><snm>Dunthorn</snm><fnm>M</fnm></au><au><snm>Bhattacharya</snm><fnm>D</fnm></au><au><snm>Patterson</snm><fnm>DJ</fnm></au><au><snm>Katz</snm><fnm>LA</fnm></au></aug><source>PLoS Genet</source><pubdate>2006</pubdate><volume>2</volume><fpage>e220</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pgen.0020220</pubid><pubid idtype="pmcid">1713255</pubid><pubid idtype="pmpid" link="fulltext">17194223</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>Phylogenomics reshuffles the eukaryotic supergroups.</p></title><aug><au><snm>Burki</snm><fnm>F</fnm></au><au><snm>Shalchian-Tabrizi</snm><fnm>K</fnm></au><au><snm>Minge</snm><fnm>M</fnm></au><au><snm>Skjaeveland</snm><fnm>A</fnm></au><au><snm>Nikolaev</snm><fnm>SI</fnm></au><au><snm>Jakobsen</snm><fnm>KS</fnm></au><au><snm>Pawlowski</snm><fnm>J</fnm></au></aug><source>PLoS ONE</source><pubdate>2007</pubdate><volume>2</volume><fpage>e790</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0000790</pubid><pubid idtype="pmcid">1949142</pubid><pubid idtype="pmpid" link="fulltext">17726520</pubid></pubidlist></xrefbib></bibl><bibl id="B48"><title><p>Broadly sampled multigene trees of eukaryotes.</p></title><aug><au><snm>Yoon</snm><fnm>HS</fnm></au><au><snm>Grant</snm><fnm>J</fnm></au><au><snm>Tekle</snm><fnm>YI</fnm></au><au><snm>Wu</snm><fnm>M</fnm></au><au><snm>Chaon</snm><fnm>BC</fnm></au><au><snm>Cole</snm><fnm>JC</fnm></au><au><snm>Logsdon</snm><fnm>JM</fnm><suf>Jr</suf></au><au><snm>Patterson</snm><fnm>DJ</fnm></au><au><snm>Bhattacharya</snm><fnm>D</fnm></au><au><snm>Katz</snm><fnm>LA</fnm></au></aug><source>BMC Evol Biol</source><pubdate>2008</pubdate><volume>8</volume><fpage>14</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2148-8-14</pubid><pubid idtype="pmcid">2249577</pubid><pubid idtype="pmpid" link="fulltext">18205932</pubid></pubidlist></xrefbib></bibl><bibl id="B49"><title><p>Global eukaryote phylogeny: combined small- and large-subunit ribosomal DNA trees support monophyly of Rhizaria, Retaria and Excavata.</p></title><aug><au><snm>Moreira</snm><fnm>D</fnm></au><au><snm>Heyden</snm><mnm>von der</mnm><fnm>S</fnm></au><au><snm>Bass</snm><fnm>D</fnm></au><au><snm>Lopez-Garcia</snm><fnm>P</fnm></au><au><snm>Chao</snm><fnm>E</fnm></au><au><snm>Cavalier-Smith</snm><fnm>T</fnm></au></aug><source>Mol Phylogenet Evol</source><pubdate>2007</pubdate><volume>44</volume><fpage>255</fpage><lpage>266</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ympev.2006.11.001</pubid><pubid idtype="pmpid" link="fulltext">17174576</pubid></pubidlist></xrefbib></bibl><bibl id="B50"><title><p>Phylogenomic analysis supports the monophyly of cryptophytes and haptophytes and the association of rhizaria with chromalveolates.</p></title><aug><au><snm>Hackett</snm><fnm>JD</fnm></au><au><snm>Yoon</snm><fnm>HS</fnm></au><au><snm>Li</snm><fnm>S</fnm></au><au><snm>Reyes-Prieto</snm><fnm>A</fnm></au><au><snm>Rummele</snm><fnm>SE</fnm></au><au><snm>Bhattacharya</snm><fnm>D</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2007</pubdate><volume>24</volume><fpage>1702</fpage><lpage>1713</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msm089</pubid><pubid idtype="pmpid" link="fulltext">17488740</pubid></pubidlist></xrefbib></bibl><bibl id="B51"><title><p>Phylogenomic analyses support the monophyly of Excavata and resolve relationships among eukaryotic "supergroups".</p></title><aug><au><snm>Hampl</snm><fnm>V</fnm></au><au><snm>Hug</snm><fnm>L</fnm></au><au><snm>Leigh</snm><fnm>JW</fnm></au><au><snm>Dacks</snm><fnm>JB</fnm></au><au><snm>Lang</snm><fnm>BF</fnm></au><au><snm>Simpson</snm><fnm>AG</fnm></au><au><snm>Roger</snm><fnm>AJ</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2009</pubdate><volume>106</volume><fpage>3859</fpage><lpage>3864</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0807880106</pubid><pubid idtype="pmcid">2656170</pubid><pubid idtype="pmpid" link="fulltext">19237557</pubid></pubidlist></xrefbib></bibl><bibl id="B52"><title><p>Phylogenomics reveals a new 'megagroup' including most photosynthetic eukaryotes.</p></title><aug><au><snm>Burki</snm><fnm>F</fnm></au><au><snm>Shalchian-Tabrizi</snm><fnm>K</fnm></au><au><snm>Pawlowski</snm><fnm>J</fnm></au></aug><source>Biol Lett</source><pubdate>2008</pubdate><volume>4</volume><fpage>366</fpage><lpage>369</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1098/rsbl.2008.0224</pubid><pubid idtype="pmcid">2610160</pubid><pubid idtype="pmpid" link="fulltext">18522922</pubid></pubidlist></xrefbib></bibl><bibl id="B53"><title><p>Critical analysis of eukaryotic phylogeny: a case study based on the HSP70 family.</p></title><aug><au><snm>Germot</snm><fnm>A</fnm></au><au><snm>Philippe</snm><fnm>H</fnm></au></aug><source>J Eukaryot Microbiol</source><pubdate>1999</pubdate><volume>46</volume><fpage>116</fpage><lpage>124</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1550-7408.1999.tb04594.x</pubid><pubid idtype="pmpid">10361733</pubid></pubidlist></xrefbib></bibl><bibl id="B54"><title><p>Animal evolution and the molecular signature of radiations compressed in time.</p></title><aug><au><snm>Rokas</snm><fnm>A</fnm></au><au><snm>Kruger</snm><fnm>D</fnm></au><au><snm>Carroll</snm><fnm>SB</fnm></au></aug><source>Science</source><pubdate>2005</pubdate><volume>310</volume><fpage>1933</fpage><lpage>1938</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1116759</pubid><pubid idtype="pmpid" link="fulltext">16373569</pubid></pubidlist></xrefbib></bibl><bibl id="B55"><title><p>The Biological Big Bang model for the major transitions in evolution.</p></title><aug><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Biol Direct</source><pubdate>2007</pubdate><volume>2</volume><fpage>21</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1745-6150-2-21</pubid><pubid idtype="pmcid">1973067</pubid><pubid idtype="pmpid" link="fulltext">17708768</pubid></pubidlist></xrefbib></bibl><bibl id="B56"><title><p>Rooting the eukaryote tree by using a derived gene fusion.</p></title><aug><au><snm>Stechmann</snm><fnm>A</fnm></au><au><snm>Cavalier-Smith</snm><fnm>T</fnm></au></aug><source>Science</source><pubdate>2002</pubdate><volume>297</volume><fpage>89</fpage><lpage>91</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1071196</pubid><pubid idtype="pmpid" link="fulltext">12098695</pubid></pubidlist></xrefbib></bibl><bibl id="B57"><title><p>The root of the eukaryote tree pinpointed.</p></title><aug><au><snm>Stechmann</snm><fnm>A</fnm></au><au><snm>Cavalier-Smith</snm><fnm>T</fnm></au></aug><source>Curr Biol</source><pubdate>2003</pubdate><volume>13</volume><fpage>R665</fpage><lpage>R666</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0960-9822(03)00602-X</pubid><pubid idtype="pmpid" link="fulltext">12956967</pubid></pubidlist></xrefbib></bibl><bibl id="B58"><title><p>Myosin domain evolution and the primary divergence of eukaryotes.</p></title><aug><au><snm>Richards</snm><fnm>TA</fnm></au><au><snm>Cavalier-Smith</snm><fnm>T</fnm></au></aug><source>Nature</source><pubdate>2005</pubdate><volume>436</volume><fpage>1113</fpage><lpage>1118</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature03949</pubid><pubid idtype="pmpid" link="fulltext">16121172</pubid></pubidlist></xrefbib></bibl><bibl id="B59"><title><p>Rare genomic changes as a tool for phylogenetics.</p></title><aug><au><snm>Rokas</snm><fnm>A</fnm></au><au><snm>Holland</snm><fnm>PW</fnm></au></aug><source>Trends Ecol Evol</source><pubdate>2000</pubdate><volume>15</volume><fpage>454</fpage><lpage>459</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0169-5347(00)01967-4</pubid><pubid idtype="pmpid" link="fulltext">11050348</pubid></pubidlist></xrefbib></bibl><bibl id="B60"><title><p>Analysis of rare genomic changes does not support the unikont-bikont phylogeny and suggests cyanobacterial symbiosis as the point of primary radiation of eukaryotes.</p></title><aug><au><snm>Rogozin</snm><fnm>IB</fnm></au><au><snm>Basu</snm><fnm>MK</fnm></au><au><snm>Csuros</snm><fnm>M</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Genome Biol Evol</source><pubdate>2009</pubdate><volume>2009</volume><fpage>99</fpage><lpage>113</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2817406</pubid><pubid idtype="pmpid" link="fulltext">20333181</pubid></pubidlist></xrefbib></bibl><bibl id="B61"><title><p>Bushes in the tree of life.</p></title><aug><au><snm>Rokas</snm><fnm>A</fnm></au><au><snm>Carroll</snm><fnm>SB</fnm></au></aug><source>PLoS Biol</source><pubdate>2006</pubdate><volume>4</volume><fpage>e352</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pbio.0040352</pubid><pubid idtype="pmcid">1637082</pubid><pubid idtype="pmpid" link="fulltext">17105342</pubid></pubidlist></xrefbib></bibl><bibl id="B62"><title><p>A comprehensive evolutionary classification of proteins encoded in complete eukaryotic genomes.</p></title><aug><au><snm>Koonin</snm><fnm>EV</fnm></au><au><snm>Fedorova</snm><fnm>ND</fnm></au><au><snm>Jackson</snm><fnm>JD</fnm></au><au><snm>Jacobs</snm><fnm>AR</fnm></au><au><snm>Krylov</snm><fnm>DM</fnm></au><au><snm>Makarova</snm><fnm>KS</fnm></au><au><snm>Mazumder</snm><fnm>R</fnm></au><au><snm>Mekhedov</snm><fnm>SL</fnm></au><au><snm>Nikolskaya</snm><fnm>AN</fnm></au><au><snm>Rao</snm><fnm>BS</fnm></au><au><snm>Rogozin</snm><fnm>IB</fnm></au><au><snm>Smirnov</snm><fnm>S</fnm></au><au><snm>Sorokin</snm><fnm>AV</fnm></au><au><snm>Sverdlov</snm><fnm>AV</fnm></au><au><snm>Vasudevan</snm><fnm>S</fnm></au><au><snm>Wolf</snm><fnm>YI</fnm></au><au><snm>Yin</snm><fnm>JJ</fnm></au><au><snm>Natale</snm><fnm>DA</fnm></au></aug><source>Genome Biol</source><pubdate>2004</pubdate><volume>5</volume><fpage>R7</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2004-5-2-r7</pubid><pubid idtype="pmcid">395751</pubid><pubid idtype="pmpid" link="fulltext">14759257</pubid></pubidlist></xrefbib></bibl><bibl id="B63"><title><p>Ancestral paralogs and pseudoparalogs and their role in the emergence of the eukaryotic cell.</p></title><aug><au><snm>Makarova</snm><fnm>KS</fnm></au><au><snm>Wolf</snm><fnm>YI</fnm></au><au><snm>Mekhedov</snm><fnm>SL</fnm></au><au><snm>Mirkin</snm><fnm>BG</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2005</pubdate><volume>33</volume><fpage>4626</fpage><lpage>4638</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gki775</pubid><pubid idtype="pmcid">1187821</pubid><pubid idtype="pmpid" link="fulltext">16106042</pubid></pubidlist></xrefbib></bibl><bibl id="B64"><title><p>Streamlining and large ancestral genomes in Archaea inferred with a phylogenetic birth-and-death model.</p></title><aug><au><snm>Csuros</snm><fnm>M</fnm></au><au><snm>Miklos</snm><fnm>I</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2009</pubdate><volume>26</volume><fpage>2087</fpage><lpage>2095</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msp123</pubid><pubid idtype="pmcid">2726834</pubid><pubid idtype="pmpid" link="fulltext">19570746</pubid></pubidlist></xrefbib></bibl><bibl id="B65"><title><p>Approaches to defining the ancestral eukaryotic protein complexome.</p></title><aug><au><snm>Ceulemans</snm><fnm>H</fnm></au><au><snm>Beke</snm><fnm>L</fnm></au><au><snm>Bollen</snm><fnm>M</fnm></au></aug><source>Bioessays</source><pubdate>2006</pubdate><volume>28</volume><fpage>316</fpage><lpage>324</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/bies.20373</pubid><pubid idtype="pmpid" link="fulltext">16479579</pubid></pubidlist></xrefbib></bibl><bibl id="B66"><title><p>Comparative genomics and structural biology of the molecular innovations of eukaryotes.</p></title><aug><au><snm>Aravind</snm><fnm>L</fnm></au><au><snm>Iyer</snm><fnm>LM</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Curr Opin Struct Biol</source><pubdate>2006</pubdate><volume>16</volume><fpage>409</fpage><lpage>419</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.sbi.2006.04.006</pubid><pubid idtype="pmpid" link="fulltext">16679012</pubid></pubidlist></xrefbib></bibl><bibl id="B67"><title><p>The Genome of <it>Naegleria gruberi </it>illuminates early eukaryotic versatility.</p></title><aug><au><snm>Fritz-Laylin</snm><fnm>LK</fnm></au><au><snm>Prochnik</snm><fnm>SE</fnm></au><au><snm>Ginger</snm><fnm>ML</fnm></au><au><snm>Dacks</snm><fnm>JB</fnm></au><au><snm>Carpenter</snm><fnm>ML</fnm></au><au><snm>Field</snm><fnm>MC</fnm></au><au><snm>Kuo</snm><fnm>A</fnm></au><au><snm>Paredez</snm><fnm>A</fnm></au><au><snm>Chapman</snm><fnm>J</fnm></au><au><snm>Pham</snm><fnm>J</fnm></au><au><snm>Shu</snm><fnm>S</fnm></au><au><snm>Neupane</snm><fnm>R</fnm></au><au><snm>Cipriano</snm><fnm>M</fnm></au><au><snm>Mancuso</snm><fnm>J</fnm></au><au><snm>Tu</snm><fnm>H</fnm></au><au><snm>Salamov</snm><fnm>A</fnm></au><au><snm>Lindquist</snm><fnm>E</fnm></au><au><snm>Shapiro</snm><fnm>H</fnm></au><au><snm>Lucas</snm><fnm>S</fnm></au><au><snm>Grigoriev</snm><fnm>IV</fnm></au><au><snm>Cande</snm><fnm>WZ</fnm></au><au><snm>Fulton</snm><fnm>C</fnm></au><au><snm>Rokhsar</snm><fnm>DS</fnm></au><au><snm>Dawson</snm><fnm>SC</fnm></au></aug><source>Cell</source><pubdate>2010</pubdate><volume>140</volume><fpage>631</fpage><lpage>642</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2010.01.032</pubid><pubid idtype="pmpid" link="fulltext">20211133</pubid></pubidlist></xrefbib></bibl><bibl id="B68"><title><p>The incredible expanding ancestor of eukaryotes.</p></title><aug><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Cell</source><volume>140</volume><fpage>606</fpage><lpage>608</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2010.02.022</pubid><pubid idtype="pmpid" link="fulltext">20211127</pubid></pubidlist></xrefbib></bibl><bibl id="B69"><title><p>The two tempos of nuclear pore complex evolution: highly adapting proteins in an ancient frozen structure.</p></title><aug><au><snm>Bapteste</snm><fnm>E</fnm></au><au><snm>Charlebois</snm><fnm>RL</fnm></au><au><snm>MacLeod</snm><fnm>D</fnm></au><au><snm>Brochier</snm><fnm>C</fnm></au></aug><source>Genome Biol</source><pubdate>2005</pubdate><volume>6</volume><fpage>R85</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2005-6-10-r85</pubid><pubid idtype="pmcid">1257468</pubid><pubid idtype="pmpid" link="fulltext">16207356</pubid></pubidlist></xrefbib></bibl><bibl id="B70"><title><p>Origins and evolution of eukaryotic RNA interference.</p></title><aug><au><snm>Shabalina</snm><fnm>SA</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Trends Ecol Evol</source><pubdate>2008</pubdate><volume>23</volume><fpage>578</fpage><lpage>587</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.tree.2008.06.005</pubid><pubid idtype="pmcid">2695246</pubid><pubid idtype="pmpid" link="fulltext">18715673</pubid></pubidlist></xrefbib></bibl><bibl id="B71"><title><p>Origin and function of ubiquitin-like proteins.</p></title><aug><au><snm>Hochstrasser</snm><fnm>M</fnm></au></aug><source>Nature</source><pubdate>2009</pubdate><volume>458</volume><fpage>422</fpage><lpage>429</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature07958</pubid><pubid idtype="pmcid">2819001</pubid><pubid idtype="pmpid" link="fulltext">19325621</pubid></pubidlist></xrefbib></bibl><bibl id="B72"><title><p>Three distinct modes of intron dynamics in the evolution of eukaryotes.</p></title><aug><au><snm>Carmel</snm><fnm>L</fnm></au><au><snm>Wolf</snm><fnm>YI</fnm></au><au><snm>Rogozin</snm><fnm>IB</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Genome Res</source><pubdate>2007</pubdate><volume>17</volume><fpage>1034</fpage><lpage>1044</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.6438607</pubid><pubid idtype="pmcid">1899114</pubid><pubid idtype="pmpid" link="fulltext">17495008</pubid></pubidlist></xrefbib></bibl><bibl id="B73"><title><p>Extremely intron-rich genes in the alveolate ancestors inferred with a flexible maximum-likelihood approach.</p></title><aug><au><snm>Csuros</snm><fnm>M</fnm></au><au><snm>Rogozin</snm><fnm>IB</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2008</pubdate><volume>25</volume><fpage>903</fpage><lpage>911</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msn039</pubid><pubid idtype="pmpid" link="fulltext">18296415</pubid></pubidlist></xrefbib></bibl><bibl id="B74"><title><p>Intron-rich ancestors.</p></title><aug><au><snm>Roy</snm><fnm>SW</fnm></au></aug><source>Trends Genet</source><pubdate>2006</pubdate><volume>22</volume><fpage>468</fpage><lpage>471</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.tig.2006.07.002</pubid><pubid idtype="pmpid" link="fulltext">16857287</pubid></pubidlist></xrefbib></bibl><bibl id="B75"><title><p>The evolution of spliceosomal introns: patterns, puzzles and progress.</p></title><aug><au><snm>Roy</snm><fnm>SW</fnm></au><au><snm>Gilbert</snm><fnm>W</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2006</pubdate><volume>7</volume><fpage>211</fpage><lpage>221</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">16485020</pubid></xrefbib></bibl><bibl id="B76"><title><p>The puzzle of plastid evolution.</p></title><aug><au><snm>Archibald</snm><fnm>JM</fnm></au></aug><source>Curr Biol</source><pubdate>2009</pubdate><volume>19</volume><fpage>R81</fpage><lpage>R88</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cub.2008.11.067</pubid><pubid idtype="pmpid" link="fulltext">19174147</pubid></pubidlist></xrefbib></bibl><bibl id="B77"><title><p>Sizing up the genomic footprint of endosymbiosis.</p></title><aug><au><snm>Elias</snm><fnm>M</fnm></au><au><snm>Archibald</snm><fnm>JM</fnm></au></aug><source>Bioessays</source><pubdate>2009</pubdate><volume>31</volume><fpage>1273</fpage><lpage>1279</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/bies.200900117</pubid><pubid idtype="pmpid" link="fulltext">19921698</pubid></pubidlist></xrefbib></bibl><bibl id="B78"><title><p>A genome phylogeny for mitochondria among alpha-proteobacteria and a predominantly eubacterial ancestry of yeast nuclear genes.</p></title><aug><au><snm>Esser</snm><fnm>C</fnm></au><au><snm>Ahmadinejad</snm><fnm>N</fnm></au><au><snm>Wiegand</snm><fnm>C</fnm></au><au><snm>Rotte</snm><fnm>C</fnm></au><au><snm>Sebastiani</snm><fnm>F</fnm></au><au><snm>Gelius-Dietrich</snm><fnm>G</fnm></au><au><snm>Henze</snm><fnm>K</fnm></au><au><snm>Kretschmann</snm><fnm>E</fnm></au><au><snm>Richly</snm><fnm>E</fnm></au><au><snm>Leister</snm><fnm>D</fnm></au><au><snm>Bryant</snm><fnm>D</fnm></au><au><snm>Steel</snm><fnm>MA</fnm></au><au><snm>Lockhart</snm><fnm>PJ</fnm></au><au><snm>Penny</snm><fnm>D</fnm></au><au><snm>Martin</snm><fnm>W</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2004</pubdate><volume>21</volume><fpage>1643</fpage><lpage>1660</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msh160</pubid><pubid idtype="pmpid" link="fulltext">15155797</pubid></pubidlist></xrefbib></bibl><bibl id="B79"><title><p>The ring of life provides evidence for a genome fusion origin of eukaryotes.</p></title><aug><au><snm>Rivera</snm><fnm>MC</fnm></au><au><snm>Lake</snm><fnm>JA</fnm></au></aug><source>Nature</source><pubdate>2004</pubdate><volume>431</volume><fpage>152</fpage><lpage>155</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature02848</pubid><pubid idtype="pmpid" link="fulltext">15356622</pubid></pubidlist></xrefbib></bibl><bibl id="B80"><title><p>Evolutionary biology: early evolution comes full circle.</p></title><aug><au><snm>Martin</snm><fnm>W</fnm></au><au><snm>Embley</snm><fnm>TM</fnm></au></aug><source>Nature</source><pubdate>2004</pubdate><volume>431</volume><fpage>134</fpage><lpage>137</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/431134a</pubid><pubid idtype="pmpid" link="fulltext">15356611</pubid></pubidlist></xrefbib></bibl><bibl id="B81"><title><p>Mitochondrial evolution.</p></title><aug><au><snm>Gray</snm><fnm>MW</fnm></au><au><snm>Burger</snm><fnm>G</fnm></au><au><snm>Lang</snm><fnm>BF</fnm></au></aug><source>Science</source><pubdate>1999</pubdate><volume>283</volume><fpage>1476</fpage><lpage>1481</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.283.5407.1476</pubid><pubid idtype="pmpid" link="fulltext">10066161</pubid></pubidlist></xrefbib></bibl><bibl id="B82"><title><p>The origin of mitochondria in light of a fluid prokaryotic chromosome model.</p></title><aug><au><snm>Esser</snm><fnm>C</fnm></au><au><snm>Martin</snm><fnm>W</fnm></au><au><snm>Dagan</snm><fnm>T</fnm></au></aug><source>Biol Lett</source><pubdate>2007</pubdate><volume>3</volume><fpage>180</fpage><lpage>184</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1098/rsbl.2006.0582</pubid><pubid idtype="pmcid">2375920</pubid><pubid idtype="pmpid" link="fulltext">17251118</pubid></pubidlist></xrefbib></bibl><bibl id="B83"><title><p>You are what you eat: a gene transfer ratchet could account for bacterial genes in eukaryotic nuclear genomes.</p></title><aug><au><snm>Doolittle</snm><fnm>WF</fnm></au></aug><source>Trends Genet</source><pubdate>1998</pubdate><volume>14</volume><fpage>307</fpage><lpage>311</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0168-9525(98)01494-2</pubid><pubid idtype="pmpid" link="fulltext">9724962</pubid></pubidlist></xrefbib></bibl><bibl id="B84"><title><p>The genome of <it>M. acetivorans </it>reveals extensive metabolic and physiological diversity.</p></title><aug><au><snm>Galagan</snm><fnm>JE</fnm></au><au><snm>Nusbaum</snm><fnm>C</fnm></au><au><snm>Roy</snm><fnm>A</fnm></au><au><snm>Endrizzi</snm><fnm>MG</fnm></au><au><snm>Macdonald</snm><fnm>P</fnm></au><au><snm>FitzHugh</snm><fnm>W</fnm></au><au><snm>Calvo</snm><fnm>S</fnm></au><au><snm>Engels</snm><fnm>R</fnm></au><au><snm>Smirnov</snm><fnm>S</fnm></au><au><snm>Atnoor</snm><fnm>D</fnm></au><au><snm>Brown</snm><fnm>A</fnm></au><au><snm>Allen</snm><fnm>N</fnm></au><au><snm>Naylor</snm><fnm>J</fnm></au><au><snm>Stange-Thomann</snm><fnm>N</fnm></au><au><snm>DeArellano</snm><fnm>K</fnm></au><au><snm>Johnson</snm><fnm>R</fnm></au><au><snm>Linton</snm><fnm>L</fnm></au><au><snm>McEwan</snm><fnm>P</fnm></au><au><snm>McKernan</snm><fnm>K</fnm></au><au><snm>Talamas</snm><fnm>J</fnm></au><au><snm>Tirrell</snm><fnm>A</fnm></au><au><snm>Ye</snm><fnm>W</fnm></au><au><snm>Zimmer</snm><fnm>A</fnm></au><au><snm>Barber</snm><fnm>RD</fnm></au><au><snm>Cann</snm><fnm>I</fnm></au><au><snm>Graham</snm><fnm>DE</fnm></au><au><snm>Grahame</snm><fnm>DA</fnm></au><au><snm>Guss</snm><fnm>AM</fnm></au><au><snm>Hedderich</snm><fnm>R</fnm></au><au><snm>Ingram-Smith</snm><fnm>C</fnm></au><etal/></aug><source>Genome Res</source><pubdate>2002</pubdate><volume>12</volume><fpage>532</fpage><lpage>542</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.223902</pubid><pubid idtype="pmcid">187521</pubid><pubid idtype="pmpid" link="fulltext">11932238</pubid></pubidlist></xrefbib></bibl><bibl id="B85"><title><p>Horizontal gene transfer: the path to maturity.</p></title><aug><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Mol Microbiol</source><pubdate>2003</pubdate><volume>50</volume><fpage>725</fpage><lpage>727</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1046/j.1365-2958.2003.03808.x</pubid><pubid idtype="pmpid" link="fulltext">14617135</pubid></pubidlist></xrefbib></bibl><bibl id="B86"><title><p>The archaebacterial origin of eukaryotes.</p></title><aug><au><snm>Cox</snm><fnm>CJ</fnm></au><au><snm>Foster</snm><fnm>PG</fnm></au><au><snm>Hirt</snm><fnm>RP</fnm></au><au><snm>Harris</snm><fnm>SR</fnm></au><au><snm>Embley</snm><fnm>TM</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2008</pubdate><volume>105</volume><fpage>20356</fpage><lpage>20361</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0810647105</pubid><pubid idtype="pmcid">2629343</pubid><pubid idtype="pmpid" link="fulltext">19073919</pubid></pubidlist></xrefbib></bibl><bibl id="B87"><title><p>The primary divisions of life: a phylogenomic approach employing composition-heterogeneous methods.</p></title><aug><au><snm>Foster</snm><fnm>PG</fnm></au><au><snm>Cox</snm><fnm>CJ</fnm></au><au><snm>Embley</snm><fnm>TM</fnm></au></aug><source>Philos Trans R Soc Lond B Biol Sci</source><pubdate>2009</pubdate><volume>364</volume><fpage>2197</fpage><lpage>2207</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1098/rstb.2009.0034</pubid><pubid idtype="pmpid" link="fulltext">19571240</pubid></pubidlist></xrefbib></bibl><bibl id="B88"><title><p>Supertrees disentangle the chimerical origin of eukaryotic genomes.</p></title><aug><au><snm>Pisani</snm><fnm>D</fnm></au><au><snm>Cotton</snm><fnm>JA</fnm></au><au><snm>McInerney</snm><fnm>JO</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2007</pubdate><volume>24</volume><fpage>1752</fpage><lpage>1760</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msm095</pubid><pubid idtype="pmpid" link="fulltext">17504772</pubid></pubidlist></xrefbib></bibl><bibl id="B89"><title><p>The deep archaeal roots of eukaryotes.</p></title><aug><au><snm>Yutin</snm><fnm>N</fnm></au><au><snm>Makarova</snm><fnm>KS</fnm></au><au><snm>Mekhedov</snm><fnm>SL</fnm></au><au><snm>Wolf</snm><fnm>YI</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2008</pubdate><volume>25</volume><fpage>1619</fpage><lpage>1630</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msn108</pubid><pubid idtype="pmcid">2464739</pubid><pubid idtype="pmpid" link="fulltext">18463089</pubid></pubidlist></xrefbib></bibl><bibl id="B90"><title><p>Comprehensive analysis of the origin of eukaryotic genomes.</p></title><aug><au><snm>Saruhashi</snm><fnm>S</fnm></au><au><snm>Hamada</snm><fnm>K</fnm></au><au><snm>Miyata</snm><fnm>D</fnm></au><au><snm>Horiike</snm><fnm>T</fnm></au><au><snm>Shinozawa</snm><fnm>T</fnm></au></aug><source>Genes Genet Syst</source><pubdate>2008</pubdate><volume>83</volume><fpage>285</fpage><lpage>291</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1266/ggs.83.285</pubid><pubid idtype="pmpid" link="fulltext">18931454</pubid></pubidlist></xrefbib></bibl><bibl id="B91"><title><p>Ribosomal protein-sequence block structure suggests complex prokaryotic evolution with implications for the origin of eukaryotes.</p></title><aug><au><snm>Vishwanath</snm><fnm>P</fnm></au><au><snm>Favaretto</snm><fnm>P</fnm></au><au><snm>Hartman</snm><fnm>H</fnm></au><au><snm>Mohr</snm><fnm>SC</fnm></au><au><snm>Smith</snm><fnm>TF</fnm></au></aug><source>Mol Phylogenet Evol</source><pubdate>2004</pubdate><volume>33</volume><fpage>615</fpage><lpage>625</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ympev.2004.07.003</pubid><pubid idtype="pmpid" link="fulltext">15522791</pubid></pubidlist></xrefbib></bibl><bibl id="B92"><title><p>Evolution of complex RNA polymerases: the complete archaeal RNA polymerase structure.</p></title><aug><au><snm>Korkhin</snm><fnm>Y</fnm></au><au><snm>Unligil</snm><fnm>UM</fnm></au><au><snm>Littlefield</snm><fnm>O</fnm></au><au><snm>Nelson</snm><fnm>PJ</fnm></au><au><snm>Stuart</snm><fnm>DI</fnm></au><au><snm>Sigler</snm><fnm>PB</fnm></au><au><snm>Bell</snm><fnm>SD</fnm></au><au><snm>Abrescia</snm><fnm>NG</fnm></au></aug><source>PLoS Biol</source><pubdate>2009</pubdate><volume>7</volume><fpage>e102</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pbio.1000102</pubid><pubid idtype="pmcid">2675907</pubid><pubid idtype="pmpid" link="fulltext">19419240</pubid></pubidlist></xrefbib></bibl><bibl id="B93"><title><p>Identification of an ortholog of the eukaryotic RNA polymerase III subunit RPC34 in Crenarchaeota and Thaumarchaeota suggests specialization of RNA polymerases for coding and non-coding RNAs in Archaea.</p></title><aug><au><snm>Blombach</snm><fnm>F</fnm></au><au><snm>Makarova</snm><fnm>KS</fnm></au><au><snm>Marrero</snm><fnm>J</fnm></au><au><snm>Siebers</snm><fnm>B</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au><au><snm>Oost</snm><mnm>van der</mnm><fnm>J</fnm></au></aug><source>Biol Direct</source><pubdate>2009</pubdate><volume>4</volume><fpage>39</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1745-6150-4-39</pubid><pubid idtype="pmcid">2770514</pubid><pubid idtype="pmpid" link="fulltext">19828044</pubid></pubidlist></xrefbib></bibl><bibl id="B94"><title><p>Identification of a crenarchaeal orthologue of Elf1: implications for chromatin and transcription in Archaea.</p></title><aug><au><snm>Daniels</snm><fnm>JP</fnm></au><au><snm>Kelly</snm><fnm>S</fnm></au><au><snm>Wickstead</snm><fnm>B</fnm></au><au><snm>Gull</snm><fnm>K</fnm></au></aug><source>Biol Direct</source><pubdate>2009</pubdate><volume>4</volume><fpage>24</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1745-6150-4-24</pubid><pubid idtype="pmcid">2732611</pubid><pubid idtype="pmpid" link="fulltext">19640276</pubid></pubidlist></xrefbib></bibl><bibl id="B95"><title><p>Archaeal histones: structures, stability and DNA binding.</p></title><aug><au><snm>Reeve</snm><fnm>JN</fnm></au><au><snm>Bailey</snm><fnm>KA</fnm></au><au><snm>Li</snm><fnm>WT</fnm></au><au><snm>Marc</snm><fnm>F</fnm></au><au><snm>Sandman</snm><fnm>K</fnm></au><au><snm>Soares</snm><fnm>DJ</fnm></au></aug><source>Biochem Soc Trans</source><pubdate>2004</pubdate><volume>32</volume><fpage>227</fpage><lpage>230</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1042/BST0320227</pubid><pubid idtype="pmpid" link="fulltext">15046577</pubid></pubidlist></xrefbib></bibl><bibl id="B96"><title><p>Molecular evolution of FtsZ protein sequences encoded within the genomes of archaea, bacteria, and eukaryota.</p></title><aug><au><snm>Vaughan</snm><fnm>S</fnm></au><au><snm>Wickstead</snm><fnm>B</fnm></au><au><snm>Gull</snm><fnm>K</fnm></au><au><snm>Addinall</snm><fnm>SG</fnm></au></aug><source>J Mol Evol</source><pubdate>2004</pubdate><volume>58</volume><fpage>19</fpage><lpage>29</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00239-003-2523-5</pubid><pubid idtype="pmpid" link="fulltext">14743312</pubid></pubidlist></xrefbib></bibl><bibl id="B97"><title><p>Evolution of cytomotive filaments: the cytoskeleton from prokaryotes to eukaryotes.</p></title><aug><au><snm>Lowe</snm><fnm>J</fnm></au><au><snm>Amos</snm><fnm>LA</fnm></au></aug><source>Int J Biochem Cell Biol</source><pubdate>2009</pubdate><volume>41</volume><fpage>323</fpage><lpage>329</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.biocel.2008.08.010</pubid><pubid idtype="pmpid" link="fulltext">18768164</pubid></pubidlist></xrefbib></bibl><bibl id="B98"><title><p>A unique cell division machinery in the Archaea.</p></title><aug><au><snm>Lindas</snm><fnm>AC</fnm></au><au><snm>Karlsson</snm><fnm>EA</fnm></au><au><snm>Lindgren</snm><fnm>MT</fnm></au><au><snm>Ettema</snm><fnm>TJ</fnm></au><au><snm>Bernander</snm><fnm>R</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2008</pubdate><volume>105</volume><fpage>18942</fpage><lpage>18946</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0809467105</pubid><pubid idtype="pmcid">2596248</pubid><pubid idtype="pmpid" link="fulltext">18987308</pubid></pubidlist></xrefbib></bibl><bibl id="B99"><title><p>Evolution of DNA polymerase families: evidences for multiple gene exchange between cellular and viral proteins.</p></title><aug><au><snm>Filee</snm><fnm>J</fnm></au><au><snm>Forterre</snm><fnm>P</fnm></au><au><snm>Sen-Lin</snm><fnm>T</fnm></au><au><snm>Laurent</snm><fnm>J</fnm></au></aug><source>J Mol Evol</source><pubdate>2002</pubdate><volume>54</volume><fpage>763</fpage><lpage>773</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00239-001-0078-x</pubid><pubid idtype="pmpid" link="fulltext">12029358</pubid></pubidlist></xrefbib></bibl><bibl id="B100"><title><p>Evolution of DNA polymerases: an inactivated polymerase-exonuclease module in Pol epsilon and a chimeric origin of eukaryotic polymerases from two classes of archaeal ancestors.</p></title><aug><au><snm>Tahirov</snm><fnm>TH</fnm></au><au><snm>Makarova</snm><fnm>KS</fnm></au><au><snm>Rogozin</snm><fnm>IB</fnm></au><au><snm>Pavlov</snm><fnm>YI</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Biol Direct</source><pubdate>2009</pubdate><volume>4</volume><fpage>11</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1745-6150-4-11</pubid><pubid idtype="pmcid">2669801</pubid><pubid idtype="pmpid" link="fulltext">19296856</pubid></pubidlist></xrefbib></bibl><bibl id="B101"><title><p>Phosphoesterase domains associated with DNA polymerases of diverse origins.</p></title><aug><au><snm>Aravind</snm><fnm>L</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>1998</pubdate><volume>26</volume><fpage>3746</fpage><lpage>3752</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/26.16.3746</pubid><pubid idtype="pmcid">147763</pubid><pubid idtype="pmpid" link="fulltext">9685491</pubid></pubidlist></xrefbib></bibl><bibl id="B102"><title><p>Prokaryotic homologs of Argonaute proteins are predicted to function as key components of a novel system of defense against mobile genetic elements.</p></title><aug><au><snm>Makarova</snm><fnm>KS</fnm></au><au><snm>Wolf</snm><fnm>YI</fnm></au><au><snm>Oost</snm><mnm>van der</mnm><fnm>J</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Biol Direct</source><pubdate>2009</pubdate><volume>4</volume><fpage>29</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1745-6150-4-29</pubid><pubid idtype="pmcid">2743648</pubid><pubid idtype="pmpid" link="fulltext">19706170</pubid></pubidlist></xrefbib></bibl><bibl id="B103"><title><p>The evolution of eukaryotes.</p></title><aug><au><snm>Martin</snm><fnm>W</fnm></au><au><snm>Dagan</snm><fnm>T</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au><au><snm>Dipippo</snm><fnm>JL</fnm></au><au><snm>Gogarten</snm><fnm>JP</fnm></au><au><snm>Lake</snm><fnm>JA</fnm></au></aug><source>Science</source><pubdate>2007</pubdate><volume>316</volume><fpage>542</fpage><lpage>543</lpage><note>author reply 542-543</note><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.316.5824.542c</pubid><pubid idtype="pmpid" link="fulltext">17463271</pubid></pubidlist></xrefbib></bibl><bibl id="B104"><title><p>Eukaryote evolution: engulfed by speculation.</p></title><aug><au><snm>Poole</snm><fnm>A</fnm></au><au><snm>Penny</snm><fnm>D</fnm></au></aug><source>Nature</source><pubdate>2007</pubdate><volume>447</volume><fpage>913</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/447913a</pubid><pubid idtype="pmpid" link="fulltext">17581566</pubid></pubidlist></xrefbib></bibl><bibl id="B105"><title><p>The origins of phagocytosis and eukaryogenesis.</p></title><aug><au><snm>Yutin</snm><fnm>N</fnm></au><au><snm>Wolf</snm><fnm>MY</fnm></au><au><snm>Wolf</snm><fnm>YI</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Biol Direct</source><pubdate>2009</pubdate><volume>4</volume><fpage>9</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1745-6150-4-9</pubid><pubid idtype="pmcid">2651865</pubid><pubid idtype="pmpid" link="fulltext">19245710</pubid></pubidlist></xrefbib></bibl><bibl id="B106"><title><p>Introns and the origin of nucleus-cytosol compartmentation.</p></title><aug><au><snm>Martin</snm><fnm>W</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Nature</source><pubdate>2006</pubdate><volume>440</volume><fpage>41</fpage><lpage>45</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature04531</pubid><pubid idtype="pmpid" link="fulltext">16511485</pubid></pubidlist></xrefbib></bibl><bibl id="B107"><title><p>A positive definition of prokaryotes.</p></title><aug><au><snm>Martin</snm><fnm>W</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Nature</source><pubdate>2006</pubdate><volume>442</volume><fpage>868</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/442868c</pubid><pubid idtype="pmpid" link="fulltext">16929275</pubid></pubidlist></xrefbib></bibl><bibl id="B108"><title><p>Transcription and translation are coupled in Archaea.</p></title><aug><au><snm>French</snm><fnm>SL</fnm></au><au><snm>Santangelo</snm><fnm>TJ</fnm></au><au><snm>Beyer</snm><fnm>AL</fnm></au><au><snm>Reeve</snm><fnm>JN</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2007</pubdate><volume>24</volume><fpage>893</fpage><lpage>895</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msm007</pubid><pubid idtype="pmpid" link="fulltext">17237472</pubid></pubidlist></xrefbib></bibl><bibl id="B109"><title><p>The origin of introns and their role in eukaryogenesis: a compromise solution to the introns-early versus introns-late debate?.</p></title><aug><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Biol Direct</source><pubdate>2006</pubdate><volume>1</volume><fpage>22</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1745-6150-1-22</pubid><pubid idtype="pmcid">1570339</pubid><pubid idtype="pmpid" link="fulltext">16907971</pubid></pubidlist></xrefbib></bibl><bibl id="B110"><title><p>Proteasomes in the archaea: from structure to function.</p></title><aug><au><snm>Maupin-Furlow</snm><fnm>JA</fnm></au><au><snm>Wilson</snm><fnm>HL</fnm></au><au><snm>Kaczowka</snm><fnm>SJ</fnm></au><au><snm>Ou</snm><fnm>MS</fnm></au></aug><source>Front Biosci</source><pubdate>2000</pubdate><volume>5</volume><fpage>D837</fpage><lpage>D865</lpage><xrefbib><pubidlist><pubid idtype="doi">10.2741/furlow</pubid><pubid idtype="pmpid">10966872</pubid></pubidlist></xrefbib></bibl><bibl id="B111"><title><p>Lessons from structural and biochemical studies on the archaeal exosome.</p></title><aug><au><snm>Hartung</snm><fnm>S</fnm></au><au><snm>Hopfner</snm><fnm>KP</fnm></au></aug><source>Biochem Soc Trans</source><pubdate>2009</pubdate><volume>37</volume><fpage>83</fpage><lpage>87</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1042/BST0370083</pubid><pubid idtype="pmpid" link="fulltext">19143607</pubid></pubidlist></xrefbib></bibl><bibl id="B112"><title><p>The oligomerization and ligand-binding properties of Sm-like archaeal proteins (SmAPs).</p></title><aug><au><snm>Mura</snm><fnm>C</fnm></au><au><snm>Kozhukhovsky</snm><fnm>A</fnm></au><au><snm>Gingery</snm><fnm>M</fnm></au><au><snm>Phillips</snm><fnm>M</fnm></au><au><snm>Eisenberg</snm><fnm>D</fnm></au></aug><source>Protein Sci</source><pubdate>2003</pubdate><volume>12</volume><fpage>832</fpage><lpage>847</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1110/ps.0224703</pubid><pubid idtype="pmcid">2323858</pubid><pubid idtype="pmpid" link="fulltext">12649441</pubid></pubidlist></xrefbib></bibl><bibl id="B113"><title><p>Ancient ESCRTs and the evolution of binary fission.</p></title><aug><au><snm>Samson</snm><fnm>RY</fnm></au><au><snm>Bell</snm><fnm>SD</fnm></au></aug><source>Trends Microbiol</source><pubdate>2009</pubdate><volume>17</volume><fpage>507</fpage><lpage>513</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.tim.2009.08.003</pubid><pubid idtype="pmpid" link="fulltext">19783442</pubid></pubidlist></xrefbib></bibl><bibl id="B114"><title><p>Cell division and the ESCRT complex: a surprise from the archaea.</p></title><aug><au><snm>Ettema</snm><fnm>TJ</fnm></au><au><snm>Bernander</snm><fnm>R</fnm></au></aug><source>Commun Integr Biol</source><pubdate>2009</pubdate><volume>2</volume><fpage>86</fpage><lpage>88</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2686351</pubid><pubid idtype="pmpid">19704896</pubid></pubidlist></xrefbib></bibl><bibl id="B115"><title><p>Ubiquitin-like small archaeal modifier proteins (SAMPs) in <it>Haloferax volcanii </it>.</p></title><aug><au><snm>Humbard</snm><fnm>MA</fnm></au><au><snm>Miranda</snm><fnm>HV</fnm></au><au><snm>Lim</snm><fnm>JM</fnm></au><au><snm>Krause</snm><fnm>DJ</fnm></au><au><snm>Pritz</snm><fnm>JR</fnm></au><au><snm>Zhou</snm><fnm>G</fnm></au><au><snm>Chen</snm><fnm>S</fnm></au><au><snm>Wells</snm><fnm>L</fnm></au><au><snm>Maupin-Furlow</snm><fnm>JA</fnm></au></aug><source>Nature</source><volume>463</volume><fpage>54</fpage><lpage>60</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature08659</pubid><pubid idtype="pmpid" link="fulltext">20054389</pubid></pubidlist></xrefbib></bibl><bibl id="B116"><title><p>Mealybug beta-proteobacterial endosymbionts contain gamma-proteobacterial symbionts.</p></title><aug><au><snm>von Dohlen</snm><fnm>CD</fnm></au><au><snm>Kohler</snm><fnm>S</fnm></au><au><snm>Alsop</snm><fnm>ST</fnm></au><au><snm>McManus</snm><fnm>WR</fnm></au></aug><source>Nature</source><pubdate>2001</pubdate><volume>412</volume><fpage>433</fpage><lpage>436</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/35086563</pubid><pubid idtype="pmpid" link="fulltext">11473316</pubid></pubidlist></xrefbib></bibl><bibl id="B117"><title><p>Secondary (gamma-Proteobacteria) endosymbionts infect the primary (beta-Proteobacteria) endosymbionts of mealybugs multiple times and coevolve with their hosts.</p></title><aug><au><snm>Thao</snm><fnm>ML</fnm></au><au><snm>Gullan</snm><fnm>PJ</fnm></au><au><snm>Baumann</snm><fnm>P</fnm></au></aug><source>Appl Environ Microbiol</source><pubdate>2002</pubdate><volume>68</volume><fpage>3190</fpage><lpage>3197</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/AEM.68.7.3190-3197.2002</pubid><pubid idtype="pmcid">126777</pubid><pubid idtype="pmpid" link="fulltext">12088994</pubid></pubidlist></xrefbib></bibl><bibl id="B118"><title><p>Interspecific evolution: microbial symbiosis, endosymbiosis and gene transfer.</p></title><aug><au><snm>Hoffmeister</snm><fnm>M</fnm></au><au><snm>Martin</snm><fnm>W</fnm></au></aug><source>Environ Microbiol</source><pubdate>2003</pubdate><volume>5</volume><fpage>641</fpage><lpage>649</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1046/j.1462-2920.2003.00454.x</pubid><pubid idtype="pmpid" link="fulltext">12871231</pubid></pubidlist></xrefbib></bibl><bibl id="B119"><title><p>Predation between prokaryotes and the origin of eukaryotes.</p></title><aug><au><snm>Davidov</snm><fnm>Y</fnm></au><au><snm>Jurkevitch</snm><fnm>E</fnm></au></aug><source>Bioessays</source><pubdate>2009</pubdate><volume>31</volume><fpage>748</fpage><lpage>757</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/bies.200900018</pubid><pubid idtype="pmpid" link="fulltext">19492355</pubid></pubidlist></xrefbib></bibl><bibl id="B120"><title><p>The origin of the eukaryotic cell: a genomic investigation.</p></title><aug><au><snm>Hartman</snm><fnm>H</fnm></au><au><snm>Fedorov</snm><fnm>A</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2002</pubdate><volume>99</volume><fpage>1420</fpage><lpage>1425</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.032658599</pubid><pubid idtype="pmcid">122206</pubid><pubid idtype="pmpid" link="fulltext">11805300</pubid></pubidlist></xrefbib></bibl><bibl id="B121"><title><p>Structure and function of the GINS complex, a key component of the eukaryotic replisome.</p></title><aug><au><snm>MacNeill</snm><fnm>SA</fnm></au></aug><source>Biochem J</source><volume>425</volume><fpage>489</fpage><lpage>500</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1042/BJ20091531</pubid><pubid idtype="pmpid" link="fulltext">20070258</pubid></pubidlist></xrefbib></bibl><bibl id="B122"><title><p>The prokaryotic V4R domain is the likely ancestor of a key component of the eukaryotic vesicle transport system.</p></title><aug><au><snm>Podar</snm><fnm>M</fnm></au><au><snm>Wall</snm><fnm>MA</fnm></au><au><snm>Makarova</snm><fnm>KS</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Biol Direct</source><pubdate>2008</pubdate><volume>3</volume><fpage>2</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1745-6150-3-2</pubid><pubid idtype="pmcid">2253512</pubid><pubid idtype="pmpid" link="fulltext">18221539</pubid></pubidlist></xrefbib></bibl><bibl id="B123"><title><p>Genomics of bacteria and archaea: the emerging dynamic view of the prokaryotic world.</p></title><aug><au><snm>Koonin</snm><fnm>EV</fnm></au><au><snm>Wolf</snm><fnm>YI</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2008</pubdate><volume>36</volume><fpage>6688</fpage><lpage>6719</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkn668</pubid><pubid idtype="pmcid">2588523</pubid><pubid idtype="pmpid" link="fulltext">18948295</pubid></pubidlist></xrefbib></bibl><bibl id="B124"><title><p>Estimating the size of the bacterial pan-genome.</p></title><aug><au><snm>Lapierre</snm><fnm>P</fnm></au><au><snm>Gogarten</snm><fnm>JP</fnm></au></aug><source>Trends Genet</source><pubdate>2009</pubdate><volume>25</volume><fpage>107</fpage><lpage>110</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.tig.2008.12.004</pubid><pubid idtype="pmpid" link="fulltext">19168257</pubid></pubidlist></xrefbib></bibl><bibl id="B125"><title><p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.</p></title><aug><au><snm>Altschul</snm><fnm>SF</fnm></au><au><snm>Madden</snm><fnm>TL</fnm></au><au><snm>Schaffer</snm><fnm>AA</fnm></au><au><snm>Zhang</snm><fnm>J</fnm></au><au><snm>Zhang</snm><fnm>Z</fnm></au><au><snm>Miller</snm><fnm>W</fnm></au><au><snm>Lipman</snm><fnm>DJ</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>1997</pubdate><volume>25</volume><fpage>3389</fpage><lpage>3402</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/25.17.3389</pubid><pubid idtype="pmcid">146917</pubid><pubid idtype="pmpid" link="fulltext">9254694</pubid></pubidlist></xrefbib></bibl><bibl id="B126"><title><p>The closest BLAST hit is often not the nearest neighbor.</p></title><aug><au><snm>Koski</snm><fnm>LB</fnm></au><au><snm>Golding</snm><fnm>GB</fnm></au></aug><source>J Mol Evol</source><pubdate>2001</pubdate><volume>52</volume><fpage>540</fpage><lpage>542</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">11443357</pubid></xrefbib></bibl><bibl id="B127"><title><p>A korarchaeal genome reveals insights into the evolution of the Archaea.</p></title><aug><au><snm>Elkins</snm><fnm>JG</fnm></au><au><snm>Podar</snm><fnm>M</fnm></au><au><snm>Graham</snm><fnm>DE</fnm></au><au><snm>Makarova</snm><fnm>KS</fnm></au><au><snm>Wolf</snm><fnm>Y</fnm></au><au><snm>Randau</snm><fnm>L</fnm></au><au><snm>Hedlund</snm><fnm>BP</fnm></au><au><snm>Brochier-Armanet</snm><fnm>C</fnm></au><au><snm>Kunin</snm><fnm>V</fnm></au><au><snm>Anderson</snm><fnm>I</fnm></au><au><snm>Lapidus</snm><fnm>A</fnm></au><au><snm>Goltsman</snm><fnm>E</fnm></au><au><snm>Barry</snm><fnm>K</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au><au><snm>Hugenholtz</snm><fnm>P</fnm></au><au><snm>Kyrpides</snm><fnm>N</fnm></au><au><snm>Wanner</snm><fnm>G</fnm></au><au><snm>Richardson</snm><fnm>P</fnm></au><au><snm>Keller</snm><fnm>M</fnm></au><au><snm>Stetter</snm><fnm>KO</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2008</pubdate><volume>105</volume><fpage>8102</fpage><lpage>8107</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0801980105</pubid><pubid idtype="pmcid">2430366</pubid><pubid idtype="pmpid" link="fulltext">18535141</pubid></pubidlist></xrefbib></bibl><bibl id="B128"><title><p>Conserved domains in DNA repair proteins and evolution of repair systems.</p></title><aug><au><snm>Aravind</snm><fnm>L</fnm></au><au><snm>Walker</snm><fnm>DR</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>1999</pubdate><volume>27</volume><fpage>1223</fpage><lpage>1242</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/27.5.1223</pubid><pubid idtype="pmcid">148307</pubid><pubid idtype="pmpid" link="fulltext">9973609</pubid></pubidlist></xrefbib></bibl><bibl id="B129"><title><p>Orthologs of the small RPB8 subunit of the eukaryotic RNA polymerases are conserved in hyperthermophilic Crenarchaeota and "Korarchaeota".</p></title><aug><au><snm>Koonin</snm><fnm>EV</fnm></au><au><snm>Makarova</snm><fnm>KS</fnm></au><au><snm>Elkins</snm><fnm>JG</fnm></au></aug><source>Biol Direct</source><pubdate>2007</pubdate><volume>2</volume><fpage>38</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1745-6150-2-38</pubid><pubid idtype="pmcid">2234397</pubid><pubid idtype="pmpid" link="fulltext">18081935</pubid></pubidlist></xrefbib></bibl><bibl id="B130"><title><p>Comparative genomics and evolution of proteins involved in RNA metabolism.</p></title><aug><au><snm>Anantharaman</snm><fnm>V</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au><au><snm>Aravind</snm><fnm>L</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2002</pubdate><volume>30</volume><fpage>1427</fpage><lpage>1464</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/30.7.1427</pubid><pubid idtype="pmcid">101826</pubid><pubid idtype="pmpid" link="fulltext">11917006</pubid></pubidlist></xrefbib></bibl><bibl id="B131"><title><p>Reconstructing the ubiquitin network: cross-talk with other systems and identification of novel functions.</p></title><aug><au><snm>Venancio</snm><fnm>TM</fnm></au><au><snm>Balaji</snm><fnm>S</fnm></au><au><snm>Iyer</snm><fnm>LM</fnm></au><au><snm>Aravind</snm><fnm>L</fnm></au></aug><source>Genome Biol</source><pubdate>2009</pubdate><volume>10</volume><fpage>R33</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2009-10-3-r33</pubid><pubid idtype="pmcid">2691004</pubid><pubid idtype="pmpid" link="fulltext">19331687</pubid></pubidlist></xrefbib></bibl><bibl id="B132"><title><p>Prediction of the archaeal exosome and its connections with the proteasome and the translation and transcription machineries by a comparative-genomic approach.</p></title><aug><au><snm>Koonin</snm><fnm>EV</fnm></au><au><snm>Wolf</snm><fnm>YI</fnm></au><au><snm>Aravind</snm><fnm>L</fnm></au></aug><source>Genome Res</source><pubdate>2001</pubdate><volume>11</volume><fpage>240</fpage><lpage>252</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.162001</pubid><pubid idtype="pmcid">311015</pubid><pubid idtype="pmpid" link="fulltext">11157787</pubid></pubidlist></xrefbib></bibl><bibl id="B133"><title><p>Comparative genomics of transcription factors and chromatin proteins in parasitic protists and other eukaryotes.</p></title><aug><au><snm>Iyer</snm><fnm>LM</fnm></au><au><snm>Anantharaman</snm><fnm>V</fnm></au><au><snm>Wolf</snm><fnm>MY</fnm></au><au><snm>Aravind</snm><fnm>L</fnm></au></aug><source>Int J Parasitol</source><pubdate>2008</pubdate><volume>38</volume><fpage>1</fpage><lpage>31</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ijpara.2007.07.018</pubid><pubid idtype="pmpid" link="fulltext">17949725</pubid></pubidlist></xrefbib></bibl><bibl id="B134"><title><p>The two faces of Alba: the evolutionary connection between proteins participating in chromatin structure and RNA metabolism.</p></title><aug><au><snm>Aravind</snm><fnm>L</fnm></au><au><snm>Iyer</snm><fnm>LM</fnm></au><au><snm>Anantharaman</snm><fnm>V</fnm></au></aug><source>Genome Biol</source><pubdate>2003</pubdate><volume>4</volume><fpage>R64</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2003-4-10-r64</pubid><pubid idtype="pmcid">328453</pubid><pubid idtype="pmpid" link="fulltext">14519199</pubid></pubidlist></xrefbib></bibl><bibl id="B135"><title><p>The origin and early evolution of mitochondria.</p></title><aug><au><snm>Gray</snm><fnm>MW</fnm></au><au><snm>Burger</snm><fnm>G</fnm></au><au><snm>Lang</snm><fnm>BF</fnm></au></aug><source>Genome Biol</source><pubdate>2001</pubdate><volume>2</volume><fpage>reviews1018</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2001-2-6-reviews1018</pubid><pubid idtype="pmcid">138944</pubid><pubid idtype="pmpid" link="fulltext">11423013</pubid></pubidlist></xrefbib></bibl></refgrp>
   </bm>
</art>