<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2007-8-11-r242</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Quantification of ortholog losses in insects and vertebrates</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Wyder</snm>
               <fnm>Stefan</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>stefan.wyder@medecine.unige.ch</email>
            </au>
            <au id="A2">
               <snm>Kriventseva</snm>
               <mi>V</mi>
               <fnm>Evgenia</fnm>
               <insr iid="I3"/>
               <insr iid="I2"/>
               <email>evgenia.kriventseva@isb-sib.ch</email>
            </au>
            <au id="A3">
               <snm>Schr&#246;der</snm>
               <fnm>Reinhard</fnm>
               <insr iid="I4"/>
               <email>reinhard.schroeder@uni-tuebingen.de</email>
            </au>
            <au id="A4">
               <snm>Kadowaki</snm>
               <fnm>Tatsuhiko</fnm>
               <insr iid="I5"/>
               <email>emi@nuagr1.agr.nagoya-u.ac.jp</email>
            </au>
            <au id="A5" ca="yes">
               <snm>Zdobnov</snm>
               <mi>M</mi>
               <fnm>Evgeny</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <insr iid="I6"/>
               <email>evgeny.zdobnov@medecine.unige.ch</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Genetic Medicine and Development, University of Geneva Medical School, 1211 Geneva, Switzerland</p>
            </ins>
            <ins id="I2">
               <p>Swiss Institute of Bioinformatics, rue Michel-Servet, 1211 Geneva, Switzerland</p>
            </ins>
            <ins id="I3">
               <p>Department of Structural Biology and Bioinformatics, University of Geneva Medical School, rue Michel-Servet, 1211 Geneva, Switzerland</p>
            </ins>
            <ins id="I4">
               <p>Interf. Institut f&#252;r Zellbiologie, Abt. Genetik der Tiere, Universit&#228;t T&#252;bingen, 72076 T&#252;bingen, Germany</p>
            </ins>
            <ins id="I5">
               <p>Graduate School of Bioagricultural Sciences, Nagoya University, Chikusa, Nagoya 464-8601, Japan</p>
            </ins>
            <ins id="I6">
               <p>Imperial College London, South Kensington Campus, London SW7 2AZ, UK</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>11</issue>
         <fpage>R242</fpage>
         <url>http://genomebiology.com/2007/8/11/R242</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18021399</pubid>
               <pubid idtype="doi">10.1186/gb-2007-8-11-r242</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>30</day>
               <month>6</month>
               <year>2007</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>4</day>
               <month>10</month>
               <year>2007</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>16</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>16</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Wyder et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Ortholog losses in vertebrates and insects</p>
      </shorttitle>
      <shortabs>
         <p>Comparison of the gene repertoires of 5 vertebrates and 5 insects showed that the rate of losses correlates well with the species' rates of molecular evolution and radiation times and suggests that the Urbilateria genome contained more than 7,000 genes.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The increasing number of sequenced insect and vertebrate genomes of variable divergence enables refined comparative analyses to quantify the major modes of animal genome evolution and allows tracing of gene genealogy (orthology) and pinpointing of gene extinctions (losses), which can reveal lineage-specific traits.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>To consistently quantify losses of orthologous groups of genes, we compared the gene repertoires of five vertebrates and five insects, including honeybee and <it>Tribolium </it>beetle, that represent insect orders outside the previously sequenced Diptera. We found hundreds of lost Urbilateria genes in each of the lineages and assessed their phylogenetic origin. The rate of losses correlates well with the species' rates of molecular evolution and radiation times, without distinction between insects and vertebrates, indicating their stochastic nature. Remarkably, this extends to the universal single-copy orthologs, losses of dozens of which have been tolerated in each species. Nevertheless, the propensity for loss differs substantially among genes, where roughly 20% of the orthologs have an 8-fold higher chance of becoming extinct. Extrapolation of our data also suggests that the Urbilateria genome contained more than 7,000 genes.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Our results indicate that the seemingly higher number of observed gene losses in insects can be explained by their two- to three-fold higher evolutionary rate. Despite the profound effect of many losses on cellular machinery, overall, they seem to be guided by neutral evolution.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The evolution of gene repertoires is mostly driven by gene duplications and gene losses. Duplications can arise by short-range copying of individual genes or of longer multigene DNA segments, or even result from whole genome duplications <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Copies of single genes are frequently associated with retrotransposition <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, whereas unequal homologous recombination copies DNA segments of varying length. Gene proliferation, on the other hand, is balanced by gene losses, either through acquiring deleterious mutations that eventually disable the genes or as a consequence of unequal homologous recombination.</p>
         <p>Massive gene losses of olfactory receptors were reported in human and ape families compared to mice <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, which have been speculated to be linked with the acquisition of full trichromatic vision, lowering the demand for olfaction <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Ohlson's 'less-is-more' hypothesis emphasizes that loss-of-function mutations may play a beneficial role in evolution <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>; an example for adaptive gene loss is the near-complete fixation of a null allele of <it>CASP12 </it>in the human lineage <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, presumably to confer protection from severe sepsis. Gene losses have also been implicated in reproductive isolation of <it>Drosophila </it>races <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>.</p>
         <p>The fast growing number of available vertebrate and insect genomes allows increasingly refined comparisons and the quantification of the major modes of animal genome evolution. The recent sequencing of the honeybee <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> and the <it>Tribolium </it>beetle <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> genomes extends insect genomics from only Dipteran species to the orders of Hymenoptera and Coleoptera, which radiated about 300 million years ago <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. This allowed us to quantify and date losses of orthologous groups across ten bilaterian species in the first analysis that consistently compares five insects (phylum Arthropoda) and five vertebrates (phylum Chordata). Previous studies of gene losses have been focussed on mammals <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, vertebrates <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, or included only a single insect <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, or two dipterans <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Reassuringly, our analysis recovered previously published cases of gene losses, such as the loss of DNA methylation <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, and the heavy rearrangement of the circadian clock <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> in Diptera.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Quantification of ortholog losses</p>
            </st>
            <p>To systematically identify gene losses in vertebrate and insect representatives of Bilaterian species, we delineated orthologous groups based on all-against-all Smith-Waterman comparisons using the official gene sets of five vertebrates (human, mouse, opossum, chicken and pufferfish) and five insects (fruitfly, malaria mosquito, dengue/yellow fever mosquito, honeybee and red flour beetle) (see details in Materials and methods). The species choice aimed to maximize the phylogenetic coverage with similar lineage radiation times in both deuterostomia and protostomia. The fraction of genes with recognized orthology among these species represents about 70-80% of their predicted gene pools. The comparative analysis of the shared content of gene repertoires across these species is discussed in the study presenting the analysis of the first beetle genome, that of <it>Tribolium castaneum </it><abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, and here we focus on the analysis of losses of orthologous genes. According to their phyletic distribution and gene copy-number in each of the species, we considered the following types of orthologous groups reflecting different selection pressures: the universal single-copy orthologs (U); the universal multiple-copy orthologs (N); patchy orthologs (P) that are present in both phyla in at least three species, in single or multiple copies; and insect- or vertebrate-specific orthologs (I and V, respectively). While universal single-copy orthologs (U) are evolving under a distinct pressure for copy-number control, the number of universal multiple-copy orthologs (N) under similarly strict copy-number control is extremely low (only six groups have equal multiple-copy gene number in at least nine species). Although U, N and P orthologs must all have been present in the last common ancestor of insects and vertebrates, the Urbilateria, losses in these fractions occur at different rates.</p>
            <p>Figure <figr fid="F1">1b</figr> shows the size distribution of the orthologous fractions and the number of losses in each species and ortholog category. The tree shown in Figure <figr fid="F1">1a</figr> illustrates the species phylogeny, which allowed us to infer the number of losses on the internal branches, assuming evolutionary parsimony, which minimizes the number of events required to explain the phyletic gene distribution in each orthologous group. The phylogenetic tree was reconstructed using maximum-likelihood analysis of the concatenated alignment of 1,150 universal single-copy orthologs <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> where the lengths of the branches are proportional to the number of accumulated mutations, allowing us to compare the gene loss rates with the rates of lineage divergence (measured as the rate of protein substitutions). For the branches closest to the root, the numbers of gene losses cannot be inferred without an additional outgroup.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Quantification of orthologous gene losses in insects and vertebrates</p>
               </caption>
               <text>
                  <p>Quantification of orthologous gene losses in insects and vertebrates. <b>(a) </b>The phylogenetic relations among the organisms are illustrated by the tree, with branch length proportional to the rate of amino acid substitutions estimated using the maximum-likelihood approach. The number of orthologous groups lost on the internal phylogenetic branches were inferred using the Dollo parsimony principle and are shown on the phylogenetic tree above branches for the I/V fraction, and below branches for the P fraction. *The presence in two species was sufficient to infer losses of I/V orthologous groups. <b>(b) </b>The number of orthologous group losses in the five main categories: U, universal single-copy genes (blue, present in all species except the one in question); N, universal multiple-copy genes (orange, present in at least nine species); P, patchy orthologs (yellow, present in both phyla in at least three species, in one or multiple copies); I/V, insect- or vertebrate-specific orthologous groups (present only in insects (green) or vertebrates (violet), in at least three species. The dark parts of the bars depict the number of contemporarily present orthologous groups, and the light parts depict the number of inferred losses. AGAM, <it>Anopheles gambiae</it>; AAEG, <it>Aedes aegypti</it>; DMEL, <it>Drosophila melanogaster</it>; TCAS, <it>Tribolium castaneum</it>; AMEL, <it>Apis meliferia</it>; HSAP, <it>Homo sapiens</it>; MMUS, <it>Mus musculus</it>; MDOM, <it>Monodelphis domestica</it>; GGAL, <it>Gallus gallus</it>; TNIG, <it>Tetraodon nigroviridis</it>.</p>
               </text>
               <graphic file="gb-2007-8-11-r242-1"/>
            </fig>
            <p>The analysis identified hundreds of differentially lost Urbilaterian genes of U, N and P orthologs in each of the species (see table of Figure <figr fid="F1">1</figr>). Overall, about 40% of ancient orthologous groups have been lost in at least one (out of the ten) species, illustrating the extent of the evolutionary flexibility of gene pools. Moreover, there are dozens of genes lost in each of the species that otherwise appear as universal single-copy orthologs. Koonin <it>et al</it>. <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> noted that nearly all pan-eukaryotic single-copy orthologs are subunits of known protein complexes; nevertheless, the observed losses indicate that even seemingly tightly constrained genes are, to a certain degree, dispensable in evolution.</p>
         </sec>
         <sec>
            <st>
               <p>Number of losses correlate with molecular divergence</p>
            </st>
            <p>The inferred distribution of losses over the internal branches of the species phylogeny allowed us to correlate them with the estimated geological time of species radiations and the molecular evolutionary rate in each of the lineages. The molecular rates of evolution were quantified using genome-wide maximum-likelihood analysis of amino acid substitutions in the well aligned regions of single-copy orthologs (see Materials and methods). Figure <figr fid="F2">2a</figr> displays the correlation of the number of losses of the different types of orthologs plotted versus the protein sequence divergence. Losses of U and N orthologs (Figure <figr fid="F2">2b</figr>) occur only at the terminal branches as the fraction definition requires presence of the orthologous genes in at least nine species, whereas losses of P orthologs (Figure <figr fid="F2">2c</figr>) occur at both internal and terminal branches. Correlations are statistically significant for all categories (see Figure <figr fid="F2">2</figr> legend for details), and there is no distinction between insects and vertebrates. Moreover, although the absolute numbers of insect- and vertebrate-specific losses appear different, they in fact follow the same trend when normalized to the total number of the phylum-specific orthologous groups (Figure <figr fid="F2">2d</figr>; see Additional data file 1 for a graph with absolute numbers). The different slopes of the regression lines reflect the varying stringency of evolutionary constraints that differ between the postulated types of orthologs. Despite the difference in the absolute numbers of lost U and N orthologs, their rates of loss are indistinguishable when normalized to the number of such orthologous groups, indicating the same level of purifying selection (Figure <figr fid="F2">2b</figr>). The data show that P orthologs are about 8-fold less constrained than U and N orthologs; this roughly corresponds to about 20% of the common Bilateria gene pool evolving 8 times faster than the remaining, more constrained fraction. I and V orthologs appear to be about three-fold more constrained than P orthologs, which is not surprising as they may represent a similar mixture of a slower evolving fraction of 80% and a faster evolving minority. The level of correlation between the number of losses and the protein sequence divergence rates (Figure <figr fid="F2">2</figr>) is similar to that observed between other genome-wide measures of species divergence <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. Chicken was excluded from this and all following analyses as a clear outlier (see Discussion).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>The number of ortholog losses correlates with the rate of amino acid substitutions</p>
               </caption>
               <text>
                  <p>The number of ortholog losses correlates with the rate of amino acid substitutions. The number of orthologous group (U, N, P, I/V) losses normalized with the total size of the fraction is plotted versus the branch length of the maximum-likelihood phylogenetic tree (Figure 1). <b>(a) </b>All ortholog types combined; <b>(b) </b>U and N orthologs; <b>(c) </b>Patchy orthologs; <b>(d) </b>Insect- and vertebrate-specific orthologs. Filled symbols denote vertebrates and open symbols denote insects. Spearman rank correlations: U orthologs, rs = 0.79, <it>p </it>= 0.015; N orthologs, rs = 0.67, <it>p </it>= 0.05; P orthologs, rs = 0.90, <it>p </it>&lt; 0.01; I/V orthologs, rs = 0.83, <it>p </it>&lt; 0.01. Regression slopes for U and N are not statistically different. Anc, ancestral.</p>
               </text>
               <graphic file="gb-2007-8-11-r242-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Insects evolve two to three times faster than vertebrates</p>
            </st>
            <p>Protein sequence divergence is significantly larger between insects than between vertebrates (see the longer branch lengths in Figure <figr fid="F1">1</figr>; Mann-Whitney U test, <it>p </it>= 0.009). Similarly, this is reflected in the observation of significantly more frequent gene losses in insects than in vertebrates (Mann-Whitney U test: N orthologs, <it>p </it>= 0.016; P orthologs, <it>p </it>= 0.04). In comparison with vertebrates, the rate of evolution in bee and beetle is about two-fold higher and up to three-fold higher in Diptera. This especially high rate of evolution in Diptera, particularly at the base of the Dipteran radiation, has been noted previously <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Lower estimate of the Urbilateria number of genes</p>
            </st>
            <p>Despite inherent dating uncertainties, the correlation between the number of lost orthologous groups and divergence times is significant for U and P orthologs (Spearman rank correlations: U orthologs, rs = 0.84, <it>p </it>= 0.007; N orthologs, rs = 0.58, <it>p </it>= 0.11; P orthologs, rs = 0.57, <it>p </it>= 0.03), indicating that losses of ancient genes occur in a roughly clock-like manner. The good correlation between the rate of losses with molecular rate and time indicates their stochastic nature. Projection of these trends as shown in Figure <figr fid="F3">3</figr> to 600 million years ago (MYA), presumably dating the radiation of insects and vertebrates, suggests that over 1,000 (95% confidence interval 799-1,456) Urbilaterian genes have been lost from insects and only half this number (95% confidence interval 404-678) from vertebrates. This leads to the lower estimate of the number of Urbilaterian genes of just over 7,000 (remarkably, we obtained 7,114 orthologous groups with at least one insect and one vertebrate member). This estimate, however, does not take into account: genes that currently appear as insect- or vertebrate-specific, many of which could be of Urbilaterian origin; closely related Urbilateria paralogs that remain unresolved and are likely grouped together in N groups; as well as fractions of fast diverging genes that escaped our orthology classification.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Extrapolation of number of ancient (U, N and P) orthologs to Urbilateria</p>
               </caption>
               <text>
                  <p>Extrapolation of number of ancient (U, N and P) orthologs to Urbilateria. The regression lines (and their 90% confidence intervals) are drawn using the number of U, N and P orthologous groups in current species, the estimates for putative ancestors, including the inferred number of losses (Figure 1), and the assumed split of insects and vertebrates about 600 MYA against the species radiation time. Remarkably, the na&#239;ve counting of orthologous groups that have at least one insect and at least one vertebrate member results in 7,114 likely Urbilateria genes.</p>
               </text>
               <graphic file="gb-2007-8-11-r242-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Functional load of losses</p>
            </st>
            <sec>
               <st>
                  <p>Recovered known facts as positive controls</p>
               </st>
               <p>Reassuringly, closer inspection of several of the predicted cases of lost genes pointed to recently published findings of lineage-specific biology.</p>
               <sec>
                  <st>
                     <p>Hedgehog signaling pathway rearrangements in <it>Drosophila</it></p>
                  </st>
                  <p>Hedgehog signaling pathway rearrangements in <it>Drosophila </it>have been reported where orthologs of human <it>Sil</it>, <it>Hip </it>and <it>Gas1 </it>are missing from <it>Drosophila </it><abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, and homologs of polaris/TG737 (nompB) and <it>Kif3a </it>(<it>Klp64D</it>) appear to have roles unrelated to hedgehog signaling <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>.</p>
               </sec>
               <sec>
                  <st>
                     <p><it>Sid-1/tag-130 </it>gene loss in all Diptera</p>
                  </st>
                  <p><it>Sid-1</it>/<it>tag-130 </it>genes have been lost in all Diptera but are present in bee and <it>Tribolium </it>as reported by Weinstock <it>et al</it>. <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. <it>Sid-1 </it>is implicated in the cellular import of RNA interference signal and enables passive uptake of double-stranded RNA (yet, <it>Sid-1 </it>is likely to be a <it>Caenorhabditis elegans </it>invention as its inparalog, <it>TAG130</it>, is less derived (Additional data file 2)).</p>
               </sec>
               <sec>
                  <st>
                     <p>DNA-methyltransferases DNMT1 and DNMT3B lost in the Coleoptera/Diptera ancestor</p>
                  </st>
                  <p>DNA-methyltransferases DNMT1 and DNMT3B have been lost in the Coleoptera/Diptera ancestor, consistent with their loss reported in Diptera <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> and their surprising presence in honeybee <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B16">16</abbr></abbrgrp>.</p>
               </sec>
               <sec>
                  <st>
                     <p>A candidate insect telomerase reverse transcriptase</p>
                  </st>
                  <p>A candidate insect telomerase reverse transcriptase (TERT) is present in honeybee and <it>Tribolium </it>but absent in all Dipterans. The absence of TERT in Diptera seems to be correlated with the loss of telomeric TTAGG repeats <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>.</p>
               </sec>
               <sec>
                  <st>
                     <p>Sterol metabolism, NAD biosynthesis, and other losses</p>
                  </st>
                  <p>Sterol metabolism, NAD biosynthesis, and other losses proposed earlier from the comparison of the fly and the mosquito genomes <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> seem to have been lost in all insects sequenced so far, that is, before the appearance of holometabolous insects. Exceptions are a dihydroxyacetone kinase 1 lost from both <it>Drosophila </it>and <it>Anopheles</it>, a C-5 sterol desaturase and a histidine ammonia-lyase present in only <it>Tribolium </it>and two genes present in only honeybee, an ornithine carbamoyltransferase and a malonyl-CoA decarboxylase.</p>
               </sec>
            </sec>
            <sec>
               <st>
                  <p>Novel case stories</p>
               </st>
               <p>Below we describe some of the identified losses that are likely to have had an impact on the functional divergence of the lineages, exemplifying losses of different types of orthologs, from the most conserved single-copy genes to orthologs with a highly patchy phylogenetic distribution. It has been suggested that secondary gene losses can be driven by the losses of key players of particular pathways or complexes that disable their functionality <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. Hence, we mapped losses to the characterized biochemical pathways annotated for human orthologs; the results for <it>Tribolium </it>and its ancestor are overviewed in Table <tblr tid="T1">1</tblr>. However, having low numbers of losses per pathway <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, we concentrated more on providing examples of losses of directly interacting genes, reported for <it>Drosophila </it>from protein-protein interaction screens <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> and literature co-citations via human orthologs <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>.</p>
               <tbl id="T1">
                  <title>
                     <p>Table 1</p>
                  </title>
                  <caption>
                     <p>Losses in <it>Tribolium </it>and its (Coleoptera/Diptera) ancestor mapped to pathways using human orthologs</p>
                  </caption>
                  <tblbdy cols="6">
                     <r>
                        <c ca="left">
                           <p>Pathway</p>
                        </c>
                        <c ca="left">
                           <p>Source</p>
                        </c>
                        <c ca="left">
                           <p>Genes</p>
                        </c>
                        <c ca="center">
                           <p>Total</p>
                        </c>
                        <c ca="center">
                           <p>Only in beetle</p>
                        </c>
                        <c ca="center">
                           <p>Significance</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="6">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Neuroactive ligand-receptor interaction</p>
                        </c>
                        <c ca="left">
                           <p>KEGG</p>
                        </c>
                        <c ca="left">
                           <p><it>NPFFR1</it>, <it>NPFFR2</it>, <it>BZRP</it>, <it>TSPO</it>, <it>GALR1</it>, <it>GALR2</it>, <it>GALR3</it></p>
                        </c>
                        <c ca="center">
                           <p>6</p>
                        </c>
                        <c ca="center">
                           <p>6</p>
                        </c>
                        <c ca="center">
                           <p>0.099</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>ABC transporters - general</p>
                        </c>
                        <c ca="left">
                           <p>KEGG</p>
                        </c>
                        <c ca="left">
                           <p><it>ABCA1</it>, <it>ABCA4</it>, <it>ABCA12</it>, <it>ABCC5</it>, <it>ABCC12</it></p>
                        </c>
                        <c ca="center">
                           <p>5</p>
                        </c>
                        <c ca="center">
                           <p>3</p>
                        </c>
                        <c ca="center">
                           <p>4.73E-005*</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Oxidative phosphorylation</p>
                        </c>
                        <c ca="left">
                           <p>KEGG</p>
                        </c>
                        <c ca="left">
                           <p><it>ATP6V0E</it>, <it>UCRC (7.2 kDa</it>), <it>NDUFA7</it>, <it>COX7C</it></p>
                        </c>
                        <c ca="center">
                           <p>4</p>
                        </c>
                        <c ca="center">
                           <p>4</p>
                        </c>
                        <c ca="center">
                           <p>0.04</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Cell cycle</p>
                        </c>
                        <c ca="left">
                           <p>KEGG</p>
                        </c>
                        <c ca="left">
                           <p><it>CDC7</it>, <it>CCNE1</it>, <it>DBF4</it></p>
                        </c>
                        <c ca="center">
                           <p>3</p>
                        </c>
                        <c ca="center">
                           <p>3</p>
                        </c>
                        <c ca="center">
                           <p>0.12</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Folate biosynthesis/starch and sucrose metabolism</p>
                        </c>
                        <c ca="left">
                           <p>KEGG</p>
                        </c>
                        <c ca="left">
                           <p><it>RAD54B</it>, <it>SETX</it></p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>0.06</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Alkaloid biosynthesis II</p>
                        </c>
                        <c ca="left">
                           <p>KEGG</p>
                        </c>
                        <c ca="left">
                           <p><it>DDHD1</it>, <it>SLC27A2</it></p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>0</p>
                        </c>
                        <c ca="center">
                           <p>0.02<sup>&#8224;</sup></p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Regulation of actin cytoskeleton</p>
                        </c>
                        <c ca="left">
                           <p>KEGG</p>
                        </c>
                        <c ca="left">
                           <p><it>FGD1</it>, <it>IQGAP1</it></p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>0</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Cholera - infection</p>
                        </c>
                        <c ca="left">
                           <p>KEGG</p>
                        </c>
                        <c ca="left">
                           <p><it>ATP6V0E1</it>, <it>TRIM23</it></p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>1</p>
                        </c>
                        <c ca="center">
                           <p>0.06</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Purine metabolism</p>
                        </c>
                        <c ca="left">
                           <p>KEGG</p>
                        </c>
                        <c ca="left">
                           <p><it>PDE1C</it>, <it>POLR2L</it></p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>0.49</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Ribosome</p>
                        </c>
                        <c ca="left">
                           <p>KEGG</p>
                        </c>
                        <c ca="left">
                           <p><it>RPL29</it>, <it>RPL39</it></p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>0.53</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Neurodegenerative disorders</p>
                        </c>
                        <c ca="left">
                           <p>KEGG</p>
                        </c>
                        <c ca="left">
                           <p><it>BCL2L1</it>, <it>NGFR</it></p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>0</p>
                        </c>
                        <c ca="center">
                           <p>0.06</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Propanoate metabolism</p>
                        </c>
                        <c ca="left">
                           <p>KEGG</p>
                        </c>
                        <c ca="left">
                           <p><it>MLYCD</it>, <it>SLC27A2</it></p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>0</p>
                        </c>
                        <c ca="center">
                           <p>0.07</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Methionine metabolism</p>
                        </c>
                        <c ca="left">
                           <p>KEGG</p>
                        </c>
                        <c ca="left">
                           <p><it>DNMT1</it>, <it>DNMT3B</it></p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>0</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Oxidative stress induced gene expression via Nrf2</p>
                        </c>
                        <c ca="left">
                           <p>Biocarta</p>
                        </c>
                        <c ca="left">
                           <p><it>HMOX1</it>, <it>NGFR</it></p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>1</p>
                        </c>
                        <c ca="center">
                           <p>0.02<sup>&#8224;</sup></p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>The statistical significance of the coordinated losses of at least two genes per pathway was calculated using hypergeometric test (* <it>p </it>&lt; 0.01,<sup>&#8224;</sup><it>p </it>&lt; 0.05).</p>
                  </tblfn>
               </tbl>
               <sec>
                  <st>
                     <p>Losses of universal single-copy orthologs</p>
                  </st>
                  <p>An example of a universal single-copy ortholog missing in <it>Drosophila </it>is a 35 kDa protein associated with U11 snRNPs. U11 and U12 are components of the minor spliceosome responsible for the splicing of a small number of U12-type introns (&lt;1% in both humans and flies) <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. The minor spliceosome is widely conserved from plants to humans, including most insects but absent from <it>C. elegans</it>. Lack of a clear ortholog of U11 snRNA and the associated 35 kDa protein has been initially proposed <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, but Schneider <it>et al</it>. <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> identified a highly divergent U11 snRNA. The 31 kDa and 35 kDa proteins seem to be missing from all <it>Drosophila </it>species and a 25 kDa protein is absent from Diptera <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Interestingly, the loss of U11/U12 spliceosomal proteins in <it>Drosophila </it>is accompanied by the loss of the majority of U12-type introns <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>.</p>
                  <p>Another example of a universal gene that seems to be missing from the <it>Drosophila </it>genome is sortilin-related receptor LR11 (also known as <it>SorLA</it>), a member of the low-density lipoprotein receptor family. LR11 binds low-density lipoprotein, the major cholesterol-carrying lipoprotein of plasma, and transports it into cells by endocytosis. Human LR11 also regulates trafficking of amyloid precursor protein and its expression is decreased in the brain of Alzheimer's disease patients <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>.</p>
               </sec>
               <sec>
                  <st>
                     <p>Losses of universal multi-copy orthologs</p>
                  </st>
                  <p>An example is the <it>Cdc7 </it>kinase and its regulatory subunit <it>Dbf4 </it>implicated in triggering DNA replication in G1 phase through phosphorylation of Mcm proteins <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. <it>Cdc7 </it>is essential in yeast in contrast to mice where homozygous null mutants for the <it>Cdc7 </it>ortholog <it>Nr2c2 </it>show impaired spermatogenesis <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. <it>Cdc7 </it>is a universal single-copy gene with two fly paralogs whereas <it>Dbf4 </it>is present in two copies in humans and opossum. We confirmed the loss of both genes in <it>Tribolium </it>by tBlastn search and phylogenetic analysis (Additional data file 3). <it>Cdc7 </it>is also missing from the current <it>Anopheles </it>annotation but tBlastn searches identified a <it>Cdc7 </it>candidate in the genome. <it>Dbf4 </it>appears to be missing from the <it>Anopheles </it>and <it>Tetraodon </it>genomes. Interestingly, in yeast an allele of the gene <it>MCM5 </it>(<it>CDC46</it>) has been identified that bypasses the requirement for CDC7/DBF4 <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. Although the <it>Tribolium </it>MCM5 ortholog TC_09146 does not feature the same mutation, P86L, it is conceivable that a similar mutation has rendered CDC7/DBF4 disposable in <it>Tribolium</it>.</p>
                  <p>Another example of a loss of an otherwise universal gene is the <it>Tribolium </it>ortholog of human ATP-binding cassette transporter A1 (ABCA1). ABCA1 is a cholesterol efflux transporter and is also required for engulfment of apoptotic cells by macrophages in mice and <it>C. elegans </it><abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp>. In humans, the turnover of ABCA1 is regulated by Alpha1-syntrophin <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>, and both genes encoding these proteins appear to be missing from the beetle genome.</p>
               </sec>
               <sec>
                  <st>
                     <p>Losses of patchy and insect-specific orthologs</p>
                  </st>
                  <p>We observed numerous losses in Diptera, many of which seem to be involved in the ubiquitin cycle, DNA repair (also reported in <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>), actin cytoskeleton and transcription control (Additional data file 4), which may point to substantial rearrangements of the ancestral pathways. An intriguing example is the BRCC complex, a complex with ubiquitin E3 ligase activity known to be involved in DNA repair, cell cycle regulation and homology-directed repair in human that has lost <it>BRCA1</it>, <it>RBBP-8 </it>and <it>BRCC3 </it>in the Dipteran lineage.</p>
                  <p>An example of insect-specific orthologous groups lost in Diptera are genes associated with oxidoreductase activity, including Aldo/keto reductases and several FAD dependent oxidoreductases; this category of genes was enriched among the 160 Diptera gene losses in a comparative Gene Ontology (GO) analysis with the <it>Tribolium </it>genome (Additional data file 5).</p>
               </sec>
               <sec>
                  <st>
                     <p>Extreme cases: exclusive insect models of human genes</p>
                  </st>
                  <p>At extremes, each novel insect genome sequence uncovers previously invisible orthologous gene relationships to human genes (see <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> for venn diagram that shows how many new orthologous relations are uncovered by the honeybee and beetle genomes). For example, we identified 45 orthologous groups shared between honeybee and at least one vertebrate but lost in the Coleoptera/Diptera ancestor, for example, an ortholog of RAD18 (GB-14468) that is an E3 ubiquitin-protein ligase involved in postreplication repair of UV-damaged DNA.</p>
                  <p>To complement the initial analysis of the <it>Tribolium </it>genome, we further identified 62 genes that are present in all vertebrates and <it>Tribolium </it>but lost from the other four insect genomes. Examples include <it>Yipf3</it>, a natural killer cell-specific antigen expressed during embryonic hematopoiesis in humans, and CENP-S, which in humans is a component of a centromeric protein complex, CENPA-CAD <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>, that replaces histones in centromeres. In <it>Tribolium</it>, the orthologs of the other five complex members <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> seem to be absent from the genome, indicating a different mode of action. <it>CRLF3</it>, a cytokine receptor-like factor 3, has also been lost in all insects but <it>Tribolium</it>, as well as a regulator of the NF-&#954;B pathway, <it>Tgf</it>, which positively regulates I-kappaB kinase. Because of structural and functional similarities in the mode of activation between insect and vertebrate NF-&#954;B/Rel transcription factors, they are thought to have countered infections in Urbilateria <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. Another <it>Tribolium </it>gene, TC_04309, absent in all sequenced arthropods, is an ortholog of the human F-box only gene 7 (<it>Fbxo7</it>). The tBlastn search and phylogenetic analysis of FBXO7 confirmed the lack of clear orthologs in other insects (Additional data files 6 and 7). FBXO7 is a component of modular E3 ubiquitin protein ligases called SCFs and functions in phosphorylation-dependent ubiquitination <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Human FBXO7 was also reported to positively regulate the activity of cyclin D/CDK6 in order to facilitate entry into the cell cycle <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. In <it>Drosophila</it>, the cyclin D/CDK6 complex stimulates cell growth as well as proliferation <abbrgrp><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp>. Amniotes have two cdk4 orthologs due to a gene duplication (cdk4 and cdk6) and two cyclin D orthologs. <it>Tribolium </it>also encodes a cdk4 ortholog and two cyclin D orthologs, in contrast to the other insects, which have only one cyclin D ortholog (Additional data file 8). It is tempting to speculate that the presence of FBXO7 and a second cyclin D in the beetle are functionally linked. TC_04309 is expressed in limbs and the hindgut of <it>Tribolium </it>embryos (Figure <figr fid="F4">4</figr>). The function of TC_04309 is unknown but, taken together, it is conceivable that it controls cell proliferation in a tissue-specific manner as in mammals.</p>
                  <fig id="F4">
                     <title>
                        <p>Figure 4</p>
                     </title>
                     <caption>
                        <p>Expression pattern of the F-box gene during embryogenesis of the beetle <it>T. castaneum</it></p>
                     </caption>
                     <text>
                        <p>Expression pattern of the F-box gene during embryogenesis of the beetle <it>T. castaneum</it>. <b>(a) </b>F-box gene TC_04309 is initially expressed in the germ rudiment at the rims of the invaginating mesoderm, a position where activated Map-kinase is also seen [76]. Expression is strongest in the posterior (white arrow) and weakens towards the anterior. Ventral view, anterior is up posterior points down. Hl, head lobes. <b>(b) </b>Expression is seen as spots in the thoracic legs (arrowhead), at the base of the labral head appendages (small arrow head) and in segmentally repeated spots in the lateral body wall. T3, thoracic segment 3. Only the anterior half of the embryo is shown. <b>(c) </b>At a similar stage as shown in (b) where all body segments are present, the F-box gene is expressed in the anlagen of the hindgut (arrow). <b>(d) </b>When the legs have grown longer, F-box gene expression is extended covering the distal end. As seen in (c, d), expression in the labrum, in the hindgut-primordium and weakly at the lateral sites of the abdominal segments persists. <b>(e) </b>The hindgut has invaginated and grows out, forming a tube where the F-box gene is expressed in its posterior, proximal end around the future posterior gut opening (arrow). <b>(f) </b>At the retracted germ band stage, F-box is expressed around the anterior gut opening (white arrow) that has formed between the head lobes. <b>(g) </b>In the same embryo shown in (f), F-box gene expression is seen in the walls of the hindgut (arrow).</p>
                     </text>
                     <graphic file="gb-2007-8-11-r242-4"/>
                  </fig>
               </sec>
            </sec>
            <sec>
               <st>
                  <p>Robustness of estimates</p>
               </st>
               <p>Several factors can lead to an overestimation of gene losses: incomplete annotations, genome sequence gaps and the limited sensitivity of protein sequence comparison methods. Reassuringly, our estimates of number of gene losses for <it>Drosophila </it>and human, the two most extensively studied and curated model organisms, are similar to that of automatically annotated species, indicating a fairly good quality of genome annotations and their relative completeness. The chicken genome, however, shows exceptionally high numbers of losses in all categories that are likely to be overestimates due to an incomplete genome sequence that was estimated to be missing 5-10% of the genes <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>, and, therefore, it was not taken into account in the analysis presented above. A closer inspection of 'missing' <it>Tribolium </it>genes from the universal single-copy and insect-specific fraction allowed us to correct about 30 <it>Tribolium </it>genes overlooked by the automatic annotation and some 150 merged genes. Nevertheless, most 'missing' genes were confirmed to be absent from the genome using tBlastn searches, indicating that the <it>Tribolium </it>annotation is nearly complete for evolutionarily conserved genes. Orthology misclassifications can also lead to inflated estimates when orthologous groups are wrongly split up, or to underestimates when several orthologous groups are spuriously pooled together, 'hiding' losses. We compared the results of our analysis with an independently derived and hand curated set of about 100 gene losses in Diptera (Hugh Robertson, personal communication). Detailed phylogenetic examination revealed only two to three cases of likely errors in our high throughput orthology identification pipeline and a few complicated cases that could not be resolved even using phylogenetic methods as the proteins were too short or too divergent.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>We present here the quantification of losses of orthologous groups in five vertebrate and five insect genomes. Lineage-specific gene duplications result in multi-copy orthologous groups or fine-grained gene families. Here we focused on complete losses of such gene families (requiring all orthologous genes to be lost in a lineage). Members of an orthologous group are likely to share overlapping functions, and a complete loss of all representatives is more likely to have biological consequences <abbrgrp><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr></abbrgrp> than a loss of a specific gene member. The parsimonious interpretation of the losses in the context of the species phylogeny suggests hundreds of gene losses on each branch of the tree. Diptera species lost the most genes, and placentals the least.</p>
         <p>We show that the higher numbers of lost genes in insects can be explained by their higher rates of evolution as the loss rate is positively correlated with the molecular rate of evolution for each ortholog category and branch of the phylogeny. Interestingly, losses normalized for evolutionary rate and total number of orthologous groups are similar between insects and vertebrates, even for I and V orthologs. Therefore, one can not exclude that gene losses are mainly driven by neutral evolution <abbrgrp><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr></abbrgrp>, which should be taken as the null hypothesis until proven otherwise. Our data also suggest that about 20% of the gene repertoire evolves 8 times faster than the rest. The fact that the overall number of losses of orthologous groups is in agreement with the model of neutral evolution does not, of course, mean that all losses are selectively neutral. In that respect, it is noteworthy that some of the lost genes we discuss, such as <it>Cdc7/Dbf4 </it>or <it>Fboxo7</it>, seem to act as positive regulators.</p>
         <p>Several hypotheses have been put forward to explain differences in evolutionary rates across species <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. A high evolutionary rate might simply reflect differences in mutation rates. The known contributors to the rate of mutations, metabolic rate <abbrgrp><abbr bid="B54">54</abbr></abbrgrp> and generation time <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>, are clearly different between dipterans and mammals. In addition, differences in DNA methylation, fidelity of DNA-repair mechanisms or the production of DNA-damaging agents have also been suggested to explain different mutation rates in different species <abbrgrp><abbr bid="B53">53</abbr><abbr bid="B56">56</abbr></abbrgrp>. We found a number of genes implicated in DNA repair missing from Diptera. Although the functional consequences of these losses in Diptera are unknown, they might contribute to an increased mutation rate. A second hypothesis is that the efficiency of selection against deleterious mutants varies across species, due to differences in effective population size and/or mode of reproduction. Finally, rate variation across lineages could be caused by species-specific differences in the timing and frequency of adaptive evolution. Indeed, theoretical models <abbrgrp><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr></abbrgrp> have proposed that evolvability is a selectable trait.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We showed that the gene loss rate correlates well with rates of molecular evolution, explaining the significantly higher number of gene losses in insects. The data also can not reject that gene losses are dominated by neutral evolution.</p>
         <p>The hundreds of lost genes we identified along the phylogenetic tree suggest common rearrangements and rewiring of ancient pathways and signaling cascades. Such global approaches are suitable for generating further experimentally testable hypotheses, and will lead to a better understanding of global evolutionary trends and detailed functional differences among lineages.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Orthology classification</p>
            </st>
            <p>Protein sets were retrieved from Ensembl for <it>Drosophila</it>, <it>Anopheles </it>and all vertebrates as of 4 August 2006. <it>Tribolium </it>and <it>Apis </it>proteins were retrieved from Baylor College of Medecine and <it>Aedes </it>proteins from VectorBase. The assignment to orthologous groups was performed as described earlier <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B59">59</abbr><abbr bid="B60">60</abbr></abbrgrp>. Namely, we retained the longest open reading frame per locus and performed all-against-all comparisons using the Smith-Waterman algorithm as implemented in ParAlign <abbrgrp><abbr bid="B61">61</abbr></abbrgrp> with default parameters. The orthologous groups were then assembled from the best reciprocal hits (BRHs; reciprocally best maching genes in between-genome-comparisons) applying a COG-like <abbrgrp><abbr bid="B62">62</abbr></abbrgrp> procedure to join BRHs across three or more species, going from the best scoring ones until an E-value cut-off of 10<sup>-6</sup>, and keeping single BRH pairs only with E-values less than 10<sup>-10</sup>. Furthermore, the orthologous groups were expanded by genes that are more similar to each other within a proteome than to any gene in any of the other species, and by very similar copies that share over 97% sequence identity, which were identified initially using CD-hit <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. All proteins in a group were required to have aligned regions overlapping by at least 20 residues to avoid the 'domain walking' effect.</p>
         </sec>
         <sec>
            <st>
               <p>Species tree</p>
            </st>
            <p>A maximum likelihood species tree was calculated using the concatenated multiple alignments of 1,150 orthologs present in exactly one copy in all the organisms studied here. Multiple protein alignments were produced using muscle <abbrgrp><abbr bid="B64">64</abbr></abbrgrp> and confidently aligned regions were extracted using Gblocks <abbrgrp><abbr bid="B65">65</abbr></abbrgrp> with default settings. Individual protein alignments were concatenated into a 336,069 amino acid superalignment that was then subjected to maximum-likelihood analysis using the JTT model (G4+I+F) as implemented in PhyML <abbrgrp><abbr bid="B66">66</abbr></abbrgrp> and we used Tree-Puzzle <abbrgrp><abbr bid="B67">67</abbr></abbrgrp> to join separate bootstrap analyses. All shown branchings have at least 99% bootstrap support estimated from 500 replicates.</p>
         </sec>
         <sec>
            <st>
               <p>Quantification of losses and correlation with other traits</p>
            </st>
            <p>Species radiation dates were taken from the literature: for insects from <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> and for vertebrates from <abbrgrp><abbr bid="B68">68</abbr><abbr bid="B69">69</abbr></abbrgrp>. We used MatLab version 7.2 (MathWorks, Natick MA, USA) for statistical analysis and data plotting. Regression lines were required to cross the origin. For each category of orthologs, the slopes of the regression lines for insects and vertebrates were compared based on a Student's <it>t</it>-distribution and were found not to be significantly different. Because traits were not normally distributed, we used non-parametric Spearman's correlation coefficients and Mann-Whitney U tests. Chicken data were excluded from graphs and statistical tests (see Discussion).</p>
         </sec>
         <sec>
            <st>
               <p>Manual analysis of case studies</p>
            </st>
            <p>Selected orthologous groups were examined manually as follows. The absence of <it>Tribolium </it>proteins was verified by screening the <it>Tribolium </it>proteome, genome (assembled and single reads) and expressed sequence tags using the Baylor College of Medicine blast server <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>. All sufficiently similar sequences, including members of other orthologous groups, were aligned using muscle v3.6 <abbrgrp><abbr bid="B64">64</abbr></abbrgrp> with default settings and all positions containing gaps were trimmed from conserved blocks using Gblocks <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>. Phylogenetic trees were constructed using maximum likelihood as implemented in PhyML <abbrgrp><abbr bid="B66">66</abbr></abbrgrp> using the JTT model of amino acid substitution, a gamma distribution of rates over four rate categories and 100 bootstraps.</p>
         </sec>
         <sec>
            <st>
               <p>Pathway mapping and database searches</p>
            </st>
            <p>We used pathway annotations from the KEGG database <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>, mapping genes to Biocarta and co-citation analysis using Webgestalt <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> web interface. For data mining we used Ensembl <abbrgrp><abbr bid="B72">72</abbr></abbrgrp>, Swiss-prot/UniProt <abbrgrp><abbr bid="B73">73</abbr></abbrgrp>, Flybase <abbrgrp><abbr bid="B74">74</abbr></abbrgrp>, the Interactive Fly <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> and Online Mendelian Inheritance in Man <abbrgrp><abbr bid="B75">75</abbr></abbrgrp> annotations.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>ABCA1, ATP-binding cassette transporter A1; BRH, best reciprocal hit; CRLF3, cytokine receptor-like factor 3; GO, Gene Ontology; I, insect-specific orthologs; MYA, million years ago; N, universal multiple-copy orthologs; P, patchy orthologs; TERT, insect telomerase reverse transcriptase; U, universal single-copy orthologs; V, vertebrate-specific orthologs.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>SW and EMZ analyzed the data and wrote the manuscript. EK contributed the orthology data and the species phylogeny. RS and TK contributed experimental characterization of exclusive <it>Tribolium </it>and honeybee orthologs of human genes. EMZ supervised the project.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper. Additional data file <supplr sid="S1">1</supplr> is a graph showing the correlation between the absolute number of lost orthologous groups and and the rate of amino acid substitutions. Additional data files <supplr sid="S2">2</supplr> and <supplr sid="S3">3</supplr> provide the phylogenetic analysis of SID-1 and CDC7, respectively. Additional data file <supplr sid="S4">4</supplr> is a table listing GO analysis of insect-specific orthologous groups lost in all Dipterans. Additional data file <supplr sid="S5">5</supplr> is a table listing functionally linked genes coeliminated in the Diptera and Drosophila lineages. Additional file <supplr sid="S6">6</supplr> provides the phylogenetic analysis of Fboxo7/cdk4/cyclin D. Additional data file <supplr sid="S7">7</supplr> is a figure showing the alignment of Fboxo7 proteins.</p>
         <suppl id="S1">
            <title>
               <p>Additional data File 1</p>
            </title>
            <caption>
               <p>Correlation between the absolute number of lost orthologous groups and the rate of amino acid substitutions</p>
            </caption>
            <text>
               <p>Correlation between the absolute number of lost orthologous groups and the rate of amino acid substitutions.</p>
            </text>
            <file name="gb-2007-8-11-r242-S1.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data File 2</p>
            </title>
            <caption>
               <p>Phylogenetic analysis of SID-1 proteins</p>
            </caption>
            <text>
               <p>Phylogenetic analysis of SID-1 proteins.</p>
            </text>
            <file name="gb-2007-8-11-r242-S2.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data File 3</p>
            </title>
            <caption>
               <p>Phylogenetic analysis of CDC7 proteins</p>
            </caption>
            <text>
               <p>Phylogenetic analysis of CDC7 proteins.</p>
            </text>
            <file name="gb-2007-8-11-r242-S3.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S4">
            <title>
               <p>Additional data File 4</p>
            </title>
            <caption>
               <p>GO analysis of insect-specific orthologous groups lost in all Dipterans</p>
            </caption>
            <text>
               <p>GO analysis of insect-specific orthologous groups lost in all Dipterans.</p>
            </text>
            <file name="gb-2007-8-11-r242-S4.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S5">
            <title>
               <p>Additional data File 5</p>
            </title>
            <caption>
               <p>Coeliminiation of functionally linked genes in the Diptera and <it>Drosophila </it>lineages</p>
            </caption>
            <text>
               <p>Coeliminiation of functionally linked genes in the Diptera and <it>Drosophila </it>lineages.</p>
            </text>
            <file name="gb-2007-8-11-r242-S5.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S6">
            <title>
               <p>Additional data File 6</p>
            </title>
            <caption>
               <p>Phylogenetic analysis of Fboxo7, cdk4 and cyclin D</p>
            </caption>
            <text>
               <p>Phylogenetic analysis of Fboxo7, cdk4 and cyclin D.</p>
            </text>
            <file name="gb-2007-8-11-r242-S6.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S7">
            <title>
               <p>Additional data File 7</p>
            </title>
            <caption>
               <p>Alignment of Fboxo7 proteins</p>
            </caption>
            <text>
               <p>Alignment of Fboxo7 proteins.</p>
            </text>
            <file name="gb-2007-8-11-r242-S7.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Hugh Robertson for sharing unpublished data, Peer Bork for stimulating discussions, and Robert M Waterhouse for help with the manuscript. Swiss National Science Foundation is acknowledged for funding (SNF 3100A0-112588 to EMZ).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Segmental duplications and the evolution of the primate genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Samonte</snm>
                  <fnm>RV</fnm>
               </au>
               <au>
                  <snm>Eichler</snm>
                  <fnm>EE</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>65</fpage>
            <lpage>72</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg705</pubid>
                  <pubid idtype="pmpid" link="fulltext">11823792</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Evolution by gene duplication: an update.</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Trends Ecol Evol</source>
            <pubdate>2003</pubdate>
            <volume>18</volume>
            <fpage>292</fpage>
            <lpage>298</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/S0169-5347(03)00033-8</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>RNAs from all categories generate retrosequences that may be exapted as novel genes or regulatory elements.</p>
            </title>
            <aug>
               <au>
                  <snm>Brosius</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>1999</pubdate>
            <volume>238</volume>
            <fpage>115</fpage>
            <lpage>134</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1119(99)00227-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">10570990</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Evolutionary dynamics of olfactory and other chemosensory receptor genes in vertebrates.</p>
            </title>
            <aug>
               <au>
                  <snm>Niimura</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Human Genet</source>
            <pubdate>2006</pubdate>
            <volume>51</volume>
            <fpage>505</fpage>
            <lpage>517</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/s10038-006-0391-8</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Loss of olfactory receptor genes coincides with the acquisition of full trichromatic vision in primates.</p>
            </title>
            <aug>
               <au>
                  <snm>Gilad</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Wiebe</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Przeworski</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lancet</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Paabo</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2004</pubdate>
            <volume>2</volume>
            <fpage>E5</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">314465</pubid>
                  <pubid idtype="pmpid" link="fulltext">14737185</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0020005</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>When less is more: gene loss as an engine of evolutionary change.</p>
            </title>
            <aug>
               <au>
                  <snm>Olson</snm>
                  <fnm>MV</fnm>
               </au>
            </aug>
            <source>Am Human Genet</source>
            <pubdate>1999</pubdate>
            <volume>64</volume>
            <fpage>18</fpage>
            <lpage>23</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1086/302219</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Gene losses during human origins.</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Grus</snm>
                  <fnm>WE</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2006</pubdate>
            <volume>4</volume>
            <fpage>e52</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1361800</pubid>
                  <pubid idtype="pmpid" link="fulltext">16464126</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0040052</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Adaptive loss of an old duplicated gene during incipient speciation.</p>
            </title>
            <aug>
               <au>
                  <snm>Greenberg</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Moran</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Fang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>CI</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2006</pubdate>
            <volume>23</volume>
            <fpage>401</fpage>
            <lpage>410</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msj045</pubid>
                  <pubid idtype="pmpid" link="fulltext">16251509</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Insights into social insects from the genome of the honeybee <it>Apis mellifera</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Weinstock</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Robinson</snm>
                  <fnm>GE</fnm>
               </au>
               <au>
                  <snm>Gibbs</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Worley</snm>
                  <fnm>KC</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Maleszka</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Robertson</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Weaver</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Beye</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2006</pubdate>
            <volume>443</volume>
            <fpage>931</fpage>
            <lpage>949</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2048586</pubid>
                  <pubid idtype="pmpid">17073008</pubid>
                  <pubid idtype="doi">10.1038/nature05260</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The first genome sequence of a beetle, <it>Tribolium castaneum</it>, a model for insect development and pest biology.</p>
            </title>
            <aug>
               <au>
                  <snm>Consortium</snm>
                  <fnm>TGS</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <inpress/>
         </bibl>
         <bibl id="B11">
            <aug>
               <au>
                  <snm>Grimaldi</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Engel</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Evolution of the Insects</source>
            <publisher>Cambridge: Cambridge University Press</publisher>
            <pubdate>2005</pubdate>
         </bibl>
         <bibl id="B12">
            <title>
               <p>The evolution of Mammalian gene families.</p>
            </title>
            <aug>
               <au>
                  <snm>Demuth</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Bie</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Stajich</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Cristianini</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hahn</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>PLoS ONE</source>
            <pubdate>2006</pubdate>
            <volume>1</volume>
            <fpage>e85</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1762380</pubid>
                  <pubid idtype="pmpid" link="fulltext">17183716</pubid>
                  <pubid idtype="doi">10.1371/journal.pone.0000085</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The gain and loss of genes during 600 million years of vertebrate evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Blomme</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Vandepoele</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>De Bodt</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Simillion</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Maere</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Van de Peer</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>R43</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1779523</pubid>
                  <pubid idtype="pmpid" link="fulltext">16723033</pubid>
                  <pubid idtype="doi">10.1186/gb-2006-7-5-r43</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>A comprehensive evolutionary classification of proteins encoded in complete eukaryotic genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Fedorova</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Jackson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Jacobs</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Krylov</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Mazumder</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mekhedov</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Nikolskaya</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Rao</snm>
                  <fnm>BS</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>R7</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">395751</pubid>
                  <pubid idtype="pmpid" link="fulltext">14759257</pubid>
                  <pubid idtype="doi">10.1186/gb-2004-5-2-r7</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Differential loss of ancestral gene families as a source of genomic divergence in animals.</p>
            </title>
            <aug>
               <au>
                  <snm>Hughes</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Friedman</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Proc Biol Sci</source>
            <pubdate>2004</pubdate>
            <volume>271</volume>
            <issue>Suppl 3</issue>
            <fpage>S107</fpage>
            <lpage>109</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1809979</pubid>
                  <pubid idtype="pmpid" link="fulltext">15101434</pubid>
                  <pubid idtype="doi">10.1098/rsbl.2003.0124</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Functional CpG methylation system in a social insect.</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Jorda</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>PL</fnm>
               </au>
               <au>
                  <snm>Maleszka</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ling</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Robertson</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Mizzen</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Peinado</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Robinson</snm>
                  <fnm>GE</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2006</pubdate>
            <volume>314</volume>
            <fpage>645</fpage>
            <lpage>647</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1135213</pubid>
                  <pubid idtype="pmpid" link="fulltext">17068262</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Molecular and phylogenetic analyses reveal mammalian-like clockwork in the honey bee (<it>Apis mellifera</it>) and shed new light on the molecular evolution of the circadian clock.</p>
            </title>
            <aug>
               <au>
                  <snm>Rubin</snm>
                  <fnm>EB</fnm>
               </au>
               <au>
                  <snm>Shemesh</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Cohen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Elgavish</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Robertson</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Bloch</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>1352</fpage>
            <lpage>1365</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1626637</pubid>
                  <pubid idtype="pmpid" link="fulltext">17065608</pubid>
                  <pubid idtype="doi">10.1101/gr.5094806</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Consistency of genome-based methods in measuring Metazoan evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Zdobnov</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Mering</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Letunic</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>FEBS Lett</source>
            <pubdate>2005</pubdate>
            <volume>579</volume>
            <fpage>3355</fpage>
            <lpage>3361</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.febslet.2005.04.006</pubid>
                  <pubid idtype="pmpid" link="fulltext">15943981</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Phylogenomic analysis reveals bees and wasps (Hymenoptera) at the base of the radiation of Holometabolous insects.</p>
            </title>
            <aug>
               <au>
                  <snm>Savard</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Tautz</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Richards</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Weinstock</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Gibbs</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Werren</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Tettelin</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lercher</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>1334</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1626634</pubid>
                  <pubid idtype="pmpid" link="fulltext">17065606</pubid>
                  <pubid idtype="doi">10.1101/gr.5204306</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Shifted, the <it>Drosophila </it>ortholog of Wnt inhibitory factor-1, controls the distribution and movement of Hedgehog.</p>
            </title>
            <aug>
               <au>
                  <snm>Glise</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Crozatier</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Halbisen</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Wise</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Olson</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Vincent</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Blair</snm>
                  <fnm>SS</fnm>
               </au>
            </aug>
            <source>Dev Cell</source>
            <pubdate>2005</pubdate>
            <volume>8</volume>
            <fpage>255</fpage>
            <lpage>266</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.devcel.2005.01.003</pubid>
                  <pubid idtype="pmpid" link="fulltext">15691766</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Genetic dissection of mechanosensory transduction: mechanoreception-defective mutations of <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Kernan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cowan</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Zuker</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Neuron</source>
            <pubdate>1994</pubdate>
            <volume>12</volume>
            <fpage>1195</fpage>
            <lpage>1206</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0896-6273(94)90437-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">8011334</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>The Interactive Fly</p>
            </title>
            <url>http://www.sdbonline.org/fly/segment/shifted4.htm</url>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Comparative genome and proteome analysis of Anopheles gambiae and <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Zdobnov</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>von Mering</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Letunic</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Torrents</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Suyama</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Copley</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Christophides</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Thomasova</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Holt</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Subramanian</snm>
                  <fnm>GM</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <fpage>149</fpage>
            <lpage>159</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1077061</pubid>
                  <pubid idtype="pmpid" link="fulltext">12364792</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Canonical TTAGG-repeat telomeres and telomerase in the honey bee, <it>Apis mellifera</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Robertson</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Gordon</snm>
                  <fnm>KHJ</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>1345</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1626636</pubid>
                  <pubid idtype="pmpid" link="fulltext">17065609</pubid>
                  <pubid idtype="doi">10.1101/gr.5085606</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Assigning protein functions by comparative genome analysis: Protein phylogenetic profiles.</p>
            </title>
            <aug>
               <au>
                  <snm>Pellegrini</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Marcotte</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Eisenberg</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Yeates</snm>
                  <fnm>TO</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <fpage>4285</fpage>
            <lpage>4288</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">16324</pubid>
                  <pubid idtype="pmpid" link="fulltext">10200254</pubid>
                  <pubid idtype="doi">10.1073/pnas.96.8.4285</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Lineage-specific loss and divergence of functionally linked genes in eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <fpage>11319</fpage>
            <lpage>11324</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">17198</pubid>
                  <pubid idtype="pmpid" link="fulltext">11016957</pubid>
                  <pubid idtype="doi">10.1073/pnas.200346997</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Predicting functional gene links from phylogenetic-statistical analyses of whole genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Barker</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Pagel</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Plos Comput Biol</source>
            <pubdate>2005</pubdate>
            <volume>1</volume>
            <fpage>24</fpage>
            <lpage>31</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1371/journal.pcbi.0010003</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>BioGRID: a general repository for interaction datasets.</p>
            </title>
            <aug>
               <au>
                  <snm>Stark</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Breitkreutz</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Reguly</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Boucher</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Breitkreutz</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Tyers</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <fpage>D535</fpage>
            <lpage>539</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347471</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381927</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj109</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>WebGestalt: an integrated system for exploring gene sets in various biological contexts.</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kirov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Snoddy</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>W741</fpage>
            <lpage>W748</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1160236</pubid>
                  <pubid idtype="pmpid" link="fulltext">15980575</pubid>
                  <pubid idtype="doi">10.1093/nar/gki475</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Splicing of a rare class of introns by the U12-dependent spliceosome.</p>
            </title>
            <aug>
               <au>
                  <snm>Will</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Luhrmann</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Biol Chem</source>
            <pubdate>2005</pubdate>
            <volume>386</volume>
            <fpage>713</fpage>
            <lpage>724</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1515/BC.2005.084</pubid>
                  <pubid idtype="pmpid" link="fulltext">16201866</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>The Genome Sequence of <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Adams</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Celniker</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Holt</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Gocayne</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Amanatides</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Scherer</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>PW</fnm>
               </au>
               <au>
                  <snm>Hoskins</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Galle</snm>
                  <fnm>RF</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>287</volume>
            <fpage>2185</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.287.5461.2185</pubid>
                  <pubid idtype="pmpid" link="fulltext">10731132</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Identification of an evolutionarily divergent U11 small nuclear ribonucleoprotein particle in <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Schneider</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Will</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Brosius</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Frilander</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Luhrmann</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>9584</fpage>
            <lpage>9589</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">470718</pubid>
                  <pubid idtype="pmpid" link="fulltext">15210936</pubid>
                  <pubid idtype="doi">10.1073/pnas.0403400101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Spliceosomal small nuclear RNA genes in 11 insect genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Mount</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Gotea</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>CF</fnm>
               </au>
               <au>
                  <snm>Hernandez</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Makalowski</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>RNA</source>
            <pubdate>2007</pubdate>
            <volume>13</volume>
            <fpage>5</fpage>
            <lpage>14</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1705759</pubid>
                  <pubid idtype="pmpid" link="fulltext">17095541</pubid>
                  <pubid idtype="doi">10.1261/rna.259207</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Comprehensive splice-site analysis using comparative genomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Sheth</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Roca</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Hastings</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Roeder</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Krainer</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Sachidanandam</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <fpage>3955</fpage>
            <lpage>3967</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1557818</pubid>
                  <pubid idtype="pmpid" link="fulltext">16914448</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl556</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Interaction of the cytosolic domains of sorLA/LR11 with the amyloid precursor protein (APP) and beta-secretase beta-site APP-cleaving enzyme.</p>
            </title>
            <aug>
               <au>
                  <snm>Spoelgen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>von Arnim</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Peltan</snm>
                  <fnm>ID</fnm>
               </au>
               <au>
                  <snm>Koker</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Irizarry</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Andersen</snm>
                  <fnm>OM</fnm>
               </au>
               <au>
                  <snm>Willnow</snm>
                  <fnm>TE</fnm>
               </au>
               <au>
                  <snm>Hyman</snm>
                  <fnm>BT</fnm>
               </au>
            </aug>
            <source>J Neurosci</source>
            <pubdate>2006</pubdate>
            <volume>26</volume>
            <fpage>418</fpage>
            <lpage>428</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1523/JNEUROSCI.3882-05.2006</pubid>
                  <pubid idtype="pmpid" link="fulltext">16407538</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>DNA replication in eukaryotic cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Bell</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Dutta</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Annu Rev Biochem</source>
            <pubdate>2002</pubdate>
            <volume>71</volume>
            <fpage>333</fpage>
            <lpage>374</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.biochem.71.110601.135425</pubid>
                  <pubid idtype="pmpid" link="fulltext">12045100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Targeted inactivation of testicular nuclear orphan receptor 4 delays and disrupts late meiotic prophase and subsequent meiotic divisions of spermatogenesis.</p>
            </title>
            <aug>
               <au>
                  <snm>Mu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>YF</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>NC</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>YT</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Shyr</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>2003</pubdate>
            <volume>24</volume>
            <fpage>5887</fpage>
            <lpage>5899</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1128/MCB.24.13.5887-5899.2004</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>mcm5/cdc46-bob1 bypasses the requirement for the S phase activator Cdc7p.</p>
            </title>
            <aug>
               <au>
                  <snm>Hardy</snm>
                  <fnm>CFJ</fnm>
               </au>
               <au>
                  <snm>Dryga</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Seematter</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pahl</snm>
                  <fnm>PMB</fnm>
               </au>
               <au>
                  <snm>Sclafani</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1997</pubdate>
            <volume>94</volume>
            <fpage>3151</fpage>
            <lpage>3155</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">20337</pubid>
                  <pubid idtype="pmpid" link="fulltext">9096361</pubid>
                  <pubid idtype="doi">10.1073/pnas.94.7.3151</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>ABC1 promotes engulfment of apoptotic cells and transbilayer redistribution of phosphatidylserine.</p>
            </title>
            <aug>
               <au>
                  <snm>Hamon</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Broccardo</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Chambenoit</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Luciani</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Toti</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Chaslin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Freyssinet</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Devaux</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <snm>McNeish</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Marguet</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nat Cell Biol</source>
            <pubdate>2000</pubdate>
            <volume>2</volume>
            <fpage>399</fpage>
            <lpage>406</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35017029</pubid>
                  <pubid idtype="pmpid" link="fulltext">10878804</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>The ABC transporter gene family of <it>Caenorhabditis elegans </it>has implications for the evolutionary dynamics of multidrug resistance in eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Sheps</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Ralph</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>ZY</fnm>
               </au>
               <au>
                  <snm>Baillie</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Ling</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>R15</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">395765</pubid>
                  <pubid idtype="pmpid" link="fulltext">15003118</pubid>
                  <pubid idtype="doi">10.1186/gb-2004-5-3-r15</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>{alpha} 1-Syntrophin modulates turnover of ABCA1.</p>
            </title>
            <aug>
               <au>
                  <snm>Munehira</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ohnishi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kawamoto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Furuya</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Shitara</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Imamura</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yokota</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Takeda</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Amachi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Matsuo</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Biol Chem</source>
            <pubdate>2004</pubdate>
            <volume>279</volume>
            <fpage>15091</fpage>
            <lpage>15095</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1074/jbc.M313436200</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>The human CENP-A centromeric nucleosome-associated complex.</p>
            </title>
            <aug>
               <au>
                  <snm>Foltz</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Jansen</snm>
                  <fnm>LE</fnm>
               </au>
               <au>
                  <snm>Black</snm>
                  <fnm>BE</fnm>
               </au>
               <au>
                  <snm>Bailey</snm>
                  <fnm>AO</fnm>
               </au>
               <au>
                  <snm>Yates</snm>
                  <fnm>JR</fnm>
                  <suf>3rd</suf>
               </au>
               <au>
                  <snm>Cleveland</snm>
                  <fnm>DW</fnm>
               </au>
            </aug>
            <source>Nat Cell Biol</source>
            <pubdate>2006</pubdate>
            <volume>8</volume>
            <fpage>458</fpage>
            <lpage>469</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ncb1397</pubid>
                  <pubid idtype="pmpid" link="fulltext">16622419</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p><it>Drosophila </it>innate immunity: an evolutionary perspective.</p>
            </title>
            <aug>
               <au>
                  <snm>Hoffmann</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Reichhart</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Nat Immunol</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>121</fpage>
            <lpage>126</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ni0202-121</pubid>
                  <pubid idtype="pmpid" link="fulltext">11812988</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Fbx7 functions in the SCF complex regulating Cdk1-cyclin B-phosphorylated hepatoma up-regulated protein (HURP) proteolysis by a proline-rich region.</p>
            </title>
            <aug>
               <au>
                  <snm>Hsu</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>YC</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>CY</fnm>
               </au>
            </aug>
            <source>Biol Chem</source>
            <pubdate>2004</pubdate>
            <volume>279</volume>
            <fpage>32592</fpage>
            <lpage>32602</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1074/jbc.M404950200</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Transforming activity of Fbxo7 is mediated specifically through regulation of cyclin D/cdk6.</p>
            </title>
            <aug>
               <au>
                  <snm>Laman</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Funes</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Ye</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Henderson</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Galinanes-Garcia</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hara</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Knowles</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>McDonald</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Boshoff</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>EMBO J</source>
            <pubdate>2005</pubdate>
            <volume>24</volume>
            <fpage>3104</fpage>
            <lpage>3116</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1201355</pubid>
                  <pubid idtype="pmpid" link="fulltext">16096642</pubid>
                  <pubid idtype="doi">10.1038/sj.emboj.7600775</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>The <it>Drosophila </it>cyclin D-cdk4 complex promotes cellular growth.</p>
            </title>
            <aug>
               <au>
                  <snm>Datar</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Jacobs</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>de la Cruz</snm>
                  <fnm>AFA</fnm>
               </au>
               <au>
                  <snm>Lehner</snm>
                  <fnm>CF</fnm>
               </au>
               <au>
                  <snm>Edgar</snm>
                  <fnm>BA</fnm>
               </au>
            </aug>
            <source>EMBO J</source>
            <pubdate>2000</pubdate>
            <volume>19</volume>
            <fpage>4543</fpage>
            <lpage>4554</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">302080</pubid>
                  <pubid idtype="pmpid" link="fulltext">10970848</pubid>
                  <pubid idtype="doi">10.1093/emboj/19.17.4543</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p><it>Drosophila </it>Cdk4 is required for normal growth and is dispensable for cell cycle progression.</p>
            </title>
            <aug>
               <au>
                  <snm>Meyer</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Jacobs</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Datar</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Du</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Edgar</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Lehner</snm>
                  <fnm>CF</fnm>
               </au>
            </aug>
            <source>EMBO J</source>
            <pubdate>2000</pubdate>
            <volume>19</volume>
            <fpage>4533</fpage>
            <lpage>4542</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">302073</pubid>
                  <pubid idtype="pmpid" link="fulltext">10970847</pubid>
                  <pubid idtype="doi">10.1093/emboj/19.17.4533</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Hillier</snm>
                  <fnm>LW</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Warren</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Hardison</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Ponting</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Burt</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Groenen</snm>
                  <fnm>MAM</fnm>
               </au>
               <au>
                  <snm>Delany</snm>
                  <fnm>ME</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>432</volume>
            <fpage>695</fpage>
            <lpage>716</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03154</pubid>
                  <pubid idtype="pmpid" link="fulltext">15592404</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Systematic functional analysis of the <it>Caenorhabditis elegans </it>genome using RNAi.</p>
            </title>
            <aug>
               <au>
                  <snm>Kamath</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Dong</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Poulin</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gotta</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kanapin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Le Bot</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Moreno</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sohrmann</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>421</volume>
            <fpage>231</fpage>
            <lpage>237</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01278</pubid>
                  <pubid idtype="pmpid" link="fulltext">12529635</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Role of duplicate genes in genetic robustness against null mutations.</p>
            </title>
            <aug>
               <au>
                  <snm>Gu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Steinmetz</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Gu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Scharfe</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>WH</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>421</volume>
            <fpage>63</fpage>
            <lpage>66</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01198</pubid>
                  <pubid idtype="pmpid" link="fulltext">12511954</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Selective constraint in protein polymorphism: study of the effectively neutral mutation model by using an improved pseudosampling method.</p>
            </title>
            <aug>
               <au>
                  <snm>Kimura</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Takahata</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA