<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2007-8-3-r36</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>New genes in the evolution of the neural crest differentiation program</p>
         </title>
         <aug>
            <au ca="yes" id="A1" ce="yes">
               <snm>Martinez-Morales</snm>
               <fnm>Juan-Ramon</fnm>
               <insr iid="I1"/>
               <email>Juan.Martinez@EMBL.de</email>
            </au>
            <au id="A2" ce="yes">
               <snm>Henrich</snm>
               <fnm>Thorsten</fnm>
               <insr iid="I1"/>
               <email>Henrich@EMBL.de</email>
            </au>
            <au id="A3" ce="yes">
               <snm>Ramialison</snm>
               <fnm>Mirana</fnm>
               <insr iid="I1"/>
               <email>mirana.ramialison@EMBL.de</email>
            </au>
            <au ca="yes" id="A4">
               <snm>Wittbrodt</snm>
               <fnm>Joachim</fnm>
               <email>Jochen.Wittbrodt@EMBL.de</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Developmental Biology Unit, EMBL, Meyerhofstra&#223;e, 69117 Heidelberg, Germany</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>3</issue>
         <fpage>R36</fpage>
         <url>http://genomebiology.com/2007/8/3/R36</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17352807</pubid>
               <pubid idtype="doi">10.1186/gb-2007-8-3-r36</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>15</day>
               <month>9</month>
               <year>2006</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>4</day>
               <month>1</month>
               <year>2007</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>12</day>
               <month>3</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>12</day>
               <month>03</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Martinez-Morales et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Gene emergence in neural crest evolution</p>
      </shorttitle>
      <shortabs>
         <p>The phylogenetic classification of genes that are ontologically associated with neural crest development reveals that neural crest evolution is associated with the emergence of new signalling peptides.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Development of the vertebrate head depends on the multipotency and migratory behavior of neural crest derivatives. This cell population is considered a vertebrate innovation and, accordingly, chordate ancestors lacked neural crest counterparts. The identification of neural crest specification genes expressed in the neural plate of basal chordates, in addition to the discovery of pigmented migratory cells in ascidians, has challenged this hypothesis. These new findings revive the debate on what is new and what is ancient in the genetic program that controls neural crest formation.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>To determine the origin of neural crest genes, we analyzed Phenotype Ontology annotations to select genes that control the development of this tissue. Using a sequential blast pipeline, we phylogenetically classified these genes, as well as those associated with other tissues, in order to define tissue-specific profiles of gene emergence. Of neural crest genes, 9% are vertebrate innovations. Our comparative analyses show that, among different tissues, the neural crest exhibits a particularly high rate of gene emergence during vertebrate evolution. A remarkable proportion of the new neural crest genes encode soluble ligands that control neural crest precursor specification into each cell lineage, including pigmented, neural, glial, and skeletal derivatives.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>We propose that the evolution of the neural crest is linked not only to the recruitment of ancestral regulatory genes but also to the emergence of signaling peptides that control the increasingly complex lineage diversification of this plastic cell population.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010005">Development</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010017">Neurobiology</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>As first proposed by Gans and Northcutt <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>, the major evolutionary innovation of the vertebrate body plan relies on elaboration of a new head at the anterior end of an ancestral chordate trunk. The three existing groups of the phylum Chordata, namely urochordates (ascidians), cephalochordates (amphioxus), and craniates (including vertebrates and agnates), share many characteristics. These include a notochord, segmented trunk muscles, and a dorsal nerve cord. Molecular data have further confirmed these anatomic descriptions, revealing a conserved patterning mechanism along the anterior-posterior and dorso-ventral axes of the neural tube <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. Resting on this archetypal chordate body plan, unique populations of cells, the neural crest and the ectodermal placodes, evolved in craniates (referred to here as 'vertebrates' for simplicity). The emergence of these pluripotent cells is linked to the evolution of more sophisticated sensory and predatory organs (for instance, jaws). These new organs, in conjunction with an increasingly complex brain, allowed the shift from a filter-feeding style of life toward active predatory strategies <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B4">4</abbr></abbrgrp>.</p>
         <p>The neural crest is a transient population of embryonic cells that originate at the boundary between neural plate and dorsal ectoderm. Secreted from neighboring tissues, signaling molecules of the Wnt, Fgf, and Bmp families cooperate to activate a distinct combination of transcription factors at the neural plate border. Among those are members of the Pax, Zic, Snail, Sox, and Msx families, which constitute the neural crest specification network <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. Shortly after their dorsal specification, neural crest cells undergo an epithelial-to-mesenchymal transition, migrate, and finally, upon arrival at their destination, they give rise to a variety of cell types. These include peripheral neurons, glial and Schwann cells, pigment cells, endocrine cells, cartilage, and bone <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. This large diversity of derivatives arises through a complex mechanism of lineage restriction, which operates both early, on the pluripotent precursors at the dorsal neural tube <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, and later, during the migration and differentiation of precursors already committed to different degrees <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. Environmental cues found throughout neural crest migratory routes play a fundamental role not only in instructing the precursor's differentiation into particular phenotypes, but also in controlling their proliferation and survival <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Among these extracellular cues, classical signaling molecules such as Fgfs, Wnts, Bmps and transforming growth factor (TGF)-&#946;s, in conjunction with locally produced cytokines such as neurotropins, endothelins, glial-derived neurotropic factor (GDNF), neuregulin and cKit, have been shown to influence precursor fate and survival <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>.</p>
         <p>The neural crest has traditionally been considered the key structure acquired very early by craniate pioneers. The presence of cartilage first and biomineralized material later in the head of the earliest craniate fossils supports this view <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>. Because of their particular nature, the evolution of cartilage and bone elements can easily be traced in the large collection of Cambrian fossils. Many fossil fish exhibit neural crest derived exoskeletal coverings of dermal bone that extend partially over the trunk, with no trace of mesenchymal endoskeleton <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. These paleontologic records indicate that in early vertebrates cartilage and bones arose first in the context of the cephalic neural crest, and that only later was this genetic program co-opted by the para-axial sclerotome <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>.</p>
         <p>The existence of an ancestral population of cells in early chordates that give rise to vertebrate neural crest on the one hand and to basal chordate dorsal derivatives on the other has been proposed several times <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. This hypothesis is supported by the conservation of many components of the neural crest specification network in chordates <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Furthermore, migratory cells that express neural crest markers and differentiate as pigmented cells have recently been identified in the urochordate <it>Ecteinascidia turbinate </it><abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. These data reinforce the hypothesis of pan-chordate 'precursors' behaving similarly and expressing a set of genes homologous to the modern neural crest. According to this view, the innovative drive impelling neural crest evolution stems from the evolution of their <it>cis</it>-regulatory elements - a process facilitated by the ancestral duplication of the vertebrate genome. The duplication of key developmental genes would have released enough evolutionary pressure to facilitate their divergence and hence the evolution of new functions <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. Although the existence of pan-chordate 'precursors' offers a satisfactory answer to the evolutionary origin of the neural crest, it fails to account for the acquisition of fundamental properties of this tissue. These include the pluripotency of the neural crest precursors that now give rise to novel cell types that are present neither in basal chordates nor in other metazoans.</p>
         <p>To gain insight into the origin and evolution of neural crest properties, we have chosen a bioinformatics approach to analyze the phylogeny of tissue-specific developmental programs in a systematic manner. Our analytical tool takes advantage of an extensive collection of mouse genes annotated through Mammalian Phenotype Ontology terms <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> (at Mouse Genome Informatics [MGI] <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>). According to their related mouse mutant phenotype annotations, we grouped genes into tissue-specific genetic programs. We then explored the phylogeny of each program using a sequential blast pipeline. We defined as 'new genes' those encoding proteins that did not exhibit any significant homology in previous phylogenetic categories, either because they are extremely divergent or because they have evolved <it>de novo</it>. For each group, the total number of new genes at each branch of the evolutionary tree was analyzed. These graphical representations (gene emergence plots) are characteristic for each tissue/organ. They show how the rate of gene innovation has changed during the evolution of a particular tissue. These data substantiate the traditional concept that neural crest is a vertebrate innovation. In addition, our systematic analysis demonstrates that neural crest evolution builds not only on the rewiring of gene networks but also on the emergence of new genes. Gene Ontology (GO) analysis of the group of new neural crest components revealed remarkable enrichment in extracellular ligands. Half of the vertebrate new genes encode secreted cytokines that are known to control the specification and survival of the different neural crest derivatives, including pigment cells, neurons, glial cells, and skeletal components. Here we propose that the emergence of these novel ligands is associated with the evolutionary transition of a relatively simple cell population, in the dorsal neural tube of ancestral chordates, toward the lineage complexity of the vertebrate neural crest.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <p>How animal body plans are modified in relation to the evolution of their genome is an intricate issue. Acquisition of novel properties in a particular cell type, or even innovative changes in tissues and organs, can very often be attributed to modifications in the wiring of pre-existing gene networks <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. However, a fundamental process in genome evolution is also the emergence of new genes. Several molecular mechanisms, including exon shuffling, gene duplication and fusion, transposition, fast sequence divergence, and entire <it>de novo </it>origin, have been proposed to serve as sources for gene innovation <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. In this work we explore the phylogeny of the genes that are involved in neural crest development to gain insight into the evolution of neural crest properties. We aimed to determine which components of the vertebrate neural crest gene program are ancient, and hence have been recruited to perform a function in this tissue, and which components evolved only recently.</p>
         <sec>
            <st>
               <p>Determining the origin of vertebrate proteins through a sequential blast pipeline</p>
            </st>
            <p>As a first step in determining when neural crest genes evolved, we filtered mouse proteins through a sequential blast pipeline. All 23,658 known mouse protein sequences (EnsEMBL v31) were consecutively blasted against available genomes grouped into seven different evolutionary categories (prokaryota, eukaryota, metazoa, deuterostomia, chordata, vertebrata, and mammalia) using a relaxed threshold of E = 10<sup>-4</sup>, as established in similar studies <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. Proteins exhibiting homology when blasted against the prokaryotic genomes were classified as ancient. The remaining genes were subsequently blasted against eukaryotic genomes and the procedure was repeated until all genes were classified (Figure <figr fid="F1">1a</figr>). According to our definition, 'new genes' in each category are those encoding proteins that did not exhibit any significant homology in previous categories, either because they have diverged extensively from a former protein or because they have evolved <it>de novo</it>.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Gene phylogeny was explored using a sequential blast pipeline</p>
               </caption>
               <text>
                  <p>Gene phylogeny was explored using a sequential blast pipeline. <b>(a) </b>All known mouse proteins were sequentially blasted (cutoff value E = 10<sup>-4</sup>) against available databases and then classified according to their appearance into seven different categories: prokaryota (pro), eukaryota (euk), metazoa (met), deuterostomia (deu), chordata (cor), vertebrata (ver), and mammalia (mam). <b>(b) </b>The table shows the number of mouse genes assigned to each category compared with their estimated age in millions of years. <b>(c) </b>Graphical representation of the global gene phylogeny.</p>
               </text>
               <graphic file="gb-2007-8-3-r36-1"/>
            </fig>
            <p>A direct comparison of the percentage of genes appearing in each category with an estimation of their respective age in millions of years <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> indicated that the frequency of gene emergence is higher for late categories (specifically, metazoans to mammals; Figure <figr fid="F1">1b,c</figr>). This higher frequency of innovation correlates with the reported observation that the rate of evolution for proteins (calculated as the ratio between nonsynonymous and synonymous amino acid substitutions) is also higher for more recent categories <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>.</p>
            <p>To elucidate whether 'new proteins', because of their divergent amino acid sequences, correlate with the emergence of novel molecular functions, we performed a GO analysis <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. For each evolutionary category we identified the GO terms that are statistically over-represented compared with all of the known mouse proteins. The 10 most significantly over-represented GO terms for each of the seven different categories are listed in Table <tblr tid="T1">1</tblr> (also see Additional data file 1 for a full list of over-represented GO terms). Our analysis shows that, within a large evolutionary window, innovations are associated with the emergence of 'new genes'. Although the first category, prokaryota, is enriched in genes that are involved in general cell metabolism, GO terms of genes appearing first in eukaryotes demonstrate their function in the newly evolved subcellular organelles. In metazoans we find the GO terms 'cell communication', 'signal transduction', and 'receptor activity' to be highly over-represented, which is in accordance with a <it>de novo </it>requirement for cell-cell communication and tissue subspecialization in the context of multicellularity. Interestingly, the collection of genes appearing first in vertebrates and mammals is enriched in terms such as 'hormone activity', 'receptor binding', 'extracellular space', and 'cytokine response', suggesting that diversification of receptor ligands is linked to vertebrate evolution. In summary, our sequential blast pipeline reliably classifies genes according to their first appearance within the phylogenetic tree.</p>
            <tbl id="T1" hint_layout="double">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Frequency of GO terms for each group of 'new genes'</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>GO ID</p>
                     </c>
                     <c ca="left">
                        <p>GO term</p>
                     </c>
                     <c ca="left">
                        <p>Count sample</p>
                     </c>
                     <c ca="left">
                        <p>Count total</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>P</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Prokaryota</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0050875</p>
                     </c>
                     <c ca="left">
                        <p>Cellular physiological process</p>
                     </c>
                     <c ca="left">
                        <p>3,219</p>
                     </c>
                     <c ca="left">
                        <p>8,198</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0008152</p>
                     </c>
                     <c ca="left">
                        <p>Metabolism</p>
                     </c>
                     <c ca="left">
                        <p>2,576</p>
                     </c>
                     <c ca="left">
                        <p>5,906</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0044237</p>
                     </c>
                     <c ca="left">
                        <p>Cellular metabolism</p>
                     </c>
                     <c ca="left">
                        <p>2,369</p>
                     </c>
                     <c ca="left">
                        <p>5,566</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0044238</p>
                     </c>
                     <c ca="left">
                        <p>Primary metabolism</p>
                     </c>
                     <c ca="left">
                        <p>2,192</p>
                     </c>
                     <c ca="left">
                        <p>5,312</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0043170</p>
                     </c>
                     <c ca="left">
                        <p>Macromolecule metabolism</p>
                     </c>
                     <c ca="left">
                        <p>1,569</p>
                     </c>
                     <c ca="left">
                        <p>3,298</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0044260</p>
                     </c>
                     <c ca="left">
                        <p>Cellular macromolecule metabolism</p>
                     </c>
                     <c ca="left">
                        <p>1,158</p>
                     </c>
                     <c ca="left">
                        <p>2,500</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0019538</p>
                     </c>
                     <c ca="left">
                        <p>Protein metabolism</p>
                     </c>
                     <c ca="left">
                        <p>1,149</p>
                     </c>
                     <c ca="left">
                        <p>2,486</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0044267</p>
                     </c>
                     <c ca="left">
                        <p>Cellular protein metabolism</p>
                     </c>
                     <c ca="left">
                        <p>1,138</p>
                     </c>
                     <c ca="left">
                        <p>2,469</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0000166</p>
                     </c>
                     <c ca="left">
                        <p>Nucleotide binding</p>
                     </c>
                     <c ca="left">
                        <p>1,070</p>
                     </c>
                     <c ca="left">
                        <p>1,577</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0016787</p>
                     </c>
                     <c ca="left">
                        <p>Hydrolase activity</p>
                     </c>
                     <c ca="left">
                        <p>1,037</p>
                     </c>
                     <c ca="left">
                        <p>1,876</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Eukaryota</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0005622</p>
                     </c>
                     <c ca="left">
                        <p>Intracellular</p>
                     </c>
                     <c ca="left">
                        <p>1,820</p>
                     </c>
                     <c ca="left">
                        <p>6,664</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0043226</p>
                     </c>
                     <c ca="left">
                        <p>Organelle</p>
                     </c>
                     <c ca="left">
                        <p>1,587</p>
                     </c>
                     <c ca="left">
                        <p>5,789</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0043229</p>
                     </c>
                     <c ca="left">
                        <p>Intracellular organelle</p>
                     </c>
                     <c ca="left">
                        <p>1,586</p>
                     </c>
                     <c ca="left">
                        <p>5,785</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0043227</p>
                     </c>
                     <c ca="left">
                        <p>Membrane-bound organelle</p>
                     </c>
                     <c ca="left">
                        <p>1,419</p>
                     </c>
                     <c ca="left">
                        <p>5,097</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0043231</p>
                     </c>
                     <c ca="left">
                        <p>Intracellular membrane-bound organelle</p>
                     </c>
                     <c ca="left">
                        <p>1,417</p>
                     </c>
                     <c ca="left">
                        <p>5,092</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0005634</p>
                     </c>
                     <c ca="left">
                        <p>Nucleus</p>
                     </c>
                     <c ca="left">
                        <p>1,054</p>
                     </c>
                     <c ca="left">
                        <p>3,267</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0046914</p>
                     </c>
                     <c ca="left">
                        <p>Transition metal ion binding</p>
                     </c>
                     <c ca="left">
                        <p>644</p>
                     </c>
                     <c ca="left">
                        <p>1,791</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0008270</p>
                     </c>
                     <c ca="left">
                        <p>Zinc ion binding</p>
                     </c>
                     <c ca="left">
                        <p>619</p>
                     </c>
                     <c ca="left">
                        <p>1,416</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0004888</p>
                     </c>
                     <c ca="left">
                        <p>Transmembrane receptor activity</p>
                     </c>
                     <c ca="left">
                        <p>23</p>
                     </c>
                     <c ca="left">
                        <p>2,007</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0043169</p>
                     </c>
                     <c ca="left">
                        <p>Cation binding</p>
                     </c>
                     <c ca="left">
                        <p>799</p>
                     </c>
                     <c ca="left">
                        <p>2,589</p>
                     </c>
                     <c ca="left">
                        <p>3.45 &#215; e<sup>-85</sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Metazoa</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0016020</p>
                     </c>
                     <c ca="left">
                        <p>Membrane</p>
                     </c>
                     <c ca="left">
                        <p>1,768</p>
                     </c>
                     <c ca="left">
                        <p>6,163</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0031224</p>
                     </c>
                     <c ca="left">
                        <p>Intrinsic to membrane</p>
                     </c>
                     <c ca="left">
                        <p>1,524</p>
                     </c>
                     <c ca="left">
                        <p>4,932</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0016021</p>
                     </c>
                     <c ca="left">
                        <p>Integral to membrane</p>
                     </c>
                     <c ca="left">
                        <p>1,523</p>
                     </c>
                     <c ca="left">
                        <p>4,930</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0007154</p>
                     </c>
                     <c ca="left">
                        <p>Cell communication</p>
                     </c>
                     <c ca="left">
                        <p>1,234</p>
                     </c>
                     <c ca="left">
                        <p>3,201</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0007165</p>
                     </c>
                     <c ca="left">
                        <p>Signal transduction</p>
                     </c>
                     <c ca="left">
                        <p>1,211</p>
                     </c>
                     <c ca="left">
                        <p>3,059</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0004872</p>
                     </c>
                     <c ca="left">
                        <p>Receptor activity</p>
                     </c>
                     <c ca="left">
                        <p>1,143</p>
                     </c>
                     <c ca="left">
                        <p>2,793</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0007166</p>
                     </c>
                     <c ca="left">
                        <p>Cell surface receptor linked signal transduction</p>
                     </c>
                     <c ca="left">
                        <p>1,061</p>
                     </c>
                     <c ca="left">
                        <p>2,253</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0004888</p>
                     </c>
                     <c ca="left">
                        <p>Transmembrane receptor activity</p>
                     </c>
                     <c ca="left">
                        <p>926</p>
                     </c>
                     <c ca="left">
                        <p>2,007</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0007186</p>
                     </c>
                     <c ca="left">
                        <p>G-protein coupled receptor protein signaling pathway</p>
                     </c>
                     <c ca="left">
                        <p>906</p>
                     </c>
                     <c ca="left">
                        <p>1,763</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0004930</p>
                     </c>
                     <c ca="left">
                        <p>G-protein coupled receptor activity</p>
                     </c>
                     <c ca="left">
                        <p>870</p>
                     </c>
                     <c ca="left">
                        <p>1,693</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Deuterostomia</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0004931</p>
                     </c>
                     <c ca="left">
                        <p>ATP-gated cation channel activity</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>4.74 &#215; e<sup>-05</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0009607</p>
                     </c>
                     <c ca="left">
                        <p>Response to biotic stimulus</p>
                     </c>
                     <c ca="left">
                        <p>45</p>
                     </c>
                     <c ca="left">
                        <p>979</p>
                     </c>
                     <c ca="left">
                        <p>0.00100739</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0006952</p>
                     </c>
                     <c ca="left">
                        <p>Defense response</p>
                     </c>
                     <c ca="left">
                        <p>44</p>
                     </c>
                     <c ca="left">
                        <p>950</p>
                     </c>
                     <c ca="left">
                        <p>0.00100739</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0004800</p>
                     </c>
                     <c ca="left">
                        <p>Thyroxine 5'-deiodinase activity</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0.002093473</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0030106</p>
                     </c>
                     <c ca="left">
                        <p>MHC class I receptor activity</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>15</p>
                     </c>
                     <c ca="left">
                        <p>0.002209497</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0006955</p>
                     </c>
                     <c ca="left">
                        <p>Immune response</p>
                     </c>
                     <c ca="left">
                        <p>35</p>
                     </c>
                     <c ca="left">
                        <p>736</p>
                     </c>
                     <c ca="left">
                        <p>0.002495027</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0030178</p>
                     </c>
                     <c ca="left">
                        <p>Negative regulation of Wnt receptor signaling pathway</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>0.003585659</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0042981</p>
                     </c>
                     <c ca="left">
                        <p>Regulation of apoptosis</p>
                     </c>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="left">
                        <p>246</p>
                     </c>
                     <c ca="left">
                        <p>0.003971402</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0008430</p>
                     </c>
                     <c ca="left">
                        <p>Selenium binding</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>29</p>
                     </c>
                     <c ca="left">
                        <p>0.004113225</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0008517</p>
                     </c>
                     <c ca="left">
                        <p>Folic acid transporter activity</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>0.004113225</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Chordata</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0005911</p>
                     </c>
                     <c ca="left">
                        <p>Intercellular junction</p>
                     </c>
                     <c ca="left">
                        <p>38</p>
                     </c>
                     <c ca="left">
                        <p>131</p>
                     </c>
                     <c ca="left">
                        <p>5.96 &#215; e<sup>-33</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0005921</p>
                     </c>
                     <c ca="left">
                        <p>Gap junction</p>
                     </c>
                     <c ca="left">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>24</p>
                     </c>
                     <c ca="left">
                        <p>1.97 &#215; e<sup>-29</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0030054</p>
                     </c>
                     <c ca="left">
                        <p>Cell junction</p>
                     </c>
                     <c ca="left">
                        <p>38</p>
                     </c>
                     <c ca="left">
                        <p>164</p>
                     </c>
                     <c ca="left">
                        <p>2.28 &#215; e<sup>-29</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0005922</p>
                     </c>
                     <c ca="left">
                        <p>Connexon complex</p>
                     </c>
                     <c ca="left">
                        <p>17</p>
                     </c>
                     <c ca="left">
                        <p>18</p>
                     </c>
                     <c ca="left">
                        <p>2.57 &#215; e<sup>-27</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0005243</p>
                     </c>
                     <c ca="left">
                        <p>Gap-junction forming channel activity</p>
                     </c>
                     <c ca="left">
                        <p>17</p>
                     </c>
                     <c ca="left">
                        <p>18</p>
                     </c>
                     <c ca="left">
                        <p>2.57 &#215; e<sup>-27</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0015285</p>
                     </c>
                     <c ca="left">
                        <p>Connexon channel activity</p>
                     </c>
                     <c ca="left">
                        <p>17</p>
                     </c>
                     <c ca="left">
                        <p>18</p>
                     </c>
                     <c ca="left">
                        <p>2.57 &#215; e<sup>-27</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0005923</p>
                     </c>
                     <c ca="left">
                        <p>Tight junction</p>
                     </c>
                     <c ca="left">
                        <p>17</p>
                     </c>
                     <c ca="left">
                        <p>60</p>
                     </c>
                     <c ca="left">
                        <p>2.44 &#215; e<sup>-14</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0016327</p>
                     </c>
                     <c ca="left">
                        <p>Apicolateral plasma membrane</p>
                     </c>
                     <c ca="left">
                        <p>17</p>
                     </c>
                     <c ca="left">
                        <p>76</p>
                     </c>
                     <c ca="left">
                        <p>1.45 &#215; e<sup>-12</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0043296</p>
                     </c>
                     <c ca="left">
                        <p>Apical junction complex</p>
                     </c>
                     <c ca="left">
                        <p>17</p>
                     </c>
                     <c ca="left">
                        <p>76</p>
                     </c>
                     <c ca="left">
                        <p>1.45 &#215; e<sup>-12</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0005615</p>
                     </c>
                     <c ca="left">
                        <p>Extracellular space</p>
                     </c>
                     <c ca="left">
                        <p>74</p>
                     </c>
                     <c ca="left">
                        <p>2,021</p>
                     </c>
                     <c ca="left">
                        <p>7.43 &#215; e<sup>-10</sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Vertebrata</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0005102</p>
                     </c>
                     <c ca="left">
                        <p>Receptor binding</p>
                     </c>
                     <c ca="left">
                        <p>130</p>
                     </c>
                     <c ca="left">
                        <p>507</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0016503</p>
                     </c>
                     <c ca="left">
                        <p>Pheromone receptor activity</p>
                     </c>
                     <c ca="left">
                        <p>59</p>
                     </c>
                     <c ca="left">
                        <p>111</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0005179</p>
                     </c>
                     <c ca="left">
                        <p>Hormone activity</p>
                     </c>
                     <c ca="left">
                        <p>53</p>
                     </c>
                     <c ca="left">
                        <p>115</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0042221</p>
                     </c>
                     <c ca="left">
                        <p>Response to chemical stimulus</p>
                     </c>
                     <c ca="left">
                        <p>90</p>
                     </c>
                     <c ca="left">
                        <p>329</p>
                     </c>
                     <c ca="left">
                        <p>9.81 &#215; e<sup>-79</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0009628</p>
                     </c>
                     <c ca="left">
                        <p>Response to abiotic stimulus</p>
                     </c>
                     <c ca="left">
                        <p>92</p>
                     </c>
                     <c ca="left">
                        <p>414</p>
                     </c>
                     <c ca="left">
                        <p>2.94 &#215; e<sup>-59</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0005615</p>
                     </c>
                     <c ca="left">
                        <p>Extracellular space</p>
                     </c>
                     <c ca="left">
                        <p>230</p>
                     </c>
                     <c ca="left">
                        <p>2,021</p>
                     </c>
                     <c ca="left">
                        <p>1.24 &#215; e<sup>-45</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0005550</p>
                     </c>
                     <c ca="left">
                        <p>Pheromone binding</p>
                     </c>
                     <c ca="left">
                        <p>50</p>
                     </c>
                     <c ca="left">
                        <p>94</p>
                     </c>
                     <c ca="left">
                        <p>1.49 &#215; e<sup>-38</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0005125</p>
                     </c>
                     <c ca="left">
                        <p>Cytokine activity</p>
                     </c>
                     <c ca="left">
                        <p>52</p>
                     </c>
                     <c ca="left">
                        <p>212</p>
                     </c>
                     <c ca="left">
                        <p>5.02 &#215; e<sup>-38</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0005549</p>
                     </c>
                     <c ca="left">
                        <p>Odorant binding</p>
                     </c>
                     <c ca="left">
                        <p>50</p>
                     </c>
                     <c ca="left">
                        <p>99</p>
                     </c>
                     <c ca="left">
                        <p>3.45 &#215; e<sup>-37</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0001664</p>
                     </c>
                     <c ca="left">
                        <p>G-protein-coupled receptor binding</p>
                     </c>
                     <c ca="left">
                        <p>36</p>
                     </c>
                     <c ca="left">
                        <p>47</p>
                     </c>
                     <c ca="left">
                        <p>3.23 &#215; e<sup>-36</sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mammalia</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0005615</p>
                     </c>
                     <c ca="left">
                        <p>Extracellular space</p>
                     </c>
                     <c ca="left">
                        <p>198</p>
                     </c>
                     <c ca="left">
                        <p>2,021</p>
                     </c>
                     <c ca="left">
                        <p>6.14 &#215; e<sup>-53</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0005102</p>
                     </c>
                     <c ca="left">
                        <p>Receptor binding</p>
                     </c>
                     <c ca="left">
                        <p>80</p>
                     </c>
                     <c ca="left">
                        <p>507</p>
                     </c>
                     <c ca="left">
                        <p>1.79 &#215; e<sup>-46</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0005125</p>
                     </c>
                     <c ca="left">
                        <p>Cytokine activity</p>
                     </c>
                     <c ca="left">
                        <p>48</p>
                     </c>
                     <c ca="left">
                        <p>212</p>
                     </c>
                     <c ca="left">
                        <p>1.79 &#215; e<sup>-46</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0009607</p>
                     </c>
                     <c ca="left">
                        <p>Response to biotic stimulus</p>
                     </c>
                     <c ca="left">
                        <p>104</p>
                     </c>
                     <c ca="left">
                        <p>979</p>
                     </c>
                     <c ca="left">
                        <p>1.03 &#215; e<sup>-30</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0006952</p>
                     </c>
                     <c ca="left">
                        <p>Defense response</p>
                     </c>
                     <c ca="left">
                        <p>102</p>
                     </c>
                     <c ca="left">
                        <p>950</p>
                     </c>
                     <c ca="left">
                        <p>1.03 &#215; e<sup>-30</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0042742</p>
                     </c>
                     <c ca="left">
                        <p>Defense response to bacteria</p>
                     </c>
                     <c ca="left">
                        <p>34</p>
                     </c>
                     <c ca="left">
                        <p>70</p>
                     </c>
                     <c ca="left">
                        <p>2.51 &#215; e<sup>-28</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0009617</p>
                     </c>
                     <c ca="left">
                        <p>Response to bacteria</p>
                     </c>
                     <c ca="left">
                        <p>34</p>
                     </c>
                     <c ca="left">
                        <p>78</p>
                     </c>
                     <c ca="left">
                        <p>2.22 &#215; e<sup>-26</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0005126</p>
                     </c>
                     <c ca="left">
                        <p>Hematopoietin/interferon-class (D200-domain) cytokine receptor binding</p>
                     </c>
                     <c ca="left">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>33</p>
                     </c>
                     <c ca="left">
                        <p>6.10 &#215; e<sup>-19</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0008083</p>
                     </c>
                     <c ca="left">
                        <p>Growth factor activity</p>
                     </c>
                     <c ca="left">
                        <p>26</p>
                     </c>
                     <c ca="left">
                        <p>141</p>
                     </c>
                     <c ca="left">
                        <p>2.98 &#215; e<sup>-18</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GO:0051707</p>
                     </c>
                     <c ca="left">
                        <p>Response to other organism</p>
                     </c>
                     <c ca="left">
                        <p>60</p>
                     </c>
                     <c ca="left">
                        <p>594</p>
                     </c>
                     <c ca="left">
                        <p>1.67 &#215; e<sup>-15</sup></p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The table summarizes the 10 most statistically overrepresented Gene Ontology (GO) annotations for genes belonging to each of the seven categories. We only considered GO terms for which <it>P </it>> 0.001 and count sample was above 15.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Assignment of neural crest genes based on phenotypic data</p>
            </st>
            <p>In order to investigate when neural crest genes arose during evolution, it was necessary to build a comprehensive list of genes involved in the development of this tissue. A large number of studies, in particular the phenotypic analysis of mutations in mice, generated by either mutagenesis or genetic engineering, have led to the identification of many genes that are involved in neural crest development <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. The Mammalian Phenotype Browser, at MGI <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, provides a comprehensive resource of phenotypic information derived from mouse mutant studies <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. Because phenotypic analysis annotations offer the most reliable read out of gene function, we took advantage of this large collection of mouse mutants in our study. The collection includes more than 14,000 genotype records associated with a total of 6,442 genes (27% of the total mouse transcriptome), and furthermore it includes the majority of the genes demonstrated to play a <it>bona fide </it>role in neural crest development. In the MGI database each mutation is annotated by a controlled vocabulary of phenotypic terms that describe the effect of a genetic variation on different tissues, organs, or systems. We selected the Mammalian Phenotype Ontology for terms associated with mutations affecting both neural crest precursors and its derivative cell types and tissues.</p>
            <p>At the Mammalian Phenotype Browser the ontology term 'abnormal neural crest cells' (MP:0002949:) is reserved for phenotypes that affect the early migration of neural crest cells. Because of this stringent definition, only eight genes are included in this definition. However, when we took phenotypes associated with the development of neural crest derivatives into account, we retrieved a comprehensive list of 615 genes. In our analysis we considered three main groups of neural crest derivatives: pigmented cells, skeletal components, and elements of the peripheral nervous system. The 'pigmentation derivatives phenotype' is completely covered by a single term, namely 'pigmentation phenotype' (MP:0001186). The 'bone derivatives phenotype' terms consist of 'craniofacial phenotype' (MP:0005382) and 'skeleton phenotype' (MP:0005390). At this point, it could be argued that vertebrate neural crest cells only give rise to cranial skeleton and teeth, whereas the axial skeleton has a mesodermal origin. As already mentioned, however, paleontologic records indicate that skeletal elements evolved within the context of the neural crest and only later was this genetic program co-opted by the sclerotome <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. The 'peripheral nervous system derivatives phenotype' consists of 'abnormal autonomic nervous system morphology' (MP:0002751), 'abnormal peripheral nervous system glia' (MP:0001105), 'abnormal somatic sensory system morphology' (MP:0000959), and 'peripheral nervous system degeneration' (MP:0000958). We grouped these three categories under the general term 'neural crest derivatives phenotype'.</p>
         </sec>
         <sec>
            <st>
               <p>Determining the origin of the neural crest gene set: gene emergence rate plots</p>
            </st>
            <p>The sequential blast pipeline provides a list of genes that emerge along the evolutionary tree in each of the seven defined categories, whereas the phenotypic annotation provides a functional link for each of these genes. Combining both, we determined in which category each of the 615 neural crest genes emerged (see Additional data file 2 for the full dataset). Previous studies had promoted the idea that gene co-option was the driving force for neural crest invention <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Our data strongly support this view because the majority (91%) of genes involved in neural crest development was already present in basal metazoans or even before. Thus, key transcription factors acting as both 'neural plate border specifiers' (such as Pax3, Dlx5, Zic, and Msx1/2) and 'neural crest specifiers' (such as FoxD, Snail/Slug, Sox9/10, Twist, and AP-2) can be traced back to our category 'metazoans' or 'eukaryotes'. Similarly, the Fgf, Wnt, and Bmp signaling pathways involved in induction of the neural plate border are ancestral. Although their corresponding ligands can be traced back to basal metazoans, the kinase activity of their receptors was already present in prokaryotes. Altogether, these data confirm the idea that gene recruitment played an important role during neural crest evolution.</p>
            <p>However, we found that a substantial percentage of the genes (9%, listed in Table <tblr tid="T2">2</tblr>) involved in neural crest development evolved in deuterostomes during the past 550 million years. To determine, within this evolutionary window, how the rate of gene emergence in the neural crest relates to the rate of innovation in other tissues, we plotted the cumulative number of genes appearing in each category. In these graphs, the tissue-specific evolutionary profile of gene emergence is depicted (Figure <figr fid="F2">2</figr>). In order to quantify the profile of the graphs we calculated 'gene emergence rate' (ger) values, as a numeric representation of the gene innovation rate from an earlier category to a later one (see Materials and methods for a description of the formula). A ger value of 1 indicates a constant profile of gene innovation. Higher ger values indicate increased appearance of new genes in a particular tissue.</p>
            <tbl id="T2" hint_layout="double">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Neural crest genes compiled using Phenotype Ontology annotations (phenotypic information derived from mutant mice studies)</p>
               </caption>
               <tblbdy cols="2">
                  <r>
                     <c ca="left">
                        <p>Group</p>
                     </c>
                     <c ca="left">
                        <p>Gene</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Deuterostomia</p>
                     </c>
                     <c ca="left">
                        <p>Brain derived neurotrophic factor</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Fanconi anemia, complementation group A</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Fos-like antigen 2</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Neurotropin 3</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Noggin</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Purinergic receptor P2X, ligand-gated ion channel, 7</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Rod outer segment membrane protein 1</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Vertebrata</p>
                     </c>
                     <c ca="left">
                        <p>BCL2-like 11 (apoptosis facilitator)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Calcitonin/calcitonin-related polypeptide, alpha</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Cocaine and amphetamine regulated transcript</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Endothelin 1</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Endothelin 3</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Formin 1</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Glial cell line derived neurotrophic factor</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Gonadotropin releasing hormone 1</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Hermansky-Pudlak syndrome 6</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Integrin, alpha 10</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Islet amyloid polypeptide</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Leukocyte cell derived chemotaxin 1</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Matrix Gla protein</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Melanoma inhibitory activity 1</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Myelin protein zero</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Natriuretic peptide precursor type C</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Neuregulin 1</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Neurturin</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Parathyroid hormone</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Parathyroid hormone-like peptide</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Phosphodiesterase 6G, cGMP-specific, rod, gamma</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Pro-opiomelanocortin-alpha</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Silver</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Tenomodulin</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Treacher Collins Franceschetti syndrome 1, homolog</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Chordata</p>
                     </c>
                     <c ca="left">
                        <p>Activating transcription factor 4</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Cbp/p300-interacting transactivator, with Glu/Asp-rich carboxy-terminal domain, 2</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Claudin 14</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Epilepsy, progressive myoclonic epilepsy, type 2 gene alpha</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Fos-like antigen 1</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Gap junction membrane channel protein beta 6</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Hyaluronan and proteoglycan link protein 1</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Transforming growth factor, beta receptor III</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mammalia</p>
                     </c>
                     <c ca="left">
                        <p>Adrenocortical dysplasia</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Ameloblastin</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Amelogenin X chromosome</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>BH3 interacting domain death agonist</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Colony stimulating factor 2 (granulocyte-macrophage)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Harakiri, BCL2 interacting protein (contains only BH3 domain)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Kit ligand</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Leptin</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Matrix extracellular phosphoglycoprotein with ASARM motif (bone)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>MyoD family inhibitor</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Nonagouti</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Oncostatin M</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Programmed cell death 1</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>TYRO protein tyrosine kinase binding protein</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The first appearance of neural crest genes was then determined using the sequential blast pipeline (Figure 1). The table contains the complete name of neural crest genes emerging in deuterostomia, chordata, vertebrata and mammalia.</p>
               </tblfn>
            </tbl>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Tissue-specific profiles of gene emergence</p>
               </caption>
               <text>
                  <p>Tissue-specific profiles of gene emergence. The accumulative number of emerging genes (y-axis) in the deuterostomia-mammalia evolutionary window (x-axis) is represented for different tissue-specific genetic programs. We termed these representations gene emergence plots. At the chordate-vertebrate transition the rate of gene emergence (ger) was estimated for the different genetic programs. <b>(a) </b>Using mouse phenotypic annotations we calculated ger values between chordata and vertebrata for each main phenotype structure in the database. Structures are highlighted from blue to yellow, according to decreasing values of ger. Neural crest derivative structures are present within the highest ger values (red box). <b>(b) </b>Plots of representative structures of each class of ger value: class I = ger > 3; class II = 3 > ger > 1.5; and class III: ger &lt; 1.5.</p>
               </text>
               <graphic file="gb-2007-8-3-r36-2"/>
            </fig>
            <p>For each of the tissue-specific gene programs studied, we ordered the ger values at the chordate-vertebrate transition (Figure <figr fid="F2">2a</figr>). Notably, tissues/systems ontogenetically derived from ventral mesoderm, and hence considered modern vertebrate innovations <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B17">17</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>, such as the hematopoietic, immune, or renal/urinary system, exhibit graphs that peak at the chordate-vertebrate transition (Figure <figr fid="F2">2b</figr>). In contrast, other tissues already present in all chordates, namely the epidermis or endodermal derivatives such as liver, respiratory, and digestive systems, have a flat profile, with lower ger values (Figure <figr fid="F2">2b</figr>). Both the profile of the neural crest gene emergence plot (Figure <figr fid="F3">3</figr>) and its ger value (3.1) indicate that the neural crest is among the most innovative vertebrate tissues (Figure <figr fid="F2">2a</figr>). This concept can be extended to each individual neural crest lineage, in particular to pigmented or bone derivatives, as deduced from their respective gene emergence plots (Figure <figr fid="F3">3</figr>). Interestingly, compared with the other crest derivatives, the ger value of the gene set associated with the peripheral nervous system derivatives is lower (1.6). This may best be explained by co-option from the ancestral program of neural development. In summary, our gene emergence plots that reliably reflect evolutionary innovation highlight the novelty of neural crest as a tissue.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Gene emergence plots of neural crest derivatives</p>
               </caption>
               <text>
                  <p>Gene emergence plots of neural crest derivatives. Graphs and gene emergence rate (ger) values associated both with <b>(a) </b>the total collection of neural crest genes and <b>(b) </b>the different bone, nervous system, and pigmentation derivatives.</p>
               </text>
               <graphic file="gb-2007-8-3-r36-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Emergence of neural crest molecules defining novel cellular functions</p>
            </st>
            <p>The notion of neural crest as a tissue with a high rate of gene innovation apparently contradicts our finding that all known neural crest specifiers can be traced back at least to metazoans. To further address this point, we focused on the collection of neural crest 'new genes' to gain insight into their molecular nature and function.</p>
            <p>Neural crest has been postulated as a fourth germ layer <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. This concept builds on neural crest pluripotency and the fact that in vertebrates it gives rise to novel cell types such as the skeletal derivatives or the specialized melanocytes <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Consistently, in the collection of vertebrate/mammalian new genes, we found molecules defining the physiology of these novel cell types. This is the case for the genes <it>Ru </it>(Hermansky-Pudlak syndrome 6) and <it>silver</it>, which encode components of the specialized melanocyte lysosomes, the melanosomes. Similarly, several new genes encode extracellular proteins that constitute part of the bone matrix (for example, bone gla protein and the phosphoglycoprotein mepe) and enamel, the outermost covering of teeth and the hardest tissue in the body (for example, ameloblastin and amelogenin).</p>
         </sec>
         <sec>
            <st>
               <p>Emergence of ligands for neural crest lineage specification</p>
            </st>
            <p>Strikingly, 50% of neural crest genes appearing first in vertebrates encode extracellular ligands. This remarkable enrichment (confirmed by exploring GO term frequency; see Additional data file 3) is in accordance with our previous whole-transcriptome GO analysis (Table <tblr tid="T1">1</tblr>). It suggests that diversification of receptor ligands played an important role during vertebrate evolution in general and neural crest evolution in particular. Individual analysis of the function of these peptides during the development of the neural crest demonstrates that they control the commitment of precursors to the different lineages.</p>
            <p>Conserved signaling pathways have an early influence on the phenotypic diversification of premigratory neural crest cells <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Bmp2/4 can directly induce autonomic neurogenesis <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>, while Wnt signaling participates in melanocyte specification <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. Superimposed on this, a second network of 'modern' vertebrate specific cytokines, produced locally, acts not only in neural crest cell fate specification but also in the migratory behavior and survival of all neural crest lineages <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Melanocyte specification and survival depend on soluble proteins such as steel factor (kit ligand), endothelin-3, &#945;-melanocyte stimulating hormone, and nonagouti <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>; gliogenesis in the peripheral nervous system is controlled by neuregulins and endothelin-3 <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>; the development of autonomic and sensory neurons is controlled by neurothropins (brain-derived neurotropic factor, neurothropin-3, and neurothropin-4) and GDNF family members (GDNF and neurturin) <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp>; and, finally, the differentiation of the skeletal lineage is specified by endothelin-1 <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. Our sequential blast pipeline analysis shows that the vast majority (9/11) of the above-mentioned cell fate specification ligands emerged in vertebrates or, to a lesser extent (steel factor and nonagouti), in mammals.</p>
            <p>Interestingly, the blast pipeline uncovered a positive hit in the echinoderm <it>Strongylocentrotus purpuratus </it>genome for the neurotropin family members brain-derived neurotropic factor and neurothropin-3. Because it has been proposed that neurotropins constitute a vertebrate innovation <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>, we performed a ClustalX alignment <abbrgrp><abbr bid="B43">43</abbr></abbrgrp> of mouse neurotropins against the echinoderm sequence (                                                                                                          Additional data file 4). This revealed that the particular array of cysteines conserved in all neurotropins, the so-called 'cysteine knot' <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>, is also present in the echinoderm sequence and therefore identifies it as a putative growth factor. However, the limited amino acid identity (33%) and the lack of conservation in critical residues required for neurotropin binding to Trk receptors indicate that the echinoderm neurotropin-related protein cannot be considered a <it>bona fide </it>neurotropin. This suggests that neurotropins evolved from divergent ligands present in ancestral chordates. In fact, the example of neurotropins may be just part of a more general mechanism because other 'new cytokines' can be related to pre-existing growth factors. Supporting this view, GDNF and neurturin are divergent members of the TGF-&#946; superfamily of ligands, as indicated by their particular cysteine knot and hence folding <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Similarly, despite their limited homology, neuregulins belong to the epidermal growth factor superfamily of ligands <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>.</p>
            <p>Taken together, our data show that the cytokine network acting in neural crest cell fate specification is mainly a vertebrate innovation (Figure <figr fid="F4">4</figr>). Furthermore, these analyses indicate that an important proportion of the 'new ligands' are derived from fast evolving growth factors.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Emerging ligands control the specification of neural crest precursors</p>
               </caption>
               <text>
                  <p>Emerging ligands control the specification of neural crest precursors. The progressive determination of neural crest (NC) precursors into different cell lineages is represented in the scheme with a code of colors. Superimposed on this, the collection of new growth factors appearing first in vertebrates is depicted. The role of each ligand in controlling the specification/survival of each particular neural crest derivative is indicated with a corresponding code of colors. alpha-MSH, alpha-melanocyte-stimulating hormone; End, endothelin; GDNF, glial-derived neurotropic factor; NT, neurotropin; Nppc, natriuretic peptide precursor.</p>
               </text>
               <graphic file="gb-2007-8-3-r36-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic analysis of the emergence of Pfam domains</p>
            </st>
            <p>The comparative analysis of gene emergence plots highlights a high rate of gene innovation for the neural crest during vertebrate evolution. In fact, there are reasons to believe that our estimation on the rate of gene emergence may be conservative. In the sequential blast pipeline analysis, the presence of an ancestral conserved domain will shadow the appearance of evolutionarily more recent domains within the same molecule. This may be particularly relevant in the case of large multidomain proteins such as receptors.</p>
            <p>To overcome this constraint and to complement our studies, we conducted a phylogenetic analysis of the Pfam motifs (defined by multiple alignment of proteins <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>) occurring in the collection of 615 neural crest genes. From a total of 8,183 Pfam domains annotated in EnsEMBL, 499 are present in the set of 615 neural crest genes. We screened for these motifs in the seven different categories, detecting homology through two different approaches: blasting Pfam consensus sequences (threshold of E = 10<sup>-4</sup>) and searching for hidden Markov models (HMMs) using HMMER software with standard parameters <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. We compiled a table including all neural crest genes with their Pfam domains and when they occur first in the defined seven temporal classes, as detected using either of the methods (Additional data file 5). A list including only those genes that contain a Pfam domain emerging in vertebrates is compiled in Table <tblr tid="T3">3</tblr>. Pfam domain detection supports and refines our sequential blast pipeline results. Thus, GDNF and neurturin were identified as divergent members of the TGF-&#946; superfamily, and the kit-ligand and nonagouti domains were detected as vertebrate novelties (previously detected as mammalian innovations; Table <tblr tid="T2">2</tblr>). Furthermore, the analysis also confirmed the ClustalX alignments demonstrating that the neurotropin domain (nerve growth factor; Table <tblr tid="T3">3</tblr>) is indeed a vertebrate innovation. In summary, our domain-based approach (more sensitive and accurate, but limited to annotated Pfam domains) complements the sequential blast analysis (Table <tblr tid="T2">2</tblr>), providing independent confirmation of the emergence in vertebrates of growth factors that are involved in the specification/survival of the neural crest cells (Table <tblr tid="T3">3</tblr>).</p>
            <tbl id="T3" hint_layout="double">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Neural crest associated Pfam domains emerging in vertebrates</p>
               </caption>
               <tblbdy cols="10">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="7" ca="center">
                        <p>Group</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Symbol</p>
                     </c>
                     <c ca="left">
                        <p>Gene</p>
                     </c>
                     <c ca="left">
                        <p>blast</p>
                     </c>
                     <c ca="left">
                        <p>pro</p>
                     </c>
                     <c ca="left">
                        <p>euk</p>
                     </c>
                     <c ca="left">
                        <p>met</p>
                     </c>
                     <c ca="left">
                        <p>deu</p>
                     </c>
                     <c ca="left">
                        <p>chr</p>
                     </c>
                     <c ca="left">
                        <p>ver</p>
                     </c>
                     <c ca="left">
                        <p>mam</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="10">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Slc12a6</p>
                     </c>
                     <c ca="left">
                        <p>Solute carrier family 12, member 6</p>
                     </c>
                     <c ca="left">
                        <p>pro</p>
                     </c>
                     <c ca="left">
                        <p>AA_permease</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>KCl_Cotrans_1</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Apc</p>
                     </c>
                     <c ca="left">
                        <p>Adenomatosis polyposis coli</p>
                     </c>
                     <c ca="left">
                        <p>pro</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Arm</p>
                     </c>
                     <c ca="left">
                        <p>APC_crr APC_15aa</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>EB1_binding APC_basic SAMP</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Asph</p>
                     </c>
                     <c ca="left">
                        <p>Aspartate-beta-hydroxylase</p>
                     </c>
                     <c ca="left">
                        <p>pro</p>
                     </c>
                     <c ca="left">
                        <p>Asp_Arg_Hydrox</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Asp-B-Hydro_N</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Top2b</p>
                     </c>
                     <c ca="left">
                        <p>Topoisomerase (DNA) II beta</p>
                     </c>
                     <c ca="left">
                        <p>pro</p>
                     </c>
                     <c ca="left">
                        <p>DNA_topoisoIV DNA_gyraseB HATPase_c</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>DTHCT</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Nef3</p>
                     </c>
                     <c ca="left">
                        <p>Neurofilament 3, medium</p>
                     </c>
                     <c ca="left">
                        <p>pro</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Filament</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Filament_head</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Nefl</p>
                     </c>
                     <c ca="left">
                        <p>Neurofilament, light polypeptide</p>
                     </c>
                     <c ca="left">
                        <p>pro</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Filament</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Filament_head</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cryab</p>
                     </c>
                     <c ca="left">
                        <p>Crystallin, alpha B</p>
                     </c>
                     <c ca="left">
                        <p>pro</p>
                     </c>
                     <c ca="left">
                        <p>HSP20</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Crystallin</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Rabggta</p>
                     </c>
                     <c ca="left">
                        <p>Rab geranylgeranyl transferase, a subunit</p>
                     </c>
                     <c ca="left">
                        <p>pro</p>
                     </c>
                     <c ca="left">
                        <p>LRR_1</p>
                     </c>
                     <c ca="left">
                        <p>LRR_2 PPTA</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>RabGGT_insert</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="10">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Otx1</p>
                     </c>
                     <c ca="left">
                        <p>Orthodenticle homolog 1</p>
                     </c>
                     <c ca="left">
                        <p>euk</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Homeobox</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>TF_Otx</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Otx2</p>
                     </c>
                     <c ca="left">
                        <p>Orthodenticle homolog 2</p>
                     </c>
                     <c ca="left">
                        <p>euk</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Homeobox</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>TF_Otx</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Zfp98</p>
                     </c>
                     <c ca="left">
                        <p>Zinc finger protein 98</p>
                     </c>
                     <c ca="left">
                        <p>euk</p>
                     </c>
                     <c ca="left">
                        <p>zf-C2H2</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>SCAN</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="10">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Prph1</p>
                     </c>
                     <c ca="left">
                        <p>Peripherin 1</p>
                     </c>
                     <c ca="left">
                        <p>met</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Filament</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Filament_head</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Gfra1</p>
                     </c>
                     <c ca="left">
                        <p>Glial cell line derived neurotrophic factor family receptor alpha 1</p>
                     </c>
                     <c ca="left">
                        <p>met</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>GDNF</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cdx1</p>
                     </c>
                     <c ca="left">
                        <p>Caudal type homeo box 1</p>
                     </c>
                     <c ca="left">
                        <p>met</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Homeobox</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Caudal_act</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cdx2</p>
                     </c>
                     <c ca="left">
                        <p>Caudal type homeo box 2</p>
                     </c>
                     <c ca="left">
                        <p>met</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Homeobox</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Caudal_act</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Hoxb9</p>
                     </c>
                     <c ca="left">
                        <p>Homeo box B9</p>
                     </c>
                     <c ca="left">
                        <p>met</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Homeobox</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Hox9_act</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Hoxa9</p>
                     </c>
                     <c ca="left">
                        <p>Homeo box A9</p>
                     </c>
                     <c ca="left">
                        <p>met</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Homeobox</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Hox9_act</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Nr3c1</p>
                     </c>
                     <c ca="left">
                        <p>Nuclear receptor subfamily 3, group C, member 1</p>
                     </c>
                     <c ca="left">
                        <p>met</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Hormone_recep zf-C4</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>GCR</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pdgfa</p>
                     </c>
                     <c ca="left">
                        <p>Platelet derived growth factor, alpha</p>
                     </c>
                     <c ca="left">
                        <p>met</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>PDGF</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>PDGF_N</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="10">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Bdnf</p>
                     </c>
                     <c ca="left">
                        <p>Brain derived neurotrophic factor</p>
                     </c>
                     <c ca="left">
                        <p>deu</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>NGF</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ntf3</p>
                     </c>
                     <c ca="left">
                        <p>Neurotropin 3</p>
                     </c>
                     <c ca="left">
                        <p>deu</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>NGF</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>P2rx7</p>
                     </c>
                     <c ca="left">
                        <p>Purinergic receptor P2X, ligand-gated ion channel, 7</p>
                     </c>
                     <c ca="left">
                        <p>deu</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>P2X_receptor</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="10">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Hapln1</p>
                     </c>
                     <c ca="left">
                        <p>Hyaluronan and proteoglycan link protein 1</p>
                     </c>
                     <c ca="left">
                        <p>cor</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>V-set</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Xlink</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="10">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Nppc</p>
                     </c>
                     <c ca="left">
                        <p>Natriuretic peptide precursor type C</p>
                     </c>
                     <c ca="left">
                        <p>ver</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>ANP</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Calca</p>
                     </c>
                     <c ca="left">
                        <p>Calcitonin/calcitonin-related polypeptide, alpha</p>
                     </c>
                     <c ca="left">
                        <p>ver</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
         