<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2005-6-13-r110</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>A compendium of <it>Caenorhabditis elegans </it>regulatory transcription factors: a resource for mapping transcription regulatory networks</p>
         </title>
         <aug>
            <au id="A1" ce="yes">
               <snm>Reece-Hoyes</snm>
               <mi>S</mi>
               <fnm>John</fnm>
               <insr iid="I1"/>
               <email>bgyjsr@leeds.ac.uk</email>
            </au>
            <au id="A2" ce="yes">
               <snm>Deplancke</snm>
               <fnm>Bart</fnm>
               <insr iid="I2"/>
               <email>bart.deplancke@umassmed.edu</email>
            </au>
            <au id="A3">
               <snm>Shingles</snm>
               <fnm>Jane</fnm>
               <insr iid="I1"/>
               <email>bgyjh@leeds.ac.uk</email>
            </au>
            <au id="A4">
               <snm>Grove</snm>
               <mi>A</mi>
               <fnm>Christian</fnm>
               <insr iid="I2"/>
               <email>christian.grove@umassmed.edu</email>
            </au>
            <au id="A5">
               <snm>Hope</snm>
               <mi>A</mi>
               <fnm>Ian</fnm>
               <insr iid="I1"/>
               <email>i.a.hope@leeds.ac.uk</email>
            </au>
            <au id="A6" ca="yes">
               <snm>Walhout</snm>
               <mi>JM</mi>
               <fnm>Albertha</fnm>
               <insr iid="I2"/>
               <email>marian.walhout@umassmed.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Institute of Integrative and Comparative Biology, Faculty of Biological Sciences, School of Biology, University of Leeds, Woodhouse Lane, Leeds LS2 9JT, UK</p>
            </ins>
            <ins id="I2">
               <p>Program in Gene Function and Expression and Program in Molecular Medicine, University of Massachusetts Medical School, Worcester, 364 Plantation Street, Lazare Research Building, Room 605, MA 01605, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2005</pubdate>
         <volume>6</volume>
         <issue>13</issue>
         <fpage>R110</fpage>
         <url>http://genomebiology.com/2005/6/13/R110</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16420670</pubid>
               <pubid idtype="doi">10.1186/gb-2005-6-13-r110</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>26</day>
               <month>9</month>
               <year>2005</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>7</day>
               <month>11</month>
               <year>2005</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>28</day>
               <month>11</month>
               <year>2005</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>30</day>
               <month>12</month>
               <year>2005</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2005</year>
         <collab>Reece-Hoyes et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>C. elegans transcription factors</p>
      </shorttitle>
      <shortabs>
         <p>A compendium of 934 transcription factor genes in C. elegans and identified by computational searches and extensive manual</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Transcription regulatory networks are composed of interactions between transcription factors and their target genes. Whereas unicellular networks have been studied extensively, metazoan transcription regulatory networks remain largely unexplored. <it>Caenorhabditis elegans </it>provides a powerful model to study such metazoan networks because its genome is completely sequenced and many functional genomic tools are available. While <it>C. elegans </it>gene predictions have undergone continuous refinement, this is not true for the annotation of functional transcription factors. The comprehensive identification of transcription factors is essential for the systematic mapping of transcription regulatory networks because it enables the creation of physical transcription factor resources that can be used in assays to map interactions between transcription factors and their target genes.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>By computational searches and extensive manual curation, we have identified a compendium of 934 transcription factor genes (referred to as wTF2.0). We find that manual curation drastically reduces the number of both false positive and false negative transcription factor predictions. We discuss how transcription factor splice variants and dimer formation may affect the total number of functional transcription factors. In contrast to mouse transcription factor genes, we find that <it>C. elegans </it>transcription factor genes do not undergo significantly more splicing than other genes. This difference may contribute to differences in organism complexity. We identify candidate redundant worm transcription factor genes and orthologous worm and human transcription factor pairs. Finally, we discuss how wTF2.0 can be used together with physical transcription factor clone resources to facilitate the systematic mapping of <it>C. elegans </it>transcription regulatory networks.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>wTF2.0 provides a starting point to decipher the transcription regulatory networks that control metazoan development and function.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010015">Model organisms</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010016">Molecular biology</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Metazoan genomes contain thousands of predicted protein-coding genes. During development, pathology, and in response to environmental changes, each of these genes is expressed in different cells, at different times and at different levels. Spatial and temporal gene expression is controlled transcriptionally through the action of regulatory transcription factors (TFs) <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Transcription of each gene can be up- or down-regulated by TFs that bind to <it>cis</it>-regulatory DNA elements. These elements include upstream elements located in the proximal promoter, and enhancers or silencers that can be located at a greater distance from the transcription start site. Frequently, the expression level of a gene is the result of a balance between transcription activation and repression governed by multiple <it>cis</it>-regulatory elements and, hence, multiple TFs. The combinatorial nature of gene transcription provides an exquisite level of flexibility to regulate genome expression.</p>
         <p>The understanding of differential gene expression at a genome-wide, or systems, level has been greatly facilitated by the mapping and analysis of transcription regulatory networks (Figure <figr fid="F1">1</figr>) <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. Such networks are composed of two types of components, or nodes: the gene targets that are subject to transcriptional control and the TF proteins that execute transcriptional control. Whereas transcription regulatory networks have been extensively studied in relatively simple unicellular systems, they remain largely unexplored in complex, metazoan systems.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Transcription regulatory networks provide models to understand differential gene expression at a systems level</p>
            </caption>
            <text>
               <p>Transcription regulatory networks provide models to understand differential gene expression at a systems level. Transcription regulatory networks are composed of two types of components, or nodes: the genes involved in the system and the TFs that regulate their expression. Protein-protein interactions between TFs and protein-DNA interactions between TFs and their target genes can be visualized in transcription regulatory networks. The dashed line represents TF-TF protein-protein interaction (heterodimer). Arrows represent protein DNA interactions that result in transcription activation; the blunt 'arrow' represents protein-DNA interaction that results in repression of transcription.</p>
            </text>
            <graphic file="gb-2005-6-13-r110-1" hint_layout="double"/>
         </fig>
         <p>The nematode <it>Caenorhabditis elegans </it>is a powerful model to decipher metazoan transcription regulatory networks. The <it>C. elegans </it>genome has been completely sequenced and is predicted to contain 19,735 protein-coding genes (WormBase WS140)<abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Several functional genomic resources enable the systematic dissection of differential gene expression at a systems level and in a high-throughput manner. For instance, microarrays are available to investigate temporal and, to a certain extent, spatial gene expression levels <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. In addition, <it>C. elegans </it>'ORFeome' <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> and 'Promoterome' <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> resources provide open reading frame (ORF) and promoter clones, respectively. These clones can be used for a wide variety of experiments that aim to dissect transcription regulatory networks (see below).</p>
         <p>With seven years of progressive refinement of the genome annotation since the publication of the genome sequence <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>, comprehensive predictions of protein-coding genes are available. However, there is no up-to-date compendium of predicted <it>C. elegans </it>TFs. Several lists of putative <it>C. elegans </it>TFs have been generated previously (Table <tblr tid="T1">1</tblr>; we refer to this combined set as wTF1.0 (worm transcription factors version 1.0)), but none of these are comprehensive or readily accessible. The earliest lists were created by scanning sequences of <it>C. elegans </it>proteins, as predicted from the genome annotation, for well-defined DNA binding domains: Hobert and Ruvkun <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> focused on homeodomain, paired domain, T-box, basic helix-loop-helix (bHLH), basic region leucine zipper (bZIP), Fork Head, erythroblast transformation specific (ETS) and nuclear hormone receptor (NHR) proteins; and Clarke and Berg <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> focused on various zinc-finger proteins. For comparative genomic studies primarily focusing on <it>Drosophila melanogaster </it><abbrgrp><abbr bid="B17">17</abbr></abbrgrp> or <it>Arabidopsis thaliana </it><abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, a list of predicted <it>C. elegans </it>proteins containing various DNA binding domains was compiled. While several (sub)-families of <it>C. elegans </it>TFs have been studied in greater detail (for example, bHLH <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, CUT homeodomain <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, and DM zinc-finger <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>), the most recent list of "all <it>C. elegans </it>TFs" was compiled five years ago when Riechmann and colleagues <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> scanned WormPep 20 (19,101 proteins). During the past few years, the creation of improved computational tools <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> and the completion of the <it>C. briggsae </it>genome sequence <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> have enabled a great improvement in the annotation of the <it>C. elegans </it>genome. Here, we used a combination of bioinformatics and extensive manual curation to generate wTF2.0, a comprehensive compendium of predicted <it>C. elegans </it>TF genes. We discuss how wTF2.0 can be used together with physical ORFeome and Promoterome clone resources to decipher transcription regulatory networks that control metazoan differential gene expression at a systems level.</p>
         <tbl id="T1" hint_layout="double">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Comparison of wTF2.0 versus wTF1.0</p>
            </caption>
            <tblbdy cols="9">
               <r>
                  <c ca="left">
                     <p>DNA binding domain</p>
                  </c>
                  <c ca="left">
                     <p>Description</p>
                  </c>
                  <c cspan="3" ca="center">
                     <p>wTF2.0</p>
                  </c>
                  <c cspan="4" ca="center">
                     <p>wTF1.0</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c cspan="7">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>WP140 22,420</p>
                  </c>
                  <c ca="center">
                     <p>Family members in humans</p>
                  </c>
                  <c ca="center">
                     <p>Ortholog pairs</p>
                  </c>
                  <c ca="center">
                     <p>Clarke </p>
                     <p>1998 </p>
                     <p>WP14 </p>
                     <p>14,655 [16]</p>
                  </c>
                  <c ca="center">
                     <p>Ruvkun 1998 WP15 15,558 [15]</p>
                  </c>
                  <c ca="center">
                     <p>Rubin 2000 WP18 18,576</p>
                  </c>
                  <c ca="center">
                     <p>Reichaman 2000 WP20  19,101 [18]</p>
                  </c>
               </r>
               <r>
                  <c cspan="9">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>AP-2</p>
                  </c>
                  <c ca="left">
                     <p>Activator protein-2</p>
                  </c>
                  <c ca="center">
                     <p>4</p>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>4</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ARID/BRIGHT</p>
                  </c>
                  <c ca="left">
                     <p>AT-rich interaction domain</p>
                  </c>
                  <c ca="center">
                     <p>4</p>
                  </c>
                  <c ca="center">
                     <p>9</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>4</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>AT HOOK</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>31</p>
                  </c>
                  <c ca="center">
                     <p>28</p>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>BHLH</p>
                  </c>
                  <c ca="left">
                     <p>basic region helix loop helix</p>
                  </c>
                  <c ca="center">
                     <p>42</p>
                  </c>
                  <c ca="center">
                     <p>103</p>
                  </c>
                  <c ca="center">
                     <p>22</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>24</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>25</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>BZIP</p>
                  </c>
                  <c ca="left">
                     <p>basic region leucine zipper</p>
                  </c>
                  <c ca="center">
                     <p>32</p>
                  </c>
                  <c ca="center">
                     <p>57</p>
                  </c>
                  <c ca="center">
                     <p>11</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>18</p>
                  </c>
                  <c ca="center">
                     <p>18</p>
                  </c>
                  <c ca="center">
                     <p>25</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>CBF</p>
                  </c>
                  <c ca="left">
                     <p>CCAAT-binding factor</p>
                  </c>
                  <c ca="center">
                     <p>9</p>
                  </c>
                  <c ca="center">
                     <p>12</p>
                  </c>
                  <c ca="center">
                     <p>7</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COLD BOX</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="center">
                     <p>11</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>CP2</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>6</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>HD</p>
                  </c>
                  <c ca="left">
                     <p>Homeodomain</p>
                  </c>
                  <c ca="center">
                     <p>99</p>
                  </c>
                  <c ca="center">
                     <p>167</p>
                  </c>
                  <c ca="center">
                     <p>36</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>83</p>
                  </c>
                  <c ca="center">
                     <p>88</p>
                  </c>
                  <c ca="center">
                     <p>84</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>HMG</p>
                  </c>
                  <c ca="left">
                     <p>High mobility group</p>
                  </c>
                  <c ca="center">
                     <p>16</p>
                  </c>
                  <c ca="center">
                     <p>59</p>
                  </c>
                  <c ca="center">
                     <p>12</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>15</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>HTH</p>
                  </c>
                  <c ca="left">
                     <p>Helix turn helix</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>12</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>IPT/TIG</p>
                  </c>
                  <c ca="left">
                     <p>Ig-like, plexins, TFs</p>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="center">
                     <p>8</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>MADF</p>
                  </c>
                  <c ca="left">
                     <p>Mothers against Dpp factor</p>
                  </c>
                  <c ca="center">
                     <p>9</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>MADS box</p>
                  </c>
                  <c ca="left">
                     <p>MCM1/AG/DEF/SRF</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>MH1</p>
                  </c>
                  <c ca="left">
                     <p>MAD homology 1</p>
                  </c>
                  <c ca="center">
                     <p>7</p>
                  </c>
                  <c ca="center">
                     <p>12</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>MYB</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>19</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>8</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>16</p>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>p53</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>PD-FULL</p>
                  </c>
                  <c ca="left">
                     <p>Paired domain</p>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="center">
                     <p>9</p>
                  </c>
                  <c ca="center">
                     <p>1*</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>PD-NPAX</p>
                  </c>
                  <c ca="left">
                     <p>Paired domain</p>
                  </c>
                  <c ca="center">
                     <p>4</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>PD-CPAX</p>
                  </c>
                  <c ca="left">
                     <p>Paired domain</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>PD-UNDEFINED</p>
                  </c>
                  <c ca="left">
                     <p>Paired domain</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>11<sup>&#8224;</sup></p>
                  </c>
                  <c ca="center">
                     <p>10<sup>&#8224;</sup></p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RPEL</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RUNT</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>6</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>SAND</p>
                  </c>
                  <c ca="left">
                     <p>Sp100, AIRE-1, NucP41/75, DEAF-1</p>
                  </c>
                  <c ca="center">
                     <p>4</p>
                  </c>
                  <c ca="center">
                     <p>8</p>
                  </c>
                  <c ca="center">
                     <p>1*</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>STAT</p>
                  </c>
                  <c ca="left">
                     <p>Signal transducers and activators of transcription</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>7</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>T-BOX</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>17</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>17</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>TEA/ATTS</p>
                  </c>
                  <c ca="left">
                     <p>Transcriptional enhancer activator</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>4</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>TSC-22/DIP/BUN</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="center">
                     <p>4</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>WH-DAC</p>
                  </c>
                  <c ca="left">
                     <p>Dachshund</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>WH-ETS</p>
                  </c>
                  <c ca="left">
                     <p>Erythroblast transformation specific</p>
                  </c>
                  <c ca="center">
                     <p>10</p>
                  </c>
                  <c ca="center">
                     <p>15</p>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>10</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>10</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>WH-FH</p>
                  </c>
                  <c ca="left">
                     <p>Fork head</p>
                  </c>
                  <c ca="center">
                     <p>18</p>
                  </c>
                  <c ca="center">
                     <p>41</p>
                  </c>
                  <c ca="center">
                     <p>4</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>15</p>
                  </c>
                  <c ca="center">
                     <p>19</p>
                  </c>
                  <c ca="center">
                     <p>15</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>WH-HSF</p>
                  </c>
                  <c ca="left">
                     <p>Heat shock factor</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>8</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>WH-RFX</p>
                  </c>
                  <c ca="left">
                     <p>X-box binding regulatory factor</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>WH-TDP</p>
                  </c>
                  <c ca="left">
                     <p>TF E2F dimerisation partner</p>
                  </c>
                  <c ca="center">
                     <p>4</p>
                  </c>
                  <c ca="center">
                     <p>11</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>4</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>WH-UNDEFINED</p>
                  </c>
                  <c ca="left">
                     <p>Winged helix</p>
                  </c>
                  <c ca="center">
                     <p>4</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>WT1</p>
                  </c>
                  <c ca="left">
                     <p>Wilms tumor1</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>17</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>YL1</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ZF-A20</p>
                  </c>
                  <c ca="left">
                     <p>Zinc finger, A20-type</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>6</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ZF-BED</p>
                  </c>
                  <c ca="left">
                     <p>BEAF/DREF-like ZF</p>
                  </c>
                  <c ca="center">
                     <p>6</p>
                  </c>
                  <c ca="center">
                     <p>4</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ZF-C2H2</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>211</p>
                  </c>
                  <c ca="center">
                     <p>391</p>
                  </c>
                  <c ca="center">
                     <p>35</p>
                  </c>
                  <c ca="center">
                     <p>117</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>138</p>
                  </c>
                  <c ca="center">
                     <p>139</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ZF-C2HC</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>6</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ZF-CCCH</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>32</p>
                  </c>
                  <c ca="center">
                     <p>50</p>
                  </c>
                  <c ca="center">
                     <p>8</p>
                  </c>
                  <c ca="center">
                     <p>20</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>15</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ZF-DHHC</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>15</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>7</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>13</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ZF-DM</p>
                  </c>
                  <c ca="left">
                     <p>Dsx and Mab-3-like ZF</p>
                  </c>
                  <c ca="center">
                     <p>11</p>
                  </c>
                  <c ca="center">
                     <p>7</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>8</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>9</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ZF-FLYWCH</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>4</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ZF-GATA</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>14</p>
                  </c>
                  <c ca="center">
                     <p>7</p>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="center">
                     <p>9</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>9</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ZF-MIZ</p>
                  </c>
                  <c ca="left">
                     <p>Msx interacting ZF</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>6</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ZF-NF-X1</p>
                  </c>
                  <c ca="left">
                     <p>Nuclear factor</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ZF-NHR/C4</p>
                  </c>
                  <c ca="left">
                     <p>Nuclear hormone receptor</p>
                  </c>
                  <c ca="center">
                     <p>274</p>
                  </c>
                  <c ca="center">
                     <p>43</p>
                  </c>
                  <c ca="center">
                     <p>6</p>
                  </c>
                  <c ca="center">
                     <p>233</p>
                  </c>
                  <c ca="center">
                     <p>235</p>
                  </c>
                  <c ca="center">
                     <p>224</p>
                  </c>
                  <c ca="center">
                     <p>252</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ZF-THAP</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="center">
                     <p>12</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>UNKNOWN</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>TOTAL</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>957<sup>&#8225;</sup></p>
                  </c>
                  <c ca="center">
                     <p>1,231</p>
                  </c>
                  <c ca="center">
                     <p>203</p>
                  </c>
                  <c ca="center">
                     <p>387</p>
                  </c>
                  <c ca="center">
                     <p>407</p>
                  </c>
                  <c ca="center">
                     <p>527</p>
                  </c>
                  <c ca="center">
                     <p>652</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>This table shows the number of genes encoding each type of domain. Genes encoding multiple domains of the same type are counted only once. Dashes indicate the domain was not investigated. *These genes encode two distinct domains: PD and HD; SAND and AT hook. <sup>&#8224;</sup>Without access to the complete Rubin and Reichmann lists, we are unable to classify their PD family members. <sup>&#8225;</sup>Twenty-three genes in wTF2.0 encode two different types of domain.</p>
            </tblfn>
         </tbl>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>TF predictions: Gene Ontology term-based searches</p>
            </st>
            <p>To identify a comprehensive compendium of predicted worm TFs, we first interrogated WormBase version 140 (WS140)<abbrgrp><abbr bid="B6">6</abbr></abbrgrp> for proteins that possess domains annotated with one of the following Gene Ontology (GO) terms: 'regulation of transcription, DNA-dependent', 'transcription factor activity', and 'DNA binding'. (WS140 is the most recent reference release of WormBase that is permanently accessible.) We identified a total of 930 proteins (Figure <figr fid="F2">2</figr>, Additional data file 1). Of these, 232 were identified by all three GO terms, 368 by two GO terms and 330 by only one GO term (Figure <figr fid="F3">3</figr>). We observed that this collection of proteins not only contains predicted regulatory TFs, but also proteins that function in other nuclear processes (for example, DNA replication and repair). Moreover, it contains numerous false positive predictions (for example, small GTP-binding proteins). We removed both types of false positives (Figure <figr fid="F2">2</figr>, Additional data file 2). Next, we examined each of the remaining proteins for the presence of a predicted DNA binding domain either by visual inspection of the protein sequence (AT-hooks and C2H2 zinc-fingers), or using InterPro v10.0 (2005)<abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, SMART <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> and Pfam <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> databases (Additional data file 2). We found several proteins that, upon closer inspection, do not possess a DNA binding domain despite their WormBase protein domain annotation (Additional data file 2). For instance, several proteins were found that are annotated to be a NHR TF, but that only contain a predicted ligand binding domain. However, we retained proteins that do not have a clear DNA binding domain but for which experimental evidence is available that supports their function as a TF. For example, we included SKN-1, a bZIP protein known to bind DNA in a sequence-specific manner <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. In total, 369 proteins (40%) were removed (Figure <figr fid="F2">2a</figr>, Additional data file 2). As expected, combining all three GO terms was the most robust method for identifying predicted TFs, as 96% of these proteins were retained. The GO term DNA binding by itself was least robust as only 16% of these were retained. However, this can readily be explained by the retrieval of proteins that do bind DNA but that are not involved in transcriptional regulation.</p>
            <suppl id="S1">
               <title>
                  <p>Additional data file 1</p>
               </title>
               <caption>
                  <p>wTF2.0: a collection of predicted <it>C. elegans </it>transcription factors</p>
               </caption>
               <text>
                  <p>wTF2.0: a collection of predicted <it>C. elegans </it>transcription factors</p>
               </text>
               <file name="gb-2005-6-13-r110-S1.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S2">
               <title>
                  <p>Additional data file 2</p>
               </title>
               <caption>
                  <p>Overview of manually curated genes that were left out of wTF2.0</p>
               </caption>
               <text>
                  <p>Overview of manually curated genes that were left out of wTF2.0</p>
               </text>
               <file name="gb-2005-6-13-r110-S2.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Generation of wTF2.0, a comprehensive compendium of <it>C. elegans </it>TFs</p>
               </caption>
               <text>
                  <p>Generation of wTF2.0, a comprehensive compendium of <it>C. elegans </it>TFs. Schematic overview of the wTF2.0 generation pipeline. See main text for details.</p>
               </text>
               <graphic file="gb-2005-6-13-r110-2" hint_layout="double"/>
            </fig>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Venn diagram presenting the results of the GO term-based bioinformatic identification of putative TFs in WormBase 140</p>
               </caption>
               <text>
                  <p>Venn diagram presenting the results of the GO term-based bioinformatic identification of putative TFs in WormBase 140. GO terms are indicated in each Venn diagram set. Numbers between parentheses represent the number of putative TFs retained in wTF2.0 after manual curation or DNA binding domain identification using InterPro v. 10.0.</p>
               </text>
               <graphic file="gb-2005-6-13-r110-3" hint_layout="double"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>TF predictions: DNA binding domains</p>
            </st>
            <p>Upon examination of the remaining 561 proteins, we noticed that several well known TF families were underrepresented compared to wTF1.0, or even absent (for example, bHLH, C2H2 zinc-fingers and MADF (Mothers Against Dpp Factor)). This suggests that the predictions based on GO annotations alone suffer from a high false negative rate. To address this issue, we searched WormBase for each protein domain known to be involved in sequence specific DNA binding (Table <tblr tid="T1">1</tblr>). In addition, we added several TFs found by yeast one-hybrid assays (for example, TFs containing RPEL and FLYWCH domains <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> (data not shown)). We used visual inspection (C2H2 zinc-fingers and AT-hooks), InterPro, SMART and Pfam to verify these predictions and, in total, added 369 additional, putative TFs to the compendium. Finally, we added 4 proteins: 3 of which are homologs of known mammalian TFs (BAR-1, HMP-2 and WRM-1, homologs of mammalian &#946;-catenin) and one that has been described in the literature (SDC-2 <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>). In total, amongst the 19,735 predicted protein-coding genes, we identified 934 predicted <it>C. elegans </it>TF genes (Additional data file 1). Taken together, the combination of computational queries and manual curation results in a comprehensive compendium of <it>C. elegans </it>TF-encoding genes. We refer to this compendium as wTF2.0.</p>
         </sec>
         <sec>
            <st>
               <p>TF families</p>
            </st>
            <p>Table <tblr tid="T1">1</tblr> presents wTF2.0 grouped into TF families. Interestingly, 23 TFs contain DNA binding domains from different families (Additional data file 3). Future studies will determine if and how these domains function together in DNA binding specificity and, consequently, target gene selection. Interestingly, most human orthologs of these TFs also contain multiple, distinct DNA binding domains (Additional data file 3), indicating that the occurrence of multiple DNA binding domains in a TF is not worm specific. Comparison of wTF2.0 to wTF1.0 revealed that, of 48 TF families, 23 are unique to wTF2.0 (Table <tblr tid="T1">1</tblr>). This is likely because for the collective predictions in wTF1.0, only the major DNA binding domains were included. In addition, some protein domains have only recently been annotated to function in DNA-binding, including SAND <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, and THAP zinc-finger <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> domains. For several of the domains unique to wTF2.0 (CP2, WH-DAC, IPT/TIG, TEA/ATTS, WT1, YL1), the genes encoding them were not actually annotated until after WormPep 20 and could, therefore, not have been included in the wTF1.0 collections.</p>
            <suppl id="S3">
               <title>
                  <p>Additional data file 3</p>
               </title>
               <caption>
                  <p>wTF2.0 TFs that contain two distinct DNA binding domains</p>
               </caption>
               <text>
                  <p>wTF2.0 TFs that contain two distinct DNA binding domains</p>
               </text>
               <file name="gb-2005-6-13-r110-S3.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <sec>
               <st>
                  <p>wTF2.0 is a dynamic resource</p>
               </st>
               <p>wTF2.0 is the most comprehensive compendium of predicted worm TFs to date. However, the set of predicted <it>C. elegans </it>TFs will still be dynamic due to regular updating of the <it>C. elegans </it>genome annotation, for example in response to genome sequence data from related nematode species <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> and improvements in gene-prediction software. We have noted several changes in gene annotations in WormBase releases subsequent to WS140 that affect wTF2.0 (Additional data file 4). There have been additions, such as Y55F3AM.7 (created in WS146 and encoding a C2H2 zinc-finger protein), and eliminations, such as Y60A9.2 (encoding a CCCH zinc-finger protein in WS140, but designated a pseudogene since WS141). We have also noted more subtle adjustments in gene structure based on TWINSCAN <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> suggestions of different splicing patterns. For example, a modified gene structure for F22A3.5 that meant the gene product would then include a complete homeodomain was adopted in WS143, with support from <it>C. briggsae </it>genome sequence. Similar gene structure changes that would lead to intact homeodomains for F34D6.2, R04A9.5 and <it>ceh-31</it>, and an intact bHLH domain for <it>hlh-19 </it>(see comments in Additional data file 1) may yet be incorporated into WormBase. Taken together, we expect that wTF2.0 will be a dynamic resource but that a relatively small number of TFs will be removed and added over time.</p>
               <suppl id="S4">
                  <title>
                     <p>Additional data file 4</p>
                  </title>
                  <caption>
                     <p>Possible additions to wTF2.0</p>
                  </caption>
                  <text>
                     <p>Possible additions to wTF2.0</p>
                  </text>
                  <file name="gb-2005-6-13-r110-S4.xls">
                     <p>Click here for file</p>
                  </file>
               </suppl>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Functional TFs: splice variants</p>
            </st>
            <p>wTF2.0 is a starting point to predict the actual number of functional TF complexes. For instance, the number of active TFs is likely greater than 934 because many TF genes encode multiple proteins as a result of alternative transcripts. In addition, many TFs function as heterodimers, with subunits associating in different combinations. To date, 144 of the 934 predicted TF genes (15.4%) are known to undergo alternative splicing (Additional data file 5). On average, each spliced TF gene results in 3 different transcripts and the number of splice variants per gene ranges from 2 to 13. Some alternative transcripts do not result in the expression of a different TF protein variant. In total, 379 alternative TF protein variants are expressed from 144 genes, and the number of variant proteins per TF gene is between 2 and 10. Interestingly, 30 TF variants, corresponding to 25 TF genes, no longer contain a DNA binding domain. Rather than binding DNA and regulating target gene expression directly, these proteins may have regulatory functions to control TF activity. Taken together, alternative splicing yields 205 additional putative DNA binding TFs, bringing the total number of predicted <it>C. elegans </it>TFs to 1,139. Interestingly, Taneri and colleagues <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> observed that mouse TF genes are more likely to undergo alternative splicing than other mouse genes (62% compared to 29%). These alternatively spliced TF genes may yield functionally different TFs that may bind DNA with different specificities and affinities and, as a consequence, regulate different sets of target genes. In contrast, the percentage of <it>C. elegans </it>TF genes that undergo alternative splicing is only slightly higher than the percentage of all protein-coding genes that are alternatively spliced (15% versus 10%) <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> (this study). This observation suggests that higher percentages of TF gene splicing may contribute to increased organism complexity. Finally, it is important to note that several <it>C. elegans </it>TFs can be expressed from multiple alternative promoters (Additional data file 5). Alternative promoters are likely to drive different patterns and levels of TF production, which may contribute to the complexity of combinatorial gene expression.</p>
            <suppl id="S5">
               <title>
                  <p>Additional data file 5</p>
               </title>
               <caption>
                  <p>Alternative splice forms and promoters</p>
               </caption>
               <text>
                  <p>Alternative splice forms and promoters</p>
               </text>
               <file name="gb-2005-6-13-r110-S5.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Functional TFs: dimers</p>
            </st>
            <p>Several TFs, including bHLH, NHR and bZIP proteins, are known to bind DNA as either homo- or heterodimers, and the different dimer combinations that occur <it>in vivo </it>determine the actual number of TF complexes. For instance, the minimum number of TF complexes would be half the total number of TFs predicted if each TF were to exclusively dimerize with one other TF. Alternatively, the total number of functional TFs could be much larger than the number of predicted TF genes if each TF dimerizes with multiple other TFs. To start addressing this issue, we retrieved a network of TF-TF interactions that were identified in a large-scale yeast two-hybrid-based protein-protein interaction mapping study <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> (Figure <figr fid="F4">4</figr>, Additional data file 6). We found 68 putative TF-TF dimers involving 71 TFs: 35 between members of different TF families and 33 between members of the same TF family. Of these 33, 7 are putative homodimers and the remaining 26 are putative heterodimers. Interestingly, the TF dimerization network suggests that certain TFs, such as NHR-49, can function as dimerization hubs. NHR-49 is involved in the regulation of fat storage and life span <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, but it is not known if NHR-49 functions in these processes as a homodimer or in concert with other NHR TFs. It is noted that the current TF dimerization network is only a small representation of all TF dimers. This is because some TFs may only form dimers on their cognate DNA and may, therefore, not be detected by yeast two-hybrid assays; and because the current worm 'interactome' (WI5) only contains approximately 5% of all protein-protein interactions that can be detected by yeast two-hybrid assays <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. Future systematic TF-TF protein-protein interaction mapping projects are required to determine the total complement of TF dimers. Although it is difficult to interpret interactions between TFs from different families, they could point to putative combinatorial regulation of target genes. Taken together, assuming that many TFs can function both as monomers and dimers, the number of functional TFs will likely exceed the number of predicted individual TF proteins.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Protein-protein interaction network of worm TFs</p>
               </caption>
               <text>
                  <p>Protein-protein interaction network of worm TFs. Blue rectangles indicate homodimers. Different colors identify different TF families as indicated. Interactions were obtained from Worm Interactome version 5 (WI5) [32] and visualized using Cytoscape [59].</p>
               </text>
               <graphic file="gb-2005-6-13-r110-4" hint_layout="double"/>
            </fig>
            <suppl id="S6">
               <title>
                  <p>Additional data file 6</p>
               </title>
               <caption>
                  <p>Overview of protein interactions involving wTF2.0 TFs</p>
               </caption>
               <text>
                  <p>Overview of protein interactions involving wTF2.0 TFs</p>
               </text>
               <file name="gb-2005-6-13-r110-S6.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <sec>
               <st>
                  <p>TF families: redundancy</p>
               </st>
               <p>For the systematic mapping of transcription regulatory networks, it is important to identify redundancy between closely related TF genes. This is because redundant genes have similar, overlapping or identical biological functions and, thus, results obtained with an individual TF may be difficult to interpret. In addition, one would like to identify paralogous TFs that share extensive similarity in their DNA binding domain, because such TFs may bind similar DNA sequences and, therefore, overlapping sets of target genes. There are several well characterized examples of redundant <it>C. elegans </it>TFs, including the Fork Head genes <it>pes-1 </it>and <it>fkh-2 </it><abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, the GATA factors <it>med-1 </it>and <it>med-2</it>, and <it>end-1 </it>and <it>end-3 </it><abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, and the T-box genes <it>tbx-8 </it>and <it>tbx-9 </it><abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, and <it>tbx-37 </it>and <it>tbx-38 </it><abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. To identify additional putative redundant or highly similar TF genes, we used ClustalX analysis to generate trees that display the level of sequence similarity within each TF family (Additional data file 7). As expected, the known redundant TFs indicated above are found on adjacent branches in these trees. Table <tblr tid="T2">2</tblr> provides additional TF pairs that share extensive homology and that, therefore, may be (partially) redundant.</p>
               <suppl id="S7">
                  <title>
                     <p>Additional data file 7</p>
                  </title>
                  <caption>
                     <p>Phylogenetic trees of worm TF families</p>
                  </caption>
                  <text>
                     <p>Phylogenetic trees of worm TF families</p>
                  </text>
                  <file name="gb-2005-6-13-r110-S7.pdf">
                     <p>Click here for file</p>
                  </file>
               </suppl>
            </sec>
            <tbl id="T2" hint_layout="double">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Candidate redundant worm TF pairs</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>DNA binding domain</p>
                     </c>
                     <c ca="left">
                        <p>TF 1</p>
                     </c>
                     <c ca="left">
                        <p>TF 2</p>
                     </c>
                     <c ca="left">
                        <p>E-value</p>
                     </c>
                     <c ca="center">
                        <p>% Identity</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>T-BOX</p>
                     </c>
                     <c ca="left">
                        <p>Y59E9AR.5</p>
                     </c>
                     <c ca="left">
                        <p>TBX-30</p>
                     </c>
                     <c ca="left">
                        <p>E-168</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>T-BOX</p>
                     </c>
                     <c ca="left">
                        <p>TBX-39</p>
                     </c>
                     <c ca="left">
                        <p>TBX-40</p>
                     </c>
                     <c ca="left">
                        <p>E-170</p>
                     </c>
                     <c ca="center">
                        <p>98.9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ZF CCCH</p>
                     </c>
                     <c ca="left">
                        <p>C35D6.4</p>
                     </c>
                     <c ca="left">
                        <p>F38C2.7</p>
                     </c>
                     <c ca="left">
                        <p>E-98</p>
                     </c>
                     <c ca="center">
                        <p>99</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>COLD BOX</p>
                     </c>
                     <c ca="left">
                        <p>CEY-2</p>
                     </c>
                     <c ca="left">
                        <p>CEY-3</p>
                     </c>
                     <c ca="left">
                        <p>E-96</p>
                     </c>
                     <c ca="center">
                        <p>99.6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ZF C2H2</p>
                     </c>
                     <c ca="left">
                        <p>F47H4.1</p>
                     </c>
                     <c ca="left">
                        <p>Y6G8.3</p>
                     </c>
                     <c ca="left">
                        <p>E-109</p>
                     </c>
                     <c ca="center">
                        <p>99.6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ZF C2H2</p>
                     </c>
                     <c ca="left">
                        <p>H16D19.3</p>
                     </c>
                     <c ca="left">
                        <p>T07D10.3</p>
                     </c>
                     <c ca="left">
                        <p>E-112</p>
                     </c>
                     <c ca="center">
                        <p>58.6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ZF C2H2</p>
                     </c>
                     <c ca="left">
                        <p>LSY-2</p>
                     </c>
                     <c ca="left">
                        <p>LSL-1</p>
                     </c>
                     <c ca="left">
                        <p>E-74</p>
                     </c>
                     <c ca="center">
                        <p>81.1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ZF C2H2</p>
                     </c>
                     <c ca="left">
                        <p>T07G12.10</p>
                     </c>
                     <c ca="left">
                        <p>T07G12.11</p>
                     </c>
                     <c ca="left">
                        <p>E-194</p>
                     </c>
                     <c ca="center">
                        <p>98.8</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>bHLH</p>
                     </c>
                     <c ca="left">
                        <p>HLH-25</p>
                     </c>
                     <c ca="left">
                        <p>HLH-27</p>
                     </c>
                     <c ca="left">
                        <p>E-139</p>
                     </c>
                     <c ca="center">
                        <p>95.6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>bHLH</p>
                     </c>
                     <c ca="left">
                        <p>HLH-28</p>
                     </c>
                     <c ca="left">
                        <p>HLH-29</p>
                     </c>
                     <c ca="left">
                        <p>E-124</p>
                     </c>
                     <c ca="center">
                        <p>99.6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>HMG</p>
                     </c>
                     <c ca="left">
                        <p>HMG-3</p>
                     </c>
                     <c ca="left">
                        <p>HMG-4</p>
                     </c>
                     <c ca="left">
                        <p>E-297</p>
                     </c>
                     <c ca="center">
                        <p>99.9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MYB</p>
                     </c>
                     <c ca="left">
                        <p>T10E7.11</p>
                     </c>
                     <c ca="left">
                        <p>T07F8.4</p>
                     </c>
                     <c ca="left">
                        <p>E-132</p>
                     </c>
                     <c ca="center">
                        <p>99.6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>PAIRED DOMAIN</p>
                     </c>
                     <c ca="left">
                        <p>EGL-38</p>
                     </c>
                     <c ca="left">
                        <p>PAX-2</p>
                     </c>
                     <c ca="left">
                        <p>E-132</p>
                     </c>
                     <c ca="center">
                        <p>92.4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ZF - PHD</p>
                     </c>
                     <c ca="left">
                        <p>F15E6.1</p>
                     </c>
                     <c ca="left">
                        <p>Y51H4A.12</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>98</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ZF - PHD</p>
                     </c>
                     <c ca="left">
                        <p>LET-418</p>
                     </c>
                     <c ca="left">
                        <p>CHD-3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>99.7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fork Head</p>
                     </c>
                     <c ca="left">
                        <p>FKH-3</p>
                     </c>
                     <c ca="left">
                        <p>FKH-4</p>
                     </c>
                     <c ca="left">
                        <p>E-199</p>
                     </c>
                     <c ca="center">
                        <p>99.7</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>E-values and % identity values were obtained via pairwise blastp BLAST. See Table 1 for DNA binding domain abbreviations.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Human TF orthologs</p>
            </st>
            <p>Next, we identified putative human orthologs for each worm TF, based on reciprocal best BLAST hits <abbrgrp><abbr bid="B38">38</abbr></abbrgrp> (Additional data file 8). First, we identified members of each TF family in humans. Subsequently, we determined the number of ortholog pairs per TF family (Table <tblr tid="T1">1</tblr>). We found that some TF families are expanded in humans. For example, the CP2, C2HC and helix-turn-helix (HTH) TF families are represented by only one or two proteins in the worm each having a human ortholog. However, the human families are expanded six-fold. Conversely, other TF families are expanded in the worm, compared to human. As reported previously <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>, the NHR family is expanded in worms and contains 274 predicted members (versus 43 in humans). Interestingly, the MADF family, which is composed of nine proteins in the worm, is not found in humans.</p>
            <suppl id="S8">
               <title>
                  <p>Additional data file 8</p>
               </title>
               <caption>
                  <p>Putative human wTF2.0 TF homologs and orthologs</p>
               </caption>
               <text>
                  <p>Putative human wTF2.0 TF homologs and orthologs</p>
               </text>
               <file name="gb-2005-6-13-r110-S8.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>Putative TF orthologs may comprise a valuable tool to annotate TF function in either worm or human systems. For instance, although for most worm TFs the binding site is completely unknown <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>, consensus DNA binding sequences are available for many human TFs and are collected in the Transfac database <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. DNA binding domains evolve slower than other protein sequences <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> and, as a consequence, orthologous TFs recognize similar DNA sequences <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. Therefore, DNA binding specificities of human TFs may be helpful to predict the DNA binding specificities of orthologous <it>C. elegans </it>TFs and <it>vice versa</it>. In the future, orthology of TFs will be invaluable in the study of the evolution of transcription regulatory networks.</p>
         </sec>
         <sec>
            <st>
               <p>wTF2.0: a tool for the creation of TF-ORF resources</p>
            </st>
            <p>wTF2.0 provides a starting point for the creation of physical clone resources that can be used to systematically map transcription regulatory networks. TF-ORFs can be obtained from the ORFeome resource and efficiently subcloned by a Gateway cloning reaction into various different Destination vectors <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp> (Figure <figr fid="F5">5</figr>). To date, the <it>C. elegans </it>ORFeome consists of approximately 13,000 full-length ORFs, which is approximately 66% of all predicted ORFs. We searched worfdb, the ORFeome database <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, and found 652 predicted TF-encoding ORFs (70%). These TF-ORFs can be used to map transcription regulatory networks in different ways. First, they can be cloned into yeast one-hybrid prey vectors to detect physical interactions with their target genes <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. For instance, TF-ORFs have been pooled to create a TF mini-library that can be used in high-throughput yeast one-hybrid assays <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Second, they can be transferred to yeast two-hybrid or 'TAG' vectors for the identification of protein-protein interactions <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp>. This will be important to further identify functional TF complexes and to understand how TF function is regulated. In addition, TAG vectors may be useful to create transgenic worm strains that can be used in chromatin-immunoprecipitation experiments to identify TF target genes <it>in vivo</it>. Finally, TF-ORFs can be subcloned into an RNA interference (RNAi) vector for the analysis of loss-of-function phenotypes or for the identification of genetic interactions <abbrgrp><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr></abbrgrp>. Phenotypic analyses of TFs will be important for the analysis and interpretation of transcription regulatory networks.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>wTF2.0 can be used to create clone resources that can be used to study the transcription regulatory networks controlling metazoan gene expression</p>
               </caption>
               <text>
                  <p>wTF2.0 can be used to create clone resources that can be used to study the transcription regulatory networks controlling metazoan gene expression. TAG, epitope or purification tag; Y1H, yeast one-hybrid; Y2H, yeast two-hybrid.</p>
               </text>
               <graphic file="gb-2005-6-13-r110-5" hint_layout="double"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>wTF2.0: a tool for the creation of TF gene promoter resources</p>
            </st>
            <p>To date, the Promoterome <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> contains approximately 6,500 promoters (33%), including 279 (30%) TF-promoters. TF-promoters can be fused to green fluorescent protein (GFP) in two configurations. TF-promoters can be fused directly to GFP in what are referred to as 'transcriptional fusions' and the resulting promoter::GFP constructs can be used to create transgenic <it>C. elegans </it>strains in which promoter activity can be examined by light microscopy <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. Such lines can also be used to examine the effects on GFP expression as the result of a knockdown in regulatory TF levels by RNAi <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. Alternatively, TF-promoters and corresponding TF-ORFs can be cloned together with GFP by multisite Gateway cloning <abbrgrp><abbr bid="B54">54</abbr></abbrgrp> to create 'translational fusions' with GFP. The resulting promoter::ORF::GFP constructs are used to create transgenic lines in which both TF-promoter activity and TF subcellular localization can be examined <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Finally, TF-promoters can be cloned into yeast one-reporter vectors to identify other TFs that can physically associate with these promoters and that may contribute to TF promoter activity <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Such interactions are important to delineate regulatory cascades, important building blocks in transcription regulatory networks <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>We have compiled wTF2.0, a comprehensive compendium of putative <it>C. elegans </it>TFs, using both computational queries and manual curation. Combining wTF2.0 with different physical TF clone resources provides the first step toward the systematic dissection of <it>C. elegans </it>transcription regulatory networks.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Prediction of C. elegans TF-encoding genes</p>
            </st>
            <p>WormPep 140 (WS140) (22,420 proteins, 19,735 genes) was searched. Proteins that have no apparent function in transcription regulation (for example, proteins involved in DNA repair and replication, chromatin remodeling, kinases) were removed. In addition, we removed general TFs (Additional data file 2). To identify TFs missed by the GO search, we searched WormPep 140 using individual DNA binding domains (Table <tblr tid="T1">1</tblr>). Next, we computationally (using SMART, Pfam or InterPro) or manually inspected each protein sequence for the presence of a DNA binding domain (Additional data file 2). For C2H2 zinc-fingers, we only considered proteins that contain fingers with the following configuration: C-X<sub>2-5</sub>-C-X<sub>9</sub>-H-X<sub>3-5</sub>-H <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. However, we did include two proteins (LIR-1 and TLP-1) that do not have a canonical C2H2 zinc-finger, because they were found multiple times in high-throughput yeast one-hybrid assays (data not shown). For AT-hook predictions, we used the definition as described <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>. The numbers of human genes encoding each TF DNA binding domain were found by searching for the appropriate InterPro domain accession number using the Ensembl Human database.</p>
         </sec>
         <sec>
            <st>
               <p>Identification of candidate redundant TFs</p>
            </st>
            <p>For each TF family with at least five members, an alignment was created using the multiple alignment mode of ClustalX v. 1.83 <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> under default settings, including the Gonnet series of protein matrices. For TF genes encoding multiple isoforms, if the DNA binding domain was identical in all isoforms, the largest isoform was used. Isoforms containing different DNA binding domains were included separately. Unrooted trees were then generated from these alignments using ClustalX v. 1.83 using the Neighbor-Joining method, and visualized using the phylogram output of TREEVIEW PPC v1.6.6 <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> (Additional data file 7). Although the analysis was not of sufficient depth for these trees to represent real evolutionary relationships amongst the deeper branches, these trees do accurately reflect close relationships between <it>C. elegans </it>TFs, with candidate redundant genes occurring on adjacent short branches.</p>
         </sec>
         <sec>
            <st>
               <p>TF splice variants and alternative TF promoters</p>
            </st>
            <p>To identify TF splice variants, the coding sequences of each TF were retrieved from WS140 using the batch gene tool. Each splice variant was then manually examined for the presence of a DNA-binding domain and/or an alternative promoter.</p>
         </sec>
         <sec>
            <st>
               <p>TF dimers</p>
            </st>
            <p>TF dimers were obtained from worm interactome version 5 (WI5) <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. TF-TF interactions were modeled into a protein interaction network using the Cytoscape software package <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>. In this network, nodes correspond to interactors and edges (that is, links between nodes) represent protein-protein interactions.</p>
         </sec>
         <sec>
            <st>
               <p>TF orthologs</p>
            </st>
            <p>For each TF, the best human blastp hit was retrieved from WormBase. The Ensembl ID of each retrieved human protein was then used to extract the best <it>C. elegans </it>blastp hit and the corresponding percentage protein sequence identity using the data-mining tool BioMart. For the 44 TFs that did not yield a hit individual blast searches were performed. For 26 <it>C. elegans </it>TFs still no human homolog could be identified. Reciprocal best blast hits were considered putative orthologs and are highlighted in bold in Additional data file 8.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper. Additional file <supplr sid="S1">1</supplr> is a table listing the collection of predicted <it>C. elegans </it>transcription factors referred to as wTF2.0. Additional data file <supplr sid="S2">2</supplr> is a table providing an overview of manually curated genes that were left out of wTF2.0. Additional data file <supplr sid="S3">3</supplr> is a table listing wTF2.0 TFs that contain two distinct DNA binding domains. Additional data file <supplr sid="S4">4</supplr> is a table showing possible additions to wTF2.0. Additional data file <supplr sid="S5">5</supplr> is a table showing alternative splice forms and promoters. Additional data file <supplr sid="S6">6</supplr> is a table that provides an overview of protein interactions involving wTF2.0 TFs. Additional data file <supplr sid="S7">7</supplr> is a figure showing phylogenetic trees of worm TF families. Additional data file <supplr sid="S8">8</supplr> is a table listing putative human wTF2.0 TF homologs and orthologs.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Vanessa Vermeirssen, Inmaculada Barrasa and Job Dekker for reading the manuscript and members of the Hope and Walhout laboratories for suggestions. This work was supported by grants R33-CA097516 (NCI) to IAH, and R33-CA097516 (NCI) and R01-DK 068429 (NIDDK) to AJMW.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Transcription regulation and animal diversity.</p>
            </title>
            <aug>
               <au>
                  <snm>Levine</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tjian</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>424</volume>
            <fpage>147</fpage>
            <lpage>151</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01763</pubid>
                  <pubid idtype="pmpid" link="fulltext">12853946</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Transcription of eukaryotic protein-coding genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Annu Rev Genet</source>
            <pubdate>2000</pubdate>
            <volume>34</volume>
            <fpage>77</fpage>
            <lpage>137</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.genet.34.1.77</pubid>
                  <pubid idtype="pmpid" link="fulltext">11092823</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Transcriptional regulatory networks in <it>Saccharomyces cerevisiae</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Rinaldi</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Robert</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Odom</snm>
                  <fnm>DT</fnm>
               </au>
               <au>
                  <snm>Bar-Joseph</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Gerber</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Hannett</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Harbison</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>I</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <fpage>799</fpage>
            <lpage>804</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1075090</pubid>
                  <pubid idtype="pmpid" link="fulltext">12399584</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>A genomic regulatory network for development.</p>
            </title>
            <aug>
               <au>
                  <snm>Davidson</snm>
                  <fnm>EH</fnm>
               </au>
               <au>
                  <snm>Rast</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Oliveri</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ransick</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Calestani</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Yuh</snm>
                  <fnm>C-H</fnm>
               </au>
               <au>
                  <snm>Minokawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Amore</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Hinman</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Arenas-Mena</snm>
                  <fnm>C</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>295</volume>
            <fpage>1669</fpage>
            <lpage>1678</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1069883</pubid>
                  <pubid idtype="pmpid" link="fulltext">11872831</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Genomic analysis of regulatory network dynamics reveals large topological changes.</p>
            </title>
            <aug>
               <au>
                  <snm>Luscombe</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Madan Babu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Snyder</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Teichmann</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Gerstein</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>431</volume>
            <fpage>308</fpage>
            <lpage>312</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature02782</pubid>
                  <pubid idtype="pmpid" link="fulltext">15372033</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>WormBase: a comprehensive data resource for <it>Caenorhabditis </it>biology and genomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>TW</fnm>
               </au>
               <au>
                  <snm>Atoshechkin</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Bastiani</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bieri</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Blasiar</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bradnam</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Canaran</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Chan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>CK</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>D383</fpage>
            <lpage>9</lpage>
            <url>http://ws140.wormbase.org/</url>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540020</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608221</pubid>
                  <pubid idtype="doi">10.1093/nar/gki066</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Genomic analysis of gene expression in <it>C. elegans</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Hill</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Hunter</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Tsung</snm>
                  <fnm>BT</fnm>
               </au>
               <au>
                  <snm>Tucker-Kellogg</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>EL</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>290</volume>
            <fpage>809</fpage>
            <lpage>812</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.290.5492.809</pubid>
                  <pubid idtype="pmpid" link="fulltext">11052945</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>A gene expression map for <it>Caenorhabditis elegans</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Kim</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Lund</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kiraly</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Duke</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Eizinger</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wylie</snm>
                  <fnm>BN</fnm>
               </au>
               <au>
                  <snm>Davidson</snm>
                  <fnm>GS</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>293</volume>
            <fpage>2087</fpage>
            <lpage>2092</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1061603</pubid>
                  <pubid idtype="pmpid" link="fulltext">11557892</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Chromosomal clustering of muscle-expressed genes in <it>Caenorhabditis elegans</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Roy</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Lund</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>SK</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>418</volume>
            <fpage>975</fpage>
            <lpage>979</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12214599</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p><it>C. elegans </it>ORFeome version 1.1: experimental verification of the genome annotation and resource for proteome-scale protein expression.</p>
            </title>
            <aug>
               <au>
                  <snm>Reboul</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Vaglio</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Rual</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Lamesch</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Martinez</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Armstrong</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jacotot</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Bertin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Janky</snm>
                  <fnm>R</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2003</pubdate>
            <volume>34</volume>
            <fpage>35</fpage>
            <lpage>41</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1140</pubid>
                  <pubid idtype="pmpid" link="fulltext">12679813</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>A first version of the <it>Caenorhabditis elegans </it>promoterome.</p>
            </title>
            <aug>
               <au>
                  <snm>Dupuy</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Deplancke</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Boxem</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hao</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lamesch</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Sequerra</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bosak</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Doucette-Stam</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hope</snm>
                  <fnm>IA</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>2169</fpage>
            <lpage>2175</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">528934</pubid>
                  <pubid idtype="pmpid" link="fulltext">15489340</pubid>
                  <pubid idtype="doi">10.1101/gr.2497604</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Genome sequence of the nematode <it>C. elegans </it>: a platform for investigating biology.</p>
            </title>
            <aug>
               <au>
                  <cnm>The <it>C.elegans</it> Sequencing Consortium</cnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1998</pubdate>
            <volume>282</volume>
            <fpage>2012</fpage>
            <lpage>2018</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.282.5396.2012</pubid>
                  <pubid idtype="pmpid" link="fulltext">9851916</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Closing in on the <it>C. elegans </it>ORFeome by cloning TWINSCAN predictions.</p>
            </title>
            <aug>
               <au>
                  <snm>Wei</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lamesch</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Arumugam</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rosenberg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Vidal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brent</snm>
                  <fnm>MR</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>577</fpage>
            <lpage>582</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1074372</pubid>
                  <pubid idtype="pmpid" link="fulltext">15805498</pubid>
                  <pubid idtype="doi">10.1101/gr.3329005</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>The genome sequence of <it>Caenorhabditis briggsae </it>: A platform for comparative genomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Stein</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Bao</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Blasiar</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Blumenthal</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Brent</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Chinwalla</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Clarke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Clee</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Coghlan</snm>
                  <fnm>A</fnm>
               </au>
               <etal/>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2003</pubdate>
            <volume>1</volume>
            <fpage>E45</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">261899</pubid>
                  <pubid idtype="pmpid" link="fulltext">14624247</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0000045</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>The taxonomy of developmental control in <it>Caenorhabditis elegans</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Ruvkun</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Hobert</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1998</pubdate>
            <volume>282</volume>
            <fpage>2033</fpage>
            <lpage>2041</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.282.5396.2033</pubid>
                  <pubid idtype="pmpid" link="fulltext">9851920</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Zinc fingers in <it>Caenorhabditis elegans </it>: finding families and probing pathways.</p>
            </title>
            <aug>
               <au>
                  <snm>Clarke</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Berg</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1998</pubdate>
            <volume>282</volume>
            <fpage>2018</fpage>
            <lpage>2022</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.282.5396.2018</pubid>
                  <pubid idtype="pmpid" link="fulltext">9851917</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Comparative genomics of the eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Yandeu</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Wortman</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Gabor Miklas</snm>
                  <fnm>GL</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Hariharan</snm>
                  <fnm>IK</fnm>
               </au>
               <au>
                  <snm>Fortini</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>PW</fnm>
               </au>
               <au>
                  <snm>Apweiler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fleischmann</snm>
                  <fnm>W</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>287</volume>
            <fpage>2204</fpage>
            <lpage>2215</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.287.5461.2204</pubid>
                  <pubid idtype="pmpid" link="fulltext">10731134</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p><it>Arabidopsis</it> transcription factors: genome-wide comparative analysis among eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Riechmann</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Heard</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Reuber</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Keddie</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Adam</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Pineda</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Ratcliffe</snm>
                  <fnm>OJ</fnm>
               </au>
               <au>
                  <snm>Samaha</snm>
                  <fnm>RR</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>290</volume>
            <fpage>2105</fpage>
            <lpage>2110</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.290.5499.2105</pubid>
                  <pubid idtype="pmpid" link="fulltext">11118137</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Phylogenetic analysis of the human basic helix-loop-helix proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Ledent</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Paquet</snm>
       