<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2008-9-2-r38</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Computational identification of the normal and perturbed genetic networks involved in myeloid differentiation and acute promyelocytic leukemia</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Chang</snm>
               <fnm>Li Wei</fnm>
               <insr iid="I1"/>
               <email>lwc1@ural.wustl.edu</email>
            </au>
            <au id="A2">
               <snm>Payton</snm>
               <mi>E</mi>
               <fnm>Jacqueline</fnm>
               <insr iid="I2"/>
               <email>jpayton@im.wustl.edu</email>
            </au>
            <au id="A3">
               <snm>Yuan</snm>
               <fnm>Wenlin</fnm>
               <insr iid="I3"/>
               <email>wyuan@im.wustl.edu</email>
            </au>
            <au id="A4">
               <snm>Ley</snm>
               <mi>J</mi>
               <fnm>Timothy</fnm>
               <insr iid="I3"/>
               <insr iid="I4"/>
               <email>tley@im.wustl.edu</email>
            </au>
            <au id="A5">
               <snm>Nagarajan</snm>
               <fnm>Rakesh</fnm>
               <insr iid="I2"/>
               <email>rakesh@wustl.edu</email>
            </au>
            <au id="A6" ca="yes">
               <snm>Stormo</snm>
               <mi>D</mi>
               <fnm>Gary</fnm>
               <insr iid="I4"/>
               <email>stormo@genetics.wustl.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Biomedical Engineering, Washington University, St Louis, MO 63130, USA</p>
            </ins>
            <ins id="I2">
               <p>Department of Pathology and Immunology, Division of Laboratory Medicine, Washington University School of Medicine, St Louis, MO 63110, USA</p>
            </ins>
            <ins id="I3">
               <p>Division of Oncology, Department of Medicine, Washington University School of Medicine, St Louis, MO 63110, USA</p>
            </ins>
            <ins id="I4">
               <p>Department of Genetics, Washington University School of Medicine, St Louis, MO 63110, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>2</issue>
         <fpage>R38</fpage>
         <url>http://genomebiology.com/2008/9/2/R38</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18291030</pubid>
               <pubid idtype="doi">10.1186/gb-2008-9-2-r38</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>2</day>
               <month>8</month>
               <year>2007</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>1</day>
               <month>1</month>
               <year>2008</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>21</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>21</day>
               <month>02</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Chang et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Genetic networks in leukaemia</p>
      </shorttitle>
      <shortabs>
         <p>A dissection of the genetic networks and circuitries is described for two form of leukaemia. Integrating transcription factor binding and gene expression profiling, networks are revealed that underly this important human disease.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Acute myeloid leukemia (AML) comprises a group of diseases characterized by the abnormal development of malignant myeloid cells. Recent studies have demonstrated an important role for aberrant transcriptional regulation in AML pathophysiology. Although several transcription factors (TFs) involved in myeloid development and leukemia have been studied extensively and independently, how these TFs coordinate with others and how their dysregulation perturbs the genetic circuitry underlying myeloid differentiation is not yet known. We propose an integrated approach for mammalian genetic network construction by combining the analysis of gene expression profiling data and the identification of TF binding sites.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We utilized our approach to construct the genetic circuitries operating in normal myeloid differentiation versus acute promyelocytic leukemia (APL), a subtype of AML. In the normal and disease networks, we found that multiple transcriptional regulatory cascades converge on the TFs Rora and Rxra, respectively. Furthermore, the TFs dysregulated in APL participate in a common regulatory pathway and may perturb the normal network through Fos. Finally, a model of APL pathogenesis is proposed in which the chimeric TF PML-RAR&#945; activates the dysregulation in APL through six mediator TFs.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>This report demonstrates the utility of our approach to construct mammalian genetic networks, and to obtain new insights regarding regulatory circuitries operating in complex diseases in humans.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010004">Cell biology</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010009">Genetics</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010012">Medicine</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010016">Molecular biology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Acute myeloid leukemia (AML) comprises a group of diseases characterized by abnormal myeloid differentiation and an accumulation of abnormal myeloid cells in the bone marrow and peripheral blood. Like other complex diseases in humans, AML is likely to be caused by disruption or dysregulation of multiple regulatory pathways. Recent studies have demonstrated a key role for aberrant transcriptional regulation in AML pathophysiology. Namely, many lineage-specific transcription factors (TFs), which coordinate normal myeloid development, are often mutated or altered in genetic fusions produced by chromosomal translocations <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Moreover, participants of many of these chimeric proteins are themselves TFs <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. These TFs may in turn interact with the normal genetic circuitry involved in myeloid differentiation and induce downstream events in AML pathogenesis. Although several chromosomal fusion proteins and myeloid TFs involved in leukemia have been identified and studied independently, how each individual TF interacts with others, and how each regulatory pathway correlates with others, remains unclear. Such comprehensive delineation of the genetic networks underlying both normal myeloid differentiation and leukemia is crucial to better understand AML pathophysiology and to develop improved therapeutic strategies.</p>
         <p>Uncovering genetic networks has been a great challenge in the post-genomic era. Breakthroughs in experimental methods such as chromatin immunoprecipitation followed by promoter arrays <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> have vastly improved the efficiency of TF target identification <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>, but these methods may be applied to only one TF under one condition in one experiment and, therefore, are laborious and time consuming. Alternatively, computational methods seek to solve this problem using a systems biology approach. A majority of these methods have utilized analysis of gene expression profiling experiment data to construct a coexpression network. These approaches usually apply computational algorithms or machine learning techniques such as analytical methods <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>, statistical regression <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, Bayesian networks <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>, support vector machine <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, data processing inequality <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> and minimum description length principle <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. However, due to the complexity of expression data (that is, the expression of many genes are measured only at a few data points), it is generally difficult to identify the dependencies and interactions between TFs and their target genes accurately. One common challenge of expression profiling based methods is to distinguish coregulation from coexpression. Namely, genes that are coherently expressed with a TF are not necessarily directly regulated by that TF. Therefore, most of these methods have focused on simpler organisms, such as bacteria or yeast, in which the number of TF genes is small and the structure of the regulatory network is simpler.</p>
         <p>Another approach to constructing genetic networks is based on identification of TF binding sites. This approach either predicts the transcriptional regulators of a set of coexpressed genes <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp> or predicts the regulatory targets of TFs using their binding sites <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. In these methods, a model of TF binding elements is first built by experimental or computational methods. This model is then used to search for genes that have matching sites in their non-coding sequence. A network of transcriptional regulation may be constructed by identifying targets for each individual TF. The advantage of this approach over expression based methods is that it identifies direct regulatory targets of a TF. However, its performance is strongly based on the accuracy of the TF binding site identification. Due to the high false discovery rate of TF binding sites, this approach has primarily been successful in simpler organisms <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B21">21</abbr></abbrgrp>, and applying this approach to mammals is still difficult and challenging.</p>
         <p>Because each of these approaches has its own advantages and limitations, recent studies have taken an integrated approach to combine multiple types of information in order to make better predictions on regulatory networks. These methods include combining gene expression data with TF binding site analysis <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>, combining chromatin immunoprecipitation with gene expression data <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>, and combining chromatin immunoprecipitation data with regulatory motif discovery <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. Although the performance of these integrated approaches is superior compared to the individual methods, most of them have been designed and tested only in lower eukaryotes. Therefore, the accurate identification of genetic networks in mammals remains a challenging problem.</p>
         <p>In this report, we present a novel approach to inferring genetic networks in mammals by combining gene expression profiling data and TF binding site analysis. We utilize this approach to study the genetic networks operating in myeloid differentiation and to elucidate how this circuitry goes awry in acute promyelocytic leukemia (APL), a subtype of AML. APL was chosen because its pathogenesis is likely based on a common mechanism involving transcriptional dysregulation. Namely, APL is characterized by the presence of a chromosomal fusion protein, PML-RAR&#945; <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. One participant of this chimeric protein, RAR&#945;, is a TF. Therefore, it is feasible that disruption of RAR&#945; function initiates the dysregulatory events in APL and is thus a good model for predicting the perturbation of genetic networks. Using our analytical approaches, we first constructed the genetic network underlying normal myeloid differentiation. In this network, multiple transcriptional regulatory cascades converge on Rora, indicating a novel function in modulating myeloid development. Next, using expression data in APL, we identified a set of dysregulated TFs and predicted their aberrantly expressed targets. These dysregulated TFs formed a genetic pathway distinct from the normal network that converged on Rxra and interacted with the normal network through Fos. Finally, we identified a set of direct targets for PML-RAR&#945; and proposed a role for this set in APL pathogenesis. Together, these results provide novel insights regarding the genetic circuitry underlying myeloid differentiation and APL pathophysiology, and our analytical approach demonstrates the utility of an integrated strategy for genetic network construction that may be applied to study other complex diseases in humans.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Construction of transcriptional regulatory networks</p>
            </st>
            <p>Our strategy to identify transcriptional regulatory networks combines two independent, but complementary methods: TF binding site identification and analysis of gene expression profiling data (Figure <figr fid="F1">1</figr>). TF binding site analysis is used to identify genes containing overrepresented binding sites of a TF, whereas analysis of gene expression profiling data results in one or more genes that are coherently expressed with a TF. Our hypothesis is that genes identified using both methods are more likely to be regulated by a TF than those genes identified by either of the methods alone. Thus, to predict regulatory targets of a TF by sequence analysis, all the TF binding sites that are conserved in human and mouse were identified, and binding probability scores of each TF binding each gene were calculated using a regulatory sequence analysis pipeline <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> (Figure <figr fid="F1">1a</figr>). To identify genes with a statistically significant binding score, the <it>P </it>value for observing a binding score by chance was calculated by randomizing all the identified TF binding sites in the genome (Figure <figr fid="F1">1b</figr>). In parallel, gene expression profiles were obtained from cultured bone marrow cells that are stimulated with the granulocyte colony-stimulating factor (G-CSF) growth factor to simulate the <it>in vivo </it>myeloid differentiation program <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. This system allowed us to identify important genes that are upregulated during myeloid differentiation. Template matching was used to identify genes whose expression is similar to a specific expression pattern or 'template'. This step identified six coexpressed gene clusters (Figure <figr fid="F1">1c</figr>). The regulatory interactions identified by sequence analysis were used to construct regulatory networks for each cluster, which were then consolidated to establish the regulatory network underlying myeloid differentiation (Figure <figr fid="F1">1d</figr>). Each of these steps is described in greater detail in the following sections.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Workflow of genetic networks construction</p>
               </caption>
               <text>
                  <p>Workflow of genetic networks construction. This workflow contains four major stages. <b>(a) </b>TF binding site identification. Genomic sequences of annotated genes are retrieved and aligned, and conserved TF binding sites in genomic sequences are identified. Binding probability scores are calculated using the identified binding sites. <b>(b) </b>TF target identification. The <it>P </it>value for observing a given binding probability score or higher by chance is calculated using permutation of TF binding sites. Using a <it>P </it>value cutoff, regulatory targets of each TF are identified. <b>(c) </b>Coexpressed gene cluster identification. Gene expression profiles are collected from experiments. Coherently expressed genes are identified and clustered. <b>(d) </b>Network construction. Genetic networks are identified for each coexpressed gene cluster using the target genes predicted for each TF within each gene cluster. The complete regulatory network is then constructed by consolidating individual networks. Hs, <it>Homo sapiens</it>; Mm, <it>Mus musculus</it>.</p>
               </text>
               <graphic file="gb-2008-9-2-r38-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Transcription factor target identification by regulatory sequence analysis</p>
            </st>
            <p>To identify regulatory targets of TFs using genomic sequence information, all of the evolutionarily conserved TF binding sites in the human genome were identified using 596 known TF binding profiles curated in the TRANSFAC <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> and JASPAR <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> databases. Using TF binding sites found in the non-coding sequence of a gene, binding probability scores <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, which assess the likelihood of a TF regulating a gene, were calculated for each TF-gene pair (Figure <figr fid="F1">1a</figr>). The <it>P </it>value for observing a binding score for a TF-gene pair by chance was then calculated by permutation of all the binding sites in the genome (Figure <figr fid="F1">1b</figr>). By applying a <it>P </it>value cutoff, genes that have statistically significant binding scores for a TF were identified as putative targets of that TF. The appropriate <it>P </it>value cutoff was determined empirically to be 0.005 by using the total number of transcriptional regulatory interactions estimated in a previous study <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. As a result, 106,997 TF-target gene pairs (that is, a TF regulating a target gene), including 6,474 TF-target TF pairs (that is, a TF regulating another TF gene) were identified. Using human-mouse ortholog gene pairs calculated using the HomoloGene database (see Materials and methods), these transcriptional regulatory interactions predicted in human were mapped to orthologs in mouse and thus generated 102,346 TF-target gene pairs. To determine if these regulatory relationships were supported by other computational prediction methods, these results were compared to the data curated in PReMod, a database of genome-wide <it>cis</it>-regulatory module predictions <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. As a result, 40.3% of these TF-target pairs were also predicted in PReMod.</p>
         </sec>
         <sec>
            <st>
               <p>Identification of upregulated gene clusters during myeloid differentiation</p>
            </st>
            <p>To elucidate the transcriptional regulatory networks underlying myeloid development, expression profiling data were utilized from a previous study that employed a well characterized model of <it>in vitro </it>myeloid differentiation <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. In this model, G-CSF is used to stimulate the maturation of enriched myeloid progenitors. During the seven-day time course, the predominant cells in culture at days 2 and 3, at days 4 and 5, and at days 6 and 7 are promyelocytes, mid-myeloid cells, and terminally differentiated myeloid cells, respectively (Figure <figr fid="F2">2a</figr>). Using these data, we identified coherently expressed genes during myeloid differentiation. Because myeloid development is a unidirectional, progressive event, it was hypothesized that genes regulated during this process have relatively simple expression patterns (that is, up-regulated or down-regulated at one or more points during myeloid development). In fact, comparing gene expression profiles during the <it>in vitro </it>system to an exhaustive list of temporal patterns revealed that the majority of genes that were triggered at some point in the seven-day time course were up-regulated either on just one day or over two consecutive days (Additional data file 1). Therefore, we focused on these two types of expression patterns. Expression patterns that were upregulated on just one day (day 0 to day 7) or over two consecutive days (days 0 and 1 to days 6 and 7) during the myeloid development were defined as 'templates'. The Pearson's correlation coefficient was calculated for each gene expression profile and template. To focus on the transcriptional regulation of the most coherently expressed genes, a correlation coefficient cutoff of 0.9 was used to identify genes whose expression profiles match each template. Using this method, six coexpressed gene clusters were identified, including genes upregulated on day 0, 1, 2, or 7, and genes upregulated on days 0 and 1 or days 6 and 7 (Figure <figr fid="F2">2b</figr>; Additional data file 2). The clusters that were upregulated on day 0, on days 0 and 1, and on days 6 and 7 include the most genes (267, 138, and 118 genes, respectively). These clusters contain many well known genes that are associated with myeloid differentiation, including those encoding myeloid differentiation antigens (for example, <it>Cd2</it>, <it>Cd3d</it>, <it>Cd5</it>), and terminal myeloid differentiation genes (for example, <it>Mmp9</it>, <it>Fpr1</it>, and <it>Itgam</it>).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Coexpressed gene clusters identified during myeloid development</p>
               </caption>
               <text>
                  <p>Coexpressed gene clusters identified during myeloid development. <b>(a) </b>The predominant cells in culture during the seven-day myeloid differentiation time course are promyelocytes, mid-myeloid cells, and terminally differentiated myeloid cells cultured at days 2 and 3, days 4 and 5, and days 6 and 7, respectively. <b>(b) </b>Coherently expressed gene clusters were identified for genes upregulated on just one day (UP0, UP1, UP2 and UP7) or over two consecutive days (UP01 and UP67) during <it>in vitro </it>myeloid differentiation.</p>
               </text>
               <graphic file="gb-2008-9-2-r38-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Integration of sequence analysis and expression profiling to construct genetic networks</p>
            </st>
            <p>The results of genomic sequence analysis and expression profiling analysis were integrated to construct the genetic network associated with each coexpressed gene cluster. The regulatory targets of myeloid TFs were identified by the intersection of genes found in the same cluster with a TF (that is, those having a similar expression pattern to that of the TF) and genes having statistically significant binding scores with that TF. Using this approach, 96 and 25 TF-target gene pairs were identified for the gene clusters upregulated on day 0 and days 0 and 1, respectively (Additional data file 3). Note that there were also genes that were down-regulated on days 0 and 1 and days 6 and 7 (Additional data file 4), but none of the TFs that were up-regulated at those time points and that had a known binding profile were predicted as a regulator of these genes. Thus, these genes may be regulated by other myeloid TFs whose binding profiles are not yet available. Using the entire set of predicted TF-target gene pairs, a genetic network was constructed for each gene cluster (Additional data file 5). In these networks, TFs and their target genes are represented by nodes, and a directed edge is drawn from a TF to a gene if the TF regulates the gene. The identified genetic networks allowed for the identification of previously unknown TFs that regulate myeloid differentiation as well as regulatory target genes of known myeloid regulators. For example, Egr1 was shown to be a candidate myeloid regulator by previous studies <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. However, what myeloid genes are directly regulated by Egr1 is unclear. Using our results, seven genes were found to be potentially regulated by Egr1, including three genes encoding TFs (Figure <figr fid="F3">3a</figr>). Among these genes, <it>Dusp5 </it>and <it>Egr1 </it>are both strongly upregulated after interleukin-5 treatment in eosinophils <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. <it>Lmna </it>modulates cellular responses to the transforming growth factor-beta 1 (<it>Tgfb1</it>) signaling pathway <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>, and <it>Tgfb1 </it>is regulated by Egr1 <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. Further computational analysis showed that all of these genes have evolutionarily conserved Egr1 binding sites in their proximal promoter region (Figure <figr fid="F3">3b</figr>). These results suggest that the regulatory networks constructed by our method identified potential target genes of Egr1.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Predicted regulatory targets of Egr1 in myeloid differentiation</p>
               </caption>
               <text>
                  <p>Predicted regulatory targets of Egr1 in myeloid differentiation. <b>(a) </b>Seven genes were identified as direct regulatory targets of Egr1. Three of these genes encode TFs (circle nodes). <b>(b) </b>Evolutionarily conserved Egr1 binding sites (red bars) were identified in the &#177; 2 kb proximal promoter region of the predicted target genes. All the Egr1 binding sites were conserved in human, mouse and rat except for PRDM16, whose rat ortholog was not available. Gene annotation information is color coded: blue, repetitive elements; yellow, conserved sequence; dark green, coding region; light green, untranslated region.</p>
               </text>
               <graphic file="gb-2008-9-2-r38-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Expanding genetic networks by additional myeloid transcription factors</p>
            </st>
            <p>The previous analysis identified myeloid TFs that are coherently expressed with their target genes during myeloid differentiation. However, there may be additional myeloid TFs that may not share a similar expression profile with their target genes (for example, myeloid TFs that are constantly expressed). The TF binding site analysis described above identifies TFs that regulate individual genes in the coexpressed gene clusters. Thus, PAP <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> was utilized to identify additional myeloid TFs that regulate a set of genes in myeloid gene clusters. PAP scores each TF and predicts TFs that regulate a set of coexpressed genes using a statistical model that is based on TF binding sites and that is used to calculate a <it>P </it>value to assess the statistical significance of this binding (see Materials and methods). Using a <it>P </it>value cutoff of 0.05, up to five additional TFs were identified for each of the coherently expressed gene clusters (Table <tblr tid="T1">1</tblr>). A majority of these additional TFs are known myeloid regulators or are involved in leukemia pathophysiology, including AML1, PU.1, and C/EBP&#945;. These TFs were added into the genetic network for each gene cluster as new nodes (Additional data file 6), and connections from these TFs to other genes in each individual network were made based on the TF-target gene pairs predicted by genomic sequence analysis (Additional data file 7).</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Additional myeloid transcription factors identified by PAP</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>Cluster</p>
                     </c>
                     <c ca="left">
                        <p>Accession</p>
                     </c>
                     <c ca="left">
                        <p>TF</p>
                     </c>
                     <c ca="left">
                        <p>Symbol</p>
                     </c>
                     <c ca="left">
                        <p>Association with myeloid development</p>
                     </c>
                     <c ca="center">
                        <p>Reference</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>UP0</p>
                     </c>
                     <c ca="left">
                        <p>MA0081</p>
                     </c>
                     <c ca="left">
                        <p>SPI-B</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Spib</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Can functionally replace PU.1 in myeloid development</p>
                     </c>
                     <c ca="center">
                        <p>[85]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>M00961</p>
                     </c>
                     <c ca="left">
                        <p>VDR</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Vdr</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Involved in monocytic differentiation in human leukemia cells</p>
                     </c>
                     <c ca="center">
                        <p>[86]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>M00777</p>
                     </c>
                     <c ca="left">
                        <p>STAT</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Stat4</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Expressed in early myeloid development</p>
                     </c>
                     <c ca="center">
                        <p>[79]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>MA0103</p>
                     </c>
                     <c ca="left">
                        <p>deltaEF1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Zfhz1a</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>M00655</p>
                     </c>
                     <c ca="left">
                        <p>PEA3</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Etv4</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>UP1</p>
                     </c>
                     <c ca="left">
                        <p>M00161</p>
                     </c>
                     <c ca="left">
                        <p>Oct-1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Pou2f1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Regulates PU.1</p>
                     </c>
                     <c ca="center">
                        <p>[87]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>UP7</p>
                     </c>
                     <c ca="left">
                        <p>M00658</p>
                     </c>
                     <c ca="left">
                        <p>PU.1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Sfpi1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Known regulator of hematopoiesis</p>
                     </c>
                     <c ca="center">
                        <p>[88]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>M00329</p>
                     </c>
                     <c ca="left">
                        <p>Pax-9</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Pax9</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>M00925</p>
                     </c>
                     <c ca="left">
                        <p>AP-1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Jun</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Known regulator of myeloid development</p>
                     </c>
                     <c ca="center">
                        <p>[89]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>M01031</p>
                     </c>
                     <c ca="left">
                        <p>HNF4</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Hnf4a</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>MA0081</p>
                     </c>
                     <c ca="left">
                        <p>SPI-B</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Spib</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Can functionally replace PU.1 in myeloid development</p>
                     </c>
                     <c ca="center">
                        <p>[85]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>UP01</p>
                     </c>
                     <c ca="left">
                        <p>M00217</p>
                     </c>
                     <c ca="left">
                        <p>USF</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Usf1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Regulates HOXB4 in normal and leukemia stem cells</p>
                     </c>
                     <c ca="center">
                        <p>[90]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>M00792</p>
                     </c>
                     <c ca="left">
                        <p>SMAD</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Smad1</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>M00805</p>
                     </c>
                     <c ca="left">
                        <p>LEF1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Lef1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Expression altered in acute leukemia</p>
                     </c>
                     <c ca="center">
                        <p>[77]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>M00799</p>
                     </c>
                     <c ca="left">
                        <p>Myc</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Myc</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Upregulated in AML and induces AML</p>
                     </c>
                     <c ca="center">
                        <p>[91]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>MA0002</p>
                     </c>
                     <c ca="left">
                        <p>AML-1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Runx1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Known regulator of hematopoiesis</p>
                     </c>
                     <c ca="center">
                        <p>[88]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>UP67</p>
                     </c>
                     <c ca="left">
                        <p>M00133</p>
                     </c>
                     <c ca="left">
                        <p>Tst-1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Pou3f1</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>M00188</p>
                     </c>
                     <c ca="left">
                        <p>AP-1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Jun</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Known regulator of myeloid development</p>
                     </c>
                     <c ca="center">
                        <p>[89]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>M00729</p>
                     </c>
                     <c ca="left">
                        <p>Cdx-2</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Cdx2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Involved in the ETV6-CDX2 fusion protein</p>
                     </c>
                     <c ca="center">
                        <p>[92]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>M00912</p>
                     </c>
                     <c ca="left">
                        <p>C/EBP</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Cebpa</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Known regulator of hematopoiesis</p>
                     </c>
                     <c ca="center">
                        <p>[88]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>M00162</p>
                     </c>
                     <c ca="left">
                        <p>Oct-1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Pou2f1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Regulates PU.1</p>
                     </c>
                     <c ca="center">
                        <p>[87]</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>PAP was used to identify overrepresented TF binding sites in each coexpressed gene clusrer. Almost all of these TFs are associated with myeloid development or myeloid leukemia.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Myeloid genetic networks among transcription factor genes</p>
            </st>
            <p>To systematically study the transcriptional regulatory mechanisms underlying myeloid differentiation, focus was given to TF genes in each coexpressed gene cluster. These TFs presumably modulate myeloid development by regulating the genes in the same gene cluster (that is, genes that were coexpressed with the TFs). Indeed, 12 of the 17 TFs found in these gene clusters were previously associated with myeloid differentiation or myeloid disorders (Table <tblr tid="T2">2</tblr>). Therefore, the regulatory networks of TFs were extracted from each genetic network identified for each gene cluster (Additional data file 8). Because genes in each coexpressed cluster are upregulated at different time points during myeloid development, each individual network represents a 'sub-network' of the entire transcriptional regulatory network for myeloid differentiation. Therefore, a comprehensive transcriptional regulatory network was constructed by combining each individual network identified in each coexpressed gene cluster. Namely, each individual network was joined by the common TFs to build a combined network (Figure <figr fid="F4">4a</figr>).</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Genetic networks operating in myeloid development and APL</p>
               </caption>
               <text>
                  <p>Genetic networks operating in myeloid development and APL. In these networks, circle nodes represent TF genes. Genes that do not encode TFs are shown in rectangles. An arrow is drawn from TF-A to gene-B if TF-A regulates gene-B. <b>(a) </b>The predicted genetic network operating in myeloid differentiation. Multiple regulatory pathways in this network converge on one single TF, Rora. The expression profiles of the TF genes are color coded: blue, upregulated at day 0; yellow, upregulated at day 0 and day 1; purple, upregulated at day 7. <b>(b) </b>The seven TFs that are dysregulated in APL may be connected to form a common regulatory pathway. Aberrant expressions of these TFs are color coded: red, overexpression; green, underexpression. <b>(c) </b>The perturbation of the normal network by dysregulated TFs in APL. The normal and disease regulatory pathways converge on Rora and Rxra, respectively. The dysregulated pathway in APL may perturb the normal genetic network through Fos. Furthermore, many TFs in the normal network (shown in orange nodes) are predicted as direct targets of at least one TF dysregulated in APL (Table 4).</p>
               </text>
               <graphic file="gb-2008-9-2-r38-4"/>
            </fig>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Transcription factor genes identified in coexpressed gene clusters</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>Cluster</p>
                     </c>
                     <c ca="left">
                        <p>Gene ID</p>
                     </c>
                     <c ca="left">
                        <p>Symbol</p>
                     </c>
                     <c ca="left">
                        <p>Expression correlation coefficient</p>
                     </c>
                     <c ca="left">
                        <p>Association with myeloid development</p>
                     </c>
                     <c ca="center">
                        <p>Reference</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>UP0</p>
                     </c>
                     <c ca="left">
                        <p>13653</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Egr1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9928</p>
                     </c>
                     <c ca="left">
                        <p>Stimulates development of hematopoietic progenitor cells</p>
                     </c>
                     <c ca="center">
                        <p>[35]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>13654</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Egr2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9459</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>14013</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Evi1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9373</p>
                     </c>
                     <c ca="left">
                        <p>Involved in many myeloid disorders</p>
                     </c>
                     <c ca="center">
                        <p>[76]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>14281</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Fos</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9488</p>
                     </c>
                     <c ca="left">
                        <p>Modulates myeloid cell survival and differentiation</p>
                     </c>
                     <c ca="center">
                        <p>[42]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>16842</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Lef1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9325</p>
                     </c>
                     <c ca="left">
                        <p>Expression altered in acute leukemia</p>
                     </c>
                     <c ca="center">
                        <p>[77]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>17131</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Smad7</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9041</p>
                     </c>
                     <c ca="left">
                        <p>Alters cell fate decisions of human hematopoietic repopulating cells</p>
                     </c>
                     <c ca="center">
                        <p>[78]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>18109</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Mycn</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9094</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>19883</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Rora</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9335</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>20849</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Stat4</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9562</p>
                     </c>
                     <c ca="left">
                        <p>Expressed in early myeloid development</p>
                     </c>
                     <c ca="center">
                        <p>[79]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>21414</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Tcf7</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9614</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>56458</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Foxo1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9575</p>
                     </c>
                     <c ca="left">
                        <p>Activates the mixed lineage leukemia gene</p>
                     </c>
                     <c ca="center">
                        <p>[80]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>UP01</p>
                     </c>
                     <c ca="left">
                        <p>14013</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Evi1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9042</p>
                     </c>
                     <c ca="left">
                        <p>Involved in many myeloid disorders</p>
                     </c>
                     <c ca="center">
                        <p>[76]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>14391</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Gabpb1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9377</p>
                     </c>
                     <c ca="left">
                        <p>Activates the neutrophil elastase promoter</p>
                     </c>
                     <c ca="center">
                        <p>[81]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>16842</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Lef1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9593</p>
                     </c>
                     <c ca="left">
                        <p>Expression altered in acute leukemia</p>
                     </c>
                     <c ca="center">
                        <p>[77]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>18044</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Nfya</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9036</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>18109</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Mycn</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9509</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>19883</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Rora</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9525</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>217082</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Hlf</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9537</p>
                     </c>
                     <c ca="left">
                        <p>E2A-HLF fusion abrogates apoptosis in leukemia cells</p>
                     </c>
                     <c ca="center">
                        <p>[82]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>UP67</p>
                     </c>
                     <c ca="left">
                        <p>12013</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Bach1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9274</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>17119</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Mxd1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9168</p>
                     </c>
                     <c ca="left">
                        <p>Expression induced during myeloid development</p>
                     </c>
                     <c ca="center">
                        <p>[83]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>328572</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Ep300</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.9097</p>
                     </c>
                     <c ca="left">
                        <p>Regulates Runx1 through acetylation of lysine residues</p>
                     </c>
                     <c ca="center">
                        <p>[84]</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>These TFs were all upregulated on just one day or over two consecutive days during the seven-day <it>in vitro </it>myeloid differentiation. Twelve of these seventeen TF genes aare known to regulate myeloid differentiation or be involved in myeloid leukemia.</p>
               </tblfn>
            </tbl>
            <p>The identified genetic network for myeloid differentiation has several interesting features. First, the integration of individual networks into the complete myeloid development network was consistent with the time at which each individual gene cluster was upregulated (that is, genes upregulated earlier occupied the upper part of the network and genes upregulated later occupied the lower part). Second, multiple regulatory pathways converge on a single TF, Rora. These characteristics highlight the genetic circuitry that may be operating in the myeloid differentiation.</p>
         </sec>
         <sec>
            <st>
               <p>Genetic networks in acute promyelocytic leukemia</p>
            </st>
            <p>To identify transcriptional regulatory mechanisms that are dysregulated in acute promyelocytic leukemia, TFs that are differentially expressed in APL compared to normal cells were identified as follows. Gene expression profiles in APL were collected from a mCG-PML-RAR&#945; knock-in mouse model <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. PML-RAR&#945; is a fusion protein observed in most APL patients, and the majority of PML-RAR&#945; knock-in mice eventually develop APL <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. Because APL is characterized by an arrest of the normal myeloid differentiation program in the promyelocytic stage, and promyelocytes are the predominant cells at day 2 and day 3 of the <it>in vitro </it>myeloid maturation program, gene expression data measured at these two days were compared to those measured in APL mice. The software program SAM (Significance Analysis of Microarrays) <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> was used to identify 602 differentially expressed genes in APL. Among these genes, 472 were overexpressed in APL, and 130 were underexpressed. These differentially expressed genes include seven encoding TFs (Stat5b, Fos, Atf1, Arid5b, Rxra, Mybl2, Nfyc) that have characterized binding profiles, termed the APL dysregulome (Table <tblr tid="T3">3</tblr>). Four of the seven dysregulated TFs, Atf1, Fos, Rxra and Stat5b, have been previously associated with acute myeloid leukemia <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>, and ninety-six genes that were differentially expressed in APL were identified as targets of these TFs (Additional data file 9).</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Dysregulated transcription factors in APL</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Gene ID</p>
                     </c>
                     <c ca="left">
                        <p>Symbol</p>
                     </c>
                     <c ca="left">
                        <p>Gene name</p>
                     </c>
                     <c ca="center">
                        <p>APL expression</p>
                     </c>
                     <c ca="center">
                        <p>SAM score</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>20851</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Stat5b</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>signal transducer and activator of transcription 5B</p>
                     </c>
                     <c ca="center">
                        <p>Up</p>
                     </c>
                     <c ca="center">
                        <p>12.05</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>14281</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Fos</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>FBJ osteosarcoma oncogene</p>
                     </c>
                     <c ca="center">
                        <p>Up</p>
                     </c>
                     <c ca="center">
                        <p>9.71</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>11908</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Atf1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>activating transcription factor 1</p>
                     </c>
                     <c ca="center">
                        <p>Up</p>
                     </c>
                     <c ca="center">
                        <p>9.68</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>71371</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Arid5b</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>AT rich interactive domain 5B (Mrf1 like)</p>
                     </c>
                     <c ca="center">
                        <p>Up</p>
                     </c>
                     <c ca="center">
                        <p>9.63</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>20181</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Rxra</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>retinoid X receptor alpha</p>
                     </c>
                     <c ca="center">
                        <p>Up</p>
                     </c>
                     <c ca="center">
                        <p>9.48</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>17865</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Mybl2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>myeloblastosis oncogene-like 2</p>
                     </c>
                     <c ca="center">
                        <p>Down</p>
                     </c>
                     <c ca="center">
                        <p>-9.47</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>18046</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Nfyc</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>nuclear transcription factor-Y gamma</p>
                     </c>
                     <c ca="center">
                        <p>Down</p>
                     </c>
                     <c ca="center">
                        <p>-10.21</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>SAM was used to identify seven TFs that are differentially expressed in APL and in normal promyelocytes. Five of these TFs are upregulated in APL, while two of them are downregulated.</p>
               </tblfn>
            </tbl>
            <p>To test if these seven TFs in the APL dysregulome participate in a common regulatory pathway (that is, their abnormal expression is the cause or result of a single regulatory cascade), transcriptional regulatory interactions between these TFs were identified using regulatory sequence analysis (see above and Materials and methods). Namely, TF-A and TF-B are connected to form a regulatory pathway if TF-B has over-represented binding sites of TF-A. As a result, six of these seven TFs were shown to form a common regulatory pathway (Figure <figr fid="F4">4b</figr>). The last TF, Nfyc, neither regulates nor is regulated by any of the other six TFs, but it regulates a gene (<it>Actl6a</it>) in common with Atf1. Two of these seven TFs, Nfyc and Mybl2, are expressed at lower levels in APL than in normal promyelocytes. It is interesting to note that while Atf1 regulates Fos and Actl6a, Fos is up-regulated and Actl6a is down-regulated in APL. This suggests that Atf1 may act as both a transcriptional activator and a repressor, possibly depending on different cooperative factors. This hypothesis is supported by a previous study of Atf1 <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. Thus, to identify cooperative TFs of Atf1, TFs that regulate Fos or Actl6a and have similar expression profiles to Atf1 in APL were identified. This analysis identified Egr2 and Nfyb as cofactors of Atf1 in the regulation of Fos and Actl6a, respectively.</p>
            <p>To study how the APL dysregulome perturbs the genetic network of normal myeloid differentiation, the regulatory cascade of the seven dysregulated TFs were joined with the normal myeloid genetic network (Figure <figr fid="F4">4c</figr>). All the predicted regulatory interactions between any two TFs within the normal genetic network were maintained, and the TFs that were predicted as targets of at least one dysregulated TF were identified (Table <tblr tid="T4">4</tblr>, Figure <figr fid="F4">4c</figr>). Interestingly, these two networks could be simply combined through a common TF, Fos. This result suggests that the genetic network of normal myeloid differentiation is perturbed, and this dysregulation is mediated through Fos. Furthermore, these results predict a change in the genetic circuitry wherein the normal cascade is regulated by Rora while the pathophysiology observed in APL is mediated by Rxra. <it>Bona fide </it>downstream targets of Rora and Rxra need to be identified, and their functions in normal myeloid development or APL need to be elucidated to further validate the role of Rora and Rxra in normal or leukemic biology.</p>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Transcription factors in the normal myeloid genetic network regulated by dysregulated transcription factors in APL</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>Differentially expressed TFs in APL</p>
                     </c>
                     <c ca="left">
                        <p>Regulated TFs in normal myeloid development</p>
                     </c>
                     <c ca="center">
                        <p><it>P </it>value</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Arid5b</p>
                     </c>
                     <c ca="left">
                        <p>Rora</p>
                     </c>
                     <c ca="center">
                        <p>~0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Arid5b</p>
                     </c>
                     <c ca="left">
                        <p>Tcf7</p>
                     </c>
                     <c ca="center">
                        <p>0.0019</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Atf1</p>
                     </c>
                     <c ca="left">
                        <p>Egr1</p>
                     </c>
                     <c ca="center">
                        <p>~0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Atf1</p>
                     </c>
                     <c ca="left">
                        <p>Hlf</p>
                     </c>
                     <c ca="center">
                        <p>0.0041</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Atf1</p>
                     </c>
                     <c ca="left">
                        <p>Rora</p>
                     </c>
                     <c ca="center">
                        <p>0.0015</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Atf1</p>
                     </c>
                     <c ca="left">
                        <p>Smad7</p>
                     </c>
                     <c ca="center">
                        <p>0.0015</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fos</p>
                     </c>
                     <c ca="left">
                        <p>Egr2</p>
                     </c>
                     <c ca="center">
                        <p>0.0005</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fos</p>
                     </c>
                     <c ca="left">
                        <p>Rora</p>
                     </c>
                     <c ca="center">
                        <p>~0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fos</p>
                     </c>
                     <c ca="left">
                        <p>Tcf7</p>
                     </c>
                     <c ca="center">
                        <p>0.0036</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fos</p>
                     </c>
                     <c ca="left">
                        <p>Zfhx1a</p>
                     </c>
                     <c ca="center">
                        <p>0.0005</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mybl2</p>
                     </c>
                     <c ca="left">
                        <p>Rora</p>
                     </c>
                     <c ca="center">
                        <p>~0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Rxra</p>
                     </c>
                     <c ca="left">
                        <p>Egr1</p>
                     </c>
                     <c ca="center">
                        <p>0.0003</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Rxra</p>
                     </c>
                     <c ca="left">
                        <p>Etv4</p>
                     </c>
                     <c ca="center">
                        <p>0.0014</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Stat5b</p>
                     </c>
                     <c ca="left">
                        <p>Foxo1</p>
                     </c>
                     <c ca="center">
                        <p>0.0005</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>These regulatory targets were identified using permutation of TF binding sites.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>PML-RAR&#945; and APL pathogenesis</p>
            </st>
            <p>While the proposed genetic network predicted that the APL pathway converged on Rxra, the relationship between PML-RAR&#945; and the APL dysregulome was not uncovered. Thus, to test whether the APL dysregulome (Figure <figr fid="F4">4b</figr>) is caused either directly or indirectly by PML-RAR&#945;, PML-RAR&#945; TF targets were identified using RAR&#945; binding profiles. Because it has been shown that PML-RAR&#945; binds to a much broader range of binding site architectures than the normal RAR&#945;, eight binding profiles of PML-RAR&#945; with various orientations and spacings (DR2, DR3, DR4, DR5, DR6, DR12, IR0, and ER8) were created based on a previous experimental study <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> in addition to the RAR&#945; binding profiles in TRANSFAC. Using the same binding site permutation algorithm, none of the TFs in the APL dysregulome were predicted as direct targets of PML-RAR&#945;, suggesting that dysregulation of these TFs was mediated by other TFs.</p>
            <p>Thus, to identify direct targets of PML-RAR&#945;, gene expression profiles collected from cultured bone marrow cells derived from young mCG-PML-RAR&#945; knock-in mice were used (see Materials and methods). First, to identify genes dysregulated in preleukemic promyelocytes, expression data from days 0, 2, and 7 in normal and preleukemic promyelocytes were analyzed using SAM. No genes were differentially expressed at day 0, whereas 73 and 1,028 genes were differentially expressed at day 2 and day 7, respectively. Using the eight binding profiles of PML-RAR&#945; and the RAR&#945; binding profiles in TRANSFAC, six TFs differentially expressed at day 7 were predicted as direct PML-RAR&#945; targets (Table <tblr tid="T5">5</tblr>). Interestingly, one of the TFs in the normal network, Egr1, is also predicted as a PML-RAR&#945; target; however, Egr1 is expressed at a normal level in young, preleukemic mice. Therefore, it is possible that Egr1 may not be a direct mediator of PML-RAR&#945; in leukemogenesis. Collectively, these results suggest a model of APL pathogenesis in which PML-RAR&#945; regulates the APL dysregulome through six mediator TFs. This circuitry ultimately converges to create the APL dysregulome, hallmarked by activation of Rxra, which then triggers downstream events (Figure <figr fid="F5">5</figr>).</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>The proposed model of APL pathogenesis induced by PML-RAR&#945;</p>
               </caption>
               <text>
                  <p>The proposed model of APL pathogenesis induced by PML-RAR&#945;. PML-RAR&#945; may activate the dysregulation of several TFs in the disease regulatory pathway in APL through six mediator TFs (dashed blue arrow). This regulatory circuitry ultimately converges on the overexpression of Rxra. Red circle, overexpressed TFs; green circle, underexpressed TFs; green box, underexpressed genes; orange circle, TFs in the normal network that are predicted as targets of dysregulated TFs in APL; gray circle, other TFs in the normal network; purple circle, PML-RAR&#945;.</p>
               </text>
               <graphic file="gb-2008-9-2-r38-5"/>
            </fig>
            <tbl id="T5">
               <title>
                  <p>Table 5</p>
               </title>
               <caption>
                  <p>Predicted mediator transcription factors of PML-RAR&#945;</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Gene ID</p>
                     </c>
                     <c ca="left">
                        <p>Symbol</p>
                     </c>
                     <c ca="left">
                        <p>Gene name</p>
                     </c>
                     <c ca="center">
                        <p>APL expression</p>
                     </c>
                     <c ca="center">
                        <p>SAM score</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>12053</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Bcl6</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>B-cell leukemia/lymphoma 6</p>
                     </c>
                     <c ca="center">
                        <p>Down</p>
                     </c>
                     <c ca="center">
                        <p>-7.39</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>21429</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Ubtf</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>upstream binding transcription factor, RNA polymerase I</p>
                     </c>
                     <c ca="center">
                        <p>Up</p>
                     </c>
                     <c ca="center">
                        <p>4.49</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>56070</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Tcerg1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>transcription elongation regulator 1 (CA150)</p>
                     </c>
                     <c ca="center">
                        <p>Up</p>
                     </c>
                     <c ca="center">
                        <p>6.25</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>56463</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Snd1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>expressed sequence AL033314</p>
                     </c>
                     <c ca="center">
                        <p>Up</p>
                     </c>
                     <c ca="center">
                        <p>4.44</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>71458</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Bcor</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Bcl6 interacting corepressor</p>
                     </c>
                     <c ca="center">
                        <p>Up</p>
                     </c>
                     <c ca="center">
                        <p>4.90</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>109151</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Chd9</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chromodomain helicase DNA binding protein 9</p>
                     </c>
                     <c ca="center">
                        <p>Up</p>
                     </c>
                     <c ca="center">
                        <p>5.64</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>SAM and the binding profiles of Rara and PML-RAR&#945; were used to identify direct regulatory targets of PML-RAR&#945;. These six TFs may activate the dysregulation of other aberrantly expressed TFs in APL.</p>
               </tblfn>
            </tbl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <sec>
            <st>
               <p>A novel approach to genetic network identification</p>
            </st>
            <p>In this report, we propose a novel approach to genetic network identification that combines two independent types of information, gene expression profiling data and computational identification of TF binding sites. Using gene expression data, genes coherently expressed with TFs were first identified. In parallel, direct regulatory targets of TFs were predicted by a computational model that calculates binding scores for each coexpressed gene and assesses statistical significance using binding site permutation. These two types of information were then integrated to construct the genetic network for each coexpressed gene cluster, which were subsequently consolidated into a comprehensive network. We used this approach to identify the genetic network in normal myeloid differentiation and to determine how this network is perturbed in APL. This approach is general and may be applied to delineate genetic networks operating in other complex human diseases.</p>
         </sec>
         <sec>
            <st>
               <p>Prediction of direct regulatory targets using TF binding site identification</p>
            </st>
            <p>A major challenge in expression data-based genetic network construction is the ability to distinguish direct regulatory targets of TFs from indirectly regulated downstream genes. To predict direct targets accurately, coexpressed genes were scored using the identified TF binding sites, and the statistical significance of each score was determined. This method is different from other existing approaches in several ways. First, it does not compare the frequency of TF binding sites to a specific set of reference sequences. Instead, the binding scores of genes in the entire genome are calculated, and the <it>P </it>value for observing a score is determined by permutation of all the binding sites. Secondly, unlike most previous methods where TF binding sites are only identified in the proximal promoter region, our model considers binding sites located in evolutionarily conserved sequences in the entire gene locus. This includes a significant number of additional, highly conserved sites found in introns and distant genomic regions. These modifications and improvements make our predictions more accurate due to a better scoring model and more comprehensive due to a more complete set of evolutionarily conserved TF binding sites <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Using the <it>in vitro </it>cell culture system to model human myeloid differentiation</p>
            </st>
            <p>In this study, we used an <it>in vitro</it>, G-SCF driven myeloid differentiation system to model normal myeloid maturation and compared its gene expression profile to that of an <it>in vivo </it>APL mouse model. Although the <it>in vitro </it>GCS-F driven myeloid differentiation is not equivalent to <it>in vivo </it>differentiation, it is a validated surrogate that mimics human myeloid differentiation <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. Moreover, this system is currently the only practical and technically feasible platform for the study of normal murine myeloid development. Therefore, we believe the data provided by this system can be used to infer hematopoietic gene regulation.</p>
         </sec>
         <sec>
            <st>
               <p>Comparing the identified genetic networks with previous experimental data</p>
            </st>
            <p>The genetic network of TFs identified by our computational method provides several new insights into the normal and aberrant regulatory pathways that may drive myeloid differentiation and in acute promyelocytic leukemia, respectively. These predictions are compared to several previous findings.</p>
            <p>First, we observed that the normal and disease regulatory cascades converge on Rora and Rxra, respectively. Rora and Rxra belong to the same family of nuclear receptors and are related to Rara, and Rxra forms a heterodimer with Rara. Although the specific functions of Rora in myeloid differentiation are still unknown, all three nuclear receptors bind to very similar DNA sequences ((A/G)GGTCA) <abbrgrp><abbr bid="B46">46</abbr><abbr bid="B48">48</abbr></abbrgrp>, implying that they may regulate the same genes. In addition, the human <it>RORA </it>gene is located within a highly conserved region on chromosome 15 near the human <it>PML </it>gene <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. Therefore, our results suggest a novel role for Rora in myeloid development. Furthermore, the dysregulated pathway in APL leads to an overexpression of Rxra, and the absence of Rxra in the normal network implies that Rxra is a potential effector protein in APL but not in normal myeloid development. In fact, recent studies have shown that although Rxra is not required in normal myelopoiesis <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>, it is an essential component for the PML-RAR&#945; complex to initiate APL in mice <abbrgrp><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr></abbrgrp>. Therefore, the prediction of Rxra as a potential key participant in APL pathogenesis is supported by <it>in vivo </it>studies.</p>
            <p>Second, the normal and the disease pathways did not share many common TFs. Instead, only one TF in the normal network, Fos, which modulates myeloid cell survival and differentiation <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>, was dysregulated in APL. Our analysis identified that the TFs dysregulated in APL may mediate the perturbation of the normal myeloid genetic network through Fos. Indeed, previous studies have demonstrated that PML-RAR&#945; promotes cell growth by activating Fos <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>, and that PML-RAR&#945; directly functions as a co-repressor of Fos in the absence of retinoic acid <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>. Our data suggest that over-expression of Fos in APL may have an aberrant activation function as well. This over-expression of Fos may be induced by PML-RAR&#945; and may, in turn, cause the dysregulation of other TFs involved in APL.</p>
            <p>Third, we identified a set of six TFs as direct targets of PML-RAR&#945;. Among these six TFs, five of them were overexpressed in APL, and one TF was underexpressed, suggesting PML-RAR&#945; may act as both a repressor and an activator. In agreement with this observation, PML-RAR&#945; was first demonstrated as an enhanced repressor of retinoic acid target genes by its stronger binding to corepressors than wild-type RAR&#945; <abbrgrp><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr></abbrgrp>, but it is becoming clear that PML-RAR&#945; may also function as an activator <abbrgrp><abbr bid="B54">54</abbr><abbr bid="B57">57</abbr></abbrgrp>. While a version of PML-RAR&#945; that can only repress gene expression is still leukemogenic <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>, whether PML-RAR&#945; can trigger APL pathogenesis by gene activation is unknown. Our analysis suggests that PML-RAR&#945; may indeed be an activator and provides a potential mechanism through which this may occur.</p>
            <p>Finally, our data are consistent with a previous study that showed that relative levels of Sfpi1 regulate macrophage versus neutrophil differentiation <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>. Importantly, at low Sfpi1 levels (neutrophil differentiation), we did not find that <it>Egr1</it>/<it>2 </it>are targets of Sfpi1 whereas Laslo <it>et al</it>. <abbrgrp><abbr bid="B59">59</abbr></abbrgrp> found that <it>Egr1</it>/<it>2 </it>are targets of Sfpi1 during macrophage differentiation where Sfpi1 is expressed at high levels. Therefore, there may be weaker binding sites for Sfpi1 in <it>Egr1 </it>or <it>Egr2 </it>promoters, and thus the activation of <it>Egr1 </it>and <it>Egr2 </it>may require a higher expression level of Sfpi1.</p>
         </sec>
         <sec>
            <st>
               <p>Genetic networks may be expanded using additional information on binding profiles</p>
            </st>
            <p>The computational approach used in this study is based on the binding profiles of TFs in TRANSFAC and JASPAR. Therefore, the quality of our prediction and the accuracy of our conclusions may be dependent upon the quality of the binding models in these databases. Some TFs require other co-factors to accomplish their regulatory functions, and so their binding specificities or preferences may also be dependent on their binding partners. Such information may not have been included in the TRANSFAC or JASPAR models. For example, the TF RXRA studied in this work typically forms a heterodimer with various co-factors, including RAR, VDR, TR, or PPAR at various spacings. The four binding profiles we used were created using different binding partners of RXRA, including PPAR (M00518), RAR and TR (M00963), and VDR (M00966). Thus, our prediction did consider the binding site of RXRA with different binding partners. However, our prediction may not include all the allowable spacings between the two sites. Therefore, our results may be further improved when more precise and complete binding profiles are available.</p>
            <p>While the direct targets of PML-RAR&#945; in APL were computationally identified, the direct targets of these mediator TFs could not be determined. Therefore, the genetic network in APL was not fully elucidated (Figure <figr fid="F5">5</figr>). In order to identify targets of these mediator TFs, knowledge of their binding profiles is required, information that is currently not available. In fact, while there are more than 2,000 TFs predicted in the human genome <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>, only a quarter of them have known binding profiles. Thus, the predicted genetic network may be further improved and expanded when more TF binding profiles become available. Additional TF binding profiles may be generated by traditional protein-DNA binding assays <abbrgrp><abbr bid="B61">61</abbr></abbrgrp> or by computational approaches that utilize evolutionary conservation of functional sequences. This latter approach is used to predict DNA binding profiles on a genome-wide scale. For example, DNA binding patterns may be identified by calculating the conservation rate of a given oligonucleotide across the genome <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>, or by clustering genes that share common conserved sequences <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. Thus, the employment of such methods is a rational next step toward the refinement of genetic networks. Integration of this computational component would not only increase our understanding of the molecular mechanisms underlying APL but would also facilitate the construction of more comprehensive regulatory networks driving other complex diseases.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We have developed an integrated approach to mammalian genetic network construction by combining gene expression profiling data and TF binding site identification. Using this technique, we have predicted Rxra as a key regulator in APL and Fos as one of the key mediators of PML-RAR&#945;. These results provide new insights about the pathophysiology of APL. Our approach may be applied to study the genetic circuitry operating in other complex diseases in humans.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Genomic sequence collection and ortholog identification</p>
            </st>
            <p>The genomic sequences of human, mouse, and rat were acquired from the NCBI's Genome Assembly Project <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>. Genome build 35 was used for human, build 33 for mouse and build 3 for rat. The genomic sequence of a gene locus was defined as the sequence between the end of the upstream gene and the end of the gene itself. Within this range, protein coding sequences were masked and excluded from the search of TF binding sites. Repetitive elements were also masked by the program RepeatMasker <abbrgrp><abbr bid="B65">65</abbr></abbrgrp> using slow and sensitive mode (the -s flag). Human, mouse and rat ortholog gene groups (13,194 in total) were identified using the annotation of NCBI's HomoloGene database as previously described <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Genomic sequences of the genes in the same ortholog group were then aligned using the program TBA <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>TF binding site identification and binding probability score calculation</p>
            </st>
            <p>To identify TF binding sites, 596 vertebrate TF binding profiles were collected from the TRANSFAC (version 9.1) and JASPAR databases. The program PATSER <abbrgrp><abbr bid="B67">67</abbr></abbrgrp> was used to search for matches of these profiles in the genome using default cutoff scores (the -li option). This cutoff score is calculated as follows: for each position, PATSER scores the subsequence and calculatesthe <it>P </it>value for observing the same score or higher at thatposition <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>. A <it>P </it>value cutoff is calculated for each binding profile using its information content. The score corresponding to that <it>P </it>value cutoff is then chosen to be the cutoff score. After all the TF binding sites were identified, binding probability scores <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> for each TF-gene pair were then calculated using evolutionarily conserved TF binding sites found in the non-coding sequence of a gene. For mammals, functional regulatory elements have been found in distant upstream regions <abbrgrp><abbr bid="B69">69</abbr><abbr bid="B70">70</abbr></abbrgrp> as well as intronic sequences <abbrgrp><abbr bid="B71">71</abbr><abbr bid="B72">72</abbr></abbrgrp>. However, searching for TF binding sites in the entire intergenic sequence and in the entire gene locus frequently results in a high false discovery rate. Therefore, to overcome this problem, we considered only evolutionarily conserved TF binding sites in the 'proximal promoter region' of a gene and in the most conserved sequence regions within a gene locus defined by multi-species conserved sequences (MCSs) <abbrgrp><abbr bid="B73">73</abbr></abbrgrp>.</p>
            <p>The proximal promoter region of a gene was defined as the 10 kb upstream sequence and the 5 kb downstream sequence from the transcription start site, regardless of the presence of an upstream gene or a downstream gene. MCSs are defined as the top 5% conserved sequences in the human genome when compared to another 11 vertebrate genomes <abbrgrp><abbr bid="B73">73</abbr></abbrgrp>. The human MCSs were first downloaded, and the MCSs in mouse and rat were defined by mapping the human MCSs to the mouse or rat genomes using multiple sequence alignments generated by TBA. These multiple sequence alignments were also used to identify evolutionarily conserved TF binding sites, which were defined as sites located within conserved sequence regions, present in all the species and aligned in the multiple sequence alignment <abbrgrp><abbr bid="B74">74</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Calculating statistical significance for a binding probability score</p>
            </st>
            <p>To predict regulatory targets of TFs, genes that have statistically significant binding scores <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> for a TF were identified. The statistical significance of a given binding score was evaluated by the <it>P </it>value for observing an equal or higher score by chance. This <it>P </it>value was calculated by permutation of all the TF binding sites in the genome. In this algorithm, each individual binding site of a TF was randomly assigned to genes in the genome based on a precalculated probability distribution calculated as follows: the probability for a gene to acquire a given TF site is the length of the TF binding site search range of that gene (that is, the proximal promoter region and the MCSs; see above) divided by the sum of the TF binding site search ranges of all the genes in the genome. This permutation of TF binding sites was performed for 10,000 iterations. After each iteration, a new binding probability score was calculated for each gene using the TF sites randomly assigned to that gene. The <it>P </it>value for observing a score for a gene was then calculated by the number of iterations where a binding score equal to or higher than the true score was obtained for that gene, divided by the total number of iterations.</p>
         </sec>
         <sec>
            <st>
               <p>Gene expression profiling data</p>
            </st>
            <p>Gene expression profiles in normal myeloid differentiation were obtained from a G-CSF stimulated <it>in vitro </it>myeloid differentiation model <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. Expression data were collected from cultured bone marrow cells in two independent experiments during a seven-day time course. Gene expression data in APL were collected from bone marrow cells of six adult PML-RAR&#945; knock-in mice based on a previously developed murine APL model <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. Expression profiles of the cultured bone marrow cells were also collected from two sets of young PML-RAR&#945; knock-in mice. For the expression data collected from cultured cells, only probesets that were present on at least one day in at least one experiment were considered.</p>
         </sec>
         <sec>
            <st>
               <p>Coexpressed gene cluster identification</p>
            </st>
            <p>The coexpressed gene clusters during myeloid differentiation were identified using the template matching clustering tool in the software suite FunctionExpress <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>. The templates for each upregulated expression pattern were created manually, including genes that were upregulated on just one day or over two consecutive days during the seven day time course. The Pearson's correlation coefficient was used to quantify the similarity between the template and the expression profile of each probeset. A cutoff of 0.9 was applied to the correlation coefficient to identify the coexpressed gene clusters.</p>
         </sec>
         <sec>
            <st>
               <p>Additional myeloid TF identification</p>
            </st>
            <p>PAP <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> was used to identify additional TFs that may regulate genes in each coexpressed gene cluster. For each gene cluster, PAP ranked all the TFs by their R-scores, which were calculated based on overrepresentation of their binding sites in the coexpressed genes. To find TFs that have a statistically significant R-score, a <it>P </it>value for each R-score was calculated using 10,000 randomly selected gene clusters of the same size. For each random set, the R-scores for each TF were calculated, and the <it>P </it>value for a TF was calculated as the number of gene clusters that had an equal or higher score than that of the original gene cluster, divided by 10,000, the total number of random sets.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>AML, acute myeloid leukemia; APL, acute promyelocytic leukemia; G-CSF, granulocyte colony-stimulating factor; MCSs, multi-species conserved sequences; SAM, Significance Analysis of Microarrays; TF, transcription factor.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>LWC, RN and GDS developed the approach and wrote the manuscript. LWC performed the analysis. JEP, WY and TJL provided the experimental data. JEP and TJL participated in the discussion of the results and edited the manuscript.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following Additional data files are available with the online version of this paper. Additional data file <supplr sid="S1">1</supplr> is a spreadsheet showing the number of genes that were up-regulated at any number of consecutive days during the <it>in vitro </it>myeloid development system. Additional data file <supplr sid="S2">2</supplr> is a spreadsheet containing a list of genes identified in each coexpressed gene cluster. Additional data file <supplr sid="S3">3</supplr> is a spreadsheet containing a list of predicted regulatory targets of TFs identified in each coexpressed gene cluster. Additional data file <supplr sid="S4">4</supplr> is a spreadsheet containing a list of down-regulated genes in the <it>in vitro </it>myeloid differentiation system. Additional data file <supplr sid="S5">5</supplr> is a PDF file showing the complete genetic networks (including genes that do not encode TFs) for genes upregulated at day 0, and at day 0 and day 1. Additional data file <supplr sid="S6">6</supplr> is a PDF file showing the expanded genetic networks (including additional TFs identified by PAP) for genes upregulated at day 0, and at day 0 and day 1. Additional data file <supplr sid="S7">7</supplr> is a spreadsheet containing a list of myeloid TFs identified by PAP and their regulatory relationships to other genes in the myeloid development networks. Additional data file <supplr sid="S8">8</supplr> is a PDF file showing regulatory networks for TF genes identified in each coexpressed gene cluster. Additional data file <supplr sid="S9">9</supplr> is a spreadsheet containing a list of TFs dysregulated in APL and their predicted regulatory targets.</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>Number of genes up-regulated at any number of consecutive days during the <it>in vitro </it>myeloid development system</p>
            </caption>
            <text>
               <p>These genes were identified by calculating the Pearson's correlation coefficients of their expression profiles to each individual expression pattern.</p>
            </text>
            <file name="gb-2008-9-2-r38-S1.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>Genes identified in each coexpressed gene cluster</p>
            </caption>
            <text>
               <p>Genes identified in each coexpressed gene cluster.</p>
            </text>
            <file name="gb-2008-9-2-r38-S2.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p>Predicted regulatory targets of TFs identified in each coexpressed gene cluster</p>
            </caption>
            <text>
               <p>Predicted regulatory targets of TFs identified in each coexpressed gene cluster.</p>
            </text>
            <file name="gb-2008-9-2-r38-S3.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S4">
            <title>
               <p>Additional data file 4</p>
            </title>
            <caption>
               <p>Down-regulated genes in the <it>in vitro </it>myeloid differentiation system</p>
            </caption>
            <text>
               <p>Down-regulated genes in the <it>in vitro </it>myeloid differentiation system.</p>
            </text>
            <file name="gb-2008-9-2-r38-S4.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S5">
            <title>
               <p>Additional data file 5</p>
            </title>
            <caption>
               <p>Complete genetic networks (including genes that do not encode TFs) for genes upregulated at day 0, and at day 0 and day 1.</p>
            </caption>
            <text>
               <p><b>(a) </b>Genes upregulated at day 0; <b>(b) </b>genes upregulated at day 0 and day 1.</p>
            </text>
            <file name="gb-2008-9-2-r38-S5.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S6">
            <title>
               <p>Additional data file 6</p>
            </title>
            <caption>
               <p>The expanded genetic networks (including additional TFs identified by PAP) for genes upregulated at day 0, and at day 0 and day 1</p>
            </caption>
            <text>
               <p><b>(a) </b>Genes upregulated at day 0; <b>(b) </b>genes upregulated at day 0 and day 1. Additional TFs identified by PAP and their regulatory relationships to other genes in the myeloid networks are colored red.</p>
            </text>
            <file name="gb-2008-9-2-r38-S6.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S7">
            <title>
               <p>Additional data file 7</p>
            </title>
            <caption>
               <p>Myeloid TFs identified by PAP and their regulatory relationships to other genes in the myeloid development networks</p>
            </caption>
            <text>
               <p>Myeloid TFs identified by PAP and their regulatory relationships to other genes in the myeloid development networks.</p>
            </text>
            <file name="gb-2008-9-2-r38-S7.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S8">
            <title>
               <p>Additional data file 8</p>
            </title>
            <caption>
               <p>Regulatory networks for TF genes identified in each coexpressed gene cluster</p>
            </caption>
            <text>
               <p><b>(a) </b>TF genes upregulated at day 0. <b>(b) </b>TF genes upregulated at day 0 and day 1. <b>(c) </b>TF genes upregulated at day 6 and day 7. <b>(d) </b>TF genes upregulated at day 7. Color coding in these networks denotes how these TF genes were identified: blue, TFs identified by coexpression; green, additional TFs identified by PAP; yellow, TFs identified by both coexpression and PAP.</p>
            </text>
            <file name="gb-2008-9-2-r38-S8.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S9">
            <title>
               <p>Additional data file 9</p>
            </title>
            <caption>
               <p>TFs dysregulated in APL and their predicted regulatory targets</p>
            </caption>
            <text>
               <p>TFs dysregulated in APL and their predicted regulatory targets.</p>
            </text>
            <file name="gb-2008-9-2-r38-S9.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Mark Watson for useful discussions and suggestions. We thank Nicole Grieselhuber for reviewing the manuscript. This work was supported by NIH grants HG00249 (GDS, LWC), GM63340 (GDS, LWC), K22LM008290 (LWC, RN), CA08962 (JEP, WY, TJL) and CA101937 (JEP, WY, TJL).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The leukemic fusion gene AML1-MDS1-EVI1 suppresses CEBPA in acute myeloid leukemia by activation of Calreticulin.</p>
            </title>
            <aug>
               <au>
                  <snm>Helbling</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Mueller</snm>
                  <fnm>BU</fnm>
               </au>
               <au>
                  <snm>Timchenko</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Hagemeijer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Jotterand</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Meyer-Monard</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lister</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rowley</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Huegli</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Fey</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Pabst</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>13312</fpage>
            <lpage>13317</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">516564</pubid>
                  <pubid idtype="pmpid" link="fulltext">15326310</pubid>
                  <pubid idtype="doi">10.1073/pnas.0404731101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>The myeloid master regulator transcription factor PU.1 is inactivated by AML1-ETO in t(8;21) myeloid leukemia.</p>
            </title>
            <aug>
               <au>
                  <snm>Vangala</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Heiss-Neumann</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Rangatia</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Singh</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Schoch</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Tenen</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Hiddemann</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Behre</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Blood</source>
            <pubdate>2003</pubdate>
            <volume>101</volume>
            <fpage>270</fpage>
            <lpage>277</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1182/blood-2002-04-1288</pubid>
                  <pubid idtype="pmpid" link="fulltext">12393465</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Identification of breakpoints in t(8;21) acute myelogenous leukemia and isolation of a fusion transcript, AML1/ETO, with similarity to <it>Drosophila </it>segmentation gene, runt.</p>
            </title>
            <aug>
               <au>
                  <snm>Erickson</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gao</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Look</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Whisenant</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Raimondi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lasher</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Trujillo</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rowley</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Drabkin</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Blood</source>
            <pubdate>1992</pubdate>
            <volume>80</volume>
            <fpage>1825</fpage>
            <lpage>1831</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">1391946</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>The PML-RAR alpha fusion mRNA generated by the t(15;17) translocation in acute promyelocytic leukemia encodes a functionally altered RAR.</p>
            </title>
            <aug>
               <au>
                  <snm>de The</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lavau</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Marchio</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Chomienne</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Degos</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Dejean</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1991</pubdate>
            <volume>66</volume>
            <fpage>675</fpage>
            <lpage>684</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0092-8674(91)90113-D</pubid>
                  <pubid idtype="pmpid" link="fulltext">1652369</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Transcriptional regulatory networks in <it>Saccharomyces cerevisiae</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Rinaldi</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Robert</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Odom</snm>
                  <fnm>DT</fnm>
               </au>
               <au>
                  <snm>Bar-Joseph</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Gerber</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Hannett</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Harbison</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Zeitlinger</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jennings</snm>
                  <fnm>EG</fnm>
               </au>
               <au>
                  <snm>Murray</snm>
                  <fnm>HL</fnm>
               </au>
               <au>
                  <snm>Gordon</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Ren</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Wyrick</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Tagne</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Volkert</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Fraenkel</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Gifford</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <fpage>799</fpage>
            <lpage>804</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1075090</pubid>
                  <pubid idtype="pmpid" link="fulltext">12399584</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Control of pancreas and liver gene expression by HNF transcription factors.</p>
            </title>
            <aug>
               <au>
                  <snm>Odom</snm>
                  <fnm>DT</fnm>
               </au>
               <au>
                  <snm>Zizlsperger</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Gordon</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Bell</snm>
                  <fnm>GW</fnm>
               </au>
               <au>
                  <snm>Rinaldi</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Murray</snm>
                  <fnm>HL</fnm>
               </au>
               <au>
                  <snm>Volkert</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Schreiber</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rolfe</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Gifford</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Fraenkel</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bell</snm>
                  <fnm>GI</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>RA</