<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
<ui>gb-2011-12-4-r34</ui>
<ji>GBJ</ji>
<fm>
<dochead>Research</dochead>
<bibl>
<title><p>The role of chromatin accessibility in directing the widespread, overlapping patterns of <it>Drosophila </it>transcription factor binding</p></title>
<aug><au id="A1" ce="yes"><snm>Li</snm><fnm>Xiao-Yong</fnm><insr iid="I1"/><insr iid="I2"/><email>XYLi@lbl.gov</email></au>
<au id="A2" ce="yes"><snm>Thomas</snm><fnm>Sean</fnm><insr iid="I3"/><email>sthomas@stamlab.org</email></au>
<au id="A3"><snm>Sabo</snm><mi>J</mi><fnm>Peter</fnm><insr iid="I3"/><email>psabo@stamlab.org</email></au>
<au id="A4"><snm>Eisen</snm><mi>B</mi><fnm>Michael</fnm><insr iid="I1"/><insr iid="I2"/><insr iid="I4"/><email>mbeisen@gmail.com</email></au>
<au ca="yes" id="A5"><snm>Stamatoyannopoulos</snm><mi>A</mi><fnm>John</fnm><insr iid="I3"/><email>jstam@STAMLAB.ORG</email></au>
<au ca="yes" id="A6"><snm>Biggin</snm><mi>D</mi><fnm>Mark</fnm><insr iid="I1"/><email>mdbiggin@lbl.gov</email></au></aug>
<insg>
<ins id="I1"><p>Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road MS 84-171, Berkeley, CA 94720, USA</p></ins>
<ins id="I2"><p>Howard Hughes Medical Institute, University of California Berkeley, 176 Stanley Hall #3220, Berkeley, CA 94720, USA</p></ins>
<ins id="I3"><p>Department of Genome Sciences, University of Washington, Foege S310A, 1705 NE Pacific Street, Box 355065, Seattle, WA 98195, USA</p></ins>
<ins id="I4"><p>Department of Molecular and Cell Biology, University of California Berkeley, 176 Stanley Hall #3220, Berkeley, CA 94720, USA</p></ins>
</insg>
<source>Genome Biology</source>
<issn>1465-6906</issn>
<pubdate>2011</pubdate>
<volume>12</volume>
<issue>4</issue>
<fpage>R34</fpage>
<url>http://genomebiology.com/2011/12/4/R34</url>
<xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2011-12-4-r34</pubid><pubid idtype="pmpid">21473766</pubid></pubidlist></xrefbib></bibl>
<history><rec><date><day>19</day><month>3</month><year>2011</year></date></rec><acc><date><day>7</day><month>4</month><year>2011</year></date></acc><pub><date><day>7</day><month>4</month><year>2011</year></date></pub></history>
<cpyrt><year>2011</year><collab>Li et al.; licensee BioMed Central Ltd.</collab><note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec><st><p>Abstract</p></st>
<sec><st><p>Background</p></st>
<p>In <it>Drosophila </it>embryos, many biochemically and functionally unrelated transcription factors bind quantitatively to highly overlapping sets of genomic regions, with much of the lowest levels of binding being incidental, non-functional interactions on DNA. The primary biochemical mechanisms that drive these genome-wide occupancy patterns have yet to be established.</p>
</sec>
<sec><st><p>Results</p></st>
<p>Here we use data resulting from the DNaseI digestion of isolated embryo nuclei to provide a biophysical measure of the degree to which proteins can access different regions of the genome. We show that the <it>in vivo </it>binding patterns of 21 developmental regulators are quantitatively correlated with DNA accessibility in chromatin. Furthermore, we find that levels of factor occupancy <it>in vivo </it>correlate much more with the degree of chromatin accessibility than with occupancy predicted from <it>in vitro </it>affinity measurements using purified protein and naked DNA. Within accessible regions, however, the intrinsic affinity of the factor for DNA does play a role in determining net occupancy, with even weak affinity recognition sites contributing. Finally, we show that programmed changes in chromatin accessibility between different developmental stages correlate with quantitative alterations in factor binding.</p>
</sec>
<sec><st><p>Conclusions</p></st>
<p>Based on these and other results, we propose a general mechanism to explain the widespread, overlapping DNA binding by animal transcription factors. In this view, transcription factors are expressed at sufficiently high concentrations in cells such that they can occupy their recognition sequences in highly accessible chromatin without the aid of physical cooperative interactions with other proteins, leading to highly overlapping, graded binding of unrelated factors.</p>
</sec>
</sec>
</abs>
</fm>
<meta>
<classifications>
<classification id="30010005" subtype="man_spc_id" type="BMC">Development</classification>
<classification id="300100010" subtype="man_spc_id" type="BMC">Genome studies</classification>
<classification id="300100015" subtype="man_spc_id" type="BMC">Model organisms</classification>
<classification id="300100016" subtype="man_spc_id" type="BMC">Molecular biology</classification>
</classifications>
</meta>
<bdy>
<sec><st><p>Background</p></st>
<p><it>In vivo </it>crosslinking studies show that a wide range of animal transcription factors each bind to many thousands of DNA regions throughout the genome and that not all of this binding is necessarily functional (for example, <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>). For example, our studies of over 20 transcriptional regulators in the <it>Drosophila </it>blastoderm embryo show that the few hundred most highly bound DNA regions include all of these proteins' known target <it>cis</it>-regulatory modules (CRMs) and are preferentially associated with developmental control genes and genes whose expression is strongly patterned in the blastoderm <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B14">14</abbr><abbr bid="B17">17</abbr><abbr bid="B19">19</abbr></abbrgrp>. In contrast, the thousands of more poorly bound regions are preferentially associated with genes not transcribed in the early embryo and/or housekeeping genes, and are frequently present in poorly conserved non-coding DNA or in protein coding sequences. In addition, there is a surprisingly high overlap in the genomic regions bound by biochemically and functionally unrelated animal transcription factors <it>in vivo </it><abbrgrp><abbr bid="B3">3</abbr><abbr bid="B17">17</abbr><abbr bid="B20">20</abbr></abbrgrp>, with the distinct biological specificities of factors being determined by quantitative differences in their occupancy on these shared regions <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B17">17</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>.</p>
<p>What biochemical mechanisms could be responsible for these widespread, overlapping patterns of animal factor binding? Most animal transcriptional regulators recognize short degenerate DNA sequences that occur frequently near most genes <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Only a subset of these sites, however, are highly occupied <it>in vivo </it>in a given cellular or developmental context, and the level of occupancy at each site correlates only poorly with a given factor's intrinsic DNA recognition properties <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B6">6</abbr><abbr bid="B14">14</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>. Thus, as long recognized, one or more mechanisms must differentially alter the relative occupancy of factors across the genome.</p>
<p>Two such mechanisms have been characterized. The first is direct heteromeric cooperative interactions between pairs of factors bound to adjacent sites in the genome that selectively increase occupancy only to regions where appropriately spaced sites for both factors occur <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. The second is competition for DNA binding with other sequence-specific factors, nucleosomes or other chromatin-associated proteins that selectively reduces binding at a subset of sites <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>. While there is evidence that both have some influence on DNA binding <it>in vivo </it><abbrgrp><abbr bid="B12">12</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>, there has been no systematic effort to quantify the relative contributions of these positive and negative effects on the overall pattern of factor binding.</p>
<p>One common set of models invokes a prominent role for direct cooperative interactions, suggesting that transcription factors cannot significantly occupy their functional target sites without such interactions between factors <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. These 'direct cooperativity' models have been used to predict that transcription factors will bind highly selectively in non-overlapping patterns, each factor binding to relatively few genes <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr></abbrgrp>, and that factors with similar intrinsic DNA recognition properties, such as the HOX proteins, may be targeted to different genes through differential interactions with cooperativity partners <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B30">30</abbr></abbrgrp>. These predictions, however, are difficult to reconcile with the measured patterns of DNA binding <it>in vivo </it>and, in the case of HOX factors, with their ultimate regulation of a very large pool of common genes <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B46">46</abbr></abbrgrp>.</p>
<p>Instead, to explain the widespread, overlapping patterns of factor binding in animals, we have previously suggested that transcription factors are expressed at sufficiently high cellular concentrations that they detectably occupy most high and moderate affinity recognition sequences that are physically accessible in the context of chromatin, without the aid of heteromeric cooperative interactions with other factors <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B14">14</abbr><abbr bid="B41">41</abbr><abbr bid="B46">46</abbr></abbrgrp>. In this 'widespread binding' model, nucleosomes and other chromatin proteins would block access to much of the genome <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B25">25</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>. At the same time, accessible, nucleosome-depleted regions, such as active CRMs, would be bound at high levels by factors exerting an essential function, but would also be bound at lower levels by other factors interacting opportunistically with fortuitously occurring cognate recognition sequences.</p>
<p>Here we seek to quantify the relative contributions of the direct cooperativity and widespread DNA binding models in the context of the quantitative genome-wide <it>in vivo </it>binding patterns of <it>Drosophila </it>developmental regulators. Genome wide DNaseI digestion data are used to provide a biophysical measurement of the access an exogenous protein has to DNA in nuclei. Since the access a protein has to DNA must affect its level of occupancy on DNA, the DNaseI data measure the contribution to the final pattern of factor binding due to competitive inhibition of binding. In contrast, local genome accessibility is not altered, <it>per se</it>, by direct heteromeric cooperative interactions. Thus, by establishing the quantitative correlation between accessibility and levels of factor binding, we can both determine accessibility's contribution to DNA binding and set an upper limit, by the extent of non-correlation, for the contribution that direct heteromeric cooperativity makes.</p>
<p>It is important to note that indirect cooperativity, a mechanism by which binding of two or more factors mutually increase each others ability to competitively displace a nucleosome without making direct physical contacts with each other <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr></abbrgrp>, is quite distinct from direct cooperativity. Indirect cooperativity is fully consistent with the widespread binding model. It assumes that at least some factors are expressed at sufficiently high concentrations that they can bind their sites without direct interactions with other factors. It also provides a ready explanation for the high overlap in factor binding because it naturally leads to increased binding of any factors whose recognition sites lie within the DNA region from which a nucleosome has been displaced. Here, however, we make no attempt to distinguish whether this or other mechanisms are the chief cause of the differential accessibility of the genome. By using direct independent measurements of accessibility and then by considering the effect this has on each factor separately, we unlink targeting of individual factors from the challenging question of how the hundreds of transcription factors expressed in each cell, together with the chromatin remodeling/modification enzymes that they recruit, alter chromatin structure <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr></abbrgrp>.</p>
</sec>
<sec><st><p>Results</p></st>
<sec><st><p>Factor binding is concentrated in highly accessible chromatin</p></st>
<p>The accessibility of genomic DNA sequences in the context of chromatin <it>in vivo </it>has classically been studied using digestion of DNA in isolated nuclei by the non-specific endonuclease DNaseI <abbrgrp><abbr bid="B59">59</abbr><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp>. Using a high-throughput version of this assay (DNase-seq) <abbrgrp><abbr bid="B62">62</abbr><abbr bid="B63">63</abbr></abbrgrp>, we have previously profiled DNA accessibility genome-wide in native chromatin at high resolution across stages 5, 9, 10, 11 and 14 of <it>Drosophila </it>embryogenesis, spanning the first 11 hours of development (S Thomas <it>et al</it>., submitted). Even though data for independent replicas from collections of embryos at the same stage of development were highly reproducible (r &#8805; 0.91; S Thomas <it>et al</it>., submitted; Additional files <supplr sid="S1">1</supplr> and <supplr sid="S2">2</supplr>), to derive a conservative picture of chromatin accessibility, and to minimize the effect of experimental variability, we reanalyzed these data to identify genomic regions with increased DNaseI sensitivity at a 5% false discovery rate (FDR) that were concordant between pairs of replicas. We identified between 16,217 and 24,373 such DNaseI accessible regions per stage, collectively spanning 9 to 13% of the euchromatic genome (Additional files <supplr sid="S1">1</supplr>, <supplr sid="S2">2</supplr> and <supplr sid="S3">3</supplr>). Consistent with our original results (S Thomas <it>et al</it>., submitted), approximately half of the accessible regions present at a particular stage show little change in accessibility over time, whereas the remaining regions display marked increases or decreases in DNaseI sensitivity during embryogenesis.</p>
<suppl id="S1">
<title><p>Additional file 1</p></title>
<text><p><b>Replica DNase-seq data closely agree</b>.</p></text>
<file name="gb-2011-12-4-r34-S1.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S2">
<title><p>Additional file 2</p></title>
<text><p><b>Summary of 5% FDR accessible regions in euchromatic DNA for stage 5, 9, 10, 11 and 14 embryos</b>.</p></text>
<file name="gb-2011-12-4-r34-S2.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S3">
<title><p>Additional file 3</p></title>
<text><p><b>5% FDR accessible regions in the euchromatic genome for stage 5, 9, 10, 11 and 14 embryos</b>.</p></text>
<file name="gb-2011-12-4-r34-S3.XLS">
   <p>Click here for file</p>
</file>
</suppl>
<p>We next compared the DNase-seq data for stage 5 embryos to <it>in vivo </it>DNA binding data at the same stage. At this point in development, the embryo is a single layer of approximately 6,000 undifferentiated cells, which are each likely to have similar patterns of chromatin structure, providing a relatively simple system for our analysis <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>. We used DNA binding data for 21 sequence-specific transcription factors, TFIIB, and the transcriptionally active form of RNA polymerase II that had been quantified by genome-wide <it>in vivo </it>formaldehyde crosslinking (ChIP-chip) <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B17">17</abbr></abbrgrp>. Only high-confidence bound regions above the 1% FDR threshold were examined, giving a conservative picture of the total amount of factor binding.</p>
<p>An extensive set of controls indicate that our ChIP-chip data provide an accurate measure of the relative levels of factor directly contacting the different genomic DNA regions to which they are crosslinked <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B14">14</abbr><abbr bid="B17">17</abbr><abbr bid="B41">41</abbr><abbr bid="B65">65</abbr></abbrgrp>. For example, <it>in vitro </it>controls show that formaldehyde crosslinking of purified transcription factors to naked DNA is proportional to factor occupancy on the DNA; quantitative PCR and bacterial artificial chromosome 'spike-in' experiments show that the whole genome amplification used in our ChIP-chip experiments preserves the relative differences in enrichment of various genomic regions; and <it>in vivo </it>UV crosslinking results show that similar data are obtained when protein-protein crosslinking is absent. In light of a recent paper showing that sonication of intact nuclei can lead to the preferential release of short (&lt;350 bp) DNA fragments from accessible genomic regions <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>, we also note that the crosslinked DNA used in our ChIP-chip experiments is sonicated only after it has been purified away from non-covalently attached proteins and that the resulting DNA fragments are mostly longer than 350 bp (mean size approximately 600 bp). As a result, our crosslinked input DNA samples show no evidence of bias towards genomic regions that are either highly accessible to DNaseI digestion or highly bound by factors (Additional file <supplr sid="S4">4</supplr>). Further, the quantification of ChIP-chip data (ChIP-chip scores) used throughout this and our previous work, with the exception of that in Additional file <supplr sid="S4">4</supplr>, were calculated by dividing the array hybridization signal from a factor immunoprecipitation by the array signal from the exactly matched, 'input' crosslinked DNA sample <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, which would correct for any DNA extraction bias that had occurred.</p>
<suppl id="S4">
<title><p>Additional file 4</p></title>
<text><p><b>ChIP-chip input crosslinked DNA is not appreciably enriched in either highly bound or highly accessible genomic regions</b>.</p></text>
<file name="gb-2011-12-4-r34-S4.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<p>Figure <figr fid="F1">1</figr> compares DNase-seq and the ChIP-chip data for the <it>even-skipped </it>(<it>eve</it>) locus at stage 5. This well characterized target gene contains five CRMs that molecular genetics indicate are each bound and regulated by combinations of the 21 regulatory factors at this stage of embryo development <abbrgrp><abbr bid="B67">67</abbr><abbr bid="B68">68</abbr><abbr bid="B69">69</abbr></abbrgrp>. These proteins are expressed in different spatial patterns and either activate or repress transcription such that, while the <it>eve </it>gene is only expressed in a subset of cells, each CRM is expected to be accessible and bound by at least some of these factors in all cells <abbrgrp><abbr bid="B67">67</abbr><abbr bid="B68">68</abbr><abbr bid="B69">69</abbr></abbrgrp>. Consistent with this, all five CRMs show peaks of DNA binding for many of the 21 factors (Figure <figr fid="F1">1</figr>). Local peaks of DNaseI accessibility align very well with both the CRMs and peaks of factor binding, with the DNase-seq peaks varying in intensity (reflected in the density of mapped DNA sequence tags) over approximately a ten-fold range (Figure <figr fid="F1">1</figr>). While this variation in peak intensity is higher than that expected and may reflect differences in experimental bias in each assay, analyses presented later in the paper indicate that, when averaged over multiple regions, DNase-seq signals do correlate with levels of factor occupancy. A high overlap between genomic regions identified by DNase-seq and ChIP-chip is also apparent across much longer regions of the genome (Figure <figr fid="F2">2</figr>), wherein the strongest peaks of factor binding almost uniformly align with major peaks of DNaseI accessibility in stage 5 chromatin.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>DNaseI accessibility and <it>in vivo </it>DNA binding by transcription factors across the <it>eve </it>locus</p></caption><text>
   <p><b>DNaseI accessibility and <it>in vivo </it>DNA binding by transcription factors across the <it>eve </it>locus</b>. DNA binding in stage 5 embryos is shown as ChIP-chip scores (blue) for 675-bp windows that fall above a 1% FDR threshold for 21 sequence-specific transcription factors, TFIIB and the transcriptionally active form of RNA polymerase II (POLII). The sequence-specific factors are grouped into three major regulatory classes that regulate patterning along the Dorsal-Ventral axis of the embryo (D-V), initiate patterning along the Anterior-Posterior axis (Early A-P), or establish later pair rule patterns along the Anterior-Posterior axis (Pair rule A-P). DNaseI accessibility at stage 5 is shown for 75-bp windows of sequence tag density (red) along with the locations of accessible regions above the 5% FDR threshold (black bars). At the bottom, the locations of major RNA transcripts are shown (grey) as well as the autoregulatory CRM (Auto) and the four stripe initiation CRMs (S3/7, S2, S4/6 and S1/5) (green). Nucleotide coordinates in the genome are given in base pairs.</p>
</text><graphic file="gb-2011-12-4-r34-1" hint_layout="single"/></fig>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>DNaseI accessibility and <it>in vivo </it>DNA binding by transcription factors across a 200-kb genomic region</p></caption><text>
   <p><b>DNaseI accessibility and <it>in vivo </it>DNA binding by transcription factors across a 200-kb genomic region</b>. The figure is labeled using the same conventions in Figure <figr fid="F1">1</figr> except that the RNA transcript locations are shown in light blue at the bottom.</p>
</text><graphic file="gb-2011-12-4-r34-2" hint_layout="double"/></fig>
<p>To quantify the global correlation between factor binding and DNaseI accessibility, we first determined the proportion of ChIP-chip peak regions that overlapped 5% FDR accessible regions at stage 5 (see Materials and methods; Additional file <supplr sid="S5">5</supplr>). Combining data from all 21 factors, RNA polymerase II and TFIIB, we observed a strikingly high overlap (mean 87%, range 71 to 99%, probability of observing a higher overlap randomly &lt;1 &#215; 10<sup>-16</sup>). We also determined the proportion of accessible regions that coincided with genomic regions bound by one or more of the 21 sequence-specific factors. Although stage 5 DNaseI accessible regions encompass only approximately 12% of the euchromatic genome, 61% of these regions coincide with binding for at least one of the 21 factors (probability of a greater overlap occurring by chance &lt;1 &#215; 10<sup>-16</sup>), or 65% if RNA polymerase and TFIIB binding are included. By contrast, only 7% of the genome that is at least 500 bp away from accessible chromatin is covered by 1% FDR ChIP-chip regions (probability of getting less overlap at random &lt;1 &#215; 10<sup>-16</sup>). Moreover, the most accessible regions displayed even higher levels of overlap with regulatory factor binding sites. Of the 5,000 most accessible regions, 95% are occupied by at least one of the 21 factors above the 1% FDR threshold, with nearly monotonically decreasing overlap with decreasing chromatin accessibility (Additional file <supplr sid="S6">6</supplr>).</p>
<suppl id="S5">
<title><p>Additional file 5</p></title>
<text><p><b>The overlap between 1% FDR ChIP-chip peaks versus 5% FDR accessible regions</b>.</p></text>
<file name="gb-2011-12-4-r34-S5.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S6">
<title><p>Additional file 6</p></title>
<text><p><b>Most highly accessible regions are bound by regulatory factors</b>.</p></text>
<file name="gb-2011-12-4-r34-S6.PDF">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec><st><p>Quantitative relationship between genome accessibility and factor occupancy</p></st>
<p>Because our previous studies establish that it is the level of regulatory factor occupancy on a given genomic region that is an important determinant of function, rather than if a region is detectably bound or not <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B14">14</abbr><abbr bid="B17">17</abbr></abbrgrp>, we next performed a quantitative comparison of factor binding and accessibility. We calculated median DNaseI scores for cohorts of 200 ChIP-chip peaks, grouped and ranked according to their ChIP-chip scores in stage 5 embryos (see Materials and methods). This analysis revealed that, for each factor, the regions that are most highly bound are significantly more accessible than regions bound at lower levels (Figure <figr fid="F3">3</figr>; Additional file <supplr sid="S7">7</supplr>). This result is most compelling for those factors with the most regions identified above the 1% FDR ChIP-chip threshold, since in these cases false positives should not contribute significantly to the median DNaseI score above this threshold; notably, however, all factors show this trend.</p>
<suppl id="S7">
<title><p>Additional file 7</p></title>
<text><p><b>The level of transcription factor occupancy correlates with the degree of DNaseI accessibility</b>.</p></text>
<file name="gb-2011-12-4-r34-S7.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>Levels of factor occupancy and genome accessibility correlate</p></caption><text>
   <p><b>Levels of factor occupancy and genome accessibility correlate</b>. The median DNase-seq tag density in non-overlapping cohorts of 200 1-kb ChIP-chip peaks is shown down the ChIP-chip rank list (continuous lines). The ChIP-chip data are from stage 5 embryos and the DNaseI accessibility data are from stages 5 (green) and 14 (purple). The 95% confidence limit for median DNaseI accessibility of each cohort is indicated. Shown also is the percent of ChIP-chip peaks that are overlapped by 5% FDR accessible regions in stage 5 embryos (dashed green line). The regions most highly bound by transcription factors are to the left along the x-axis and results are plotted as far as the 25% FDR cutoff. The location of the ChIP-chip 1% FDR threshold is indicated by the vertical black dotted line. Results for the regulatory transcription factors <b>(a) </b>Dichaete (D) and <b>(b) </b>Twist (TWI) are shown. Additional file <supplr sid="S7">7</supplr> shows plots for all 21 regulators.</p>
</text><graphic file="gb-2011-12-4-r34-3" hint_layout="double"/></fig>
<p>We confirmed that the aforementioned relationship is quantitative - that is, that the lower median accessibility of cohorts of poorly bound regions largely derives from reduced accessibility of each region rather than a reduced number of accessible regions versus highly bound cohorts. This is illustrated clearly by the fact that the proportion of ChIP-chip peaks that overlap accessible regions reduces more gradually down the rank list than do DNaseI scores (Figure <figr fid="F3">3</figr>; Additional file <supplr sid="S7">7</supplr>). For example, for the sequence-specific factor Dichaete (D) at ChIP-chip rank 2,000 when accessibility is reduced by two-fold, the percent overlap drops only marginally.</p>
<p>The plots in Figure <figr fid="F3">3</figr> also show that regions bound highly by factors in stage 5 are much less accessible at stage 14 than at stage 5, even though we have previously shown that both stages contain a similar number and length of accessible regions, and the median accessibility of accessible regions at stage 14 is fully 78% of that at stage 5 (S Thomas <it>et al</it>., submitted; Additional file <supplr sid="S2">2</supplr>). Thus, most genomic regions bound at high levels by regulatory factors at stage 5 have their accessibility specifically reduced at later stages of development, consistent with the known inactivation of many early active CRMs.</p>
</sec>
<sec><st><p>Genome accessibility and intrinsic factor specificity determine occupancy <it>in vivo</it></p></st>
<p>The above analyses establish a close quantitative relationship between genome accessibility and local levels of factor binding. They do not, however, establish whether the pattern of binding is determined principally by genome accessibility <it>per se</it>, or whether it is the binding of regulatory factors that potentiates chromatin accessibility. As described in the Introduction, ultimately, it is the combined action of all of the hundreds of sequence-specific factors in a given cell, together with the chromatin remodeling proteins that they recruit, that is likely to determine the pattern of chromatin accessibility <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr></abbrgrp>. We therefore focused our attention on the more immediately tractable question of whether, for each single factor in turn, observed chromatin accessibility (however originated mechanistically) has a major effect on determining that factor's binding pattern.</p>
<p>To address this question, we first compared the influence on levels of <it>in vivo </it>factor occupancy of both genome accessibility and the intrinsic specificity of factors for naked DNA as determined <it>in vitro </it>using purified protein. All of the 16 factors for which there are sufficiently accurate position weight matrices (PWMs) of intrinsic specificity <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B70">70</abbr></abbrgrp> (Berkeley <it>Drosophila </it>Transcription Network Project (BDTNP), unpublished data) were examined. We segmented the genome into accessible and closed chromatin compartments based on the 5% FDR accessible regions. We then scanned each compartment and annotated all significant matches to each of the 16 factor PWMs, and then classified these into several affinity cohorts. To provide a negative control, we also separately identified for each factor equivalent cohorts of matches to sets of PWMs for which the order of nucleotide positions had been randomly permutated. At the location of each match to the genuine or scrambled PWMs, the median ChIP-chip score of the region &#177;250 bp around the match was calculated. The highest affinity cohorts typically contained 1,000 recognition site occurrences in accessible chromatin and 12,000 in closed regions, whereas the lowest affinity cohorts contained 0.8 and 6.6 million in these regions (Table <tblr tid="T1">1</tblr>).</p>
<tbl hint_layout="double" id="T1"><title><p>Table 1</p></title><caption><p>Frequency of DNA affinity cohort recognition sequences in accessible and closed genome regions</p></caption><tblbdy cols="4">
      <r>
         <c ca="center">
            <p>
               <b>Affinity cohort</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b><it>P</it>-values included</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Mean number of PWM matches for factors in 5% FDR accessible regions</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Mean number of PWM matches for factors in closed genomic regions</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>-5</p>
         </c>
         <c ca="center">
            <p><it>P </it>&lt;1e-4.5</p>
         </c>
         <c ca="center">
            <p>1,145</p>
         </c>
         <c ca="center">
            <p>12,344</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>-4</p>
         </c>
         <c ca="center">
            <p>1e-3.5 > <it>P </it>> 1e-4.5</p>
         </c>
         <c ca="center">
            <p>9,938</p>
         </c>
         <c ca="center">
            <p>96,853</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>-3</p>
         </c>
         <c ca="center">
            <p>1e-2.5 > <it>P </it>> 1e-3.5</p>
         </c>
         <c ca="center">
            <p>94,126</p>
         </c>
         <c ca="center">
            <p>825,406</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>-2</p>
         </c>
         <c ca="center">
            <p>1e-1.5 > <it>P </it>> 1e-2.5</p>
         </c>
         <c ca="center">
            <p>811,773</p>
         </c>
         <c ca="center">
            <p>6,596,274</p>
         </c>
      </r>
   </tblbdy></tbl>
<p>This analysis revealed that, among genomic regions that contain genuine factor recognition sequences of similar affinity, those in the accessible chromatin (dark red lines in Figure <figr fid="F4">4</figr> and Additional file <supplr sid="S8">8</supplr>) are clearly bound at significantly higher levels <it>in vivo </it>than those in inaccessible chromatin regions (dark blue lines in Figure <figr fid="F4">4</figr> and Additional file <supplr sid="S8">8</supplr>). The fact that the same pattern is evident for 16 factors with widely varying DNA binding specificities (Additional file <supplr sid="S8">8</supplr>) strongly suggests that the observed correlation is not the result of any sequence bias in regions detected by the DNase-seq assay, but instead reflects genuinely different properties of accessible and closed chromatin regions. Additionally, the fact that such large effects are seen when averaged over thousands to millions of genomic regions strongly suggests that accessibility has a major influence on <it>in vivo </it>occupancy genome-wide.</p>
<suppl id="S8">
<title><p>Additional file 8</p></title>
<text><p><b>Comparison of ChIP-chip scores for occurrences of DNA recognition sequences in accessible versus closed chromatin regions</b>.</p></text>
<file name="gb-2011-12-4-r34-S8.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>Factor recognition sites in DNaseI accessible regions are more highly bound <it>in vivo </it>than sites in closed chromatin</p></caption><text>
   <p><b>Factor recognition sites in DNaseI accessible regions are more highly bound <it>in vivo </it>than sites in closed chromatin</b>. Separately for each transcription factor, all significant recognition sequences in the euchromatic genome for four affinity cohorts were identified using PWMs derived from <it>in vitro </it>DNA binding data (Table 1) <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B70">70</abbr></abbrgrp>. In addition, matches to ten PWMs derived by random permutation of nucleotide position order were derived for each factor. Sites in each affinity cohort for both the genuine and scrambled PWMs present were each classified as either accessible or inaccessible, using the 5% FDR DNaseI accessible regions to define accessible regions (Table 1). The median ChIP-chip scores (y-axis) for the 500-bp regions &#177;250 bp around recognition sites in each affinity cohort were plotted separately for accessible (red lines) and inaccessible (blue lines) genomic regions. Dark red and blue lines show results for the genuine factor PWMs, light red and blue lines the median result for the scrambled PWMs. The highest affinity cohort is to the left (x-axis). Web logo representations of the PWM representing the highest and lowest affinity cohorts of genuine recognition sites are shown at the bottom. The 95% confidence limits for the median ChIP-chip scores are indicated. Plots for <b>(a) </b>CAD, <b>(b) </b>GT, <b>(c) </b>KNI, and <b>(d) </b>HRY are shown. Additional file <supplr sid="S8">8</supplr> provides similar plots for all 16 factors for which sufficiently accurate PWMs are available.</p>
</text><graphic file="gb-2011-12-4-r34-4" hint_layout="double"/></fig>
<p>Further, in 13 out of 16 cases (excepting KNI, PRD, and FTZ), genomic regions with higher intrinsic affinity recognition sequences have higher ChIP-chip scores. Even moderate affinity sites, though, appear to mediate DNA binding <it>in vivo</it>, albeit at a lower level, as these are occupied at higher levels than matches to scrambled PWMs of equivalent affinity for all 16 factors (compare the dark red and light red lines in Figure <figr fid="F4">4</figr> and Additional file <supplr sid="S8">8</supplr>). Thus, both the intrinsic affinity of a factor for a given DNA sequence and the accessibility of the site contribute to the pattern of genome binding <it>in vivo</it>.</p>
<p>We next focused exclusively on accessible genomic regions, and asked which component - measured factor occupancy <it>in vivo </it>or the intrinsic affinity of factors for DNA - was more closely correlated with chromatin accessibility. To address this, we grouped accessible regions into ranked cohorts of 200 based on the peak density of mapped DNaseI cleavages within each region, and plotted the median ChIP-chip scores and the number of recognition sequences (at the <it>P </it>&lt; 0.003 matching level) in each cohort (Figure <figr fid="F5">5</figr>).</p>
<fig id="F5"><title><p>Figure 5</p></title><caption><p>Accessibility better explains <it>in vivo </it>occupancy than does intrinsic affinity</p></caption><text>
   <p><b>Accessibility better explains <it>in vivo </it>occupancy than does intrinsic affinity</b>. We identified and grouped 150-bp local peaks of accessibility within DNaseI accessible regions into non-overlapping cohorts of 200 peaks down the DNase-seq rank list. <b>(a) </b>The median ChIP-chip score in each cohort for each factor. <b>(b) </b>The sum of occurrences of recognition sequences that match the factor's PWM (<it>P </it>&lt; 0.003) in each cohort for each factor. The bottom row in each panel shows the relative DNase-seq scores for each cohort. Data for each factor were normalized by scaling the median value for each row and plotted as a heat map. The correlation coefficients of the data for each factor with the DNase-seq scores are shown on the right. The correlations are calculated using data for each accessible region, not the cohort average values.</p>
</text><graphic file="gb-2011-12-4-r34-5" hint_layout="double"/></fig>
<p>For all 16 factors, we found that observed levels of <it>in vivo </it>occupancy decline sharply in parallel with accessibility, most strikingly across the few thousand most accessible regions, and more gradually after that over the remaining regions. The fact that a wide array of regulatory factors with markedly different intrinsic DNA binding and biological specificities all show a similar correlation in their levels of occupancy across a diverse array of genomic elements alone implies that some common principle is directing the pattern of binding. The strong correlation of binding with accessibility suggests that the degree of access that factors have to DNA is the common force driving the otherwise surprisingly similar behavior of factors. This view is further supported by the fact that the intrinsic DNA recognition properties of factors correlate much more poorly with accessibility than does <it>in vivo </it>occupancy, suggesting that access to DNA plays a larger role in determining occupancy <it>in vivo </it>than does intrinsic specificity (r = 0.03 to 0.12 versus r = 0.32 to 0.6; Figure <figr fid="F5">5</figr>). For each factor, the density of recognition sequences drops more gradually down the rank list of accessible genomic regions than do either levels of <it>in vivo </it>occupancy or DNase-seq scores (Figure <figr fid="F5">5</figr>). Indeed, for many factors the most accessible cohorts have fewer recognition sites than regions 2,000 to 6,000 down the rank list. There is higher correlation between site density and accessibility for a few factors (especially HRY, RUNT and SNA), which could suggest that these proteins play a pioneering role in determining the pattern of genome accessibility, similar to transcription factors such as the glucocorticoid receptor <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B49">49</abbr></abbrgrp>. This correlation, however, is still low (&lt;0.13), suggesting that accessibility is affecting their binding more than any of them are affecting it.</p>
</sec>
<sec><st><p>Developmental alterations in genome accessibility direct changes in factor binding</p></st>
<p>The above analyses strongly support the 'widespread binding' model in that they suggest that the accessibility of DNA in chromatin plays a major role in determining the pattern of <it>in vivo </it>DNA binding for each transcription factor. These analyses, however, are largely of events at a single stage (stage 5). As described above, we have shown that many regions bound by developmental regulators at this stage become inaccessible in later embryogenesis (Figure <figr fid="F3">3</figr>; Additional file <supplr sid="S7">7</supplr>) and regions bound by factors in later stages are inaccessible at stage 5 (S Thomas <it>et al</it>., submitted). Such perturbations of the chromatin landscape during development provide a unique and rigorous opportunity to assess the extent to which the patterns of regulatory factor DNA binding are caused by accessibility, as follows. Since changes in factor binding between stages are necessarily measured on the same genomic regions, any alteration in occupancy cannot be due to differences in DNA sequence, but must instead derive from temporal changes in the influence of other proteins on binding, including occlusion by nucleosomes. While direct positive cooperative interactions with other sequence-specific factors could, in principle, be responsible for most of the temporal alterations in DNA binding, this cannot be the case if these alterations in DNA binding are highly correlated with changed DNA accessibility. In such cases, since changed accessibility must affect factor DNA binding and do so in proportion to the degree of that change, any additional influences on DNA binding due to heteromeric cooperative interactions and other effects must be limited, at most, to the residual extent that altered DNA binding and accessibility do not correlate. In other words, a temporal analysis sets an upper bound on all other influences on factor binding, beyond chromatin accessibility and the intrinsic affinity of factors for DNA.</p>
<p>To examine factor DNA binding in the context of developmentally programmed changes in chromatin accessibility, we analyzed <it>in vivo </it>occupancy data for two regulatory factors: hunchback (HB) at stage 9, at which time this factor is expressed in neuroblasts <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>, and Medea (MED) at stages 10 and 14, which is expressed in all cells during embryogenesis, but is activated only in changing subsets of cells in response to transforming growth factor-&#946; signaling <abbrgrp><abbr bid="B72">72</abbr><abbr bid="B73">73</abbr><abbr bid="B74">74</abbr></abbrgrp>.</p>
<p>Both MED and HB exhibit temporal changes in occupancy, which visualization at individual gene loci suggests accompany programmed changes in chromatin accessibility (Figure <figr fid="F6">6</figr>; Additional file <supplr sid="S9">9</supplr>). A larger scale quantification of the change in factor binding shows that, between stage 5 and stages 9, 10 or 14, the correlation between binding levels for a given factor genome-wide range between r = 0.33 and r = 0.83, whereas the correlation between biological replicates at the same stage is r = 0.93 (Additional file <supplr sid="S10">10</supplr>). At most regions, therefore, the changes in levels of binding between stages for a protein are moderate, but are clearly distinguished from experimental variability between biological replicates.</p>
<suppl id="S9">
<title><p>Additional file 9</p></title>
<text><p><b>Levels of MED factor occupancy and DNaseI accessibility change between developmental stages</b>.</p></text>
<file name="gb-2011-12-4-r34-S9.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S10">
<title><p>Additional file 10</p></title>
<text><p><b>Change in DNA binding levels <it>in vivo </it>between developmental stages</b>.</p></text>
<file name="gb-2011-12-4-r34-S10.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<fig id="F6"><title><p>Figure 6</p></title><caption><p>Levels of HB factor occupancy and DNaseI accessibility change between developmental stages</p></caption><text>
   <p><b>Levels of HB factor occupancy and DNaseI accessibility change between developmental stages</b>. The level of hunchback (HB) binding and DNaseI accessibility to the <it>Caudal </it>(<it>cad</it>; left) and <it>hb</it>; right) genes are shown at stages 5 and 9. The figure is labeled using the same conventions in Figure 1 except that the locations of the regions above the ChIP-chip 1% FDR threshold are indicated by black horizontal lines beneath the continuous traces of ChIP-chip scores. Additional file <supplr sid="S9">9</supplr> shows similar results for Medea (MED).</p>
</text><graphic file="gb-2011-12-4-r34-6" hint_layout="double"/></fig>
<p>To quantify the relationship between these temporal changes in factor occupancy and alterations in genome accessibility, we focused on the 400 most highly bound genomic regions at each stage. We then calculated for each highly bound region the ratio of ChIP-chip scores between pairs of stages for a factor and separately the ratio of the density of DNaseI cleavage between the same stages and then took the correlation between these two ratios (Figure <figr fid="F7">7</figr>). An advantage of this analysis strategy is that taking ratios within each data class first will greatly reduce any systematic bias introduced by either experimental protocol. Thus, analyzing the ratios will allow a more accurate comparison between two data types. Representative results for HB are shown in Figure <figr fid="F7">7</figr>, which reveals a clear correlation between temporal changes in binding and temporal changes in accessibility. Significant correlations (r = 0.49 to 0.8, <it>P</it>-values all &lt;0.001) were likewise observed for all six pairwise comparisons between factors and stages (Figure <figr fid="F7">7</figr>; Additional file <supplr sid="S11">11</supplr>). Although strong, these correlations should be regarded as minimum estimates of the degree to which accessibility influences binding as remaining experimental biases in the data not removed by taking ratios will prevent a complete correlation.</p>
<suppl id="S11">
<title><p>Additional file 11</p></title>
<text><p><b>Temporal changes in levels of MED occupancy correlate with changes in DNaseI accessibility</b>.</p></text>
<file name="gb-2011-12-4-r34-S11.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<fig id="F7"><title><p>Figure 7</p></title><caption><p>Temporal changes in levels of HB occupancy correlate with changes in DNaseI accessibility</p></caption><text>
   <p><b>Temporal changes in levels of HB occupancy correlate with changes in DNaseI accessibility</b>. We identified the 1-kb regions &#177;500 bp of the peak nucleotide of binding for each of the 400 regions most highly bound by HB at <b>(a) </b>stage 5 and <b>(b) </b>stage 9. (a) Scatter plot of the ratio of ChIP-chip scores at stage 5 over those at stage 9 (x-axis) versus the ratio of DNase-seq scores at stage 5 over those at stage 9 (y-axis). (b) Scatter plot of the ratio of ChIP-chip scores at stage 9 over those at stage 5 (x-axis) versus the ratio of DNase-seq scores at stage 9 over those at stage 5 (y-axis). The Pearson correlation coefficients (<it>r</it>) for each comparison are shown in the top right of each panel. See Additional file <supplr sid="S11">11</supplr> for similar plots for MED.</p>
</text><graphic file="gb-2011-12-4-r34-7" hint_layout="double"/></fig>
</sec>
</sec>
<sec><st><p>Discussion</p></st>
<p>We have shown that the phenomenon of widespread, overlapping patterns of DNA binding by different sequence-specific transcription factors in <it>Drosophila </it>embryos is tightly linked in a quantitative manner to DNA accessibility in chromatin. First, averaged across the entire euchromatic genome, the level of DNA binding <it>in vivo </it>at recognition sequences with similar intrinsic affinity for a given factor is much higher in accessible versus inaccessible chromatin for all 16 factors for which all corresponding data are available (Figure <figr fid="F4">4</figr>). Within highly accessible regions, the thousands of higher affinity recognition sequences for a single factor are generally the most highly occupied <it>in vivo</it>, but even the hundreds of thousands of moderate affinity sites are generally bound at higher levels than similar sites in less accessible regions. Second, the degree of chromatin accessibility is much more highly correlated with <it>in vivo </it>occupancy than with occupancy predicted from <it>in vitro </it>affinity measurements using purified protein and naked DNA (Figure <figr fid="F5">5</figr>). Third, there is a high quantitative correlation between programmed changes in accessibility during embryogenesis and changes in the level of factor DNA binding (Figure <figr fid="F7">7</figr>). Since the accessibility experienced by transcription factors must approximate that experienced by DNaseI, the high correlation between the experimentally measured alterations in factor DNA binding and DNaseI digestion suggests that altered chromatin accessibility is the dominant determinant of the change in binding, as opposed to other potential influences such as direct heteromeric cooperative interactions.</p>
<p>All of these results support a previously proposed 'widespread binding' model, which was initially based on comparisons between <it>in vivo </it>UV crosslinking data for different classes of homeoproteins and <it>in vitro </it>DNA binding, genetic, restriction enzyme accessibility, and target gene expression data <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B14">14</abbr><abbr bid="B41">41</abbr><abbr bid="B46">46</abbr></abbrgrp>. In this model, regulatory factors are expressed at sufficiently high concentrations in cells that they can detectably occupy their recognition sequences in highly accessible chromatin without the aid of physical cooperative interactions with other proteins. Given the broad DNA recognition properties of animal transcription factors <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, this would inevitably lead to highly overlapping, graded binding of unrelated factors, with the lowest levels of binding being non-functional <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B14">14</abbr><abbr bid="B41">41</abbr><abbr bid="B46">46</abbr></abbrgrp>.</p>
<p>Computational modeling conducted in parallel to the studies presented here lends further credence to this model <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>. Using a generalized hidden Markov model, quite accurate quantitative predictions of the patterns of ChIP-seq <it>in vivo </it>DNA binding for five of the early <it>Drosophila </it>regulators can be made using only <it>in vitro </it>DNA binding and DNaseI accessibility data as input. No potential heteromeric interactions could be found in the model that would improve the prediction of DNA binding by these proteins, which are known to function in concert on a common pool of CRMs. Analysis of chromatin accessibility before and after induction of DNA binding of glucocorticoid receptor (GR) in different cell types also supports the widespread binding model. Not withstanding the fact that up to 12 to 15% of the regions bound by this pioneering transcription factor are inaccessible prior to induction, the remaining GR recognition sites in the genome that become bound are accessible prior to induction, with the different locations of GR binding between cell types largely correlating with the altered locations of accessible DNA <abbrgrp><abbr bid="B76">76</abbr></abbrgrp>.</p>
<p>The widespread binding model incorporates long-standing predictions that, given the relatively high concentrations of transcription factors and DNA in cells, the majority of factor molecules not bound at high levels to functional targets should be bound instead at lower densities to any accessible parts of the genome <abbrgrp><abbr bid="B77">77</abbr><abbr bid="B78">78</abbr></abbrgrp>. These thermodynamic arguments are supported by various lines of evidence suggesting that the concentration of free, unbound factor molecules in nuclei is indeed much lower than suggested by the number of molecules present <abbrgrp><abbr bid="B79">79</abbr><abbr bid="B80">80</abbr><abbr bid="B81">81</abbr><abbr bid="B82">82</abbr></abbrgrp>. Such predictions were originally made for the Lac repressor in <it>Escherichia coli </it>and assumed that genome-wide, low occupancy binding would result from the sequence-independent, electrostatic affinity of transcription factors for DNA (K<sub>D </sub>approximately 10<sup>-6 </sup>M). Given the broad sequence-specific recognition properties of most animal transcription factors, however, it is likely that most accessible genomic regions will contain moderate or high affinity (K<sub>D </sub>&lt; 10<sup>-8 </sup>M) recognition sites for many of these proteins <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B83">83</abbr></abbrgrp>. The factors whose <it>in vivo </it>binding we have examined are typically expressed at tens of thousands of molecules per cell <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B84">84</abbr></abbrgrp> (BDTNP, unpublished data). Thus, thermodynamically, most of these molecules are likely to significantly occupy accessible moderate or high affinity recognition sequences, rather than being bound via an electrostatic, sequence-independent interaction. Indeed, even genomic regions bound at low levels <it>in vivo </it>are enriched for specific recognition sequences of a range of affinities (<abbrgrp><abbr bid="B3">3</abbr><abbr bid="B14">14</abbr></abbrgrp> and this paper).</p>
<p>DNA recognition sites for factors that would interfere with the proper regulation of a nearby gene will be actively selected against <abbrgrp><abbr bid="B85">85</abbr></abbrgrp>. Low level binding at fortuitously occurring sites that does not lead to biologically significant transcriptional effects, in contrast, would not be subject to negative selection, and is consistent with the high amount of apparently incidental binding of factors detected <it>in vivo </it><abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B14">14</abbr><abbr bid="B17">17</abbr></abbrgrp>.</p>
<p>Our analysis does not rule out an important role for direct heteromeric cooperative interactions between transcription factors quantitatively modifying binding of these proteins at a subset of recognition sequences. Our results, however, set limits on the extent to which direct positive heteromeric cooperative interactions are likely to determine the overall distribution of factor binding in cells. Because accessibility must affect binding, the high quantitative correlation we have measured between accessibility and <it>in vivo </it>binding leaves only a modest role for direct cooperative interactions to further modify binding.</p>
<p>A much larger role for direct heteromeric interactions in targeting transcription factor binding has been invoked where it is assumed that the concentrations at which factors are expressed in cells are too low to allow significant occupation of functional target sites without such interactions <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. This 'direct cooperativity model' is associated with the idea that factors each bind and regulate a limited number of largely different genes, even in the same cell type (for example, <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>), and that even factors with similar intrinsic DNA recognition properties are targeted to different genes (for example, <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B30">30</abbr></abbrgrp>). Based on the evidence presented here and the growing recognition that transcription factors bind a wide array of genomic regions in many animals and cell types <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>, the direct cooperativity model may apply to a relatively limited set of factors and circumstances.</p>
<p>The occurrence of statistically significant local clusters of recognition sites for multiple transcription factors in a subset of CRMs modules (for example, <abbrgrp><abbr bid="B86">86</abbr><abbr bid="B87">87</abbr><abbr bid="B88">88</abbr><abbr bid="B89">89</abbr><abbr bid="B90">90</abbr><abbr bid="B91">91</abbr><abbr bid="B92">92</abbr></abbrgrp>) could be taken as evidence for the direct cooperativity model. Such preferential clustering, however, could also result because of post-DNA-binding synergistic cooperativity between factors that does not significantly influence their targeting to DNA but instead influences members of the general transcriptional machinery <abbrgrp><abbr bid="B46">46</abbr><abbr bid="B86">86</abbr><abbr bid="B93">93</abbr></abbrgrp>. Thus, the arrangement of recognition sites in the genome, while highly informative in detecting putative regulatory elements, cannot itself distinguish between different factor targeting mechanisms.</p>
<p>In addition to the long-standing evidence that nucleosomes inhibit the binding of transcription factors at some DNA regions <it>vivo </it>(reviewed by <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B40">40</abbr></abbrgrp>), genome-wide studies have increasingly shown an association between regions bound by factors <it>in vivo </it>and features of chromatin structure, such as histone modifications, nucleosome content or accessibility <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B25">25</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B94">94</abbr><abbr bid="B95">95</abbr><abbr bid="B96">96</abbr><abbr bid="B97">97</abbr><abbr bid="B98">98</abbr><abbr bid="B99">99</abbr><abbr bid="B100">100</abbr><abbr bid="B101">101</abbr></abbrgrp>. These studies, however, have not shown that functionally distinct factors show a quantitative continuum of function and binding at common regions; nor observed a high quantitative correlation between DNA accessibility and factor binding; nor considered the classic thermodynamic predictions of Lin and Riggs <abbrgrp><abbr bid="B77">77</abbr></abbrgrp> and Peter von Hippel <abbrgrp><abbr bid="B78">78</abbr></abbrgrp>; nor sought to distinguish between the 'widespread binding' and the 'direct cooperativity' models for transcription factor targeting. Most of these studies have generally looked at the association qualitatively. In addition, the studies in yeast have not measured accessibility directly, but have attempted to infer it from ChIP-chip studies of nucleosome occupancy or nucleosome position sequence data <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>, which will likely lead to some inaccuracy as genome accessibility is the product of all proteins bound to DNA and also high order chromatin structures. Our results thus highlight the importance of both measuring and considering the quantitative nature of factor binding and genome accessibility and of attempting to distinguish between alternative targeting models.</p>
<p>Finally, while our analysis does not address how the distribution of accessible regions in the genome is itself established, it is consistent with the indirect cooperativity model proposed by others in which different transcription factors mutually aid each other's binding to DNA by displacing a nucleosome without physically interacting with each other <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr></abbrgrp>. Indirect cooperativity, we suggest, implies that factors are expressed at a sufficiently high concentration in cells that they can occupy their recognition sites without the aid of direct protein-protein interactions with other proteins. It also predicts a high overlap in the genomic regions bound by transcription factors once the broad intrinsic DNA recognition properties of these proteins are taken into account. Most factors would be expected to contribute only a small part to determining the overall pattern of chromatin accessibility in this model, whereas chromatin accessibility would be expected to play a large role in determining the pattern of binding of each factor, when each is considered individually. The emerging picture is of a dynamic interplay between nucleosomes and sequence-specific DNA binding proteins (along with the remodeling/modification enzymes that they recruit) that mutually determine each other's binding patterns <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr></abbrgrp>.</p>
</sec>
<sec><st><p>Conclusions</p></st>
<p>Using the <it>Drosophila </it>embryo as a model system, we have provided a uniquely detailed, quantitative comparison between DNA accessibility and regulatory transcription factor occupancy <it>in vivo</it>. These analyses support a long-standing 'widespread binding' model <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B41">41</abbr><abbr bid="B46">46</abbr><abbr bid="B77">77</abbr><abbr bid="B78">78</abbr><abbr bid="B79">79</abbr><abbr bid="B102">102</abbr></abbrgrp>, which suggests that animal regulatory factors are generally expressed at sufficiently high concentrations in cells that they can detectably occupy their recognition sequences in highly accessible chromatin without the aid of physical cooperative interactions with other proteins. Given the broad DNA recognition properties of animal transcription factors <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, this should inevitably lead to highly overlapping, graded binding of unrelated factors, with the lowest levels of binding being non-functional, consistent with extensive <it>in vivo </it>DNA binding and regulatory data in <it>Drosophila </it><abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B14">14</abbr><abbr bid="B17">17</abbr><abbr bid="B19">19</abbr><abbr bid="B46">46</abbr></abbrgrp>. This simple thermodynamic model predicts that similar widespread, overlapping DNA binding by many different regulatory transcription factors will be found in all animal cells.</p>
</sec>
<sec><st><p>Materials and methods</p></st>
<sec><st><p>ChIP-chip of HB and MED in late stage embryos</p></st>
<p>Embryos were collected in population cages for 1 hour, and then allowed to develop to the required stage before being harvested and fixed with formaldehyde <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B65">65</abbr></abbrgrp>. Chromatin was purified and ChIP-chip experiments were performed using affinity purified antibodies against HB and MED as described previously <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B17">17</abbr></abbrgrp>. The data were processed as before to determine 1% FDR and 25% FDR bound regions and peaks using the symmetric null test <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> (Figure <figr fid="F2">2</figr>). All raw microarray data (CEL files) have been deposited at ArrayExpress [ArrayExpress: E-TABM-1021], and details of the locations of the 1% and 25% FDR bound regions are provided as Additional file <supplr sid="S12">12</supplr>. In addition, these and more processed forms of the data are available from the BDTNP's public web site <abbrgrp><abbr bid="B103">103</abbr></abbrgrp>.</p>
<suppl id="S12">
<title><p>Additional file 12</p></title>
<text><p><b>1% and 25% FDR ChIP-chip bound regions for HB at stage 9 and MED at stages 10 and 14</b>.</p></text>
<file name="gb-2011-12-4-r34-S12.ZIP">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec><st><p>Determining the intersection of 5% FDR accessible regions and peaks</p></st>
<p>The raw DNase-seq DNA sequence tag data are from Thomas <it>et al</it>. ('Dynamic reprogramming of chromatin accessibility during <it>Drosophila </it>embryo development', submitted), which used methods described in <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B62">62</abbr><abbr bid="B104">104</abbr></abbrgrp> to generate the data. For convenience, the NCBI Sequence Read Archive accession numbers for these data are also provided here: [NCBI SRA: STUDY SRP002474, NCBI SRA EXPERIMENTS SRX020691 to SRX020700] for stage 5 rep 1 to stage 14 rep 2, respectively). As described (S Thomas <it>et al</it>., submitted), DNaseI accessible regions were defined using a scan statistic that identified regions with DNaseI cleavage densities that were significantly above the local 50 kb background. Regions at 5% FDR were identified (Additional files <supplr sid="S2">2</supplr> and <supplr sid="S3">3</supplr>). Peaks in accessibility were identified from local maxima in tag density within 75 bp of a given 20-bp sliding window across each accessible region (Additional files <supplr sid="S2">2</supplr> and <supplr sid="S3">3</supplr>). The conservatively defined set of accessible regions and peaks in accessibility that were found in both replicates at each stage were used for subsequent analysis (for example, Additional files <supplr sid="S5">5</supplr>, <supplr sid="S6">6</supplr> and <supplr sid="S7">7</supplr>).</p>
</sec>
<sec><st><p>Correlating factor binding and genome accessibility</p></st>
<p>The locations of 1% FDR ChIP-chip peaks for 21 factors at stage 5 were obtained from previously published data <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B17">17</abbr><abbr bid="B103">103</abbr></abbrgrp> [Array Express; E-TABM-736]. The percentage of ChIP-chip peaks overlapped by accessible chromatin for each factor at stage 5 (Additional file <supplr sid="S5">5</supplr>) was calculated by adding the number of instances either where the 1-kb ChIP-chip peak was overlapped by an accessible region by at least 200 bp or where a ChIP-chip peak entirely encompassed a 5% FDR DNaseI accessible region, and dividing by the total number of 1% FDR ChIP-chip peaks. The significance of this coverage was assessed using two separate methods, a simple hypergeometric model and the Genome Structure Correction (GSC) statistic <abbrgrp><abbr bid="B105">105</abbr></abbrgrp>. The hypergeometric model assessed the likelihood of set A to include 'q' base pairs of overlap with set 'B', assuming n draws without replacement from the genome where n is the base-pair coverage of set A. GSC is a more complex bootstrapping method specifically designed to calculate probabilities of overlap for sets of genomics features. For both tests it was impossible to determine with any further accuracy the probabilities of overlaps for each factor with greater significance than the <it>de minimus </it>probability of 1 &#215; 10<sup>-16</sup>.</p>
<p>To determine what fraction of the accessible regions was covered by one or more factors (Additional file <supplr sid="S5">5</supplr>), all of the single-nucleotide locations of 1% FDR ChIP-chip peaks <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B17">17</abbr></abbrgrp> for all factors were merged and padded on either end by 500 bp to account for imprecision in the location of each peak. Peaks in DNaseI accessibility in stage 5 embryos were ranked from largest to smallest and divided into cohorts of 1,000 peaks. If any of the merged ChIP regions fell within 75 bp of a peak in accessibility, then that DNaseI peak was said to be 'covered' by a ChIP factor. The fraction of peaks that were bound by any of the factors was calculated as the number of 'covered' peaks divided by the number of peaks per cohort.</p>
<p>The 25% FDR ChIP-chip peaks for each factor were ranked from largest to smallest and divided into cohorts of 200 peaks (Figure <figr fid="F3">3</figr>; Additional file <supplr sid="S7">7</supplr>). The maximum DNaseI density for stage 5 and 14 embryos within 500 bp of each ChIP-chip peak was recorded as was whether or not that peak overlapped a stage 5 DNaseI accessible region. The number of ChIP-chip peaks in each cohort that overlapped a stage 5 accessible region divided by the number of peaks in each cohort was calculated to determine the percent of ChIP-chip peaks in each cohort that were in accessible regions. The median and 95% confidence intervals of maximum DNaseI densities for the ChIP-chip peak cohorts were calculated with R's box plot function <abbrgrp><abbr bid="B106">106</abbr></abbrgrp>.</p>
</sec>
<sec><st><p>Measuring the effect of accessibility and intrinsic factor specificity on <it>in vivo </it>occupancy</p></st>
<p>PWMs for 16 transcription factors have previously been collated <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> from various <it>in vitro </it>SELEX and DNaseI footprinting experiments that used purified transcription factor protein and naked DNA <abbrgrp><abbr bid="B70">70</abbr></abbrgrp> (BDTNP, unpublished data). For convenience these are provided in Additional file <supplr sid="S13">13</supplr>. These PWMs were used to identify all DNA sequences that match them genome-wide at <it>P</it>-values &lt;0.04 using Fimo <abbrgrp><abbr bid="B107">107</abbr></abbrgrp>. For each factor, these recognition site occurrences were then divided into two groups depending on whether the matches were located within 5% FDR DNaseI accessible regions or whether they were in inaccessible chromatin. The recognition sites were then further broken down into cohorts in R based on <it>P</it>-values as follows:</p>
<suppl id="S13">
<title><p>Additional file 13</p></title>
<text><p><b>Position weight matrices of factors' intrinsic DNA recognition properties used</b>.</p></text>
<file name="gb-2011-12-4-r34-S13.XLS">
   <p>Click here for file</p>
</file>
</suppl>
<p><display-formula><graphic file="gb-2011-12-4-r34-i1.gif"/></display-formula></p>
<p>For each cohort, the maximum ChIP-chip signal from the relevant factor within 250 bp of each sequence match was determined using input DNA normalized ChIP-chip scores calculated as Array hybridization signal for factor immunoprecipitation/Array hybridization signal for input crosslinked DNA (see Figure <figr fid="F2">2</figr> in <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>) except that natural numbers, not log2, were used here. The 95% confidence interval about the median of these scores was calculated using R's box plot function (Figure <figr fid="F4">4</figr>; Additional file <supplr sid="S8">8</supplr>).</p>
<p>In addition, ten permutations of each original PWM were generated by shuffling the order of positions in the weight matrices for each permutation. If any permutation that matched any other of the randomly generated permutations for that factor or the normal PWM of one of the other 15 factors (<it>P </it>&lt; 0.05 defined using Tomtom <abbrgrp><abbr bid="B108">108</abbr></abbrgrp>) it was discarded and a new permutation was generated. The set of sequence matches to these scrambled PWMs were then identified throughout the genome, separated into those in open or closed chromatin and binned into groups based on affinity in the same manner as for the genuine motifs. The maximum ChIP-chip scores within 250 bp of each scrambled recognition site occurrence was determined and the median of this peak score was determined over the entire set of ten scrambled PWMs for each factor and the 95% confidence limits calculated as for the matches to the genuine PWMs (Figure <figr fid="F4">4</figr>; Additional file <supplr sid="S8">8</supplr>).</p>
<p>To correlate accessibility with ChIP-chip scores (Figure <figr fid="F5">5a</figr>), peaks in accessibility at stage 5 were annotated with maximum input DNA normalized ChIP-chip scores within 75 bp of each peak for the 16 factors with well-characterized <it>in vitro </it>binding specificities (Figure <figr fid="F4">4</figr>; Additional file <supplr sid="S8">8</supplr>). The peaks were ranked by accessibility and the correlation between level of accessibility and ChIP-chip score was calculated using R's Pearson correlation function. The DNaseI peaks were then ranked, separated into cohorts of 200 similarly accessible peaks and the median peak in ChIP-chip signal for each cohort was determined and plotted using R's heat map function scaling rows to account for inherent differences in ChIP-chip signal between factors. A similar process was used to correlate accessibility with the presence of recognition sites for each of the 16 factors (Figure <figr fid="F5">5b</figr>). The same PWMs for the factors derived from <it>in vitro </it>DNA binding data, described above, were employed to identify all sequence matches to these matrices within 75 bp of peaks of accessibility with <it>P </it>&lt; 0.003 using Fimo <abbrgrp><abbr bid="B107">107</abbr></abbrgrp> (that is, matches that fell into at least the -3 cohort from Figure <figr fid="F4">4</figr>). The correlation between the level of accessibility and the number of PWM matches was calculated using R's Pearson correlation function. For each factor, the peaks in accessibility were ranked and divided into cohorts of 200 and the sum of all recognition sites was added over each cohort and plotted in R using the heat map function, while scaling rows to one another in order to account for differences in information content between PWMs.</p>
</sec>
<sec><st><p>Correlating temporal changes in factor occupancy and DNA accessibility</p></st>
<p>Scatter plots and Pearson correlations were generated using R (Figure <figr fid="F7">7</figr>; Additional files <supplr sid="S10">10</supplr> and <supplr sid="S11">11</supplr>). Peaks in ChIP-chip data for HB2 antibody above the 25% FDR threshold were annotated by the maximum ChIP-chip signal for HB 1 and HB 2 within 500 bp of each peak <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, and these two replicate input DNA normalized ChIP-chip scores were plotted against each other and a correlation coefficient calculated (Additional file <supplr sid="S10">10</supplr>). This same process was used to assess the correlation between maximum HB 2 ChIP-chip signal from stage 5 embryos compared to HB 2 ChIP-chip signal from stage 9 embryos, as well as to compare MED ChIP-chip signals from stage 5, 10 and 11 embryos. This process was also used to determine if the changes in ChIP-chip signal were correlated with changes in chromatin accessibility at the same genomic regions (Figure <figr fid="F7">7</figr>; Additional file <supplr sid="S11">11</supplr>). For these plots, the ratio between input DNA normalized ChIP-chip scores for stage X and scores for stage Y was plotted against the ratio between DNAse-seq density for stage X and density for stage Y for the following six pairwise comparisons: HB 2 stage 5/HB 2 stage 9; HB 2 stage 9/HB 2 stage 5; MED stage 5/MED stage 10; MED stage 5/MED stage 14; MED stage 10/MED stage 5; and MED stage 14/MED stage 5.</p>
</sec>
</sec>
<sec><st><p>Abbreviations</p></st>
<p>BDTNP: Berkeley <it>Drosophila </it>Transcription Network Project; bp: base pair; cad: caudal; ChIP-chip: chromatin immunoprecipitation followed by microarray analysis; CRM: <it>cis</it>-regulatory module; D: Dichaete; DNase-seq: DNaseI digestion of nuclei followed by high throughput DNA sequencing; eve: even-skipped; FDR: false discovery rate; GR: glucocorticoid receptor; HB: hunchback; MED, Medea; PWM: position weight matrix; TWI: Twist.</p>
</sec>
<sec><st><p>Authors' contributions</p></st>
<p>XL, ST, MBE, JAS and MDB conceived and designed the experiments and analyses and wrote the paper. XL and PJS performed the wet laboratory experiments. XL, ST, JAS and MDB analyzed the data. All authors read and approved the final manuscript.</p>
</sec>
</bdy>
<bm>
<ack>
<sec><st><p>Acknowledgements</p></st>
<p>This work is part of a collaboration between the BDTNP and John Stamatoyannopoulos' group. We are very grateful for the frequent advice, support, criticism, and enthusiasm of members of both groups. The <it>in vivo </it>DNA binding data were funded by the US National Institutes of Health (NIH) under grants GM704403 (to MDB and MBE). Computational analyses were funded by NIH grant R01GM71923 (to JAS) and T90 HG 004007-04 (to ST). Work at Lawrence Berkeley National Laboratory was conducted under Department of Energy contract DE-AC02-05CH11231.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>Two homeo domain proteins bind with similar specificity to a wide range of DNA sites in <it>Drosophila </it>embryos.</p></title><aug><au><snm>Walter</snm><fnm>J</fnm></au><au><snm>Dever</snm><fnm>CA</fnm></au><au><snm>Biggin</snm><fnm>MD</fnm></au></aug><source>Genes Dev</source><pubdate>1994</pubdate><volume>8</volume><fpage>1678</fpage><lpage>1692</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.8.14.1678</pubid><pubid idtype="pmpid" link="fulltext">7958848</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>Eve and ftz regulate a wide array of genes in blastoderm embryos: the selector homeoproteins directly or indirectly regulate most genes in <it>Drosophila</it>.</p></title><aug><au><snm>Liang</snm><fnm>Z</fnm></au><au><snm>Biggin</snm><fnm>MD</fnm></au></aug><source>Development</source><pubdate>1998</pubdate><volume>125</volume><fpage>4471</fpage><lpage>4482</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">9778506</pubid></xrefbib></bibl><bibl id="B3"><title><p>A comparison of <it>in vivo </it>and <it>in vitro </it>DNA-binding specificities suggests a new model for homeoprotein DNA binding in <it>Drosophila </it>embryos.</p></title><aug><au><snm>Carr</snm><fnm>A</fnm></au><au><snm>Biggin</snm><fnm>MD</fnm></au></aug><source>EMBO J</source><pubdate>1999</pubdate><volume>18</volume><fpage>1598</fpage><lpage>1608</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/emboj/18.6.1598</pubid><pubid idtype="pmcid">1171247</pubid><pubid idtype="pmpid">10075930</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Core transcriptional regulatory circuitry in human embryonic stem cells.</p></title><aug><au><snm>Boyer</snm><fnm>LA</fnm></au><au><snm>Lee</snm><fnm>TI</fnm></au><au><snm>Cole</snm><fnm>MF</fnm></au><au><snm>Johnstone</snm><fnm>SE</fnm></au><au><snm>Levine</snm><fnm>SS</fnm></au><au><snm>Zucker</snm><fnm>JP</fnm></au><au><snm>Guenther</snm><fnm>MG</fnm></au><au><snm>Kumar</snm><fnm>RM</fnm></au><au><snm>Murray</snm><fnm>HL</fnm></au><au><snm>Jenner</snm><fnm>RG</fnm></au><au><snm>Gifford</snm><fnm>DK</fnm></au><au><snm>Melton</snm><fnm>DA</fnm></au><au><snm>Jaenisch</snm><fnm>R</fnm></au><au><snm>Young</snm><fnm>RA</fnm></au></aug><source>Cell</source><pubdate>2005</pubdate><volume>122</volume><fpage>947</fpage><lpage>956</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2005.08.020</pubid><pubid idtype="pmcid">3006442</pubid><pubid idtype="pmpid">16153702</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Unbiased location analysis of E2F1-binding sites suggests a widespread role for E2F1 in the human genome.</p></title><aug><au><snm>Bieda</snm><fnm>M</fnm></au><au><snm>Xu</snm><fnm>X</fnm></au><au><snm>Singer</snm><fnm>MA</fnm></au><au><snm>Green</snm><fnm>R</fnm></au><au><snm>Farnham</snm><fnm>PJ</fnm></au></aug><source>Genome Res</source><pubdate>2006</pubdate><volume>16</volume><fpage>595</fpage><lpage>605</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.4887606</pubid><pubid idtype="pmcid">1457046</pubid><pubid idtype="pmpid">16606705</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Relationships between p63 binding, DNA sequence, transcription activity, and biological function in human cells.</p></title><aug><au><snm>Yang</snm><fnm>A</fnm></au><au><snm>Zhu</snm><fnm>Z</fnm></au><au><snm>Kapranov</snm><fnm>P</fnm></au><au><snm>McKeon</snm><fnm>F</fnm></au><au><snm>Church</snm><fnm>GM</fnm></au><au><snm>Gingeras</snm><fnm>TR</fnm></au><au><snm>Struhl</snm><fnm>K</fnm></au></aug><source>Mol Cell</source><pubdate>2006</pubdate><volume>24</volume><fpage>593</fpage><lpage>602</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.molcel.2006.10.018</pubid><pubid idtype="pmpid" link="fulltext">17188034</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>A core transcriptional network for early mesoderm development in <it>Drosophila melanogaster</it>.</p></title><aug><au><snm>Sandmann</snm><fnm>T</fnm></au><au><snm>Girardot</snm><fnm>C</fnm></au><au><snm>Brehme</snm><fnm>M</fnm></au><au><snm>Tongprasit</snm><fnm>W</fnm></au><au><snm>Stolc</snm><fnm>V</fnm></au><au><snm>Furlong</snm><fnm>EEM</fnm></au></aug><source>Genes Dev</source><pubdate>2007</pubdate><volume>21</volume><fpage>436</fpage><lpage>449</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.1509007</pubid><pubid idtype="pmcid">1804332</pubid><pubid idtype="pmpid">17322403</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Whole-genome ChIP-chip analysis of Dorsal, Twist, and Snail suggests integration of diverse patterning processes in the <it>Drosophila </it>embryo.</p></title><aug><au><snm>Zeitlinger</snm><fnm>J</fnm></au><au><snm>Zinzen</snm><fnm>RP</fnm></au><au><snm>Stark</snm><fnm>A</fnm></au><au><snm>Kellis</snm><fnm>M</fnm></au><au><snm>Zhang</snm><fnm>H</fnm></au><au><snm>Young</snm><fnm>RA</fnm></au><au><snm>Levine</snm><fnm>M</fnm></au></aug><source>Genes Dev</source><pubdate>2007</pubdate><volume>21</volume><fpage>385</fpage><lpage>390</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.1509607</pubid><pubid idtype="pmcid">1804326</pubid><pubid idtype="pmpid">17322397</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Genome-wide mapping of <it>in vivo </it>protein-DNA interactions.</p></title><aug><au><snm>Johnson</snm><fnm>DS</fnm></au><au><snm>Mortazavi</snm><fnm>A</fnm></au><au><snm>Myers</snm><fnm>RM</fnm></au><au><snm>Wold</snm><fnm>B</fnm></au></aug><source>Science</source><pubdate>2007</pubdate><volume>316</volume><fpage>1497</fpage><lpage>1502</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1141319</pubid><pubid idtype="pmpid" link="fulltext">17540862</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing.</p></title><aug><au><snm>Robertson</snm><fnm>G</fnm></au><au><snm>Hirst</snm><fnm>M</fnm></au><au><snm>Bainbridge</snm><fnm>M</fnm></au><au><snm>Bilenky</snm><fnm>M</fnm></au><au><snm>Zhao</snm><fnm>Y</fnm></au><au><snm>Zeng</snm><fnm>T</fnm></au><au><snm>Euskirchen</snm><fnm>G</fnm></au><au><snm>Bernier</snm><fnm>B</fnm></au><au><snm>Varhol</snm><fnm>R</fnm></au><au><snm>Delaney</snm><fnm>A</fnm></au><au><snm>Thiessen</snm><fnm>N</fnm></au><au><snm>Griffith</snm><fnm>OL</fnm></au><au><snm>He</snm><fnm>A</fnm></au><au><snm>Marra</snm><fnm>M</fnm></au><au><snm>Snyder</snm><fnm>M</fnm></au><au><snm>Jones</snm><fnm>S</fnm></au></aug><source>Nat Methods</source><pubdate>2007</pubdate><volume>4</volume><fpage>651</fpage><lpage>657</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nmeth1068</pubid><pubid idtype="pmpid" link="fulltext">17558387</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Integration of external signaling pathways with the core transcriptional network in embryonic stem cells.</p></title><aug><au><snm>Chen</snm><fnm>X</fnm></au><au><snm>Xu</snm><fnm>H</fnm></au><au><snm>Yuan</snm><fnm>P</fnm></au><au><snm>Fang</snm><fnm>F</fnm></au><au><snm>Huss</snm><fnm>M</fnm></au><au><snm>Vega</snm><fnm>VB</fnm></au><au><snm>Wong</snm><fnm>E</fnm></au><au><snm>Orlov</snm><fnm>YL</fnm></au><au><snm>Zhang</snm><fnm>W</fnm></au><au><snm>Jiang</snm><fnm>J</fnm></au><au><snm>Loh</snm><fnm>YH</fnm></au><au><snm>Yeo</snm><fnm>HC</fnm></au><au><snm>Yeo</snm><fnm>ZX</fnm></au><au><snm>Narang</snm><fnm>V</fnm></au><au><snm>Govindarajan</snm><fnm>KR</fnm></au><au><snm>Leong</snm><fnm>B</fnm></au><au><snm>Shahab</snm><fnm>A</fnm></au><au><snm>Ruan</snm><fnm>Y</fnm></au><au><snm>Bourque</snm><fnm>G</fnm></au><au><snm>Sung</snm><fnm>WK</fnm></au><au><snm>Clarke</snm><fnm>ND</fnm></au><au><snm>Wei</snm><fnm>CL</fnm></au><au><snm>Ng</snm><fnm>HH</fnm></au></aug><source>Cell</source><pubdate>2008</pubdate><volume>133</volume><fpage>1106</fpage><lpage>1117</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2008.04.043</pubid><pubid idtype="pmpid" link="fulltext">18555785</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project.</p></title><aug><au><snm>Consortium</snm><fnm>TEP</fnm></au></aug><source>Nature</source><pubdate>2007</pubdate><volume>447</volume><fpage>799</fpage><lpage>816</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature05874</pubid><pubid idtype="pmcid">2212820</pubid><pubid idtype="pmpid">17571346</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>Genomic profiling and expression studies reveal both positive and negative activities for the <it>Drosophila </it>Myb MuvB/dREAM complex in proliferating cells.</p></title><aug><au><snm>Georlette</snm><fnm>D</fnm></au><au><snm>Ahn</snm><fnm>S</fnm></au><au><snm>MacAlpine</snm><fnm>DM</fnm></au><au><snm>Cheung</snm><fnm>E</fnm></au><au><snm>Lewis</snm><fnm>PW</fnm></au><au><snm>Beall</snm><fnm>EL</fnm></au><au><snm>Bell</snm><fnm>SP</fnm></au><au><snm>Speed</snm><fnm>T</fnm></au><au><snm>Manak</snm><fnm>JR</fnm></au><au><snm>Botchan</snm><fnm>MR</fnm></au></aug><source>Genes Dev</source><pubdate>2007</pubdate><volume>21</volume><fpage>2880</fpage><lpage>2896</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.1600107</pubid><pubid idtype="pmcid">2049191</pubid><pubid idtype="pmpid">17978103</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>Transcription factors bind thousands of active and inactive regions in the <it>Drosophila </it>blastoderm.</p></title><aug><au><snm>Li</snm><fnm>XY</fnm></au><au><snm>MacArthur</snm><fnm>S</fnm></au><au><snm>Bourgon</snm><fnm>R</fnm></au><au><snm>Nix</snm><fnm>D</fnm></au><au><snm>Pollard</snm><fnm>DA</fnm></au><au><snm>Iyer</snm><fnm>VN</fnm></au><au><snm>Hechmer</snm><fnm>A</fnm></au><au><snm>Simirenko</snm><fnm>L</fnm></au><au><snm>Stapleton</snm><fnm>M</fnm></au><au><snm>Luengo Hendriks</snm><fnm>CL</fnm></au><au><snm>Chu</snm><fnm>HC</fnm></au><au><snm>Ogawa</snm><fnm>N</fnm></au><au><snm>Inwood</snm><fnm>W</fnm></au><au><snm>Sementchenko</snm><fnm>V</fnm></au><au><snm>Beaton</snm><fnm>A</fnm></au><au><snm>Weiszmann</snm><fnm>R</fnm></au><au><snm>Celniker</snm><fnm>SE</fnm></au><au><snm>Knowles</snm><fnm>DW</fnm></au><au><snm>Gingeras</snm><fnm>T</fnm></au><au><snm>Speed</snm><fnm>TP</fnm></au><au><snm>Eisen</snm><fnm>MB</fnm></au><au><snm>Biggin</snm><fnm>MD</fnm></au></aug><source>PLoS Biol</source><pubdate>2008</pubdate><volume>6</volume><fpage>e27</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pbio.0060027</pubid><pubid idtype="pmcid">2235902</pubid><pubid idtype="pmpid">18271625</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>An oestrogen-receptor-alpha-bound human chromatin interactome.</p></title><aug><au><snm>Fullwood</snm><fnm>MJ</fnm></au><au><snm>Liu</snm><fnm>MH</fnm></au><au><snm>Pan</snm><fnm>YF</fnm></au><au><snm>Liu</snm><fnm>J</fnm></au><au><snm>Xu</snm><fnm>H</fnm></au><au><snm>Mohamed</snm><fnm>YB</fnm></au><au><snm>Orlov</snm><fnm>YL</fnm></au><au><snm>Velkov</snm><fnm>S</fnm></au><au><snm>Ho</snm><fnm>A</fnm></au><au><snm>Mei</snm><fnm>PH</fnm></au><au><snm>Chew</snm><fnm>EG</fnm></au><au><snm>Huang</snm><fnm>PY</fnm></au><au><snm>Welboren</snm><fnm>WJ</fnm></au><au><snm>Han</snm><fnm>Y</fnm></au><au><snm>Ooi</snm><fnm>HS</fnm></au><au><snm>Ariyaratne</snm><fnm>PN</fnm></au><au><snm>Vega</snm><fnm>VB</fnm></au><au><snm>Luo</snm><fnm>Y</fnm></au><au><snm>Tan</snm><fnm>PY</fnm></au><au><snm>Choy</snm><fnm>PY</fnm></au><au><snm>Wansa</snm><fnm>KD</fnm></au><au><snm>Zhao</snm><fnm>B</fnm></au><au><snm>Lim</snm><fnm>KS</fnm></au><au><snm>Leow</snm><fnm>SC</fnm></au><au><snm>Yow</snm><fnm>JS</fnm></au><au><snm>Joseph</snm><fnm>R</fnm></au><au><snm>Li</snm><fnm>H</fnm></au><au><snm>Desai</snm><fnm>KV</fnm></au><au><snm>Thomsen</snm><fnm>JS</fnm></au><au><snm>Lee</snm><fnm>YK</fnm></au><etal/></aug><source>Nature</source><pubdate>2009</pubdate><volume>462</volume><fpage>58</fpage><lpage>64</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature08497</pubid><pubid idtype="pmcid">2774924</pubid><pubid idtype="pmpid">19890323</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Functional targets of the monogenic diabetes transcription factors HNF-1alpha and HNF-4alpha are highly conserved between mice and humans.</p></title><aug><au><snm>Boj</snm><fnm>SF</fnm></au><au><snm>Servitja</snm><fnm>JM</fnm></au><au><snm>Martin</snm><fnm>D</fnm></au><au><snm>Rios</snm><fnm>M</fnm></au><au><snm>Talianidis</snm><fnm>I</fnm></au><au><snm>Guigo</snm><fnm>R</fnm></au><au><snm>Ferrer</snm><fnm>J</fnm></au></aug><source>Diabetes</source><pubdate>2009</pubdate><volume>58</volume><fpage>1245</fpage><lpage>1253</lpage><xrefbib><pubidlist><pubid idtype="doi">10.2337/db08-0812</pubid><pubid idtype="pmcid">2671044</pubid><pubid idtype="pmpid">19188435</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>Developmental roles of 21 <it>Drosophila </it>transcription factors are determined by quantitative differences in binding to an overlapping set of thousands of genomic regions.</p></title><aug><au><snm>MacArthur</snm><fnm>S</fnm></au><au><snm>Li</snm><fnm>XY</fnm></au><au><snm>Li</snm><fnm>J</fnm></au><au><snm>Brown</snm><fnm>JB</fnm></au><au><snm>Chu</snm><fnm>HC</fnm></au><au><snm>Zeng</snm><fnm>L</fnm></au><au><snm>Grondona</snm><fnm>BP</fnm></au><au><snm>Hechmer</snm><fnm>A</fnm></au><au><snm>Simirenko</snm><fnm>L</fnm></au><au><snm>Keranen</snm><fnm>SV</fnm></au><au><snm>Knowles</snm><fnm>DW</fnm></au><au><snm>Stapleton</snm><fnm>M</fnm></au><au><snm>Bickel</snm><fnm>P</fnm></au><au><snm>Biggin</snm><fnm>MD</fnm></au><au><snm>Eisen</snm><fnm>MB</fnm></au></aug><source>Genome Biol</source><pubdate>2009</pubdate><volume>10</volume><fpage>R80</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2009-10-7-r80</pubid><pubid idtype="pmcid">2728534</pubid><pubid idtype="pmpid">19627575</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>Genome-wide MyoD binding in skeletal muscle cells: a potential for broad cellular reprogramming.</p></title><aug><au><snm>Cao</snm><fnm>Y</fnm></au><au><snm>Yao</snm><fnm>Z</fnm></au><au><snm>Sarkar</snm><fnm>D</fnm></au><au><snm>Lawrence</snm><fnm>M</fnm></au><au><snm>Sanchez</snm><fnm>GJ</fnm></au><au><snm>Parker</snm><fnm>MH</fnm></au><au><snm>MacQuarrie</snm><fnm>KL</fnm></au><au><snm>Davison</snm><fnm>J</fnm></au><au><snm>Morgan</snm><fnm>MT</fnm></au><au><snm>Ruzzo</snm><fnm>WL</fnm></au><au><snm>Gentleman</snm><fnm>RC</fnm></au><au><snm>Tapscott</snm><fnm>SJ</fnm></au></aug><source>Dev Cell</source><pubdate>2010</pubdate><volume>18</volume><fpage>662</fpage><lpage>674</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.devcel.2010.02.014</pubid><pubid idtype="pmpid" link="fulltext">20412780</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>Binding site turnover produces pervasive quantitative changes in transcription factor binding between closely related <it>Drosophila </it>species.</p></title><aug><au><snm>Bradley</snm><fnm>RK</fnm></au><au><snm>Li</snm><fnm>XY</fnm></au><au><snm>Trapnell</snm><fnm>C</fnm></au><au><snm>Davidson</snm><fnm>S</fnm></au><au><snm>Pachter</snm><fnm>L</fnm></au><au><snm>Chu</snm><fnm>HC</fnm></au><au><snm>Tonkin</snm><fnm>LA</fnm></au><au><snm>Biggin</snm><fnm>MD</fnm></au><au><snm>Eisen</snm><fnm>MB</fnm></au></aug><source>PLoS Biol</source><pubdate>2010</pubdate><volume>8</volume><fpage>e1000343</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pbio.1000343</pubid><pubid idtype="pmcid">2843597</pubid><pubid idtype="pmpid">20351773</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Hotspots of transcription factor colocalization in the genome of <it>Drosophila melanogaster</it>.</p></title><aug><au><snm>Moorman</snm><fnm>C</fnm></au><au><snm>Sun</snm><fnm>LV</fnm></au><au><snm>Wang</snm><fnm>J</fnm></au><au><snm>de Wit</snm><fnm>E</fnm></au><au><snm>Talhout</snm><fnm>W</fnm></au><au><snm>Ward</snm><fnm>LD</fnm></au><au><snm>Greil</snm><fnm>F</fnm></au><au><snm>Lu</snm><fnm>XJ</fnm></au><au><snm>White</snm><fnm>KP</fnm></au><au><snm>Bussemaker</snm><fnm>HJ</fnm></au><au><snm>van Steensel</snm><fnm>B</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2006</pubdate><volume>103</volume><fpage>12027</fpage><lpage>12032</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0605003103</pubid><pubid idtype="pmcid">1567692</pubid><pubid idtype="pmpid">16880385</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Combinatorial binding predicts spatio-temporal cis-regulatory activity.</p></title><aug><au><snm>Zinzen</snm><fnm>RP</fnm></au><au><snm>Girardot</snm><fnm>C</fnm></au><au><snm>Gagneur</snm><fnm>J</fnm></au><au><snm>Braun</snm><fnm>M</fnm></au><au><snm>Furlong</snm><fnm>EE</fnm></au></aug><source>Nature</source><pubdate>2009</pubdate><volume>462</volume><fpage>65</fpage><lpage>70</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature08531</pubid><pubid idtype="pmpid" link="fulltext">19890324</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>ChIP-Seq of transcription factors predicts absolute and differential gene expression in embryonic stem cells.</p></title><aug><au><snm>Ouyang</snm><fnm>Z</fnm></au><au><snm>Zhou</snm><fnm>Q</fnm></au><au><snm>Wong</snm><fnm>WH</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2009</pubdate><volume>106</volume><fpage>21521</fpage><lpage>21526</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0904863106</pubid><pubid idtype="pmcid">2789751</pubid><pubid idtype="pmpid">19995984</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Different gene regulation strategies revealed by analysis of binding motifs.</p></title><aug><au><snm>Wunderlich</snm><fnm>Z</fnm></au><au><snm>Mirny</snm><fnm>LA</fnm></au></aug><source>Trends Genet</source><pubdate>2009</pubdate><volume>25</volume><fpage>434</fpage><lpage>440</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.tig.2009.08.003</pubid><pubid idtype="pmpid" link="fulltext">19815308</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF.</p></title><aug><au><snm>Iyer</snm><fnm>VR</fnm></au><au><snm>Horak</snm><fnm>CE</fnm></au><au><snm>Scafe</snm><fnm>CS</fnm></au><au><snm>Botstein</snm><fnm>D</fnm></au><au><snm>Snyder</snm><fnm>M</fnm></au><au><snm>Brown</snm><fnm>PO</fnm></au></aug><source>Nature</source><pubdate>2001</pubdate><volume>409</volume><fpage>533</fpage><lpage>538</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/35054095</pubid><pubid idtype="pmpid" link="fulltext">11206552</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Whole-genome comparison of Leu3 binding <it>in vitro </it>and <it>in vivo </it>reveals the importance of nucleosome occupancy in target site selection.</p></title><aug><au><snm>Liu</snm><fnm>X</fnm></au><au><snm>Lee</snm><fnm>CK</fnm></au><au><snm>Granek</snm><fnm>JA</fnm></au><au><snm>Clarke</snm><fnm>ND</fnm></au><au><snm>Lieb</snm><fnm>JD</fnm></au></aug><source>Genome Res</source><pubdate>2006</pubdate><volume>16</volume><fpage>1517</fpage><lpage>1528</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.5655606</pubid><pubid idtype="pmcid">1665635</pubid><pubid idtype="pmpid">17053089</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Molecular mechanisms of cell-type determination in budding yeast.</p></title><aug><au><snm>Johnson</snm><fnm>AD</fnm></au></aug><source>Curr Opin Genet Dev</source><pubdate>1995</pubdate><volume>5</volume><fpage>552</fpage><lpage>558</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/0959-437X(95)80022-0</pubid><pubid idtype="pmpid" link="fulltext">8664541</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>Virus induction of human IFN beta gene expression requires the assembly of an enhanceosome.</p></title><aug><au><snm>Thanos</snm><fnm>D</fnm></au><au><snm>Maniatis</snm><fnm>T</fnm></au></aug><source>Cell</source><pubdate>1995</pubdate><volume>83</volume><fpage>1091</fpage><lpage>1100</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/0092-8674(95)90136-1</pubid><pubid idtype="pmpid" link="fulltext">8548797</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>Transcriptional regulatory cascades in development: initial rates, not steady state, determine network kinetics.</p></title><aug><au><snm>Bolouri</snm><fnm>H</fnm></au><au><snm>Davidson</snm><fnm>EH</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2003</pubdate><volume>100</volume><fpage>9371</fpage><lpage>9376</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.1533293100</pubid><pubid idtype="pmcid">170925</pubid><pubid idtype="pmpid">12883007</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>Global regulatory logic for specification of an embryonic cell lineage.</p></title><aug><au><snm>Oliveri</snm><fnm>P</fnm></au><au><snm>Tu</snm><fnm>Q</fnm></au><au><snm>Davidson</snm><fnm>EH</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2008</pubdate><volume>105</volume><fpage>5955</fpage><lpage>5962</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0711220105</pubid><pubid idtype="pmcid">2329687</pubid><pubid idtype="pmpid">18413610</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Hox specificity unique roles for cofactors and collaborators.</p></title><aug><au><snm>Mann</snm><fnm>RS</fnm></au><au><snm>Lelli</snm><fnm>KM</fnm></au><au><snm>Joshi</snm><fnm>R</fnm></au></aug><source>Curr Top Dev Biol</source><pubdate>2009</pubdate><volume>88</volume><fpage>63</fpage><lpage>101</lpage><xrefbib><pubidlist><pubid idtype="doi">full_text</pubid><pubid idtype="pmcid">2810641</pubid><pubid idtype="pmpid">19651302</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>Two protein-binding sites in chromatin implicated in the activation of heat shock genes.</p></title><aug><au><snm>Wu</snm><fnm>C</fnm></au></aug><source>Nature</source><pubdate>1984</pubdate><volume>309</volume><fpage>229</fpage><lpage>234</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/309229a0</pubid><pubid idtype="pmpid">6325944</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>Architectural variations of inducible eukaryotic promoters: preset and remodeling chromatin structures.</p></title><aug><au><snm>Wallrath</snm><fnm>LL</fnm></au><au><snm>Lu</snm><fnm>Q</fnm></au><au><snm>Granok</snm><fnm>H</fnm></au><au><snm>Elgin</snm><fnm>SC</fnm></au></aug><source>Bioessays</source><pubdate>1994</pubdate><volume>16</volume><fpage>165</fpage><lpage>170</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/bies.950160306</pubid><pubid idtype="pmpid">8166669</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>Controlling the double helix.</p></title><aug><au><snm>Felsenfeld</snm><fnm>G</fnm></au><au><snm>Groudine</snm><fnm>M</fnm></au></aug><source>Nature</source><pubdate>2003</pubdate><volume>421</volume><fpage>448</fpage><lpage>453</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature01411</pubid><pubid idtype="pmpid" link="fulltext">12540921</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>The role of chromatin during transcription.</p></title><aug><au><snm>Li</snm><fnm>B</fnm></au><au><snm>Carey</snm><fnm>M</fnm></au><au><snm>Workman</snm><fnm>JL</fnm></au></aug><source>Cell</source><pubdate>2007</pubdate><volume>128</volume><fpage>707</fpage><lpage>719</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2007.01.015</pubid><pubid idtype="pmpid" link="fulltext">17320508</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>The complex language of chromatin regulation during transcription.</p></title><aug><au><snm>Berger</snm><fnm>SL</fnm></au></aug><source>Nature</source><pubdate>2007</pubdate><volume>447</volume><fpage>407</fpage><lpage>412</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature05915</pubid><pubid idtype="pmpid" link="fulltext">17522673</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>Nucleosome retention and the stochastic nature of promoter chromatin remodeling for transcription.</p></title><aug><au><snm>Boeger</snm><fnm>H</fnm></au><au><snm>Griesenbeck</snm><fnm>J</fnm></au><au><snm>Kornberg</snm><fnm>RD</fnm></au></aug><source>Cell</source><pubdate>2008</pubdate><volume>133</volume><fpage>716</fpage><lpage>726</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2008.02.051</pubid><pubid idtype="pmcid">2409070</pubid><pubid idtype="pmpid">18485878</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>Nucleosome destabilization in the epigenetic regulation of gene expression.</p></title><aug><au><snm>Henikoff</snm><fnm>S</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2008</pubdate><volume>9</volume><fpage>15</fpage><lpage>26</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrg2206</pubid><pubid idtype="pmpid" link="fulltext">18059368</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>The logic of chromatin architecture and remodelling at promoters.</p></title><aug><au><snm>Cairns</snm><fnm>BR</fnm></au></aug><source>Nature</source><pubdate>2009</pubdate><volume>461</volume><fpage>193</fpage><lpage>198</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature08450</pubid><pubid idtype="pmpid" link="fulltext">19741699</pubid></pubidlist></xrefbib></bibl><bibl id="B39"><title><p>A RSC/nucleosome complex determines chromatin architecture and facilitates activator binding.</p></title><aug><au><snm>Floer</snm><fnm>M</fnm></au><au><snm>Wang</snm><fnm>X</fnm></au><au><snm>Prabhu</snm><fnm>V</fnm></au><au><snm>Berrozpe</snm><fnm>G</fnm></au><au><snm>Narayan</snm><fnm>S</fnm></au><au><snm>Spagna</snm><fnm>D</fnm></au><au><snm>Alvarez</snm><fnm>D</fnm></au><au><snm>Kendall</snm><fnm>J</fnm></au><au><snm>Krasnitz</snm><fnm>A</fnm></au><au><snm>Stepansky</snm><fnm>A</fnm></au><au><snm>Hicks</snm><fnm>J</fnm></au><au><snm>Bryant</snm><fnm>GO</fnm></au><au><snm>Ptashne</snm><fnm>M</fnm></au></aug><source>Cell</source><pubdate>2010</pubdate><volume>141</volume><fpage>407</fpage><lpage>418</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2010.03.048</pubid><pubid idtype="pmpid" link="fulltext">20434983</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>Chromatin unfolds.</p></title><aug><au><snm>Felsenfeld</snm><fnm>G</fnm></au></aug><source>Cell</source><pubdate>1996</pubdate><volume>86</volume><fpage>13</fpage><lpage>19</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0092-8674(00)80073-2</pubid><pubid idtype="pmpid" link="fulltext">8689680</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>Accessibility of transcriptionally inactive genes in specifically reduced at homeoprotein-DNA binding sites in <it>Drosophila</it>.</p></title><aug><au><snm>Carr</snm><fnm>A</fnm></au><au><snm>Biggin</snm><fnm>MD</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2000</pubdate><volume>28</volume><fpage>2839</fpage><lpage>2846</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/28.14.2839</pubid><pubid idtype="pmcid">102649</pubid><pubid idtype="pmpid">10908343</pubid></pubidlist></xrefbib></bibl><bibl id="B42"><title><p>Transcription factor access to promoter elements.</p></title><aug><au><snm>Morse</snm><fnm>RH</fnm></au></aug><source>J Cell Biochem</source><pubdate>2007</pubdate><volume>102</volume><fpage>560</fpage><lpage>570</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/jcb.21493</pubid><pubid idtype="pmpid" link="fulltext">17668451</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>High-resolution mapping and characterization of open chromatin across the genome.</p></title><aug><au><snm>Boyle</snm><fnm>AP</fnm></au><au><snm>Davis</snm><fnm>S</fnm></au><au><snm>Shulha</snm><fnm>HP</fnm></au><au><snm>Meltzer</snm><fnm>P</fnm></au><au><snm>Margulies</snm><fnm>EH</fnm></au><au><snm>Weng</snm><fnm>Z</fnm></au><au><snm>Furey</snm><fnm>TS</fnm></au><au><snm>Crawford</snm><fnm>GE</fnm></au></aug><source>Cell</source><pubdate>2008</pubdate><volume>132</volume><fpage>311</fpage><lpage>322</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2007.12.014</pubid><pubid idtype="pmcid">2669738</pubid><pubid idtype="pmpid">18243105</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Interaction of the glucocorticoid receptor with the chromatin landscape.</p></title><aug><au><snm>John</snm><fnm>S</fnm></au><au><snm>Sabo</snm><fnm>PJ</fnm></au><au><snm>Johnson</snm><fnm>TA</fnm></au><au><snm>Sung</snm><fnm>MH</fnm></au><au><snm>Biddie</snm><fnm>SC</fnm></au><au><snm>Lightman</snm><fnm>SL</fnm></au><au><snm>Voss</snm><fnm>TC</fnm></au><au><snm>Davis</snm><fnm>SR</fnm></au><au><snm>Meltzer</snm><fnm>PS</fnm></au><au><snm>Stamatoyannopoulos</snm><fnm>JA</fnm></au><au><snm>Hager</snm><fnm>GL</fnm></au></aug><source>Mol Cell</source><pubdate>2008</pubdate><volume>29</volume><fpage>611</fpage><lpage>624</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.molcel.2008.02.010</pubid><pubid idtype="pmpid" link="fulltext">18342607</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>Histone modifications at human enhancers reflect global cell-type-specific gene expression.</p></title><aug><au><snm>Heintzman</snm><fnm>ND</fnm></au><au><snm>Hon</snm><fnm>GC</fnm></au><au><snm>Hawkins</snm><fnm>RD</fnm></au><au><snm>Kheradpour</snm><fnm>P</fnm></au><au><snm>Stark</snm><fnm>A</fnm></au><au><snm>Harp</snm><fnm>LF</fnm></au><au><snm>Ye</snm><fnm>Z</fnm></au><au><snm>Lee</snm><fnm>LK</fnm></au><au><snm>Stuart</snm><fnm>RK</fnm></au><au><snm>Ching</snm><fnm>CW</fnm></au><au><snm>Ching</snm><fnm>KA</fnm></au><au><snm>Antosiewicz-Bourget</snm><fnm>JE</fnm></au><au><snm>Liu</snm><fnm>H</fnm></au><au><snm>Zhang</snm><fnm>X</fnm></au><au><snm>Green</snm><fnm>RD</fnm></au><au><snm>Lobanenkov</snm><fnm>VV</fnm></au><au><snm>Stewart</snm><fnm>R</fnm></au><au><snm>Thomson</snm><fnm>JA</fnm></au><au><snm>Crawford</snm><fnm>GE</fnm></au><au><snm>Kellis</snm><fnm>M</fnm></au><au><snm>Ren</snm><fnm>B</fnm></au></aug><source>Nature</source><pubdate>2009</pubdate><volume>459</volume><fpage>108</fpage><lpage>112</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature07829</pubid><pubid idtype="pmcid">2910248</pubid><pubid idtype="pmpid">19295514</pubid></pubidlist></xrefbib></bibl><bibl id="B46"><title><p>Regulation of segmentation and segmental identity by <it>Drosophila </it>homeoproteins: the role of DNA binding in functional activity and specificity.</p></title><aug><au><snm>Biggin</snm><fnm>MD</fnm></au><au><snm>McGinnis</snm><fnm>W</fnm></au></aug><source>Development</source><pubdate>1997</pubdate><volume>124</volume><fpage>4425</fpage><lpage>4433</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">9409661</pubid></xrefbib></bibl><bibl id="B47"><title><p>Role of trans-activating proteins in the generation of active chromatin at the PHO5 promoter in <it>S. cerevisiae</it>.</p></title><aug><au><snm>Fascher</snm><fnm>KD</fnm></au><au><snm>Schmitz</snm><fnm>J</fnm></au><au><snm>Horz</snm><fnm>W</fnm></au></aug><source>EMBO J</source><pubdate>1990</pubdate><volume>9</volume><fpage>2523</fpage><lpage>2528</lpage><xrefbib><pubidlist><pubid idtype="pmcid">552282</pubid><pubid idtype="pmpid">2196175</pubid></pubidlist></xrefbib></bibl><bibl id="B48"><title><p>Facilitated binding of GAL4 and heat shock factor to nucleosomal templates: differential function of DNA-binding domains.</p></title><aug><au><snm>Taylor</snm><fnm>IC</fnm></au><au><snm>Workman</snm><fnm>JL</fnm></au><au><snm>Schuetz</snm><fnm>TJ</fnm></au><au><snm>Kingston</snm><fnm>RE</fnm></au></aug><source>Genes Dev</source><pubdate>1991</pubdate><volume>5</volume><fpage>1285</fpage><lpage>1298</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.5.7.1285</pubid><pubid idtype="pmpid" link="fulltext">2065977</pubid></pubidlist></xrefbib></bibl><bibl id="B49"><title><p>Transcription factor loading on the MMTV promoter: a bimodal mechanism for promoter activation.</p></title><aug><au><snm>Archer</snm><fnm>TK</fnm></au><au><snm>Lefebvre</snm><fnm>P</fnm></au><au><snm>Wolford</snm><fnm>RG</fnm></au><au><snm>Hager</snm><fnm>GL</fnm></au></aug><source>Science</source><pubdate>1992</pubdate><volume>255</volume><fpage>1573</fpage><lpage>1576</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1347958</pubid><pubid idtype="pmpid" link="fulltext">1347958</pubid></pubidlist></xrefbib></bibl><bibl id="B50"><title><p>Binding of disparate transcriptional activators to nucleosomal DNA is inherently cooperative.</p></title><aug><au><snm>Adams</snm><fnm>CC</fnm></au><au><snm>Workman</snm><fnm>JL</fnm></au></aug><source>Mol Cell Biol</source><pubdate>1995</pubdate><volume>15</volume><fpage>1405</fpage><lpage>1421</lpage><xrefbib><pubidlist><pubid idtype="pmcid">230365</pubid><pubid idtype="pmpid">7862134</pubid></pubidlist></xrefbib></bibl><bibl id="B51"><title><p>A model for the cooperative binding of eukaryotic regulatory proteins to nucleosomal target sites.</p></title><aug><au><snm>Polach</snm><fnm>KJ</fnm></au><au><snm>Widom</snm><fnm>J</fnm></au></aug><source>J Mol Biol</source><pubdate>1996</pubdate><volume>258</volume><fpage>800</fpage><lpage>812</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/jmbi.1996.0288</pubid><pubid idtype="pmpid" link="fulltext">8637011</pubid></pubidlist></xrefbib></bibl><bibl id="B52"><title><p>Evidence for two modes of cooperative DNA binding <it>in vivo </it>that do not involve direct protein-protein interactions.</p></title><aug><au><snm>Vashee</snm><fnm>S</fnm></au><au><snm>Melcher</snm><fnm>K</fnm></au><au><snm>Ding</snm><fnm>WV</fnm></au><au><snm>Johnston</snm><fnm>SA</fnm></au><au><snm>Kodadek</snm><fnm>T</fnm></au></aug><source>Curr Biol</source><pubdate>1998</pubdate><volume>8</volume><fpage>452</fpage><lpage>458</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0960-9822(98)70179-4</pubid><pubid idtype="pmpid" link="fulltext">9550700</pubid></pubidlist></xrefbib></bibl><bibl id="B53"><title><p>Collaborative competition mechanism for gene activation <it>in vivo</it>.</p></title><aug><au><snm>Miller</snm><fnm>JA</fnm></au><au><snm>Widom</snm><fnm>J</fnm></au></aug><source>Mol Cell Biol</source><pubdate>2003</pubdate><volume>23</volume><fpage>1623</fpage><lpage>1632</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/MCB.23.5.1623-1632.2003</pubid><pubid idtype="pmcid">151720</pubid><pubid idtype="pmpid">12588982</pubid></pubidlist></xrefbib></bibl><bibl id="B54"><title><p>Chromatin-dependent cooperativity between site-specific transcription factors <it>in vivo</it>.</p></title><aug><au><snm>Hebbar</snm><fnm>PB</fnm></au><au><snm>Archer</snm><fnm>TK</fnm></au></aug><source>J Biol Chem</source><pubdate>2007</pubdate><volume>282</volume><fpage>8284</fpage><lpage>8291</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1074/jbc.M610554200</pubid><pubid idtype="pmcid">2528297</pubid><pubid idtype="pmpid">17186943</pubid></pubidlist></xrefbib></bibl><bibl id="B55"><title><p>An ensemble model of competitive multi-factor binding of the genome.</p></title><aug><au><snm>Wasson</snm><fnm>T</fnm></au><au><snm>Hartemink</snm><fnm>AJ</fnm></au></aug><source>Genome Res</source><pubdate>2009</pubdate><volume>19</volume><fpage>2101</fpage><lpage>2112</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.093450.109</pubid><pubid idtype="pmcid">2775586</pubid><pubid idtype="pmpid">19720867</pubid></pubidlist></xrefbib></bibl><bibl id="B56"><title><p>Nucleosome-mediated cooperativity between transcription factors.</p></title><aug><au><snm>Mirny</snm><fnm>L</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2010</pubdate><volume>107</volume><fpage>22534</fpage><lpage>22539</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0913805107</pubid><pubid idtype="pmpid" link="fulltext">21149679</pubid></pubidlist></xrefbib></bibl><bibl id="B57"><title><p>The SWI-SNF complex: a chromatin remodeling machine?</p></title><aug><au><snm>Peterson</snm><fnm>CL</fnm></au><au><snm>Tamkun</snm><fnm>JW</fnm></au></aug><source>Trends Biochem Sci</source><pubdate>1995</pubdate><volume>20</volume><fpage>143</fpage><lpage>146</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0968-0004(00)88990-2</pubid><pubid idtype="pmpid" link="fulltext">7770913</pubid></pubidlist></xrefbib></bibl><bibl id="B58"><title><p>Eukaryotic transcription: an interlaced network of transcription factors and chromatin-modifying machines.</p></title><aug><au><snm>Kadonaga</snm><fnm>JT</fnm></au></aug><source>Cell</source><pubdate>1998</pubdate><volume>92</volume><fpage>307</fpage><lpage>313</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0092-8674(00)80924-1</pubid><pubid idtype="pmpid" link="fulltext">9476891</pubid></pubidlist></xrefbib></bibl><bibl id="B59"><title><p>The 5' end of <it>Drosophila </it>heat shock genes in chromatin are hypersensitive to DNAse I.</p></title><aug><au><snm>Wu</snm><fnm>C</fnm></au></aug><source>Nature</source><pubdate>1980</pubdate><volume>286</volume><fpage>854</fpage><lpage>860</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/286854a0</pubid><pubid idtype="pmpid">6774262</pubid></pubidlist></xrefbib></bibl><bibl id="B60"><title><p>Anatomy of hypersensitive sites.</p></title><aug><au><snm>Elgin</snm><fnm>SC</fnm></au></aug><source>Nature</source><pubdate>1984</pubdate><volume>309</volume><fpage>213</fpage><lpage>214</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/309213a0</pubid><pubid idtype="pmpid">6325942</pubid></pubidlist></xrefbib></bibl><bibl id="B61"><title><p>Nuclease hypersensitive sites in chromatin.</p></title><aug><au><snm>Gross</snm><fnm>DS</fnm></au><au><snm>Garrard</snm><fnm>WT</fnm></au></aug><source>Annu Rev Biochem</source><pubdate>1988</pubdate><volume>57</volume><fpage>159</fpage><lpage>197</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1146/annurev.bi.57.070188.001111</pubid><pubid idtype="pmpid" link="fulltext">3052270</pubid></pubidlist></xrefbib></bibl><bibl id="B62"><title><p>Global mapping of protein-DNA interactions <it>in vivo </it>by digital genomic footprinting.</p></title><aug><au><snm>Hesselberth</snm><fnm>JR</fnm></au><au><snm>Chen</snm><fnm>X</fnm></au><au><snm>Zhang</snm><fnm>Z</fnm></au><au><snm>Sabo</snm><fnm>PJ</fnm></au><au><snm>Sandstrom</snm><fnm>R</fnm></au><au><snm>Reynolds</snm><fnm>AP</fnm></au><au><snm>Thurman</snm><fnm>RE</fnm></au><au><snm>Neph</snm><fnm>S</fnm></au><au><snm>Kuehn</snm><fnm>MS</fnm></au><au><snm>Noble</snm><fnm>WS</fnm></au><au><snm>Fields</snm><fnm>S</fnm></au><au><snm>Stamatoyannopoulos</snm><fnm>JA</fnm></au></aug><source>Nat Methods</source><pubdate>2009</pubdate><volume>6</volume><fpage>283</fpage><lpage>289</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nmeth.1313</pubid><pubid idtype="pmcid">2668528</pubid><pubid idtype="pmpid">19305407</pubid></pubidlist></xrefbib></bibl><bibl id="B63"><title><p>CCCTC-binding factor and the transcription factor T-bet orchestrate T helper 1 cell-specific structure and function at the interferon-gamma locus.</p></title><aug><au><snm>Sekimata</snm><fnm>M</fnm></au><au><snm>Perez-Melgosa</snm><fnm>M</fnm></au><au><snm>Miller</snm><fnm>SA</fnm></au><au><snm>Weinmann</snm><fnm>AS</fnm></au><au><snm>Sabo</snm><fnm>PJ</fnm></au><au><snm>Sandstrom</snm><fnm>R</fnm></au><au><snm>Dorschner</snm><fnm>MO</fnm></au><au><snm>Stamatoyannopoulos</snm><fnm>JA</fnm></au><au><snm>Wilson</snm><fnm>CB</fnm></au></aug><source>Immunity</source><pubdate>2009</pubdate><volume>31</volume><fpage>551</fpage><lpage>564</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.immuni.2009.08.021</pubid><pubid idtype="pmcid">2810421</pubid><pubid idtype="pmpid">19818655</pubid></pubidlist></xrefbib></bibl><bibl id="B64"><aug><au><snm>Campos-Ortega</snm><fnm>JA</fnm></au><au><snm>Hartenstein</snm><fnm>V</fnm></au></aug><source>The Embryonic Development of Drosophila melanogaster</source><publisher>Berlin: Springer-Verlag</publisher><edition>2</edition><pubdate>1997</pubdate></bibl><bibl id="B65"><title><p>The specificity of protein-DNA crosslinking by formaldehyde: <it>in vitro </it>and in <it>Drosophila </it>embryos.</p></title><aug><au><snm>Toth</snm><fnm>J</fnm></au><au><snm>Biggin</snm><fnm>MD</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2000</pubdate><volume>28</volume><fpage>e4</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/28.2.e4</pubid><pubid idtype="pmcid">102536</pubid><pubid idtype="pmpid">10606672</pubid></pubidlist></xrefbib></bibl><bibl id="B66"><title><p>Mapping accessible chromatin regions using Sono-Seq.</p></title><aug><au><snm>Auerbach</snm><fnm>RK</fnm></au><au><snm>Euskirchen</snm><fnm>G</fnm></au><au><snm>Rozowsky</snm><fnm>J</fnm></au><au><snm>Lamarre-Vincent</snm><fnm>N</fnm></au><au><snm>Moqtaderi</snm><fnm>Z</fnm></au><au><snm>Lefrancois</snm><fnm>P</fnm></au><au><snm>Struhl</snm><fnm>K</fnm></au><au><snm>Gerstein</snm><fnm>M</fnm></au><au><snm>Snyder</snm><fnm>M</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2009</pubdate><volume>106</volume><fpage>14926</fpage><lpage>14931</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0905443106</pubid><pubid idtype="pmcid">2736440</pubid><pubid idtype="pmpid">19706456</pubid></pubidlist></xrefbib></bibl><bibl id="B67"><title><p>The eve stripe 2 enhancer employs multiple modes of transcriptional synergy.</p></title><aug><au><snm>Arnosti</snm><fnm>DN</fnm></au><au><snm>Barolo</snm><fnm>S</fnm></au><au><snm>Levine</snm><fnm>M</fnm></au><au><snm>Small</snm><fnm>S</fnm></au></aug><source>Development</source><pubdate>1996</pubdate><volume>122</volume><fpage>205</fpage><lpage>214</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">8565831</pubid></xrefbib></bibl><bibl id="B68"><title><p>Analysis of an even-skipped rescue transgene reveals both composite and discrete neuronal and early blastoderm enhancers, and multi-stripe positioning by gap gene repressor gradients.</p></title><aug><au><snm>Fujioka</snm><fnm>M</fnm></au><au><snm>Emi-Sarker</snm><fnm>Y</fnm></au><au><snm>Yusibova</snm><fnm>GL</fnm></au><au><snm>Goto</snm><fnm>T</fnm></au><au><snm>Jaynes</snm><fnm>JB</fnm></au></aug><source>Development</source><pubdate>1999</pubdate><volume>126</volume><fpage>2527</fpage><lpage>2538</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2778309</pubid><pubid idtype="pmpid">10226011</pubid></pubidlist></xrefbib></bibl><bibl id="B69"><title><p>A self-organizing system of repressor gradients establishes segmental complexity in <it>Drosophila</it>.</p></title><aug><au><snm>Clyde</snm><fnm>DE</fnm></au><au><snm>Corado</snm><fnm>MS</fnm></au><au><snm>Wu</snm><fnm>X</fnm></au><au><snm>Pare</snm><fnm>A</fnm></au><au><snm>Papatsenko</snm><fnm>D</fnm></au><au><snm>Small</snm><fnm>S</fnm></au></aug><source>Nature</source><pubdate>2003</pubdate><volume>426</volume><fpage>849</fpage><lpage>853</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature02189</pubid><pubid idtype="pmpid" link="fulltext">14685241</pubid></pubidlist></xrefbib></bibl><bibl id="B70"><title><p>REDfly 2.0: an integrated database of cis-regulatory modules and transcription factor binding sites in <it>Drosophila</it>.</p></title><aug><au><snm>Halfon</snm><fnm>MS</fnm></au><au><snm>Gallo</snm><fnm>SM</fnm></au><au><snm>Bergman</snm><fnm>CM</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2008</pubdate><volume>36</volume><fpage>D594</fpage><lpage>598</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkm876</pubid><pubid idtype="pmcid">2238825</pubid><pubid idtype="pmpid">18039705</pubid></pubidlist></xrefbib></bibl><bibl id="B71"><title><p>Regulation of POU genes by castor and hunchback establishes layered compartments in the <it>Drosophila </it>CNS.</p></title><aug><au><snm>Kambadur</snm><fnm>R</fnm></au><au><snm>Koizumi</snm><fnm>K</fnm></au><au><snm>Stivers</snm><fnm>C</fnm></au><au><snm>Nagle</snm><fnm>J</fnm></au><au><snm>Poole</snm><fnm>SJ</fnm></au><au><snm>Odenwald</snm><fnm>WF</fnm></au></aug><source>Genes Dev</source><pubdate>1998</pubdate><volume>12</volume><fpage>246</fpage><lpage>260</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.12.2.246</pubid><pubid idtype="pmcid">316437</pubid><pubid idtype="pmpid">9436984</pubid></pubidlist></xrefbib></bibl><bibl id="B72"><title><p>Molecular integration of inductive and mesoderm-intrinsic inputs governs even-skipped enhancer activity in a subset of pericardial and dorsal muscle progenitors.</p></title><aug><au><snm>Knirr</snm><fnm>S</fnm></au><au><snm>Frasch</snm><fnm>M</fnm></au></aug><source>Dev Biol</source><pubdate>2001</pubdate><volume>238</volume><fpage>13</fpage><lpage>26</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/dbio.2001.0397</pubid><pubid idtype="pmpid" link="fulltext">11783990</pubid></pubidlist></xrefbib></bibl><bibl id="B73"><title><p>TGF-beta family signal transduction in <it>Drosophila </it>development: from Mad to Smads.</p></title><aug><au><snm>Raftery</snm><fnm>LA</fnm></au><au><snm>Sutherland</snm><fnm>DJ</fnm></au></aug><source>Dev Biol</source><pubdate>1999</pubdate><volume>210</volume><fpage>251</fpage><lpage>268</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/dbio.1999.9282</pubid><pubid idtype="pmpid" link="fulltext">10357889</pubid></pubidlist></xrefbib></bibl><bibl id="B74"><title><p>Stepwise formation of a SMAD activity gradient during dorsal-ventral patterning of the <it>Drosophila </it>embryo.</p></title><aug><au><snm>Sutherland</snm><fnm>DJ</fnm></au><au><snm>Li</snm><fnm>M</fnm></au><au><snm>Liu</snm><fnm>XQ</fnm></au><au><snm>Stefancsik</snm><fnm>R</fnm></au><au><snm>Raftery</snm><fnm>LA</fnm></au></aug><source>Development</source><pubdate>2003</pubdate><volume>130</volume><fpage>5705</fpage><lpage>5716</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1242/dev.00801</pubid><pubid idtype="pmpid" link="fulltext">14534137</pubid></pubidlist></xrefbib></bibl><bibl id="B75"><title><p>Quantitative models of the mechanisms that control genome-wide patterns of transcription factor binding during early <it>Drosophila </it>development.</p></title><aug><au><snm>Kaplan</snm><fnm>T</fnm></au><au><snm>Li</snm><fnm>XY</fnm></au><au><snm>Sabo</snm><fnm>PJ</fnm></au><au><snm>Thomas</snm><fnm>S</fnm></au><au><snm>Stamatoyannopoulos</snm><fnm>JA</fnm></au><au><snm>Biggin</snm><fnm>MD</fnm></au><au><snm>Eisen</snm><fnm>MB</fnm></au></aug><source>PLoS Genet</source><pubdate>2011</pubdate><volume>7</volume><fpage>e1001290</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pgen.1001290</pubid><pubid idtype="pmcid">3033374</pubid><pubid idtype="pmpid">21304941</pubid></pubidlist></xrefbib></bibl><bibl id="B76"><title><p>Chromatin accessibility pre-determines glucocorticoid receptor binding patterns.</p></title><aug><au><snm>John</snm><fnm>S</fnm></au><au><snm>Sabo</snm><fnm>PJ</fnm></au><au><snm>Thurman</snm><fnm>RE</fnm></au><au><snm>Sung</snm><fnm>MH</fnm></au><au><snm>Biddie</snm><fnm>SC</fnm></au><au><snm>Johnson</snm><fnm>TA</fnm></au><au><snm>Hager</snm><fnm>GL</fnm></au><au><snm>Stamatoyannopoulos</snm><fnm>JA</fnm></au></aug><source>Nat Genet</source><pubdate>2011</pubdate><volume>43</volume><fpage>264</fpage><lpage>268</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng.759</pubid><pubid idtype="pmpid" link="fulltext">21258342</pubid></pubidlist></xrefbib></bibl><bibl id="B77"><title><p>The general affinity of lac repressor for <it>E. coli </it>DNA: Implications for gene regulation in procaryotes and eukaryotes.</p></title><aug><au><snm>Lin</snm><fnm>S</fnm></au><au><snm>Riggs</snm><fnm>AD</fnm></au></aug><source>Cell</source><pubdate>1975</pubdate><volume>4</volume><fpage>107</fpage><lpage>111</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/0092-8674(75)90116-6</pubid><pubid idtype="pmpid" link="fulltext">1092468</pubid></pubidlist></xrefbib></bibl><bibl id="B78"><title><p>Nonspecific DNA binding of genome regulating proteins as a biological control mechanism: 1. The lac operon: Equilibrium aspects.</p></title><aug><au><snm>von Hippel</snm><fnm>PH</fnm></au><au><snm>Revzin</snm><fnm>A</fnm></au><au><snm>Gross</snm><fnm>CA</fnm></au><au><snm>Wang</snm><fnm>AC</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1974</pubdate><volume>71</volume><fpage>4808</fpage><lpage>4812</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.71.12.4808</pubid><pubid idtype="pmcid">433986</pubid><pubid idtype="pmpid">4612528</pubid></pubidlist></xrefbib></bibl><bibl id="B79"><title><p>Comparison of protein binding to DNA <it>in vivo </it>and <it>in vitro</it>: defining an effective intracellular target.</p></title><aug><au><snm>Yang</snm><fnm>Sw</fnm></au><au><snm>Nash</snm><fnm>HA</fnm></au></aug><source>EMBO J</source><pubdate>1995</pubdate><volume>14</volume><fpage>6292</fpage><lpage>6300</lpage><xrefbib><pubidlist><pubid idtype="pmcid">394753</pubid><pubid idtype="pmpid">8557048</pubid></pubidlist></xrefbib></bibl><bibl id="B80"><title><p>Specific gain- and loss-of-function phenotypes induced by satellite-specific DNA-binding drugs fed to <it>Drosophila melanogaster</it>.</p></title><aug><au><snm>Janssen</snm><fnm>S</fnm></au><au><snm>Cuvier</snm><fnm>O</fnm></au><au><snm>Muller</snm><fnm>M</fnm></au><au><snm>Laemmli</snm><fnm>UK</fnm></au></aug><source>Mol Cell</source><pubdate>2000</pubdate><volume>6</volume><fpage>1013</fpage><lpage>1024</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S1097-2765(00)00100-3</pubid><pubid idtype="pmpid" link="fulltext">11106741</pubid></pubidlist></xrefbib></bibl><bibl id="B81"><title><p>Global nature of dynamic protein-chromatin interactions <it>in vivo</it>: three-dimensional genome scanning and dynamic interaction networks of chromatin proteins.</p></title><aug><au><snm>Phair</snm><fnm>RD</fnm></au><au><snm>Scaffidi</snm><fnm>P</fnm></au><au><snm>Elbi</snm><fnm>C</fnm></au><au><snm>Vecerova</snm><fnm>J</fnm></au><au><snm>Dey</snm><fnm>A</fnm></au><au><snm>Ozato</snm><fnm>K</fnm></au><au><snm>Brown</snm><fnm>DT</fnm></au><au><snm>Hager</snm><fnm>G</fnm></au><au><snm>Bustin</snm><fnm>M</fnm></au><au><snm>Misteli</snm><fnm>T</fnm></au></aug><source>Mol Cell Biol</source><pubdate>2004</pubdate><volume>24</volume><fpage>6393</fpage><lpage>6402</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/MCB.24.14.6393-6402.2004</pubid><pubid idtype="pmcid">434243</pubid><pubid idtype="pmpid">15226439</pubid></pubidlist></xrefbib></bibl><bibl id="B82"><title><p>Functional sequestration of transcription factor activity by repetitive DNA.</p></title><aug><au><snm>Liu</snm><fnm>X</fnm></au><au><snm>Wu</snm><fnm>B</fnm></au><au><snm>Szary</snm><fnm>J</fnm></au><au><snm>Kofoed</snm><fnm>EM</fnm></au><au><snm>Schaufele</snm><fnm>F</fnm></au></aug><source>J Biol Chem</source><pubdate>2007</pubdate><volume>282</volume><fpage>20868</fpage><lpage>20876</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1074/jbc.M702547200</pubid><pubid idtype="pmpid" link="fulltext">17526489</pubid></pubidlist></xrefbib></bibl><bibl id="B83"><title><p>Recognition of specific DNA sequences.</p></title><aug><au><snm>Garvie</snm><fnm>CW</fnm></au><au><snm>Wolberger</snm><fnm>C</fnm></au></aug><source>Mol Cell</source><pubdate>2001</pubdate><volume>8</volume><fpage>937</fpage><lpage>946</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S1097-2765(01)00392-6</pubid><pubid idtype="pmpid" link="fulltext">11741530</pubid></pubidlist></xrefbib></bibl><bibl id="B84"><title><p>Expression, modification, and localization of the fushi tarazu protein in <it>Drosophila </it>embryos.</p></title><aug><au><snm>Krause</snm><fnm>HM</fnm></au><au><snm>Klemenz</snm><fnm>R</fnm></au><au><snm>Gehring</snm><fnm>WJ</fnm></au></aug><source>Genes Dev</source><pubdate>1988</pubdate><volume>2</volume><fpage>1021</fpage><lpage>1036</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.2.8.1021</pubid><pubid idtype="pmpid" link="fulltext">3049237</pubid></pubidlist></xrefbib></bibl><bibl id="B85"><title><p>The effects of selection against spurious transcription factor binding sites.</p></title><aug><au><snm>Hahn</snm><fnm>MW</fnm></au><au><snm>Stajich</snm><fnm>JE</fnm></au><au><snm>Wray</snm><fnm>GA</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2003</pubdate><volume>20</volume><fpage>901</fpage><lpage>906</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msg096</pubid><pubid idtype="pmpid" link="fulltext">12716998</pubid></pubidlist></xrefbib></bibl><bibl id="B86"><title><p>A computational genomics approach to the identification of gene networks.</p></title><aug><au><snm>Wagner</snm><fnm>A</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>1997</pubdate><volume>25</volume><fpage>3594</fpage><lpage>3604</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/25.18.3594</pubid><pubid idtype="pmcid">146952</pubid><pubid idtype="pmpid">9278479</pubid></pubidlist></xrefbib></bibl><bibl id="B87"><title><p>Identification of regulatory regions which confer muscle-specific gene expression.</p></title><aug><au><snm>Wasserman</snm><fnm>WW</fnm></au><au><snm>Fickett</snm><fnm>JW</fnm></au></aug><source>J Mol Biol</source><pubdate>1998</pubdate><volume>278</volume><fpage>167</fpage><lpage>181</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/jmbi.1998.1700</pubid><pubid idtype="pmpid" link="fulltext">9571041</pubid></pubidlist></xrefbib></bibl><bibl id="B88"><title><p>Computer-assisted identification of cell cycle-related genes: new targets for E2F transcription factors.</p></title><aug><au><snm>Kel</snm><fnm>AE</fnm></au><au><snm>Kel-Margoulis</snm><fnm>OV</fnm></au><au><snm>Farnham</snm><fnm>PJ</fnm></au><au><snm>Bartley</snm><fnm>SM</fnm></au><au><snm>Wingender</snm><fnm>E</fnm></au><au><snm>Zhang</snm><fnm>MQ</fnm></au></aug><source>J Mol Biol</source><pubdate>2001</pubdate><volume>309</volume><fpage>99</fpage><lpage>120</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/jmbi.2001.4650</pubid><pubid idtype="pmpid" link="fulltext">11491305</pubid></pubidlist></xrefbib></bibl><bibl id="B89"><title><p>Transcriptional control in the segmentation gene network of <it>Drosophila</it>.</p></title><aug><au><snm>Schroeder</snm><fnm>MD</fnm></au><au><snm>Pearce</snm><fnm>M</fnm></au><au><snm>Fak</snm><fnm>J</fnm></au><au><snm>Fan</snm><fnm>H</fnm></au><au><snm>Unnerstall</snm><fnm>U</fnm></au><au><snm>Emberly</snm><fnm>E</fnm></au><au><snm>Rajewsky</snm><fnm>N</fnm></au><au><snm>Siggia</snm><fnm>ED</fnm></au><au><snm>Gaul</snm><fnm>U</fnm></au></aug><source>PLoS Biol</source><pubdate>2004</pubdate><volume>2</volume><fpage>E271</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pbio.0020271</pubid><pubid idtype="pmcid">514885</pubid><pubid idtype="pmpid">15340490</pubid></pubidlist></xrefbib></bibl><bibl id="B90"><title><p>Computational identification of developmental enhancers: conservation and function of transcription factor binding-site clusters in <it>Drosophila melanogaster </it>and <it>Drosophila pseudoobscura</it>.</p></title><aug><au><snm>Berman</snm><fnm>BP</fnm></au><au><snm>Pfeiffer</snm><fnm>BD</fnm></au><au><snm>Laverty</snm><fnm>TR</fnm></au><au><snm>Salzberg</snm><fnm>SL</fnm></au><au><snm>Rubin</snm><fnm>GM</fnm></au><au><snm>Eisen</snm><fnm>MB</fnm></au><au><snm>Celniker</snm><fnm>SE</fnm></au></aug><source>Genome Biol</source><pubdate>2004</pubdate><volume>5</volume><fpage>R61</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2004-5-9-r61</pubid><pubid idtype="pmcid">522868</pubid><pubid idtype="pmpid">15345045</pubid></pubidlist></xrefbib></bibl><bibl id="B91"><title><p>Motif-blind, genome-wide discovery of cis-regulatory modules in <it>Drosophila </it>and mouse.</p></title><aug><au><snm>Kantorovitz</snm><fnm>MR</fnm></au><au><snm>Kazemian</snm><fnm>M</fnm></au><au><snm>Kinston</snm><fnm>S</fnm></au><au><snm>Miranda-Saavedra</snm><fnm>D</fnm></au><au><snm>Zhu</snm><fnm>Q</fnm></au><au><snm>Robinson</snm><fnm>GE</fnm></au><au><snm>Gottgens</snm><fnm>B</fnm></au><au><snm>Halfon</snm><fnm>MS</fnm></au><au><snm>Sinha</snm><fnm>S</fnm></au></aug><source>Dev Cell</source><pubdate>2009</pubdate><volume>17</volume><fpage>568</fpage><lpage>579</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.devcel.2009.09.002</pubid><pubid idtype="pmcid">2768654</pubid><pubid idtype="pmpid">19853570</pubid></pubidlist></xrefbib></bibl><bibl id="B92"><title><p>Homotypic clusters of transcription factor binding sites are a key component of human promoters and enhancers.</p></title><aug><au><snm>Gotea</snm><fnm>V</fnm></au><au><snm>Visel</snm><fnm>A</fnm></au><au><snm>Westlund</snm><fnm>JM</fnm></au><au><snm>Nobrega</snm><fnm>MA</fnm></au><au><snm>Pennacchio</snm><fnm>LA</fnm></au><au><snm>Ovcharenko</snm><fnm>I</fnm></au></aug><source>Genome Res</source><pubdate>2010</pubdate><volume>20</volume><fpage>565</fpage><lpage>577</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.104471.109</pubid><pubid idtype="pmcid">2860159</pubid><pubid idtype="pmpid">20363979</pubid></pubidlist></xrefbib></bibl><bibl id="B93"><title><p>Transcriptional activation by recruitment.</p></title><aug><au><snm>Ptashne</snm><fnm>M</fnm></au><au><snm>Gann</snm><fnm>A</fnm></au></aug><source>Nature</source><pubdate>1997</pubdate><volume>386</volume><fpage>569</fpage><lpage>577</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/386569a0</pubid><pubid idtype="pmpid" link="fulltext">9121580</pubid></pubidlist></xrefbib></bibl><bibl id="B94"><title><p>A nucleosome-guided map of transcription factor binding sites in yeast.</p></title><aug><au><snm>Narlikar</snm><fnm>L</fnm></au><au><snm>Gordan</snm><fnm>R</fnm></au><au><snm>Hartemink</snm><fnm>AJ</fnm></au></aug><source>PLoS Comput Biol</source><pubdate>2007</pubdate><volume>3</volume><fpage>e215</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pcbi.0030215</pubid><pubid idtype="pmcid">2065891</pubid><pubid idtype="pmpid">17997593</pubid></pubidlist></xrefbib></bibl><bibl id="B95"><title><p>Probabilistic inference of transcription factor binding from multiple data sources.</p></title><aug><au><snm>Lahdesmaki</snm><fnm>H</fnm></au><au><snm>Rust</snm><fnm>AG</fnm></au><au><snm>Shmulevich</snm><fnm>I</fnm></au></aug><source>PLoS One</source><pubdate>2008</pubdate><volume>3</volume><fpage>e1820</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0001820</pubid><pubid idtype="pmcid">2268002</pubid><pubid idtype="pmpid">18364997</pubid></pubidlist></xrefbib></bibl><bibl id="B96"><title><p>Predicting functional transcription factor binding through alignment-free and affinity-based analysis of orthologous promoter sequences.</p></title><aug><au><snm>Ward</snm><fnm>LD</fnm></au><au><snm>Bussemaker</snm><fnm>HJ</fnm></au></aug><source>Bioinformatics</source><pubdate>2008</pubdate><volume>24</volume><fpage>i165</fpage><lpage>171</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btn154</pubid><pubid idtype="pmcid">2718632</pubid><pubid idtype="pmpid">18586710</pubid></pubidlist></xrefbib></bibl><bibl id="B97"><title><p>Nucleosomal context of binding sites influences transcription factor binding affinity and gene regulation.</p></title><aug><au><snm>Dai</snm><fnm>Z</fnm></au><au><snm>Dai</snm><fnm>X</fnm></au><au><snm>Xiang</snm><fnm>Q</fnm></au><au><snm>Feng</snm><fnm>J</fnm></au></aug><source>Genomics Proteomics Bioinformatics</source><pubdate>2009</pubdate><volume>7</volume><fpage>155</fpage><lpage>162</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S1672-0229(08)60045-5</pubid><pubid idtype="pmpid" link="fulltext">20172488</pubid></pubidlist></xrefbib></bibl><bibl id="B98"><title><p>High-throughput chromatin information enables accurate tissue-specific prediction of transcription factor binding sites.</p></title><aug><au><snm>Whitington</snm><fnm>T</fnm></au><au><snm>Perkins</snm><fnm>AC</fnm></au><au><snm>Bailey</snm><fnm>TL</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2009</pubdate><volume>37</volume><fpage>14</fpage><lpage>25</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkn866</pubid><pubid idtype="pmcid">2662491</pubid><pubid idtype="pmpid">18988630</pubid></pubidlist></xrefbib></bibl><bibl id="B99"><title><p>Integrating multiple evidence sources to predict transcription factor binding in the human genome.</p></title><aug><au><snm>Ernst</snm><fnm>J</fnm></au><au><snm>Plasterer</snm><fnm>HL</fnm></au><au><snm>Simon</snm><fnm>I</fnm></au><au><snm>Bar-Joseph</snm><fnm>Z</fnm></au></aug><source>Genome Res</source><pubdate>2010</pubdate><volume>20</volume><fpage>526</fpage><lpage>536</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.096305.109</pubid><pubid idtype="pmcid">2847756</pubid><pubid idtype="pmpid">20219943</pubid></pubidlist></xrefbib></bibl><bibl id="B100"><title><p>Genome-wide histone acetylation data improve prediction of mammalian transcription factor binding sites.</p></title><aug><au><snm>Ramsey</snm><fnm>SA</fnm></au><au><snm>Knijnenburg</snm><fnm>TA</fnm></au><au><snm>Kennedy</snm><fnm>KA</fnm></au><au><snm>Zak</snm><fnm>DE</fnm></au><au><snm>Gilchrist</snm><fnm>M</fnm></au><au><snm>Gold</snm><fnm>ES</fnm></au><au><snm>Johnson</snm><fnm>CD</fnm></au><au><snm>Lampano</snm><fnm>AE</fnm></au><au><snm>Litvak</snm><fnm>V</fnm></au><au><snm>Navarro</snm><fnm>G</fnm></au><au><snm>Stolyar</snm><fnm>T</fnm></au><au><snm>Aderem</snm><fnm>A</fnm></au><au><snm>Shmulevich</snm><fnm>I</fnm></au></aug><source>Bioinformatics</source><pubdate>2010</pubdate><volume>26</volume><fpage>2071</fpage><lpage>2075</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btq405</pubid><pubid idtype="pmcid">2922897</pubid><pubid idtype="pmpid">20663846</pubid></pubidlist></xrefbib></bibl><bibl id="B101"><title><p>Genome-wide prediction of transcription factor binding sites using an integrated model.</p></title><aug><au><snm>Won</snm><fnm>KJ</fnm></au><au><snm>Ren</snm><fnm>B</fnm></au><au><snm>Wang</snm><fnm>W</fnm></au></aug><source>Genome Biol</source><pubdate>2010</pubdate><volume>11</volume><fpage>R7</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2010-11-1-r7</pubid><pubid idtype="pmcid">2847719</pubid><pubid idtype="pmpid">20096096</pubid></pubidlist></xrefbib></bibl><bibl id="B102"><title><p>DNA binding specificity of two homeodomain proteins <it>in vitro </it>and in <it>Drosophila </it>embryos.</p></title><aug><au><snm>Walter</snm><fnm>J</fnm></au><au><snm>Biggin</snm><fnm>MD</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1996</pubdate><volume>93</volume><fpage>2680</fpage><lpage>2685</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.93.7.2680</pubid><pubid idtype="pmcid">39690</pubid><pubid idtype="pmpid">8610101</pubid></pubidlist></xrefbib></bibl><bibl id="B103"><title><p>BDTNP ChIP/chip Database.</p></title><url>http://bdtnp.lbl.gov/Fly-Net/chipchip.jsp?w=summary</url></bibl><bibl id="B104"><title><p>Genome-scale mapping of DNase I sensitivity <it>in vivo </it>using tiling DNA microarrays.</p></title><aug><au><snm>Sabo</snm><fnm>PJ</fnm></au><au><snm>Kuehn</snm><fnm>MS</fnm></au><au><snm>Thurman</snm><fnm>R</fnm></au><au><snm>Johnson</snm><fnm>BE</fnm></au><au><snm>Johnson</snm><fnm>EM</fnm></au><au><snm>Cao</snm><fnm>H</fnm></au><au><snm>Yu</snm><fnm>M</fnm></au><au><snm>Rosenzweig</snm><fnm>E</fnm></au><au><snm>Goldy</snm><fnm>J</fnm></au><au><snm>Haydock</snm><fnm>A</fnm></au><au><snm>Weaver</snm><fnm>M</fnm></au><au><snm>Shafer</snm><fnm>A</fnm></au><au><snm>Lee</snm><fnm>K</fnm></au><au><snm>Neri</snm><fnm>F</fnm></au><au><snm>Humbert</snm><fnm>R</fnm></au><au><snm>Singer</snm><fnm>MA</fnm></au><au><snm>Richmond</snm><fnm>TA</fnm></au><au><snm>Dorschner</snm><fnm>MO</fnm></au><au><snm>McArthur</snm><fnm>M</fnm></au><au><snm>Hawrylycz</snm><fnm>M</fnm></au><au><snm>Green</snm><fnm>RD</fnm></au><au><snm>Navas</snm><fnm>PA</fnm></au><au><snm>Noble</snm><fnm>WS</fnm></au><au><snm>Stamatoyannopoulos</snm><fnm>JA</fnm></au></aug><source>Nat Methods</source><pubdate>2006</pubdate><volume>3</volume><fpage>511</fpage><lpage>518</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nmeth890</pubid><pubid idtype="pmpid" link="fulltext">16791208</pubid></pubidlist></xrefbib></bibl><bibl id="B105"><title><p>Subsampling methods for genomic inference.</p></title><aug><au><snm>Bickel</snm><fnm>PJ</fnm></au><au><snm>Boley</snm><fnm>N</fnm></au><au><snm>Brown</snm><fnm>JB</fnm></au><au><snm>Huang</snm><fnm>H</fnm></au><au><snm>Zhang</snm><fnm>NR</fnm></au></aug><source>Ann Appl Stat</source><pubdate>2010</pubdate><inpress/></bibl><bibl id="B106"><aug><au><cnm>R Development Core Team</cnm></au></aug><source>R: A Language and Environment for Statistical Computing</source><publisher>Vienna, Austria: R Foundation for Statistical Computing</publisher><pubdate>2009</pubdate></bibl><bibl id="B107"><title><p>MEME SUITE: tools for motif discovery and searching.</p></title><aug><au><snm>Bailey</snm><fnm>TL</fnm></au><au><snm>Boden</snm><fnm>M</fnm></au><au><snm>Buske</snm><fnm>FA</fnm></au><au><snm>Frith</snm><fnm>M</fnm></au><au><snm>Grant</snm><fnm>CE</fnm></au><au><snm>Clementi</snm><fnm>L</fnm></au><au><snm>Ren</snm><fnm>J</fnm></au><au><snm>Li</snm><fnm>WW</fnm></au><au><snm>Noble</snm><fnm>WS</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2009</pubdate><volume>37</volume><fpage>W202</fpage><lpage>208</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkp335</pubid><pubid idtype="pmcid">2703892</pubid><pubid idtype="pmpid">19458158</pubid></pubidlist></xrefbib></bibl><bibl id="B108"><title><p>Quantifying similarity between motifs.</p></title><aug><au><snm>Gupta</snm><fnm>S</fnm></au><au><snm>Stamatoyannopoulos</snm><fnm>JA</fnm></au><au><snm>Bailey</snm><fnm>TL</fnm></au><au><snm>Noble</snm><fnm>WS</fnm></au></aug><source>Genome Biol</source><pubdate>2007</pubdate><volume>8</volume><fpage>R24</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2007-8-2-r24</pubid><pubid idtype="pmcid">1852410</pubid><pubid idtype="pmpid">17324271</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm>
</art>