<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2006-7-9-r82</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Multiplatform genome-wide identification and modeling of functional human estrogen receptor binding sites</p>
         </title>
         <aug>
            <au ca="yes" id="A1" ce="yes">
               <snm>Vega</snm>
               <mi>B</mi>
               <fnm>Vinsensius</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>vegav@gis.a-star.edu.sg</email>
            </au>
            <au id="A2" ce="yes">
               <snm>Lin</snm>
               <fnm>Chin-Yo</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <insr iid="I4"/>
               <email>chinyolin@byu.edu</email>
            </au>
            <au id="A3">
               <snm>Lai</snm>
               <mnm>Siew</mnm>
               <fnm>Koon</fnm>
               <insr iid="I1"/>
               <email>kslai01@yahoo.com</email>
            </au>
            <au id="A4">
               <snm>Li Kong</snm>
               <fnm>Say</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>kongsl@gis.a-star.edu.sg</email>
            </au>
            <au id="A5">
               <snm>Xie</snm>
               <fnm>Min</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>xiem@gis.a-star.edu.sg</email>
            </au>
            <au id="A6">
               <snm>Su</snm>
               <fnm>Xiaodi</fnm>
               <insr iid="I5"/>
               <email>xd-su@imre.a-star.edu.sg</email>
            </au>
            <au id="A7">
               <snm>Teh</snm>
               <mnm>Fang</mnm>
               <fnm>Huey</fnm>
               <insr iid="I5"/>
               <email>hf-teh@imre.a-star.edu.sg</email>
            </au>
            <au id="A8">
               <snm>Thomsen</snm>
               <mi>S</mi>
               <fnm>Jane</fnm>
               <insr iid="I1"/>
               <email>thomsenj@gis.a-star.edu.sg</email>
            </au>
            <au id="A9">
               <snm>Li Yeo</snm>
               <fnm>Ai</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>bluz_ocean@yahoo.com.sg</email>
            </au>
            <au id="A10">
               <snm>Sung</snm>
               <mnm>Kin</mnm>
               <fnm>Wing</fnm>
               <insr iid="I2"/>
               <email>sungk@gis.a-star.edu.sg</email>
            </au>
            <au id="A11">
               <snm>Bourque</snm>
               <fnm>Guillaume</fnm>
               <insr iid="I2"/>
               <email>bourque@gis.a-star.edu.sg</email>
            </au>
            <au id="A12" ca="yes">
               <snm>Liu</snm>
               <mi>T</mi>
               <fnm>Edison</fnm>
               <insr iid="I1"/>
               <email>liue@gis.a-star.edu.sg</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Estrogen Receptor Biology Program, Genome Institute of Singapore, 60 Biopolis Street, Republic of Singapore 138672</p>
            </ins>
            <ins id="I2">
               <p>Information and Mathematical Sciences Group, Genome Institute of Singapore, 60 Biopolis Street, Republic of Singapore 138672</p>
            </ins>
            <ins id="I3">
               <p>Microarray and Expression Genomics Laboratory, Genome Institute of Singapore, 60 Biopolis Street, Republic of Singapore 138672</p>
            </ins>
            <ins id="I4">
               <p>Department of Microbiology and Molecular Biology, Brigham Young University, 753 WIDB, Provo, UT 84602, USA</p>
            </ins>
            <ins id="I5">
               <p>Institute of Materials Research and Engineering, 3, Research Link, Republic of Singapore 117602</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2006</pubdate>
         <volume>7</volume>
         <issue>9</issue>
         <fpage>R82</fpage>
         <url>http://genomebiology.com/2006/7/9/R82</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16961928</pubid>
               <pubid idtype="doi">10.1186/gb-2006-7-9-r82</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>27</day>
               <month>2</month>
               <year>2006</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>11</day>
               <month>5</month>
               <year>2006</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>9</day>
               <month>9</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>9</day>
               <month>9</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Vega et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Human estrogen receptor binding sites</p>
      </shorttitle>
      <shortabs>
         <p>Refinement of the functional human estrogen receptor binding site model using a multi-platform genome-wide approach reveals extended binding specificity signal.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Transcription factor binding sites (TFBS) impart specificity to cellular transcriptional responses and have largely been defined by consensus motifs derived from a handful of validated sites. The low specificity of the computational predictions of TFBSs has been attributed to ubiquity of the motifs and the relaxed sequence requirements for binding. We posited that the inadequacy is due to limited input of empirically verified sites, and demonstrated a multiplatform approach to constructing a robust model.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Using the TFBS for the estrogen receptor (ER)&#945; (estrogen response element [ERE]) as a model system, we extracted EREs from multiple molecular and genomic platforms whose binding to ER&#945; has been experimentally confirmed or rejected. <it>In silico </it>analyses revealed significant sequence information flanking the standard binding consensus, discriminating ERE-like sequences that bind ER&#945; from those that are nonbinders. We extended the ERE consensus by three bases, bearing a terminal G at the third position 3' and an initiator C at the third position 5', which were further validated using surface plasmon resonance spectroscopy. Our functional human ERE prediction algorithm (h-ERE) outperformed existing predictive algorithms and produced fewer than 5% false negatives upon experimental validation.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Building upon a larger experimentally validated ERE set, the h-ERE algorithm is able to demarcate better the universe of ERE-like sequences that are potential ER binders. Only 14% of the predicted optimal binding sites were utilized under the experimental conditions employed, pointing to other selective criteria not related to EREs. Other factors, in addition to primary nucleotide sequence, will ultimately determine binding site selection.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010016">Molecular biology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Estrogen receptors (ERs) are members of the nuclear receptor superfamily of transcription factors, which plays key roles in human development, physiology, and endocrine-related diseases <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Two ER subtypes, namely ER&#945; (<it>ESR1</it>) and ER&#946; (<it>ESR2</it>), mediate cellular responses to hormone exposure in target tissues, and receptors are directed at <it>cis</it>-regulatory sites of target genes via interactions between the zinc finger motifs in their DNA-binding domains and specific nucleotide sequence motifs termed estrogen response elements (EREs). Specificity protein (Sp)-1 and activator protein (AP)-1 transcription factors are also known to tether with ER and regulate a smaller subset of target genes through Sp1 and AP1 binding sites. The importance of these sites to the overall ER biologic response remains unclear.</p>
         <p>The consensus ERE sequence (5'-GGTCAnnnTGACC-3') was derived from conserved regulatory elements found in <it>Xenopus </it>and chicken vitellogenin genes and consists of palindromic repeats separated by a three-base spacer to accommodate interactions with receptor dimers <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. Subsequent characterizations of EREs in additional target genes, however, indicate that the majority of response elements deviate from the described consensus sequence <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. Furthermore, ERE-like sequences are ubiquitous in the human genome, and evidence for ER binding among the majority of ERE-like sites in estrogen response gene expression studies is apparently absent; these factors suggest that additional sequence motifs and/or chromatin features may contribute to the specificity of ER binding and transcriptional response. Recent efforts to model better the ERE by using position weight matrices (PWMs <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>) in order to describe all previously published EREs have resulted in more complete models but with a limited ability to predict <it>bona fide </it>ER binding <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. We posited that the current major challenge with construction of ERE models is the limited datasets available, both for experimentally determined ER-bound sites and for ERE-like sites that do not bind ER.</p>
         <p>In addition to compiling the known sites reported in the literature, we pursued a combined experimental and informatics approach to identify additional ER binding sites and their associated direct target genes. This information was analyzed to develop a more faithful model of the ER binding site motifs. To accomplish this, we applied three experimental strategies for ER-binding sites discovery. First, we predicted putative EREs in the promoter regions of direct target genes discovered by microarray analysis <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> and then tested for ER binding at predicted sites of responsive genes by chromatin immunoprecipitation (ChIP) assays <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Second, we surveyed ER-binding sites in promoter regions of the human genome by hybridizing fluorescently-labeled ChIP DNA fragments to high-density oligonucleotide arrays ('ChIP-on-chip') with probes against about 30,000 proximal promoters (-1 kilobase [kb] to +0.2 kb relative to the transcription start sites [TSSs]). Third, we detected ER-binding sites across the genome by ChIP, followed by cloning and sequencing of bound fragments ('ChIP-and-clone'). ERE-like sites that have been validated, for binding and nonbinding, by conventional ChIP followed by quantitative polymerase chain reaction (qPCR) using site-specific primers were then used to train and test a model for functional EREs (summarized in Figure <figr fid="F1">1</figr>). In the present study, we focused on functional human EREs to minimize potential noise introduced by species-specific variation, which we have previously observed <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Schematics of ERE discovery and validation for model training and testing</p>
            </caption>
            <text>
               <p>Schematics of ERE discovery and validation for model training and testing. ERE, estrogen response element; ChIP, chromatin immunoprecipitation; qPCR, quantitative polymerase chain reaction.</p>
            </text>
            <graphic file="gb-2006-7-9-r82-1"/>
         </fig>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Functional estrogen receptor binding sites</p>
            </st>
            <p>We used a combination of literature search and direct experimentation to generate a list of qualified ER-binding sites. In this study we constrained ourselves to using only sites that have been validated for the modeling of functional EREs. We first extracted human ERE sequences that have been experimentally validated in the literature to either bind or not to bind ER. Klinge <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> and Bourdeau and coworkers <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> each described EREs that have been validated by electrophoretic mobility shift assays, transient transfection with reporter gene constructs, or ChIP assays.</p>
            <p>Supplementing the list of confirmed EREs gleaned from the literature, we experimentally identified functional ER-binding sites using two whole-genome experimental strategies. The first strategy was to extract candidate ER-binding sites computationally from a list of putative direct ER target genes. Eighty-nine putative direct target genes were identified as genes expressed in MCF-7 cells that were responsive to estradiol treatment, sensitive to inhibition by Faslodex (ICI 182,780), and insensitive to cycloheximide <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. We then computationally surveyed 3.5 kb regions flanking the TSSs (-3 kb to +0.5 kb) of these 89 genes to identify proximate consensus EREs (allowing for deviations in up to two conserved positions of the consensus motif). Each site was then tested by ChIP assays and qPCR with site-specific primers to determine the true nature of ER binding. Eight EREs were found to be bound by ER, whereas 41 others were not found to be bound by ER.</p>
            <p>In our second approach, we performed ChIP assays on estradiol-treated breast tumor cells and detected ER-binding sites using high-density oligonucleotide microarrays (NimbleGen, Madison, WI, USA) containing probes against proximal promoter regions (-1 kb to +0.2 kb from TSS; 12 probes per promoter) of over 30,000 human known gene and RefSeq transcripts annotated in the human genome sequence hg16 (July 2003), NCBI build 34 annotation of the UCSC genome browser. The ChIP-on-chip studies were performed using duplicate array experiments on the ChIP samples and on input control DNA. The promoters that appeared among the top 5% of the binding ratio range (ER antibody versus control) for both replicates, that had at least a 15% increase, and that were supported by consistent binding ratio enrichment across more than four probes or additional evidence of ER regulation from the microarray data were selected. Putative EREs (allowing for up to two mismatches from the consensus) were then identified in the selected promoters, and some were further validated by additional ChIP and qPCR (see Materials and methods, below, for more detail). Out of the total 28 sites tested, 13 were found to bind ER whereas 15 were not. From the literature sources and experiments described above, a total of 45 validated ER-binding sites and 58 validated non-ER-binding were identified, all of which bore close resemblance to the consensus ERE (Table <tblr tid="T1">1</tblr>). Each of the 45 binders and 58 non-binders was associated with a gene and most were located in the genes' upstream regulatory regions. This list of 103 genes were used as the training set to assess the significance of ancillary sequence signals beyond the core ERE that might better predict ER binding.</p>
            <tbl id="T1" hint_layout="double">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Genomic coordinates of ERE-like sequences that have been experimentally validated or rejected as ER-binding</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Name</p>
                     </c>
                     <c ca="left">
                        <p>Genomic location</p>
                     </c>
                     <c ca="left">
                        <p>Pattern</p>
                     </c>
                     <c ca="left">
                        <p>Validation</p>
                     </c>
                     <c ca="left">
                        <p>Reference</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PDZK1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr1:143,215,756-143,215,768</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAccc<b><ul>A</ul></b>G<b><ul>T</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ADORA1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr1:199,790,269-199,790,281</p>
                     </c>
                     <c ca="left">
                        <p>GGT<b><ul>T</ul></b>AgggTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[10] and this study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ADORA1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr1:199,790,414-199,790,426</p>
                     </c>
                     <c ca="left">
                        <p>GGT<b><ul>GT</ul></b>cttTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>AGT</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr1:227,156,613-227,156,625</p>
                     </c>
                     <c ca="left">
                        <p>GG<b><ul>G</ul></b>CAtcgTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[4]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>GREB1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr2:11,603,634-11,603,646</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAaaaTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[10]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>GREB1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr2:11,615,324-11,615,336</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAtcaTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[10]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>GREB1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr2:11,621,861-11,621,873</p>
                     </c>
                     <c ca="left">
                        <p><b><ul>A</ul></b>GTCAgtgT<b><ul>C</ul></b>ACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>GREB1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr2:11,623,258-11,623,270</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAttcTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[8,10]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CYP1B1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr2:38,214,993-38,215,005</p>
                     </c>
                     <c ca="left">
                        <p>GGTC<b><ul>G</ul></b>cgcTG<b><ul>C</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CYP1B1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr2:38,215,049-38,215,061</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAaag<b><ul>C</ul></b>G<b><ul>G</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>LTF</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr3:46,481,739-46,481,751</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAagg<b><ul>C</ul></b>GA<b><ul>T</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[10]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>AREG</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr4:75,676,340-75,676,352</p>
                     </c>
                     <c ca="left">
                        <p>GG<b><ul>A</ul></b>CAaggTG<b><ul>T</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ELOVL2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr6:11,154,748-11,154,760</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAtctTGA<b><ul>TG</ul></b></p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>VEGF</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr6:43,844,381-43,844,393</p>
                     </c>
                     <c ca="left">
                        <p><b><ul>AA</ul></b>TCAgacTGAC<b><ul>T</ul></b></p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[4]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>LY6E</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr8:144,170,802-144,170,814</p>
                     </c>
                     <c ca="left">
                        <p>GG<b><ul>A</ul></b>CAagaTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[10]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PTGES</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr9:129,597,654-129,597,666</p>
                     </c>
                     <c ca="left">
                        <p>GG<b><ul>A</ul></b>CAgccTG<b><ul>G</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CASP7</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr10:115,428,398-115,428,410</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAgggTGA<b><ul>A</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[10]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CASP7</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr10:115,428,492-115,428,504</p>
                     </c>
                     <c ca="left">
                        <p>GGTC<b><ul>G</ul></b>gggTGA<b><ul>A</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[10]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CASP7</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr10:115,428,572-115,428,584</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAgggTGA<b><ul>A</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[10]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CASP7</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr10:115,428,612-115,428,624</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAgggTGA<b><ul>A</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[10]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CASP7</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr10:115,428,652-115,428,664</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAgggTGA<b><ul>A</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[10]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CASP7</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr10:115,428,689-115,428,701</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAgggTGA<b><ul>A</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[10]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CASP7</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr10:115,428,743-115,428,755</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAgggTGA<b><ul>A</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[10]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CTSD</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr11:1,741,924-1,741,936</p>
                     </c>
                     <c ca="left">
                        <p>GG<b><ul>C</ul></b>C<b><ul>G</ul></b>ggcTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[4]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PGR</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr11:100,504,595-100,504,607</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAcca<b><ul>GCT</ul></b>C<b><ul>T</ul></b></p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[4]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PGR</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr11:100,505,180-100,505,192</p>
                     </c>
                     <c ca="left">
                        <p>G<b><ul>CAGG</ul></b>agcTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[4]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>SCNN1A</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr12:6,355,536-6,355,548</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAgccT<b><ul>C</ul></b>ACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[10]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>GAPDH</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr12:6,513,208-6,513,220</p>
                     </c>
                     <c ca="left">
                        <p>GG<b><ul>A</ul></b>CAtcgTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[10]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ESR2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr14:63,879,248-63,879,260</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAggcTG<b><ul>GT</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[4]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>FLJ30973</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr15:55,670,850-55,670,862</p>
                     </c>
                     <c ca="left">
                        <p>GG<b><ul>G</ul></b>CAgtgTG<b><ul>G</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>FLJ30973</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr15:55,671,545-55,671,557</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAcccTG<b><ul>CT</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ABCA3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr16:2,319,793-2,319,805</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAcggTG<b><ul>TT</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[8]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>IGFBP4</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr17:35,849,113-35,849,125</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAttgTGAC<b><ul>A</ul></b></p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[10]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>TRIM25</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr17:52,323,321-52,323,333</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAtggTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[4], [10]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>BCL2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr18:59,136,673-59,136,685</p>
                     </c>
                     <c ca="left">
                        <p>GGTC<b><ul>G</ul></b>cca<b><ul>G</ul></b>GACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[4]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MGC26694</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr19:19,035,118-19,035,130</p>
                     </c>
                     <c ca="left">
                        <p>G<b><ul>T</ul></b>TCAgagTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>GRAMD1A</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr19:40,182,519-40,182,531</p>
                     </c>
                     <c ca="left">
                        <p>GG<b><ul>C</ul></b>C<b><ul>T</ul></b>ggcTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ACTN4</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr19:43,897,093-43,897,105</p>
                     </c>
                     <c ca="left">
                        <p>GGTCActgTGAC<b><ul>T</ul></b></p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>GPR77</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr19:52,532,131-52,532,143</p>
                     </c>
                     <c ca="left">
                        <p>GGTCActcTGAC<b><ul>A</ul></b></p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>C3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr19:6,671,884-6,671,902</p>
                     </c>
                     <c ca="left">
                        <p>GGT<b><ul>GG</ul></b>cccTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[4]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>NRIP1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr21:15,359,833-15,359,845</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAaagTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[8]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>TFF1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr21:42,659,626-42,659,638</p>
                     </c>
                     <c ca="left">
                        <p>GGTC<b><ul>C</ul></b>tggTG<b><ul>T</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>TFF1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr21:42,659,906-42,659,918</p>
                     </c>
                     <c ca="left">
                        <p><b><ul>A</ul></b>G<b><ul>C</ul></b>CAagaTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>TFF1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr21:42,660,106-42,660,118</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAcggTG<b><ul>G</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>[4]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CRKL</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr22:19,595,695-19,595,707</p>
                     </c>
                     <c ca="left">
                        <p><b><ul>A</ul></b>GTCAatcT<b><ul>A</ul></b>ACC</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>TSHB</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr1:115,283,928-115,283,940</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAgctTGAC<b><ul>A</ul></b></p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>[10]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>TXNIP</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr1:142,927,222-142,927,234</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAgtg<b><ul>G</ul></b>GA<b><ul>T</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>LOR</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr1:150,045,850-150,045,862</p>
                     </c>
                     <c ca="left">
                        <p>GGTC<b><ul>C</ul></b>aaa<b><ul>G</ul></b>GACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>GREB1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr2:11,622,443-11,622,455</p>
                     </c>
                     <c ca="left">
                        <p><b><ul>T</ul></b>G<b><ul>C</ul></b>CAccaTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>GREB1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr2:11,625,143-11,625,155</p>
                     </c>
                     <c ca="left">
                        <p><b><ul>T</ul></b>GTCAatcTG<b><ul>T</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>EN1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr2:119,322,563-119,322,575</p>
                     </c>
                     <c ca="left">
                        <p>GGT<b><ul>T</ul></b>AcccTGA<b><ul>A</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>UGCGL1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr2:128,563,200-128,563,212</p>
                     </c>
                     <c ca="left">
                        <p><b><ul>T</ul></b>GTCAaaaTG<b><ul>T</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>UGCGL1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr2:128,565,292-128,565,304</p>
                     </c>
                     <c ca="left">
                        <p><b><ul>T</ul></b>GTCAcatTGA<b><ul>G</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PLGLB1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr2:87,884,778-87,884,790</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAgtgTG<b><ul>C</ul></b>C<b><ul>A</ul></b></p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>SIAH2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr3:151,966,545-151,966,557</p>
                     </c>
                     <c ca="left">
                        <p>G<b><ul>C</ul></b>TCAtagTG<b><ul>C</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ATP13A3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr3:195,656,453-195,656,465</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAtta<b><ul>A</ul></b>TACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CISH</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr3:50,626,609-50,626,621</p>
                     </c>
                     <c ca="left">
                        <p>GG<b><ul>C</ul></b>CAgag<b><ul>G</ul></b>GACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>LMCD1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr3:8,517,591-8,517,603</p>
                     </c>
                     <c ca="left">
                        <p>GG<b><ul>CCT</ul></b>gcaTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>FLJ22269</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr4:673,249-673,261</p>
                     </c>
                     <c ca="left">
                        <p>GG<b><ul>G</ul></b>CAgagTGAC<b><ul>T</ul></b></p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CCNG2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr4:78,433,176-78,433,188</p>
                     </c>
                     <c ca="left">
                        <p>GG<b><ul>A</ul></b>CAactTGA<b><ul>T</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>STC2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr5:172,689,912-172,689,924</p>
                     </c>
                     <c ca="left">
                        <p>GG<b><ul>G</ul></b>CAatgTGA<b><ul>A</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>IL6ST</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr5:55,327,909-55,327,921</p>
                     </c>
                     <c ca="left">
                        <p>GGT<b><ul>G</ul></b>AgcaTGA<b><ul>T</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PLK2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr5:57,792,972-57,792,984</p>
                     </c>
                     <c ca="left">
                        <p>GGT<b><ul>T</ul></b>Acag<b><ul>C</ul></b>GACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>OLIG3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr6:137,857,308-137,857,320</p>
                     </c>
                     <c ca="left">
                        <p><b><ul>C</ul></b>GTCAtccT<b><ul>A</ul></b>ACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>FKBPL</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr6:32,206,228-32,206,240</p>
                     </c>
                     <c ca="left">
                        <p>GG<b><ul>C</ul></b>CAgcc<b><ul>C</ul></b>GACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>FKBPL</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr6:32,206,311-32,206,323</p>
                     </c>
                     <c ca="left">
                        <p><b><ul>C</ul></b>G<b><ul>C</ul></b>CAccaTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>SERPINE1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr7:100,361,980-100,361,992</p>
                     </c>
                     <c ca="left">
                        <p>G<b><ul>AC</ul></b>CAgccTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>SERPINE1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr7:100,362,938-100,362,950</p>
                     </c>
                     <c ca="left">
                        <p>GG<b><ul>A</ul></b>CAagcTG<b><ul>C</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>SERPINE1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr7:100,363,852-100,363,864</p>
                     </c>
                     <c ca="left">
                        <p><b><ul>T</ul></b>GTCAaga<b><ul>A</ul></b>GACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>TSPAN13</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr7:16,566,080-16,566,092</p>
                     </c>
                     <c ca="left">
                        <p>G<b><ul>A</ul></b>T<b><ul>A</ul></b>AgtcTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>BLVRA</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr7:43,570,289-43,570,301</p>
                     </c>
                     <c ca="left">
                        <p>GGTCActcTG<b><ul>G</ul></b>C<b><ul>T</ul></b></p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>BLVRA</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr7:43,570,774-43,570,786</p>
                     </c>
                     <c ca="left">
                        <p><b><ul>A</ul></b>GTCAaccT<b><ul>T</ul></b>ACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>B4GALT1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr9:33,157,593-33,157,605</p>
                     </c>
                     <c ca="left">
                        <p>G<b><ul>C</ul></b>TCAacg<b><ul>C</ul></b>GACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>B4GALT1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr9:33,158,622-33,158,634</p>
                     </c>
                     <c ca="left">
                        <p>G<b><ul>A</ul></b>TCAgaa<b><ul>G</ul></b>GACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>DNAJC1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr10:22,333,030-22,333,042</p>
                     </c>
                     <c ca="left">
                        <p>G<b><ul>T</ul></b>TCAactTG<b><ul>T</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>GAD2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr10:26,545,037-26,545,049</p>
                     </c>
                     <c ca="left">
                        <p>GGTC<b><ul>G</ul></b>cagTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>[10]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CXCL12</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr10:44,202,437-44,202,449</p>
                     </c>
                     <c ca="left">
                        <p>GGTC<b><ul>C</ul></b>agcTG<b><ul>C</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CXCL12</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr10:44,203,283-44,203,295</p>
                     </c>
                     <c ca="left">
                        <p><b><ul>T</ul></b>GTCAaaaTG<b><ul>G</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PGR</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr11:100,509,203-100,509,215</p>
                     </c>
                     <c ca="left">
                        <p><b><ul>A</ul></b>GTCAtgtTGAC<b><ul>A</ul></b></p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>DGKZ</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr11:46,321,832-46,321,844</p>
                     </c>
                     <c ca="left">
                        <p>GG<b><ul>C</ul></b>CAtgcTG<b><ul>G</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CTSW</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr11:65,403,499-65,403,511</p>
                     </c>
                     <c ca="left">
                        <p>G<b><ul>AC</ul></b>CAgccTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>C14orf131</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr14:101,872,078-101,872,090</p>
                     </c>
                     <c ca="left">
                        <p>GG<b><ul>C</ul></b>CAacaTGAC<b><ul>A</ul></b></p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>DLG7</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr14:54,727,987-54,727,999</p>
                     </c>
                     <c ca="left">
                        <p>GGTC<b><ul>G</ul></b>tcc<b><ul>A</ul></b>GACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ESR2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr14:63,876,354-63,876,366</p>
                     </c>
                     <c ca="left">
                        <p>G<b><ul>AC</ul></b>CAgccTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>THBS1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr15:37,657,943-37,657,955</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAatc<b><ul>CC</ul></b>ACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>FLJ13710</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr15:69,737,514-69,737,526</p>
                     </c>
                     <c ca="left">
                        <p><b><ul>A</ul></b>GTCAttgT<b><ul>T</ul></b>ACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>FLJ13710</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr15:69,738,257-69,738,269</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAatgTG<b><ul>CG</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>FLJ13710</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr15:69,738,459-69,738,471</p>
                     </c>
                     <c ca="left">
                        <p>G<b><ul>C</ul></b>TCActtTG<b><ul>T</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>SH3GL3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr15:82,077,053-82,077,065</p>
                     </c>
                     <c ca="left">
                        <p>G<b><ul>A</ul></b>TC<b><ul>T</ul></b>tgcTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>SMAP-1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr15:89,278,745-89,278,757</p>
                     </c>
                     <c ca="left">
                        <p><b><ul>A</ul></b>GTCAatcTG<b><ul>T</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ABCA3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr16:2,321,166-2,321,178</p>
                     </c>
                     <c ca="left">
                        <p>GGTC<b><ul>T</ul></b>tttT<b><ul>T</ul></b>ACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>HCFC1R1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr16:3,015,149-3,015,161</p>
                     </c>
                     <c ca="left">
                        <p>G<b><ul>AC</ul></b>CAgccTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ADCY9</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr16:4,107,737-4,107,749</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAggcTG<b><ul>GT</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ADCY9</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr16:4,108,935-4,108,947</p>
                     </c>
                     <c ca="left">
                        <p>GGT<b><ul>G</ul></b>AaaaTG<b><ul>T</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CAPNS2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr16:54,100,244-54,100,256</p>
                     </c>
                     <c ca="left">
                        <p>GGTC<b><ul>C</ul></b>gtc<b><ul>C</ul></b>GACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PAFAH1B1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr17:2,441,502-2,441,514</p>
                     </c>
                     <c ca="left">
                        <p><b><ul>C</ul></b>G<b><ul>C</ul></b>CAtgtTGACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>IGFBP4</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr17:35,851,519-35,851,531</p>
                     </c>
                     <c ca="left">
                        <p>G<b><ul>A</ul></b>TCActgT<b><ul>A</ul></b>ACC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>IGFBP4</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr17:35,853,510-35,853,522</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAtgcTG<b><ul>C</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>RBBP8</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr18:18,766,140-18,766,152</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAttcTG<b><ul>CT</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MKNK2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr19:2,382,491-2,382,503</p>
                     </c>
                     <c ca="left">
                        <p>GG<b><ul>G</ul></b>CAgagTGA<b><ul>G</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>BBC3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr19:52,426,840-52,426,852</p>
                     </c>
                     <c ca="left">
                        <p><b><ul>T</ul></b>GTCAttgTG<b><ul>T</ul></b>CC</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>BBC3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr19:52,427,249-52,427,261</p>
                     </c>
                     <c ca="left">
                        <p>GGTCAggcTG<b><ul>GT</ul></b>C</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>GPBP6</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chrY:169,893-169,905</p>
                     </c>
                     <c ca="left">
                        <p>G<b><ul>C</ul></b>TCAcgaTGAC<b><ul>G</ul></b></p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                     <c ca="left">
                        <p>This study</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Shown in bold and underlined are nucleotides that deviate from the consensus core ERE. ER, estrogen receptor; ERE, estrogen response element.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Ancillary signals for ER binding around the core ERE</p>
            </st>
            <p>ER is known to interact with the 10 base pair (bp) long consensus ERE (hereafter referred to as the 'core ERE'). Presence of the consensus site (or its acceptable variants) is required for the direct binding of the ER dimer to the DNA. However, it is still unclear whether the core site alone is sufficient to signal activated ER for such binding or whether additional ER-binding signals in the sequences flanking the core can be used to distinguish binders from nonbinders. An <it>in silico </it>supervised learning experiment was devised to explore these possibilities.</p>
            <p>We modeled the problem of finding additional signals for ER binding among the sequences surrounding the core ERE as a binary classification problem (binders versus nonbinders). The features were position-specific motifs surrounding the core ERE. In other words, we asked whether there is any motif (<it>m</it>) within a definitive distance (<it>p</it>) to the core ERE that could help distinguish the binders from nonbinders. The robust and versatile na&#239;ve Bayesian classification approach was employed, with binary tuple &lt;<it>m</it>,<it>p</it>> as features, where <it>m </it>is a <it>k</it>-bp long motif and <it>p </it>is the distance between motif <it>m </it>and the core ERE. Two sets of experiments were set up. The first consisted of the core plus its flanking regions, whereas the second considered only the flanking regions of core ERE. The motif length <it>k </it>and the size of flanking regions were similarly varied in both setups. The goal was to learn whether motifs of certain length at particular distances from the core could contribute to the discrimination of binders from nonbinders. Although the results indicated that window size (<it>k</it>) of 1 bp generally outperformed the rest (Additional data file 1), the span of flanking regions did not appear to affect significantly the outcome of the two experiments.</p>
            <p>These observations suggested that additional signal for ER-binding might lie in the distribution of single nucleotides adjacent to the core ERE. This hypothesis was initially investigated by visually inspecting the sequence logo <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> constructed from the binders, including their flanking sequences. Shown in Figures <figr fid="F2">2a</figr> (for ER binders) and <figr fid="F2">2b</figr> (for nonbinders) are the logos for up to 10 flanking nucleotides. Comparison between the binders and nonbinders revealed that additional binding signals potentially came from adjacent nucleotides, specifically those up to 3 bp flanking the core ERE, which extended the consensus palindrome. A series of Monte Carlo runs, performed to estimate the probability that observing such additional signals could happen by chance alone, showed that the signals are statistically significant at 3 bp away from the core motif (Monte Carlo <it>P </it>value = 0.002 and <it>P </it>value &lt; 0.001; see Materials and methods and Additional data file 3).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Sequence logos</p>
               </caption>
               <text>
                  <p>Sequence logos. Shown are sequence logos for <b>(a) </b>the 45 ER-binding loci with 10 bp flanking sequences and <b>(b) </b>58 ER nonbinding loci with 10 bp flanking sequences. The logo for the binders exhibited additional signal at the third bases upstream and downstream of the core palindromic ERE. bp, base pairs; ER, estrogen receptor; ERE, estrogen response element.</p>
               </text>
               <graphic file="gb-2006-7-9-r82-2"/>
            </fig>
            <p>To determine the functionality for the conserved cytosine and guanine three bases upstream of the first ERE half-site and downstream of the second ERE half-site, respectively, we examined the interactions between ER and wild-type and mutant binding sites using surface plasmon resonance (SPR) spectroscopy. Purified ER was incubated with either the previously validated ERE (wild-type) adjacent to the <it>GREB1 </it>gene or mutants containing substitutions in the conserved guanine (mutant 1), the canonical half-sites (mutant 2), in the conserved guanine and the cytosine in the symmetrical position upstream of the first ERE half-site (mutant 3; see Figure <figr fid="F3">3a</figr>), and at the sixth bases upstream of the core ERE (mutant 4; see Figure <figr fid="F3">3a</figr>) as the negative control. Substitution of the conserved guanine (mutant 1) disrupted ER binding by about 40%, and, as expected, mutations in the consensus half-sites reduced binding significantly (see Figure <figr fid="F3">3b</figr>). Interestingly, substitution of the cytosine three bases upstream of the first half-site with an adenine (Figure <figr fid="F3">3b</figr>, mutant 3), in addition to the substitution in the conserved guanine adjacent to the second half-site, further diminished binding. As was also expected, the substitution outside the three bases flanking the ERE did not perturb the binding significantly. These results indicate that the conserved guanine outside of the canonical ERE, discovered by modeling novel ER binding site, is involved in mediating ER binding to the ERE.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Substitution of the conserved guanine outside of the canonical ERE disrupts ER binding</p>
               </caption>
               <text>
                  <p>Substitution of the conserved guanine outside of the canonical ERE disrupts ER binding. <b>(a) </b>Interactions between ER and wild-type and mutant EREs were measured by SPR. The canonical ERE is underlined, and the conserved guanine is indicated by an arrow. Base substitutions are indicated in bold. <b>(b) </b>Binding of ER to ERE is indicated as a percentage of binding relative to the wild-type sequence. ER, estrogen receptor; ERE, estrogen response element; SPR, surface plasmon resonance.</p>
               </text>
               <graphic file="gb-2006-7-9-r82-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Modeling functional EREs</p>
            </st>
            <p>The model we propose, h-ERE, exploits the above observation and consists of two PWMs representing the models for binders and nonbinders. The model relies on a decision tree for classifying sites into binders or nonbinders, based on the scores obtained from the individual PWMs. Two sets of 19 bp sequences, one for binders and the other for nonbinders, were formed from the core sites plus three adjacent nucleotides. We further optimized the binding EREs by minimizing the total entropy of the aligned sites (see Materials and methods), while augmenting the nonbinding EREs by taking both strands of the validated nonbinding loci when constructing the weight matrix.</p>
            <p>With this information we constructed a decision tree for the selection of high-likelihood binding EREs versus nonbinding EREs. Each matrix was used to calculate the log-likelihood of a given 19 bp site to be a binder or a non-binder. For each site two scores can be calculated, the binding score (S<sub>B</sub>) and nonbinding score (S<sub>NB</sub>). Complementing the matrices, a decision tree for distinguishing binders and nonbinders based on S<sub>B </sub>and S<sub>NB </sub>was constructed from all of the training dataset using the CART algorithm <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> implemented in R, with 100 cross-validation runs. Figure <figr fid="F4">4</figr> depicts the resultant tree. Putative binders are further subcategorized into three groups, from weak binding (group 1) to strong binding (group 3). Apart from these groupings, sites whose raw log-likelihood binding score (S<sub>B</sub>) is greater than its nonbinding (S<sub>NB</sub>) scores are potentially functional sites. Additionally, to reflect the nature of the validated sites, the model considers sequences whose core EREs have more than 4 bp mismatches with the consensus ERE, GGTCAnnnTGACC, to be non-binding.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Decision tree for ERE prediction</p>
               </caption>
               <text>
                  <p>Decision tree for ERE prediction. Group 3 EREs would be predicted to be the highest likelihood binders of ER. ER, estrogen receptor; ERE, estrogen response element; S<sub>B</sub>, binding score; S<sub>NB</sub>, nonbinding score.</p>
               </text>
               <graphic file="gb-2006-7-9-r82-4"/>
            </fig>
            <p>In all, given a 19 bp sequence, the proposed h-ERE first checks whether the core 13 bp nucleotides contains at most four mismatches to the consensus ERE. Next, based on the computed PWM scores, predictions can be made based on four stringency levels: stringent (considers only sites in group 3 to be binders), medium (predicts sites in group 3 and group 2 to be as binders), relaxed (considers sites of groups 1-3 to be binders), and loose (defines sites whose S<sub>B </sub>> S<sub>NB </sub>as binders).</p>
         </sec>
         <sec>
            <st>
               <p>Unbiased mapping of EREs</p>
            </st>
            <p>In previously described studies conducted to identify EREs, the analyses have largely focused on the 5' <it>cis</it>-regulatory regions of direct target genes. However, ChIP analysis of predicted EREs in the extended promoters of 89 putative direct target genes defined by hormone and inhibitor treatments and microarray expression data <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> indicated ER binding in only 9% of the promoter regions from genes apparently directly regulated by ER. These results suggest that ER may target binding sites outside of the canonical 5' promoter regions. Therefore, to discover additional EREs in an unbiased manner and to generate a dataset for testing model performance, we employed the 'ChIP-and-clone' strategy of cloning precipitated DNA fragments into a bacterial plasmid vector, followed by direct sequencing of the inserts to identify ER binding sites. This approach has the potential to sample any region of the genome, as opposed to PCR-based or microarray-based directed strategies, which target specific sites or functional regions, respectively. Anti-ER ChIP was performed on nuclear lysates from estradiol-treated MCF-7 cells, followed by cloning of precipitated binding sites into the pCR-Blunt (Invitrogen, Carlsbard, CA, USA) vector. From the ChIP library, a total of 1006 clones were successfully sequenced and specifically mapped to the human genome. Based on the presence of ERE-like sequences or supporting microarray expression data for ER regulation of the adjacent transcript, 33 clones were selected for subsequent validation by ChIP and site-specific qPCR. An additional 75 clones were randomly selected from those that have neither EREs nor adjacent transcript expression data for further validation (data not shown). Thus, a total of 108 clones were validated (five contained EREs and are supported by microarray expression data, 23 with only EREs and no supporting expression data, five supported by microarray but no EREs, and 75 with neither EREs nor expression data).</p>
            <p>The validation results indicate that ERE-like sequences remain the predominant feature of functional ER-binding sites. In the five clones with EREs and supporting microarray expression data for ER regulation, the validation rate was 100%; for the 23 clones that encode EREs but lack supporting expression data, the validation rate was 57% (13/23). In contrast, clones for which no ERE-like sequences were detected, the validation rates were 40% (2/5) and 9% (7/75), respectively, for those with and without supporting expression data for the adjacent gene. A total of 19 EREs were found in the 18 empirically verified ER-bound clones. Interestingly, the five validated clones that contain EREs and are adjacent to genes that were shown to be hormone regulated map to intronic regions of the target genes. This is consistent with our hypothesis that ER may bind outside of the 5' <it>cis</it>-regulatory regions of target genes. Moreover, when we tested ERE-like sequences in the promoter region of one of the target genes, <it>SIAH2</it>, we did not detect ER binding, suggesting that the intronic ERE is the functional ER binding site (data not shown) for this particular target gene. From this analysis, all EREs that bind ER and did not bind ER in the validation experiments were then used to test model performance (Table <tblr tid="T2">2</tblr>).</p>
            <tbl id="T2" hint_layout="double">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Validation results on genomic loci containing ERE-like sequences identified by sequencing random ChIP fragment from an ER ChIP library</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Nearest gene</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Genomic location</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Pattern</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Validation</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>NBPF4</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr1:108,492,542-108,492,560</p>
                     </c>
                     <c ca="left">
                        <p>ttaGGTCAgctTG<b><ul>T</ul></b>CCcag</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>C1orf21</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr1:181,327,606-181,327,624</p>
                     </c>
                     <c ca="left">
                        <p>ctgGGTCAgcaTGACCttc</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>chr11:64,942,548-64,942,566</p>
                     </c>
                     <c ca="left">
                        <p>ctgGG<b><ul>G</ul></b>CAtgcT<b><ul>C</ul></b>ACCtca</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>SEC15L2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr2:72,713,948-72,713,966</p>
                     </c>
                     <c ca="left">
                        <p>ggaGGTC<b><ul>T</ul></b>aggTGACCtcg</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>chr3:132,571,914-132,571,932</p>
                     </c>
                     <c ca="left">
                        <p>aggGGTCAtggTGAC<b><ul>A</ul></b>tta</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>SLC6A6</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr3:14,429,604-14,429,622</p>
                     </c>
                     <c ca="left">
                        <p>ctgGGTCActgTG<b><ul>T</ul></b>CCgga</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>SIAH2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr3:151,957,126-151,957,144</p>
                     </c>
                     <c ca="left">
                        <p>acaGGTCAccaTGACCtgg</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>SNX24</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr5:122,216,372-122,216,390</p>
                     </c>
                     <c ca="left">
                        <p>cagGGT<b><ul>T</ul></b>AtctT<b><ul>A</ul></b>ACCaac</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PKIB</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr6:122,985,938-122,985,956</p>
                     </c>
                     <c ca="left">
                        <p>tttGGTCAtgt<b><ul>GGG</ul></b>CCtga</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>chr6:23,720,183-23,720,201</p>
                     </c>
                     <c ca="left">
                        <p>tcgGGTCAtgcTG<b><ul>C</ul></b>C<b><ul>T</ul></b>ggg</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>BTBD9</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr6:38,337,561-38,337,579</p>
                     </c>
                     <c ca="left">
                        <p>tggGGTCAtggTGAC<b><ul>T</ul></b>cct</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>SHB</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr9:37,943,504-37,943,522</p>
                     </c>
                     <c ca="left">
                        <p>gcaGGT<b><ul>GG</ul></b>ggcTG<b><ul>C</ul></b>C<b><ul>T</ul></b>cca</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>SLC38A1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr12:44,881,783-44,881,801</p>
                     </c>
                     <c ca="left">
                        <p>cag<b><ul>A</ul></b>GT<b><ul>G</ul></b>AactTGACCtga</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>SLC38A1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr12:44,881,800-44,881,818</p>
                     </c>
                     <c ca="left">
                        <p>gagGGTCAtcc<b><ul>CA</ul></b>ACCcca</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>chr16:2,781,142-2,781,160</p>
                     </c>
                     <c ca="left">
                        <p>ccaGGTC<b><ul>G</ul></b>gctTG<b><ul>C</ul></b>CCtta</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>chr16:743,678-743,696</p>
                     </c>
                     <c ca="left">
                        <p>atgGGTCActgTGACCcag</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>chr17:46,382,536-46,382,554</p>
                     </c>
                     <c ca="left">
                        <p>cccGG<b><ul>A</ul></b>CAcgaTG<b><ul>T</ul></b>CCccc</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>TEX14</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr17:54,072,183-54,072,201</p>
                     </c>
                     <c ca="left">
                        <p>cacGGTCAtggTGACCtga</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>chr20:54,945,262-54,945,280</p>
                     </c>
                     <c ca="left">
                        <p>ggg<b><ul>A</ul></b>G<b><ul>A</ul></b>CAcccTGACCtaa</p>
                     </c>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>chr2:222,089,422-222,089,440</p>
                     </c>
                     <c ca="left">
                        <p>cagG<b><ul>T</ul></b>TCAaaaTGACGggt</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>STK10</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr5:171,535,283-171,535,301</p>
                     </c>
                     <c ca="left">
                        <p>tgtGGTC<b><ul>T</ul></b>ctgTG<b><ul>C</ul></b>CCagg</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>KIAA1191</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr5:175,712,328-175,712,346</p>
                     </c>
                     <c ca="left">
                        <p>agaGG<b><ul>C</ul></b>CAgtcT<b><ul>AC</ul></b>CCtcc</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>RASGEF1C</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr5:179,478,929-179,478,947</p>
                     </c>
                     <c ca="left">
                        <p>gtgGG<b><ul>C</ul></b>C<b><ul>G</ul></b>gccTG<b><ul>G</ul></b>CCtgt</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>SORCS1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr10:108,692,194-108,692,212</p>
                     </c>
                     <c ca="left">
                        <p>cac<b><ul>A</ul></b>GTCAtgcTGACCcca</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>chr14:38,648,346-38,648,364</p>
                     </c>
                     <c ca="left">
                        <p>attGGTCAgagTGAC<b><ul>A</ul></b>gaa</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>chr14:79,636,926-79,636,944</p>
                     </c>
                     <c ca="left">
                        <p>acc<b><ul>T</ul></b>G<b><ul>G</ul></b>CAcgcTGACCcat</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>LOC57149</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr16:20,819,825-20,819,843</p>
                     </c>
                     <c ca="left">
                        <p>tggGGTCAcac<b><ul>A</ul></b>G<b><ul>G</ul></b>CCcgt</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>chr16:25,535,373-25,535,391</p>
                     </c>
                     <c ca="left">
                        <p>ttaG<b><ul>T</ul></b>TCAcctT<b><ul>A</ul></b>ACCcct</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CEACAM6</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>chr19:46,954,305-46,954,323</p>
                     </c>
                     <c ca="left">
                        <p>cagG<b><ul>AC</ul></b>CAggg<b><ul>A</ul></b>GACCtga</p>
                     </c>
                     <c ca="left">
                        <p>Nonbinding</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Shown in bold and underlined are nucleotides that deviate from the consensus core ERE. ChIP, chromatin immunoprecipitation; ER, estrogen receptor; ERE, estrogen response element.</p>
               </tblfn>
            </tbl>
            <p>Currently, three other models have been widely used to predict functional EREs: consensus sequence search (allowing for certain mismatches), TRANSFAC matrices using MATCH <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> search algorithm, and Dragon ERE finder <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. The performance of these models (under different settings) is compared with h-ERE in Table <tblr tid="T3">3</tblr>. Although h-ERE was not the most sensitive or the most specific, it offered the best balance between the two criteria. With the interest of having a single performance measure that captures the balance between sensitivity and specificity, harmonic means of the two were computed (see van Rijsbergen <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> and Materials and methods). By this measure, h-ERE offers the best balance in performance, even under different stringency settings.</p>
            <tbl id="T3" hint_layout="double">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Performance comparison of various prediction algorithms for ER binding using the independent dataset shown in Table 2</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Prediction algorithm</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Sensitivity</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Specificity</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Harmonic mean</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Fisher's exact test <it>P </it>value</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Consensus ERE with &#8804;2 mismatches</p>
                     </c>
                     <c ca="left">
                        <p>94.74%</p>
                     </c>
                     <c ca="left">
                        <p>30%</p>
                     </c>
                     <c ca="left">
                        <p>45.57%</p>
                     </c>
                     <c ca="left">
                        <p>0.104838</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Consensus ERE with &#8804;3 mismatches</p>
                     </c>
                     <c ca="left">
                        <p>94.74%</p>
                     </c>
                     <c ca="left">
                        <p>0%</p>
                     </c>
                     <c ca="left">
                        <p>0.00%</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Dragon ERE finder v2.0</p>
                     </c>
                     <c ca="left">
                        <p>68.42%</p>
                     </c>
                     <c ca="left">
                        <p>40%</p>
                     </c>
                     <c ca="left">
                        <p>50.49%</p>
                     </c>
                     <c ca="left">
                        <p>0.477589</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TFAC 8.1 (min FP)</p>
                     </c>
                     <c ca="left">
                        <p>31.57%</p>
                     </c>
                     <c ca="left">
                        <p>100%</p>
                     </c>
                     <c ca="left">
                        <p>47.99%</p>
                     </c>
                     <c ca="left">
                        <p>0.057117</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TFAC 8.1 (min FN)</p>
                     </c>
                     <c ca="left">
                        <p>78.94%</p>
                     </c>
                     <c ca="left">
                        <p>40%</p>
                     </c>
                     <c ca="left">
                        <p>53.10%</p>
                     </c>
                     <c ca="left">
                        <p>0.255439</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>h-ERE (stringent)</p>
                     </c>
                     <c ca="left">
                        <p>42.10%</p>
                     </c>
                     <c ca="left">
                        <p>90%</p>
                     </c>
                     <c ca="left">
                        <p>57.37%</p>
                     </c>
                     <c ca="left">
                        <p>0.084693</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>h-ERE (medium)</p>
                     </c>
                     <c ca="left">
                        <p>68.42%</p>
                     </c>
                     <c ca="left">
                        <p>70%</p>
                     </c>
                     <c ca="left">
                        <p>69.20%</p>
                     </c>
                     <c ca="left">
                        <p>0.056272</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>h-ERE (relaxed)</p>
                     </c>
                     <c ca="left">
                        <p>73.68%</p>
                     </c>
                     <c ca="left">
                        <p>70%</p>
                     </c>
                     <c ca="left">
                        <p>71.79%</p>
                     </c>
                     <c ca="left">
                        <p>0.03043</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>h-ERE (loose)</p>
                     </c>
                     <c ca="left">
                        <p>84.21%</p>
                     </c>
                     <c ca="left">
                        <p>70%</p>
                     </c>
                     <c ca="left">
                        <p>76.45%</p>
                     </c>
                     <c ca="left">
                        <p>0.006199</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>h-ERE outperformed the other algorithms. ERE, estrogen response element.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Whole-genome predictions of ER-binding sites</p>
            </st>
            <p>In order to assign specific ERE predictions, we constructed a decision tree using binding and nonbinding scores from the PWMs (see Materials and methods). The parameters were selected to minimize error on the classification of the training set. We scanned the human genome (UCSC hg17) using the h-ERE decision tree and detected 38,024 putative sites under the 'stringent' criteria, including 3607 EREs encoded by Alu repeats. To assess further the performance of our predictive algorithm, we randomly selected 60 sites predicted to be ER binders by h-ERE (group 3 sites) and 60 nonbinders (group 0 sites) for further experimental validation by ChIP and qPCR. Of the 120 sites, specific primers for qPCR could be designed for only 64 sites, 44 of which are binders whereas 20 are nonbinders. Fourteen per cent (6/44) of the predicted binding sites were shown to bind ER (more than twofold enrichment over control) whereas no binding was detected in any of the sites classified as nonbinders (0/20), suggesting that the false-negative rate is less than 5%. The low rate of false negatives allows us to demarcate in the human genome the global set of EREs that contain the universe of putative true binding motifs. This suggests that, taking into account the 14% validation rate, there would be 5363 validated ER-binding sites within the global optimized ERE set for the MCF-7 cells, under conditions similar to our experimental setup.</p>
            <p>We then considered how much of the predictions could be attributed to random occurrences simply by chance alone. A series of Monte Carlo simulations were carried out to estimate the false positive rate of h-ERE. One thousand nucleotide sequences 1 megabase (Mbp) long were generated randomly, governed by the empirical single nucleotide distribution of the human genome (UCSC hg17), and were run through h-ERE. The numbers of predicted binders divided by 1 Mbp was reported as the h-ERE false discovery rate per base pair. Taking a conservative estimate of the noise and extrapolating it, for the human genome (about 3 gigabases [Gbp]) about 33,000 (approximately 86%) were estimated to be false positives, and hence approximately 5000 ER-binding sites are present in the human genome.</p>
            <p>Taken together, the convergence of these two analyses suggest that binding site motifs will be subject to statistical noise from random motif generation, but that a consistent number of <it>bona fide </it>binding sites, for the MCF-7 cells and under similar conditions as our experimentations, is likely to exist (about 5000).</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>In this report we describe a combinatorial experimental approach for transcription factor binding site discovery and demonstrate superior performance of the resultant computational model. The experimental strategies presented here address the major problem in binding site modeling, namely the small size of experimental datasets for model training and testing. The unique use of validated nonbinding EREs and examining flanking sequences allowed us to identify a novel feature of the ERE.</p>
         <p>Previous efforts to characterize the ERE have included mutagenesis studies and electrophoretic mobility shift assays or DNase footprinting experiments. For example, Driscoll and colleagues <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp> demonstrated that single mutations in the core ERE can greatly disrupt ER binding. Furthermore, they found that changes in the flanking sequences can also either enhance or disrupt binding, depending on corresponding changes in the core ERE. Their experiments examined up to two bases flanking the core ERE, and they found that an A or T in the position immediately flanking the core ERE is important for optimal ER binding. Their observation is supported by the model we present here (Figure <figr fid="F2">2</figr>). In our study we found additional single nucleotide features flanking the consensus ERE that are associated with binding site functionality. In particular, there is a prevalence of guanines in the third position downstream 