<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
<ui>gb-2012-13-11-r110</ui>
<ji>1465-6906</ji>
<fm>
<dochead>Research</dochead>
<bibl>
<title><p>Human genes with CpG island promoters have a distinct transcription-associated chromatin organization</p></title>
<aug>
<au id="A1" ca="yes"><snm>Vavouri</snm><fnm>Tanya</fnm><insr iid="I1"/><email>tvavouri@imppc.org</email></au>
<au id="A2" ca="yes"><snm>Lehner</snm><fnm>Ben</fnm><insr iid="I2"/><email>ben.lehner@crg.eu</email></au>
</aug>
<insg>
<ins id="I1"><p>Institute of Predictive and Personalized Medicine of Cancer (IMPPC), Badalona, Barcelona 08916, Spain</p></ins>
<ins id="I2"><p>EMBL-CRG Systems Biology Unit and ICREA, Centre for Genomic Regulation and UPF, Barcelona 08003, Spain</p></ins>
</insg>
<source>Genome Biology</source>
<issn>1465-6906</issn>
<pubdate>2012</pubdate>
<volume>13</volume>
<issue>11</issue>
<fpage>R110</fpage>
<url>http://genomebiology.com/2012/13/11/R110</url>
<xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2012-13-11-r110</pubid><pubid idtype="pmpid">23186133</pubid></pubidlist></xrefbib></bibl>
<history><rec><date><day>5</day><month>7</month><year>2012</year></date></rec><revrec><date><day>15</day><month>10</month><year>2012</year></date></revrec><acc><date><day>27</day><month>11</month><year>2012</year></date></acc><pub><date><day>27</day><month>11</month><year>2012</year></date></pub></history>
<cpyrt><year>2012</year><collab>Vavouri and Lehner; licensee BioMed Central Ltd.</collab><note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec><st><p>Abstract</p></st>
<sec><st><p>Background</p></st>
<p>More than 50% of human genes initiate transcription from CpG dinucleotide-rich regions referred to as CpG islands. These genes show differences in their patterns of transcription initiation, and have been reported to have higher levels of some activation-associated chromatin modifications.</p>
</sec>
<sec><st><p>Results</p></st>
<p>Here we report that genes with CpG island promoters have a characteristic transcription-associated chromatin organization. This signature includes high levels of the transcription elongation-associated histone modifications H4K20me1, H2BK5me1 and H3K79me1/2/3 in the 5' end of the gene, depletion of the activation marks H2AK5ac, H3K14ac and H3K23ac immediately downstream of the transcription start site (TSS), and characteristic epigenetic asymmetries around the TSS. The chromosome organization factor CTCF may be bound upstream of RNA polymerase in most active CpG island promoters, and an unstable nucleosome at the TSS may be specifically marked by H4K20me3, the first example of such a modification. H3K36 monomethylation is only detected as enriched in the bodies of active genes that have CpG island promoters. Finally, as expression levels increase, peak modification levels of the histone methylations H3K9me1, H3K4me1, H3K4me2 and H3K27me1 shift further away from the TSS into the gene body.</p>
</sec>
<sec><st><p>Conclusions</p></st>
<p>These results suggest that active genes with CpG island promoters have a distinct step-like series of modified nucleosomes after the TSS. The identity, positioning, shape and relative ordering of transcription-associated histone modifications differ between genes with and without CpG island promoters. This supports a model where chromatin organization reflects not only transcription activity but also the type of promoter in which transcription initiates.</p>
</sec>
</sec>
</abs>
</fm>
<bdy>
<sec><st><p>Background</p></st>
<p>More than half of human genes initiate transcription from regions of the genome with an elevated content of CpG dinucleotides and G+C base pairs referred to as 'CpG islands' <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. In contrast to the rest of the genome, where CpG dinucleotides are heavily methylated and so rapidly lost through deamination, CpG sites within promoter CpG islands are normally free from DNA methylation and do not have an elevated mutation rate <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. Genes with promoters containing CpG islands (henceforth CpG promoter genes) encode housekeeping genes expressed in all cell types <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp> but also include a substantial number of master developmental regulators such as <it>HOX </it>genes <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B12">12</abbr></abbrgrp>. In contrast, non-CpG promoter genes tend to have more restricted expression patterns and to be expressed later in development during tissue differentiation.</p>
<p>Several lines of evidence suggest that the process of transcription initiation differs in CpG and non-CpG promoters. Systematic identification of the 5' ends of mammalian transcripts revealed that transcription tends to initiate from a broad region in CpG promoters but in a sharp peak in non-CpG promoters <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. CpG promoters also more frequently initiate transcription in both the sense and antisense direction, and produce unstable non-coding RNAs even in the absence of full-length mRNA production <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. Further, RNA polymerase II may be constitutively recruited to CpG promoters <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>, with polymerase release being an important point of regulation <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B19">19</abbr></abbrgrp>. CpG promoters are less likely to contain a TATA-box <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, and contain fewer specifically located transcription factor binding sites <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>.</p>
<p>In addition to transcription, chromatin organization has also been reported to differ between CpG and non-CpG promoters. CpG and GC-rich DNA is preferentially bound by CXXC domain proteins that can recruit chromatin-modifying activities, including Cfp1 <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, a subunit of an H3K4me3 methyltransferase complex <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, and KDM2A, a H3K36me2 demethylase <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Consistent with this, unmethylated CpG promoters have higher levels of H3K4me3, a histone modification associated with transcription initiation <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. However, CpG promoters also have higher levels of other modifications associated with transcription activation, such as the histone H3 lysine 4 methylations H3K4me1 and H3K4me2, and the histone variant H2A.Z <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. Moreover, it has been reported that GC-rich sequences can recruit the polycomb repressive complex 2 <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. CpG promoters have also been reported to contain a more pronounced nucleosome-depleted region upstream of the start site, despite the fact that nucleosomes have a high intrinsic affinity for G+C and CpG rich DNA <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. This distinction between nucleosome-depleted CpG promoters and nucleosome occupied non-CpG promoters is reminiscent of the distinction between two major classes of promoter in budding yeast <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>. Finally, in efforts to use chromatin modifications to predict the locations of core promoters or gene expression levels, different modifications have sometimes been reported as most useful for genes with and without CpG islands <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>. For example, in the models developed by Karlic <it>et al. </it><abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, H4K20me1 and H3K27ac were most frequently employed to predict the expression levels of genes with CpG island promoters, whereas H3K4me3 and H3K79me1 were the modifications most frequently used in models to predict the expression levels of non-CpG island genes.</p>
<p>Chromatin-modifying enzymes can be recruited by elongating polymerase complexes, by sequence-specific DNA-binding proteins, and by non-coding RNAs <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. We hypothesized, therefore, that, beyond the distinctions described above, promoter type could be quite a general influence on the chromatin organization of a gene, including distally, away from the start site. We show here that this is indeed the case, and that genes with CpG island promoters show characteristic transcription-coupled changes in chromatin organization not seen in other genes. In particular, CpG promoter genes show a distinct set of transcription-linked epigenetic transitions within the 5' end of their gene bodies. They also have a different chromatin organization within the promoter region, including a histone modification specifically detected at the initiation site. Our analyses highlight complex differences in the chromatin of human genes with and without CpG islands in their promoters, and are consistent with a model in which there are at least two characteristic ways in which the chromatin of a human gene changes from repression to activation, depending upon the type of promoter in which transcription initiates.</p>
</sec>
<sec><st><p>Results</p></st>
<sec><st><p>Chromatin profile comparison of expression-matched genes with and without CpG islands in their promoters</p></st>
<p>To test whether human genes with CpG island promoters have a distinct chromatin organization, we analyzed the levels of histone modifications and other epigenetic modifications in 1,500 CpG and 1,500 non-CpG promoter genes with precisely matched expression levels (Materials and methods; Figure <figr fid="F1">1a</figr>; Additional files <supplr sid="S1">1</supplr> and <supplr sid="S2">2</supplr>). Transcription is a strong influence on chromatin organization <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B33">33</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp> and so it is necessary to control for expression level when examining additional potential influences. We first analyzed histone modifications previously associated with transcription elongation, that is, enriched within the bodies of expressed genes.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Chromatin profile comparison of expression-matched genes with and without CpG islands in their promoters</p></caption><text>
   <p><b>Chromatin profile comparison of expression-matched genes with and without CpG islands in their promoters</b>. <b>(a) </b>Expression levels of 1,500 expression-matched CpG and non-CpG promoter genes classified as having low, medium and high expression levels (500 genes of each type in each category). <b>(b) </b>Average H3K36me3 profile between 500 bp upstream and 3 kb downstream of the transcription start site (TSS) of genes with or without a CpG promoter and with low, medium or high expression levels (color coded as in panel (a). Lines represent the loess-smoothed average read density shifted by 75 bp to indicate the position of the nucleosome dyad and the grey shading indicates the area within two standard errors of the predicted loess line.</p>
</text><graphic file="gb-2012-13-11-r110-1"/></fig>
<suppl id="S1">
<title><p>Additional file 1</p></title>
<text><p><b>Table with gene groups classified according to promoter type and gene expression level</b>. Excel file containing human genes classified according to their promoter type and their expression level in CD4+ T cells and IMR90 cells.</p></text>
<file name="gb-2012-13-11-r110-S1.XLSX">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S2">
<title><p>Additional file 2</p></title>
<text><p><b>Table with the expression-matched groups of CpG and non-CpG promoter genes</b>. Excel file containing the Ensembl identifiers of genes and the corresponding expression values used to generate all the average chromatin profile plots in the main figures.</p></text>
<file name="gb-2012-13-11-r110-S2.XLSX">
   <p>Click here for file</p>
</file>
</suppl>
<p>Trimethylation of the lysine 3 residue of histone H3 (H3K36me3) is catalyzed by Set2-type methyltransferases and is linked to transcription elongation <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp>. As shown in Figure <figr fid="F1">1</figr>, CpG and non-CpG promoter genes with matched expression levels have very similar distributions of this modification, with a linear accumulation towards the 3' end of expressed genes, and levels that correlate with expression (Figure <figr fid="F1">1b</figr>). Only at the first nucleosome downstream of the site of initiation do highly expressed non-CpG promoters show a slightly higher level of this histone modification. The similar accumulation of H3K36me3 towards the end of the gene suggests a similar transcription elongation-coupled mechanism of H3K36me3 deposition in both gene classes.</p>
</sec>
<sec><st><p>Transcription elongation-associated histone modifications peak in the 5' end of genes with CpG island promoters</p></st>
<p>In contrast to H3K36me3, however, our analysis revealed that other histone modifications enriched within the bodies of active genes have different distributions in CpG and non-CpG promoter genes. The modification H4K20me1 is catalyzed by SETD8/Pr-Set7 methyltransferases and has been variously linked to gene silencing, gene activation, transcription elongation, and to the early exons of highly expressed genes <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp>. In expressed genes with CpG promoters, H4K20me1 levels increase rapidly within gene bodies, reaching a peak on nucleosomes approximately 1 to 1.5 kb after the start site (Figure <figr fid="F2">2a</figr>). In contrast, in non-CpG promoter genes, H4K20me1 levels are low on promoter-proximal nucleosomes, increasing only gradually to a plateau approximately 2 to 2.5 kb after the start site (Figure <figr fid="F2">2a</figr>). A distinct peak is not observed, and non-CpG promoter genes with medium expression only have very low levels of H4K20me1 (Figure <figr fid="F2">2a</figr>).</p>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>Sharp increases in H4K20me1, H2BK5me1, and H3K79me1 modification levels in the bodies of CpG promoter genes</p></caption><text>
   <p><b>Sharp increases in H4K20me1, H2BK5me1, and H3K79me1 modification levels in the bodies of CpG promoter genes</b>. <b>(a-c) </b>Average profiles of H4K20me1 (a), H2BK5me1 (b) and H3K79me1 (c) for expression-matched groups of CpG and non-CpG promoter genes (color-coded as in Figure 1a). Arrows indicate modification peaks unique to CpG promoter genes. TSS, transcription start site.</p>
</text><graphic file="gb-2012-13-11-r110-2"/></fig>
<p>Levels of the transcription elongation-linked modifications H2BK5me1 and H3K79me1/2/3 also differ between the two gene classes, and in a manner that is similar to H4K20me1. Modification levels peak on nucleosomes 750 bp to 1 kb downstream of CpG promoter genes (Figure <figr fid="F2">2b, c</figr>). In contrast, they show only weak monotonic enrichment in the bodies of genes without CpG promoters (Figure <figr fid="F2">2b, c</figr>; Additional file <supplr sid="S3">3</supplr>).</p>
<suppl id="S3">
<title><p>Additional file 3</p></title>
<text><p><b>Average background subtracted chromatin profiles of all histone modifications and H2A.Z at expression-matched CpG and non-CpG promoter genes</b>. Grey shading indicates the 95% confidence interval of the calculated mean.</p></text>
<file name="gb-2012-13-11-r110-S3.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<p>The exon density (Additional file <supplr sid="S4">4</supplr>), nucleosome density (see below) and DNA methylation levels (Additional file <supplr sid="S5">5</supplr>) downstream of highly expressed CpG and non-CpG promoters are very similar, and so cannot account for these differences in the deposition of transcription elongation marks (Additional file <supplr sid="S3">3</supplr>). Also, the differences in the chromatin profiles remain largely unchanged when a different CpG island definition is used (Additional file <supplr sid="S6">6</supplr> using CpG islands defined by Hackenberg <it>et al. </it><abbrgrp><abbr bid="B48">48</abbr></abbrgrp> and Additional file <supplr sid="S7">7</supplr> using the promoter classification of Weber <it>et al. </it><abbrgrp><abbr bid="B25">25</abbr></abbrgrp>), when we remove all genes with more than one transcription start site (Additional file <supplr sid="S8">8</supplr>), or even when we use an independent set of chromatin data from IMR90 cells (Additional file <supplr sid="S9">9</supplr>). Rather, genome-wide analysis suggests differences in the recruitment or activation of histone modifying enzymes during the early stages of transcription elongation in these two gene classes.</p>
<suppl id="S4">
<title><p>Additional file 4</p></title>
<text><p><b>Exon density profiles of the expression-matched CpG and non-CpG promoter genes</b>.</p></text>
<file name="gb-2012-13-11-r110-S4.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S5">
<title><p>Additional file 5</p></title>
<text><p><b>DNA methylation profiles at expression-matched CpG and non-CpG promoter genes</b>.</p></text>
<file name="gb-2012-13-11-r110-S5.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S6">
<title><p>Additional file 6</p></title>
<text><p><b>Chromatin profiles of genes annotated according to whether they overlap CpG islands defined by the CpGcluster program </b><abbrgrp><abbr bid="B48"><b>48</b></abbr></abbrgrp>.</p></text>
<file name="gb-2012-13-11-r110-S6.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S7">
<title><p>Additional file 7</p></title>
<text><p><b>Chromatin profiles of genes grouped into three promoter types: high CpG promoters, intermediate CpG promoters and low CpG promoters</b>. For this supplementary figure, we annotated genes based on their promoter type using the definition of Weber <it>et al. </it><abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. We sampled 300 genes from each expression level and each promoter type to generate the average chromatin profile plots.</p></text>
<file name="gb-2012-13-11-r110-S7.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S8">
<title><p>Additional file 8</p></title>
<text><p><b>Chromatin profiles of genes that contain a single transcription start site</b>. For this supplementary figure, we removed all genes with more than one transcription start site. We then split the genes into the three expression levels and sampled 300 genes from each expression level and each promoter type to generate the average chromatin profile plots. Promoters are annotated according to UCSC downloaded CpG islands <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>.</p></text>
<file name="gb-2012-13-11-r110-S8.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S9">
<title><p>Additional file 9</p></title>
<text><p><b>Chromatin profiles of expression-matched CpG and non-CpG promoter genes in IMR90 cells</b>. Note that the CD4+ T-cell chromatin profiles shown in the main paper and the IMR90 chromatin datasets shown here were generated with different chromatin immunoprecipitation protocols. In the case of CD4+ T cells, nucleosomes were isolated by micrococcal nuclease digestion before immunoprecipitation. In the case of IMR90 cells, chromatin was sonicated before immunoprecipitation. These experimental differences may account for some of the differences in the CD4+ T-cell chromatin profiles and the IMR90 chromatin profiles. The GEO file accession numbers for the datasets used here are GSE2672 (a), GSM521890 (b), GSM521915 (c), GSM521904 (d), GSM752986 (e), GSM521899 (f), GSM521895 (g), GSM521866 (h), GSM521881 (i), GSM521885 (j), GSM469975 (k). Promoters are annotated according to UCSC downloaded CpG islands <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>.</p></text>
<file name="gb-2012-13-11-r110-S9.PDF">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec><st><p>Transcription-coupled shifts of histone modification peaks in the 5' end of CpG promoter genes</p></st>
<p>A second class of histone modifications revealed by our analysis is a set of transcription-associated histone methylations that are enriched in genes with CpG island promoters, but for which the location of the most modified nucleosomes differs depending upon the expression level (Figure <figr fid="F3">3</figr>). As for the previously described modifications, the levels of the methylations H3K9me1, H3K4me2, H3K4me1 and H3K27me1 increase with expression. However, in genes with higher expression, maximum modification levels are found on nucleosomes located further into the body of a gene (Figure <figr fid="F3">3a-h</figr>). Indeed, the +1 nucleosomes of highly expressed genes have lower levels of these histone methylations than the +1 nucleosomes of lowly expressed genes (Figure <figr fid="F3">3i-l</figr>). To our knowledge, this shifting of histone methylation peaks into gene bodies has not been previously noted.</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>Transcription-coupled shifts of histone modification peaks enriched in the 5' end of CpG promoter genes</p></caption><text>
   <p><b>Transcription-coupled shifts of histone modification peaks enriched in the 5' end of CpG promoter genes</b>. <b>(a-h) </b>Average profiles of expression-matched CpG and non-CpG genes (a, c, e, g) and heat maps of all genes (b, d, f, h) for H3K9me1 (a, b), H3K4me2 (c, d), H3K4me1 (e,f) and H3K27me1 (g, h). In the heatmaps, each row represents one gene and each column a single nucleotide position. <b>(i-l) </b>Plots of the correlation between expression level and peak modification position for H3K9me1 (i), H3K4me2 (j), H3K4me1 (k), H3K27me1 (l) at CpG and non-CpG promoter genes. The x-axis indicates the first position with the highest histone modification level within 2 kb of the gene transcription start site (TSS). The y-axis indicates the expression level ranking of the gene. For a direct comparison, only the expression-matched subsets of genes with low, medium and high expression are plotted here. The spearman rank correlation values are shown below the scatter plots (all <it>P</it>-values &lt; 10<sup>-8</sup>).</p>
</text><graphic file="gb-2012-13-11-r110-3"/></fig>
</sec>
<sec><st><p>Monomethylation of H3K36 is enriched in the bodies of genes with CpG island promoters</p></st>
<p>In contrast to H3K36me3, which is located towards the 3' end of active gene bodies, monomethylation of histone H3 lysine 36 (H3K36me1) is detected in the 5' end of highly expressed genes <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. Surprisingly, our analysis suggests that H3K36 monomethylation is only enriched in the bodies of active genes that have CpG islands in their promoters (Figure <figr fid="F4">4a</figr>). Enrichment is detected on the fourth to eighth nucleosomes downstream of the start site, and then quickly returns to background levels within 2 kb (Figure <figr fid="F4">4a</figr>). In contrast, no enrichment is detected in highly expressed non-CpG promoter genes (Figure <figr fid="F4">4a</figr>). This suggests that the deposition of this modification may depend on the presence of a CpG island.</p>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>Some transcription-associated histone modifications are depleted in the 5' ends of CpG promoter genes</p></caption><text>
   <p><b>Some transcription-associated histone modifications are depleted in the 5' ends of CpG promoter genes</b>. <b>(a-d) </b>Average profiles of expression-matched CpG and non-CpG genes for H3K36me1 (a), H2AK5ac (b), H3K14ac (c), and H3K23ac (d). Arrows indicate modification peaks unique to CpG promoter genes. TSS, transcription start site.</p>
</text><graphic file="gb-2012-13-11-r110-4"/></fig>
</sec>
<sec><st><p>Some transcription-associated histone modifications are depleted in the 5' ends of CpG promoter genes</p></st>
<p>In contrast to the histone modifications described above that are shifted closer to the transcription start site in CpG promoter genes (H4K20me1, H2BK5me1, H3K79me1/2/3), show stronger peaks that shift in location in CpG promoter genes (H3K9me1, H3K4me1, H3K4me2, H3K27me1), or are only enriched in CpG promoter genes (H3K36me1), a fourth set of modifications consists of transcription-associated modifications that are actually depleted in the 5' end of CpG island promoter genes. In highly expressed genes with CpG promoters, the modifications H2AK5ac, H3K14ac and H3K23ac peak approximately 750 bp after the initiation site (Figure <figr fid="F4">4b-d</figr>). In contrast, in non-CpG genes, these modifications either peak immediately after the start site (Figure <figr fid="F4">4c, d</figr>) or do not show a clear peak (Figure <figr fid="F4">4b</figr>). Thus, some transcription activation-associated histone modifications are depleted proximal to the start site of CpG promoter genes.</p>
</sec>
<sec><st><p>A characteristic series of histone modification peaks means that the relative positions of histone modifications differ between genes with and without CpG island promoters</p></st>
<p>In expressed genes with CpG islands in their promoters, there is therefore a series of characteristic transitions in transcription-associated histone modifications in the region between approximately 0.5 and 2 kb after the initiation site. In contrast, in non-CpG genes, the same modifications gradually increase more distally within the gene body (H4K20me1, H2BK5me1, H3K79me1/2/3), show weaker or no enrichment within the gene body (H3K9me1, H3K4me1, H3K4me2, H3K27me1, H3K36me1), or are enriched from immediately after the initiation site (H2AK5ac, H3K14ac, H3K23ac). Thus, the identity, positioning, shape and relative ordering of transcription-associated histone modifications all differ between genes with and without CpG island promoters (Figures <figr fid="F2">2</figr> to <figr fid="F4">4</figr>).</p>
</sec>
<sec><st><p>Intrinsic binding preferences only predict nucleosome occupancy in non-expressed, non-CpG island promoters</p></st>
<p>We next compared overall levels of nucleosome occupancy within CpG and non-CpG promoter genes, as revealed by micrococcal nuclease digestion <abbrgrp><abbr bid="B49">49</abbr></abbrgrp> (Figure <figr fid="F5">5</figr>). For highly expressed genes, patterns of occupancy are similar in the two gene classes: a characteristic nucleosome-depleted region is observed upstream of the transcription start site, and an array of well-positioned nucleosomes within the 5' end of the gene (Figure <figr fid="F5">5a</figr>). From an analysis of all genes it was previously proposed that nucleosome depletion upstream of transcription start sites is independent of gene expression level <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. Our analysis showed that this is indeed the case for CpG promoters (Figure <figr fid="F5">5a</figr>), but for non-CpG promoters there is a clear association between a nucleosome-depleted region and mRNA expression levels (Figure <figr fid="F5">5a</figr>). That is, CpG promoters have a constitutive nucleosome-depleted region, but non-CpG promoters do not.</p>
<fig id="F5"><title><p>Figure 5</p></title><caption><p>CpG promoters have a constitutive nucleosome-depleted region and, when active, an unstable H4K20me3-modified nucleosome at the start site</p></caption><text>
   <p><b>CpG promoters have a constitutive nucleosome-depleted region and, when active, an unstable H4K20me3-modified nucleosome at the start site</b>. <b>(a, b) </b>Highly expressed CpG promoter and non-CpG promoter genes have an upstream nucleosome-depleted region and well-positioned nucleosomes within the gene body (a, top). However, CpG promoters also contain a nucleosome-depleted region when repressed (a, bottom left) whereas, in inactive non-CpG promoters (a, bottom right), nucleosome occupancy more closely resembles that predicted by the intrinsic binding preferences of nucleosomes for DNA (b). <b>(c, d) </b>In expressed CpG promoter genes the transcription initiation site within the nucleosome-depleted region (a) is precisely marked by H4K20me3.</p>
</text><graphic file="gb-2012-13-11-r110-5"/></fig>
<p>Moreover, comparing the nucleosome occupancy detected in human promoters to that predicted by the intrinsic binding preferences of nucleosomes for DNA <abbrgrp><abbr bid="B50">50</abbr></abbrgrp> revealed that only in non-expressed, non-CpG promoters is nucleosome occupancy actually reflecting the intrinsic binding preferences of nucleosomes for DNA (Figure <figr fid="F5">5a, b</figr>). Thus, in contrast to the situation in transcriptionally quiescent sperm <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>, in somatic cells influences beyond the affinity of the DNA for nucleosomes must be important for determining nucleosome occupancy in most active and repressed human promoters.</p>
</sec>
<sec><st><p>Evidence for an unstable H4K20me3-modified nucleosome at the start site in active CpG island promoters</p></st>
<p>Consistent with the overall nucleosome depletion, most histone modifications are also depleted at the transcription start sites of CpG promoter genes and at the start sites of highly expressed non-CpG promoter genes (Figures <figr fid="F3">3</figr>, <figr fid="F4">4</figr> and <figr fid="F6">6</figr>). However, our analysis revealed one notable exception to this: the modification H4K20me3 is detected precisely at the transcription start site in active CpG promoters (Figure <figr fid="F5">5c</figr>). This signal is weak but detected across many initiation sites (Figure <figr fid="F5">5d</figr>). With the current data, we cannot exclude the possibility that the signal is coming from a subset of cells within the population. Nevertheless, a similar peak is not seen for other modifications (Additional file <supplr sid="S3">3</supplr>). Previously, H4K20me3 has been linked to transcriptional pausing <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>, to heterochromatin <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>, and to the body of KRAB and zinc finger genes <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B46">46</abbr></abbrgrp>. Our analysis suggests that the modification is also specifically enriched on an unstable nucleosome precisely positioned at the transcription start site in active CpG island promoters. To our knowledge, this is the first evidence for a specific histone post-translational modification precisely marking transcription start sites.</p>
<fig id="F6"><title><p>Figure 6</p></title><caption><p>Asymmetries in promoter proximal histone modifications around the transcription start site</p></caption><text>
   <p><b>Asymmetries in promoter proximal histone modifications around the transcription start site</b>. <b>(a,b) </b>H2A.Z <b>(a) and H4K5ac (b</b>) have higher modification levels upstream of the start site of CpG promoter genes. <b>(c) </b>The asymmetry of each promoter proximal modification is quantified for all highly expressed CpG (red) and non-CpG (blue) promoter genes. Here the number of modified histone reads between the transcription start site (TSS) and 500 bp upstream of the start site is divided by the number of reads between the start site and 500 bp downstream. Box plots indicate the range and 25, 50 and 75 percentile of the asymmetry score for the 500 highly expressed genes. Significant differences in the distributions between CpG and non-CpG promoter genes are indicated by an asterisk (Wilcoxon signed rank test, <it>P </it>&lt; 0.05).</p>
</text><graphic file="gb-2012-13-11-r110-6"/></fig>
</sec>
<sec><st><p>Transcription-associated epigenetic asymmetries around start sites</p></st>
<p>Many epigenetic modifications, particularly histone acetylations, are enriched around the start sites of actively expressed genes (Additional file <supplr sid="S3">3</supplr>). One of these start site-proximal modifications, H3K4me3, was previously reported to show two distinct peaks in the forward and reverse directions, interpreted as corresponding to two divergent sites of RNA polymerase II initiation <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Our expression-matched gene sets show that these dual peaks are observed for many promoter-associated modifications, and for both CpG and non-CpG promoter genes (Figure <figr fid="F6">6a, b</figr>). Interestingly, however, the symmetry of these dual peaks about the start site can differ between CpG and non-CpG promoters. For example, the modifications H2A.Z and H4K5ac (and to a lesser extent H4K8ac, H3K36ac, H3K27ac, H2BK5ac, H3K18ac, H3K9ac and H3K4me3) actually have higher levels upstream of the start site (in the antisense orientation) than downstream (in the sense orientation) specifically in promoters with CpG islands (Figure <figr fid="F6">6</figr>). This difference is characteristic comparing across promoters (Figure <figr fid="F6">6</figr>) and again points to a different transcription-associated chromatin organization in genes with CpG island promoters.</p>
</sec>
<sec><st><p>The chromosome organization factor CTCF may be bound immediately upstream of RNA polymerase II in most active CpG island promoters</p></st>
<p>Finally, we examined the association of the chromosome organization CCTC-binding factor (CTCF) with CpG and non-CpG promoter genes, and how this association relates to gene expression. We chose to analyze CTCF because up to a quarter of all binding sites for CTCF have been reported to occur close to gene promoters <abbrgrp><abbr bid="B46">46</abbr><abbr bid="B54">54</abbr></abbrgrp>. Surprisingly, CTCF is detected at high levels immediately upstream of the start site in very many CpG island promoters (Figure <figr fid="F7">7a, b</figr>; Additional file <supplr sid="S10">10</supplr>), and at similar levels in genes with both medium and high expression (and medium and high levels of RNA polymerase II; Figure <figr fid="F7">7c, d</figr>). This suggests the intriguing possibility that CTCF binding might be part of the general architecture of an active CpG island promoter (see Discussion).</p>
<fig id="F7"><title><p>Figure 7</p></title><caption><p>CTCF is bound immediately upstream of RNA polymerase in active CpG island promoters</p></caption><text>
   <p><b>CTCF is bound immediately upstream of RNA polymerase in active CpG island promoters</b>. <b>(a-d) </b>CTCF is bound immediately upstream of the initiation site, with similar recruitment in CpG genes with high and medium expression (a, b) and with high and low RNA polymerase II recruitment (c, d). TSS, transcription start site.</p>
</text><graphic file="gb-2012-13-11-r110-7"/></fig>
<suppl id="S10">
<title><p>Additional file 10</p></title>
<text><p><b>CTCF binds CpG promoters more frequently than non-CpG promoters</b>. <b>(a-f) </b>CpG promoters are more frequently bound by CTCF than non-CpG promoters. The enrichment of CTCF at CpG promoters is independent of the definition of overlap (a, c, e). All comparisons have a chi-squared test <it>P</it>-value &lt; 2.2e<sup>-16</sup>. To control for the correlation between CTCF binding and expression level, we split the two groups of genes into ten equally sized expression bins and counted the fraction of promoters overlapping CTCF binding sites in each expression bin (b, d, f). Error bars represent 95% confidence intervals of binomial proportions. <b>(g-i) </b>CTCF binding sites that overlap CpG promoters have similar levels of CTCF as intergenic CTCF binding sites. Here we show CTCF levels at all CpG promoters (g), CTCF levels at an equal number of distal CTCF binding sites (h) and CTCF levels at all intergenic CTCF binding sites (i). The CTCF profiles shown in (g-i) are centered on the CTCF peak mid-point.</p></text>
<file name="gb-2012-13-11-r110-S10.PDF">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
</sec>
<sec><st><p>Discussion</p></st>
<p>We have presented here evidence that, depending upon the type of promoter in which transcription initiates, human genes show two distinct patterns of transcription-coupled changes in chromatin organization. Transcription from both CpG and non-CpG promoters is associated with a set of histone-modification transitions around the start site and into the gene body, but the identity, shape and ordering of these modifications differs between the two gene classes. These differences are summarized in Figure <figr fid="F8">8</figr>.</p>
<fig id="F8"><title><p>Figure 8</p></title><caption><p>Human genes with CpG promoters show distinct transcription-coupled changes in chromatin</p></caption><text>
   <p><b>Human genes with CpG promoters show distinct transcription-coupled changes in chromatin</b>. <b>(a) </b>Multiple transcription elongation-associated histone modifications are detected on nucleosomes closer to the transcription start site (TSS) in genes with CpG islands in their promoters (orange) compared to other genes (green). <b>(b) </b>In contrast, other transcription-associated histone modifications are excluded from a promoter proximal region. <b>(c) </b>H3K36me1 is only detected in the bodies of genes with CpG promoters. <b>(d) </b>Additional transcription-associated modifications are enriched in CpG promoter genes and shift in location away from the start as expression levels increase. <b>(e) </b>Taken together, this means that genes with CpG promoters show a characteristic series of modified nucleosomes upstream, downstream, and at the transcription start site.</p>
</text><graphic file="gb-2012-13-11-r110-8"/></fig>
<p>In the repressed state, CpG promoters are distinguished by a nucleosome-depleted region. Upon activation, this nucleosome-depleted region shows evidence of containing a nucleosome specifically modified by H4K20me3 at the initiation site, and CTCF binds immediately upstream of RNA polymerase. This suggests that CTCF could be part of the basic architecture of CpG island promoters, perhaps contributing to their organization into 'active chromatin hubs' or 'transcription factories' <abbrgrp><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr></abbrgrp>.</p>
<p>Multiple transcription elongation-coupled modifications (H4K20me1, H2BK5me1, H3K79me1/2/3) occur on nucleosomes closer to the transcription start site in genes with CpG promoters, suggesting an earlier and sharper transition in transcription elongation complexes. In contrast, a later elongation complex transition, as reflected in the deposition of H3K36me3, appears to occur independently of promoter type. H3K36me1 appears specific to CpG promoter genes, and H3K9me1, H3K4me1, H3K4me2 and H3K27me1 show stronger transcription-associated peaks in the 5' ends of these genes. Uniquely, the same modifications also shift into the bodies of genes as expression levels increase: the most modified nucleosomes are different in genes with high and low expression levels. This might, perhaps, reflect incompatibility with other modifications detected in higher levels close to the start site in highly active genes. Finally, other transcription-coupled modifications (H2AK5ac, H3K14ac, H3K23ac) are depleted in the very 5' end of CpG promoter genes. It is possible that enzymes removing these modifications are directly or indirectly recruited to CpG islands, as has been shown for the CXXC domain-containing H3K36me2 demethylase KDM2A <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>.</p>
</sec>
<sec><st><p>Conclusions</p></st>
<p>Taken together, therefore, we propose that active CpG promoter genes have a characteristic 'step-like' series of transitions in the modifications that nucleosomes carry upstream, downstream, and at the transcription initiation site, extending about 2 kb into a gene (Figure <figr fid="F8">8</figr>). Several potential molecular mechanisms could contribute to this characteristic chromatin signature. Possibilities include the direct recruitment of histone-modifying enzymes to CpG islands, a difference in the composition of RNA polymerase complexes loading in CpG island promoters, or altered dynamics of polymerase, such as delayed release from the promoter and different elongation speeds. CpG promoters often transcribe non-coding RNAs, and it is possible that some chromatin differences relate to the recruitment of chromatin-modifying enzymes by these RNAs <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>. The interplay between different modifications is also likely to be important: both the sequential recruitment of 'reader' and 'writer' proteins away from the start site and the incompatibility between different modifications could contribute to the establishment of the characteristic nucleosome series. For example, increased acetylase recruitment at the start site may contribute to the 3' shift in histone methylations in highly expressed CpG promoter genes. In future work, biochemical studies will be required to investigate these possible molecular mechanisms. Together with previous work, however, our results suggest that the chromatin organization of a human gene reflects not just the level of expression, but also the type of promoter in which transcription initiates.</p>
</sec>
<sec><st><p>Materials and methods</p></st>
<sec><st><p>Gene start site and CpG island annotations</p></st>
<p>Human protein-coding genes were retrieved from Ensembl release 54 <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>. For each gene we considered only the most 5' transcription start site and we removed genes less than 3 kb long to avoid the inclusion of non-genic regions downstream of a gene. To avoid analyzing upstream regions that overlap another gene, we also removed all genes that have a promoter within 500 bp of another gene. We retrieved CpG islands from the UCSC genome browser <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B60">60</abbr></abbrgrp>. We considered a gene as having a CpG-island promoter when its first transcription start site overlaps a UCSC CpG island. Removing genes with more than one transcription start site did not change any of the observations (see Additional file <supplr sid="S8">8</supplr>). All chromatin profiles were repeated using the alternative CpG island definition from <abbrgrp><abbr bid="B48">48</abbr></abbrgrp> (shown in Additional file <supplr sid="S6">6</supplr>). Further, all chromatin profiles were repeated using the promoter definition by <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> (shown in Additional file <supplr sid="S7">7</supplr>). Following the original definition, to annotate promoters as 'high CpG promoters' (HCPs) we scanned the region from -1,200 bp to +300 bp of the transcription start site for a 500-bp window with CpG observed over expected ratio of &gt; 0.75 and GC content &gt; 55. Promoters with all windows with CpG observed over expected ratio &#8804; 0.48 were annotated as 'low CpG promoters' (LCPs). The rest of the promoters were annotated as 'intermediate CpG promoters' (ICPs).</p>
</sec>
<sec><st><p>Gene expression data</p></st>
<p>We retrieved MAS5 normalized mRNA expression data for CD4+ T cells from Schones <it>et al. </it><abbrgrp><abbr bid="B49">49</abbr></abbrgrp> (Gene Expression Omnibus (GEO) accession GSE10437), mapping U133-PLUS-2 probes to genes using Ensembl. Probes matching multiple genes were discarded. In total, 16,781 protein-coding genes had annotated expression levels. Genes were ranked according to their expression level, using the (replicate-averaged) value of the most sensitive probe. We then split all genes into three equally sized groups of genes according to their expression level (low, middle and high expression). From these we randomly sampled 500 low, 500 intermediate and 500 high expression non-CpG promoter genes. For each non-CpG promoter gene we then identified a CpG promoter gene with a very similar expression level (absolute difference between log<sub>e </sub>expression of non-CpG promoter gene and log<sub>e </sub>expression of CpG promoter gene &#8804; 0.1). These three expression-matched sets of CpG and non-CpG promoter genes were used to generate average chromatin profile plots. The entire sets of high, intermediate and low expression genes were used to generate chromatin profile heatmaps (the number of genes in each class are included in Additional file <supplr sid="S1">1</supplr>). The expression-matched sets of genes and their annotations are included in Additional file <supplr sid="S2">2</supplr>. A small number of genes with extreme expression levels for which we could not find a similarly expressed gene with a different promoter type were removed. We repeated the analysis in exactly the same way for IMR90 cells using microarray gene expression data from Kim <it>et al. </it><abbrgrp><abbr bid="B36">36</abbr></abbrgrp> (GEO accession GSE2672).</p>
</sec>
<sec><st><p>Chromatin profiles</p></st>
<p>We retrieved the mapped sequenced reads of nucleosome fragments <abbrgrp><abbr bid="B49">49</abbr></abbrgrp> and ChIP-Seq mapped read data for H2A.Z, 20 histone methylations <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> and 18 histone acetylations <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. Similarly, we analyzed DNA methylation data from the same cell-type <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. These datasets contained all reads that match the genome in a unique position with up to two mismatches. To minimize sequence amplification bias, we removed identical reads. We shifted the start position of the reads by 75 bp in the direction of sequencing (75 bp is approximately half of the length of the isolated DNA fragments), this way transforming the read start positions to nucleosome dyad positions. All datasets were rescaled to 10 million uniquely mapped nucleosome fragments. To generate the average chromatin profiles shown in the figures, we counted the number of dyads that fall at each position along the region surrounding the gene start site. Smoothed lines were generated based on the per-base-pair averaged position-shifted read count using the loess regression function in R (with 180 bp span) <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>. The predict.loess R function was used for the calculation of 95% confidence intervals. For the background subtracted chromatin profiles included in Additional file <supplr sid="S3">3</supplr> we used a 75-bp window sliding by 1 bp and calculated the difference between the number of shifted reads from the histone modification (or H2A.Z) and the number of shifted reads from the nucleosome occupancy. At each position with respect to the transcription start site we then calculated the mean and the standard error of the background-subtracted values assuming a Normal distribution. Heatmaps were generated using Java Tree View 1.1.5r2 <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. We repeated all chromatin profiles using data from a fetal lung fibroblast cell line (IMR90) generated by the NIH Roadmap Epigenomics Project <abbrgrp><abbr bid="B64">64</abbr><abbr bid="B65">65</abbr></abbrgrp>. We downloaded the mapped reads provided as BED files. Because these reads were mapped to human genome version hg19, we converted all gene promoters from hg18 to hg19 using the LiftOver tool. These profiles are shown in Additional file <supplr sid="S9">9</supplr>. The accession identifiers of the samples used for these profiles are included in the figure legend. Regions of statistically significant CTCF binding in CD4+ T cells (used in Additional file <supplr sid="S10">10</supplr>), based on the data from Barski <it>et al. </it><abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, were retrieved from Ensembl (regulatory build of Ensembl release 68). We defined distal CTCF binding sites as those not overlapping any annotated Ensembl gene. The coordinates of CTCF peaks were converted from human genome assembly hg19 to hg18 using the liftOver tool <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>.</p>
</sec>
<sec><st><p>Sequence properties</p></st>
<p>Intrinsic nucleosome binding preference calculations along the human genome were predicted by Kaplan <it>et al. </it>and downloaded from the authors' website <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. The average nucleosome affinity model score around the transcription start site of each gene set is shown in Figure <figr fid="F5">5b</figr>. We also carried out the same analysis using the predicted probability that a nucleosome will be formed at each genomic position <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>, which gave very similar results (data not shown). Similarly, as a control, we also calculated the G+C content profile at &#177; 3 kb around the gene start of each group (Additional file <supplr sid="S11">11</supplr>).</p>
<suppl id="S11">
<title><p>Additional file 11</p></title>
<text><p><b>GC content around CpG and non-CpG promoter genes</b>.</p></text>
<file name="gb-2012-13-11-r110-S11.PDF">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec><st><p>Histone modification peak positions and asymmetry around the gene start</p></st>
<p>To compare the asymmetry of a histone modification around the start of CpG promoter and non-CpG promoter genes, for each highly expressed gene we calculated the number of position-shifted reads that map to the 500 bp upstream of the gene start and divided by the total number of position-shifted reads that map within 500 bp both upstream and downstream of the gene start. We then compared the distributions of these 'asymmetry scores' at highly expressed CpG and non-CpG promoter genes using the Wilcoxon rank sum test.</p>
</sec>
</sec>
<sec><st><p>Abbreviations</p></st>
<p>bp: base pair; GEO: Gene Expression Omnibus.</p>
</sec>
<sec><st><p>Competing interests</p></st>
<p>The authors declare that they have no competing interests.</p>
</sec>
<sec><st><p>Authors' contributions</p></st>
<p>TV performed all analyses. TV and BL designed analyses and wrote the manuscript. Both authors read and approved the manuscript for publication.</p>
</sec>
</bdy>
<bm>
<ack>
<sec><st><p>Acknowledgements</p></st>
<p>TV is funded by MICINN grant BFU2011-30246, Ramon y Cajal grant RYC-2010-07114, European Commission Framework 7 European Re-integration grant PERG08-GA-2010-276741, and by the Institute of Predictive and Personalized Medicine of Cancer. BL is funded by an ERC Starting Grant, ERASysBio+ ERANET, MICINN grant BFU2008-00365, AGAUR, the EMBO Young Investigator Program, European Commission Framework 7 integrated project 4DCellFate, and by the EMBL-CRG Systems Biology Program. We thank Salvador Aznar Benitah and Luciano di Croce for comments on the manuscript.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>CpG islands - 'A rough guide'.</p></title><aug><au><snm>Illingworth</snm><fnm>RS</fnm></au><au><snm>Bird</snm><fnm>AP</fnm></au></aug><source>FEBS Lett</source><pubdate>2009</pubdate><volume>583</volume><fpage>1713</fpage><lpage>1720</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.febslet.2009.04.012</pubid><pubid idtype="pmpid" link="fulltext">19376112</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>Genetics and epigenetics: stability and plasticity during cellular differentiation.</p></title><aug><au><snm>Mohn</snm><fnm>F</fnm></au><au><snm>Schubeler</snm><fnm>D</fnm></au></aug><source>Trends Genet</source><pubdate>2009</pubdate><volume>25</volume><fpage>129</fpage><lpage>136</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.tig.2008.12.005</pubid><pubid idtype="pmpid" link="fulltext">19185382</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>DNA methylation and the frequency of CpG in animal DNA.</p></title><aug><au><snm>Bird</snm><fnm>AP</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>1980</pubdate><volume>8</volume><fpage>1499</fpage><lpage>1504</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/8.7.1499</pubid><pubid idtype="pmcid">324012</pubid><pubid idtype="pmpid" link="fulltext">6253938</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>A fraction of the mouse genome that is derived from islands of nonmethylated, CpG-rich DNA.</p></title><aug><au><snm>Bird</snm><fnm>A</fnm></au><au><snm>Taggart</snm><fnm>M</fnm></au><au><snm>Frommer</snm><fnm>M</fnm></au><au><snm>Miller</snm><fnm>OJ</fnm></au><au><snm>Macleod</snm><fnm>D</fnm></au></aug><source>Cell</source><pubdate>1985</pubdate><volume>40</volume><fpage>91</fpage><lpage>99</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/0092-8674(85)90312-5</pubid><pubid idtype="pmpid" link="fulltext">2981636</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>CpG islands in vertebrate genomes.</p></title><aug><au><snm>Gardiner-Garden</snm><fnm>M</fnm></au><au><snm>Frommer</snm><fnm>M</fnm></au></aug><source>J Mol Biol</source><pubdate>1987</pubdate><volume>196</volume><fpage>261</fpage><lpage>282</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/0022-2836(87)90689-9</pubid><pubid idtype="pmpid" link="fulltext">3656447</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>A genome-wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promoters.</p></title><aug><au><snm>Saxonov</snm><fnm>S</fnm></au><au><snm>Berg</snm><fnm>P</fnm></au><au><snm>Brutlag</snm><fnm>DL</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2006</pubdate><volume>103</volume><fpage>1412</fpage><lpage>1417</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0510310103</pubid><pubid idtype="pmcid">1345710</pubid><pubid idtype="pmpid" link="fulltext">16432200</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Primate CpG islands are maintained by heterogeneous evolutionary regimes involving minimal selection.</p></title><aug><au><snm>Cohen</snm><fnm>NM</fnm></au><au><snm>Kenigsberg</snm><fnm>E</fnm></au><au><snm>Tanay</snm><fnm>A</fnm></au></aug><source>Cell</source><pubdate>2011</pubdate><volume>145</volume><fpage>773</fpage><lpage>786</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2011.04.024</pubid><pubid idtype="pmpid" link="fulltext">21620139</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>CpG islands as gene markers in the human genome.</p></title><aug><au><snm>Larsen</snm><fnm>F</fnm></au><au><snm>Gundersen</snm><fnm>G</fnm></au><au><snm>Lopez</snm><fnm>R</fnm></au><au><snm>Prydz</snm><fnm>H</fnm></au></aug><source>Genomics</source><pubdate>1992</pubdate><volume>13</volume><fpage>1095</fpage><lpage>1107</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/0888-7543(92)90024-M</pubid><pubid idtype="pmpid">1505946</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Determinants of CpG islands: expression in early embryo and isochore structure.</p></title><aug><au><snm>Ponger</snm><fnm>L</fnm></au><au><snm>Duret</snm><fnm>L</fnm></au><au><snm>Mouchiroud</snm><fnm>D</fnm></au></aug><source>Genome Res</source><pubdate>2001</pubdate><volume>11</volume><fpage>1854</fpage><lpage>1860</lpage><xrefbib><pubidlist><pubid idtype="pmcid">311164</pubid><pubid idtype="pmpid" link="fulltext">11691850</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>On the nature of human housekeeping genes.</p></title><aug><au><snm>Zhu</snm><fnm>J</fnm></au><au><snm>He</snm><fnm>F</fnm></au><au><snm>Hu</snm><fnm>S</fnm></au><au><snm>Yu</snm><fnm>J</fnm></au></aug><source>Trends Genet</source><pubdate>2008</pubdate><volume>24</volume><fpage>481</fpage><lpage>484</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.tig.2008.08.004</pubid><pubid idtype="pmpid" link="fulltext">18786740</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>An abundance of ubiquitously expressed genes revealed by tissue transcriptome sequence data.</p></title><aug><au><snm>Ramskold</snm><fnm>D</fnm></au><au><snm>Wang</snm><fnm>ET</fnm></au><au><snm>Burge</snm><fnm>CB</fnm></au><au><snm>Sandberg</snm><fnm>R</fnm></au></aug><source>PLoS Comput Biol</source><pubdate>2009</pubdate><volume>5</volume><fpage>e1000598</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pcbi.1000598</pubid><pubid idtype="pmcid">2781110</pubid><pubid idtype="pmpid" link="fulltext">20011106</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Hyperconserved CpG domains underlie Polycomb-binding sites.</p></title><aug><au><snm>Tanay</snm><fnm>A</fnm></au><au><snm>O&apos;Donnell</snm><fnm>AH</fnm></au><au><snm>Damelin</snm><fnm>M</fnm></au><au><snm>Bestor</snm><fnm>TH</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2007</pubdate><volume>104</volume><fpage>5521</fpage><lpage>5526</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0609746104</pubid><pubid idtype="pmcid">1838490</pubid><pubid idtype="pmpid" link="fulltext">17376869</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>Genome-wide analysis of mammalian promoter architecture and evolution.</p></title><aug><au><snm>Carninci</snm><fnm>P</fnm></au><au><snm>Sandelin</snm><fnm>A</fnm></au><au><snm>Lenhard</snm><fnm>B</fnm></au><au><snm>Katayama</snm><fnm>S</fnm></au><au><snm>Shimokawa</snm><fnm>K</fnm></au><au><snm>Ponjavic</snm><fnm>J</fnm></au><au><snm>Semple</snm><fnm>CAM</fnm></au><au><snm>Taylor</snm><fnm>MS</fnm></au><au><snm>Engstr&#246;m</snm><fnm>PG</fnm></au><au><snm>Frith</snm><fnm>MC</fnm></au><au><snm>Forrest</snm><fnm>ARR</fnm></au><au><snm>Alkema</snm><fnm>WB</fnm></au><au><snm>Tan</snm><fnm>SL</fnm></au><au><snm>Plessy</snm><fnm>C</fnm></au><au><snm>Kodzius</snm><fnm>R</fnm></au><au><snm>Ravasi</snm><fnm>T</fnm></au><au><snm>Kasukawa</snm><fnm>T</fnm></au><au><snm>Fukuda</snm><fnm>S</fnm></au><au><snm>Kanamori-Katayama</snm><fnm>M</fnm></au><au><snm>Kitazume</snm><fnm>Y</fnm></au><au><snm>Kawaji</snm><fnm>H</fnm></au><au><snm>Kai</snm><fnm>C</fnm></au><au><snm>Nakamura</snm><fnm>M</fnm></au><au><snm>Konno</snm><fnm>H</fnm></au><au><snm>Nakano</snm><fnm>K</fnm></au><au><snm>Mottagui-Tabar</snm><fnm>S</fnm></au><au><snm>Arner</snm><fnm>P</fnm></au><au><snm>Chesi</snm><fnm>A</fnm></au><au><snm>Gustincich</snm><fnm>S</fnm></au><au><snm>Persichetti</snm><fnm>F</fnm></au><etal/></aug><source>Nat Genet</source><pubdate>2006</pubdate><volume>38</volume><fpage>626</fpage><lpage>635</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng1789</pubid><pubid idtype="pmpid" link="fulltext">16645617</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>Nascent RNA sequencing reveals widespread pausing and divergent initiation at human promoters.</p></title><aug><au><snm>Core</snm><fnm>LJ</fnm></au><au><snm>Waterfall</snm><fnm>JJ</fnm></au><au><snm>Lis</snm><fnm>JT</fnm></au></aug><source>Science</source><pubdate>2008</pubdate><volume>322</volume><fpage>1845</fpage><lpage>1848</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1162228</pubid><pubid idtype="pmcid">2833333</pubid><pubid idtype="pmpid" link="fulltext">19056941</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>RNA exosome depletion reveals transcription upstream of active human promoters.</p></title><aug><au><snm>Preker</snm><fnm>P</fnm></au><au><snm>Nielsen</snm><fnm>J</fnm></au><au><snm>Kammler</snm><fnm>S</fnm></au><au><snm>Lykke-Andersen</snm><fnm>S</fnm></au><au><snm>Christensen</snm><fnm>MS</fnm></au><au><snm>Mapendano</snm><fnm>CK</fnm></au><au><snm>Schierup</snm><fnm>MH</fnm></au><au><snm>Jensen</snm><fnm>TH</fnm></au></aug><source>Science</source><pubdate>2008</pubdate><volume>322</volume><fpage>1851</fpage><lpage>1854</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1164096</pubid><pubid idtype="pmpid" link="fulltext">19056938</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Divergent transcription from active promoters.</p></title><aug><au><snm>Seila</snm><fnm>AC</fnm></au><au><snm>Calabrese</snm><fnm>JM</fnm></au><au><snm>Levine</snm><fnm>SS</fnm></au><au><snm>Yeo</snm><fnm>GW</fnm></au><au><snm>Rahl</snm><fnm>PB</fnm></au><au><snm>Flynn</snm><fnm>RA</fnm></au><au><snm>Young</snm><fnm>RA</fnm></au><au><snm>Sharp</snm><fnm>PA</fnm></au></aug><source>Science</source><pubdate>2008</pubdate><volume>322</volume><fpage>1849</fpage><lpage>1851</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1162253</pubid><pubid idtype="pmcid">2692996</pubid><pubid idtype="pmpid" link="fulltext">19056940</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>A chromatin landmark and transcription initiation at most promoters in human cells.</p></title><aug><au><snm>Guenther</snm><fnm>MG</fnm></au><au><snm>Levine</snm><fnm>SS</fnm></au><au><snm>Boyer</snm><fnm>LA</fnm></au><au><snm>Jaenisch</snm><fnm>R</fnm></au><au><snm>Young</snm><fnm>RA</fnm></au></aug><source>Cell</source><pubdate>2007</pubdate><volume>130</volume><fpage>77</fpage><lpage>88</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2007.05.042</pubid><pubid idtype="pmcid">3200295</pubid><pubid idtype="pmpid" link="fulltext">17632057</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>All and only CpG containing sequences are enriched in promoters abundantly bound by RNA polymerase II in multiple tissues.</p></title><aug><au><snm>Rozenberg</snm><fnm>JM</fnm></au><au><snm>Shlyakhtenko</snm><fnm>A</fnm></au><au><snm>Glass</snm><fnm>K</fnm></au><au><snm>Rishi</snm><fnm>V</fnm></au><au><snm>Myakishev</snm><fnm>MV</fnm></au><au><snm>Fitzgerald</snm><fnm>PC</fnm></au><au><snm>Vinson</snm><fnm>C</fnm></au></aug><source>BMC Genomics</source><pubdate>2008</pubdate><volume>9</volume><fpage>67</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-9-67</pubid><pubid idtype="pmcid">2267717</pubid><pubid idtype="pmpid" link="fulltext">18252004</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>c-Myc regulates transcriptional pause release.</p></title><aug><au><snm>Rahl</snm><fnm>PB</fnm></au><au><snm>Lin</snm><fnm>CY</fnm></au><au><snm>Seila</snm><fnm>AC</fnm></au><au><snm>Flynn</snm><fnm>RA</fnm></au><au><snm>Mccuine</snm><fnm>S</fnm></au><au><snm>Burge</snm><fnm>CB</fnm></au><au><snm>Sharp</snm><fnm>PA</fnm></au><au><snm>Young</snm><fnm>RA</fnm></au></aug><source>Cell</source><pubdate>2010</pubdate><volume>141</volume><fpage>432</fpage><lpage>445</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2010.03.030</pubid><pubid idtype="pmcid">2864022</pubid><pubid idtype="pmpid" link="fulltext">20434984</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>A transcription factor affinity-based code for mammalian transcription initiation.</p></title><aug><au><snm>Megraw</snm><fnm>M</fnm></au><au><snm>Pereira</snm><fnm>F</fnm></au><au><snm>Jensen</snm><fnm>ST</fnm></au><au><snm>Ohler</snm><fnm>U</fnm></au><au><snm>Hatzigeorgiou</snm><fnm>AG</fnm></au></aug><source>Genome Res</source><pubdate>2009</pubdate><volume>19</volume><fpage>644</fpage><lpage>656</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.085449.108</pubid><pubid idtype="pmcid">2665783</pubid><pubid idtype="pmpid" link="fulltext">19141595</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Cloning of a mammalian transcriptional activator that binds unmethylated CpG motifs and shares a CXXC domain with DNA methyltransferase, human trithorax, and methyl-CpG binding domain protein 1.</p></title><aug><au><snm>Voo</snm><fnm>KS</fnm></au><au><snm>Carlone</snm><fnm>DL</fnm></au><au><snm>Jacobsen</snm><fnm>BM</fnm></au><au><snm>Flodin</snm><fnm>A</fnm></au><au><snm>Skalnik</snm><fnm>DG</fnm></au></aug><source>Mol Cell Biol</source><pubdate>2000</pubdate><volume>20</volume><fpage>2108</fpage><lpage>2121</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/MCB.20.6.2108-2121.2000</pubid><pubid idtype="pmcid">110827</pubid><pubid idtype="pmpid" link="fulltext">10688657</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>CpG islands influence chromatin structure via the CpG-binding protein Cfp1.</p></title><aug><au><snm>Thomson</snm><fnm>JP</fnm></au><au><snm>Skene</snm><fnm>PJ</fnm></au><au><snm>Selfridge</snm><fnm>J</fnm></au><au><snm>Clouaire</snm><fnm>T</fnm></au><au><snm>Guy</snm><fnm>J</fnm></au><au><snm>Webb</snm><fnm>S</fnm></au><au><snm>Kerr</snm><fnm>AR</fnm></au><au><snm>Deaton</snm><fnm>A</fnm></au><au><snm>Andrews</snm><fnm>R</fnm></au><au><snm>James</snm><fnm>KD</fnm></au><au><snm>Turner</snm><fnm>DJ</fnm></au><au><snm>Illingworth</snm><fnm>R</fnm></au><au><snm>Bird</snm><fnm>A</fnm></au></aug><source>Nature</source><pubdate>2010</pubdate><volume>464</volume><fpage>1082</fpage><lpage>1086</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature08924</pubid><pubid idtype="pmpid" link="fulltext">20393567</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>CpG islands recruit a histone H3 lysine 36 demethylase.</p></title><aug><au><snm>Blackledge</snm><fnm>NP</fnm></au><au><snm>Zhou</snm><fnm>JC</fnm></au><au><snm>Tolstorukov</snm><fnm>MY</fnm></au><au><snm>Farcas</snm><fnm>AM</fnm></au><au><snm>Park</snm><fnm>PJ</fnm></au><au><snm>Klose</snm><fnm>RJ</fnm></au></aug><source>Mol Cell</source><pubdate>2010</pubdate><volume>38</volume><fpage>179</fpage><lpage>190</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.molcel.2010.04.009</pubid><pubid idtype="pmcid">3098377</pubid><pubid idtype="pmpid" link="fulltext">20417597</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>Genome-wide maps of chromatin state in pluripotent and lineage-committed cells.</p></title><aug><au><snm>Mikkelsen</snm><fnm>TS</fnm></au><au><snm>Ku</snm><fnm>M</fnm></au><au><snm>Jaffe</snm><fnm>DB</fnm></au><au><snm>Issac</snm><fnm>B</fnm></au><au><snm>Lieberman</snm><fnm>E</fnm></au><au><snm>Giannoukos</snm><fnm>G</fnm></au><au><snm>Alvarez</snm><fnm>P</fnm></au><au><snm>Brockman</snm><fnm>W</fnm></au><au><snm>Kim</snm><fnm>TK</fnm></au><au><snm>Koche</snm><fnm>RP</fnm></au><au><snm>Lee</snm><fnm>W</fnm></au><au><snm>Mendenhall</snm><fnm>E</fnm></au><au><snm>O&apos;Donovan</snm><fnm>A</fnm></au><au><snm>Presser</snm><fnm>A</fnm></au><au><snm>Russ</snm><fnm>C</fnm></au><au><snm>Xie</snm><fnm>X</fnm></au><au><snm>Meissner</snm><fnm>A</fnm></au><au><snm>Wernig</snm><fnm>M</fnm></au><au><snm>Jaenisch</snm><fnm>R</fnm></au><au><snm>Nusbaum</snm><fnm>C</fnm></au><au><snm>Lander</snm><fnm>ES</fnm></au><au><snm>Bernstein</snm><fnm>BE</fnm></au></aug><source>Nature</source><pubdate>2007</pubdate><volume>448</volume><fpage>553</fpage><lpage>560</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature06008</pubid><pubid idtype="pmcid">2921165</pubid><pubid idtype="pmpid" link="fulltext">17603471</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Distribution, silencing potential and evolutionary impact of promoter DNA methylation in the human genome.</p></title><aug><au><snm>Weber</snm><fnm>M</fnm></au><au><snm>Hellmann</snm><fnm>I</fnm></au><au><snm>Stadler</snm><fnm>MB</fnm></au><au><snm>Ramos</snm><fnm>L</fnm></au><au><snm>Paabo</snm><fnm>S</fnm></au><au><snm>Rebhan</snm><fnm>M</fnm></au><au><snm>Schubeler</snm><fnm>D</fnm></au></aug><source>Nat Genet</source><pubdate>2007</pubdate><volume>39</volume><fpage>457</fpage><lpage>466</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng1990</pubid><pubid idtype="pmpid" link="fulltext">17334365</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project.</p></title><aug><au><snm>Birney</snm><fnm>E</fnm></au><au><snm>Stamatoyannopoulos</snm><fnm>JA</fnm></au><au><snm>Dutta</snm><fnm>A</fnm></au><au><snm>Guig&#243;</snm><fnm>R</fnm></au><au><snm>Gingeras</snm><fnm>TR</fnm></au><au><snm>Margulies</snm><fnm>EH</fnm></au><au><snm>Weng</snm><fnm>Z</fnm></au><au><snm>Snyder</snm><fnm>M</fnm></au><au><snm>Dermitzakis</snm><fnm>ET</fnm></au><au><snm>Thurman</snm><fnm>RE</fnm></au><au><snm>Kuehn</snm><fnm>MS</fnm></au><au><snm>Taylor</snm><fnm>CM</fnm></au><au><snm>Neph</snm><fnm>S</fnm></au><au><snm>Koch</snm><fnm>CM</fnm></au><au><snm>Asthana</snm><fnm>S</fnm></au><au><snm>Malhotra</snm><fnm>A</fnm></au><au><snm>Adzhubei</snm><fnm>I</fnm></au><au><snm>Greenbaum</snm><fnm>JA</fnm></au><au><snm>Andrews</snm><fnm>RM</fnm></au><au><snm>Flicek</snm><fnm>P</fnm></au><au><snm>Boyle</snm><fnm>PJ</fnm></au><au><snm>Cao</snm><fnm>H</fnm></au><au><snm>Carter</snm><fnm>NP</fnm></au><au><snm>Clelland</snm><fnm>GK</fnm></au><au><snm>Davis</snm><fnm>S</fnm></au><au><snm>Day</snm><fnm>N</fnm></au><au><snm>Dhami</snm><fnm>P</fnm></au><au><snm>Dillon</snm><fnm>SC</fnm></au><au><snm>Dorschner</snm><fnm>MO</fnm></au><au><snm>Fiegler</snm><fnm>H</fnm></au><etal/></aug><source>Nature</source><pubdate>2007</pubdate><volume>447</volume><fpage>799</fpage><lpage>816</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature05874</pubid><pubid idtype="pmcid">2212820</pubid><pubid idtype="pmpid" link="fulltext">17571346</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>Genome-wide analysis of histone modifications in human pancreatic islets.</p></title><aug><au><snm>Bhandare</snm><fnm>R</fnm></au><au><snm>Schug</snm><fnm>J</fnm></au><au><snm>Le Lay</snm><fnm>J</fnm></au><au><snm>Fox</snm><fnm>A</fnm></au><au><snm>Smirnova</snm><fnm>O</fnm></au><au><snm>Liu</snm><fnm>C</fnm></au><au><snm>Naji</snm><fnm>A</fnm></au><au><snm>Kaestner</snm><fnm>KH</fnm></au></aug><source>Genome Res</source><pubdate>2010</pubdate><volume>20</volume><fpage>428</fpage><lpage>433</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.102038.109</pubid><pubid idtype="pmcid">2847745</pubid><pubid idtype="pmpid" link="fulltext">20181961</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>GC-rich sequence elements recruit PRC2 in mammalian ES cells.</p></title><aug><au><snm>Mendenhall</snm><fnm>EM</fnm></au><au><snm>Koche</snm><fnm>RP</fnm></au><au><snm>Truong</snm><fnm>T</fnm></au><au><snm>Zhou</snm><fnm>VW</fnm></au><au><snm>Issac</snm><fnm>B</fnm></au><au><snm>Chi</snm><fnm>AS</fnm></au><au><snm>Ku</snm><fnm>M</fnm></au><au><snm>Bernstein</snm><fnm>BE</fnm></au></aug><source>PLoS Genet</source><pubdate>2010</pubdate><volume>6</volume><fpage>e1001244</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pgen.1001244</pubid><pubid idtype="pmcid">3000368</pubid><pubid idtype="pmpid" link="fulltext">21170310</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>High nucleosome occupancy is encoded at human regulatory sequences.</p></title><aug><au><snm>Tillo</snm><fnm>D</fnm></au><au><snm>Kaplan</snm><fnm>N</fnm></au><au><snm>Moore</snm><fnm>IK</fnm></au><au><snm>Fondufe-Mittendorf</snm><fnm>Y</fnm></au><au><snm>Gossett</snm><fnm>AJ</fnm></au><au><snm>Field</snm><fnm>Y</fnm></au><au><snm>Lieb</snm><fnm>JD</fnm></au><au><snm>Widom</snm><fnm>J</fnm></au><au><snm>Segal</snm><fnm>E</fnm></au><au><snm>Hughes</snm><fnm>TR</fnm></au></aug><source>PLoS One</source><pubdate>2010</pubdate><volume>5</volume><fpage>e9129</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0009129</pubid><pubid idtype="pmcid">2817738</pubid><pubid idtype="pmpid" link="fulltext">20161746</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Two strategies for gene regulation by promoter nucleosomes.</p></title><aug><au><snm>Tirosh</snm><fnm>I</fnm></au><au><snm>Barkai</snm><fnm>N</fnm></au></aug><source>Genome Res</source><pubdate>2008</pubdate><volume>18</volume><fpage>1084</fpage><lpage>1091</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.076059.108</pubid><pubid idtype="pmcid">2493397</pubid><pubid idtype="pmpid" link="fulltext">18448704</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>Distinct modes of regulation by chromatin encoded through nucleosome positioning signals.</p></title><aug><au><snm>Field</snm><fnm>Y</fnm></au><au><snm>Kaplan</snm><fnm>N</fnm></au><au><snm>Fondufe-Mittendorf</snm><fnm>Y</fnm></au><au><snm>Moore</snm><fnm>IK</fnm></au><au><snm>Sharon</snm><fnm>E</fnm></au><au><snm>Lubling</snm><fnm>Y</fnm></au><au><snm>Widom</snm><fnm>J</fnm></au><au><snm>Segal</snm><fnm>E</fnm></au></aug><source>PLoS Comput Biol</source><pubdate>2008</pubdate><volume>4</volume><fpage>e1000216</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pcbi.1000216</pubid><pubid idtype="pmcid">2570626</pubid><pubid idtype="pmpid" link="fulltext">18989395</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>High-resolution human core-promoter prediction with CoreBoost_HM.</p></title><aug><au><snm>Wang</snm><fnm>X</fnm></au><au><snm>Xuan</snm><fnm>Z</fnm></au><au><snm>Zhao</snm><fnm>X</fnm></au><au><snm>Li</snm><fnm>Y</fnm></au><au><snm>Zhang</snm><fnm>MQ</fnm></au></aug><source>Genome Res</source><pubdate>2009</pubdate><volume>19</volume><fpage>266</fpage><lpage>275</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2652208</pubid><pubid idtype="pmpid" link="fulltext">18997002</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>Histone modification levels are predictive for gene expression.</p></title><aug><au><snm>Karlic</snm><fnm>R</fnm></au><au><snm>Chung</snm><fnm>HR</fnm></au><au><snm>Lasserre</snm><fnm>J</fnm></au><au><snm>Vlahovicek</snm><fnm>K</fnm></au><au><snm>Vingron</snm><fnm>M</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2010</pubdate><volume>107</volume><fpage>2926</fpage><lpage>2931</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0909344107</pubid><pubid idtype="pmcid">2814872</pubid><pubid idtype="pmpid" link="fulltext">20133639</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>Regulation of chromatin by histone modifications.</p></title><aug><au><snm>Bannister</snm><fnm>AJ</fnm></au><au><snm>Kouzarides</snm><fnm>T</fnm></au></aug><source>Cell Res</source><pubdate>2011</pubdate><volume>21</volume><fpage>381</fpage><lpage>395</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/cr.2011.22</pubid><pubid idtype="pmcid">3193420</pubid><pubid idtype="pmpid" link="fulltext">21321607</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>Methylation of histone H3 Lys 4 in coding regions of active genes.</p></title><aug><au><snm>Bernstein</snm><fnm>BE</fnm></au><au><snm>Humphrey</snm><fnm>EL</fnm></au><au><snm>Erlich</snm><fnm>RL</fnm></au><au><snm>Schneider</snm><fnm>R</fnm></au><au><snm>Bouman</snm><fnm>P</fnm></au><au><snm>Liu</snm><fnm>JS</fnm></au><au><snm>Kouzarides</snm><fnm>T</fnm></au><au><snm>Schreiber</snm><fnm>SL</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2002</pubdate><volume>99</volume><fpage>8695</fpage><lpage>8700</lpage><xrefbib><pubidlist><pubid idtype="pmcid">124361</pubid><pubid idtype="pmpid" link="fulltext">12060701</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>A high-resolution map of active promoters in the human genome.</p></title><aug><au><snm>Kim</snm><fnm>TH</fnm></au><au><snm>Barrera</snm><fnm>LO</fnm></au><au><snm>Zheng</snm><fnm>M</fnm></au><au><snm>Qu</snm><fnm>C</fnm></au><au><snm>Singer</snm><fnm>MA</fnm></au><au><snm>Richmond</snm><fnm>TA</fnm></au><au><snm>Wu</snm><fnm>Y</fnm></au><au><snm>Green</snm><fnm>RD</fnm></au><au><snm>Ren</snm><fnm>B</fnm></au></aug><source>Nature</source><pubdate>2005</pubdate><volume>436</volume><fpage>876</fpage><lpage>880</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature03877</pubid><pubid idtype="pmcid">1895599</pubid><pubid idtype="pmpid" link="fulltext">15988478</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>Genome-wide map of nucleosome acetylation and methylation in yeast.</p></title><aug><au><snm>Pokholok</snm><fnm>DK</fnm></au><au><snm>Harbison</snm><fnm>CT</fnm></au><au><snm>Levine</snm><fnm>S</fnm></au><au><snm>Cole</snm><fnm>M</fnm></au><au><snm>Hannett</snm><fnm>NM</fnm></au><au><snm>Lee</snm><fnm>TI</fnm></au><au><snm>Bell</snm><fnm>GW</fnm></au><au><snm>Walker</snm><fnm>K</fnm></au><au><snm>Rolfe</snm><fnm>PA</fnm></au><au><snm>Herbolsheimer</snm><fnm>E</fnm></au><au><snm>Zeitlinger</snm><fnm>J</fnm></au><au><snm>Lewitter</snm><fnm>F</fnm></au><au><snm>Gifford</snm><fnm>DK</fnm></au><au><snm>Young</snm><fnm>RA</fnm></au></aug><source>Cell</source><pubdate>2005</pubdate><volume>122</volume><fpage>517</fpage><lpage>527</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2005.06.026</pubid><pubid idtype="pmpid" link="fulltext">16122420</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>Combinatorial patterns of histone acetylations and methylations in the human genome.</p></title><aug><au><snm>Wang</snm><fnm>Z</fnm></au><au><snm>Zang</snm><fnm>C</fnm></au><au><snm>Rosenfeld</snm><fnm>JA</fnm></au><au><snm>Schones</snm><fnm>DE</fnm></au><au><snm>Barski</snm><fnm>A</fnm></au><au><snm>Cuddapah</snm><fnm>S</fnm></au><au><snm>Cui</snm><fnm>K</fnm></au><au><snm>Roh</snm><fnm>TY</fnm></au><au><snm>Peng</snm><fnm>W</fnm></au><au><snm>Zhang</snm><fnm>MQ</fnm></au><au><snm>Zhao</snm><fnm>K</fnm></au></aug><source>Nat Genet</source><pubdate>2008</pubdate><volume>40</volume><fpage>897</fpage><lpage>903</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng.154</pubid><pubid idtype="pmcid">2769248</pubid><pubid idtype="pmpid" link="fulltext">18552846</pubid></pubidlist></xrefbib></bibl><bibl id="B39"><title><p>Discovery and characterization of chromatin states for systematic annotation of the human genome.</p></title><aug><au><snm>Ernst</snm><fnm>J</fnm></au><au><snm>Kellis</snm><fnm>M</fnm></au></aug><source>Nat Biotechnol</source><pubdate>2010</pubdate><volume>28</volume><fpage>817</fpage><lpage>825</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nbt.1662</pubid><pubid idtype="pmcid">2919626</pubid><pubid idtype="pmpid" link="fulltext">20657582</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>The histone 3 lysine 36 methyltransferase, SET2, is involved in transcriptional elongation.</p></title><aug><au><snm>Schaft</snm><fnm>D</fnm></au><au><snm>Roguev</snm><fnm>A</fnm></au><au><snm>Kotovic</snm><fnm>KM</fnm></au><au><snm>Shevchenko</snm><fnm>A</fnm></au><au><snm>Sarov</snm><fnm>M</fnm></au><au><snm>Shevchenko</snm><fnm>A</fnm></au><au><snm>Neugebauer</snm><fnm>KM</fnm></au><au><snm>Stewart</snm><fnm>AF</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2003</pubdate><volume>31</volume><fpage>2475</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkg372</pubid><pubid idtype="pmcid">156053</pubid><pubid idtype="pmpid" link="fulltext">12736296</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>Methylation of histone H3 by Set2 in Saccharomyces cerevisiae is linked to transcriptional elongation by RNA polymerase II.</p></title><aug><au><snm>Krogan</snm><fnm>NJ</fnm></au><au><snm>Kim</snm><fnm>M</fnm></au><au><snm>Tong</snm><fnm>A</fnm></au><au><snm>Golshani</snm><fnm>A</fnm></au><au><snm>Cagney</snm><fnm>G</fnm></au><au><snm>Canadien</snm><fnm>V</fnm></au><au><snm>Richards</snm><fnm>DP</fnm></au><au><snm>Beattie</snm><fnm>BK</fnm></au><au><snm>Emili</snm><fnm>A</fnm></au><au><snm>Boone</snm><fnm>C</fnm></au></aug><source>Mol Cell Biol</source><pubdate>2003</pubdate><volume>23</volume><fpage>4207</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/MCB.23.12.4207-4218.2003</pubid><pubid idtype="pmcid">427527</pubid><pubid idtype="pmpid" link="fulltext">12773564</pubid></pubidlist></xrefbib></bibl><bibl id="B42"><title><p>Association of the histone methyltransferase Set2 with RNA polymerase II plays a role in transcription elongation.</p></title><aug><au><snm>Li</snm><fnm>J</fnm></au><au><snm>Moazed</snm><fnm>D</fnm></au><au><snm>Gygi</snm><fnm>SP</fnm></au></aug><source>J Biol Chem</source><pubdate>2002</pubdate><volume>277</volume><fpage>49383</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1074/jbc.M209294200</pubid><pubid idtype="pmpid" link="fulltext">12381723</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>Discovery and annotation of functional chromatin signatures in the human genome.</p></title><aug><au><snm>Hon</snm><fnm>G</fnm></au><au><snm>Wang</snm><fnm>W</fnm></au><au><snm>Ren</snm><fnm>B</fnm></au></aug><source>PLoS Comput Biol</source><pubdate>2009</pubdate><volume>5</volume><fpage>e1000566</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pcbi.1000566</pubid><pubid idtype="pmcid">2775352</pubid><pubid idtype="pmpid" link="fulltext">19918365</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Histone H4 Lys 20 monomethylation by histone methylase SET8 mediates Wnt target gene activation.</p></title><aug><au><snm>Li</snm><fnm>Z</fnm></au><au><snm>Nie</snm><fnm>F</fnm></au><au><snm>Wang</snm><fnm>S</fnm></au><au><snm>Li</snm><fnm>L</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2011</pubdate><volume>108</volume><fpage>3116</fpage><lpage>3123</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.1009353108</pubid><pubid idtype="pmcid">3044395</pubid><pubid idtype="pmpid" link="fulltext">21282610</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>PR-Set7-mediated monomethylation of histone H4 lysine 20 at specific genomic regions induces transcriptional repression.</p></title><aug><au><snm>Congdon</snm><fnm>LM</fnm></au><au><snm>Houston</snm><fnm>SI</fnm></au><au><snm>Veerappan</snm><fnm>CS</fnm></au><au><snm>Spektor</snm><fnm>TM</fnm></au><au><snm>Rice</snm><fnm>JC</fnm></au></aug><source>J Cell Biochem</source><pubdate>2010</pubdate><volume>110</volume><fpage>609</fpage><lpage>619</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/jcb.22570</pubid><pubid idtype="pmpid" link="fulltext">20512922</pubid></pubidlist></xrefbib></bibl><bibl id="B46"><title><p>High-resolution profiling of histone methylations in the human genome.</p></title><aug><au><snm>Barski</snm><fnm>A</fnm></au><au><snm>Cuddapah</snm><fnm>S</fnm></au><au><snm>Cui</snm><fnm>K</fnm></au><au><snm>Roh</snm><fnm>TY</fnm></au><au><snm>Schones</snm><fnm>DE</fnm></au><au><snm>Wang</snm><fnm>Z</fnm></au><au><snm>Wei</snm><fnm>G</fnm></au><au><snm>Chepelev</snm><fnm>I</fnm></au><au><snm>Zhao</snm><fnm>K</fnm></au></aug><source>Cell</source><pubdate>2007</pubdate><volume>129</volume><fpage>823</fpage><lpage>837</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2007.05.009</pubid><pubid idtype="pmpid" link="fulltext">17512414</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>Profile of histone lysine methylation across transcribed mammalian chromatin.</p></title><aug><au><snm>Vakoc</snm><fnm>CR</fnm></au><au><snm>Sachdeva</snm><fnm>MM</fnm></au><au><snm>Wang</snm><fnm>H</fnm></au><au><snm>Blobel</snm><fnm>GA</fnm></au></aug><source>Mol Cell Biol</source><pubdate>2006</pubdate><volume>26</volume><fpage>9185</fpage><lpage>9195</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/MCB.01529-06</pubid><pubid idtype="pmcid">1698537</pubid><pubid idtype="pmpid" link="fulltext">17030614</pubid></pubidlist></xrefbib></bibl><bibl id="B48"><title><p>CpGcluster: a distance-based algorithm for CpG-island detection.</p></title><aug><au><snm>Hackenberg</snm><fnm>M</fnm></au><au><snm>Previti</snm><fnm>C</fnm></au><au><snm>Luque-Escamilla</snm><fnm>PL</fnm></au><au><snm>Carpena</snm><fnm>P</fnm></au><au><snm>Martinez-Aroza</snm><fnm>J</fnm></au><au><snm>Oliver</snm><fnm>JL</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2006</pubdate><volume>7</volume><fpage>446</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-7-446</pubid><pubid idtype="pmcid">1617122</pubid><pubid idtype="pmpid" link="fulltext">17038168</pubid></pubidlist></xrefbib></bibl><bibl id="B49"><title><p>Dynamic regulation of nucleosome positioning in the human genome.</p></title><aug><au><snm>Schones</snm><fnm>DE</fnm></au><au><snm>Cui</snm><fnm>K</fnm></au><au><snm>Cuddapah</snm><fnm>S</fnm></au><au><snm>Roh</snm><fnm>TY</fnm></au><au><snm>Barski</snm><fnm>A</fnm></au><au><snm>Wang</snm><fnm>Z</fnm></au><au><snm>Wei</snm><fnm>G</fnm></au><au><snm>Zhao</snm><fnm>K</fnm></au></aug><source>Cell</source><pubdate>2008</pubdate><volume>132</volume><fpage>887</fpage><lpage>898</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2008.02.022</pubid><pubid idtype="pmpid" link="fulltext">18329373</pubid></pubidlist></xrefbib></bibl><bibl id="B50"><title><p>The DNA-encoded nucleosome organization of a eukaryotic genome.</p></title><aug><au><snm>Kaplan</snm><fnm>N</fnm></au><au><snm>Moore</snm><fnm>IK</fnm></au><au><snm>Fondufe-Mittendorf</snm><fnm>Y</fnm></au><au><snm>Gossett</snm><fnm>AJ</fnm></au><au><snm>Tillo</snm><fnm>D</fnm></au><au><snm>Field</snm><fnm>Y</fnm></au><au><snm>LeProust</snm><fnm>EM</fnm></au><au><snm>Hughes</snm><fnm>TR</fnm></au><au><snm>Lieb</snm><fnm>JD</fnm></au><au><snm>Widom</snm><fnm>J</fnm></au><au><snm>Segal</snm><fnm>E</fnm></au></aug><source>Nature</source><pubdate>2009</pubdate><volume>458</volume><fpage>362</fpage><lpage>366</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature07667</pubid><pubid idtype="pmcid">2658732</pubid><pubid idtype="pmpid" link="fulltext">19092803</pubid></pubidlist></xrefbib></bibl><bibl id="B51"><title><p>Chromatin organization in sperm may be the major functional consequence of base composition variation in the human genome.</p></title><aug><au><snm>Vavouri</snm><fnm>T</fnm></au><au><snm>Lehner</snm><fnm>B</fnm></au></aug><source>PLoS Genet</source><pubdate>2011</pubdate><volume>7</volume><fpage>e1002036</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pgen.1002036</pubid><pubid idtype="pmcid">3072381</pubid><pubid idtype="pmpid" link="fulltext">21490963</pubid></pubidlist></xrefbib></bibl><bibl id="B52"><title><p>SUV420H2-mediated H4K20 trimethylation enforces RNA polymerase II promoter-proximal pausing by blocking hMOF-dependent H4K16 acetylation.</p></title><aug><au><snm>Kapoor-Vazirani</snm><fnm>P</fnm></au><au><snm>Kagey</snm><fnm>JD</fnm></au><au><snm>Vertino</snm><fnm>PM</fnm></au></aug><source>Mol Cell Biol</source><pubdate>2011</pubdate><volume>31</volume><fpage>1594</fpage><lpage>1609</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/MCB.00524-10</pubid><pubid idtype="pmcid">3126334</pubid><pubid idtype="pmpid" link="fulltext">21321083</pubid></pubidlist></xrefbib></bibl><bibl id="B53"><title><p>A silencing pathway to induce H3-K9 and H4-K20 trimethylation at constitutive heterochromatin.</p></title><aug><au><snm>Schotta</snm><fnm>G</fnm></au><au><snm>Lachner</snm><fnm>M</fnm></au><au><snm>Sarma</snm><fnm>K</fnm></au><au><snm>Ebert</snm><fnm>A</fnm></au><au><snm>Sengupta</snm><fnm>R</fnm></au><au><snm>Reuter</snm><fnm>G</fnm></au><au><snm>Reinberg</snm><fnm>D</fnm></au><au><snm>Jenuwein</snm><fnm>T</fnm></au></aug><source>Genes Dev</source><pubdate>2004</pubdate><volume>18</volume><fpage>1251</fpage><lpage>1262</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.300704</pubid><pubid idtype="pmcid">420351</pubid><pubid idtype="pmpid" link="fulltext">15145825</pubid></pubidlist></xrefbib></bibl><bibl id="B54"><title><p>Analysis of the vertebrate insulator protein CTCF-binding sites in the human genome.</p></title><aug><au><snm>Kim</snm><fnm>TH</fnm></au><au><snm>Abdullaev</snm><fnm>ZK</fnm></au><au><snm>Smith</snm><fnm>AD</fnm></au><au><snm>Ching</snm><fnm>KA</fnm></au><au><snm>Loukinov</snm><fnm>DI</fnm></au><au><snm>Green</snm><fnm>RD</fnm></au><au><snm>Zhang</snm><fnm>MQ</fnm></au><au><snm>Lobanenkov</snm><fnm>VV</fnm></au><au><snm>Ren</snm><fnm>B</fnm></au></aug><source>Cell</source><pubdate>2007</pubdate><volume>128</volume><fpage>1231</fpage><lpage>1245</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2006.12.048</pubid><pubid idtype="pmcid">2572726</pubid><pubid idtype="pmpid" link="fulltext">17382889</pubid></pubidlist></xrefbib></bibl><bibl id="B55"><title><p>Spatial organization of gene expression: the active chromatin hub.</p></title><aug><au><snm>de Laat</snm><fnm>W</fnm></au><au><snm>Grosveld</snm><fnm>F</fnm></au></aug><source>Chromosome Res</source><pubdate>2003</pubdate><volume>11</volume><fpage>447</fpage><lpage>459</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1023/A:1024922626726</pubid><pubid idtype="pmpid">12971721</pubid></pubidlist></xrefbib></bibl><bibl id="B56"><title><p>Transcription factories: genetic programming in three dimensions.</p></title><aug><au><snm>Edelman</snm><fnm>LB</fnm></au><au><snm>Fraser</snm><fnm>P</fnm></au></aug><source>Curr Opin Genet Dev</source><pubdate>2012</pubdate><volume>22</volume><fpage>110</fpage><lpage>114</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.gde.2012.01.010</pubid><pubid idtype="pmpid" link="fulltext">22365496</pubid></pubidlist></xrefbib></bibl><bibl id="B57"><title><p>Histone demethylation by a family of JmjC domain-containing proteins.</p></title><aug><au><snm>Tsukada</snm><fnm>Y</fnm></au><au><snm>Fang</snm><fnm>J</fnm></au><au><snm>Erdjument-Bromage</snm><fnm>H</fnm></au><au><snm>Warren</snm><fnm>ME</fnm></au><au><snm>Borchers</snm><fnm>CH</fnm></au><au><snm>Tempst</snm><fnm>P</fnm></au><au><snm>Zhang</snm><fnm>Y</fnm></au></aug><source>Nature</source><pubdate>2006</pubdate><volume>439</volume><fpage>811</fpage><lpage>816</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature04433</pubid><pubid idtype="pmpid" link="fulltext">16362057</pubid></pubidlist></xrefbib></bibl><bibl id="B58"><title><p>Short RNAs are transcribed from repressed polycomb target genes and interact with polycomb repressive complex-2.</p></title><aug><au><snm>Kanhere</snm><fnm>A</fnm></au><au><snm>Viiri</snm><fnm>K</fnm></au><au><snm>Araujo</snm><fnm>CC</fnm></au><au><snm>Rasaiyaah</snm><fnm>J</fnm></au><au><snm>Bouwman</snm><fnm>RD</fnm></au><au><snm>Whyte</snm><fnm>WA</fnm></au><au><snm>Pereira</snm><fnm>CF</fnm></au><au><snm>Brookes</snm><fnm>E</fnm></au><au><snm>Walker</snm><fnm>K</fnm></au><au><snm>Bell</snm><fnm>GW</fnm></au><au><snm>Pombo</snm><fnm>A</fnm></au><au><snm>Fisher</snm><fnm>AG</fnm></au><au><snm>Young</snm><fnm>RA</fnm></au><au><snm>Jenner</snm><fnm>RG</fnm></au></aug><source>Mol Cell</source><pubdate>2010</pubdate><volume>38</volume><fpage>675</fpage><lpage>688</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.molcel.2010.03.019</pubid><pubid idtype="pmcid">2886029</pubid><pubid idtype="pmpid" link="fulltext">20542000</pubid></pubidlist></xrefbib></bibl><bibl id="B59"><title><p>Ensembl 2011.</p></title><aug><au><snm>Flicek</snm><fnm>P</fnm></au><au><snm>Amode</snm><fnm>MR</fnm></au><au><snm>Barrell</snm><fnm>D</fnm></au><au><snm>Beal</snm><fnm>K</fnm></au><au><snm>Brent</snm><fnm>S</fnm></au><au><snm>Chen</snm><fnm>Y</fnm></au><au><snm>Clapham</snm><fnm>P</fnm></au><au><snm>Coates</snm><fnm>G</fnm></au><au><snm>Fairley</snm><fnm>S</fnm></au><au><snm>Fitzgerald</snm><fnm>S</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2011</pubdate><volume>39</volume><fpage>D800</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkq1064</pubid><pubid idtype="pmcid">3013672</pubid><pubid idtype="pmpid" link="fulltext">21045057</pubid></pubidlist></xrefbib></bibl><bibl id="B60"><title><p>The UCSC Genome Browser database: update 2010.</p></title><aug><au><snm>Rhead</snm><fnm>B</fnm></au><au><snm>Karolchik</snm><fnm>D</fnm></au><au><snm>Kuhn</snm><fnm>RM</fnm></au><au><snm>Hinrichs</snm><fnm>AS</fnm></au><au><snm>Zweig</snm><fnm>AS</fnm></au><au><snm>Fujita</snm><fnm>PA</fnm></au><au><snm>Diekhans</snm><fnm>M</fnm></au><au><snm>Smith</snm><fnm>KE</fnm></au><au><snm>Rosenbloom</snm><fnm>KR</fnm></au><au><snm>Raney</snm><fnm>BJ</fnm></au><au><snm>Pohl</snm><fnm>A</fnm></au><au><snm>Pheasant</snm><fnm>M</fnm></au><au><snm>Meyer</snm><fnm>LR</fnm></au><au><snm>Learned</snm><fnm>K</fnm></au><au><snm>Hsu</snm><fnm>F</fnm></au><au><snm>Hillman-Jackson</snm><fnm>J</fnm></au><au><snm>Harte</snm><fnm>RA</fnm></au><au><snm>Giardine</snm><fnm>B</fnm></au><au><snm>Dreszer</snm><fnm>TR</fnm></au><au><snm>Clawson</snm><fnm>H</fnm></au><au><snm>Barber</snm><fnm>GP</fnm></au><au><snm>Haussler</snm><fnm>D</fnm></au><au><snm>Kent</snm><fnm>WJ</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2010</pubdate><volume>38</volume><fpage>D613</fpage><lpage>619</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkp939</pubid><pubid idtype="pmcid">2808870</pubid><pubid idtype="pmpid" link="fulltext">19906737</pubid></pubidlist></xrefbib></bibl><bibl id="B61"><title><p>Nucleosome deposition and DNA methylation at coding region boundaries.</p></title><aug><au><snm>Choi</snm><fnm>JK</fnm></au><au><snm>Bae</snm><fnm>JB</fnm></au><au><snm>Lyu</snm><fnm>J</fnm></au><au><snm>Kim</snm><fnm>TY</fnm></au><au><snm>Kim</snm><fnm>YJ</fnm></au></aug><source>Genome Biol</source><pubdate>2009</pubdate><volume>10</volume><fpage>R89</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2009-10-9-r89</pubid><pubid idtype="pmcid">2768978</pubid><pubid idtype="pmpid" link="fulltext">19723310</pubid></pubidlist></xrefbib></bibl><bibl id="B62"><title><p>R: A language and environment for statistical computing.</p></title><aug><au><cnm>R Development Core Team</cnm></au></aug><publisher>R Foundation for Statistical Computing, Vienna, Austria</publisher><url>http://www.r-project.org/</url></bibl><bibl id="B63"><title><p>Java Treeview--extensible visualization of microarray data.</p></title><aug><au><snm>Saldanha</snm><fnm>AJ</fnm></au></aug><source>Bioinformatics</source><pubdate>2004</pubdate><volume>20</volume><fpage>3246</fpage><lpage>3248</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/bth349</pubid><pubid idtype="pmpid" link="fulltext">15180930</pubid></pubidlist></xrefbib></bibl><bibl id="B64"><title><p>Distinct epigenomic landscapes of pluripotent and lineage-committed human cells.</p></title><aug><au><snm>Hawkins</snm><fnm>RD</fnm></au><au><snm>Hon</snm><fnm>GC</fnm></au><au><snm>Lee</snm><fnm>LK</fnm></au><au><snm>Ngo</snm><fnm>Q</fnm></au><au><snm>Lister</snm><fnm>R</fnm></au><au><snm>Pelizzola</snm><fnm>M</fnm></au><au><snm>Edsall</snm><fnm>LE</fnm></au><au><snm>Kuan</snm><fnm>S</fnm></au><au><snm>Luu</snm><fnm>Y</fnm></au><au><snm>Klugman</snm><fnm>S</fnm></au><au><snm>Antosiewicz-Bourget</snm><fnm>J</fnm></au><au><snm>Ye</snm><fnm>Z</fnm></au><au><snm>Espinoza</snm><fnm>C</fnm></au><au><snm>Agarwahl</snm><fnm>S</fnm></au><au><snm>Shen</snm><fnm>L</fnm></au><au><snm>Ruotti</snm><fnm>V</fnm></au><au><snm>Wang</snm><fnm>W</fnm></au><au><snm>Stewart</snm><fnm>R</fnm></au><au><snm>Thomson</snm><fnm>JA</fnm></au><au><snm>Ecker</snm><fnm>JR</fnm></au><au><snm>Ren</snm><fnm>B</fnm></au></aug><source>Cell Stem Cell</source><pubdate>2010</pubdate><volume>6</volume><fpage>479</fpage><lpage>491</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.stem.2010.03.018</pubid><pubid idtype="pmcid">2867844</pubid><pubid idtype="pmpid" link="fulltext">20452322</pubid></pubidlist></xrefbib></bibl><bibl id="B65"><title><p>The NIH Roadmap Epigenomics Mapping Consortium.</p></title><aug><au><snm>Bernstein</snm><fnm>BE</fnm></au><au><snm>Stamatoyannopoulos</snm><fnm>JA</fnm></au><au><snm>Costello</snm><fnm>JF</fnm></au><au><snm>Ren</snm><fnm>B</fnm></au><au><snm>Milosavljevic</snm><fnm>A</fnm></au><au><snm>Meissner</snm><fnm>A</fnm></au><au><snm>Kellis</snm><fnm>M</fnm></au><au><snm>Marra</snm><fnm>MA</fnm></au><au><snm>Beaudet</snm><fnm>AL</fnm></au><au><snm>Ecker</snm><fnm>JR</fnm></au><au><snm>Farnham</snm><fnm>PJ</fnm></au><au><snm>Hirst</snm><fnm>M</fnm></au><au><snm>Lander</snm><fnm>ES</fnm></au><au><snm>Mikkelsen</snm><fnm>TS</fnm></au><au><snm>Thomson</snm><fnm>JA</fnm></au></aug><source>Nat Biotechnol</source><pubdate>2010</pubdate><volume>28</volume><fpage>1045</fpage><lpage>1048</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nbt1010-1045</pubid><pubid idtype="pmpid" link="fulltext">20944595</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm>
</art>