<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2009-10-9-r89</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Nucleosome deposition and DNA methylation at coding region boundaries</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Choi</snm>
               <mnm>Kyoon</mnm>
               <fnm>Jung</fnm>
               <insr iid="I1"/>
               <email>jungkyoon@gmail.com</email>
            </au>
            <au id="A2">
               <snm>Bae</snm>
               <fnm>Jae-Bum</fnm>
               <insr iid="I1"/>
               <email>jbbae@yonsei.ac.kr</email>
            </au>
            <au id="A3">
               <snm>Lyu</snm>
               <fnm>Jaemyun</fnm>
               <insr iid="I1"/>
               <email>jmlyu@yonsei.ac.kr</email>
            </au>
            <au id="A4">
               <snm>Kim</snm>
               <fnm>Tae-Yoon</fnm>
               <insr iid="I2"/>
               <email>tykim@cath.ac.kr</email>
            </au>
            <au ca="yes" id="A5">
               <snm>Kim</snm>
               <fnm>Young-Joon</fnm>
               <insr iid="I1"/>
               <email>yjkim@yonsei.ac.kr</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Biochemistry, College of Life Science and Technology, Yonsei University, 134 Sinchon-dong, Seodaemun-gu, Seoul, Korea</p>
            </ins>
            <ins id="I2">
               <p>Laboratory of Dermato-Immunology, The Catholic University of Korea, 505 Banpo-dong, Seocho-gu, Seoul, Korea</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2009</pubdate>
         <volume>10</volume>
         <issue>9</issue>
         <fpage>R89</fpage>
         <url>http://genomebiology.com/2009/10/9/R89</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">19723310</pubid>
               <pubid idtype="doi">10.1186/gb-2009-10-9-r89</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>24</day>
               <month>6</month>
               <year>2009</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>10</day>
               <month>8</month>
               <year>2009</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>1</day>
               <month>9</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>1</day>
               <month>9</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Choi et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Epigenetics at ends of coding regions</p>
      </shorttitle>
      <shortabs>
         <p>Nucleosomes and methylation have been observed to peak at both ends of protein coding units in a genome-wide survey.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Nucleosome deposition downstream of transcription initiation and DNA methylation in the gene body suggest that control of transcription elongation is a key aspect of epigenetic regulation.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Here we report a genome-wide observation of distinct peaks of nucleosomes and methylation at both ends of a protein coding unit. Elongating polymerases tend to pause near both coding ends immediately upstream of the epigenetic peaks, causing a significant reduction in elongation efficiency. Conserved features in underlying protein coding sequences seem to dictate their evolutionary conservation across multiple species. The nucleosomal and methylation marks are commonly associated with high sequence-encoded DNA-bending propensity but differentially with CpG density. As the gene grows longer, the epigenetic codes seem to be shifted from variable inner sequences toward boundary regions, rendering the peaks more prominent in higher organisms.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>Recent studies suggest that epigenetic inhibition of transcription elongation facilitates the inclusion of constitutive exons during RNA splicing. The epigenetic marks we identified here seem to secure the first and last coding exons from exon skipping as they are indispensable for accurate translation.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010016">Molecular biology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Recent epigenomic studies point out that epigenetic control of transcription elongation is a widespread regulatory mechanism. Intragenic DNA methylation occurs at higher density <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp> and has a larger effect on expression level than promoter methylation <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, inhibiting transcription elongation in filamentous fungi <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>, plant protoplasts <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, <it>Arabidopsis </it><abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, and mammalian cells <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Remarkably, methylation of protein coding regions alone can inhibit gene expression and its inhibitory effects are larger when it occurs nearer the start codon <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Intriguingly, the methylation map of rice chromosomes exhibits single peaks near start codons <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. It is probable that the methylation peak near the start codon exerts major inhibitory effects on transcription elongation.</p>
         <p>It may be that specific organization of nucleosomes surrounding the coding start region is also important in regulating transcription elongation. The maps of H2A.Z-containing nucleosomes in yeast and fly reveal conserved positioning of a nucleosome downstream of the transcription start site (TSS; the +1 nucleosome) <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. Notably, another yeast study provides a hint that the +1 nucleosome peaks very close to the start codon <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. In addition, there seems to be a nucleosomal peak prior to the 3' end of the yeast open reading frame (ORF) as well <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. The +1 nucleosome of fly is located further from the TSS than that of yeast <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. However, fly nucleosome patterns surrounding the start and stop codons have never been examined.</p>
         <p>Taken together, anecdotal findings in different species suggest a role for coding region boundaries in maintaining nucleosome deposition and DNA methylation. From a mechanistic perspective, the epigenetic marks can act as 'roadblocks' to RNA polymerase II (Pol II) progression. However, previous studies have focused on transcription initiation or termination, but not elongation. In this study, we attempt to provide insight into elongation inhibition surrounding translation start and end sites, which has never been observed. Given the impact of DNA methylation on nucleosome formation <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, correlations or differences between the two epigenetic marks also deserve systematic investigation.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <p>We first surveyed published nucleosome positions in yeast and fly <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. When aligned at the TSS, there is a significant difference in +1 nucleosome position between the two species <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> (Additional data file 1a). However, aligning by the coding region places the first coding nucleosome in similar positions in the two species - that is, just downstream of the start codon (Additional data file 1b). We also identified a highly conserved nucleosome immediately upstream of the 3' coding end in both species (Additional data file 1c).</p>
         <p>Analyzing the H2A.Z map for human T cells <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> also revealed nucleosomal peaks just downstream of start codons and just upstream of stop codons, marking both ends of the coding sequences (Figure <figr fid="F1">1a</figr>). Meanwhile, boundaries at both ends of transcripts are tightly coupled with nucleosome-free regions, potentially allowing access of the initiation and termination complex (Figure <figr fid="F1">1b</figr>). The nucleosome-free region at the TSS was followed by the +1 nucleosome. However, the association of the +1 nucleosome and the TSS appears to be weaker than that of the first coding nucleosome and the start codon. The patterns of nucleosome positioning for some individual genes are shown in Additional data file 2.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Epigenetic peaks near coding region boundaries</p>
            </caption>
            <text>
               <p>Epigenetic peaks near coding region boundaries. <b>(a, b) </b>Genome-wide average of nucleosome occupancy in T cells for genes aligned at the (a) coding ends or (b) transcript ends. The inner coding region is outlined in yellow. <b>(c) </b>Genome-wide average of methylation level in T cells for genes aligned at the coding ends or transcript ends. The inner coding and transcript regions are outlined in yellow.</p>
            </text>
            <graphic file="gb-2009-10-9-r89-1"/>
         </fig>
         <p>We carried out Solexa sequencing of methylated DNA from human T cells and found methylation peaks at the exact same positions (Figure <figr fid="F1">1c</figr>). We also profiled the mouse liver and found the same patterns (Additional data file 3). Together, the nucleosomal peaks are observed in human, fly, and yeast, and the methylation peaks in human, mouse, and plants.</p>
         <p>To examine their role in regulating transcription, we first related the level of the epigenetic peaks to expression level. We found that highly expressed genes are depleted of the epigenetic peaks (Figure <figr fid="F2">2a</figr>), consistent with findings of a nucleosomal barrier against high transcription rate <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. However, the overall correlation was not strong. We then estimated elongation efficiency as mRNA production per unit density of elongating Pol II. Upon initiation, Pol II is phosphorylated at Ser5 in its carboxy-terminal domain, switching to an elongation-competent form. Thus, we calculated the ratio of expression level to the density of Ser5-phosphorylated Pol II within the transcript body. Genes with high elongation efficiency will show high expression levels even with a low density of elongating Pol II across the transcribed region, and the opposite for low elongation efficiency. A strong association was found between the level of the epigenetic peaks and elongation efficiency (Figure <figr fid="F2">2b</figr>; Additional data file 4).</p>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Correlation of elongation inhibition with epigenetic peaks</p>
            </caption>
            <text>
               <p>Correlation of elongation inhibition with epigenetic peaks. <b>(a, b) </b>The average of nucleosome level (left panel) and methylation level (right panel) were plotted (a) within each expression percentile and (b) within each bin of elongation efficiency. The epigenetic marks near the start codon are scaled on the left side (black curve) and those near the stop codon on the right side (gray curve). <b>(c-e) </b>Density of Ser5-phosphorylated Pol II (black trace) and nucleosome level (gray) surrounding the start codon (left panel) or stop codon (right panel) for (c) all genes, (d) genes with a long (> 5 kb) 5' UTR (left panel) or 3' UTR (right panel), and (e) genes with high- or low-occupancy boundary nucleosomes (top 10% or bottom 10%). The Pol II scale is on the left side and the nucleosome scale is on the right side.</p>
            </text>
            <graphic file="gb-2009-10-9-r89-2"/>
         </fig>
         <p>Without any interference, elongating Pol II should be distributed evenly across the transcript body except at the initiating and terminating sites. Faced with roadblocks, however, Pol II pauses and a pileup of Pol II forms, which can be observed as a peak of Pol II density. Thus, to demonstrate Pol II pausing at epigenetic marks, three criteria should be met: the presence of a Pol II peak; the presence of an epigenetic peak; and a correspondence between the positions of the two peaks.</p>
         <p>Elongating Pol II appears to pile up immediately upstream of the nucleosomal peaks at both ends of protein coding units (Figure <figr fid="F2">2c</figr>), satisfying the three criteria. However, there seem to be confounding effects from Pol II enriched at nearby transcription initiation and termination sites. For example, the Pol II tail downstream of the stop codon (arrow above the right panel of Figure <figr fid="F2">2c</figr>) might reflect Pol II awaiting to be released from the transcription termination site. We thus selected genes with a long (> 5 kb) 5' untranslated region (UTR) and examined Pol II density around the TSS and start codon separately. Both unphosphorylated and elongating Pol II were enriched at the TSS, but only elongating Pol II showed high downstream density (Additional data file 5) with a pileup upstream of the start codon (left panel of Figure <figr fid="F2">2d</figr>). Another Pol II peak upstream of the start codon (arrow above the left panel of Figure <figr fid="F2">2d</figr>) seems to reflect Pol II at the TSS. A pileup of Pol II was also found before a long (> 5 kb) 3' UTR (right panel of Figure <figr fid="F2">2d</figr>), indicative of Pol II blockage that occurs independently of the transcription termination site.</p>
         <p>Pol II pausing was not observed with low nucleosome occupancy (Figure <figr fid="F2">2e</figr>), indicating that elongating Pol II is indeed impeded by the boundary nucleosome. To observe the specific effect of the boundary nucleosome, we computed relative occupancy at the coding end compared to the surrounding region. Higher or lower nucleosome occupancy near the coding end directly led to higher or lower Pol II density in the immediate upstream region (Additional data file 6).</p>
         <p>We roughly estimated the percentage of genes that are affected by Pol II pausing by comparing the average Pol II density around boundaries and that across surrounding regions. We found that 54% of genes exhibit higher Pol II density near the start codon than in the flanking region and 41% of genes have a Pol II peak near the stop codon.</p>
         <p>Nucleosome positioning is governed by DNA sequences <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B16">16</abbr></abbrgrp>. Methylation level is dependent on the CpG content of the target sequence <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. Given the distinctive patterns of nucleosome positioning and methylation maintained in specific regions among different species, there should be strong constraints on the underlying DNA sequences. Being under strong natural selection, protein coding sequences could be better candidates than UTRs for conserved epigenetic targets downstream of transcription initiation and upstream of termination. Coding region boundaries might be subject to considerable negative selection that purifies sequence changes that are detrimental to nucleosome deposition or DNA methylation.</p>
         <p>We examined two sequence characteristics deemed to be involved in epigenetic programming: DNA-bending propensity and CpG density. DNA-bending propensity, the ability of nucleotide sequences to wrap around a histone complex, is an important determinant of nucleosome formation <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>. DNase I digestion experiments indicate that bending parameters for the start codon and three stop codons are among the 8 highest out of those for the 32 trinucleotides <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Therefore, they can significantly contribute to the high bendability of coding boundaries (Additional data file 7). The boundary sequences with higher bendability tend to be more enriched for nucleosomes (upper panel in Figure <figr fid="F3">3a</figr>). Unexpectedly, DNA methylation level was also proportional to bending propensity.</p>
         <fig id="F3">
            <title>
               <p>Figure 3</p>
            </title>
            <caption>
               <p>Correlation of genetic and epigenetic characteristics at coding region boundaries</p>
            </caption>
            <text>
               <p>Correlation of genetic and epigenetic characteristics at coding region boundaries. <b>(a) </b>Bending propensity and CpG density were calculated for flanking sequences downstream of the start codon or upstream of the stop codon. The number of genes (gray curve measured on the right scale), methylation level (red curve), and nucleosome level (blue curve) were obtained according to the bendability (upper panels) and CpG density (lower panels) near the start codon (left panels) and stop codon (right panels). <b>(b) </b>A total of 25,883 genes were clustered by the strength of the marks near their coding ends. Marks with strengths higher than the median level of all genes were considered significant.</p>
            </text>
            <graphic file="gb-2009-10-9-r89-3"/>
         </fig>
         <p>CpG density should be a determinant of methylation level. The boundary sequences with intermediate CpG density were densely methylated (lower panel in Figure <figr fid="F3">3a</figr>). In contrast, nucleosome occupancy was dominant among genes with lower CpG density. While the two marks commonly have affinity for base compositions with high bending propensity, DNA methylation at CpG sites might affect structural DNA bending and nucleosome formation. The proportions of genes marked by both nucleosomes and methylation or by just one of these are shown in Figure <figr fid="F3">3b</figr>. More genes are specifically marked by nucleosomes than methylation, possibly because many boundary regions have relatively low CpG density (gray curve in lower panels of Figure <figr fid="F3">3a</figr>).</p>
         <p>A group of genes had highest CpG density at the 5' end (gray curve at CpG density > 0.8 in bottom left panel of Figure <figr fid="F3">3a</figr>). These genes showed a markedly reduced level of DNA methylation, reflecting the fact that CpG islands are typically unmethylated. Indeed, 97.2% of these genes contained a CpG island within their promoter (-1,000 bp to 500 bp from the TSS) and 92.8% had a short (&lt; 500 bp) UTR (<it>P </it>&lt; 10<sup>-100</sup>), an indication that promoter CpG islands are overlapping or located very close to the start codon. These genes exhibited high expression compared to the rest of the genes (<it>P </it>&lt; 10<sup>-80</sup>), even higher than the genes with a promoter CpG island (<it>P </it>= 1.4 &#215; 10<sup>-10</sup>) (Additional data file 8), indicating additional effects of elongation control.</p>
         <p>Next, we explored the intragenic distribution of the marks. Although significantly higher than its flanking region, the 5' peak is generally lower than the 3' peak (Additional data file 9a). Meanwhile, k-means clustering shows that most genes have higher peaks at both ends compared to the central region (Additional data file 9b). We then examined these patterns according to the size of the coding region (Figure <figr fid="F4">4a</figr>). We found that genes with a short coding sequence (&lt; 1 kb) have nucleosomes and methylation in their inner region over a large portion of the gene body. In particular, their 3' ends lack both marks, in sharp contrast to most other genes, in which the marks are shifted toward both ends with a bias toward the 3' end. Unlike at the 5' end, both marks commonly peaked at the 3' end, especially in many genes of intermediate size (Figure <figr fid="F4">4b</figr>).</p>
         <fig id="F4">
            <title>
               <p>Figure 4</p>
            </title>
            <caption>
               <p>Epigenetic characteristics within coding regions of varying size</p>
            </caption>
            <text>
               <p>Epigenetic characteristics within coding regions of varying size. <b>(a) </b>Heatmaps showing the coding-region profiles of nucleosomes (left side) or methylation (right side) averaged over 100 neighboring genes ordered by size. A total of 25,883 genes were used. Each row represents a mean-centered profile for each gene. The average profile of genes of similar size is given on the left side (for nucleosome level) and the right side (for methylation level) of the heatmap. <b>(b) </b>A combined profile of the heatmap signatures of the two marks at each end (marks > 0.5 at the 5' end and > 1 at the 3' end were considered significant). <b>(c) </b>Genome-wide average of nucleosome occupancy in yeast and fly (upper panels) and methylation level in <it>Arabidopsis </it>and rice (lower panels) according to relative positions within the coding region. The median length of coding regions is shown. <b>(d) </b>Size of coding regions from each species. The box width is proportional to the square root of the number of measured genes.</p>
            </text>
            <graphic file="gb-2009-10-9-r89-4"/>
         </fig>
         <p>Notably, nucleosome composition within the coding region of yeast or fly genes is not sharply shifted to the coding boundaries - the 3' peak is especially not very prominent (upper panel in Figure <figr fid="F4">4c</figr>) - a similar pattern to that seen for human genes of similar size (1 to 2 kb). Intragenic DNA methylation in plant genomes is also concentrated in the central region of protein coding sequences (lower panel in Figure <figr fid="F4">4c</figr>). <it>Arabidopsis </it>genes share a similar pattern with human genes of similar size (1 to 3 kb). Rice genes are longer than <it>Arabidopsis </it>genes (Figure <figr fid="F4">4d</figr>) and have detectable, if not complete, peak patterns, which can explain the observed 5'-end peak <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Evolutionary processes seem to have maintained the boundaries of protein coding sequences as targets for nucleosome binding or DNA methylation. Incorporating novel protein domains or introns in inner sequences might have concentrated the epigenetic marks in the 5'- and 3'-end segments. It could be a good strategy to diversify DNA sequences in the middle of coding regions to encode various protein functions while constraining the boundary sequences, including the start and stop codons, to create an epigenetically favorable environment.</p>
         <p>What is the physiological role of the conserved 5'- and 3'-end peaks? A few studies have hinted at their role in RNA splicing. First, a crosstalk between nucleosomes and RNA splicing has been suggested <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>. The chromatin remodeling complex SWI/SNF was shown to create roadblocks of nucleosomes that slow Pol II elongation, which facilitates the inclusion of the exon during RNA splicing. Another study showed that H3K46me3, which impedes Pol II elongation, was specifically enriched in constitutive exons compared to alternative exons, thus suggesting it has a role in controlling RNA splicing <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>.</p>
         <p>These findings suggest a novel role for nucleosome deposition and DNA methylation at coding boundaries in controlling RNA splicing. The first and last coding exons should be constitutively included in a mature transcript since skipping these exons causes translation failure. Slowing elongation by these epigenetic roadblocks might facilitate the inclusion of these indispensable boundary exons. DNA sequences surrounding the start and stop codons are highly conserved to ensure a favorable epigenetic environment, explaining why the epigenetic peaks are found in the specific loci of the first and last exons. This mechanism seems to be more common among higher organisms with longer genes.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>Previous epigenetic studies have focused on promoter regions in an attempt to associate epigenetic patterns with transcription initiation activity. However, recent genome-wide epigenetic studies challenge the traditional viewpoint and have shed new light on the epigenetic control of transcription elongation. Notably, epigenetic inhibition of Pol II elongation has been proposed to facilitate the inclusion of constitutive exons during RNA splicing. The evolutionarily conserved epigenetic patterns we identified here seem to ensure the inclusion of the first and last coding exons as they are indispensable for accurate translation. Further mechanistic studies are required to support this hypothesis.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Identifying gene structure</p>
            </st>
            <p>The chromosomal positions of transcription initiation and termination sites, and coding start and end sites were obtained from the RefSeq Genes track of the UCSC Genome Browser <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> for the human, mouse, and fly genomes. Each entry was treated as an individual gene. The start and end positions of yeast ORFs were obtained from the <it>Saccharomyces </it>Genome Database <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. The positions of TSSs relative to the start codons of yeast ORFs were obtained from a genome-wide full-length cDNA analysis <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Nucleosomal data for yeast, fly and human</p>
            </st>
            <p>H2A.Z-containing nucleosomes were mapped to the yeast genome <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> and the fly genome <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Predictions of nucleosomal locations were downloaded from the authors' website for yeast <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> and fly <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. The average occupancy of predicted nucleosomes was given as a function of distance from the TSS, start codon, or stop codon across multiple aligned genes. H2A.Z-containing nucleosomes in human T cells were mapped to the human genome by means of Solexa sequencing technology <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. The tag coordinate files in the browser extensible data (BED) format for resting nucleosomes were downloaded from the supplementary website <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. The sequencing reads were extended to 150 bp in length according to their orientation <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> and the number of overlapping sequence reads was obtained at 150-bp intervals across the human genome (UCSC hg18 assembly based on NCBI build 36.1). The read counts served as estimates of nucleosome level at the corresponding genomic intervals.</p>
         </sec>
         <sec>
            <st>
               <p>Affinity purification of methylated genomic DNA</p>
            </st>
            <p>We purified genomic DNA with the aid of the DNeasyR Blood and Tissue Kit (Qiagen, Valencia, CA, USA). The methylated CpG island recovery assay (MIRA) <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> was carried out as follows. Preparation of the glutathione S-transferase (GST)-tagged MBD2b and His-tagged MBD3L1 proteins was done as described <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. The purified genomic DNA was sonicated to 100 to 500 bp in size and incubated with the prepared GST-MBD2b protein, His-MBD3L1 protein, and the JM110 bacterial DNA. MagneGST beads (Promega, Madison, WI, USA) were pre-blocked with the JM110 bacterial DNA and incubated at 4&#176;C in the MIRA binding buffer (10 mM Tris-HCl, pH 7.5, 50 mM NaCl, 1 mM EDTA, 1 mM dithiothreitol, 3 mM MgCl<sub>2</sub>, 0.1% Triton X-100, 5% glycerol, 25 &#956;g/ml bovine serum albumin) and then washed in washing buffer (10 mM Tris-HCl, pH 7.5, 300 mM NaCl, 1 mM EDTA, 3 mM MgCl<sub>2</sub>, 0.1% Triton X-100). DNA elution was done by incubation at room temperature for 5 minutes and 56&#176;C for 30 minutes with RNase A and Proteinase K. Additional purification of the eluted DNA fragments was done with the aid of the QIAquick PCR Purification Kit (Qiagen).</p>
         </sec>
         <sec>
            <st>
               <p>Solexa sequencing of affinity-purified methylated DNA</p>
            </st>
            <p>The eluted DNA fragments were ligated to a pair of Solexa adaptors for Illumina Genome Analyzer sequencing. The ligation products were size-fractionated to obtain 175 to 225-bp fragments on an agarose gel and subjected to PCR amplification. Cluster generation and 36 cycles of sequencing were done according to the manufacturer's instructions. The sequence tags were mapped to the human genome (UCSC hg18 assembly based on NCBI build 36.1) or the mouse genome (UCSC mm9 assembly based on NCBI build 37) by means of the Solexa Analysis Pipeline (version 0.3.0). We obtained 34-bp sequenced reads excluding the first and last nucleotide. The raw sequence tags have been deposited in NCBI's Short Read Archive (SRA) under accession number GSE17554.</p>
         </sec>
         <sec>
            <st>
               <p>Profiling DNA methylation in human T cells and mouse liver</p>
            </st>
            <p>Affinity purification and Solexa sequencing of methylated DNA fragments were carried out for human na&#239;ve T cells purified from blood samples from healthy males and females and for liver tissue from normal mice (C57BL/6). To identify the actual positions of methylated DNA fragments, we extended the 34-bp reads toward the 3' end according to the size fractionation of the ligation products (200 bp on average). The output was converted to BED files for visualization in the UCSC genome browser. We counted overlapping sequence tags at a 150-bp resolution for consistency with the nucleosomal data. The read counts served as estimates of methylation level at their respective genomic intervals.</p>
         </sec>
         <sec>
            <st>
               <p>Obtaining Pol II density data and gene expression data for human T cells, and estimating elongation efficiency</p>
            </st>
            <p>The tag coordinate BED files for unphosphorylated Pol II and Ser5-phosphorylated Pol II in resting T cells were downloaded from <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. We counted overlapping sequence tags at a 150-bp resolution for consistency with the other data. The read counts served as estimates of Pol II density at their respective genomic intervals. Microarray data for gene expression level in resting T cells are available at the Gene Expression Omnibus (GEO) under accession number GSE10437 <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Efficiency of transcriptional elongation was estimated as mRNA production per unit density of elongating Pol II. The expression level of a gene was divided by the average density of Ser5-phosphorylated Pol II over its transcribed region (from the TSS to the termination site). An arbitrary value (0.01) was added to the denominator to avoid illegal division by zero. The ratio was log transformed (base <it>e</it>).</p>
         </sec>
         <sec>
            <st>
               <p>Genetic and epigenetic codes near the start codon and stop codon</p>
            </st>
            <p>For quantitative analysis of genetic or epigenetic codes near coding start and end sites, we used the DNA sequences flanking the coding ends or the 150-bp genomic intervals closest to the coding ends. Nucleosome level, methylation level, and CpG density were obtained from the two to four genomic intervals closest to the start codon or stop codon. CpG density within a genomic interval was obtained as described below. Bending propensity was calculated for the flanking sequences as described below.</p>
         </sec>
         <sec>
            <st>
               <p>Calculation of DNA bendability and CpG density</p>
            </st>
            <p>DNA-bending propensity was calculated based on trinucleotide parameters <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. A bending parameter was assigned for each single-nucleotide position according to base composition and then averaged in a sliding window of 100 bp or within a 150-bp segment downstream of the start codon or upstream of the stop codon. The percentage of G and C nucleotides was obtained for the 150-bp intervals. The number of G and C nucleotides was divided by the total number of nucleotides in the segment (that is, 150 nucleotides). CpG density was calculated for the same 150-bp intervals that were used for nucleosomal level estimation and methylation level estimation by the ratio of observed to expected CpG frequencies according to the formula cited in Gardiner-Garden <it>et al</it>. <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. The genomic coordinates of CpG islands were downloaded from the CpG islands track at the UCSC genome browser, as predicted by the following criteria: GC content of 50% or greater, length greater than 200 bp, and a ratio greater than 0.6 of observed number of CpG dinucleotides to the expected number. A gene was deemed to contain a CpG island if the region -1,000 bp to 500 bp from the TSS contained one of the defined CpG islands.</p>
         </sec>
         <sec>
            <st>
               <p>Methylation data for <it>Arabidopsis thaliana </it>and <it>Oryza sativa</it></p>
            </st>
            <p>DNA methylation in the <it>Arabidopsis </it>genome was mapped for wild-type roots using tiling microarrays <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. The data are available at the GEO under accession number GSE12212 (WT root-1 and WT root-2). DNA methylation of rice chromosomes 4 and 10 was mapped using tiling microarrays for cultured cells and light-grown shoots <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. The data are available at the GEO under accession number GSE9925 (CC replicates 1 to 2 and LS replicates 1 to 2).</p>
         </sec>
         <sec>
            <st>
               <p>Statistical tests</p>
            </st>
            <p>The length of the 5' UTRs of selected genes was compared against all 5' UTRs in the genome by means of the Wilcoxon rank sum test. Gene expression level was tested by means of the two-sample <it>t</it>-test of log ratios or the Wilcoxon rank sum test. Genes with high or low nucleosome occupancy and high or low methylation level at coding boundaries are defined as the top or bottom 10% in each category.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>GEO: Gene Expression Omnibus; GST: glutathione S-transferase; MIRA: methylated CpG island recovery assay; ORF: open reading frame; Pol II: RNA polymerase II; TSS: transcription start site; UTR: untranslated region.</p>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The authors declare that they have no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>JKC conceived of the study, carried out the analysis, and wrote the manuscript. JBB carried out the purification and sequencing of methylated DNA. JL processed the raw sequencing data and participated in data analysis. TYK participated in sample preparation. YJK participated in study design and coordination, and finalized the manuscript.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper: a figure showing nucleosome patterns surrounding the TSS, start codon, and stop codon in yeast and fly (Additional data file <supplr sid="S1">1</supplr>); a figure showing illustrative genes with nucleosomal peaks at coding boundaries (Additional data file <supplr sid="S2">2</supplr>); a figure showing DNA methylation level surrounding the transcript and coding region boundaries in the mouse liver (Additional data file <supplr sid="S3">3</supplr>); a figure showing nucleosome occupancy according to differential Pol II elongation efficiency (Additional data file <supplr sid="S4">4</supplr>); a figure comparing densities of Ser5-phosphorylated and unphosphorylated Pol II (Additional data file <supplr sid="S5">5</supplr>); a figure showing Pol II density with higher and lower nucleosome occupancy (Additional data file <supplr sid="S6">6</supplr>); a figure showing DNA bending propensity at the start and stop codons (Additional data file <supplr sid="S7">7</supplr>); a figure demonstrating the length of the 5' UTR and gene expression level for genes with high CpG density around the start codon (Additional data file <supplr sid="S8">8</supplr>); a figure showing the overall patterns of nucleosome occupancy and DNA methylation level inside the protein coding region (Additional data file <supplr sid="S9">9</supplr>).</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>Nucleosome patterns surrounding the TSS, start codon, and stop codon in yeast and fly</p>
            </caption>
            <text>
               <p>Nucleosome patterns surrounding the TSS, start codon, and stop codon in yeast and fly.</p>
            </text>
            <file name="gb-2009-10-9-r89-S1.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>Illustrative genes with nucleosomal peaks at coding boundaries</p>
            </caption>
            <text>
               <p>Illustrative genes with nucleosomal peaks at coding boundaries.</p>
            </text>
            <file name="gb-2009-10-9-r89-S2.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p>DNA methylation level surrounding the transcript and coding region boundaries in the mouse liver</p>
            </caption>
            <text>
               <p>DNA methylation level surrounding the transcript and coding region boundaries in the mouse liver.</p>
            </text>
            <file name="gb-2009-10-9-r89-S3.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S4">
            <title>
               <p>Additional data file 4</p>
            </title>
            <caption>
               <p>Nucleosome occupancy according to differential Pol II elongation efficiency</p>
            </caption>
            <text>
               <p>Nucleosome occupancy according to differential Pol II elongation efficiency.</p>
            </text>
            <file name="gb-2009-10-9-r89-S4.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S5">
            <title>
               <p>Additional data file 5</p>
            </title>
            <caption>
               <p>Densities of Ser5-phosphorylated and unphosphorylated Pol II</p>
            </caption>
            <text>
               <p>Densities of Ser5-phosphorylated and unphosphorylated Pol II.</p>
            </text>
            <file name="gb-2009-10-9-r89-S5.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S6">
            <title>
               <p>Additional data file 6</p>
            </title>
            <caption>
               <p>Pol II density with higher and lower nucleosome occupancy</p>
            </caption>
            <text>
               <p>Pol II density with higher and lower nucleosome occupancy.</p>
            </text>
            <file name="gb-2009-10-9-r89-S6.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S7">
            <title>
               <p>Additional data file 7</p>
            </title>
            <caption>
               <p>DNA bending propensity at the start and stop codons</p>
            </caption>
            <text>
               <p>DNA bending propensity at the start and stop codons.</p>
            </text>
            <file name="gb-2009-10-9-r89-S7.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S8">
            <title>
               <p>Additional data file 8</p>
            </title>
            <caption>
               <p>Length of the 5' UTR and gene expression level for genes with high CpG density around the start codon</p>
            </caption>
            <text>
               <p>Length of the 5' UTR and gene expression level for genes with high CpG density around the start codon.</p>
            </text>
            <file name="gb-2009-10-9-r89-S8.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S9">
            <title>
               <p>Additional data file 9</p>
            </title>
            <caption>
               <p>Overall patterns of nucleosome occupancy and DNA methylation level inside the protein coding region</p>
            </caption>
            <text>
               <p>Overall patterns of nucleosome occupancy and DNA methylation level inside the protein coding region.</p>
            </text>
            <file name="gb-2009-10-9-r89-S9.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was supported by the Korea Science and Engineering Foundation (KOSEF) and the Korea Foundation for International Cooperation of Science and Technology (KICOS) through grants provided by the Korean Ministry of Education, Science and Technology (MEST) (M10750030001-08N5003-0011 and K20704000006-08A0500-00610).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Genome-wide high-resolution mapping and functional analysis of DNA methylation in <it>Arabidopsis</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Yazaki</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sundaresan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cokus</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chan</snm>
                  <fnm>SW-L</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Henderson</snm>
                  <fnm>IR</fnm>
               </au>
               <au>
                  <snm>Shinn</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Pellegrini</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jacobsen</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Ecker</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2006</pubdate>
            <volume>126</volume>
            <fpage>1025</fpage>
            <lpage>1028</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2006.09.002</pubid>
                  <pubid idtype="pmpid" link="fulltext">16990127</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Genome-wide analysis of <it>Arabidopsis </it>thaliana DNA methylation uncovers an interdependence between methylation and transcription.</p>
            </title>
            <aug>
               <au>
                  <snm>Zilberman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gehring</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tran</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Ballinger</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Henikoff</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2007</pubdate>
            <volume>39</volume>
            <fpage>61</fpage>
            <lpage>69</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1929</pubid>
                  <pubid idtype="pmpid" link="fulltext">17128275</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Shotgun bisulphite sequencing of the <it>Arabidopsis </it>genome reveals DNA methylation patterning.</p>
            </title>
            <aug>
               <au>
                  <snm>Cokus</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Feng</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Merriman</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Haudenschild</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Pradhan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Pellegrini</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jacobsen</snm>
                  <fnm>SE</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2008</pubdate>
            <volume>452</volume>
            <fpage>215</fpage>
            <lpage>219</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature06745</pubid>
                  <pubid idtype="pmcid">2377394</pubid>
                  <pubid idtype="pmpid" link="fulltext">18278030</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>High-resolution mapping of epigenetic modifications of the rice genome uncovers interplay between DNA methylation, histone methylation, and gene expression.</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Su</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Stolc</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Tongprasit</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Jin</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Terzaghi</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>XW</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2008</pubdate>
            <volume>20</volume>
            <fpage>259</fpage>
            <lpage>276</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1105/tpc.107.056879</pubid>
                  <pubid idtype="pmcid">2276441</pubid>
                  <pubid idtype="pmpid" link="fulltext">18263775</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>DNA methylation inhibits elongation but not initiation of transcription in <it>Neurospora crassa</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Rountree</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Selker</snm>
                  <fnm>EU</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>1997</pubdate>
            <volume>11</volume>
            <fpage>2383</fpage>
            <lpage>2395</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gad.11.18.2383</pubid>
                  <pubid idtype="pmcid">316521</pubid>
                  <pubid idtype="pmpid" link="fulltext">9308966</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Methylation induced premeiotically in <it>Ascobolus</it>: coextension with DNA repeat lengths and effect on transcript elongation.</p>
            </title>
            <aug>
               <au>
                  <snm>Barry</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Faugeron</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Rossignol</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1993</pubdate>
            <volume>90</volume>
            <fpage>4557</fpage>
            <lpage>4561</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.90.10.4557</pubid>
                  <pubid idtype="pmcid">46551</pubid>
                  <pubid idtype="pmpid">8506299</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Methylation of coding region alone inhibits gene expression in plant protoplasts.</p>
            </title>
            <aug>
               <au>
                  <snm>Hohn</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Corsten</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rieke</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Muller</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rothnie</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1996</pubdate>
            <volume>93</volume>
            <fpage>8334</fpage>
            <lpage>8339</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.93.16.8334</pubid>
                  <pubid idtype="pmcid">38671</pubid>
                  <pubid idtype="pmpid">8710871</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Intragenic DNA methylation alters chromatin structure and elongation efficiency in mammalian cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Lorincz</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Dickerson</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Schmitt</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Groudine</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nat Struct Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>11</volume>
            <fpage>1068</fpage>
            <lpage>1075</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nsmb840</pubid>
                  <pubid idtype="pmpid" link="fulltext">15467727</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Translational and rotational settings of H2A.Z nucleosomes across the <it>Saccharomyces cerevisiae </it>genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Albert</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Mavrich</snm>
                  <fnm>TN</fnm>
               </au>
               <au>
                  <snm>Tomsho</snm>
                  <fnm>LP</fnm>
               </au>
               <au>
                  <snm>Qi</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zanton</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Schuster</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Pugh</snm>
                  <fnm>BF</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>446</volume>
            <fpage>572</fpage>
            <lpage>576</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature05632</pubid>
                  <pubid idtype="pmpid" link="fulltext">17392789</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Nucleosome organization in the <it>Drosophila </it>genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Mavrich</snm>
                  <fnm>TN</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ioshikhes</snm>
                  <fnm>IP</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Venters</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Zanton</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Tomsho</snm>
                  <fnm>LP</fnm>
               </au>
               <au>
                  <snm>Qi</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Glaser</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Schuster</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Gilmour</snm>
                  <fnm>DS</fnm>
               </au>
               <au>
                  <snm>Albert</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Pugh</snm>
                  <fnm>BF</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2008</pubdate>
            <volume>453</volume>
            <fpage>358</fpage>
            <lpage>362</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature06929</pubid>
                  <pubid idtype="pmcid">2735122</pubid>
                  <pubid idtype="pmpid" link="fulltext">18408708</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Nucleosome positions predicted through comparative genomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Ioshikhes</snm>
                  <fnm>IP</fnm>
               </au>
               <au>
                  <snm>Albert</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Zanton</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Pugh</snm>
                  <fnm>BF</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2006</pubdate>
            <volume>38</volume>
            <fpage>1210</fpage>
            <lpage>1215</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1878</pubid>
                  <pubid idtype="pmpid" link="fulltext">16964265</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>A barrier nucleosome model for statistical positioning of nucleosomes throughout the yeast genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Mavrich</snm>
                  <fnm>TN</fnm>
               </au>
               <au>
                  <snm>Ioshikhes</snm>
                  <fnm>IP</fnm>
               </au>
               <au>
                  <snm>Venters</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Tomsho</snm>
                  <fnm>LP</fnm>
               </au>
               <au>
                  <snm>Qi</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schuster</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Albert</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Pugh</snm>
                  <fnm>BF</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2008</pubdate>
            <volume>18</volume>
            <fpage>1073</fpage>
            <lpage>1083</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.078261.108</pubid>
                  <pubid idtype="pmcid">2493396</pubid>
                  <pubid idtype="pmpid" link="fulltext">18550805</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>DNA methylation, nucleosome formation and positioning.</p>
            </title>
            <aug>
               <au>
                  <snm>Pennings</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Allan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Davey</snm>
                  <fnm>CS</fnm>
               </au>
            </aug>
            <source>Brief Funct Genomic Proteomic</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>351</fpage>
            <lpage>361</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bfgp/3.4.351</pubid>
                  <pubid idtype="pmpid" link="fulltext">15814025</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Dynamic regulation of nucleosome positioning in the human genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Schones</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Cui</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Cuddapah</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Roh</snm>
                  <fnm>T-Y</fnm>
               </au>
               <au>
                  <snm>Barski</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2008</pubdate>
            <volume>132</volume>
            <fpage>887</fpage>
            <lpage>898</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2008.02.022</pubid>
                  <pubid idtype="pmpid" link="fulltext">18329373</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Transcription through chromatin by RNA polymerase II: histone displacement and exchange.</p>
            </title>
            <aug>
               <au>
                  <snm>Kulaeva</snm>
                  <fnm>OI</fnm>
               </au>
               <au>
                  <snm>Gaykalova</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Studitsky</snm>
                  <fnm>VM</fnm>
               </au>
            </aug>
            <source>Mutat Res</source>
            <pubdate>2007</pubdate>
            <volume>618</volume>
            <fpage>116</fpage>
            <lpage>129</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1924643</pubid>
                  <pubid idtype="pmpid" link="fulltext">17313961</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>A genomic code for nucleosome positioning.</p>
            </title>
            <aug>
               <au>
                  <snm>Segal</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Fondufe-Mittendorf</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Thastrom</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Field</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>IK</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>JZ</fnm>
               </au>
               <au>
                  <snm>Widom</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2006</pubdate>
            <volume>442</volume>
            <fpage>772</fpage>
            <lpage>778</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04979</pubid>
                  <pubid idtype="pmcid">2623244</pubid>
                  <pubid idtype="pmpid" link="fulltext">16862119</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Distribution, silencing potential and evolutionary impact of promoter DNA methylation in the human genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Weber</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hellmann</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Ramos</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>P&#228;&#228;bo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rebhan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sch&#252;beler</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2007</pubdate>
            <volume>39</volume>
            <fpage>457</fpage>
            <lpage>466</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1990</pubid>
                  <pubid idtype="pmpid" link="fulltext">17334365</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>DNA structure in human RNA polymerase II promoters.</p>
            </title>
            <aug>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Baldi</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Chauvin</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Brunak</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1998</pubdate>
            <volume>281</volume>
            <fpage>663</fpage>
            <lpage>673</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1998.1972</pubid>
                  <pubid idtype="pmpid" link="fulltext">9710538</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>The pattern and evolution of yeast promoter bendability.</p>
            </title>
            <aug>
               <au>
                  <snm>Tirosh</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Berman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Barkai</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <fpage>318</fpage>
            <lpage>321</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2007.03.015</pubid>
                  <pubid idtype="pmpid" link="fulltext">17418911</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Sequence-dependent bending propensity of DNA as revealed by DNase I: parameters for trinucleotides.</p>
            </title>
            <aug>
               <au>
                  <snm>Brukner</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>S&#225;nchez</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Suck</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Pongor</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>EMBO J</source>
            <pubdate>1995</pubdate>
            <volume>14</volume>
            <fpage>1812</fpage>
            <lpage>1818</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">398274</pubid>
                  <pubid idtype="pmpid">7737131</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>The human SWI/SNF subunit Brm is a regulator of alternative splicing.</p>
            </title>
            <aug>
               <au>
                  <snm>Batsche</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Yaniv</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Muchardt</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nat Struct Mol Biol</source>
            <pubdate>2006</pubdate>
            <volume>13</volume>
            <fpage>22</fpage>
            <lpage>29</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nsmb1030</pubid>
                  <pubid idtype="pmpid" link="fulltext">16341228</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Chromatin, transcript elongation and alternative splicing.</p>
            </title>
            <aug>
               <au>
                  <snm>Kornblihtt</snm>
                  <fnm>AR</fnm>
               </au>
            </aug>
            <source>Nat Struct Mol Biol</source>
            <pubdate>2006</pubdate>
            <volume>13</volume>
            <fpage>5</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nsmb0106-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">16395314</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Differential chromatin marking of introns and expressed exons by H3K36me3.</p>
            </title>
            <aug>
               <au>
                  <snm>Kolasinska-Zwierz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Down</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Latorre</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>XS</fnm>
               </au>
               <au>
                  <snm>Ahringer1</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2009</pubdate>
            <volume>41</volume>
            <fpage>376</fpage>
            <lpage>381</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng.322</pubid>
                  <pubid idtype="pmcid">2648722</pubid>
                  <pubid idtype="pmpid" link="fulltext">19182803</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Processing the H3K36me3 signature.</p>
            </title>
            <aug>
               <au>
                  <snm>Sims</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Reinberg</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2009</pubdate>
            <volume>41</volume>
            <fpage>270</fpage>
            <lpage>271</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng0309-270</pubid>
                  <pubid idtype="pmpid" link="fulltext">19240748</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>UCSC Genome Browser</p>
            </title>
            <url>http://genome.ucsc.edu/</url>
         </bibl>
         <bibl id="B26">
            <title>
               <p><it>Saccharomyces </it>Genome Database</p>
            </title>
            <url>http://www.yeastgenome.org/</url>
         </bibl>
         <bibl id="B27">
            <title>
               <p>A large-scale full-length cDNA analysis to explore the budding yeast transcriptome.</p>
            </title>
            <aug>
               <au>
                  <snm>Miura</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Kawaguchi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Sese</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Toyoda</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Morishita</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ito</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>17846</fpage>
            <lpage>17851</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0605645103</pubid>
                  <pubid idtype="pmcid">1693835</pubid>
                  <pubid idtype="pmpid" link="fulltext">17101987</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Nucleosome Maps of the <it>Saccharomyces </it>Genome</p>
            </title>
            <url>http://atlas.bx.psu.edu/yeast-maps/yeast-index.html</url>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Nucleosome Maps of the <it>Drosophila </it>Genome</p>
            </title>
            <url>http://atlas.bx.psu.edu/dmel-maps/dmel-index.html</url>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Nucleosome Data for the Human Genome</p>
            </title>
            <url>http://dir.nhlbi.nih.gov/papers/lmi/epigenomes/hgtcellnucleosomes.html</url>
         </bibl>
         <bibl id="B31">
            <title>
               <p>MIRA-assisted microarray analysis, a new technology for the determination of DNA methylation patterns, identifies frequent methylation of homeodomain-containing genes in lung cancer cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Rauch</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Pfeifer</snm>
                  <fnm>GP</fnm>
               </au>
            </aug>
            <source>Cancer Res</source>
            <pubdate>2006</pubdate>
            <volume>66</volume>
            <fpage>7939</fpage>
            <lpage>7947</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1158/0008-5472.CAN-06-1888</pubid>
                  <pubid idtype="pmpid" link="fulltext">16912168</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Homeobox gene methylation in lung cancer studied by genome-wide analysis with a microarray-based methylated CpG island recovery assay.</p>
            </title>
            <aug>
               <au>
                  <snm>Rauch</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Zhong</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Lau</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Kernstine</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Riggs</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Pfeifer</snm>
                  <fnm>GP</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <fpage>5527</fpage>
            <lpage>5532</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0701059104</pubid>
                  <pubid idtype="pmcid">1838508</pubid>
                  <pubid idtype="pmpid" link="fulltext">17369352</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>CpG islands in vertebrate genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Gardiner-Garden</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Frommer</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1987</pubdate>
            <volume>196</volume>
            <fpage>261</fpage>
            <lpage>282</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0022-2836(87)90689-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">3656447</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Histone H2A.Z and DNA methylation are mutually antagonistic chromatin marks.</p>
            </title>
            <aug>
               <au>
                  <snm>Zilberman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Coleman-Derr</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ballinger</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Henikoff</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2008</pubdate>
            <volume>456</volume>
            <fpage>125</fpage>
            <lpage>129</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature07324</pubid>
                  <pubid idtype="pmpid" link="fulltext">18815594</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
