<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2010-11-4-r42</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Monovalent and unpoised status of most genes in undifferentiated cell-enriched <it>Drosophila </it>testis</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Gan</snm>
               <fnm>Qiang</fnm>
               <insr iid="I1"/>
               <email>qgan2@jhu.edu</email>
            </au>
            <au id="A2">
               <snm>Schones</snm>
               <mi>E</mi>
               <fnm>Dustin</fnm>
               <insr iid="I2"/>
               <email>schonesde@nhlbi.nih.gov</email>
            </au>
            <au id="A3">
               <snm>Ho Eun</snm>
               <fnm>Suk</fnm>
               <insr iid="I1"/>
               <email>eun7615@jhu.edu</email>
            </au>
            <au id="A4">
               <snm>Wei</snm>
               <fnm>Gang</fnm>
               <insr iid="I2"/>
               <email>weig2@mail.nih.gov</email>
            </au>
            <au id="A5">
               <snm>Cui</snm>
               <fnm>Kairong</fnm>
               <insr iid="I2"/>
               <email>cuik@mail.nih.gov</email>
            </au>
            <au id="A6">
               <snm>Zhao</snm>
               <fnm>Keji</fnm>
               <insr iid="I2"/>
               <email>zhaok@nhlbi.nih.gov</email>
            </au>
            <au ca="yes" id="A7">
               <snm>Chen</snm>
               <fnm>Xin</fnm>
               <insr iid="I1"/>
               <email>xchen32@jhu.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Biology, The Johns Hopkins University, 3400 North Charles Street, Baltimore, MD 21218, USA</p>
            </ins>
            <ins id="I2">
               <p>Laboratory of Molecular Immunology, National Heart, Lung and Blood Institute, National Institutes of Health, 10 Center Drive, Building 10, Bethesda, MD 20892, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2010</pubdate>
         <volume>11</volume>
         <issue>4</issue>
         <fpage>R42</fpage>
         <url>http://genomebiology.com/2010/11/4/R42</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">20398323</pubid>
               <pubid idtype="doi">10.1186/gb-2010-11-4-r42</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>18</day>
               <month>11</month>
               <year>2009</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>26</day>
               <month>2</month>
               <year>2010</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>15</day>
               <month>4</month>
               <year>2010</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>15</day>
               <month>4</month>
               <year>2010</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2010</year>
         <collab>Gan et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Chromatin in undifferentiated cells</p>
      </shorttitle>
      <shortabs>
         <p>In undifferentiated Drosophila cells, differentiation-associated genes have monovalent, not bivalent histone modifications, in contrast to differentiation-associated genes in stem cells.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Increasing evidence demonstrates that stem cells maintain their identities by a unique transcription network and chromatin structure. Opposing epigenetic modifications H3K27me3 and H3K4me3 have been proposed to label differentiation-associated genes in stem cells, progenitor and precursor cells. In addition, many differentiation-associated genes are maintained at a poised status by recruitment of the initiative RNA Polymerase II (Pol II) at their promoter regions, in preparation for lineage-specific expression upon differentiation. Previous studies have been performed using cultured mammalian embryonic stem cells. To a lesser extent, chromatin structure has been delineated in other model organisms, such as <it>Drosophila</it>, to open new avenues for genetic analyses.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Here we use testes isolated from a <it>Drosophila bag of marbles </it>mutant strain, from which germ cells are in their undifferentiated status. We use these testes to study the endogenous chromatin structure of undifferentiated cells using ChIP-seq. We integrate the ChIP-seq with RNA-seq data, which measures the digital transcriptome. Our genome-wide analyses indicate that most differentiation-associated genes in undifferentiated cells lack an active chromatin mark and initiative Pol II; instead, they are associated with either the repressive H3K27me3 mark or no detectable mark.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>Our results reveal that most of the differentiation-associated genes in undifferentiated-cell-enriched <it>Drosophila </it>testes are associated with monovalent but not bivalent modifications, a chromatin signature that is distinct from the data reported in mammalian stem or precursor cells, which may reflect cell type specificity, species specificity, or both.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification id="30010005" subtype="man_spc_id" type="BMC">Development</classification>
         <classification id="300100010" subtype="man_spc_id" type="BMC">Genome studies</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Extensive studies indicate that embryonic stem cells (ESCs), lineage-committed adult stem cells and early progenitor cells maintain their identities by a unique transcriptional network and chromatin structure (reviewed by <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>). In particular, the bivalent domains harboring both the active H3K4me3 and repressive H3K27me3 marks label developmental regulators <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. The H3K4me3 and H3K27me3 marks are placed by the Trithorax group (TrxG) complex <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp> and the Polycomb group (PcG) complex <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>, respectively. Increasing evidence indicates that the PcG and the TrxG complexes play critical roles in the choice between the proliferating progenitor cell state and terminal differentiation program <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B9">9</abbr></abbrgrp>. It has been reported that bivalent genes in ESCs or early progenitor cells are bound by PcG proteins and are maintained at a 'poised' status by recruitment of RNA Polymerase II (Pol II), in preparation for lineage-specific expression upon differentiation <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. In various stem cell lineages, reversal of repression by the PcG silencing machinery may act as the first step toward robust activation of terminal differentiation genes <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>.</p>
         <p>The <it>Drosophila </it>male germline stem cell (GSC) lineage is a naturally existing adult stem cell system and has emerged as an excellent system for studying the molecular mechanisms that control stem cell maintenance versus differentiation <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Each GSC divides asymmetrically to self-renew and give rise to a gonialblast, the daughter cell that first undergoes a transit-amplifying spermatogonial stage before converting to differentiating spermatocytes <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. The maintenance of GSCs and spermatogonia in an undifferentiated and proliferative state, as well as the subsequent reversal of these controls to allow terminal differentiation, are both critical to continuous production of gametes throughout lifetime. Despite extensive genetic studies on maintenance of GSCs, it is unclear how epigenetic mechanisms may establish and maintain a unique chromatin structure for their undifferentiated status; and how mis-regulation of such a structure may lead to mis-determination of their fate <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
         <p>Previous studies in this system have shown that PcG transcriptional silencing proteins repress the genes required for terminal differentiation in undifferentiated germ cells. Developmental programs reverse Polycomb silencing and activate the expression of differentiation genes in spermatocytes <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. This work uncovered an intriguing parallel between <it>Drosophila </it>GSC and ESC lineages, because PcG proteins play an extensive role in keeping developmental regulators at a silent status in murine and human ESCs <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. To investigate whether other features in mammalian ESCs apply to <it>Drosophila</it>, we studied the chromatin structure in the undifferentiated-cell-enriched <it>Drosophila </it>testis. Our results revealed two distinct features in this tissue: a monovalent chromatin signature and lack of paused RNA Pol II at the differentiation genes. Both features are different from what have been reported for ESCs and other mammalian adult stem cells, suggesting a potential novel mechanism of regulating the germ cell differentiation program in <it>Drosophila </it>testis.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Summary of the ChIP-seq results in undifferentiated-cell-enriched <it>Drosophila </it>testis</p>
            </st>
            <p>Since it is unfeasible to obtain a sufficient number of naturally existing GSCs for epigenomic mapping using current chromatin immunoprecipitation (ChIP) techniques, we took advantage of the <it>bag of marbles </it>(<it>bam</it>) mutant strain. Mutations in the <it>bam </it>gene inhibit the transition from spermatogonial progenitor stage to differentiating spermatocytes, which results in an accumulation of undifferentiated cells, including GSCs, transit-amplifying spermatogonial cells, as well as somatic cells <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>. Although the <it>bam </it>testes are not a pure source of GSCs, they are enriched with undifferentiated germ cells and have been used for transcription profiling by microarray <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> or RNA-seq <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> to search for undifferentiated-cell-enriched genes. These analyses further confirmed the undifferentiated status of cells in <it>bam </it>testes, in which the expression of meiotic and terminal differentiation genes was extremely low or undetectable.</p>
            <p>To investigate the epigenetic mechanisms that regulate the male germline cellular differentiation program, we used <it>bam </it>testes for chromatin landscape mapping using ChIP followed by high-throughput sequencing (ChIP-seq). The ChIP-seq technique has been demonstrated to be a highly sensitive method to detect binding sites of chromatin-associated proteins at a genome-wide coverage (reviewed by <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>). More importantly, this technique is compatible with small amounts of starting material, such as the hand-dissected fly testes in our experiments. To compare our data with the previous results using mammalian cells, we used the same set of antibodies, including antibodies against RNA Pol II, H3K4me3, H3K36me3, H3K27me3 and unmodified histone H3, to perform ChIP-seq experiments using <it>bam </it>testes (Materials and methods; Additional file <supplr sid="S1">1</supplr>). The anti-Pol II antibody (4H8) used in our assays recognizes both the initiating and elongating forms of Pol II according to published work <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B24">24</abbr></abbrgrp>. Sequencing depth analysis indicated that we have reached or almost reached the plateau of peak detection for each of these histone modifications as well as Pol II (Materials and methods; Additional file <supplr sid="S2">2</supplr>a-d).</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>
                     <b>Summary of unique and non-redundant reads in each sample.</b>
                  </p>
               </text>
               <file name="gb-2010-11-4-r42-S1.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p><b>Sequencing depth of ChIP-seq using various antibodies in both <it>bam </it>testis and S2 cell samples</b>. Sequencing depth is analyzed by calculating the enriched regions for each subsample of the corresponding ChIP-seq experiment. The blue line represents the trend line. The x-axis of the plot indicates the percentage of subsample reads compared to the total unique reads, whereas the y-axis indicates the identified islands.</p>
               </text>
               <file name="gb-2010-11-4-r42-S2.tiff">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>To validate our ChIP-seq data, we compared the relationship between gene expression level and enrichment of RNA Pol II as well as distinct histone modifications. The <it>Drosophila </it>genome is highly compact, including many overlapping genes and genes with transcriptional start sites (TSSs) within a short distance of one another, complicating interpretations of ChIP-seq results. To avoid this, we classified 9,459 annotated genes that are applicable for ChIP-seq analysis into four groups (Materials and methods) according to their expression levels determined by RNA-seq in RPKM values <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp> (Figure <figr fid="F1">1a</figr>; RPKM is sequencing reads per kilobase of exon per million mapped reads <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>). Our data demonstrated that both RNA Pol II and H3K4me3 were highly elevated near the TSSs of annotated genes. They also positively correlated with gene expression level (Figure <figr fid="F1">1b, c</figr>). And H3K36me3 was enriched downstream of the TSSs and also positively correlated with expression level (Figure <figr fid="F1">1d</figr>). In contrast, H3K27me3 was negatively correlated with gene expression, as expected for a repressive mark (Figure <figr fid="F1">1e</figr>). These results are consistent with observations in human T cells <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> and mammalian ESCs <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Summary of the ChIP-seq results using <it>bam </it>testis</p>
               </caption>
               <text>
                  <p><b>Summary of the ChIP-seq results using <it>bam </it>testis</b>. <b>(a) </b>The four groups of genes were classified according to their RPKM value based on the RNA-seq results <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. The numbers in brackets denote genes used for H3K36me3 (K36) analysis. *See Materials and methods for gene selection criteria. Antibodies used for ChIP-seq were: <b>(b) </b>anti-RNA Pol II (Pol II); <b>(c) </b>anti-H3K4me3 (K4); <b>(d) </b>anti-H3K36me3 (K36); and <b>(e) </b>anti-H3K27me3 (K27). Enrichment of each histone modification and RNA Pol II is plotted over a -3-kb to +3-kb region with respect (w.r.t.) to genes' TSSs, except for K36, for which a -5-kb to +5-kb region is used.</p>
               </text>
               <graphic file="gb-2010-11-4-r42-1" hint_layout="double"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Monovalent chromatin signature is prevalent in undifferentiated-cell-enriched <it>bam </it>testis</p>
            </st>
            <p>The coexistence of active H3K4me3 and repressive H3K27me3 marks is detected in a large number of developmental regulator genes in mammalian ESCs <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp> and adult stem/progenitor cells <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>. These 'bivalent' modifications are proposed to poise genes for rapid induction during cellular differentiation to become different cell lineages. However, the mechanisms that contribute to the bivalency of genes may be diverse under different circumstances, and still remain unsolved at the structural level <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>.</p>
            <p>To test whether bivalency also applies to the undifferentiated cells of <it>Drosophila </it>testis, we first examined several critical genes that regulate cellular differentiation in the GSC lineage. The <it>Enhancer of Zeste </it>(<it>E(z)</it>) gene encodes a PcG complex component that is highly expressed in undifferentiated cells, but abruptly down-regulated upon differentiation initiation in spermatocytes. The co-existence of H3K27me3 and H3K4me3 was not observed at the <it>E(z) </it>gene locus near the TSS. Instead, we found only the active H3K4me3 mark at the <it>E(z) </it>gene locus, enriched near the TSS (Materials and methods; Figure <figr fid="F2">2a, b</figr>). The <it>spermatocyte arrest </it>(<it>sa</it>) gene, which encodes a testis-specific homolog of TBP associated factor (tTAF), is turned on in spermatocytes. The <it>tTAF </it>gene acts to antagonize PcG repression and regulate a cell-type-specific transcription program for terminal differentiation <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B34">34</abbr></abbrgrp> (Figure <figr fid="F2">2b</figr>). In contrast to the <it>E(z) </it>gene, we found that the chromatin of the <it>sa </it>gene locus was only associated with the repressive H3K27me3 mark near its TSS in <it>bam </it>testis (Figure <figr fid="F2">2c</figr>). More examples of the histone modification patterns of differentially expressed genes are shown in Additional file <supplr sid="S3">3</supplr>. These data suggest that developmental regulator genes in the germline lineage may be associated with a 'monovalent' modification (either H3K4me3 or H3K27me3 mark) in undifferentiated cells of <it>bam </it>testis.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Monovalent chromatin signature is prevalent in undifferentiated cells of <it>bam </it>testis</p>
               </caption>
               <text>
                  <p><b>Monovalent chromatin signature is prevalent in undifferentiated cells of <it>bam </it>testis</b>. <b>(a) </b>UCSC genome browser screenshot showing the H3K4me3 monovalency at the <it>E(z) </it>gene locus in undifferentiated cells. The read counts are labeled on the y-axis and the genomic region used to calculate the enrichment of modified histones (0 to +500 bp with respect to the TSS) is shaded in grey. <b>(b) </b>Transcription level of two representative monovalent genes, <it>E(z) </it>and <it>sa</it>, in <it>bam </it>and <it>wild-type </it>(<it>wt</it>) testis, respectively. According to the RNA-seq data, the RPKM for <it>E(z) </it>mRNA is four-fold higher in <it>bam </it>testis, and that for <it>sa </it>mRNA is 113-fold higher in wild-typetestis. <b>(c) </b>UCSC genome browser snapshot showing the H3K27me3 monovalency at the <it>sa </it>gene locus in undifferentiated cells. The read counts are labeled on the y-axis and the genomic region used to calculate the enrichment of modified histones (0 to +500 bp with respect to the TSS) is shaded in grey. <b>(d) </b>Scatter plot for H3K4me3 and H3K27me3 enrichment of all annotated genes. A common window (0 to +500 bp with respect to the TSS) was used to calculate H3K4me3 and H3K27me3 reads. The blue dashed lines indicate the statistical cutoff line - <it>P </it>&lt; 0.05 in a 1-kb window per 1 million total reads. Gene numbers in each quadrant are labeled in black. Differentiation genes are defined as those with RPKM in <it>bam </it>testis &lt; 0.5 and RPKM in wild-type testis &#8805; 1 and are labeled as red dots. Among the 1,894 differentiation genes, 1,304 genes are applicable for ChIP-seq analysis (Materials and methods). Up-regulated genes are defined as those with RPKM in wild-typetestis/<it>bam </it>testis &#8805; 2 (if the RPKM in <it>bam </it>testis &lt; 0.5, the value is raised to 0.5) and are labeled as green inverted triangles. Among the 3,377 up-regulated genes, 2,188 genes are applicable for ChIP-seq analysis (Materials and methods).</p>
               </text>
               <graphic file="gb-2010-11-4-r42-2" hint_layout="double"/>
            </fig>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p><b>More examples of the monovalent genes in <it>bam </it>testis</b>. <b>(a) </b>Transcription levels of two representative H3K4me3-monovalent genes, <it>Decondensation factor 31 </it>(<it>Df31</it>) and <it>Rpd3</it>, and two representative H3K27me3-monovalent genes, <it>don juan </it>(<it>dj</it>) and <it>fuzzy onion </it>(<it>fzo</it>), in <it>bam </it>and wild-typetestis, respectively. <b>(b-e) </b>UCSC genome browser screenshot showing the active H3K4me3 monovalency at (b) <it>Df31 </it>and (c) <it>Rpd3 </it>genes, as well as the repressive H3K27me3 monovalency at (d) <it>dj </it>and (e) <it>fzo </it>genes, respectively. These examples are chosen based on their expression profiles in <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>.</p>
               </text>
               <file name="gb-2010-11-4-r42-S3.tiff">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>To test the prevalence of monovalency at a genome-wide scale, we analyzed all 9,459 annotated genes that are applicable for ChIP-seq analysis (Materials and methods). We identified 3,360 genes that were enriched with a significant level of active H3K4me3 and 1,631 genes enriched significantly with repressive H3K27me3 (<it>P </it>&lt; 0.05; Materials and methods). Surprisingly, only 91 genes were associated with both H3K4me3 and H3K27me3 marks (Figure <figr fid="F2">2d</figr>).</p>
            <p>Although we could validate several bivalent genes using independent ChIP followed by real-time PCR analysis (Additional file <supplr sid="S4">4</supplr>), we found that, different from the silent status of bivalent genes in ESCs <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, 56% of the bivalent genes in <it>bam </it>testis were actively expressed (Additional file <supplr sid="S5">5</supplr>; the RPKM cutoff for expressed genes is based on Additional file <supplr sid="S6">6</supplr>), indicating that the bivalency may be a result of temporally and/or spatially regulated gene expression, as has been reported in <it>Xenopus </it>embryos <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. One particular example is the <it>benign gonial cell neoplasm </it>(<it>bgcn</it>) gene (Additional file <supplr sid="S4">4</supplr>b), whose product acts with the Bam protein in the transition of germ cells from the proliferative to the differentiating stage <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. We reasoned that the bivalency at the <it>bgcn </it>gene may have resulted from mixed cell types in <it>bam </it>testes. The active status of <it>bgcn </it>in spermatogonial cells is likely associated with the enriched active H3K4me3 mark, whereas its repressive status in GSCs and somatic cells may contribute to the detection of the repressive H3K27me3 mark. Consistent with this, we found that none of the 91 putative bivalent genes we identified in <it>bam </it>testis retained their bivalency in the cultured <it>Drosophila </it>cells, including S2, BG3 and D23 cell lines <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> (Additional files <supplr sid="S5">5</supplr> and <supplr sid="S7">7</supplr>; Materials and methods). Although the culture cells were not synchronized, they were more pure than tissues with mixed types and staged cells. In addition, we performed Gene Ontology analysis to search for significantly enriched gene categories in both molecular function and biological process terms (Materials and methods). Surprisingly the only category that is significantly enriched (<it>P </it>&lt; 0.01) for these 91 putative bivalent genes is the 'multicellular organismal process' category, further suggesting that the apparent bivalency may come from different cells and is not specifically involved in cellular differentiation during spermatogenesis.</p>
            <suppl id="S4">
               <title>
                  <p>Additional file 4</p>
               </title>
               <text>
                  <p><b>Validation of bivalent genes using ChIP followed by real-time PCR analyses on five gene loci</b>. <b>(a) </b><it>Rpl7A </it>gene is used as a positive control for the active H3K4me3 enrichment and <it>don juan </it>(<it>dj</it>) is used as a positive control for the repressive H3K27me3 mark. All five bivalent genes have H3K4me3 and H3K27me3 at levels between the two controls. The y-axis denotes the ChIP-ed DNA/input DNA percentage. Primer pairs used for PCR validation are shown as black boxes in each panel of (b-f). Primer sequence information is in Materials and methods. <b>(b-f) </b>UCSC genome browser screenshots showing H3K4me3 and H3K27me3 enrichment at each of the following genes' loci: (b) <it>bgcn</it>; (c) <it>Neurochondrin</it>; (d) <it>CG14834</it>; (e) <it>NPC2</it>; and (f) <it>retn</it>.</p>
               </text>
               <file name="gb-2010-11-4-r42-S4.jpeg">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S5">
               <title>
                  <p>Additional file 5</p>
               </title>
               <text>
                  <p><b>Bivalent genes in undifferentiated cells in <it>bam </it>testis</b>. In total, 91 bivalent genes were identified in undifferentiated cells in <it>bam </it>testis, including 18 differentiation genes (RPKM in <it>bam </it>&lt; 0.5, in wild-type &#8805; 1), which are highlighted in red.</p>
               </text>
               <file name="gb-2010-11-4-r42-S5.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S6">
               <title>
                  <p>Additional file 6</p>
               </title>
               <text>
                  <p><b>Determination of the cutoff RPKM value for expressed genes by comparing microarray and RNA-seq data</b>. The histogram shows the distribution of RPKMs for genes that are unambiguously called 'Present' in all three replicates from the gene expression microarray data. The RPKM values are adjusted by adding a pseudo-count of 1 prior to the logarithm. Threshold RPKM values of 0.5 and 1 are shown as dashed lines in blue and green, respectively. Approximately 99% of genes that are considered unambiguously 'Present' (3P calls) in the microarray data have an RPKM &#8805; 1. Therefore, we assigned RPKM &#8805; 1 as the cutoff for expressed genes. Conversely, only 0.4% of these genes have an RPKM &lt; 0.5, and we assigned RPKM &lt; 0.5 as the cutoff for absent or silent genes.</p>
               </text>
               <file name="gb-2010-11-4-r42-S6.tiff">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S7">
               <title>
                  <p>Additional file 7</p>
               </title>
               <text>
                  <p><b>Bivalent genes in <it>Drosophila </it>S2 cells</b>. In total, 27 bivalent genes were identified in <it>Drosophila </it>S2 cells.</p>
               </text>
               <file name="gb-2010-11-4-r42-S7.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>To test the prevalence of monovalency for all differentiation genes in testis, we compared the expression profile of fully differentiated wild-type testis with undifferentiated <it>bam </it>testis using the RNA-seq data <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. We found 1,894 genes that were silent in the undifferentiated-cell-enriched <it>bam </it>testis (RPKM &lt; 0.5) but were turned on in the differentiated cells from wild-type testis (RPKM &#8805; 1). We defined these genes as differentiation genes (the RPKM cutoffs are based on Additional file <supplr sid="S6">6</supplr>). Among these 1,894 differentiation genes, 1,304 are applicable for ChIP-seq analysis (Materials and methods). Examination of the histone modifications on these 1,304 genes revealed that only 18 genes were associated with both active H3K4me3 and repressive H3K27me3 in <it>bam </it>testis (Figure <figr fid="F2">2d</figr>; Additional file <supplr sid="S5">5</supplr>), whereas 448 were associated with only H3K27me3, 48 were associated with only H4K4me3 and 790 were not associated with either of these modifications. We also checked all genes that are up-regulated at least two-fold in wild-type testis, regardless of their expression level in <it>bam </it>testis (RPKM in wild-type testis/<it>bam </it>testis &#8805; 2). We found 3,377 genes that fall into this category, including all differentiation genes. Among these 3,377 up-regulated genes, 2,188 are applicable for ChIP-seq analysis. We analyzed the H3K27me3 and H3K4me3 enrichment of these genes and found only 25 genes enriched with both histone marks (Figure <figr fid="F2">2d</figr>). These results indicate that most differentiation genes in undifferentiated-cell-enriched testis are marked by either a monovalent chromatin signature or no modification (H3K4me3 or H3K27me3), which is very different from the modification patterns in mammalian ESCs <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> or progenitor cells <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. Consistent with our findings, previous studies demonstrated the paucity of bivalent domains in fly embryos, which contain progenitor cells mainly for somatic tissues <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. In summary, our results reveal that monovalent modification is a prevalent chromatin signature of differentiation genes in undifferentiated-cell-enriched <it>Drosophila </it>testis.</p>
         </sec>
         <sec>
            <st>
               <p>Most genes are unpoised in the undifferentiated-cell-enriched <it>bam </it>testis</p>
            </st>
            <p>For the purpose of discussion here, we use 'poised' to describe the prepared status of a gene for transcription <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B24">24</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp>, which is associated with the promoter proximal binding of RNA Pol II at a 'paused' status <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp> and/or with active histone modification marks <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B32">32</abbr></abbrgrp>. Previous studies in mammalian ESCs <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and <it>Drosophila </it>embryos <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> have suggested that many differentiation genes are unexpressed yet have paused RNA Pol II associated with their promoters, in order to stay at a poised state ready for robust transcription upon developmental stimuli. For example, more than 13% of the ectodermal differentiation genes were repressed but have paused RNA Pol II in the <it>Toll</it><sup><it>10b </it></sup>mutant <it>Drosophila </it>embryos, in which all cells are transformed to the mesodermal fate <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. Here we examined whether paused Pol II is also prevalent in the undifferentiated-cell-enriched male gonads. Surprisingly, we found only 63 (4.8%) of the 1,304 differentiation genes were bound by Pol II (<it>P </it>&lt; 0.05) in undifferentiated-cell-enriched <it>bam </it>testis (Figure <figr fid="F3">3a</figr>). These data indicate that most genes that are silent in undifferentiated cells (RPKM in <it>bam </it>testis &lt;0.5) but turned on upon differentiation (RPKM in wild-type testis &#8805; 1) remained at an unpoised status in undifferentiated cells.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Most genes are unpoised in undifferentiated cells of <it>bam </it>testis</p>
               </caption>
               <text>
                  <p><b>Most genes are unpoised in undifferentiated cells of <it>bam </it>testis</b>. <b>(a) </b>Scatter plot for H3K4me3 and RNA Pol II enrichment of all annotated genes. All labels and the definition of up-regulated genes are the same as in Figure 2d, except that a -250 to +250-bp window with respect to the TSS was used to calculate reads for RNA Pol II. <b>(b) </b>Scatter plot for RNA Pol II enrichment and RPKM values for all annotated genes. All labels are the same as in Figure 2d, except that RPKM &#8805; 1 is used as a cutoff for expressed genes. <b>(c) </b>RPKM values for genes that have three absent calls (3A) in the microarray. <b>(d) </b>Analysis of the RPKM values of transcripts demonstrates that genes with stalled Pol II are weakly transcribed genes. Genes that show an active Pol II profile are expressed at significantly higher levels. Genes with no Pol II profile are expressed at significantly lower levels. The box represents the 25th and 75th percentiles, with the 50th percentile as a black bar. The whiskers refer to outliers that are at least 1.5&#215; the interquartile range from the box. The y-axis represents the RPKM value.</p>
               </text>
               <graphic file="gb-2010-11-4-r42-3" hint_layout="double"/>
            </fig>
            <p>On the other hand, analyses of the expression levels of all genes with significant Pol II binding (<it>P </it>&lt; 0.05) demonstrated that approximately 95% of genes (3,796 genes) that are enriched with Pol II (4,008 genes) were actively expressed (RPKM in <it>bam </it>testis &#8805; 1; Figure <figr fid="F3">3b</figr>). Only 5% of Pol II-enriched genes (212 genes) were expressed at very low to undetectable levels (RPKM &lt;1), which may be candidate poised genes in this cell lineage but only comprised 2% of all genes analyzed (9,459 genes). Interestingly, we found that a subset of these potentially poised genes (43%; 92 genes in Additional file <supplr sid="S4">4</supplr>b) are up-regulated upon differentiation (RPKM in wild-type testis/<it>bam </it>testis &#8805; 2), with the significantly (<it>P </it>&lt; 0.01) enriched ontology category 'transcription regulator activity'. In contrast, 75 genes (35%) remain silent throughout spermatogenesis (RPKM &lt;0.5 in both wild-type and <it>bam </it>testis). Consistent with their low or undetectable expression (RPKM &lt;1) in <it>bam </it>testis, we found that only 28 out of the 212 (13.2%) potentially poised genes are enriched with H3K36me3, a transcription elongation epigenetic mark <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. Furthermore, we compared these 92 putative poised genes with the 91 potential bivalent genes in <it>bam </it>testis. Surprisingly, only two genes (<it>CG8517 </it>and <it>CG13611</it>) overlapped. These data demonstrate that different from what has been reported in mammalian ESC lineages <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>, most differentiation genes stay at an unpoised status before exposure to the developmental signals in the undifferentiated-cell-enriched <it>Drosophila bam </it>testis.</p>
            <p>In addition to RNA Pol II, H3K4me3 is another well-established mark for active chromatin status, and is associated with many unexpressed but poised genes in ESCs <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. In our analysis, we found 95.4% of all H3K4me3-enriched genes (3,360 genes) are actively expressed (3,204 genes in Additional file <supplr sid="S8">8</supplr>). This result further confirmed that most genes in undifferentiated-cell-enriched <it>Drosophila </it>testis are either actively expressed or silent, without an intermediate poised status.</p>
            <suppl id="S8">
               <title>
                  <p>Additional file 8</p>
               </title>
               <text>
                  <p><b>Most genes enriched with H3K4me3 are actively expressed in <it>bam </it>testis</b>. Scatter plot for H3K4me3 enrichment and RPKM values for all annotated genes. All labels are the same as in Figure <figr fid="F2">2d</figr>, except RPKM = 1 is used as a cutoff for expressed genes.</p>
               </text>
               <file name="gb-2010-11-4-r42-S8.tiff">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Direct comparison of RNA-seq and microarray data reveals genes with low expression levels may contribute to the paused Pol II phenomenon</p>
            </st>
            <p>One explanation for the difference in our results from previous publications is that previous analysis of gene expression mainly relied on microarray studies, which may not be sensitive enough to distinguish genes with low expression levels from genes that are absolutely silent <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. Indeed, when we compared the gene expression profiles in <it>bam </it>testis determined by microarray versus RNA-seq techniques, we found that 670 genes considered as silent based on microarray analysis (three 'absent' calls in all three biological replicates) were actually expressed according to their RPKM values in the RNA-seq data set (RPKM &#8805; 1; Figure <figr fid="F3">3c</figr>). Approximately 24.8% of these genes (107 of the 432 genes that are applicable for ChIP-seq analysis among all 670 genes) had significant enrichment of Pol II (<it>P </it>&lt; 0.05) at their promoter regions. Therefore, our results suggest that some of the 'poised' genes may actually be expressed but below the detection threshold in microarray analysis.</p>
            <p>Since the Pol II antibody we used in our ChIP-seq experiment recognizes both the initiating and elongating (Ser5 phosphorylated) forms <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B24">24</abbr></abbrgrp>, we used the Pol II ChIP-seq data to calculate the stalling index for each gene (<abbrgrp><abbr bid="B39">39</abbr></abbrgrp> and Materials and methods). Using this assay, we identified 695 genes that have paused Pol II at their promoter regions in <it>bam </it>testis (Additional file <supplr sid="S9">9</supplr>a; Materials and methods). Ontology analysis revealed significant functional categories (<it>P </it>&lt; 0.01) for these genes (Additional file <supplr sid="S9">9</supplr>b). However, approximately 93.0% (647 out of 695) of these genes were expressed (RPKM &#8805; 1), albeit at relatively low levels (Figure <figr fid="F3">3d</figr>).</p>
            <suppl id="S9">
               <title>
                  <p>Additional file 9</p>
               </title>
               <text>
                  <p><b>Comparison of our ChIP-seq data with published ChIP-chip data. (a) </b>Flow chart to compare our ChIP-seq data in <it>bam </it>testis and S2 cells with published ChIP-chip data in S2 cells <abbrgrp><abbr bid="B51">51</abbr></abbrgrp> and embryos <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. <b>(b-d) </b>Ontology analysis of (b) the 695 genes with stalled Pol II in <it>bam </it>testis, (c) the 1,821 genes with stalled Pol II in S2 cells, and (d) the 1,014 genes with promoter-proximal enrichment of Pol II <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>.</p>
               </text>
               <file name="gb-2010-11-4-r42-S9.tiff">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Do different techniques contribute to the different features of the chromatin landscape?</p>
            </st>
            <p>There are two major differences between our work and published studies in <it>Drosophila </it>or mammalian cells: first, we used direct sequencing-based techniques (ChIP-seq and RNA-seq) instead of hybridization-based techniques (ChIP-chip and microarray); and second, we used a tissue that is enriched with undifferentiated germ cells instead of embryos <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>, embryo-derived cell lines <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>, or mammalian cells <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, which mainly comprise cells with somatic fate. In order to make a more direct comparison to address whether the difference is due to distinct techniques or different cell types, we performed ChIP-seq and RNA-seq analyses using the same cell type - <it>Drosophila </it>S2 cells - that was used in a previous study <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>.</p>
            <p>We analyzed the chromatin status in <it>Drosophila </it>S2 cells using antibodies against RNA Pol II, H3K4me3, H3K36me3 and H3K27me3 (Additional file <supplr sid="S10">10</supplr>; Materials and methods). Sequencing depth analysis indicated that most of these assays almost reached (Additional file <supplr sid="S2">2</supplr>e,g) or reached (Additional file <supplr sid="S2">2</supplr>f,h) the detection saturation plateau. For libraries that are not completely saturated, the strongest peaks enriched with the corresponding modified histones or Pol II should be detected first and are therefore identified by our assays. Through analyzing the ChIP-seq data, we identified 3,742 genes with enriched active H3K4me3 and 2,095 genes with enriched repressive H3K27me3 in S2 cells (<it>P </it>&lt; 0.05; Materials and methods). Among them, we found only 27 genes enriched in both H3K4me3 and H3K27me3 (Figure <figr fid="F4">4a</figr>; Additional file <supplr sid="S7">7</supplr>), indicating that bivalency is also not a prevalent chromatin signature for genes in S2 cells. To compare the ChIP-seq data with the gene expression level, we profiled the S2 cell transcriptome using the RNA-seq method (Materials and methods). We found that 25 out of the 27 (93%) bivalent genes in non-synchronized S2 cells are actively expressed (RPKM &#8805; 1), indicating that the bivalency may be a result of temporally regulated gene expression during cell cycle. Our S2 cell data are consistent with a previous report using fly embryos <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>, where the authors identified 4,893 H3K4me3- and 2,480 H3K27me3-enriched regions, with only 161 overlapping regions. The even fewer bivalent genes in S2 cells (27 genes) than those in <it>bam </it>testis (91 genes) may reflect a higher homogeneity of cultured cells than of dissected tissues. We also identified 3,956 genes that have significant binding of RNA Pol II at their promoter region (<it>P </it>&lt; 0.05), and 5,281 genes that are unbound by Pol II (Figure <figr fid="F4">4b</figr>). We found that among all the Pol II-enriched genes, 93% are expressed genes (RPKM &#8805; 1; Figure <figr fid="F4">4b</figr>). In contrast, only 260 genes that were expressed at very low to undetectable levels (RPKM &lt;1) have enriched Pol II binding at their promoter region; these genes comprise approximately 6.6% of the entire Pol II-enriched genes and 2.7% of the total genes analyzed (9,459 genes). Consistently, more than 99.8% of H3K4me3-enriched genes are actively expressed (3,720 genes out of 3,728 genes with significant enrichment of H3K4me3 have RPKM values &#8805; 1; Additional file <supplr sid="S11">11</supplr>). When we compared the gene expression profiles in S2 cells determined by microarray versus RNA-seq techniques, we found that 770 genes considered as silent based on microarray analysis (two 'absent' calls in both biological replicates) were actually expressed according to their RPKM values (RPKM &#8805; 1; Figure <figr fid="F4">4c</figr>) in the RNA-seq data set.</p>
            <suppl id="S10">
               <title>
                  <p>Additional file 10</p>
               </title>
               <text>
                  <p><b>Summary of the ChIP-seq results using <it>Drosophila </it>S2 cells</b>. <b>(a) </b>The four groups of genes were classified according to their RPKM value based on the RNA-seq results. The numbers in brackets denote genes used for H3K36me3 (K36) analysis. *See Materials and methods for gene selection criteria. <b>(b-e) </b>Antibodies used for ChIP-seq are: (b) anti-RNA Pol II (Pol II); (c) anti-H3K4me3 (K4); (d) anti-H3K36me3 (K36); and (e) anti-H3K27me3 (K27). Enrichment of each histone modification and RNA Pol II is plotted over a -5 to +5-kb region with respect to the TSSs of the genes.</p>
               </text>
               <file name="gb-2010-11-4-r42-S10.jpeg">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Comparison of RNA-seq and ChIP-seq data using S2cells</p>
               </caption>
               <text>
                  <p><b>Comparison of RNA-seq and ChIP-seq data using S2cells</b>. <b>(a) </b>Scatter plot for H3K4me3 and H3K27me3 enrichment of all annotated genes. A common 0 to +500-bp window with respect to the TSS was used to calculate both H3K4me3 and H3K27me3 reads. The blue dashed lines indicate the statistical cutoff line - <it>P </it>&lt; 0.05 in a 1-kb window per 1 million total reads. Gene numbers in each quadrant are labeled in black. <b>(b) </b>Scatter plot for RNA Pol II enrichment and RPKM values for all annotated genes. All labels are the same as in Figure 2d, except that RPKM = 1 is used as a cutoff for expressed genes. <b>(c) </b>The RPKM value for genes that have two absent calls (2A) in the microarray. <b>(d) </b>Analysis of the RPKM value of transcripts demonstrates that genes with stalled Pol II are weakly transcribed genes. Genes that show an active Pol II profile are expressed at significantly higher levels. Genes with no Pol II profile are expressed at significantly lower levels. The box represents the 25th and 75th percentiles, with the 50th percentile as a black bar. The whiskers refer to outliers that are at least 1.5&#215; the interquartile range from the box. The y-axis represents the RPKM value.</p>
               </text>
               <graphic file="gb-2010-11-4-r42-4" hint_layout="double"/>
            </fig>
            <suppl id="S11">
               <title>
                  <p>Additional file 11</p>
               </title>
               <text>
                  <p><b>Most genes enriched with H3K4me3 are actively expressed in S2 cells</b>. Scatter plot of H3K4me3 enrichment and RPKM values for all annotated genes. All labels are the same as in Figure <figr fid="F2">2d</figr>, except RPKM = 1 was used as a cutoff for expressed genes.</p>
               </text>
               <file name="gb-2010-11-4-r42-S11.tiff">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>We next applied the ChIP-seq data using antibodies against RNA Pol II to compute the stalling index of individual genes. Through this assay, we identified 1,821 genes with stalled Pol II in S2 cells (Additional file <supplr sid="S9">9</supplr>a; Materials and methods and <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>). We compared these 1,821 genes with the genes with promoter-proximal enrichment of polymerase, which were identified in the previous study using S2 cell <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. In fact, approximately 86.2% genes with promoter-proximal enrichment of polymerase overlapped with the genes with stalled Pol II identified in our analysis (Additional file <supplr sid="S9">9</supplr>a), indicating that the criteria we used to define genes with stalled Pol II is consistent with previous studies. Ontology assays further confirmed that the significant categories (<it>P </it>&lt; 0.01) of genes identified using either method are similar (Additional file <supplr sid="S9">9</supplr>c,d). Noticeably, compared to genes with active Pol II, the genes with stalled Pol II have lower expression levels (Figure <figr fid="F4">4d</figr>), although most (88.5%) of them are actively expressed (RPKM &#8805; 1). These results suggest that Pol II binding correlates well with active gene expression status. More sensitive techniques, such as RNA-seq, can accurately detect transcript level for comparison with chromatin structure, in order to understand how epigenetic mechanisms regulate gene expression. To further confirm that our results are not algorithm-specific, we re-analyzed the data with an independent method, the SICER software <abbrgrp><abbr bid="B52">52</abbr></abbrgrp> (Materials and methods). This software uses different algorithms to identify Pol II or modified histone-enriched genomic regions, regardless of their associations with genes. The SICER software has been used to identify bivalent domains in multipotent hematopoietic stem cell lineages in mammals <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>. Using SICER, we confirmed that most H3K4me3- and H3K27me3-enriched regions are not overlapping (data not shown) and therefore are monovalent in <it>Drosophila bam </it>testis and cultured S2 cells. Using SICER, we also confirmed that most genes associated with Pol II-enriched regions in these two samples are actively expressed (data not shown).</p>
         </sec>
         <sec>
            <st>
               <p>Does cell type specificity dictate chromatin architecture?</p>
            </st>
            <p>One possible reason for the different chromatin features identified in our data versus previously published mammalian studies is cell type differences. The <it>bam </it>testis we used for ChIP-seq analysis is mainly enriched with transit-amplifying spermatogonial cells, and the chromatin differences we found could come from the distinction between transit-amplifying cells and <it>bona fide </it>stem cells. Another possibility for such differences between undifferentiated germ cell-enriched testis samples and ESCs is that germ cells are part of a unilineage stem cell system, but ESCs are pluripotent. Thus, in the germline lineage, differentiation genes have a unidirectional switch from silent to activated status during spermatogenesis, whereas developmental genes in ESC lineages have multi-directional switches during lineage-specific differentiation. The bivalent signature of ESC genes may enable more dynamic and refined activation upon perceiving additional signals (Figure <figr fid="F5">5a</figr>). Indeed, a permanganate footprint assay using early staged <it>Drosophila </it>embryos, which are pluripotent, showed evidence of paused Pol II at several genes. In contrast, there was no such evidence when using S2 cells derived from older embryos, at which stage the pluripotency may be lost <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. Alternatively, these differences may simply reflect distinctive features between somatic cells and germ cells. Germ cells give rise to gametes, which are the most 'immortal' cell type due to their ability to produce the next generation of an entire organism upon fertilization <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. This requirement to reset totipotency may distinguish germ cells from somatic cells with regard to their chromatin features (Figure <figr fid="F5">5a</figr>).</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Cartoons comparing ESCs and undifferentiated cells of <it>bam </it>testis</p>
               </caption>
               <text>
                  <p><b>Cartoons comparing ESCs and undifferentiated cells of <it>bam </it>testis</b>. <b>(a) </b>During ESC differentiation, bivalent genes resolve into monovalent genes according to cell type specificity in somatic lineages. In the <it>Drosophila </it>male germline lineage, monovalent genes in undifferentiated cells may either retain their chromatin signature or switch to another pattern. Differentiation genes that are required for spermatogenesis require additional activation mechanisms to turn on their expression robustly, in addition to the removal of the repressive H3K27me3 mark. <b>(b) </b>A potential molecular mechanism that renders the bivalency and poised status dispensable in the <it>Drosophila </it>germline stem cell lineage, due to the lack of endogenous DNA methylase activity. DNMT, DNA methyltransferase.</p>
               </text>
               <graphic file="gb-2010-11-4-r42-5" hint_layout="double"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Does species specificity dictate chromatin architecture?</p>
            </st>
            <p>The chromatin structure we observed in undifferentiated germ cell-enriched <it>bam </it>testis may also apply to <it>Drosophila </it>S2 cells, raising another possibility that species difference should be considered. It has been demonstrated that bivalent genes in mammalian ESCs are also enriched with PcG activities <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. These PcG components have the capability to recruit DNA methyltransferases to methylate DNA and silence gene expression in a more permanent manner <abbrgrp><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr></abbrgrp>. In contrast, the active H3K4me3 mark placed by the TrxG complex <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp> may keep chromatin in a relatively open state by recruiting RNA Pol II and initiating low level transcription to prevent DNA methylation (Figure <figr fid="F5">5b</figr>). Indeed, many bivalent domains in ESCs reside at the highly conserved non-coding elements, which are enriched with CpG islands and potentiated for DNA methylation <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. In contrast to mammalian systems, the DNA methylase activity is almost negligible in adult flies <abbrgrp><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr><abbr bid="B60">60</abbr></abbrgrp>. Therefore, differentiation genes in <it>Drosophila </it>are not subject to a 'stable' repression caused by DNA methylation as in the mammalian systems, which may promote more dynamic gene regulation while rendering bivalency and paused RNA Pol II status dispensable in <it>Drosophila</it>. Noticeably, recent work in <it>Xenopus </it>embryos also demonstrated that active H3K4me3 and repressive H3K27me3 regulate distinct groups of genes in a spatially controlled manner. It will be interesting to study monovalency versus bivalency during animal development, as well as evolution.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>Recent data have suggested two non-mutually exclusive mechanisms of gene expression regulation, paused Pol II and bivalent modifications, in mammalian ESCs and other stem or progenitor cells. In these primitive cell types, developmental regulators are characterized by bivalent domains harboring both the H3K4me3 and H3K27me3 marks <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. These genes may also be bound by paused Pol II, which may poise them for rapid induction during differentiation or in response to developmental and environmental stimuli <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B12">12</abbr></abbrgrp>. Here our analysis of undifferentiated-cell-enriched <it>Drosophila bam </it>testis, by combining highly sensitive ChIP-seq and RNA-seq methods, indicates that most differentiation genes are not enriched with the bivalent signature. In addition, these genes are unpoised with no significant binding of paused Pol II. These data suggest that transcription of differentiation genes in this system is mainly controlled at the Pol II recruitment step but not at the elongation step. These chromatin signatures we identified could reflect cell type specificity, species specificity, or both.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Fly strains and husbandry</p>
            </st>
            <p>Flies were raised using standard medium at 25&#176;C unless stated otherwise. The <it>bam</it><sup>1</sup>/TM3 stock was obtained from Bloomington <it>Drosophila </it>Stock Center. The <it>bam</it><sup><it>114-97</it></sup>/TM6B stock was a gift from Dr Margaret Fuller. The <it>bam</it><sup><it>1</it></sup><it>/bam</it><sup><it>114-97 </it></sup>testis was used for immunostaining with antibodies against a germ cell-specific mark Vasa <abbrgrp><abbr bid="B61">61</abbr></abbrgrp> and a somatic cell-specific mark Traffic jam (Tj) <abbrgrp><abbr bid="B62">62</abbr></abbrgrp> to demonstrate that most of the cells in this tissue are germ cells (Additional file <supplr sid="S12">12</supplr>).</p>
            <suppl id="S12">
               <title>
                  <p>Additional file 12</p>
               </title>
               <text>
                  <p>
                     <b>Immunostaining of the <it>bam </it>testis with antibodies against the germ cell marker Vasa and the somatic marker Traffic jam (Tj).</b>
                  </p>
               </text>
               <file name="gb-2010-11-4-r42-S12.tiff">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Culturing S2 cells</p>
            </st>
            <p>The <it>Drosophila </it>S2 cells (ATCC, CRL-1963, Lot#5054622, Manassas, VA, USA) were cultured following the manufacturer's protocol. Briefly, one vial of S2 cell stock was taken from liquid nitrogen and thawed quickly at room temperature. Once the cells were completely thawed, they were transferred to a 25 cm<sup>2 </sup>flask (Corning, CLS430639-20EA, Lowell, MA, USA) containing 5 ml of room temperature complete Schneider's <it>Drosophila </it>medium (GIBCO, #11720-034, Carlsbad, CA, USA; contains 10% heat-inactivated fetal bovine serum (GIBCO #16140-071)). After incubating at 25&#176;C for 30 minutes, the S2 cells were centrifuged at 1,000 rpm, the medium was removed and the cells were transferred into a 25 cm<sup>2 </sup>flask (Corning, CLS430639-20EA) containing 5 ml of room temperature complete Schneider's <it>Drosophila </it>medium. The S2 cells were harvested at the exponentially growing stage after incubating at 25&#176;C for about 90 hours.</p>
         </sec>
         <sec>
            <st>
               <p>Microarray experiments and data analysis</p>
            </st>
            <p>Total RNA from approximately 200 pairs of <it>bam </it>(<it>bam</it><sup><it>1</it></sup>/<it>bam</it><sup><it>114-97</it></sup>) fly testes was extracted using TRIzol (Invitrogen, #15596-018, Carlsbad, CA, USA) following the manufacturer's instructions and the genomic DNA was degraded using 2 Units of DNase I (Fermentas, #EN0521, Glen Burnie, MD, USA) at 37&#176;C for 20 minutes. Total RNA from <it>Drosophila </it>S2 cells was extracted using the Qiagen RNeasy Mini Kit (catalogue number 74014, Valencia, CA, USA), and the genomic DNA was degraded with RNase-Free DNase (catalogue number 79254). RNA integrity was checked by gel electrophoresis (1% agarose). Approximately 4 &#956;g of total RNA from each biological replicate were used to generate labeling probes to hybridize with the Affymetix GeneChip <it>Drosophila </it>Genome 2.0 Array according to the Affymetrix protocol. Three biological replicates were performed for <it>bam </it>testes and two biological replicates for S2 cells.</p>
            <p>Microarray hybridization was processed at the Genomics Core Facility at the National Heart, Lung and Blood Institute and the raw data were exported from the Affymetrix Microarray Suite (MAS). The CEL files were used for signal normalization with RMA as part of the limma package from the Bioconductor R packages <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. The 'Present (P)', 'Absent (A)' and 'Marginal (M)' calls were retrieved with the code of 'eset' in the Affy package.</p>
         </sec>
         <sec>
            <st>
               <p>RNA-seq</p>
            </st>
            <p>Extraction of RNA using <it>bam </it>testes or S2 cells was performed using similar methods as described for microarray experiments. For the total RNA from <it>bam </it>testes (8.5 &#956;g) and S2 cells (approximately 20 &#956;g), we performed two rounds of mRNA isolation using Dynabeads mRNA purification kit (Invitrogen, #610-06), according to the manufacturer's instructions. The final mRNAs were eluted in 13.5 &#956;l 10 mM Tris-HCl (pH 7.5) and immediately used to generate the first strand cDNA, using 4 &#956;l random hexamers (ABI, #N8080127, Foster City, CA, USA) and SuperScript II Reverse Transcription Kit (Invitrogen, #18064-014) in a 30 &#956;l final volume, following the manufacturer's instructions. The second strand cDNA was generated with the following recipe: 10 &#956;l 5&#215; second strand buffer (500 mM Tris-HCl pH7.8, 50 mM MgCl<sub>2</sub>, 10 mM DTT), 30 nmol dNTPs (Invitrogen, #18427-013), 2 Units of RNase H (Invitrogen, #18021-014) and 50 Units of DNA Pol I (Invitrogen, #18010-025). The entire reaction mix was incubated at 16&#176;C for 2.5 hours. The double-stranded DNA (dsDNA) was purified with a QIAquick PCR purification kit (Qiagen, #28106) and the concentration was quantified using a Qubit fluorometer (Invitrogen).</p>
            <p>To generate sequencing libraries, about 300 ng dsDNA from each sample was fragmented by sonication using Bioruptor (Diagenode, UCD-200-TM-EX, Sparta, NJ, USA) using medium power output for 30 minutes in ice water. The resulting DNA fragments were analyzed by agarose gel to verify they were within the approximately 100 to 300 bp size range. Sequencing libraries were prepared as follows: end-repair (DNA end-repair kit from Epicenter, #ER0720, Madison, WI, USA); A-tailing (300 ng dsDNA, 5 &#956;l Thermo buffer, 10 nmol dATP, 15 Units of Taq polymerase, at 70&#176;C for 30 minutes); Solexa adaptor ligation (300 ng dsDNA, 4 &#956;l DNA Ligase buffer, 1 &#956;l Solexa adaptor mix, 3 ul DNA Ligase, at 70&#176;C overnight); PCR (98&#176;C 10 s, 65&#176;C 30 s, 72&#176;C 30 s for 16 cycles; then additional 72&#176;C for 5 minutes) amplification with adaptor primers and size selection (200 to 400 bp). Then the library dsDNA for S2 cells was used on an Illumina Genome Analyzer II at a concentration of 10 ng per lane.</p>
            <p>We obtained 20,041,035 and 9,780,523 total reads from an Illumina Genome Analyzer II for <it>bam </it>testis and S2 cell samples, respectively. And 10,163,916 (<it>bam </it>testis) and 6,263,318 (S2 cell) unique and non-redundant reads were used for downstream data analysis.</p>
            <p>The Gene Expression Omnibus accession number for the raw and analyzed RNA-seq data is [GEO:GSE19325].</p>
         </sec>
         <sec>
            <st>
               <p>Comparison of microarray results with RNA-seq data</p>
            </st>
            <p>To compare the RPKM value from RNA-seq data with microarray results, we first retrieved 12,728 <it>Drosophila </it>genes from the Ensembl database, which also have probe(s) in the Affymetix GeneChip <it>Drosophila </it>Genome 2.0 Array. Genes with multiple probes were filtered out if different probes gave inconsistent 'Present (P)' or 'Absent (A)' calls. We then analyzed the RPKM distribution for genes with all 'Absent' calls or all 'Present' calls in microarray datasets, for both <it>bam </it>testis and S2 cell samples.</p>
            <p>The histograms were generated using the 'hist' function in the R programming environment (R version 2.5.0 <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>). To calculate the log<sub>2 </sub>RPKM values of individual genes, all their original RPKM values were added a pseudo-count of 1.</p>
         </sec>
         <sec>
            <st>
               <p>Chromatin immunoprecipitation</p>
            </st>
            <p>For each modified histone and Pol II ChIP experiment, we dissected approximately 200 pairs of <it>bam </it>testes in cold phosphate-buffered saline (PBS) and grouped them in 200 &#956;l PBS that contained protease inhibitor (Roche complete mini, #11836153001, Nutley, NJ, USA) and 0.5 mM phenylmethanesulfonyl fluoride (PMSF; MP Biomedicals, #195381, Solon, OH, USA). Approximately 1,000 cells could be extracted from one <it>bam </it>testis. We then added 5.5 &#956;l 37% fresh formaldehyde (Supelco, #47083-U, Bellefonte, PA, USA) and incubated at 37&#176;C for 15 minutes. The testes were washed twice with 450 &#956;l cold 1&#215; PBS (with inhibitors and PMSF). Then 200 &#956;l lysis buffer (50 mM Tris-HCl, pH7.6, 1 mM CaCl<sub>2</sub>, 0.2% Triton X-100, 5 mM butyrate, 1&#215; protease inhibitor cocktail, and 0.5 mM fresh PMSF) was added and the tissues were homogenized thoroughly followed by incubation at room temperature for 10 minutes. Chromatin was sheared into approximately 200-bp fragments by sonication using Microtip (Misonix, Inc., Microson XL-2000, Farmingdale, NY, USA) with the following procedure: 4 s at power 20, rest for 50 s, 4 to 5 times, followed by spinning at 14k rpm for 10 minutes at 4&#176;C. The chromatin was diluted 10&#215; with RIPA buffer (10 mM Tris, pH7.6, 1 mM EDTA, 0.1% SDS, 0.1% Na-Deoxycholate, 1% Triton X-100, with protease inhibitors and PMSF) and 50 &#956;l of this dilution was reverse cross-linked with 0.25 M NaCl for 2 hours at 65&#176;C and used as input for real-time PCR analysis.</p>
            <p>We washed 40 &#956;l of Dynabeads Protein A (Invitrogen, #100.01D) with 600 &#956;l 1&#215; PBS. We then added 100 &#956;l 1&#215; PBS with 4 &#956;g antibody and incubated the antibody-Protein A beads mixture at room temperature for 40 minutes with occasional tapping. After the unbound antibody was removed using the manipulator (Invitrogen, DYNAL MPC-S), 1 ml of the chromatin extract was added to the beads and the mixture was rotated at 4&#176;C overnight. The beads were then washed twice with 1 ml RIPA buffer, twice with 1 ml RIPA buffer containing 0.3 M NaCl, once with LiCl wash buffer (0.25 M LiCl, 0.5% NP40, and 0.5% sodium deoxycholate), once with 1 ml TE (10 mM Tris-HCl, pH 8.0 and 1 mM EDTA) containing 0.2% Triton X-100, and once with 1 ml TE. The beads were then suspended in 100 &#956;l 1&#215; TE containing 3 &#956;l 10% SDS and 5 &#956;l 20 mg/ml proteinase K, followed by incubation at 65&#176;C overnight. After the supernatant was collected, the beads were washed once more with 100 &#956;l TE with 0.5 M NaCl. The supernatant from this wash was combined with the previous supernatant. The combined samples were treated by Phenol/Chloroform extraction, salt/EtOH precipitation, and dissolved in 50 &#956;l 1&#215; TE. The products were either used for real-time PCR analyses or processed for Solexa sequencing according to the established protocol. Antibodies used include those against H3K4me3 (Abcam, #ab8580, Cambridge, MA, USA), H3K27me3 (Millipore, #07-449, Billerica, MA, USA), H3K36me3 (Abcam, #ab9050), H3 (Abcam, #ab1791) and RNA Pol II (Abcam, ab5408).</p>
         </sec>
         <sec>
            <st>
               <p>ChIP experiment using S2 cells</p>
            </st>
            <p>Exponentially growing S2 cells were harvested and dissolved in digestion buffer (50 mM Tris-HCl, pH7.6, 1 mM CCl<sub>2</sub>, 0.2% Triton X-100, 5 mM butyrate, 1&#215; protease inhibitor cocktail and 0.5 mM PMSF). Chromatin was prepared and ChIP-seq experiments were performed as described previously <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> with antibodies against Pol II (Abcam, #ab5408), H3K4me3 (Abcam, #ab8580), H3K27me3 (Millipore/Upstate, #07-449), and H3K36me3 (Abcam, #ab9050).</p>
         </sec>
         <sec>
            <st>
               <p>Solexa pipeline analysis</p>
            </st>
            <p>The 25-bp sequencing reads were obtained from the Illumina Genome Analyzer pipeline. All reads were aligned to the <it>Drosophila </it>genome (dm3) using the ELAND (Efficient Local Alignment of Nucleotide Data) software, allowing up to two mismatches with the reference sequence. Only uniquely mapped reads were retained. For multiple identical reads, at most three copies were retained to reduce the possibility of biases from PCR amplification. The output of the Genome Analyzer pipeline was converted to browser extensible data (BED) files. The wig files used for visualization on the UCSC browser were generated from the uniquely mapped reads using a 4-bp window and 160 bp as the DNA fragment size, as previously described <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. The size of the DNA fragment was determined by the distance from the 5' to the 3' peak of the mapped reads, as shown in Additional file <supplr sid="S13">13</supplr>.</p>
            <suppl id="S13">
               <title>
                  <p>Additional file 13</p>
               </title>
               <text>
                  <p><b>Determination of the fragment size for ChIP-seq analysis, based on the distance between the 5' and 3' sequencing read peaks</b>. All expressed genes (RPKM &#8805; 1) were used for the modification level plots over a -2 to +2-kb window with respect to the TSS. <b>(a-e) </b>Antibodies used for ChIP-seq in <it>bam </it>testis were: (a) anti-RNA Pol II; (b) anti-H3K4me3; (c) anti-H3K36me3; (d) anti-H3K27me3; and (e) anti-H3.</p>
               </text>
               <file name="gb-2010-11-4-r42-S13.tiff">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>The Gene Expression Omnibus accession number for the raw and analyzed ChIP-seq data is [GEO:GSE19325].</p>
         </sec>
         <sec>
            <st>
               <p>Defining genomic regions for analyzing modified histone and Pol II enrichment</p>
            </st>
            <p>To determine the regions used for modified histone and Pol II occupancy of the annotated transcripts in Figure <figr fid="F1">1</figr>, we first plotted each histone modification and Pol II using the entire annotated transcripts that are applicable for ChIP-seq analysis. Based on the overall enrichment plot, we used the region from -250 to +250 bp (the TSS was defined as 0) to calculate the Pol II enrichment of individual transcripts. A 0 to +500-bp window was used for H3K4me3 and H3K27me3, and a +500 to 1,500-bp window was used for H3K36me3 enrichment calculations. For genes with a transcript size longer than 1 kb, a +500 to +1,000-bp window was defined as the gene body region to calculate the stalling index.</p>
         </sec>
         <sec>
            <st>
               <p><it>Drosophila </it>genes used for ChIP-seq analysis</p>
            </st>
            <p><it>Drosophila </it>genes used for ChIP-seq analysis were derived from the UCSC database (April 2006/BDGP R5/dm3), which contained 14,058 genes and 21,243 transcripts. The coordinates for these transcripts were downloaded (August 2009) from the UCSC table browser <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>. Compared to human and mouse genomes, the <it>Drosophila </it>genome is more gene-dense, including many overlapping genes and short distances between TSSs, which may lead to incorrect conclusions from ChIP-seq analysis. To avoid this, we chose transcripts for analysis using the following three steps. First, exclude transcripts shorter than 500 bp (transcript size is defined as the distance between the annotated transcription start and end sites) because these short transcripts will affect the promoter analysis for H3K4me3 and H3K27me3. A total of 740 transcripts (736 genes) were excluded using this criterion. Second, exclude overlapping transcripts from different genes (Additional file <supplr sid="S14">14</supplr>) and transcripts that have short distances between their TSSs; 6,497 transcripts (4,535 genes) overlap with at least one other transcript, but 3,832 transcripts (2,555 genes) among them were useful for promoter analysis (-250 to +250 bp, or 0 to +500 bp), because the overlapping regions do not affect promoter analysis of these genes. A total of 3,558 non-overlapping transcripts (2,729 genes with opposite transcription direction) have a distance between TSSs shorter than 400 bp, which will affect the Pol II promoter analysis (-250 to +250 bp), and were thus excluded. Third, after removal of certain transcripts using the above two criteria, 14,849 transcripts (9,459 genes) were retained for ChIP-seq promoter analysis. In total, 7,799 genes were applicable for stalling index analysis, and 6,400 genes for H3K36me3 enrichment analysis. Among these 9,459 genes, 2,612 have multiple transcripts, but only one transcript for each was used for ChIP-seq analysis.</p>
            <suppl id="S14">
               <title>
                  <p>Additional file 14</p>
               </title>
               <text>
                  <p><b>Cartoons showing overlapping transcripts from different genes that were retained for data analysis</b>. The transcripts labeled by blue letters were retained for data analysis, in each situation shown for overlapping genes.</p>
               </text>
               <file name="gb-2010-11-4-r42-S14.tiff">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Choosing the dominant transcript for multi-isoform genes</p>
            </st>
            <p>For genes with multiple isoforms, we used the most abundant one for the comparison between ChIP-seq and RNA-seq data.</p>
            <sec>
               <st>
                  <p><it>bam </it>Pol II versus testis RPKM, and S2 Pol II versus cell RPKM</p>
               </st>
               <p>1, The dominant transcript was determined as that with the highest number of Pol II ChIP-seq reads in the -250 to +250-bp window with respect to the TSS region. If multiple transcripts have the same number of reads in this region, criterion 2 is used. 2, For transcripts that are longer than 1 kb, the dominant transcript was determined as that with the highest number of ChIP-seq reads in the gene body (+500 to +1,000-bp with respect to the TSS). For those transcripts that are shorter than 1 kb, the dominant transcript was determined as that with the highest number of ChIP-seq reads for H3K4me3 in the region 0 to +500 bp with respect to the TSS. If multiple transcripts have the same numbers of reads, criterion 3 is used. 3, The dominant transcript was determined as that with the longest transcript. If all isoforms have the same length, one was chosen randomly.</p>
            </sec>
            <sec>
               <st>
                  <p><it>bam </it>H3K4me3 versus H3K27me3, bam H3K4me3 versus testis RPKM, S2 cell H3K4me3 versus H3K27me3, and S2 H3K4me3 versus RPKM</p>
               </st>
               <p>1, The dominant transcript was determined as that with the highest number of H3K4me3 ChIP-seq reads in the region 0 to +500 bp with respect to the TSS. If multiple transcripts have the same number of reads, criterion 2 is used. 2, For transcripts that are longer than 1.5 kb, the dominant transcript was determined as that with the highest number of ChIP-seq reads for H3K36me3. For transcripts that are shorter than 1.5 kb, and thus are not applicable to calculate H3K36me3 reads, the dominant transcript was determined as that with the highest number of ChIP-seq reads for Pol II in the region -250 to +250 bp with respect to the TSS. If multiple transcripts have the same number of reads, criterion 3 is used. 3, The dominant transcript was determined as that with the longest transcript. If all isoforms have the same length, one was chosen randomly.</p>
            </sec>
            <sec>
               <st>
                  <p><it>bam </it>H3K36me3 versus testis RPKM, and S2 cell H3K36me3 versus RPKM</p>
               </st>
               <p>1, For the 6,400 genes that have at least one transcript longer than 1.5 kb, the dominant transcript was determined as that with the highest number of H3K36me3 ChIP-seq reads in the region +500 to +1,500 bp with respect to the TSS. If multiple transcripts have the same number of reads, criterion 2 is used. 2, The dominant transcript was determined as that with the highest number of ChIP-seq reads for H3K4me3 in the region 0 to +500 bp with respect to the TSS. If multiple transcripts have the same number of reads, criterion 3 is used. 3, The dominant transcript was determined as that with the longest transcript. If all isoforms have the same length, one was chosen randomly.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Comparison of ChIP-seq results with the RNA-seq data</p>
            </st>
            <p>To generate the plots in Figure <figr fid="F1">1</figr>, we retrieved annotated <it>Drosophila </it>genes from the Ensembl database for ChIP-seq analysis. We classified them into silent and expressed genes according to RPKM value (genes with RPKM = 0 were classified as the silent group, and genes with RPKM &#8805; 1 were classified as the expressed group). The expressed group was further classified into low, moderate and high groups based on RPKM values. The coordinates of these transcripts were downloaded from the UCSC table browser <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>. The read density was calculated in a 5-bp window across the genome.</p>
         </sec>
         <sec>
            <st>
               <p>Comparison of our ChIP-seq data with the published ChIP-chip data</p>
            </st>
            <p>We compared our 91 bivalent genes in <it>bam </it>testis with the ChIP-chip data using BG3 and D23 cells <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. We first downloaded both the H3K4me3 and H3K27me3 sgr files from the NCBI website. We retained the probes whose ChIP/input hybridization intensity ratios are &#8805; 2, to be consistent with the authors' definition of enriched regions. We then mapped the probes to the promoter regions (0 to +500 bp) of our 91 bivalent genes. For BG3 cells, we found 6 out of the 91 <it>bam </it>testis bivalent genes contained an enriched H3K4me3 signal; and 4 of these 91 bivalent genes contained an enriched H3K27me3 signal. However, none of these 91 bivalent genes is enriched with both H3K4me3 and H3K27me3 in BG3 cells. Similarly, for D23 cells, 5 and 11 out of these 91 <it>bam </it>testis bivalent genes contained enriched H3K4me3 and H3K27me3 signals, respectively. Again, none of these 91 bivalent genes is enriched with both H3K4me3 and H3K27me3 in D23 cells.</p>
            <p>We also checked the ChIP-on-chip data using <it>Drosophila </it>embryos <abbrgrp><abbr bid="B38">38</abbr></abbrgrp> and found 4,893 H3K4me3 and 2,480 H3K27me3 enriched regions. However, only 161 of them overlap with each other. We then compared the 91 bivalent genes we identified in <it>bam </it>testis with these 161 bivalent regions in embryos. From this comparison, we found only two genes (<it>CG4637 </it>and <it>CG9610</it>) for which the promoter region (0 to +500 bp) overlaps two bivalent regions in embryos (chr3R: 18965718-18967662 and chr3R: 4158335-4159501).</p>
         </sec>
         <sec>
            <st>
               <p>Determination of the <it>P</it>-value of enrichment of ChIP-seq reads within a 500-bp window</p>
            </st>
            <p>Since we used a 500-bp window to detect enrichment of Pol II and histone modifications (for example, active H3K4me3 and repressive H3K27me3), a sequencing read count threshold was chosen according to the Poisson distribution, which distributes the total and unique reads randomly across the <it>Drosophila </it>genome that can be mapped. For every 500-bp window with read number ranging from 1 to 99, the <it>P</it>-value was calculated. The threshold was chosen as the minimal number of reads that reached a significant enrichment compared to the random distribution (that is, <it>P </it>&lt; 0.05). For example, for the total 1,342,075 reads of anti-Pol II ChIP-seq in <it>bam </it>testis, a 500-bp window containing 11 read counts has a <it>P</it>-value of 0.03, which passes the threshold <it>P</it>-value of &#8804; 0.05. We then set the threshold of significant enrichment within a 500-bp window in this data set to be 11 read counts.</p>
         </sec>
         <sec>
            <st>
               <p>Analysis of the RNA Pol II stalling index</p>
            </st>
            <p>To analyze the RNA Pol II stalling index, we modified a method that was adapted from published work <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B66">66</abbr></abbrgrp>. Basically, the stalling index reflects differential Pol II binding at the promoter region versus the gene body region. Therefore, we defined the -250 to +250-bp region around the TSS as the promoter region (with respect to the TSS), and the +500 to +1,000-bp region (with respect to the TSS) as the gene body region. To calculate the stalling index, we first counted the total Pol II reads at the promoter region and at the gene body region. A stalling index was defined as the ratio of total reads in the promoter region divided by the total reads in the gene body region. Based on the stalling index, we classified genes into active, stalled or no Pol II categories based on the following criteria: active Pol II genes, stalling index &#8804; 3 and significant Pol II enrichment (<it>P </it>&lt; 0.05); stalled Pol II genes, stalling index &#8805; 5 and significant Pol II enrichment (<it>P </it>&lt; 0.05); no Pol II genes, no significant Pol II enrichment (<it>P </it>> 0.05).</p>
         </sec>
         <sec>
            <st>
               <p>Scatter plot analysis</p>
            </st>
            <p>The scatter plots delineate comparisons of different chromatin modifications in <it>bam</it>. All plots were generated in the R programming environment (R version 2.5.0 <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>). Transformation to the log value was used to compare chromatin modifications with RPKM values.</p>
         </sec>
         <sec>
            <st>
               <p>Box plot analysis</p>
            </st>
            <p>The distribution of gene expression level was analyzed using box plots in the R programming environment (R version 2.5.0 <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>). The box represents the 25th and 75th percentiles, with the 50th percentile as a black bar. The whiskers refer to outliers that are at least 1.5&#215; the interquartile range from the box. The y-axis represents the RPKM value.</p>
         </sec>
         <sec>
            <st>
               <p>Identification of Pol II or modified histone-enriched regions using SICER</p>
            </st>
            <p>To identify the significantly enriched regions, we used SICER software <abbrgrp><abbr bid="B52">52</abbr></abbrgrp> with the following parameters: window size = 200 bp, gap size = 0 bp and E-value = 100 for Pol II and H3K4me3; window size = 400 bp, gap size = 0 bp and E-value = 100 for H3K27me3 and H3K36me3. The reason we do not allow any gap is due to the density of genes in the <it>Drosophila </it>genome.</p>
         </sec>
         <sec>
            <st>
               <p>Sequencing depth analysis</p>
            </st>
            <p>To analyze the sequencing depth of ChIP-seq, we first shuffled the reads and their corresponding genomic loci, then extracted subsamples (2.5%, 5%, 7.5%, and so on until 100% of the total unique reads), and then identified the enriched regions in each subsample using SICER software as described previously. The E-value was increased from the first subsample (E-value = 3) to the last subsample (E-value = 120) by an increment of 3. We then plotted the correlation between subsamples and the enriched Pol II or modified histones in the corresponding regions, as shown in Additional file <supplr sid="S2">2</supplr>.</p>
         </sec>
         <sec>
            <st>
               <p>Gene Ontology assay</p>
            </st>
            <p>The gene function ontology analyses were performed using the DAVID 2008 informatics tools <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>, based on the Gene Ontology Consortium <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>. All Ensembl annotated genes were used as a background comparison. Two particular Gene Ontology annotations (molecular function and biological process) were analyzed with a cutoff P value of &lt; 0.01.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p><it>bam</it>: <it>bag-of-marbles</it>; <it>bgcn</it>: <it>benign gonial cell neoplasm</it>; ChIP: chromatin immunoprecipitation; ChIP-seq: chromatin immunoprecipitation followed by high-throughput sequencing; dsDNA: double-stranded DNA; ESC: embryonic stem cell; <it>E(z)</it>: <it>Enhancer of Zeste</it>; GSC: germline stem cell; PBS: phosphate-buffered saline; PcG: Polycomb group; PMSF: phenylmethanesulfonyl fluoride; Pol II: RNA Polymerase II; RPKM: sequencing reads per kilobase of exon per million mapped reads; <it>sa</it>: <it>spermatocyte arrest</it>; TrxG: Trithorax group; TSS: transcriptional start site.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>QG, SE, GW, KC and XC performed the experiments; QG and DS analyzed the data; QG, DS, KZ and XC wrote the paper.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Caitlin Choi and Ankit Vartak for technical assistance with testis dissection. We also thank Chen lab members for their critical reading and insightful suggestions. The gene expression profiling using microarrays was performed by the Genomics Core Facility of the National Heart, Lung and Blood Institute. This work is supported in part by research grant no. 05-FY09-88 from the March of Dimes Foundation, the R00HD055052 NIH Pathway to Independence Award from NICHD, the 49th Mallinckrodt Scholar Award from the Edward Mallinckrodt, Jr Foundation and the Johns Hopkins University start-up funding (XC), and support of Division of Intramural Research, the National Heart, Lung and Blood Institute, NIH (KZ).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Molecular control of pluripotency.</p>
            </title>
            <aug>
               <au>
                  <snm>Boyer</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Mathur</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Jaenisch</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Curr Opin Genet Dev</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>455</fpage>
            <lpage>462</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.gde.2006.08.009</pubid>
                  <pubid idtype="pmpid" link="fulltext">16920351</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Stem cells, the molecular circuitry of pluripotency and nuclear reprogramming.</p>
            </title>
            <aug>
               <au>
                  <snm>Jaenisch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2008</pubdate>
            <volume>132</volume>
            <fpage>567</fpage>
            <lpage>582</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2008.01.015</pubid>
                  <pubid idtype="pmpid" link="fulltext">18295576</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>A bivalent chromatin structure marks key developmental genes in embryonic stem cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Bernstein</snm>
                  <fnm>BE</fnm>
               </au>
               <au>
                  <snm>Mikkelsen</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Xie</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Kamal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Huebert</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Cuff</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Fry</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Meissner</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wernig</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Plath</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Jaenisch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wagschal</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Feil</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Schreiber</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2006</pubdate>
            <volume>125</volume>
            <fpage>315</fpage>
            <lpage>326</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2006.02.041</pubid>
                  <pubid idtype="pmpid" link="fulltext">16630819</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Epigenetic regulation of cellular memory by the Polycomb and Trithorax group proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Ringrose</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Paro</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Annu Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>38</volume>
            <fpage>413</fpage>
            <lpage>443</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.genet.38.072902.091907</pubid>
                  <pubid idtype="pmpid" link="fulltext">15568982</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>ASH1, a <it>Drosophila </it>trithorax group protein, is required for methylation of lysine 4 residues on histone H3.</p>
            </title>
            <aug>
               <au>
                  <snm>Byrd</snm>
                  <fnm>KN</fnm>
               </au>
               <au>
                  <snm>Shearn</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>11535</fpage>
            <lpage>11540</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.1933593100</pubid>
                  <pubid idtype="pmcid">208793</pubid>
                  <pubid idtype="pmpid" link="fulltext">13679578</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>The histone methyltransferases Trithorax and Ash1 prevent transcriptional silencing by Polycomb group proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Klymenko</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Muller</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>EMBO Rep</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>373</fpage>
            <lpage>377</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.embor.7400111</pubid>
                  <pubid idtype="pmcid">1299022</pubid>
                  <pubid idtype="pmpid" link="fulltext">15031712</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Histone methyltransferase activity of a <it>Drosophila </it>Polycomb group repressor complex.</p>
            </title>
            <aug>
               <au>
                  <snm>Muller</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hart</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Francis</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Vargas</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Sengupta</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wild</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>O'Connor</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Kingston</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2002</pubdate>
            <volume>111</volume>
            <fpage>197</fpage>
            <lpage>208</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(02)00976-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">12408864</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Role of histone H3 lysine 27 methylation in Polycomb-group silencing.</p>
            </title>
            <aug>
               <au>
                  <snm>Cao</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Xia</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Erdjument-Bromage</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tempst</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <fpage>1039</fpage>
            <lpage>1043</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1076997</pubid>
                  <pubid idtype="pmpid" link="fulltext">12351676</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Polycomb/Trithorax response elements and epigenetic memory of cell identity.</p>
            </title>
            <aug>
               <au>
                  <snm>Ringrose</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Paro</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Development</source>
            <pubdate>2007</pubdate>
            <volume>134</volume>
            <fpage>223</fpage>
            <lpage>232</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1242/dev.02723</pubid>
                  <pubid idtype="pmpid" link="fulltext">17185323</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Polycomb complexes repress developmental regulators in murine embryonic stem cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Boyer</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Plath</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Zeitlinger</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Brambrink</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Medeiros</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Levine</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Wernig</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tajonar</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ray</snm>
                  <fnm>MK</fnm>
               </au>
               <au>
                  <snm>Bell</snm>
                  <fnm>GW</fnm>
               </au>
               <au>
                  <snm>Otte</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Vidal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gifford</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Jaenisch</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2006</pubdate>
            <volume>441</volume>
            <fpage>349</fpage>
            <lpage>353</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04733</pubid>
                  <pubid idtype="pmpid" link="fulltext">16625203</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Control of developmental regulators by Polycomb in human embryonic stem cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Jenner</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Boyer</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Guenther</snm>
                  <fnm>MG</fnm>
               </au>
               <au>
                  <snm>Levine</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Chevalier</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Johnstone</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Cole</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Isono</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Koseki</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Fuchikami</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Abe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Murray</snm>
                  <fnm>HL</fnm>
               </au>
               <au>
                  <snm>Zucker</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Yuan</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bell</snm>
                  <fnm>GW</fnm>
               </au>
               <au>
                  <snm>Herbolsheimer</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Hannett</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Odom</snm>
                  <fnm>DT</fnm>
               </au>
               <au>
                  <snm>Otte</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Volkert</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Melton</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Gifford</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Jaenisch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2006</pubdate>
            <volume>125</volume>
            <fpage>301</fpage>
            <lpage>313</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2006.02.043</pubid>
                  <pubid idtype="pmpid" link="fulltext">16630818</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>A chromatin landmark and transcription initiation at most promoters in human cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Guenther</snm>
                  <fnm>MG</fnm>
               </au>
               <au>
                  <snm>Levine</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Boyer</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Jaenisch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2007</pubdate>
            <volume>130</volume>
            <fpage>77</fpage>
            <lpage>88</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2007.05.042</pubid>
                  <pubid idtype="pmpid" link="fulltext">17632057</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Searching chromatin for stem cell identity.</p>
            </title>
            <aug>
               <au>
                  <snm>Buszczak</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Spradling</snm>
                  <fnm>AC</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2006</pubdate>
            <volume>125</volume>
            <fpage>233</fpage>
            <lpage>236</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2006.04.004</pubid>
                  <pubid idtype="pmpid" link="fulltext">16630812</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Stem cells: What can we learn from flies?</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>X</fnm>
               </au>
            </aug>
            <source>Fly (Austin)</source>
            <pubdate>2008</pubdate>
            <volume>2</volume>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">18820440</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Male and female <it>Drosophila </it>germline stem cells: two versions of immortality.</p>
            </title>
            <aug>
               <au>
                  <snm>Fuller</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Spradling</snm>
                  <fnm>AC</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2007</pubdate>
            <volume>316</volume>
            <fpage>402</fpage>
            <lpage>404</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1140861</pubid>
                  <pubid idtype="pmpid" link="fulltext">17446390</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Genetic control of cell proliferation and differentiation in <it>Drosophila </it>spermatogenesis.</p>
            </title>
            <aug>
               <au>
                  <snm>Fuller</snm>
                  <fnm>MT</fnm>
               </au>
            </aug>
            <source>Semin Cell Dev Biol</source>
            <pubdate>1998</pubdate>
            <volume>9</volume>
            <fpage>433</fpage>
            <lpage>444</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/scdb.1998.0227</pubid>
                  <pubid idtype="pmpid" link="fulltext">9813190</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Tissue-specific TAFs counteract Polycomb to turn on terminal differentiation.</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Hiller</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sancak</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Fuller</snm>
                  <fnm>MT</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>310</volume>
            <fpage>869</fpage>
            <lpage>872</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1118101</pubid>
                  <pubid idtype="pmpid" link="fulltext">16272126</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>bag-of-marbles and benign gonial cell neoplasm act in the germline to restrict proliferation during <it>Drosophila </it>spermatogenesis.</p>
            </title>
            <aug>
               <au>
                  <snm>Gonczy</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Matunis</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>DiNardo</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Development</source>
            <pubdate>1997</pubdate>
            <volume>124</volume>
            <fpage>4361</fpage>
            <lpage>4371</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9334284</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>bag-of-marbles: a <it>Drosophila </it>gene required to initiate both male and female gametogenesis.</p>
            </title>
            <aug>
               <au>
                  <snm>McKearin</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Spradling</snm>
                  <fnm>AC</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>1990</pubdate>
            <volume>4</volume>
            <fpage>2242</fpage>
            <lpage>2251</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gad.4.12b.2242</pubid>
                  <pubid idtype="pmpid" link="fulltext">2279698</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Novel regulators revealed by profiling <it>Drosophila </it>testis stem cells within their niche.</p>
            </title>
            <aug>
               <au>
                  <snm>Terry</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Tulina</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Matunis</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>DiNardo</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Dev Biol</source>
            <pubdate>2006</pubdate>
            <volume>294</volume>
            <fpage>246</fpage>
            <lpage>257</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ydbio.2006.02.048</pubid>
                  <pubid idtype="pmpid" link="fulltext">16616121</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Dynamic regulation of alternative splicing and chromatin structure in <it>Drosophila </it>gonads revealed by RNA-seq.</p>
            </title>
            <aug>
               <au>
                  <snm>Gan</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Chepelev</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Tarayrah</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Cui</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>X</fnm>
               </au>
            </aug>
            <source>Cell Res</source>
            <pubdate>2010</pubdate>
            <inpress/>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">20440302</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Genome-wide approaches to studying chromatin modifications.</p>
            </title>
            <aug>
               <au>
                  <snm>Schones</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <fpage>179</fpage>
            <lpage>191</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg2270</pubid>
                  <pubid idtype="pmpid" link="fulltext">18250624</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Genomic location analysis by ChIP-Seq.</p>
            </title>
            <aug>
               <au>
                  <snm>Barski</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>J Cell Biochem</source>
            <pubdate>2009</pubdate>
            <volume>107</volume>
            <fpage>11</fpage>
            <lpage>18</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/jcb.22077</pubid>
                  <pubid idtype="pmpid" link="fulltext">19173299</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Ring1-mediated ubiquitination of H2A restrains poised RNA polymerase II at bivalent genes in mouse ES cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Stock</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Giadrossi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Casanova</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brookes</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Vidal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Koseki</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Brockdorff</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Fisher</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Pombo</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nat Cell Biol</source>
            <pubdate>2007</pubdate>
            <volume>9</volume>
            <fpage>1428</fpage>
            <lpage>1435</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ncb1663</pubid>
                  <pubid idtype="pmpid" link="fulltext">18037880</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>RNA-Seq: a revolutionary tool for transcriptomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Gerstein</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Snyder</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2009</pubdate>
            <volume>10</volume>
            <fpage>57</fpage>
            <lpage>63</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg2484</pubid>
                  <pubid idtype="pmpid" link="fulltext">19015660</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Mapping and quantifying mammalian transcriptomes by RNA-Seq.</p>
            </title>
            <aug>
               <au>
                  <snm>Mortazavi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>McCue</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Schaeffer</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wold</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Nat Methods</source>
            <pubdate>2008</pubdate>
            <volume>5</volume>
            <fpage>621</fpage>
            <lpage>628</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nmeth.1226</pubid>
                  <pubid idtype="pmpid" link="fulltext">18516045</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>High-resolution profiling of histone methylations in the human genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Barski</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cuddapah</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cui</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Roh</snm>
                  <fnm>TY</fnm>
               </au>
               <au>
                  <snm>Schones</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chepelev</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2007</pubdate>
            <volume>129</volume>
            <fpage>823</fpage>
            <lpage>837</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2007.05.009</pubid>
                  <pubid idtype="pmpid" link="fulltext">17512414</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Genome-wide maps of chromatin state in pluripotent and lineage-committed cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Mikkelsen</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Ku</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jaffe</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Issac</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lieberman</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Giannoukos</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Alvarez</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Brockman</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>TK</fnm>
               </au>
               <au>
                  <snm>Koche</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Mendenhall</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>O'Donovan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Presser</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Russ</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Xie</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Meissner</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wernig</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jaenisch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nusbaum</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Bernstein</snm>
                  <fnm>BE</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>448</volume>
            <fpage>553</fpage>
            <lpage>560</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature06008</pubid>
                  <pubid idtype="pmpid" link="fulltext">17603471</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Whole-genome analysis of histone H3 lysine 4 and lysine 27 methylation in human embryonic stem cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Pan</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Tian</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nie</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ruotti</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Jonsdottir</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Stewart</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Thomson</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Cell Stem Cell</source>
            <pubdate>2007</pubdate>
            <volume>1</volume>
            <fpage>299</fpage>
            <lpage>312</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.stem.2007.08.003</pubid>
                  <pubid idtype="pmpid" link="fulltext">18371364</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Whole-genome mapping of histone H3 Lys4 and 27 trimethylations reveals distinct genomic compartments in human embryonic stem cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Zhao</snm>
                  <fnm>XD</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Chew</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chiu</snm>
                  <fnm>KP</fnm>
               </au>
               <au>
                  <snm>Choo</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Orlov</snm>
                  <fnm>YL</fnm>
               </au>
               <au>
                  <snm>Sung</snm>
                  <fnm>WK</fnm>
               </au>
               <au>
                  <snm>Shahab</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kuznetsov</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Bourque</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Oh</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ruan</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ng</snm>
                  <fnm>HH</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>CL</fnm>
               </au>
            </aug>
            <source>Cell Stem Cell</source>
            <pubdate>2007</pubdate>
            <volume>1</volume>
            <fpage>286</fpage>
            <lpage>298</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.stem.2007.08.004</pubid>
                  <pubid idtype="pmpid" link="fulltext">18371363</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Global mapping of H3K4me3 and H3K27me3 reveals specificity and plasticity in lineage fate determination of differentiating CD4+ T cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Wei</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zang</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hu-Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yao</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Cui</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kanno</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Roh</snm>
                  <fnm>TY</fnm>
               </au>
               <au>
                  <snm>Watford</snm>
                  <fnm>WT</fnm>
               </au>
               <au>
                  <snm>Schones</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Peng</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Paul</snm>
                  <fnm>WE</fnm>
               </au>
               <au>
                  <snm>O'Shea</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Immunity</source>
            <pubdate>2009</pubdate>
            <volume>30</volume>
            <fpage>155</fpage>
            <lpage>167</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.immuni.2008.12.009</pubid>
                  <pubid idtype="pmcid">2722509</pubid>
                  <pubid idtype="pmpid" link="fulltext">19144320</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Chromatin signatures in multipotent human hematopoietic stem cells indicate the fate of bivalent genes during differentiation.</p>
            </title>
            <aug>
               <au>
                  <snm>Cui</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Zang</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Roh</snm>
                  <fnm>TY</fnm>
               </au>
               <au>
                  <snm>Schones</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Childs</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Peng</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Cell Stem Cell</source>
            <pubdate>2009</pubdate>
            <volume>4</volume>
            <fpage>80</fpage>
            <lpage>93</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.stem.2008.11.011</pubid>
                  <pubid idtype="pmcid">2785912</pubid>
                  <pubid idtype="pmpid" link="fulltext">19128795</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Characterization of human epigenomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Schones</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Curr Opin Genet Dev</source>
            <pubdate>2009</pubdate>
            <volume>19</volume>
            <fpage>127</fpage>
            <lpage>134</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.gde.2009.02.001</pubid>
                  <pubid idtype="pmcid">2699568</pubid>
                  <pubid idtype="pmpid" link="fulltext">19299119</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Testis-specific TAF homologs collaborate to control a tissue-specific transcription program.</p>
            </title>
            <aug>
               <au>
                  <snm>Hiller</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Pringle</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Suchorolski</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sancak</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Viswanathan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bolival</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>TY</fnm>
               </au>
               <au>
                  <snm>Marino</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Fuller</snm>
                  <fnm>MT</fnm>
               </au>
            </aug>
            <source>Development</source>
            <pubdate>2004</pubdate>
            <volume>131</volume>
            <fpage>5297</fpage>
            <lpage>5308</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1242/dev.01314</pubid>
                  <pubid idtype="pmpid" link="fulltext">15456720</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>A hierarchy of H3K4me3 and H3K27me3 acquisition in spatial gene regulation in <it>Xenopus </it>embryos.</p>
            </title>
            <aug>
               <au>
                  <snm>Akkers</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>van Heeringen</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Jacobi</snm>
                  <fnm>UG</fnm>
               </au>
               <au>
                  <snm>Janssen-Megens</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Francoijs</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Stunnenberg</snm>
                  <fnm>HG</fnm>
               </au>
               <au>
                  <snm>Veenstra</snm>
                  <fnm>GJ</fnm>
               </au>
            </aug>
            <source>Dev Cell</source>
            <pubdate>2009</pubdate>
            <volume>17</volume>
            <fpage>425</fpage>
            <lpage>434</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.devcel.2009.08.005</pubid>
                  <pubid idtype="pmpid" link="fulltext">19758566</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>The curious case of bivalent marks.</p>
            </title>
            <aug>
               <au>
                  <snm>Herz</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Nakanishi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shilatifard</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Dev Cell</source>
            <pubdate>2009</pubdate>
            <volume>17</volume>
            <fpage>301</fpage>
            <lpage>303</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.devcel.2009.08.014</pubid>
                  <pubid idtype="pmpid" link="fulltext">19758552</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Alternative epigenetic chromatin states of polycomb target genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Schwartz</snm>
                  <fnm>YB</fnm>
               </au>
               <au>
                  <snm>Kahn</snm>
                  <fnm>TG</fnm>
               </au>
               <au>
                  <snm>Stenberg</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ohno</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Bourgon</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Pirrotta</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>PLoS Genet</source>
            <volume>6</volume>
            <fpage>e1000805</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pgen.1000805</pubid>
                  <pubid idtype="pmcid">2799325</pubid>
                  <pubid idtype="pmpid" link="fulltext">20062800</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Functional anatomy of polycomb and trithorax chromatin landscapes in <it>Drosophila </it>embryos.</p>
            </title>
            <aug>
               <au>
                  <snm>Schuettengruber</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ganapathi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Leblanc</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Portoso</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jaschek</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tolhuis</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>van Lohuizen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tanay</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cavalli</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2009</pubdate>
            <volume>7</volume>
            <fpage>e13</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pbio.1000013</pubid>
                  <pubid idtype="pmcid">2621266</pubid>
                  <pubid idtype="pmpid" link="fulltext">19143474</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>RNA polymerase stalling at developmental control genes in the <it>Drosophila melanogaster </it>embryo.</p>
            </title>
            <aug>
               <au>
                  <snm>Zeitlinger</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stark</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kellis</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hong</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Nechaev</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Adelman</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Levine</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2007</pubdate>
            <volume>39</volume>
            <fpage>1512</fpage>
            <lpage>1516</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng.2007.26</pubid>
                  <pubid idtype="pmcid">2824921</pubid>
                  <pubid idtype="pmpid" link="fulltext">17994019</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Long-range chromosomal interactions regulate the timing of the transition between poised and active gene expression.</p>
            </title>
            <aug>
               <au>
                  <snm>Vernimmen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>De Gobbi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sloane-Stanley</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Wood</snm>
                  <fnm>WG</fnm>
               </au>
               <au>
                  <snm>Higgs</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>EMBO J</source>
            <pubdate>2007</pubdate>
            <volume>26</volume>
            <fpage>2041</fpage>
            <lpage>2051</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.emboj.7601654</pubid>
                  <pubid idtype="pmcid">1852780</pubid>
                  <pubid idtype="pmpid" link="fulltext">17380126</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Quiet and poised: "silent" genes accumulate transcription machinery.</p>
            </title>
            <aug>
               <au>
                  <snm>Robinson</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2010</pubdate>
            <volume>8</volume>
            <fpage>e1000269</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pbio.1000269</pubid>
                  <pubid idtype="pmcid">2795803</pubid>
                  <pubid idtype="pmpid" link="fulltext">20076756</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Differential H3K4 methylation identifies developmentally poised hematopoietic genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Orford</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kharchenko</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lai</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Dao</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Worhunsky</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Ferro</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Janzen</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Scadden</snm>
                  <fnm>DT</fnm>
               </au>
            </aug>
            <source>Dev Cell</source>
            <pubdate>2008</pubdate>
            <volume>14</volume>
            <fpage>798</fpage>
            <lpage>809</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.devcel.2008.04.002</pubid>
                  <pubid idtype="pmpid" link="fulltext">18477461</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>DNA sequence requirements for generating paused polymerase at the start of hsp70.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kraus</snm>
                  <fnm>KW</fnm>
               </au>
               <au>
                  <snm>Wolfner</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Lis</snm>
                  <fnm>JT</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>1992</pubdate>
            <volume>6</volume>
            <fpage>284</fpage>
            <lpage>295</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gad.6.2.284</pubid>
                  <pubid idtype="pmpid" link="fulltext">1737619</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Paused Pol II captures enhancer activity and acts as a potent insulator.</p>
            </title>
            <aug>
               <au>
                  <snm>Core</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Lis</snm>
                  <fnm>JT</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2009</pubdate>
            <volume>23</volume>
            <fpage>1606</fpage>
            <lpage>1612</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gad.1827709</pubid>
                  <pubid idtype="pmpid" link="fulltext">19605681</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Immediate mediators of the inflammatory response are poised for gene activation through RNA polymerase II stalling.</p>
            </title>
            <aug>
               <au>
                  <snm>Adelman</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kennedy</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Nechaev</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gilchrist</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Muse</snm>
                  <fnm>GW</fnm>
               </au>
               <au>
                  <snm>Chinenov</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Rogatsky</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2009</pubdate>
            <volume>106</volume>
            <fpage>18207</fpage>
            <lpage>18212</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0910177106</pubid>
                  <pubid idtype="pmcid">2775335</pubid>
                  <pubid idtype="pmpid" link="fulltext">19820169</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Sequence-specific termination by T7 RNA polymerase requires formation of paused conformation prior to the point of RNA release.</p>
            </title>
            <aug>
               <au>
                  <snm>Song</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kang</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Genes Cells</source>
            <pubdate>2001</pubdate>
            <volume>6</volume>
            <fpage>291</fpage>
            <lpage>301</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2443.2001.00420.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">11318872</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Downstream DNA selectively affects a paused conformation of human RNA polymerase II.</p>
            </title>
            <aug>
               <au>
                  <snm>Palangat</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hittinger</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Landick</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>341</volume>
            <fpage>429</fpage>
            <lpage>442</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jmb.2004.06.009</pubid>
                  <pubid idtype="pmpid" link="fulltext">15276834</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Structural basis of transcription: mismatch-specific fidelity mechanisms and paused RNA polymerase II with frayed RNA.</p>
            </title>
            <aug>
               <au>
                  <snm>Sydow</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Brueckner</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Cheung</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Damsma</snm>
                  <fnm>GE</fnm>
               </au>
               <au>
                  <snm>Dengl</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lehmann</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Vassylyev</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cramer</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>2009</pubdate>
            <volume>34</volume>
            <fpage>710</fpage>
            <lpage>721</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.molcel.2009.06.002</pubid>
                  <pubid idtype="pmpid" link="fulltext">19560423</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>RNA polymerase II elongation complexes paused after the synthesis of 15- or 35-base transcripts have different structures.</p>
            </title>
            <aug>
               <au>
                  <snm>Linn</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Luse</snm>
                  <fnm>DS</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>1991</pubdate>
            <volume>11</volume>
            <fpage>1508</fpage>
            <lpage>1522</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">369435</pubid>
                  <pubid idtype="pmpid" link="fulltext">1705007</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>A global view of gene activity and alternative splicing by deep sequencing of the human transcriptome.</p>
            </title>
            <aug>
               <au>
                  <snm>Sultan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Schulz</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Richard</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Magen</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Klingenhoff</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Scherf</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Seifert</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Borodina</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Soldatov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Parkhomchuk</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>O'Keeffe</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Haas</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Vingron</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lehrach</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yaspo</snm>
                  <fnm>ML</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2008</pubdate>
            <volume>321</volume>
            <fpage>956</fpage>
            <lpage>960</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1160342</pubid>
                  <pubid idtype="pmpid" link="fulltext">18599741</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>RNA polymerase is poised for activation across the genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Muse</snm>
                  <fnm>GW</fnm>
               </au>
               <au>
                  <snm>Gilchrist</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Nechaev</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shah</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Parker</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Grissom</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Zeitlinger</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Adelman</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2007</pubdate>
            <volume>39</volume>
            <fpage>1507</fpage>
            <lpage>1511</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng.2007.21</pubid>
                  <pubid idtype="pmcid">2365887</pubid>
                  <pubid idtype="pmpid" link="fulltext">17994021</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>A clustering approach for identification of enriched domains from histone modification ChIP-Seq data.</p>
            </title>
            <aug>
               <au>
                  <snm>Zang</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Schones</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Zeng</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Cui</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Peng</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2009</pubdate>
            <volume>25</volume>
            <fpage>1952</fpage>
            <lpage>1958</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btp340</pubid>
                  <pubid idtype="pmpid" link="fulltext">19505939</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Germ cells are forever.</p>
            </title>
            <aug>
               <au>
                  <snm>Cinalli</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Rangan</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lehmann</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2008</pubdate>
            <volume>132</volume>
            <fpage>559</fpage>
            <lpage>562</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2008.02.003</pubid>
                  <pubid idtype="pmpid" link="fulltext">18295574</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>The Polycomb group protein EZH2 directly controls DNA methylation.</p>
            </title>
            <aug>
               <au>
                  <snm>Vire</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Brenner</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Deplus</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Blanchon</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Fraga</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Didelot</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Morey</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Van Eynde</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bernard</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Vanderwinden</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Bollen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Esteller</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Di Croce</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>de Launoit</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Fuks</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2006</pubdate>
            <volume>439</volume>
            <fpage>871</fpage>
            <lpage>874</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04431</pubid>
                  <pubid idtype="pmpid" link="fulltext">16357870</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>PcG proteins, DNA methylation, and gene repression by chromatin looping.</p>
            </title>
            <aug>
               <au>
                  <snm>Tiwari</snm>
                  <fnm>VK</fnm>
               </au>
               <au>
                  <snm>McGarvey</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Licchesi</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Ohm</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Herman</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Schubeler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Baylin</snm>
                  <fnm>SB</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2008</pubdate>
            <volume>6</volume>
            <fpage>2911</fpage>
            <lpage>2927</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pbio.0060306</pubid>
                  <pubid idtype="pmcid">2592355</pubid>
                  <pubid idtype="pmpid" link="fulltext">19053175</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Bmi1 cooperates with Dnmt1-associated protein 1 in gene silencing.</p>
            </title>
            <aug>
               <au>
                  <snm>Negishi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Saraya</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Miyagi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nagao</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Inagaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Nishikawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tajima</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Koseki</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tsuda</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Takasaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Nakauchi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Iwama</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Biochem Biophys Res Commun</source>
            <pubdate>2007</pubdate>
            <volume>353</volume>
            <fpage>992</fpage>
            <lpage>998</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.bbrc.2006.12.166</pubid>
                  <pubid idtype="pmpid" link="fulltext">17214966</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Epigenetic codes for heterochromatin formation and silencing: rounding up the usual suspects.</p>
            </title>
            <aug>
               <au>
                  <snm>Richards</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Elgin</snm>
                  <fnm>SC</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2002</pubdate>
            <volume>108</volume>
            <fpage>489</fpage>
            <lpage>500</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(02)00644-X</pubid>
                  <pubid idtype="pmpid" link="fulltext">11909520</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>DNA methylation in <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Lyko</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Ramsahoye</snm>
                  <fnm>BH</fnm>
               </au>
               <au>
                  <snm>Jaenisch</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>408</volume>
            <fpage>538</fpage>
            <lpage>540</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35046205</pubid>
                  <pubid idtype="pmpid" link="fulltext">11117732</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p><it>Drosophila </it>proteins related to vertebrate DNA (5-cytosine) methyltransferases.</p>
            </title>
            <aug>
               <au>
                  <snm>Hung</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Karthikeyan</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Koo</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>Kiger</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Shen</snm>
                  <fnm>CJ</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <fpage>11940</fpage>
            <lpage>11945</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.96.21.11940</pubid>
                  <pubid idtype="pmcid">18391</pubid>
                  <pubid idtype="pmpid" link="fulltext">10518555</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>The putative <it>Drosophila </it>methyltransferase gene dDnmt2 is contained in a transposon-like element and is expressed specifically in ovaries.</p>
            </title>
            <aug>
               <au>
                  <snm>Lyko</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Whittaker</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Orr-Weaver</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Jaenisch</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Mech Dev</source>
            <pubdate>2000</pubdate>
            <volume>95</volume>
            <fpage>215</fpage>
            <lpage>217</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0925-4773(00)00325-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">10906465</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>The active migration of <it>Drosophila </it>primordial germ cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Jaglarz</snm>
                  <fnm>MK</fnm>
               </au>
               <au>
                  <snm>Howard</snm>
                  <fnm>KR</fnm>
               </au>
            </aug>
            <source>Development</source>
            <pubdate>1995</pubdate>
            <volume>121</volume>
            <fpage>3495</fpage>
            <lpage>3503</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8582264</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Zfh-1 controls somatic stem cell self-renewal in the <it>Drosophila </it>testis and nonautonomously influences germline stem cell self-renewal.</p>
            </title>
            <aug>
               <au>
                  <snm>Leatherman</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Dinardo</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Cell Stem Cell</source>
            <pubdate>2008</pubdate>
            <volume>3</volume>
            <fpage>44</fpage>
            <lpage>54</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.stem.2008.05.001</pubid>
                  <pubid idtype="pmcid">2601693</pubid>
                  <pubid idtype="pmpid" link="fulltext">18593558</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Bioconductor R packages</p>
            </title>
            <url>http://www.bioconductor.org</url>
         </bibl>
         <bibl id="B64">
            <title>
               <p>R programming environment</p>
            </title>
            <url>http://www.r-project.org</url>
         </bibl>
         <bibl id="B65">
            <title>
               <p>UCSC Table Browser</p>
            </title>
            <url>http://genome.ucsc.edu/cgi-bin/hgTables</url>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Dynamic regulation of nucleosome positioning in the human genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Schones</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Cui</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Cuddapah</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Roh</snm>
                  <fnm>TY</fnm>
               </au>
               <au>
                  <snm>Barski</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2008</pubdate>
            <volume>132</volume>
            <fpage>887</fpage>
            <lpage>898</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2008.02.022</pubid>
                  <pubid idtype="pmpid" link="fulltext">18329373</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>DAVID 2008 informatics tools</p>
            </title>
            <url>http://david.abcc.ncifcrf.gov</url>
         </bibl>
         <bibl id="B68">
            <title>
               <p>Gene Ontology Consortium</p>
            </title>
            <url>http://www.geneontology.org</url>
         </bibl>
      </refgrp>
   </bm>
</art>
