<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>gb-2012-13-4-r27</ui><ji>1465-6906</ji><fm>
<dochead>Research</dochead>
<bibl>
<title>
<p>Genome-wide identification and characterization of replication origins by deep sequencing</p>
</title>
<aug>
<au id="A1"><snm>Xu</snm><fnm>Jia</fnm><insr iid="I1"/><insr iid="I2"/><email>jia.xu@umassmed.edu</email></au>
<au id="A2"><snm>Yanagisawa</snm><fnm>Yoshimi</fnm><insr iid="I3"/><email>yoshimi.yanagisawa@umassmed.edu</email></au>
<au id="A3"><snm>Tsankov</snm><mi>M</mi><fnm>Alexander</fnm><insr iid="I4"/><email>atsankov@mit.edu</email></au>
<au id="A4"><snm>Hart</snm><fnm>Christopher</fnm><insr iid="I5"/><insr iid="I8"/><email>chart@ncf.edu</email></au>
<au id="A5"><snm>Aoki</snm><fnm>Keita</fnm><insr iid="I6"/><email>keaoki@lab.nig.ac.jp</email></au>
<au id="A6"><snm>Kommajosyula</snm><fnm>Naveen</fnm><insr iid="I3"/><email>naveen_kommajosyula@dfci.harvard.edu</email></au>
<au id="A7"><snm>Steinmann</snm><mi>E</mi><fnm>Kathleen</fnm><insr iid="I5"/><insr iid="I9"/><email>ksteinmann@helicosbio.com</email></au>
<au id="A8"><snm>Bochicchio</snm><fnm>James</fnm><insr iid="I4"/><email>jboch@broadinstitute.org</email></au>
<au id="A9"><snm>Russ</snm><fnm>Carsten</fnm><insr iid="I4"/><email>carsten@broadinstitute.org</email></au>
<au id="A10"><snm>Regev</snm><fnm>Aviv</fnm><insr iid="I4"/><insr iid="I7"/><email>aregev@broadinstitute.org</email></au>
<au id="A11"><snm>Rando</snm><mi>J</mi><fnm>Oliver</fnm><insr iid="I3"/><email>oliver.rando@umassmed.edu</email></au>
<au id="A12"><snm>Nusbaum</snm><fnm>Chad</fnm><insr iid="I4"/><email>chad@broadinstitute.org</email></au>
<au id="A13"><snm>Niki</snm><fnm>Hironori</fnm><insr iid="I6"/><email>hniki@lab.nig.ac.jp</email></au>
<au id="A14" ca="yes"><snm>Milos</snm><fnm>Patrice</fnm><insr iid="I5"/><email>pmilos@helicosbio.com</email></au>
<au id="A15" ca="yes"><snm>Weng</snm><fnm>Zhiping</fnm><insr iid="I1"/><email>zhiping.weng@umassmed.edu</email></au>
<au id="A16" ca="yes"><snm>Rhind</snm><fnm>Nicholas</fnm><insr iid="I3"/><email>nick.rhind@umassmed.edu</email></au>
</aug>
<insg>
<ins id="I1"><p>Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA</p></ins>
<ins id="I2"><p>Bioinformatics Core Facility, University of Massachusetts Medical School, Worcester, MA 01605, USA</p></ins>
<ins id="I3"><p>Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA</p></ins>
<ins id="I4"><p>Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA</p></ins>
<ins id="I5"><p>Helicos BioSciences Corporation, Cambridge, MA 02139, USA</p></ins>
<ins id="I6"><p>Microbial Genetics Laboratory, Genetic Strains Research Center, National Institute of Genetics, 1111 Yata, Mishima, Shizuoka 411-8540, Japan</p></ins>
<ins id="I7"><p>Howard Hughes Medical Institute, 4000 Jones Bridge Road, Chevy Chase, MD 20815-6789, USA</p></ins>
<ins id="I8"><p>Division of Natural Sciences, New College of Florida, Sarasota, FL 34243, USA</p></ins>
<ins id="I9"><p>Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA</p></ins>
</insg>
<source>Genome Biology</source>
<issn>1465-6906</issn>
<pubdate>2012</pubdate>
<volume>13</volume>
<issue>4</issue>
<fpage>R27</fpage>
<url>http://genomebiology.com/2012/13/4/R27</url>
<xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2012-13-4-r27</pubid><pubid idtype="pmpid">22531001</pubid></pubidlist></xrefbib></bibl>
<history><rec><date><day>30</day><month>11</month><year>2011</year></date></rec><revrec><date><day>27</day><month>3</month><year>2012</year></date></revrec><acc><date><day>24</day><month>4</month><year>2012</year></date></acc><pub><date><day>24</day><month>4</month><year>2012</year></date></pub></history>
<cpyrt><year>2012</year><collab>Xu et al.; licensee BioMed Central Ltd.</collab><note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<sec>
<st>
<p>Background</p>
</st>
<p>DNA replication initiates at distinct origins in eukaryotic genomes, but the genomic features that define these sites are not well understood.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>We have taken a combined experimental and bioinformatic approach to identify and characterize origins of replication in three distantly related fission yeasts: <it>Schizosaccharomyces pombe</it>, <it>Schizosaccharomyces octosporus </it>and <it>Schizosaccharomyces japonicus</it>. Using single-molecule deep sequencing to construct amplification-free high-resolution replication profiles, we located origins and identified sequence motifs that predict origin function. We then mapped nucleosome occupancy by deep sequencing of mononucleosomal DNA from the corresponding species, finding that origins tend to occupy nucleosome-depleted regions.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>The sequences that specify origins are evolutionarily plastic, with low complexity nucleosome-excluding sequences functioning in <it>S. pombe </it>and <it>S. octosporus</it>, and binding sites for trans-acting nucleosome-excluding proteins functioning in <it>S. japonicus</it>. Furthermore, chromosome-scale variation in replication timing is conserved independently of origin location and via a mechanism distinct from known heterochromatic effects on origin function. These results are consistent with a model in which origins are simply the nucleosome-depleted regions of the genome with the highest affinity for the origin recognition complex. This approach provides a general strategy for understanding the mechanisms that define DNA replication origins in eukaryotes.</p>
</sec>
</sec>
</abs>
</fm><bdy>
<sec>
<st>
<p>Background</p>
</st>
<p>
<it>Cis</it>-acting sequences determine the location of replication origins in eukaryotic genomes <abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp>. However, the nature of such <it>cis-</it>acting sequences is not well understood. In the budding yeast <it>Saccharomyces cerevisiae</it>, the eukaryote with the best-studied origins, two sequence characteristics are important for origin function. The first is the 17-bp autonomously replicating sequence (ARS) consensus sequence (ACS) bound by the origin recognition complex (ORC) <abbrgrp>
<abbr bid="B2">2</abbr>
</abbrgrp>. The second is a broader sequence context encompassing 200 to 300 bp that appears to be important for depleting nucleosomes from the origin <abbrgrp>
<abbr bid="B3">3</abbr>
<abbr bid="B4">4</abbr>
<abbr bid="B5">5</abbr>
<abbr bid="B6">6</abbr>
</abbrgrp>. The ACS is not sufficient for origin function; it is present in greater than ten-fold excess over functional origins <abbrgrp>
<abbr bid="B4">4</abbr>
<abbr bid="B6">6</abbr>
</abbrgrp>. Moreover, the ACS does not appear to be essential for ORC function <it>in </it>
<it>vitro </it>
<abbrgrp>
<abbr bid="B7">7</abbr>
</abbrgrp>. Nonetheless, it is required for origin function <it>in vivo </it>
<abbrgrp>
<abbr bid="B8">8</abbr>
</abbrgrp>. Thus, it is thought that the ACS serves to increase the affinity of a subset of nucleosome-free regions (NFRs) for ORC, thereby conferring origin activity on these loci <abbrgrp>
<abbr bid="B4">4</abbr>
</abbrgrp>. In other eukaryotes, the same characteristics - nucleosome depletion and ORC affinity - are likely to be important <abbrgrp>
<abbr bid="B9">9</abbr>
</abbrgrp>. However, neither the mechanism by which nucleosomes are excluded nor the sequences that may bind ORC are known. In fact, the lack of obvious origin-specific motifs in other species and the lack of sequence-specific DNA binding of ORC <it>in vitro </it>has led to the speculation that any region of open chromatin in the genome permissive for ORC binding may function as an origin <abbrgrp>
<abbr bid="B4">4</abbr>
<abbr bid="B10">10</abbr>
</abbrgrp>.</p>
<p>In the fission yeast <it>Schizosaccharomyces pombe</it>, origins have been mapped genome-wide by a number of approaches <abbrgrp>
<abbr bid="B11">11</abbr>
<abbr bid="B12">12</abbr>
<abbr bid="B13">13</abbr>
<abbr bid="B14">14</abbr>
<abbr bid="B15">15</abbr>
</abbrgrp>. These maps confirm earlier conclusions that origin function in <it>S. pombe </it>is conferred by high AT content <abbrgrp>
<abbr bid="B16">16</abbr>
<abbr bid="B17">17</abbr>
</abbrgrp>. <it>S. pombe </it>ORC directly binds AT-rich sequences through nine AT-hook motifs at the amino terminus of its Orc4 subunit <abbrgrp>
<abbr bid="B18">18</abbr>
</abbrgrp>. In addition, AT-rich sequences serve to exclude nucleosomes because the intrinsic stiffness of polyA energetically disfavors nucleosome formation <abbrgrp>
<abbr bid="B19">19</abbr>
<abbr bid="B20">20</abbr>
</abbrgrp>. NFRs at <it>S. pombe </it>origins have been both predicted and observed, although due to the low resolution of existing origin maps and consequent heterogeneity in origin alignments, the averaged NFRs are broader and shallower than those in <it>S. cerevisiae </it>
<abbrgrp>
<abbr bid="B4">4</abbr>
<abbr bid="B21">21</abbr>
<abbr bid="B22">22</abbr>
</abbrgrp>.</p>
<p>The characterization of origin sequences in other organisms has been complicated by the difficulty of identifying origins at high resolution. ARS assays, which were essential in identifying yeast origins, have failed to identify origins in human cells <abbrgrp>
<abbr bid="B10">10</abbr>
</abbrgrp>. Human origins that have been individually mapped at high resolution have not yielded predictive sequence motifs <abbrgrp>
<abbr bid="B23">23</abbr>
</abbrgrp>. Recent identification of <it>Drosophila melanogaster </it>origins by ORC ChIP-chip showed that the sequences that determine origin function tend to be those that exclude nucleosomes; however, no origin-specific sequences were identified <abbrgrp>
<abbr bid="B9">9</abbr>
</abbrgrp>.</p>
<p>We have developed a generally applicable deep-sequencing-based approach to map origins at high resolution and characterize their sequence features. We applied our approach to three distantly related fission yeasts and found that although the sequences and genomic locations of origins are not conserved among the three species, sequence features that define origin function can be readily identified in each of the genomes. Although the sequence character of origins is conserved between <it>S. pombe </it>and <it>Schizosaccharomyces octosporus</it>, it has dramatically diverged in the evolution of <it>Schizosaccharomyces japonicus</it>.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<sec>
<st>
<p>Identification of origins by deep sequencing</p>
</st>
<p>To identify origins in <it>S. pombe</it>, <it>S. octosporus</it>, and <it>S. japonicus</it>, we mapped all sites with increased DNA copy number in early S phase <abbrgrp>
<abbr bid="B11">11</abbr>
<abbr bid="B24">24</abbr>
</abbrgrp>. We synchronized cells in G2 by elutriation, allowed them to enter S phase in the presence of the ribonucleotide reductase inhibitor hydroxyurea (HU), harvested genomic DNA from the S-phase arrested cells, sheared the DNA to about 200-bp fragments, and used deep sequencing to count the frequency of each region of the genome (Figure <figr fid="F1">1</figr>). As a control for systematic biases in DNA preparation and sequencing, we also sequenced DNA from G2 cells, which have uniform DNA copy number.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Identification of replication origins by deep sequencing of hydroxyurea-arrested cells</p></caption><text>
   <p><b>Identification of replication origins by deep sequencing of hydroxyurea-arrested cells</b>. <b>(a) </b>The replication profile of a region of <it>S. pombe </it>dataset Sp1 containing three validated origins and several AT islands. The red curve shows the normalized replication fraction in HU-arrested cells. The blue dots show identified replication peaks. On the x-axis, black represents genes, orange represents AT-rich intergenes <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> and green represents validated origins (collated in <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>). <b>(b) </b>The difference between the G2 datasets form Sp1 and Sp2, demonstrating the magnitude of the noise in the datasets. <b>(c) </b>The replication profile of a region of <it>S. octosporus </it>dataset So1, as in (a). <b>(d) </b>The replication profile of a region of <it>S. japonicus </it>dataset Sj2 containing two cloned ARSs <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, as in (a). ORF, open reading frame.</p>
</text><graphic file="gb-2012-13-4-r27-1"/></fig>
<p>We employed Helicos single-molecule sequencing technology because it economically provides large numbers of reads without amplification and with significantly lower GC-bias than alternative sequencing technologies <abbrgrp>
<abbr bid="B25">25</abbr>
<abbr bid="B26">26</abbr>
</abbrgrp>. For <it>S. pombe</it>, we collected three pairs of datasets (each pair corresponding to S and G2 cells): two from wild-type cells with different synchronization protocols and one from a different strain background (see below). For <it>S. octosporus</it>, we collected one pair of wild-type datasets. For <it>S. japonicus</it>, we collected two wild-type S-phase datasets and one wild-type G2 dataset. For each sample, we collected an average of 18 million mappable sequence reads, which represents &gt;200-fold sampling of copy number at a 200-bp resolution (Table <tblr tid="T1">1</tblr>).</p>
<tbl id="T1"><title><p>Table 1</p></title><caption><p>Dataset characterization</p></caption><tblbdy cols="10">
      <r>
         <c ca="left">
            <p>
               <b>Name</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Species</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Genotype</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Synchronization</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>S reads (M)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>G2 reads (M)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Number of peaks</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Median inter-peak distance</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>SVM auROC</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Number of features used</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="10">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Sp1</p>
         </c>
         <c ca="left">
            <p>
               <it>S. pombe</it>
            </p>
         </c>
         <c ca="left">
            <p>Wild type</p>
         </c>
         <c ca="left">
            <p>Elutriation-HU</p>
         </c>
         <c ca="center">
            <p>9.4</p>
         </c>
         <c ca="center">
            <p>19.4</p>
         </c>
         <c ca="center">
            <p>143</p>
         </c>
         <c ca="center">
            <p>37.9</p>
         </c>
         <c ca="center">
            <p>0.81</p>
         </c>
         <c ca="center">
            <p>693</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Sp2</p>
         </c>
         <c ca="left">
            <p>
               <it>S. pombe</it>
            </p>
         </c>
         <c ca="left">
            <p>
               <it>dpf1-3A</it>
            </p>
         </c>
         <c ca="left">
            <p>Elutriation-HU</p>
         </c>
         <c ca="center">
            <p>12.0</p>
         </c>
         <c ca="center">
            <p>20.3</p>
         </c>
         <c ca="center">
            <p>160</p>
         </c>
         <c ca="center">
            <p>33.7</p>
         </c>
         <c ca="center">
            <p>0.85</p>
         </c>
         <c ca="center">
            <p>346</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Sp3</p>
         </c>
         <c ca="left">
            <p>
               <it>S. pombe</it>
            </p>
         </c>
         <c ca="left">
            <p>
               <it>cdc25-ts</it>
            </p>
         </c>
         <c ca="left">
            <p><it>cdc25 </it>release-HU</p>
         </c>
         <c ca="center">
            <p>9.7</p>
         </c>
         <c ca="center">
            <p>7.7</p>
         </c>
         <c ca="center">
            <p>387</p>
         </c>
         <c ca="center">
            <p>18.2</p>
         </c>
         <c ca="center">
            <p>0.76</p>
         </c>
         <c ca="center">
            <p>1,386</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>So1</p>
         </c>
         <c ca="left">
            <p>
               <it>S. octosporus</it>
            </p>
         </c>
         <c ca="left">
            <p>Wild type</p>
         </c>
         <c ca="left">
            <p>Elutriation-HU</p>
         </c>
         <c ca="center">
            <p>37.5</p>
         </c>
         <c ca="center">
            <p>31.3</p>
         </c>
         <c ca="center">
            <p>208</p>
         </c>
         <c ca="center">
            <p>22.1</p>
         </c>
         <c ca="center">
            <p>0.77</p>
         </c>
         <c ca="center">
            <p>346</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Sj1</p>
         </c>
         <c ca="left">
            <p>
               <it>S. japonicus</it>
            </p>
         </c>
         <c ca="left">
            <p>Wild type</p>
         </c>
         <c ca="left">
            <p>Elutriation-HU</p>
         </c>
         <c ca="center">
            <p>10.3</p>
         </c>
         <c ca="center">
            <p>24.5</p>
         </c>
         <c ca="center">
            <p>526</p>
         </c>
         <c ca="center">
            <p>14.2</p>
         </c>
         <c ca="center">
            <p>0.86</p>
         </c>
         <c ca="center">
            <p>1,386</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Sj2</p>
         </c>
         <c ca="left">
            <p>
               <it>S. japonicus</it>
            </p>
         </c>
         <c ca="left">
            <p>Wild type</p>
         </c>
         <c ca="left">
            <p>Elutriation-HU</p>
         </c>
         <c ca="center">
            <p>9.6</p>
         </c>
         <c ca="center">
            <p>24.5</p>
         </c>
         <c ca="center">
            <p>542</p>
         </c>
         <c ca="center">
            <p>13.8</p>
         </c>
         <c ca="center">
            <p>0.89</p>
         </c>
         <c ca="center">
            <p>693</p>
         </c>
      </r>
   </tblbdy></tbl>
<p>To produce replication profiles for each species, we normalized the read frequency in the S and G2 datasets, subtracted the G2 background from the S-phase signal, and smoothed the resulting curves (see Materials and methods). The replication peaks of <it>S. pombe </it>and <it>S. japonicus </it>are about 10 kb wide, consistent with previous estimates of fork progression in HU arrested cells (Figure <figr fid="F1">1</figr>) <abbrgrp>
<abbr bid="B27">27</abbr>
</abbrgrp>. The replication peaks of <it>S. octosporus </it>are more than twice as wide, perhaps reflecting a more heterogeneous arrest in HU.</p>
<p>We found that many of the peaks in the <it>S. pombe </it>Sp1 and Sp2 datasets were notched, presumably due to loss of bubble-shaped replication intermediates during our sample preparation. To compensate for the notched peaks, we identified peaks in all of our replication profiles using a template fitting approach that predicts where the peaks would be if they were not notched (Figure <figr fid="F1">1</figr>; Figure S1 in Additional file <supplr sid="S1">1</supplr>). To estimate the sensitivity of our peak detection, we compared two independent <it>S. pombe </it>G2 samples from the datasets Sp1 and Sp2 (Figure <figr fid="F1">1b</figr>). The difference between the two datasets, which reflects the experimental noise in the data, has a distribution of 1.00 &#177; 0.145. We therefore restricted our analysis to peaks of heights greater than 1.30, or about 2 standard deviations above the baseline. To confirm the reproducibility of our peaks, we compared independent datasets from both <it>S. pombe </it>and <it>S. japonicus </it>and found a strong correlation between peak locations in the datasets (<it>P </it>&lt; 10<sup>-10</sup>; Figure S2 in Additional file <supplr sid="S1">1</supplr>).</p>
<suppl id="S1">
<title>
<p>Additional file 1</p>
</title>
<text>
<p>
<b>Supplemental Figures S1 to S6</b>. Figure S1: Peak detection by template fitting. Examples of the template-fitting approach to peak detection. The blue curve is the normalized, smoothed data. The blue dot is the peak call before template fitting. The red curve is the template. The red dot is the peak call after template fitting. The numbers are serial numbers assigned to the peaks during the iterative peak-calling process. Peak 106 can be seen at about 4.27 Mb in Figure <figr fid="F1">1</figr> and Figure S2 in Additional file <supplr sid="S1">1</supplr>. Figure S2: Comparison of independent replication profiles. Replication profiles from <b>(a) </b>Sp1, Sp2 and Sp3 and <b>(b) </b>Sj1 and Sj2 are compared as in Figure <figr fid="F2">2</figr>. <b>(c) </b>Venn diagrams of peak overlap between the indicated datasets. Most cases of non-overlapping peak calls are due to the peak in one of the datasets being below the cutoff, such as at 4.23 Mb is <it>S. pombe </it>and 1.55 Mb is <it>S. japonicus</it>. Figure S3: The Sap1 binding site. Logos for the Sap1 binding site derived from <b>(a) </b>MEME analysis of <it>S. japonicus </it>origins (Figure <figr fid="F3">3</figr>; Table S6 in Additional file <supplr sid="S2">2</supplr>) and <b>(b) </b>
<it>in vitro </it>selection <abbrgrp>
<abbr bid="B54">54</abbr>
</abbrgrp>. Figure S4: Orc2 and Orc4 domain structures. Domain structures of Orc2 and Orc4 as defined by PFAM <abbrgrp>
<abbr bid="B55">55</abbr>
</abbrgrp>. Figure S5: Origin motifs are nucleosome depleted are origin and non-origin sites. Nucleosome occupancy over motifs is depicted as in Figure <figr fid="F4">4</figr>, except motifs are divided into those within 1 kb of a replication peak and those farther away. Figure S6: Nucleosome alignment on transcriptional start sites. Nucleosome occupancy over all annotated transcriptional start sites is depicted.</p>
</text>
<file name="gb-2012-13-4-r27-S1.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<p>To validate our replication peaks, we compared our data to confirmed origins. For <it>S. pombe</it>, we examined 37 origins that were previously shown to be active <it>in vivo </it>by two-dimensional gel analysis (collated in <abbrgrp>
<abbr bid="B15">15</abbr>
</abbrgrp>) and have mappable reads in our datasets. Of the 37 origins, 22 overlap with an identified replication peak in our data sets; 8 are less than 1 kb away and 4 are between 1 and 2.5 kb away; for 3 we see no replication peak in the region (Figure <figr fid="F1">1a</figr>; Table S2 in Additional file <supplr sid="S2">2</supplr>). Overall, the enrichment of origins near replication peaks is highly significant (<it>P </it>= 4.41 &#215; 10<sup>-09</sup>). We also compared our replication peaks to AT islands, intergenes of high AT content, which are strong predictors of origin function in <it>S. pombe </it>
<abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp>. We find a significant (<it>P </it>&lt; 10<sup>-10</sup>), but not perfect, overlap of AT islands with our replication peaks (Figure <figr fid="F1">1a</figr>), consistent with our sequence analysis of origin motifs (see below). We also see comparable overlap between our replication peaks and published <it>S. pombe </it>origin maps (see Materials and methods).</p>
<suppl id="S2">
<title>
<p>Additional file 2</p>
</title>
<text>
<p>
<b>Supplemental Tables S1 to S8</b>. Table S1: dataset analysis parameters. Table S2: validated <it>S. pombe </it>origins. Table S3: <it>S. japonicus </it>ARSs. Table S4: <it>k</it>-mer auROC scores. Table S5: cross-species SVM performance. Table S6: MEME results. Table S7: dataset overlap. Table S8: accession numbers.</p>
</text>
<file name="gb-2012-13-4-r27-S2.XLS">
   <p>Click here for file</p>
</file>
</suppl>
<p>For <it>S. japonicus</it>, we compared our data to 11 ARSs isolated from a genomic library <abbrgrp>
<abbr bid="B28">28</abbr>
</abbrgrp>. Six of the ARSs encompass a replication peak and four others were within 2.5 kb of a replication peak (<it>P </it>= 0.012); for one ARS, we found no associated replication peak (Figure <figr fid="F1">1d</figr>; Table S3 in Additional file <supplr sid="S2">2</supplr>). These results suggest that our replication peaks correspond to <it>in vivo </it>origins of replication.</p>
</sec>
<sec>
<st>
<p>Genome scale origin distribution</p>
</st>
<p>We compared the locations of replication peaks within syntenic regions among the three species and found no significant conservation of origin location (<it>P </it>= 0.22), consistent with the lack of conservation of origin locations among budding yeast species of similar evolutionary distance <abbrgrp>
<abbr bid="B29">29</abbr>
</abbrgrp>.</p>
<p>Nonetheless, genome-scale patterns of origin firing are conserved between <it>S. pombe </it>and <it>S. octosporus </it>(Figure <figr fid="F2">2</figr>). The replication peaks on chromosomes 1 and 2 of these species are greatly enriched in the central, centromere-containing regions of the chromosomes and much diminished on the arms; in contrast, replication peaks are uniformly distributed across chromosome 3. Early firing of <it>S. pombe </it>centromeric origins has been reported <abbrgrp>
<abbr bid="B30">30</abbr>
</abbrgrp>. This early firing is due to an interaction between the Dfp1-dependent kinase (DDK; the <it>S. pombe </it>homolog of the Cdc7-Dbf4 replication kinase) with the Swi6 heterochromatin-binding protein (the <it>S. pombe </it>homolog of the HP1 heterochromatin-binding protein) <abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp>. The Swi6-binding motif in Dfp1 is conserved in <it>S. octosporus</it>, which also shows a bias towards chromosome centers, but not in <it>S. japonicus</it>, which does not (Figure <figr fid="F2">2b</figr>). To test if the Dpf1-Swi6 interaction is required for the centromere bias, we mapped origins in an <it>S. pombe </it>strain carrying a mutant Dfp1 that does not interact with Swi6 <abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp>. The mutant does not affect the centromere bias (Figure <figr fid="F2">2a</figr>), consistent with the observation that the bias extends hundreds of kilobases from the centromeres, much farther than heterochromatin does, and encompasses all of chromosome 3.</p>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>Genome-wide patterns of origin firing</p></caption><text>
   <p><b>Genome-wide patterns of origin firing</b>. <b>(a) </b>The extent of replication in HU-arrested cells is depicted in red. The locations of centromeres are indicated by blue dots. <b>(b) </b>The Swi6(HP1)-binding region of Dfp1 from <it>S. pombe</it>, <it>S. octosporus </it>and <it>S. japonicus</it>. The <it>S. pombe </it>Swi6-binding motif is in bold. The three amino acids mutated in <it>dfp1-3A </it>are indicated by asterisks. Amino acids similar in two of the three species are boxed in gray. The presence of the <it>dfp1-3A </it>allele inHM1826 was confirmed from the genomic sequence used to generate the replication profiles. Wt, wild type.</p>
</text><graphic file="gb-2012-13-4-r27-2"/></fig>
<p>The central regions of chromosomes 1 and 2 have been previously reported to have more efficient origins than their arms <abbrgrp>
<abbr bid="B11">11</abbr>
<abbr bid="B13">13</abbr>
<abbr bid="B15">15</abbr>
</abbrgrp>, but the effect is not as strong as the bias seen in our datasets. We investigated experimental factors that might account for this difference in the distribution of origin firing. Previous experiments had synchronized cells in G2 by inactivating a temperature-sensitive (ts) allele of the Cdc25 mitotic activator and then releasing the blocked culture into a synchronous cell cycle. This protocol arrests cells for up to 4 hours in G2 and causes them to grow to as much as twice their normal size before dividing. In contrast, we had used centrifugal elutriation to select a synchronous population of G2 cells from an asynchronous culture, a protocol that does not involve arresting the cell cycle. We repeated our origin mapping using a <it>cdc25-ts </it>block-and-release approach, and confirmed that origin firing is much more uniformly distributed across the genome (Figure <figr fid="F2">2</figr>). Nonetheless, the same origins are used in the two conditions (Figure S2 in Additional file <supplr sid="S1">1</supplr>). This result suggests that a prolonged G2 arrest causes origin firing potential to be more uniformly distributed.</p>
</sec>
<sec>
<st>
<p>Characterization of origin sequences</p>
</st>
<p>Although origin locations have been identified in a wide variety of eukaryotes, the sequence characteristics that confer origin activity to these loci have only been elucidated in <it>S. cerevisiae </it>and <it>S. pombe </it>
<abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp>. We used two computational approaches to analyze the sequence characteristics of the origins we identified in the three <it>Schizosaccharomyces </it>species. The first approach used a support vector machine (SVM), a machine-learning algorithm that tests the ability of <it>k</it>-mer sequences (<it>k </it>= 1 to 6) to predict origin function (see Materials and methods) <abbrgrp>
<abbr bid="B32">32</abbr>
</abbrgrp>. The results of such tests can be analyzed with a receiver operating characteristic (ROC) curve, which plots true-positive rate (sensitivity) as a function of false-positive rate (1 minus specificity). The area under the ROC curve (auROC) demonstrates the performance of the SVM model, with 1.0 being perfect and 0.5 being no better than random. auROC thus serves as a metric of the predictive power of the model.</p>
<p>We performed SVM analysis on all data sets (Figure <figr fid="F3">3</figr> and Table <tblr tid="T1">1</tblr>; Tables S4 and S5 in Additional file <supplr sid="S2">2</supplr>). For all three species, we are able to predict origins with high sensitivity and specificity, as demonstrated by auROC scores ranging from 0.76 to 0.89 (Table <tblr tid="T1">1</tblr>). To determine which sequences are the most predictive of origin function, we examined the auROC scores for individual <it>k</it>-mers. As expected for <it>S. pombe</it>, the AT richness is a strong predictor of origin function, as demonstrated by the high predictive value of the A nucleotide (auROC = 0.62 to 0.79; Table S4 in Additional file <supplr sid="S2">2</supplr>). However, simple AT richness is not as strong a predictor as polyA tracts (auROC up to 0.86), consistent with the fact that the AT-hook motif, of which <it>S. pombe </it>Orc4 has nine (Figure S4 in Additional file <supplr sid="S1">1</supplr>), binds polyA. Interestingly, the sequence character of the origins used in the <it>cdc25-ts</it>-synchronized culture is less biased towards strictly polyA and is less predictive of origin function than that of the unarrested, wild-type culture (Table <tblr tid="T1">1</tblr>; Table S4 in Additional file <supplr sid="S2">2</supplr>). This change in the sequence character of the origins used in the <it>cdc25-ts</it>-synchronized cells is consistent with a wider range of less well-defined origins being used in these experimental conditions. <it>S. octosporus </it>origins have similar sequence character, with polyA tracts dominating the SVM signal; however, the auROC scores are significantly lower than those for <it>S. pombe</it>, possible due to the lower resolution of the data set (Figures <figr fid="F2">2</figr> and Figure <figr fid="F4">4a</figr>, Table <tblr tid="T1">1</tblr>; Table S4 in Additional file <supplr sid="S2">2</supplr>).</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>Sequence characteristics of fission yeast origins</p></caption><text>
   <p><b>Sequence characteristics of fission yeast origins</b>. <b>(a) </b>Each <it>k</it>-mer (<it>k </it>= 1 to 6) is plotted on origin prediction (auROC) versus origin enrichment (log frequency ratio). <it>k</it>-mers of only A or T are red and those of only G or C blue; polyA <it>k</it>-mers are solid red and polyG <it>k</it>-mers blue and labeled. Complementary sequences are not distinguished (that is, 6A is equivalent to 6T). Motifs identified by MEME are represented as diamond symbols. For MEME motifs identified in more than one species, the species of origin is indicated. <b>(b) </b>Logos of the MEME motifs included in (a). For MEME motifs identified in more that one species, a representative logo is shown.</p>
</text><graphic file="gb-2012-13-4-r27-3"/></fig>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>Fission yeast origins are depleted of nucleosomes</p></caption><text>
   <p><b>Fission yeast origins are depleted of nucleosomes</b>. <b>(a) </b>Average nucleosome occupancy for all replication peaks relative to the genome average is plotted log<sub>2</sub>. <b>(b) </b>Average nucleosome occupancy of the NFRs nearest to each replication peak is plotted in red; the nucleosome occupancy of the NFRs nearest to an equal number of random non-origin locus is plotted in black. <b>(c) </b>Average nucleosome occupancy over all instances of each of the identified origin motifs relative to the genome average is plotted log<sub>2</sub>. The logo for each motif is shown above.</p>
</text><graphic file="gb-2012-13-4-r27-4"/></fig>
<p>Surprisingly, AT content is not a positive predictor of origin function in <it>S. japonicus</it>. In fact, it is a strong negative predictor of origin function (auROC = 0.15). Instead, <it>S. japonicus </it>origins are characterized by high GC content and a variety of GC-rich <it>k</it>-mers. Interestingly, A is a much stronger negative predictor of origin function than 6A, which is a weakly positive character (auROC = 0.54). This result suggests that although <it>S. japonicus </it>origins are depleted in AT content, they maintain a higher relative polyA content, consistent with the existence of AT-hooks on <it>S. japonicus </it>Orc2 and Orc4 (Figure S4 in Additional file <supplr sid="S1">1</supplr>).</p>
<p>As expected, SMVs trained on one data set from a species performed well on other datasets from that species (Table S5 in Additional file <supplr sid="S2">2</supplr>). Conversely, the SMVs trained on <it>S. pombe </it>perform poorly on <it>S. japonicus</it>, which has very different origin characteristics, and vice versa. Interestingly, the <it>S. pombe </it>SVMs also performed poorly on <it>S. octosporus</it>, and vice versa. Therefore, it appears that even though <it>S. pombe </it>and <it>S. octosporus </it>have qualitatively similar origin characteristics, there are quantitative differences in their origin sequences.</p>
<p>Our second approach to characterizing origin sequence used the <it>ab initio </it>motif discover algorithm MEME, which is capable of identifying the enrichment of motifs that are longer than six residues. As for the SVM analysis, we used 1 kb from either side of the replication peaks as the positive loci. However, because fission yeast origins are generally intergenic <abbrgrp>
<abbr bid="B16">16</abbr>
<abbr bid="B17">17</abbr>
</abbrgrp>, our positive loci are enriched for promoter sequences. To avoid training on promoter sequences and thus inadvertently identifying promoter motifs, we masked coding sequence from both our positive and negative loci. We performed the analysis using two background models. The first approach uses only the mono-nucleotide composition of the input loci as background, and finds motifs that are enriched relative to that bias. This approach finds motifs that are unexpectedly frequent in the input loci, but does not compare the frequency between positive and negative loci. The second approach uses a fifth-order hidden Markov model of the background in the negative loci, which reduces the background noise and allows for more rigorous identification of enriched motifs. We initially limited our search to motifs of less than 17 bases, to increase sensitivity. No additional motifs were found upon extending the search space to 50 bases.</p>
<p>In both <it>S. pombe </it>and <it>S. octosporus</it>, the motifs identified by MEME using both background models reflected the polyA-rich nature of origins seen in the SVM analysis (Figure <figr fid="F3">3</figr>; Table S6 in Additional file <supplr sid="S2">2</supplr>). In addition, a polyAG motif was frequently found in <it>S. pombe </it>origins (Table S6 in Additional file <supplr sid="S2">2</supplr>).</p>
<p>In <it>S. japonicus</it>, using the mono-nucleotide composition background, we find polyA motifs, suggesting that origins have significant enrichment of polyA tracts above that expected from their AT content, consistent with the SVM results (Figure <figr fid="F3">3</figr>; Table S4 in Additional file <supplr sid="S2">2</supplr>). However, using the hidden Markov model background model, we find no polyA motifs, suggesting that although intergenes in general tend to have high numbers of polyA tracts, those tracts do not directly determine origin function. Instead, we find three highly enriched motifs (Figure <figr fid="F3">3</figr>): polyG, a head to tail repeat of CTCGCT and the binding site for the Sap1 protein (Figure S3 in Additional file <supplr sid="S1">1</supplr>), with 87% of all replication peaks containing at least one of the motifs. polyG motifs are known to exclude nucleosomes - like polyA, due to the intrinsic stiffness of the sequence <abbrgrp>
<abbr bid="B19">19</abbr>
<abbr bid="B20">20</abbr>
<abbr bid="B33">33</abbr>
</abbrgrp> - and Sap1 has been shown to exclude nucleosomes in <it>S. pombe </it>
<abbrgrp>
<abbr bid="B33">33</abbr>
</abbrgrp>, suggesting that these motifs may function to deplete origins of nucleosomes in <it>S. japonicus</it>.</p>
</sec>
<sec>
<st>
<p>Nucleosome localization at origins</p>
</st>
<p>Previous work has demonstrated that origins tend to be nucleosome depleted <abbrgrp>
<abbr bid="B3">3</abbr>
<abbr bid="B4">4</abbr>
<abbr bid="B9">9</abbr>
<abbr bid="B21">21</abbr>
<abbr bid="B34">34</abbr>
</abbrgrp>. To investigate the nucleosome occupancy of origins in the three fission yeast, we mapped mono-nucleosomes by deep sequencing of all three species. Average nucleosome occupancy profiles centered on replication peaks from <it>S. pombe </it>and <it>S. japonicus </it>show them to be nucleosome depleted, as expected (Figure <figr fid="F4">4a</figr>). The replication peaks for <it>S. octosporus </it>show no significant nucleosome depletion (data not shown), possibly due to the lower resolution of that replication data set. To compensate for the fact that our replication peaks are not perfectly aligned with their underlying origins, we identified the nearest NFR to each replication peak and to an equal number of control sites (Figure <figr fid="F4">4b</figr>). Average nucleosome occupancy profiles centered on origin-proximal NFRs tend to be deeper and broader than nucleosome profiles centered on origin-distal NFRs (<it>P </it>&lt; 5 &#215; 10<sup>-3</sup>). Note that both the origin NFRs and non-origin NFRs in <it>S. pombe </it>tend to be more nucleosome depleted than in <it>S. japonicus</it>. In <it>S. octosporus</it>, the non-origin NFRs are similar to those in <it>S. pombe</it>, but the <it>S. octosporus </it>origin NFRs appear to be less depleted than the origin NFRs in <it>S. pombe</it>, possibly because the lower resolution of the data set caused some of the replication peaks to be assigned to the wrong NFR.</p>
<p>Several of the identified sequence motifs associated with origins, in particular polyA and Sap1, are known to exclude histone binding <abbrgrp>
<abbr bid="B20">20</abbr>
<abbr bid="B33">33</abbr>
</abbrgrp>. To investigate the roles of other origin-associated motifs, we examined their <it>in vivo </it>nucleosome occupancy. The motifs tend to be nucleosome depleted in all three species, with three exceptions (Figure <figr fid="F4">4c</figr>). The first exception is the CTCGCT motif in <it>S. pombe </it>and <it>S. octosporus</it>, which is largely absent from these genomes. The second exception is polyG, which is not nucleosome depleted in <it>S. pombe </it>and <it>S. octosporus</it>. Although it seems counter-intuitive that a sequence believed to exclude nucleosomes because of its intrinsic stiffness should show species-specific effects, such effects have been seen before, due to the fact that polyG motifs in relatively G-rich genomes (like <it>S. japonicus</it>) tend to be embedded in longer G-rich sequences <abbrgrp>
<abbr bid="B33">33</abbr>
</abbrgrp>. The third exception is the lack of nucleosome exclusion for polyAG in <it>S. octosporus</it>. This result is consistent with the fact the polyAG is not enriched at <it>S. octosporus </it>origins and may reflect a different dinucleotide bias between the <it>S. pombe </it>and <it>S. octosporus </it>polyAG sequences.</p>
<p>To further investigate the roles of the identified origin motifs in nucleosome exclusion, we compared the nucleosome occupancy of motifs at origins and non-origin loci. The motifs all tend to be as nucleosome depleted at non-origin loci as they are at origins (Figure S5 in Additional file <supplr sid="S1">1</supplr>). This result suggests that the motifs exclude nucleosomes, as opposed to being nucleosome depleted because of their enrichment in origins. Therefore, the sequences that characterize origins in fission yeast appear to fulfill two roles: excluding nucleosomes and binding ORC. This conclusion is consistent with a general model for eukaryotic origins in which origins are simply the NFRs with the highest affinity ORC binding sites.</p>
</sec>
</sec>
<sec>
<st>
<p>Discussion</p>
</st>
<p>We have used a combination of experimental and bioinformatic approaches to identify and characterize replication origins in three distantly related fission yeast. We find that origins are plastic genetic elements, lacking conservation of primary sequence or location. Nonetheless, origins maintain conserved sequence characteristics that reflect the need for two origin functions: exclusion of nucleosomes and binding of ORC. Moreover, we have identified sequence determinants that predict origin function. Surprisingly, the sequences predictive of origin function are qualitatively different between <it>S. pombe </it>and <it>S. japonicus</it>, suggesting that not only are specific origins evolutionarily ephemeral but that the mechanisms that define origins are not strongly constrained.</p>
<p>Analysis of origin locations relative to conserved genes in the three species show that origins are not conserved. Origin locations are conserved amongst the much more closely related budding yeast of the <it>Saccharomyces </it>sensu stricto clade <abbrgrp>
<abbr bid="B6">6</abbr>
</abbrgrp>. However, even in that study, origin heterogeneity was observed. Origin locations are not conserved between <it>Saccharomyces cerevisiae </it>and <it>Kluyveromyces lactis </it>
<abbrgrp>
<abbr bid="B29">29</abbr>
</abbrgrp>, which are less closely related (54.8% average amino acid identity) than <it>S. pombe </it>and <it>S. octosporus </it>(65.5% average amino acid identity) <abbrgrp>
<abbr bid="B35">35</abbr>
</abbrgrp>. In contrast to the rapid divergence of origins within the <it>Schizosaccharomyces </it>clade, coding gene order and structure are strongly conserved; the median block of conserved synteny between <it>S. pombe </it>and <it>S. octosporus </it>is 65 kb (43 genes) and 81% of genes in the clade have conserved exon structure <abbrgrp>
<abbr bid="B35">35</abbr>
</abbrgrp>. These results suggest that origins are dynamic genetic elements, any one of which is under weak evolutionary constraint.</p>
<p>In contrast to the lack of conservation of specific origins, the chromosome-wide pattern of origin firing is conserved between <it>S. pombe </it>and <it>S. octosporus</it>. In both species, early firing origins are concentrated in the central regions of the two larger chromosomes but spread across the smaller third chromosome (Figure <figr fid="F2">2</figr>). However, because of the numerous chromosomal rearrangements between the two species <abbrgrp>
<abbr bid="B35">35</abbr>
</abbrgrp>, this conserved chromosomal pattern does not imply any conservation of replication timing between the genes of the two species. <it>S. pombe </it>centromeres replicate early <abbrgrp>
<abbr bid="B30">30</abbr>
<abbr bid="B31">31</abbr>
</abbrgrp>; however, the early replicating regions seen in our data extend hundreds of kilobases beyond the centromeric heterochromatin (Figure <figr fid="F2">2</figr>). Furthermore, the interaction between the DDK replication kinase and centromeric heterochromatin required for the early replication of <it>S. pombe </it>centromeres <abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp> is not required for the chromosome-wide pattern of origin firing we see in <it>S. pombe </it>(Figure <figr fid="F2">2</figr>). These results suggest that the distribution of origin firing is regulated by factors other than centromeric heterochromatin.</p>
<p>The difference in origin-firing distribution along chromosomes is reduced in magnitude when cells are arrested in G2 before replication (Figure <figr fid="F2">2</figr>). A similar effect is seen when cells are arrested in metaphase <abbrgrp>
<abbr bid="B36">36</abbr>
</abbrgrp>. Since both G2 and metaphase arrests lead to a similar redistribution of origin-firing timing, we suspect that the effect is not due to the specific phase of the cell-cycle arrest. In fact, since origins are not licensed in G2, we suspect it is not a direct effect on origins at all. Instead, we speculate that it is the increased size of the cells during the arrest that affects origin firing. Origin efficiency in <it>S. pombe </it>is regulated by the rate limiting factors Dfp1 (the regulatory subunit of the DDK replication kinase) and Cdc45 (an origin initiation factor) <abbrgrp>
<abbr bid="B36">36</abbr>
<abbr bid="B37">37</abbr>
</abbrgrp>. Since cells continue to grow while arrested, these proteins will continue to accumulate. Upon release, the excess of these limiting factors may allow normally less efficient origins to compete more efficiently for firing. A similar situation is observed in human cells, in which synchronization by release from G1 or S block allows for the activation of normally inefficient or dormant origins <abbrgrp>
<abbr bid="B38">38</abbr>
</abbrgrp>.</p>
<p>The sequences that predict origin function in the <it>Schizosaccharomyces </it>clade appear to have two functions: ORC binding and nucleosome exclusion. All three species have AT hook domains on their Orc4 subunits (Figure S4 in Additional file <supplr sid="S1">1</supplr>), which, in <it>S. pombe</it>, direct ORC binding to polyA sequences <abbrgrp>
<abbr bid="B18">18</abbr>
</abbrgrp>. As expected, origins in all three species are enriched for polyA motifs (Figure <figr fid="F3">3</figr>; Tables S4 and S5 in Additional file <supplr sid="S2">2</supplr>). In <it>S. pombe </it>and <it>S. octosporus</it>, origins are defined by polyA motifs <abbrgrp>
<abbr bid="B16">16</abbr>
<abbr bid="B17">17</abbr>
</abbrgrp>, which not only bind ORC but also intrinsically exclude nucleosomes, due to the biophysical stiffness of polyA sequence <abbrgrp>
<abbr bid="B20">20</abbr>
</abbrgrp>. The lack of specific origin motifs in <it>S. pombe </it>is consistent with the finding that synthetic A-rich sequences confer origin function <abbrgrp>
<abbr bid="B39">39</abbr>
</abbrgrp>. In contrast, in <it>S. japonicus</it>, polyA is not predictive of origin function, suggesting it acts like the ACS in <it>S. cerevisiae</it>, which is required to bind ORC, but only functions in an NFR context <abbrgrp>
<abbr bid="B4">4</abbr>
<abbr bid="B6">6</abbr>
</abbrgrp>. It is interesting to note that <it>S. japonicus </it>has a single amino-terminal AT hook on Orc2, a configuration shared with <it>S. cerevisiae</it>, but not with the other fission yeast. This evolutionary distribution suggests that the Orc2 AT hook is ancestral and was lost in the <it>S. pombe</it>-<it>S. octosporus </it>clade. Likewise, the presence of Orc4 AT hooks in <it>S. japonicus </it>suggests that these motifs arose before AT motifs evolved to define origins, allowing evolution of AT-rich origins and the loss of other origin defining motifs in the <it>S. pombe</it>-<it>S. octosporus </it>clade.</p>
<p>Instead of using polyA sequences to exclude nucleosomes, <it>S. japonicus </it>origins are defined by other nucleosome-excluding sequences (Figure <figr fid="F3">3</figr>): polyG, the Sap1 binding site and a CTCGTC motif. polyG motifs exclude nucleosomes, presumably due to the intrinsic stiffness of the sequence <abbrgrp>
<abbr bid="B19">19</abbr>
<abbr bid="B33">33</abbr>
</abbrgrp>. polyG motifs are nucleosome depleted in other yeast <abbrgrp>
<abbr bid="B33">33</abbr>
</abbrgrp>, but not in <it>S. pombe </it>or <it>S. octosporus </it>(Figure <figr fid="F4">4c</figr>). This discrepancy is likely due to the <it>S. japonicus </it>polyG motifs being embedded in larger G-rich stretches, an effect seen within the budding yeast clade <abbrgrp>
<abbr bid="B33">33</abbr>
</abbrgrp>. polyG is also enriched in human origins <abbrgrp>
<abbr bid="B40">40</abbr>
</abbrgrp> and is anti-nucleosomal in <it>C. elegans </it>
<abbrgrp>
<abbr bid="B33">33</abbr>
</abbrgrp>, suggesting that polyG sequences may have a general role in excluding nucleosomes from origins in relatively G-rich metazoan genomes.</p>
<p>Sap1 is a <it>trans</it>-acting nucleosome-excluding factor <abbrgrp>
<abbr bid="B33">33</abbr>
</abbrgrp> with binding sites enriched at <it>S. japonicus </it>origins (Figure <figr fid="F3">3</figr>; Table S6 in Additional file <supplr sid="S2">2</supplr>). The fact that Sap1 is strongly anti-nucleosomal in <it>S. pombe </it>and <it>S. octosporus </it>
<abbrgrp>
<abbr bid="B33">33</abbr>
</abbrgrp> (Figure <figr fid="F4">4c</figr>; Figure S5 in Additional file <supplr sid="S1">1</supplr>) but not enriched in <it>S. pombe </it>or <it>S. octosporus </it>origins (Figure <figr fid="F3">3a</figr>) suggests that Sap1 is a general nucleosome excluding factor, consistent with its roles in a number of other functions in <it>S. pombe </it>
<abbrgrp>
<abbr bid="B41">41</abbr>
<abbr bid="B42">42</abbr>
<abbr bid="B43">43</abbr>
</abbrgrp>. Sap1 sites are found primarily in origins in <it>S. japonicus </it>(113/131, 86%), but not in <it>S. pombe </it>or <it>S. octosporus </it>(12/45, 27%). Interestingly, one of Sap1's essential functions in <it>S. pombe </it>involves Cbf1 <abbrgrp>
<abbr bid="B41">41</abbr>
</abbrgrp>, a protein specific to the <it>S. pombe</it>-<it>S. octosporus </it>clade <abbrgrp>
<abbr bid="B35">35</abbr>
</abbrgrp>, suggesting that Sap1 may have evolved new functions in the <it>S. pombe</it>-<it>S. octosporus </it>clade. However, Sap1 still retains a role in replication in <it>S. pombe </it>
<abbrgrp>
<abbr bid="B42">42</abbr>
</abbrgrp>, even if it no longer defines replication origins.</p>
<p>The CTCGCT motif is also likely to act by binding a <it>trans</it>-acting factor. The fact that the motif is essentially absent from the <it>S. pombe</it>-<it>S. octosporus </it>clade suggests that it is bound by an <it>S. japonicus</it>-specific nucleosome-excluding factor.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>Our comparative analysis of origins in fission yeast has allowed us to identify the sequence characteristics that define origin function in these species. Our results are consistent with a general model for eukaryotic origin function in which origins are simply the nucleosome-free regions in the genome with the highest affinity for ORC <abbrgrp>
<abbr bid="B4">4</abbr>
</abbrgrp>. In metazoans, where ORC appears to have little, if any, sequence specificity <abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp>, this model suggests that any NFR could act as an origin, consistent with the distributed initiations seen at some loci <abbrgrp>
<abbr bid="B44">44</abbr>
</abbrgrp> and the correlation between origins and promoters <abbrgrp>
<abbr bid="B9">9</abbr>
<abbr bid="B40">40</abbr>
</abbrgrp>, which are also nucleosome free. Furthermore, the model suggests that origins are not well-defined genetic elements, but simply the highest affinity ORC binding sites available in the genome. As long as there are no large regions of the genome devoid of ORC binding sites, the particular location or characteristics of these sites may not be important.</p>
</sec>
<sec>
<st>
<p>Materials and methods</p>
</st>
<sec>
<st>
<p>General methods</p>
</st>
<p>All three species were grown in rich medium (YES) and manipulated using standard fission yeast protocols <abbrgrp>
<abbr bid="B45">45</abbr>
</abbrgrp>. The strains used were yFS101 (<it>S. pombe h-</it>), HM1826 (<it>S. pombe h+ nmt1-TK dfp1-3A::kanMX6</it>) <abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp>, yFS128 (<it>S. pombe h- leu1-32 ura4-D18 cdc25-22</it>), yFS286 (<it>S. octosporus</it>), and yFS275 (<it>S. japonicus</it>).</p>
</sec>
<sec>
<st>
<p>DNA sequencing</p>
</st>
<p>DNA sequencing and sequence data analysis were performed as previously described <abbrgrp>
<abbr bid="B26">26</abbr>
</abbrgrp>. We used the January 2007 assembly of the <it>S. pombe </it>genome <abbrgrp>
<abbr bid="B46">46</abbr>
</abbrgrp> and SO3 and SJ1 assemblies of the <it>S. octosporus </it>and <it>S. japonicus </it>genomes <abbrgrp>
<abbr bid="B35">35</abbr>
</abbrgrp>. Reads that mapped to multiple locations of the genome were discarded. Post analysis, <it>S. japonicus </it>coordinates were mapped to genome assembly SJ4, the current release.</p>
</sec>
<sec>
<st>
<p>Construction of genome-wide replication profiles</p>
</st>
<p>To construct replication profiles, we extended the reads by 100 bp in the 3' direction, because the average length of DNA fragments was 200 bp. Then we smoothed the data using a 200-bp sliding window with a 20-bp step. We excluded regions of <it>S. octosporus </it>(contig6:1-400000, contig2:1-5600, contig7:766887-886887) and <it>S. japonicus </it>(contig1:1-62000, contig3:1-74000, contig3:1776000-1809320, contig4:1-16000) that had anomalously high read counts, presumably due to under-representation of repetitive sequences in the genome assemblies. To account for different sequencing depths in the various datasets, we normalized each dataset so that the mean across the genome was 1. Using flow cytometry analysis of percentage of the genome replicated in the S phase samples (Table S1 in Additional file <supplr sid="S2">2</supplr>), we applied the following formula to combine the S and G2 datasets into replication profiles: <it>R </it>= <it>S </it>&#215; (1 + <it>n</it>%) - <it>G2 </it>+ 1. Finally, we performed LOESS smoothing (Igor PRO v6.12, WaveMetrics (Lake Oswego, OR USA) on the replication profile <it>R </it>using empirically determined smoothing factors (Table S1 in Additional file <supplr sid="S2">2</supplr>).</p>
</sec>
<sec>
<st>
<p>Replication origin peak detection</p>
</st>
<p>We used the MATLAB ipeak program (MathWorks (Natick, MA USA) to detect peaks in each replication profile <it>R</it>. The ipeak algorithm first looks for downward zero crossings in smoothed first derivative, and then detects peaks by least square curve fitting. The ipeak parameters used were AmpT = 1, SlopeT = 0, SmoothW = 58, FitW = 58 for <it>S. pombe</it>, AmpT = 1, SlopeT = 0, SmoothW = 50, FitW = 50 for <it>S. octosporus</it>, and AmpT = 1, SlopeT = 0, SmoothW = 10, FitW = 10 for <it>S. japonicus</it>. We set the peaks to be the highest positions in <it>R </it>within a &#177; 500-bp window around the peak locations detected by ipeak. Peaks with anomalous heights (greater than 3 in <it>S</it>. <it>pombe </it>and <it>S. octosporus </it>or 10 in <it>S</it>. <it>japonicus</it>) were discarded; these regions are presumed to correspond to unannotated repetitive regions. We used the Genomic Signal Aggregator <abbrgrp>
<abbr bid="B47">47</abbr>
</abbrgrp> to generate an average peak profile of all peaks detected by ipeak in the entire genome. The center of the average peak profile was then fitted to a Gaussian distribution, which was then used serve as a peak template for refining peak calling.</p>
<p>In order to remove split peaks and sub-peaks, we used the following strategy. First we sorted all peaks detected by the ipeak algorithm by their amplitudes. Then we went down the list and performed the following steps: 1) we scaled the peak template to the same height as the data peak under investigation and positioned the template such that its highest point coincides with the highest point of the data peak; 2) we slid the peak template in a &#177; 250-bp window to search for the position that would lead to maximal correlation between the peak template and the data peak; 3) we searched in the height dimension with a step of 0.01 for minimal root mean square deviation (RMSD) between the peak template and the data peak. To avoid noise in the data, the correlation and RMSD were computed only for the region that was within 60% of the highest point of the data peak. We assigned the peak location and height to those of the resulting template, saved the information, then subtracted the newly detected peak from the replication profile <it>R </it>so that minor side peaks would be ignored. We iterated this procedure until all peaks above the peak threshold had been detected.</p>
</sec>
<sec>
<st>
<p>Validation of origin peak detection</p>
</st>
<p>We compared our data to published <it>S. pombe </it>origin lists compiled by three different approaches: MCM ChIP-chip mapping, a microarray-based, HU-arrest, copy-number strategy similar to ours, and a microarray-based mapping of single-stranded DNA in HU arrested cells <abbrgrp>
<abbr bid="B11">11</abbr>
<abbr bid="B12">12</abbr>
<abbr bid="B13">13</abbr>
</abbrgrp>. We calculated the statistical significance of the overlap between these origin lists (the test sets) and two well-defined origin lists (the validation sets): a list of origins confirmed by two-dimensional gels (collated in <abbrgrp>
<abbr bid="B15">15</abbr>
</abbrgrp>) and a list of AT-rich intergenes that are strongly correlated with origins <abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp>. We varied the resolution for the test sets, from 500 to 5,000 bp, by varying the distance from a validation set origin that a test set origin could be and still be considered to overlap it. We reasoned that the statistical significance of the overlap between the test and validation sets would increase as the size of the test sets increased up to the resolution of the test set. After that point the significance would decrease because the test set would get bigger without improving the overlap. By this analysis, our data are of similar resolution, between about 1 and 3 kb, to the other HU-arrest approaches (see Materials and methods; Table S7 in Additional file <supplr sid="S2">2</supplr>), which presumably reflects the resolution of the HU-arrested replication bubbles. The overlap between our replication peaks is comparable to previously published origin prediction, with approximately 50% overlap between any pairwise comparisons of datasets (Figure S2 in Additional file <supplr sid="S1">1</supplr>).</p>
</sec>
<sec>
<st>
<p>Motif finding</p>
</st>
<p>We used MEME v4.4 to find sequence motifs <it>de novo </it>in the replication origin peaks with parameters nmotifs = 30, minsites= 20, minw = 11, maxw = 17 <abbrgrp>
<abbr bid="B48">48</abbr>
</abbrgrp>. MEME is an expectation-maximization algorithm that identifies motifs that are enriched in a positive set of sequences, in our case the sequences from 1 kb on either side of the replication peaks, compared with a background set of sequences, which were a set of sequences from 1 kb on either side of loci chosen randomly from the genome excluding 8 kb around the peaks taller than 1.1, 1.14, 1.18, 1.2, 1.2, and 1.25 for Sp1, Sp2, Sp3, So1, Sj1, and Sj2, respectively. We ran the algorithm with two background models: fifth-order Markov model (defined as frequencies of all 6-mers) or simply mononucleotide frequencies. If two neighboring peaks were closer than 2 kb, the overlapping sequence was counted only once. We only used the intergenic portions for both positive and background sequences.</p>
</sec>
<sec>
<st>
<p>SVM peak prediction</p>
</st>
<p>In order to predict replication origins from sequence features, we constructed SVMs with a linear kernel using the libSVM algorithm in the R package (e1071). We used <it>k</it>-mer (<it>k </it>= 1 to 6; 2,772 <it>k</it>-mers in total) and significant motifs identified by MEME as sequence features. The frequency of each <it>k</it>-mer was calculated for each sequence in the peak set and the background set as described in the Motif Finding section, and was 2-norm normalized. For each MEME motif, we ran the CLOVER algorithm to obtain a cumulative score for each sequence <abbrgrp>
<abbr bid="B49">49</abbr>
</abbrgrp>. Then we normalized the CLOVER scores for each motif to have the same mean and standard deviation as the <it>k</it>-mer frequencies across the training sequences, so that <it>k</it>-mers and motifs could be combined with the SVM.</p>
<p>We used the positive and background sequences described in the Motif Finding section for training and testing the SVMs. We performed five-fold cross-validation, in which the sequences were divided into three equal sized portions and in each of five runs one portion is held out for testing while the other two portions were used for training. We computed the auROC score, which is a performance measure that combines sensitivity and specificity, using libSVM and colAUC (R package caTools).</p>
</sec>
<sec>
<st>
<p>Nucleosome mapping</p>
</st>
<p>Nucleosomal DNA libraries were constructed and sequenced as previously described <abbrgrp>
<abbr bid="B50">50</abbr>
</abbrgrp>. To assess the quality of the nucleosome maps, we aligned our nucleosome signal on the annotated transcriptional start site for each genome <abbrgrp>
<abbr bid="B35">35</abbr>
</abbrgrp> using the Genomic Signal Aggregator <abbrgrp>
<abbr bid="B47">47</abbr>
</abbrgrp> (Figure S6 in Additional file <supplr sid="S1">1</supplr>). The profiles show a deep NFR at the transcriptional start site, with phased nucleosomes downstream and heterogeneous nucleosomes upstream, as previously reported for <it>S. pombe </it>
<abbrgrp>
<abbr bid="B21">21</abbr>
<abbr bid="B33">33</abbr>
</abbrgrp>. Nucleosome mapping and motif occupancy analysis were performed as previously described, using default settings <abbrgrp>
<abbr bid="B51">51</abbr>
</abbrgrp>. We identified NFRs genome-wide as described <abbrgrp>
<abbr bid="B50">50</abbr>
</abbrgrp> but without the constraint of searching relative to the translational start site of each gene. We defined the NFRs nearest each species' replication peaks as the origin-proximal NFRs and computed their average nucleosome occupancy, aligning the data on the NFR centers. As controls, we chose an equal number of randomly distributed genomic coordinates and defined the nearest NFRs as non-origin-proximal and then measured the non-origin-proximal average nucleosome occupancy. We repeated this analysis for 10,000 randomly chosen genomic regions for each species. To test the significance of the hypothesis that origin-proximal NFRs are more nucleosome depleted than non-origin-proximal NFRs, we calculated a <it>P</it>-value of n/10,000, where n is the number of times that the non-origin-proximal NFR had an average nucleosome occupancy that was more depleted than the origin-proximal NFRs.</p>
</sec>
<sec>
<st>
<p>Data availability</p>
</st>
<p>Filtered reads for all datasets have been submitted to the Sequence Read Archive (Table S8 in Additional file <supplr sid="S2">2</supplr>). <it>S. pombe </it>replication profiles have been submitted to OriDB <abbrgrp>
<abbr bid="B52">52</abbr>
</abbrgrp> and ReplicationDomain <abbrgrp>
<abbr bid="B53">53</abbr>
</abbrgrp>.</p>
</sec>
</sec>
<sec>
<st>
<p>Abbreviations</p>
</st>
<p>ACS: ARS consensus sequence; ARS: autonomously replicating sequence; auROC: area under the ROC curve; bp: base pair; DDK: Dbf4-depenent kinase; HU: hydroxyurea; NFR: nucleosome-free region; OCR: origin recognition complex; ROC: receiver operating characteristic; SVM: support vector machine; ts: temperature sensitive.</p>
</sec>
<sec>
<st>
<p>Competing interests</p>
</st>
<p>CH, KS and PM were employees of Helicos BioSciences during the tenure of this research.</p>
</sec>
<sec>
<st>
<p>Authors' contributions</p>
</st>
<p>JX performed most post-sequence-alignment data analysis. YY constructed nucleosomal DNA libraries. AT analyzed nucleosome occupancy. CH filtered and aligned the replication-profile sequence data. KA and HN isolated and characterized the <it>S. japonicus </it>ARSs. NK prepared the replication-profile DNA samples. KES prepared and sequenced the replication-profile DNA libraries. JB and CR provided additional DNA sequence and analysis. OJR analyzed nucleosome occupancy and interpreted data. ZW designed and facilitated data analysis. PM, CN and NR conceived the study and coordinated its execution. NR interpreted the data and wrote the manuscript. All authors read and approved the final manuscript.</p>
</sec>
</bdy><bm>
<ack>
<sec>
<st>
<p>Acknowledgements</p>
</st>
<p>This work was supported by an American Cancer Society Research Scholar Grant to NR.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>Eukaryotic chromosome DNA replication: where, when, and how?</p></title><aug><au><snm>Masai</snm><fnm>H</fnm></au><au><snm>Matsumoto</snm><fnm>S</fnm></au><au><snm>You</snm><fnm>Z</fnm></au><au><snm>Yoshizawa-Sugata</snm><fnm>N</fnm></au><au><snm>Oda</snm><fnm>M</fnm></au></aug><source>Annu Rev Biochem</source><pubdate>2010</pubdate><volume>79</volume><fpage>89</fpage><lpage>130</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1146/annurev.biochem.052308.103205</pubid><pubid idtype="pmpid" link="fulltext">20373915</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>The ARS309 chromosomal replicator of Saccharomyces cerevisiae depends on an exceptional ARS consensus sequence.</p></title><aug><au><snm>Theis</snm><fnm>JF</fnm></au><au><snm>Newlon</snm><fnm>CS</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1997</pubdate><volume>94</volume><fpage>10786</fpage><lpage>10791</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.94.20.10786</pubid><pubid idtype="pmcid">23486</pubid><pubid idtype="pmpid" link="fulltext">9380711</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Diversity of eukaryotic DNA replication origins revealed by genome-wide analysis of chromatin structure.</p></title><aug><au><snm>Berbenetz</snm><fnm>NM</fnm></au><au><snm>Nislow</snm><fnm>C</fnm></au><au><snm>Brown</snm><fnm>GW</fnm></au></aug><source>PLoS Genet</source><pubdate>2010</pubdate><volume>6</volume><fpage>e1001092</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pgen.1001092</pubid><pubid idtype="pmcid">2932696</pubid><pubid idtype="pmpid" link="fulltext">20824081</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Conserved nucleosome positioning defines replication origins.</p></title><aug><au><snm>Eaton</snm><fnm>ML</fnm></au><au><snm>Galani</snm><fnm>K</fnm></au><au><snm>Kang</snm><fnm>S</fnm></au><au><snm>Bell</snm><fnm>SP</fnm></au><au><snm>Macalpine</snm><fnm>DM</fnm></au></aug><source>Genes Dev</source><pubdate>2010</pubdate><volume>24</volume><fpage>748</fpage><lpage>753</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.1913210</pubid><pubid idtype="pmcid">2854390</pubid><pubid idtype="pmpid" link="fulltext">20351051</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Prediction of Saccharomyces cerevisiae replication origins.</p></title><aug><au><snm>Breier</snm><fnm>AM</fnm></au><au><snm>Chatterji</snm><fnm>S</fnm></au><au><snm>Cozzarelli</snm><fnm>NR</fnm></au></aug><source>Genome Biol</source><pubdate>2004</pubdate><volume>5</volume><fpage>R22</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2004-5-4-r22</pubid><pubid idtype="pmcid">395781</pubid><pubid idtype="pmpid" link="fulltext">15059255</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Genome-wide identification of replication origins in yeast by comparative genomics.</p></title><aug><au><snm>Nieduszynski</snm><fnm>CA</fnm></au><au><snm>Knox</snm><fnm>Y</fnm></au><au><snm>Donaldson</snm><fnm>AD</fnm></au></aug><source>Genes Dev</source><pubdate>2006</pubdate><volume>20</volume><fpage>1874</fpage><lpage>1879</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.385306</pubid><pubid idtype="pmcid">1522085</pubid><pubid idtype="pmpid" link="fulltext">16847347</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Concerted loading of Mcm2-7 double hexamers around DNA during DNA replication origin licensing.</p></title><aug><au><snm>Remus</snm><fnm>D</fnm></au><au><snm>Beuron</snm><fnm>F</fnm></au><au><snm>Tolun</snm><fnm>G</fnm></au><au><snm>Griffith</snm><fnm>JD</fnm></au><au><snm>Morris</snm><fnm>EP</fnm></au><au><snm>Diffley</snm><fnm>JF</fnm></au></aug><source>Cell</source><pubdate>2009</pubdate><volume>139</volume><fpage>719</fpage><lpage>730</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2009.10.015</pubid><pubid idtype="pmcid">2804858</pubid><pubid idtype="pmpid" link="fulltext">19896182</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>The ARS consensus sequence is required for chromosomal origin function in Saccharomyces cerevisiae.</p></title><aug><au><snm>Deshpande</snm><fnm>AM</fnm></au><au><snm>Newlon</snm><fnm>CS</fnm></au></aug><source>Mol Cell Biol</source><pubdate>1992</pubdate><volume>12</volume><fpage>4305</fpage><lpage>4313</lpage><xrefbib><pubidlist><pubid idtype="pmcid">360354</pubid><pubid idtype="pmpid" link="fulltext">1406623</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Drosophila ORC localizes to open chromatin and marks sites of cohesin complex loading.</p></title><aug><au><snm>MacAlpine</snm><fnm>HK</fnm></au><au><snm>Gordan</snm><fnm>R</fnm></au><au><snm>Powell</snm><fnm>SK</fnm></au><au><snm>Hartemink</snm><fnm>AJ</fnm></au><au><snm>MacAlpine</snm><fnm>DM</fnm></au></aug><source>Genome Res</source><pubdate>2010</pubdate><volume>20</volume><fpage>201</fpage><lpage>211</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.097873.109</pubid><pubid idtype="pmcid">2813476</pubid><pubid idtype="pmpid" link="fulltext">19996087</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>Making sense of eukaryotic DNA replication origins.</p></title><aug><au><snm>Gilbert</snm><fnm>DM</fnm></au></aug><source>Science</source><pubdate>2001</pubdate><volume>294</volume><fpage>96</fpage><lpage>100</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1061724</pubid><pubid idtype="pmcid">1255916</pubid><pubid idtype="pmpid">11588251</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Genome-wide characterization of fission yeast DNA replication origins.</p></title><aug><au><snm>Heichinger</snm><fnm>C</fnm></au><au><snm>Penkett</snm><fnm>CJ</fnm></au><au><snm>Bahler</snm><fnm>J</fnm></au><au><snm>Nurse</snm><fnm>P</fnm></au></aug><source>EMBO J</source><pubdate>2006</pubdate><volume>25</volume><fpage>5171</fpage><lpage>5179</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/sj.emboj.7601390</pubid><pubid idtype="pmcid">1630417</pubid><pubid idtype="pmpid" link="fulltext">17053780</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Genomic mapping of single-stranded DNA in hydroxyurea-challenged yeasts identifies origins of replication.</p></title><aug><au><snm>Feng</snm><fnm>W</fnm></au><au><snm>Collingwood</snm><fnm>D</fnm></au><au><snm>Boeck</snm><fnm>ME</fnm></au><au><snm>Fox</snm><fnm>LA</fnm></au><au><snm>Alvino</snm><fnm>GM</fnm></au><au><snm>Fangman</snm><fnm>WL</fnm></au><au><snm>Raghuraman</snm><fnm>MK</fnm></au><au><snm>Brewer</snm><fnm>BJ</fnm></au></aug><source>Nat Cell Biol</source><pubdate>2006</pubdate><volume>8</volume><fpage>148</fpage><lpage>155</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ncb1358</pubid><pubid idtype="pmcid">1414058</pubid><pubid idtype="pmpid" link="fulltext">16429127</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>Genome-wide localization of pre-RC sites and identification of replication origins in fission yeast.</p></title><aug><au><snm>Hayashi</snm><fnm>M</fnm></au><au><snm>Katou</snm><fnm>Y</fnm></au><au><snm>Itoh</snm><fnm>T</fnm></au><au><snm>Tazumi</snm><fnm>A</fnm></au><au><snm>Yamada</snm><fnm>Y</fnm></au><au><snm>Takahashi</snm><fnm>T</fnm></au><au><snm>Nakagawa</snm><fnm>T</fnm></au><au><snm>Shirahige</snm><fnm>K</fnm></au><au><snm>Masukata</snm><fnm>H</fnm></au></aug><source>EMBO J</source><pubdate>2007</pubdate><volume>26</volume><fpage>1327</fpage><lpage>1339</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/sj.emboj.7601585</pubid><pubid idtype="pmcid">1817633</pubid><pubid idtype="pmpid" link="fulltext">17304213</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>Global profiling of DNA replication timing and efficiency reveals that efficient replication/firing occurs late during S-phase in S. pombe.</p></title><aug><au><snm>Eshaghi</snm><fnm>M</fnm></au><au><snm>Karuturi</snm><fnm>RK</fnm></au><au><snm>Li</snm><fnm>J</fnm></au><au><snm>Chu</snm><fnm>Z</fnm></au><au><snm>Liu</snm><fnm>ET</fnm></au><au><snm>Liu</snm><fnm>J</fnm></au></aug><source>PLoS ONE</source><pubdate>2007</pubdate><volume>2</volume><fpage>e722</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0000722</pubid><pubid idtype="pmcid">1934932</pubid><pubid idtype="pmpid" link="fulltext">17684567</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>Checkpoint independence of most DNA replication origins in fission yeast.</p></title><aug><au><snm>Mickle</snm><fnm>KL</fnm></au><au><snm>Ramanathan</snm><fnm>S</fnm></au><au><snm>Rosebrock</snm><fnm>A</fnm></au><au><snm>Oliva</snm><fnm>A</fnm></au><au><snm>Chaudari</snm><fnm>A</fnm></au><au><snm>Yompakdee</snm><fnm>C</fnm></au><au><snm>Scott</snm><fnm>D</fnm></au><au><snm>Leatherwood</snm><fnm>J</fnm></au><au><snm>Huberman</snm><fnm>JA</fnm></au></aug><source>BMC Mol Biol</source><pubdate>2007</pubdate><volume>8</volume><fpage>112</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2199-8-112</pubid><pubid idtype="pmcid">2235891</pubid><pubid idtype="pmpid" link="fulltext">18093330</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>DNA replication origins in the Schizosaccharomyces pombe genome.</p></title><aug><au><snm>Dai</snm><fnm>J</fnm></au><au><snm>Chuang</snm><fnm>RY</fnm></au><au><snm>Kelly</snm><fnm>TJ</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2005</pubdate><volume>102</volume><fpage>337</fpage><lpage>342</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0408811102</pubid><pubid idtype="pmcid">539312</pubid><pubid idtype="pmpid" link="fulltext">15623550</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>Genome-wide distribution of DNA replication origins at A+T-rich islands in Schizosaccharomyces pombe.</p></title><aug><au><snm>Segurado</snm><fnm>M</fnm></au><au><snm>de Luis</snm><fnm>A</fnm></au><au><snm>Antequera</snm><fnm>F</fnm></au></aug><source>EMBO Rep</source><pubdate>2003</pubdate><volume>4</volume><fpage>1048</fpage><lpage>1053</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/sj.embor.7400008</pubid><pubid idtype="pmcid">1326378</pubid><pubid idtype="pmpid" link="fulltext">14566325</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>The fission yeast homologue of Orc4p binds to replication origin DNA via multiple AT-hooks.</p></title><aug><au><snm>Chuang</snm><fnm>RY</fnm></au><au><snm>Kelly</snm><fnm>TJ</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1999</pubdate><volume>96</volume><fpage>2656</fpage><lpage>2661</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.96.6.2656</pubid><pubid idtype="pmcid">15824</pubid><pubid idtype="pmpid" link="fulltext">10077566</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>Poly(dA:dT), a ubiquitous promoter element that stimulates transcription via its intrinsic DNA structure.</p></title><aug><au><snm>Iyer</snm><fnm>V</fnm></au><au><snm>Struhl</snm><fnm>K</fnm></au></aug><source>EMBO J</source><pubdate>1995</pubdate><volume>14</volume><fpage>2570</fpage><lpage>2579</lpage><xrefbib><pubidlist><pubid idtype="pmcid">398371</pubid><pubid idtype="pmpid">7781610</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>DNA bending and its relation to nucleosome positioning.</p></title><aug><au><snm>Drew</snm><fnm>HR</fnm></au><au><snm>Travers</snm><fnm>AA</fnm></au></aug><source>J Mol Biol</source><pubdate>1985</pubdate><volume>186</volume><fpage>773</fpage><lpage>790</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/0022-2836(85)90396-1</pubid><pubid idtype="pmpid" link="fulltext">3912515</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Schizosaccharomyces pombe genome-wide nucleosome mapping reveals positioning mechanisms distinct from those of Saccharomyces cerevisiae.</p></title><aug><au><snm>Lantermann</snm><fnm>AB</fnm></au><au><snm>Straub</snm><fnm>T</fnm></au><au><snm>Stralfors</snm><fnm>A</fnm></au><au><snm>Yuan</snm><fnm>GC</fnm></au><au><snm>Ekwall</snm><fnm>K</fnm></au><au><snm>Korber</snm><fnm>P</fnm></au></aug><source>Nat Struct Mol Biol</source><pubdate>2010</pubdate><volume>17</volume><fpage>251</fpage><lpage>257</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nsmb.1741</pubid><pubid idtype="pmpid" link="fulltext">20118936</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>Distinct modes of regulation by chromatin encoded through nucleosome positioning signals.</p></title><aug><au><snm>Field</snm><fnm>Y</fnm></au><au><snm>Kaplan</snm><fnm>N</fnm></au><au><snm>Fondufe-Mittendorf</snm><fnm>Y</fnm></au><au><snm>Moore</snm><fnm>IK</fnm></au><au><snm>Sharon</snm><fnm>E</fnm></au><au><snm>Lubling</snm><fnm>Y</fnm></au><au><snm>Widom</snm><fnm>J</fnm></au><au><snm>Segal</snm><fnm>E</fnm></au></aug><source>PLoS Comput Biol</source><pubdate>2008</pubdate><volume>4</volume><fpage>e1000216</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pcbi.1000216</pubid><pubid idtype="pmcid">2570626</pubid><pubid idtype="pmpid" link="fulltext">18989395</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Evaluating genome-scale approaches to eukaryotic DNA replication.</p></title><aug><au><snm>Gilbert</snm><fnm>DM</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2010</pubdate><volume>11</volume><fpage>673</fpage><lpage>684</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrg2830</pubid><pubid idtype="pmcid">2962615</pubid><pubid idtype="pmpid" link="fulltext">20811343</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>Mapping of early firing origins on a replication profile of budding yeast.</p></title><aug><au><snm>Yabuki</snm><fnm>N</fnm></au><au><snm>Terashima</snm><fnm>H</fnm></au><au><snm>Kitada</snm><fnm>K</fnm></au></aug><source>Genes Cells</source><pubdate>2002</pubdate><volume>7</volume><fpage>781</fpage><lpage>789</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1046/j.1365-2443.2002.00559.x</pubid><pubid idtype="pmpid" link="fulltext">12167157</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Single-molecule DNA sequencing of a viral genome.</p></title><aug><au><snm>Harris</snm><fnm>TD</fnm></au><au><snm>Buzby</snm><fnm>PR</fnm></au><au><snm>Babcock</snm><fnm>H</fnm></au><au><snm>Beer</snm><fnm>E</fnm></au><au><snm>Bowers</snm><fnm>J</fnm></au><au><snm>Braslavsky</snm><fnm>I</fnm></au><au><snm>Causey</snm><fnm>M</fnm></au><au><snm>Colonell</snm><fnm>J</fnm></au><au><snm>Dimeo</snm><fnm>J</fnm></au><au><snm>Efcavitch</snm><fnm>JW</fnm></au><au><snm>Giladi</snm><fnm>E</fnm></au><au><snm>Gill</snm><fnm>J</fnm></au><au><snm>Healy</snm><fnm>J</fnm></au><au><snm>Jarosz</snm><fnm>M</fnm></au><au><snm>Lapen</snm><fnm>D</fnm></au><au><snm>Moulton</snm><fnm>K</fnm></au><au><snm>Quake</snm><fnm>SR</fnm></au><au><snm>Steinmann</snm><fnm>K</fnm></au><au><snm>Thayer</snm><fnm>E</fnm></au><au><snm>Tyurina</snm><fnm>A</fnm></au><au><snm>Ward</snm><fnm>R</fnm></au><au><snm>Weiss</snm><fnm>H</fnm></au><au><snm>Xie</snm><fnm>Z</fnm></au></aug><source>Science</source><pubdate>2008</pubdate><volume>320</volume><fpage>106</fpage><lpage>109</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1150427</pubid><pubid idtype="pmpid" link="fulltext">18388294</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Virtual terminator nucleotides for next-generation DNA sequencing.</p></title><aug><au><snm>Bowers</snm><fnm>J</fnm></au><au><snm>Mitchell</snm><fnm>J</fnm></au><au><snm>Beer</snm><fnm>E</fnm></au><au><snm>Buzby</snm><fnm>PR</fnm></au><au><snm>Causey</snm><fnm>M</fnm></au><au><snm>Efcavitch</snm><fnm>JW</fnm></au><au><snm>Jarosz</snm><fnm>M</fnm></au><au><snm>Krzymanska-Olejnik</snm><fnm>E</fnm></au><au><snm>Kung</snm><fnm>L</fnm></au><au><snm>Lipson</snm><fnm>D</fnm></au><au><snm>Lowman</snm><fnm>GM</fnm></au><au><snm>Marappan</snm><fnm>S</fnm></au><au><snm>McInerney</snm><fnm>P</fnm></au><au><snm>Platt</snm><fnm>A</fnm></au><au><snm>Roy</snm><fnm>A</fnm></au><au><snm>Siddiqi</snm><fnm>SM</fnm></au><au><snm>Steinmann</snm><fnm>K</fnm></au><au><snm>Thompson</snm><fnm>JF</fnm></au></aug><source>Nat Methods</source><pubdate>2009</pubdate><volume>6</volume><fpage>593</fpage><lpage>595</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nmeth.1354</pubid><pubid idtype="pmcid">2719685</pubid><pubid idtype="pmpid" link="fulltext">19620973</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>DNA replication origins fire stochastically in fission yeast.</p></title><aug><au><snm>Patel</snm><fnm>PK</fnm></au><au><snm>Arcangioli</snm><fnm>B</fnm></au><au><snm>Baker</snm><fnm>SP</fnm></au><au><snm>Bensimon</snm><fnm>A</fnm></au><au><snm>Rhind</snm><fnm>N</fnm></au></aug><source>Mol Biol Cell</source><pubdate>2006</pubdate><volume>17</volume><fpage>308</fpage><lpage>316</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1345668</pubid><pubid idtype="pmpid" link="fulltext">16251353</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>Novel episomal vectors and a highly efficient transformation procedure for the fission yeast Schizosaccharomyces japonicus.</p></title><aug><au><snm>Aoki</snm><fnm>K</fnm></au><au><snm>Nakajima</snm><fnm>R</fnm></au><au><snm>Furuya</snm><fnm>K</fnm></au><au><snm>Niki</snm><fnm>H</fnm></au></aug><source>Yeast</source><pubdate>2010</pubdate><volume>27</volume><fpage>1049</fpage><lpage>1060</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/yea.1815</pubid><pubid idtype="pmpid" link="fulltext">20737410</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>A comprehensive genome-wide map of autonomously replicating sequences in a naive genome.</p></title><aug><au><snm>Liachko</snm><fnm>I</fnm></au><au><snm>Bhaskar</snm><fnm>A</fnm></au><au><snm>Lee</snm><fnm>C</fnm></au><au><snm>Chung</snm><fnm>SC</fnm></au><au><snm>Tye</snm><fnm>BK</fnm></au><au><snm>Keich</snm><fnm>U</fnm></au></aug><source>PLoS Genet</source><pubdate>2010</pubdate><volume>6</volume><fpage>e1000946</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pgen.1000946</pubid><pubid idtype="pmcid">2869322</pubid><pubid idtype="pmpid" link="fulltext">20485513</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Early-replicating heterochromatin.</p></title><aug><au><snm>Kim</snm><fnm>SM</fnm></au><au><snm>Dubey</snm><fnm>DD</fnm></au><au><snm>Huberman</snm><fnm>JA</fnm></au></aug><source>Genes Dev</source><pubdate>2003</pubdate><volume>17</volume><fpage>330</fpage><lpage>335</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.1046203</pubid><pubid idtype="pmcid">195982</pubid><pubid idtype="pmpid" link="fulltext">12569122</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>The heterochromatin protein Swi6/HP1 activates replication origins at the pericentromeric region and silent mating-type locus.</p></title><aug><au><snm>Hayashi</snm><fnm>MT</fnm></au><au><snm>Takahashi</snm><fnm>TS</fnm></au><au><snm>Nakagawa</snm><fnm>T</fnm></au><au><snm>Nakayama</snm><fnm>J</fnm></au><au><snm>Masukata</snm><fnm>H</fnm></au></aug><source>Nat Cell Biol</source><pubdate>2009</pubdate><volume>11</volume><fpage>357</fpage><lpage>362</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ncb1845</pubid><pubid idtype="pmpid" link="fulltext">19182789</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>Support vector machine applications in computational biology</p></title><aug><au><snm>Noble</snm><fnm>WS</fnm></au></aug><source>Kernel Methods in Computational Biology</source><publisher>MIT Press</publisher><editor>Sch&#246;lkopf B, Tsuda K, Vert JP</editor><pubdate>2004</pubdate><fpage>71</fpage><lpage>92</lpage></bibl><bibl id="B33"><title><p>Evolutionary divergence of intrinsic and trans-regulated nucleosome positioning sequences reveals plastic rules for chromatin organization.</p></title><aug><au><snm>Tsankov</snm><fnm>A</fnm></au><au><snm>Yanagisawa</snm><fnm>Y</fnm></au><au><snm>Rhind</snm><fnm>N</fnm></au><au><snm>Regev</snm><fnm>A</fnm></au><au><snm>Rando</snm><fnm>OJ</fnm></au></aug><source>Genome Res</source><pubdate>2011</pubdate><volume>21</volume><fpage>1851</fpage><lpage>1862</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.122267.111</pubid><pubid idtype="pmcid">3205570</pubid><pubid idtype="pmpid" link="fulltext">21914852</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>Pre-replication complex proteins assemble at regions of low nucleosome occupancy within the Chinese hamster dihydrofolate reductase initiation zone.</p></title><aug><au><snm>Lubelsky</snm><fnm>Y</fnm></au><au><snm>Sasaki</snm><fnm>T</fnm></au><au><snm>Kuipers</snm><fnm>MA</fnm></au><au><snm>Lucas</snm><fnm>I</fnm></au><au><snm>Le Beau</snm><fnm>MM</fnm></au><au><snm>Carignon</snm><fnm>S</fnm></au><au><snm>Debatisse</snm><fnm>M</fnm></au><au><snm>Prinz</snm><fnm>JA</fnm></au><au><snm>Dennis</snm><fnm>JH</fnm></au><au><snm>Gilbert</snm><fnm>DM</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2011</pubdate><volume>39</volume><fpage>3141</fpage><lpage>3155</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkq1276</pubid><pubid idtype="pmcid">3082903</pubid><pubid idtype="pmpid" link="fulltext">21148149</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>Comparative functional genomics of the fission yeasts.</p></title><aug><au><snm>Rhind</snm><fnm>N</fnm></au><au><snm>Chen</snm><fnm>Z</fnm></au><au><snm>Yassour</snm><fnm>M</fnm></au><au><snm>Thompson</snm><fnm>DA</fnm></au><au><snm>Haas</snm><fnm>BJ</fnm></au><au><snm>Habib</snm><fnm>N</fnm></au><au><snm>Wapinski</snm><fnm>I</fnm></au><au><snm>Roy</snm><fnm>S</fnm></au><au><snm>Lin</snm><fnm>MF</fnm></au><au><snm>Heiman</snm><fnm>DI</fnm></au><au><snm>Young</snm><fnm>SK</fnm></au><au><snm>Furuya</snm><fnm>K</fnm></au><au><snm>Guo</snm><fnm>Y</fnm></au><au><snm>Pidoux</snm><fnm>A</fnm></au><au><snm>Chen</snm><fnm>HM</fnm></au><au><snm>Robbertse</snm><fnm>B</fnm></au><au><snm>Goldberg</snm><fnm>JM</fnm></au><au><snm>Aoki</snm><fnm>K</fnm></au><au><snm>Bayne</snm><fnm>EH</fnm></au><au><snm>Berlin</snm><fnm>AM</fnm></au><au><snm>Desjardins</snm><fnm>CA</fnm></au><au><snm>Dobbs</snm><fnm>E</fnm></au><au><snm>Dukaj</snm><fnm>L</fnm></au><au><snm>Fan</snm><fnm>L</fnm></au><au><snm>Fitzgerald</snm><fnm>MG</fnm></au><au><snm>French</snm><fnm>C</fnm></au><au><snm>Gujja</snm><fnm>S</fnm></au><au><snm>Hansen</snm><fnm>K</fnm></au><au><snm>Keifenheim</snm><fnm>D</fnm></au><au><snm>Levin</snm><fnm>JZ</fnm></au><etal/></aug><source>Science</source><pubdate>2011</pubdate><volume>332</volume><fpage>930</fpage><lpage>936</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1203357</pubid><pubid idtype="pmcid">3131103</pubid><pubid idtype="pmpid" link="fulltext">21511999</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>Establishing the program of origin firing during S phase in fission yeast.</p></title><aug><au><snm>Wu</snm><fnm>PY</fnm></au><au><snm>Nurse</snm><fnm>P</fnm></au></aug><source>Cell</source><pubdate>2009</pubdate><volume>136</volume><fpage>852</fpage><lpage>864</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2009.01.017</pubid><pubid idtype="pmcid">2787407</pubid><pubid idtype="pmpid" link="fulltext">19269364</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>The Hsk1(Cdc7) replication kinase regulates origin efficiency.</p></title><aug><au><snm>Patel</snm><fnm>PK</fnm></au><au><snm>Kommajosyula</snm><fnm>N</fnm></au><au><snm>Rosebrock</snm><fnm>A</fnm></au><au><snm>Bensimon</snm><fnm>A</fnm></au><au><snm>Leatherwood</snm><fnm>J</fnm></au><au><snm>Bechhoefer</snm><fnm>J</fnm></au><au><snm>Rhind</snm><fnm>N</fnm></au></aug><source>Mol Biol Cell</source><pubdate>2008</pubdate><volume>19</volume><fpage>5550</fpage><lpage>5558</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1091/mbc.E08-06-0645</pubid><pubid idtype="pmcid">2592646</pubid><pubid idtype="pmpid" link="fulltext">18799612</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>Bubble-chip analysis of human origin distributions demonstrates on a genomic scale significant clustering into zones and significant association with transcription.</p></title><aug><au><snm>Mesner</snm><fnm>LD</fnm></au><au><snm>Valsakumar</snm><fnm>V</fnm></au><au><snm>Karnani</snm><fnm>N</fnm></au><au><snm>Dutta</snm><fnm>A</fnm></au><au><snm>Hamlin</snm><fnm>JL</fnm></au><au><snm>Bekiranov</snm><fnm>S</fnm></au></aug><source>Genome Res</source><pubdate>2011</pubdate><volume>21</volume><fpage>377</fpage><lpage>389</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.111328.110</pubid><pubid idtype="pmcid">3044852</pubid><pubid idtype="pmpid" link="fulltext">21173031</pubid></pubidlist></xrefbib></bibl><bibl id="B39"><title><p>Structural diversity and dynamics of genomic replication origins in Schizosaccharomyces pombe.</p></title><aug><au><snm>Cotobal</snm><fnm>C</fnm></au><au><snm>Segurado</snm><fnm>M</fnm></au><au><snm>Antequera</snm><fnm>F</fnm></au></aug><source>EMBO J</source><pubdate>2010</pubdate><volume>29</volume><fpage>934</fpage><lpage>942</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/emboj.2009.411</pubid><pubid idtype="pmcid">2837175</pubid><pubid idtype="pmpid" link="fulltext">20094030</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>Genome-scale analysis of metazoan replication origins reveals their organization in specific but flexible sites defined by conserved features.</p></title><aug><au><snm>Cayrou</snm><fnm>C</fnm></au><au><snm>Coulombe</snm><fnm>P</fnm></au><au><snm>Vigneron</snm><fnm>A</fnm></au><au><snm>Stanojcic</snm><fnm>S</fnm></au><au><snm>Ganier</snm><fnm>O</fnm></au><au><snm>Peiffer</snm><fnm>I</fnm></au><au><snm>Rivals</snm><fnm>E</fnm></au><au><snm>Puy</snm><fnm>A</fnm></au><au><snm>Laurent-Chabalier</snm><fnm>S</fnm></au><au><snm>Desprat</snm><fnm>R</fnm></au><au><snm>Mechali</snm><fnm>M</fnm></au></aug><source>Genome Res</source><pubdate>2011</pubdate><volume>21</volume><fpage>1438</fpage><lpage>1449</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.121830.111</pubid><pubid idtype="pmcid">3166829</pubid><pubid idtype="pmpid" link="fulltext">21750104</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>CENP-B preserves genome integrity at replication forks paused by retrotransposon LTR.</p></title><aug><au><snm>Zaratiegui</snm><fnm>M</fnm></au><au><snm>Vaughn</snm><fnm>MW</fnm></au><au><snm>Irvine</snm><fnm>DV</fnm></au><au><snm>Goto</snm><fnm>D</fnm></au><au><snm>Watt</snm><fnm>S</fnm></au><au><snm>Bahler</snm><fnm>J</fnm></au><au><snm>Arcangioli</snm><fnm>B</fnm></au><au><snm>Martienssen</snm><fnm>RA</fnm></au></aug><source>Nature</source><pubdate>2011</pubdate><volume>469</volume><fpage>112</fpage><lpage>115</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature09608</pubid><pubid idtype="pmcid">3057531</pubid><pubid idtype="pmpid" link="fulltext">21151105</pubid></pubidlist></xrefbib></bibl><bibl id="B42"><title><p>Sap1 promotes the association of the replication fork protection complex with chromatin and is involved in the replication checkpoint in Schizosaccharomyces pombe.</p></title><aug><au><snm>Noguchi</snm><fnm>C</fnm></au><au><snm>Noguchi</snm><fnm>E</fnm></au></aug><source>Genetics</source><pubdate>2007</pubdate><volume>175</volume><fpage>553</fpage><lpage>566</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1534/genetics.106.065334</pubid><pubid idtype="pmcid">1800610</pubid><pubid idtype="pmpid" link="fulltext">17151242</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>A novel switch-activating site (SAS1) and its cognate binding factor (SAP1) required for efficient mat1 switching in Schizosaccharomyces pombe.</p></title><aug><au><snm>Arcangioli</snm><fnm>B</fnm></au><au><snm>Klar</snm><fnm>AJ</fnm></au></aug><source>EMBO J</source><pubdate>1991</pubdate><volume>10</volume><fpage>3025</fpage><lpage>3032</lpage><xrefbib><pubidlist><pubid idtype="pmcid">453017</pubid><pubid idtype="pmpid">1915277</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Initiation sites are distributed at frequent intervals in the Chinese hamster dihydrofolate reductase origin of replication but are used with very different efficiencies.</p></title><aug><au><snm>Dijkwel</snm><fnm>PA</fnm></au><au><snm>Wang</snm><fnm>S</fnm></au><au><snm>Hamlin</snm><fnm>JL</fnm></au></aug><source>Mol Cell Biol</source><pubdate>2002</pubdate><volume>22</volume><fpage>3053</fpage><lpage>3065</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/MCB.22.9.3053-3065.2002</pubid><pubid idtype="pmcid">133756</pubid><pubid idtype="pmpid" link="fulltext">11940663</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>Basic methods for fission yeast.</p></title><aug><au><snm>Forsburg</snm><fnm>SL</fnm></au><au><snm>Rhind</snm><fnm>N</fnm></au></aug><source>Yeast</source><pubdate>2006</pubdate><volume>23</volume><fpage>173</fpage><lpage>183</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/yea.1347</pubid><pubid idtype="pmpid" link="fulltext">16498704</pubid></pubidlist></xrefbib></bibl><bibl id="B46"><title><p>The genome sequence of Schizosaccharomyces pombe.</p></title><aug><au><snm>Wood</snm><fnm>V</fnm></au><au><snm>Gwilliam</snm><fnm>R</fnm></au><au><snm>Rajandream</snm><fnm>MA</fnm></au><au><snm>Lyne</snm><fnm>M</fnm></au><au><snm>Lyne</snm><fnm>R</fnm></au><au><snm>Stewart</snm><fnm>A</fnm></au><au><snm>Sgouros</snm><fnm>J</fnm></au><au><snm>Peat</snm><fnm>N</fnm></au><au><snm>Hayles</snm><fnm>J</fnm></au><au><snm>Baker</snm><fnm>S</fnm></au><au><snm>Basham</snm><fnm>D</fnm></au><au><snm>Bowman</snm><fnm>S</fnm></au><au><snm>Brooks</snm><fnm>K</fnm></au><au><snm>Brown</snm><fnm>D</fnm></au><au><snm>Brown</snm><fnm>S</fnm></au><au><snm>Chillingworth</snm><fnm>T</fnm></au><au><snm>Churcher</snm><fnm>C</fnm></au><au><snm>Collins</snm><fnm>M</fnm></au><au><snm>Connor</snm><fnm>R</fnm></au><au><snm>Cronin</snm><fnm>A</fnm></au><au><snm>Davis</snm><fnm>P</fnm></au><au><snm>Feltwell</snm><fnm>T</fnm></au><au><snm>Fraser</snm><fnm>A</fnm></au><au><snm>Gentles</snm><fnm>S</fnm></au><au><snm>Goble</snm><fnm>A</fnm></au><au><snm>Hamlin</snm><fnm>N</fnm></au><au><snm>Harris</snm><fnm>D</fnm></au><au><snm>Hidalgo</snm><fnm>J</fnm></au><au><snm>Hodgson</snm><fnm>G</fnm></au><au><snm>Holroyd</snm><fnm>S</fnm></au><etal/></aug><source>Nature</source><pubdate>2002</pubdate><volume>415</volume><fpage>871</fpage><lpage>880</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature724</pubid><pubid idtype="pmpid" link="fulltext">11859360</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>ACT: aggregation and correlation toolbox for analyses of genome tracks.</p></title><aug><au><snm>Jee</snm><fnm>J</fnm></au><au><snm>Rozowsky</snm><fnm>J</fnm></au><au><snm>Yip</snm><fnm>KY</fnm></au><au><snm>Lochovsky</snm><fnm>L</fnm></au><au><snm>Bjornson</snm><fnm>R</fnm></au><au><snm>Zhong</snm><fnm>G</fnm></au><au><snm>Zhang</snm><fnm>Z</fnm></au><au><snm>Fu</snm><fnm>Y</fnm></au><au><snm>Wang</snm><fnm>J</fnm></au><au><snm>Weng</snm><fnm>Z</fnm></au><au><snm>Gerstein</snm><fnm>M</fnm></au></aug><source>Bioinformatics</source><pubdate>2011</pubdate><volume>27</volume><fpage>1152</fpage><lpage>1154</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btr092</pubid><pubid idtype="pmcid">3072554</pubid><pubid idtype="pmpid" link="fulltext">21349863</pubid></pubidlist></xrefbib></bibl><bibl id="B48"><title><p>MEME SUITE: tools for motif discovery and searching.</p></title><aug><au><snm>Bailey</snm><fnm>TL</fnm></au><au><snm>Boden</snm><fnm>M</fnm></au><au><snm>Buske</snm><fnm>FA</fnm></au><au><snm>Frith</snm><fnm>M</fnm></au><au><snm>Grant</snm><fnm>CE</fnm></au><au><snm>Clementi</snm><fnm>L</fnm></au><au><snm>Ren</snm><fnm>J</fnm></au><au><snm>Li</snm><fnm>WW</fnm></au><au><snm>Noble</snm><fnm>WS</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2009</pubdate><volume>37</volume><fpage>W202</fpage><lpage>208</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkp335</pubid><pubid idtype="pmcid">2703892</pubid><pubid idtype="pmpid" link="fulltext">19458158</pubid></pubidlist></xrefbib></bibl><bibl id="B49"><title><p>Detection of functional DNA motifs via statistical over-representation.</p></title><aug><au><snm>Frith</snm><fnm>MC</fnm></au><au><snm>Fu</snm><fnm>Y</fnm></au><au><snm>Yu</snm><fnm>L</fnm></au><au><snm>Chen</snm><fnm>JF</fnm></au><au><snm>Hansen</snm><fnm>U</fnm></au><au><snm>Weng</snm><fnm>Z</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2004</pubdate><volume>32</volume><fpage>1372</fpage><lpage>1381</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkh299</pubid><pubid idtype="pmcid">390287</pubid><pubid idtype="pmpid" link="fulltext">14988425</pubid></pubidlist></xrefbib></bibl><bibl id="B50"><title><p>The role of nucleosome positioning in the evolution of gene regulation.</p></title><aug><au><snm>Tsankov</snm><fnm>AM</fnm></au><au><snm>Thompson</snm><fnm>DA</fnm></au><au><snm>Socha</snm><fnm>A</fnm></au><au><snm>Regev</snm><fnm>A</fnm></au><au><snm>Rando</snm><fnm>OJ</fnm></au></aug><source>PLoS Biol</source><pubdate>2010</pubdate><volume>8</volume><fpage>e1000414</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pbio.1000414</pubid><pubid idtype="pmcid">2897762</pubid><pubid idtype="pmpid" link="fulltext">20625544</pubid></pubidlist></xrefbib></bibl><bibl id="B51"><title><p>Identifying positioned nucleosomes with epigenetic marks in human from ChIP-Seq.</p></title><aug><au><snm>Zhang</snm><fnm>Y</fnm></au><au><snm>Shin</snm><fnm>H</fnm></au><au><snm>Song</snm><fnm>JS</fnm></au><au><snm>Lei</snm><fnm>Y</fnm></au><au><snm>Liu</snm><fnm>XS</fnm></au></aug><source>BMC Genomics</source><pubdate>2008</pubdate><volume>9</volume><fpage>537</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-9-537</pubid><pubid idtype="pmcid">2596141</pubid><pubid idtype="pmpid" link="fulltext">19014516</pubid></pubidlist></xrefbib></bibl><bibl id="B52"><title><p>OriDB: a DNA replication origin database.</p></title><aug><au><snm>Nieduszynski</snm><fnm>CA</fnm></au><au><snm>Hiraga</snm><fnm>S</fnm></au><au><snm>Ak</snm><fnm>P</fnm></au><au><snm>Benham</snm><fnm>CJ</fnm></au><au><snm>Donaldson</snm><fnm>AD</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2007</pubdate><volume>35</volume><fpage>D40</fpage><lpage>46</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkl758</pubid><pubid idtype="pmcid">1781122</pubid><pubid idtype="pmpid" link="fulltext">17065467</pubid></pubidlist></xrefbib></bibl><bibl id="B53"><title><p>ReplicationDomain: a visualization tool and comparative database for genome-wide replication timing data.</p></title><aug><au><snm>Weddington</snm><fnm>N</fnm></au><au><snm>Stuy</snm><fnm>A</fnm></au><au><snm>Hiratani</snm><fnm>I</fnm></au><au><snm>Ryba</snm><fnm>T</fnm></au><au><snm>Yokochi</snm><fnm>T</fnm></au><au><snm>Gilbert</snm><fnm>DM</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2008</pubdate><volume>9</volume><fpage>530</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-9-530</pubid><pubid idtype="pmcid">2636809</pubid><pubid idtype="pmpid" link="fulltext">19077204</pubid></pubidlist></xrefbib></bibl><bibl id="B54"><title><p>The essential DNA-binding protein sap1 of Schizosaccharomyces pombe contains two independent oligomerization interfaces that dictate the relative orientation of the DNA-binding domain.</p></title><aug><au><snm>Ghazvini</snm><fnm>M</fnm></au><au><snm>Ribes</snm><fnm>V</fnm></au><au><snm>Arcangioli</snm><fnm>B</fnm></au></aug><source>Mol Cell Biol</source><pubdate>1995</pubdate><volume>15</volume><fpage>4939</fpage><lpage>4946</lpage><xrefbib><pubidlist><pubid idtype="pmcid">230740</pubid><pubid idtype="pmpid" link="fulltext">7651412</pubid></pubidlist></xrefbib></bibl><bibl id="B55"><title><p>The Pfam protein families database.</p></title><aug><au><snm>Finn</snm><fnm>RD</fnm></au><au><snm>Mistry</snm><fnm>J</fnm></au><au><snm>Tate</snm><fnm>J</fnm></au><au><snm>Coggill</snm><fnm>P</fnm></au><au><snm>Heger</snm><fnm>A</fnm></au><au><snm>Pollington</snm><fnm>JE</fnm></au><au><snm>Gavin</snm><fnm>OL</fnm></au><au><snm>Gunasekaran</snm><fnm>P</fnm></au><au><snm>Ceric</snm><fnm>G</fnm></au><au><snm>Forslund</snm><fnm>K</fnm></au><au><snm>Holm</snm><fnm>L</fnm></au><au><snm>Sonnhammer</snm><fnm>EL</fnm></au><au><snm>Eddy</snm><fnm>SR</fnm></au><au><snm>Bateman</snm><fnm>A</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2010</pubdate><volume>38</volume><fpage>D211</fpage><lpage>222</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkp985</pubid><pubid idtype="pmcid">2808889</pubid><pubid idtype="pmpid" link="fulltext">19920124</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm></art>