<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>gb-2006-7-7-r53</ui>
	<ji>GBJ</ji>
	<fm>
		<dochead>Research</dochead>
		<bibl>
			<title>
				<p>Comparative genomics of <it>Drosophila </it>and human core promoters</p>
			</title>
			<aug>
				<au id="A1">
					<snm>FitzGerald</snm>
					<mi>C</mi>
					<fnm>Peter</fnm>
					<insr iid="I1"/>
					<email>pcf@helix.nih.gov</email>
				</au>
				<au id="A2">
					<snm>Sturgill</snm>
					<fnm>David</fnm>
					<insr iid="I2"/>
					<email>sturgill@helix.nih.gov</email>
				</au>
				<au id="A3">
					<snm>Shyakhtenko</snm>
					<fnm>Andrey</fnm>
					<insr iid="I3"/>
					<email>shlyakha@mail.nih.gov</email>
				</au>
				<au id="A4">
					<snm>Oliver</snm>
					<fnm>Brian</fnm>
					<insr iid="I2"/>
					<email>oliver@helix.nih.gov</email>
				</au>
				<au id="A5" ca="yes">
					<snm>Vinson</snm>
					<fnm>Charles</fnm>
					<insr iid="I3"/>
					<email>vinsonc@dc37a.nci.nih.gov</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Genome Analysis Unit, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA</p>
				</ins>
				<ins id="I2">
					<p>Laboratory of Cellular and Developmental Biology National Institute of Diabetes and Digestive and Kidney, National Institutes of Health, Bethesda, MD 20892, USA</p>
				</ins>
				<ins id="I3">
					<p>Laboratory of Metabolism, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA</p>
				</ins>
			</insg>
			<source>Genome Biology</source>
			<issn>1465-6906</issn>
			<pubdate>2006</pubdate>
			<volume>7</volume>
			<issue>7</issue>
			<fpage>R53</fpage>
			<url>http://genomebiology.com/2006/7/7/R53</url>
			<xrefbib>
				<pubidlist><pubid idtype="pmpid">16827941</pubid><pubid idtype="doi">10.1186/gb-2006-7-7-r53</pubid>
				</pubidlist></xrefbib>
		</bibl>
		<history>
			<rec>
				<date>
					<day>22</day>
					<month>3</month>
					<year>2006</year>
				</date>
			</rec>
			<revrec>
				<date>
					<day>8</day>
					<month>5</month>
					<year>2006</year>
				</date>
			</revrec>
			<acc>
				<date>
					<day>6</day>
					<month>6</month>
					<year>2006</year>
				</date>
			</acc>
			<pub>
				<date>
					<day>7</day>
					<month>7</month>
					<year>2006</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2006</year>
			<collab>FitzGerald et al.; licensee BioMed Central Ltd.</collab>
			<note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<shorttitle>
			<p>Fly and human core promoters</p>
		</shorttitle>
		<shortabs>
			<p>Comparison of DNA sequence distributions in <it>Drosophila </it>and human promoters suggests that different motifs have distinct functional roles.</p>
		</shortabs>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st>
					<p>The core promoter region plays a critical role in the regulation of eukaryotic gene expression. We have determined the non-random distribution of DNA sequences relative to the transcriptional start site in <it>Drosophila melanogaster </it>promoters to identify sequences that may be biologically significant. We compare these results with those obtained for human promoters.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st>
					<p>We determined the distribution of all 65,536 octamer (8-mers) DNA sequences in 10,914 <it>Drosophila </it>promoters and two sets of human promoters aligned relative to the transcriptional start site. In <it>Drosophila</it>, 298 8-mers have highly significant (<it>p </it>&#8804; 1 &#215; 10<sup>-16</sup>) non-random distributions peaking within 100 base-pairs of the transcriptional start site. These sequences were grouped into 15 DNA motifs. Ten motifs, termed directional motifs, occur only on the positive strand while the remaining five motifs, termed non-directional motifs, occur on both strands. The only directional motifs to localize in human promoters are TATA, INR, and DPE. The directional motifs were further subdivided into those precisely positioned relative to the transcriptional start site and those that are positioned more loosely relative to the transcriptional start site. Similar numbers of non-directional motifs were identified in both species and most are different. The genes associated with all 15 DNA motifs, when they occur in the peak, are enriched in specific Gene Ontology categories and show a distinct mRNA expression pattern, suggesting that there is a core promoter code in <it>Drosophila</it>.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusion</p>
					</st>
					<p><it>Drosophila </it>and human promoters use different DNA sequences to regulate gene expression, supporting the idea that evolution occurs by the modulation of gene regulation.</p>
				</sec>
			</sec>
		</abs>
	</fm>
	<meta>
		<classifications>
			<classification type="BMC" subtype="man_spc_id" id="30010016">Molecular biology</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010001">Biochemistry and structural biology</classification>
		</classifications>
	</meta>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p>The regulation of eukaryotic gene expression is a complex process involving many different control mechanisms, including chromatin structure and DNA sequences that bind specific proteins <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. For convenience, we divide DNA sequence motifs that are bound by proteins into three distinct classes: the core promoter region where the basal transcription machinery binds; motifs within the core promoter region that bind to transcription factors; and classic enhancer or silencer motifs, that function at large distances from the transcriptional start site (TSS). Two extremes of regulated gene expression may be envisioned. In one extreme, the general transcriptional machinery is identical for all promoters, and the binding of different transcription factors to the core promoter and more distant motifs recruits and regulates RNA polymerase activity to control gene expression. In the other extreme, different motifs within the core promoter direct the assembly of transcriptional machinery with different components. The latter system is used in prokaryotic systems where different sigma factors, a component of the polymerase complex, bind different motifs in the core promoter to regulate functionally related genes <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. This type of system also operates in sex specific tissues of <it>Drosophila </it>where the germ cells express variant isoforms of the general transcriptional complex <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp> termed core promoter selectivity factors <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Furthermore, genetic studies in <it>Drosophila </it>indicate that the core promoter contains information that directs tissue-specific mRNA expression <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>.</p>
			<p>A variety of computational methods have been used to identify DNA binding sites for transcription factors and core promoter elements in both <it>Drosophila </it>and human <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. Previous full-genome-analysis of <it>Drosophila </it>core promoters has examined abundance, but not the precise positioning of motifs near the TSS. Here, we use the technique of examining non-random distribution relative to the TSS in <it>Drosophila melanogaster </it>promoter sequences to identify DNA motifs that are biologically significant. This study adds to our understanding of <it>Drosophila </it>core promoters by identifying new motifs and showing that motifs correlate with different biological functions. Comparing these results with those obtained with human indicate that the DNA motifs that localize are different except for the strand specific core promoter elements TATA, initiator element (INR), and downstream promoter element (DPE).</p>
		</sec>
		<sec>
			<st>
				<p>Results</p>
			</st>
			<p>Genomic DNA sequences and gene annotation data for <it>Drosophila </it>and human were downloaded from the UCSC Genome Browser site <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Human gene annotation data were also obtained from the DBTSS <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. For each organism, we created a dataset corresponding to the region -1,001 to +499 base-pairs (bp) relative to the annotated TSS sequences of each RefSeq gene that had an annotated 5' untranslated region (UTR) of 10 or more bp. We created two human datasets, one using the UCSC annotations and one using the DBTSS annotations.</p>
			<sec>
				<st>
					<p>Distribution of mono-nucleotides is different between <it>Drosophila </it>and human promoters</p>
				</st>
				<p>To determine the gross structure of <it>Drosophila </it>and human promoters, we determined the abundance of the four mononucleotides (1-mer; Figure <figr fid="F1">1a</figr>) across the 1,500 bp from -1,000 bp to +499 bp for 10,914 <it>Drosophila </it>promoters and compared these to distributions in 15,011 (UCSC) and 12,926 (DBTSS) human promoters (Figure <figr fid="F1">1b,c</figr>). <it>Drosophila </it>promoters are more A and T rich (56%) than human promoters (44%). In addition, <it>Drosophila </it>promoters had a peak for both A and T between -200 bp and the TSS, while the human promoters had a broad peak for both G and C centered at the TSS, suggesting a fundamental difference in global promoter architecture. The two human datasets show the same general distribution patterns, but the DBTSS set has more pronounced peaks and valleys at the TSS.</p>
				<fig id="F1">
					<title>
						<p>Figure 1</p>
					</title>
					<caption>
						<p>The distribution of nucleotides across <it>Drosophila </it>and human promoters</p>
					</caption>
					<text>
						<p>The distribution of nucleotides across <it>Drosophila </it>and human promoters. The distribution of mononucleotides across the <b>(a) </b>1,500 bp region of 10,914 <it>Drosophila </it>and <b>(b) </b>15,011 and <b>(c) </b>12,926 human promoters; the frequency of each mononucleotide is plotted against position (in 20 bp bins). The TSS occurs in bin 51 and its location is indicated. <b>(d) </b>The frequency of occurrence of the CA dinucleotide, at a single base-pair resolution across the 1,500 bp promoter region for all three datasets.</p>
					</text>
					<graphic file="gb-2006-7-7-r53-1"/>
				</fig>
				<p>The CA dinucleotide is often associated with the TSS <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> and is often associated with a unique TSS <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. RNA polymerase is known to prefer an adenine in the +1 position <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. This provides an important quality control metric. A tight cluster of CA sites at the TSS would indicate that enough TSSs have been accurately assigned to permit analysis of other motifs. Figure <figr fid="F1">1d</figr> presents the CA dinucleotide distribution plotted at a single nucleotide resolution, rather than the 20 bp bin shown in Figure <figr fid="F1">1a-c</figr>. The CA distribution in both <it>Drosophila </it>and human promoters showed a spike exactly at the TSS (the A of the CA dinucleotide is at position +1 in the peak). The <it>Drosophila </it>CA spike at the TSS occurs in approximately 20% of all promoters while the spike is less pronounced in the human (UCSC) dataset (approximately 10%) and more pronounced in the human (DBTSS) dataset (approximately 40%). This CA peak is part of the initiator (INR) motif (TCAGTY) that is positioned at the TSS (see below). That CA is often present at the TSS suggests that the TSS has been appropriately assigned in many of the transcripts in both the <it>Drosophila </it>and human promoter dataset. If the CA peak is taken as a relative measure of the quality, or precise alignment, of the datasets, then the two human sets bracket the <it>Drosophila </it>set with respect to the accuracy of the positioning of the TSS.</p>
			</sec>
			<sec>
				<st>
					<p>Distribution of all 8-mer DNA sequences in promoters</p>
				</st>
				<p>Having validated the quality of the TSS assignments, we determined the distribution of all 8-mers in the set of <it>Drosophila </it>and human putative promoters to identify potential DNA binding sites for transcription factors that are localized relative to the TSS. A clustering factor (CF), describing the presence of a peak in the distribution of each 8-mer, was calculated three ways, by examining the distribution on both strands (CF), on the positive strand (CF<sup>+</sup>), and on the negative strand (CF<sup>-</sup>). For these calculations we divided the 1,500 bp of genomic DNA, from -1,000 bp to +499 bp relative to the TSS, into 75 bins of 20 bp each (see Materials and methods).</p>
				<p>When CF values were plotted against the bin with the maximum number of members for the <it>Drosophila </it>and human promoters, respectively (Figure <figr fid="F2">2a-c</figr>), all distributions showed similar patterns, with a grouping of DNA sequences that peak within 100 bp of the TSS. The highest CF values for all plots is 20 to 30, indicating that these 8-mers are approximately 20 to 30 times more abundant at one position relative to the TSS than elsewhere in promoters. In contrast to the similarity in CF values, when the data were plotted for CF<sup>+</sup>, (Figure <figr fid="F2">2d-f</figr>), a profound difference between <it>Drosophila </it>and both human datasets was revealed. <it>Drosophila </it>8-mers have a maximum CF<sup>+ </sup>value of approximately 50 while the maximum CF<sup>+ </sup>for human sequences is approximately 20. This suggests that <it>Drosophila </it>has more 8-mers that occur preferentially on one strand of DNA, and that the <it>Drosophila </it>strand-dependent 8-mers have a higher degree of localization than their human counterparts. Control data, using 7th-order Markov random datasets, show a complete lack of clustering for any 8-mers for either human or <it>Drosophila </it>(data not shown).</p>
				<fig id="F2">
					<title>
						<p>Figure 2</p>
					</title>
					<caption>
						<p>The localization of all 65,536 8-mers in <it>Drosophila </it>and human promoters</p>
					</caption>
					<text>
						<p>The localization of all 65,536 8-mers in <it>Drosophila </it>and human promoters. The clustering factors (CF or CF<sup>+</sup>) calculated for 20 bp bins plotted at the position of the most populated bin for all 65,536 8-mers. <b>(a) </b>CF for 10,914 <it>Drosophila </it>promoters; <b>(b) </b>CF for 15,011 human (UCSC) promoters; <b>(c) </b>CF for 12,926 human (DBTSS) promoters; <b>(d) </b>CF<sup>+ </sup>for 10,914 <it>Drosophila </it>promoters; <b>(e) </b>CF<sup>+ </sup>for 15,011 human (UCSC) promoters; <b>(f) </b>CF<sup>+ </sup>for 12,926 human (DBTSS) promoters.</p>
					</text>
					<graphic file="gb-2006-7-7-r53-2"/>
				</fig>
				<p>To determine if an 8-mer has a peak in its distribution on only one strand of DNA, we compared the CF<sup>+ </sup>with the CF on the opposite strand (CF<sup>-</sup>). In <it>Drosophila</it>, we identified two types of peaking 8-mers; those that peak on both strands and thus have similar CF<sup>+ </sup>and CF<sup>- </sup>values (termed non-directional motifs (NDMs)), and 8-mers that peak preferentially on one strand (termed directional motifs (DMs)) and thus have significantly different CF<sup>+ </sup>and CF<sup>- </sup>values (Figure <figr fid="F3">3a</figr>). Indeed, many motifs are randomly positioned on one strand and &gt;20-fold enriched at a given position of the opposite strand. These two distinct types of motifs are potentially bound by proteins that have different roles in transcription regulation. The 8-mers with a high CF<sup>+ </sup>but a low CF<sup>- </sup>contain directional information and could be binding sites for core promoter selectivity factors. In contrast, in both human promoter sets, we observed a significant number of 8-mers that peak on both strands (Figure <figr fid="F3">3b,c</figr>), and few that preferentially peak on one strand (as shown below, these are predominantly TATA and INR-like sequences). While the human DBTSS dataset contains a greater number of DMs than does the UCSC dataset, both sets are clearly more biased toward NDM than is the <it>Drosophila </it>dataset. These data suggest that there is a significant difference in the sequence organization of promoters between these human and <it>Drosophila </it>datasets.</p>
				<fig id="F3">
					<title>
						<p>Figure 3</p>
					</title>
					<caption>
						<p>Scatter plots showing the strand dependence of 8-mer localization, and the comparison of localization between different organisms (<it>Drosophila </it>and human)</p>
					</caption>
					<text>
						<p>Scatter plots showing the strand dependence of 8-mer localization, and the comparison of localization between different organisms (<it>Drosophila </it>and human). The clustering factors for all 8-mers, calculated for 20 bp bins, are plotted on the positive (CF<sup>+</sup>) versus the negative (CF<sup>-</sup>) strand for <b>(a) </b><it>Drosophila</it>, <b>(b) </b>human (UCSC), and <b>(c) </b>human (DBTSS) promoters. The 256 palindromic sequences have equivalent CF<sup>+</sup>/CF<sup>- </sup>values but are plotted with a CF<sup>- </sup>value of -1. Comparison of CF values of 8-mers for <b>(d) </b>human (UCSC) versus <it>Drosophila</it>, <b>(e) </b>human (DBTSS) versus <it>Drosophila</it>, and <b>(f) </b>human (UCSC) versus human (DBTSS). Common elements should lie along the diagonal.</p>
					</text>
					<graphic file="gb-2006-7-7-r53-3"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p><it>Drosophila </it>and human 8-mers that peak are different</p>
				</st>
				<p>Are the motifs that peak in humans similar to the motifs that peak in <it>Drosophila</it>? To answer this, we directly compared the CF values for all 8-mers between human and <it>Drosophila </it>(Figure <figr fid="F3">3d,e</figr>). The majority of 8-mers with high CF values are different between the two species. In contrast, 8-mers with the largest CF values are common between the two human datasets (Figure <figr fid="F3">3f</figr>), lending confidence to the idea that the differences between the two species are real.</p>
			</sec>
			<sec>
				<st>
					<p>Fifteen DNA motifs that cluster in <it>Drosophila</it></p>
				</st>
				<p>To determine the statistical significance of the CF<sup>+ </sup>values, we converted the CF<sup>+ </sup>into a probability term using the 8-mer frequencies observed in the 10,914 <it>Drosophila </it>promoter dataset. The probability term, <it>P</it>, represents -log<sub>10</sub>(1 - <it>p</it>), where <it>p </it>is the area under the normalized curve of the distribution of CF<sub>expt</sub>. A high <it>P </it>value indicates that it is very unlikely that the peak for the 8-mer occurs by chance. A plot of the <it>P </it>values versus the most populated bin number (Figure <figr fid="F4">4a</figr>) shows a group of 8-mers near the TSS whose distributions are very unlikely to occur by chance. We analyzed the 298 8-mers that have a <it>P </it>value &#8805; 16. All these 8-mers had peaks centered between -100 bp and +40 bp. As illustrated in Figure <figr fid="F4">4a</figr>, <it>P </it>&#8805; 16 is a conservative cutoff. We plotted CF<sup>+ </sup>versus CF<sup>- </sup>for these 298 sequences to examine their strand specific localization (Figure <figr fid="F4">4b</figr>). DMs (black circles) predominate, but NDMs (red circles) were also identified.</p>
				<fig id="F4">
					<title>
						<p>Figure 4</p>
					</title>
					<caption>
						<p>8-mer localization in <it>Drosophila </it>expressed as a probability term, and characteristics of the most statistically relevant 8-mers</p>
					</caption>
					<text>
						<p>8-mer localization in <it>Drosophila </it>expressed as a probability term, and characteristics of the most statistically relevant 8-mers. <b>(a) </b>The probability term P = -log<sub>10</sub>(1 - <it>p</it>) for the 13,552 8-mers with a maximum bin containing &#8805;15 members. The 298 DNA sequences above the line at <it>P </it>= 16, a 1 in 1 &#215; 10<sup>16 </sup>(single sampling) chance of being random, were analyzed in more detail. <b>(b) </b>Clustering factors for both the positive (CF<sup>+</sup>) and negative strand (CF<sup>-</sup>) were plotted for the 298 most significant peaking 8-mers. The distribution falls into two distinct groupings; those that display a symmetric distribution on both strands (red circles) and those that cluster on only one strand (black circles). <b>(c) </b>A histogram showing the number of promoters containing each of the 15 motifs, grouped into three classes, DMp1 to 5, DMv1 to 5, and NDM1 to 5. We also present the common name and the consensus sequence.</p>
					</text>
					<graphic file="gb-2006-7-7-r53-4"/>
				</fig>
				<p>The 298 8-mer sequences were manually grouped into 15 families and a consensus motif was determined for each family (Figure <figr fid="F5">5</figr>). The placement of an 8-mer into a particular motif was guided by: the similarity amongst DNA sequences; the shape of the distribution histogram; the peak position relative to the TSS; and whether the 8-mer was directional or non-directional. The total number of 8-mers in each of the 15 motifs varied dramatically, with over one-third of the 298 8-mers representing variations of the INR motif (TCAGTY) and 8 motifs were represented by 5 or fewer 8-mers. We determined the abundance of the 15 motifs by counting unique promoters that contained a motif in the peak (Figure <figr fid="F4">4c</figr>). A total of 6,067 promoters contain one or more of the 15 motifs. The most abundant motif is the non-directional DRE, found in 15% (1,593) of <it>Drosophila </it>promoters, followed by directional INR, found in 14% (1,501) of promoters. The least abundant motif identified, DMp5, is found in 0.7% (80) of all promoters.</p>
				<fig id="F5">
					<title>
						<p>Figure 5</p>
					</title>
					<caption>
						<p>The 15 DNA motifs derived from grouping 298 octamers whose probability of having a non-random distribution was less than 1 &#215; 10<sup>-16</sup></p>
					</caption>
					<text>
						<p>The 15 DNA motifs derived from grouping 298 octamers whose probability of having a non-random distribution was less than 1 &#215; 10<sup>-16</sup>. The table is grouped into two panels. <b>(a) </b>presents the 10 directional motifs, while <b>(b) </b>shows the five non-directional motifs. We present: the sequence logo; the consensus sequence using IUPAC letters to represent degenerate bases - R (G, A), W (A, T), Y (T, C), K (G, T), M(A, C), S (G, C), N (A, T, G, C); the name assigned in this work; the common name if it exists; designations from previous work [10]; the number of 8-mers that peaked that were placed in the family; peak location as base-pairs relative to the TSS; clustering factor (CF<sup>+</sup>) on the positive strand; clustering factor (CF<sup>-</sup>) on the negative strand; the bins that were pooled to define the peak; and the unique genes in the peak.</p>
					</text>
					<graphic file="gb-2006-7-7-r53-5"/>
				</fig>
				<p>Figure <figr fid="F6">6</figr> presents the distribution of each of the 15 consensus motifs, showing the number of occurrences on each DNA strand. To gain more insight into how constrained motif position is relative to the TSS, we examined the distribution of the 15 DNA motifs at a single base-pair resolution. The inserts in Figure <figr fid="F6">6</figr> show the single base-pair distribution plots for the motifs in the region -100 to +100 relative to the TSS. Five of the DMs (Figure <figr fid="F6">6a-e</figr>) are positioned at a single base-pair resolution relative to the TSS while the other five DMs (Figure <figr fid="F6">6f-j</figr>) and the five NDMs (Figure <figr fid="F6">6k-o</figr>) are spread across a broad region of up to 50 bp, though they all clustered near the TSS. We thus classified the DMs as either precise or variably positioned. The DMs are named DMp1 to 5 (for directional motif precise) and DMv1 to 5 (for directional motif variable). The NDMs are named NDM1 to 5. Where a motif has a previous common name we use that name, for example, DMp1 is TATA, DMp2 is INR, DMp4 and DMp5 are DPE-like, NDM1 is GAGA and NDM4 is downstream responsive element (DRE). The single base-pair resolution plots not only reveal the precise versus variable positioning of the motifs, they also reveal the power of the initial analysis based on 20 bp bins. Many of the motifs (DMvs and NDMs) would not have been identified at a single base-pair resolution. Also, the number of promoters identified that contain a specific motif is much greater at a 20 bp resolution than a 1 bp resolution (for example, for INR there are approximately 1,500 versus approximately 400).</p>
				<fig id="F6">
					<title>
						<p>Figure 6</p>
					</title>
					<caption>
						<p>The distribution of the 15 identified motifs in <it>Drosophila </it>promoters</p>
					</caption>
					<text>
						<p>The distribution of the 15 identified motifs in <it>Drosophila </it>promoters. <b>(a-o) </b>The number of occurrences of each motif, in each 20 bp bin, for the positive strand (solid red) and the negative strand (dashed black). The inserts show the same data plotted at a single nucleotide resolution from -100 bp to +100 bp relative to the TSS. Inserts for the directional motifs (DMp1 to 5 and DMv1 to 5) show the distribution on the positive strand only, while those for the non-directional motifs (NDM1 to 5) show the distribution for both strands. (a-e) The directional motifs that have a precise localization (DMp); (f-j) the directional motifs with a variable localization (DMv); (k-o) the non-directional motifs that all have a variable localization (NDM).</p>
					</text>
					<graphic file="gb-2006-7-7-r53-6"/>
				</fig>
				<p>To further examine the localization of DNA sequences at a single base-pair resolution, we examined the CF<sup>+ </sup>values of all 6-mers for both <it>Drosophila </it>and human promoters (Figure <figr fid="F7">7</figr>). We chose 6-mers to produce enough occurrences at each base pair position to be able to determine peaks reliably. The <it>Drosophila </it>data (Figure <figr fid="F7">7a</figr>) showed three distinct regions in which individual 6-mers were preferentially localized. Examination of the DNA sequences that cluster around each of these three positions indicated they can be grouped into a single motif that is localized at a specific base-pair position relative to the TSS. The three motifs are TATA, INR and DPE. Where promoters have two of these motifs, they are precisely positioned relative to each other (Figure <figr fid="F7">7d</figr>).</p>
				<fig id="F7">
					<title>
						<p>Figure 7</p>
					</title>
					<caption>
						<p>The localization, on the positive strand, of all 4,096 6-mers in <it>Drosophila </it>and human promoters</p>
					</caption>
					<text>
						<p>The localization, on the positive strand, of all 4,096 6-mers in <it>Drosophila </it>and human promoters. Clustering factor (CF<sup>+</sup>) for the positive strand, plotted at a single base-pair resolution, at the position of the most populated bp, for all 4,096 6-mers. <b>(a) </b>CF<sup>+ </sup>from 10,914 <it>Drosophila </it>promoters; <b>(b) </b>CF<sup>+ </sup>from 15,011 human (UCSC); <b>(c) </b>CF<sup>+ </sup>from 12,926 human (DBTSS) promoters; <b>(d) </b>the exact placement of <it>Drosophila </it>TATA, INR variants, and DPE variants relative to each other. The sequence is broken into 10 bp segments.</p>
					</text>
					<graphic file="gb-2006-7-7-r53-7"/>
				</fig>
				<p>The clustering of 6-mers at a single base-pair resolution in the UCSC human promoters showed generally lower CF<sup>+ </sup>values and only two peaks corresponding to the TATA and INR positions (Figure <figr fid="F7">7b</figr>). While the DBTSS dataset (Figure <figr fid="F7">7c</figr>) showed more pronounced peaks than the UCSC dataset, it still failed to show a clear DPE peak. Examination of the sequences localized under the main human (DBTSS) peaks produced a result similar to that seen form <it>Drosophila</it>. The sequences lying under the TATA peak were exclusively TATA-like sequences. The sequences under the INR peak represented INR variants localized exactly at the TSS and other NDMs, predominantly erythroblast transformation specific (ETS), localized close to the TSS. However, the variety of INR sequences that localized in the human dataset was greater than that seen for the <it>Drosophila </it>data. Attempts to identify distinct human INR motifs six nucleotides or greater were unsuccessful due to the wide degeneracy in sequences that surround the prominent central CA core.</p>
			</sec>
			<sec>
				<st>
					<p>Comparison of <it>Drosophila </it>and human motifs that peak</p>
				</st>
				<p>We examined if motifs that peak in <it>Drosophila </it>also peak in human and vice-versa. Of the 15 <it>Drosophila </it>motifs that peaked, four also localized in human promoters (TATA, INR, DPE1 and NDM2; Figure <figr fid="F8">8a,b,d,l</figr>) with INR, DPE1 and NDM2 occurring at much lower frequency in human promoters. While both the human and <it>Drosophila </it>promoters showed a clear overabundance of the CA dimer at the TSS (Figure <figr fid="F1">1d</figr>), we were previously <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> unable to detect an INR signal in human promoters using the degenerate human consensus sequence (YYANWYY). However, mapping the <it>Drosophila </it>INR motif (TCAGTY) to human promoters does produce a weak peak at the TSS in the UCSC dataset and a more pronounced peak in the DBTSS dataset (Figure <figr fid="F8">8b</figr>). Analysis of this peak at a 1 bp resolution (Figure <figr fid="F8">8x</figr>) revealed that both human datasets contain significantly fewer of these precisely positioned elements than does the <it>Drosophila </it>dataset. This result suggests that this TCAGTY motif plays a less significant role in human gene transcription than it does in <it>Drosophila</it>, and agrees with previous findings that the human INR is more degenerate than its <it>Drosophila </it>counterpart. It should be noted that in all cases, the motifs that contained a peak in one human dataset also showed peaks in the other human dataset, although the DBTSS dataset showed more pronounced peaks. This confirms both the qualitative similarity of the two datasets and the suggestion that the DBTSS data contains greater numbers of accurately positioned TSSs. Of the eight motifs previously identified to abundantly peak in humans <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, only TATA also peaked in <it>Drosophila </it>promoters (Figure <figr fid="F9">9</figr>).</p>
				<fig id="F8">
					<title>
						<p>Figure 8</p>
					</title>
					<caption>
						<p>The distribution of 15 '<it>Drosophila </it>specific' motifs in <it>Drosophila </it>and human promoters</p>
					</caption>
					<text>
						<p>The distribution of 15 '<it>Drosophila </it>specific' motifs in <it>Drosophila </it>and human promoters. <b>(a-o) </b>The number of occurrences of each of the 15 identified <it>Drosophila </it>motifs in each 20 bp bin for <it>Drosophila </it>(dotted black), human (UCSC; solid red) and human (DBTSS; dashed blue) promoters. For the ten directional motifs, only the occurrences on the positive strand are represented. For the five non-directional elements, the occurrences on both the positive and negative strand are represented. <b>(x) </b>The distributions of the INR motif (TGACTY), from -100 to +100, for both <it>Drosophila </it>and human promoters at a single base-pair resolution. The number of occurrences of each element has been normalized, based on a dataset of 10,000 promoters, to compensate for the different sizes of the datasets.</p>
					</text>
					<graphic file="gb-2006-7-7-r53-8"/>
				</fig>
				<fig id="F9">
					<title>
						<p>Figure 9</p>
					</title>
					<caption>
						<p>The distribution of 8 'human specific' motifs in <it>Drosophila </it>and human promoters</p>
					</caption>
					<text>
						<p>The distribution of 8 'human specific' motifs in <it>Drosophila </it>and human promoters. <b>(a-h) </b>The number of occurrences of each previously identified [11] human specific motif in each 20 bp bin for <it>Drosophila </it>(dotted black), human (UCSC; solid red) and human (DBTSS; dashed blue) promoters. The number of occurrences of each element has been normalized, based on a dataset of 10,000 promoters, to compensate for the different sizes of the datasets.</p>
					</text>
					<graphic file="gb-2006-7-7-r53-9"/>
				</fig>
				<p>In comparing the distributions of the <it>Drosophila </it>and human motifs, it is apparent that some sequences, even when they occur outside of the peak, display different abundances for the two organisms. This is true for DRE (Figure <figr fid="F8">8n</figr>), which peaks in <it>Drosophila </it>but is also a highly abundant motif outside of the peak (total of 7,058 across 1,500 bp of 10,914 promoters). In humans, there is no indication of any clustering, and this element is also very rare (total of 1,015 across 1,500 bp of 15,011 promoters). The reciprocal observation is made for human promoters, where SP1 (Figure <figr fid="F9">9h</figr>) is characterized by a very large peak and is also abundant outside of the peak but is virtually absent from <it>Drosophila </it>core promoters. In contrast, the INR (Figure <figr fid="F8">8b</figr>), which peaks in both organisms, albeit on different scales, shows very similar total abundance in both organisms (a total of 17,377 and 20,320 occurrences across 1,500 bp, in 10,914 and 15,011 promoters, for <it>Drosophila </it>and human, respectively).</p>
			</sec>
			<sec>
				<st>
					<p>E-box motifs that peak in both <it>Drosophila </it>and humans</p>
				</st>
				<p>NDM5 (CAGCTSWW) is a derivative of the general DNA sequence termed an E-box (CANNTG) that is bound by B-HLH-ZIP transcription factors, including the oncogene Myc|Max. A recent paper <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> has shown that an E-box sequence is located near the TSS of <it>Drosophila </it>genes. The sequence CACGTG is the core of the upstream stimulatory factor (USF) sequence previously identified in humans to peak near the TSS <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. We compared the distribution of these related sequences in <it>Drosophila </it>and human. The USF consensus sequence (TCACGTGR) does not show any clustering in <it>Drosophila </it>(Figure <figr fid="F9">9b</figr>). However, the 6-mer E-box variants CACGTG and CAGCTG have peaks in both human and <it>Drosophila </it>promoters (Figure <figr fid="F10">10a,b</figr>). In <it>Drosophila</it>, the sequence CACGTG peaks downstream of the TSS while in human it peaks upstream of the TSS. The E-box variant CAGCTG peaks in both human and <it>Drosophila </it>just upstream of the TSS. Figures <figr fid="F9">9c,d</figr> highlight two E-box 8-mer variants with dramatically different peaking properties where sequences outside a conserved 6-mer define the peaking properties of the 8-mer. The sequence RCACGTCY peaks only in <it>Drosophila </it>while YCACGTGR peaks only in human, suggesting that distinct B-HLH proteins bind these related sequences.</p>
				<fig id="F10">
					<title>
						<p>Figure 10</p>
					</title>
					<caption>
						<p>E-box variants that peak in <it>Drosophila </it>and human promoters</p>
					</caption>
					<text>
						<p>E-box variants that peak in <it>Drosophila </it>and human promoters. <b>(a-d) </b>The number of occurrences of <b>(a) </b>CACGTG,<b>(b) </b>CAGCTG, <b>(c) </b>RCACGTGY and <b>(d) </b>YCACGTGR in each 20 bp bin for <it>Drosophila </it>(dotted black), human (UCSC; solid red), and human (DBTSS; dashed blue) promoters.</p>
					</text>
					<graphic file="gb-2006-7-7-r53-10"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Correlation of different DNA motifs in the same promoter</p>
				</st>
				<p>We examined correlations in the occurrence of the 15 peaking motifs in <it>Drosophila </it>to gain insight into their potential combinatorial or redundant function. Table <tblr tid="T1">1</tblr> presents a matrix showing: the number of promoters that contain one motif in a peak that also contain a second motif in a peak (a); the frequency of this co-occurrence (b); and the probability (c). There is a complex pattern of positive and negative correlation for individual motifs, suggesting that combinations of motifs act to regulate core promoter function.</p>
				<tbl id="T1">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>The co-occurrence in the same promoter of DNA motifs that cluster</p>
					</caption>
					<tblbdy cols="19">
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>
									<b>Motif</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>DMp1</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>DMp2</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>DMp3</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>DMp4</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>DMp5</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>DMv1</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>DMv2</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>DMv3</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>DMv4</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>DMv5</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>NDM1</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>NDM2</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>NDM3</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>NDM4</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>NDM5</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>
									<b>Ohler no.</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="right">
								<p>
									<b>3</b>
								</p>
							</c>
							<c ca="right">
								<p>
									<b>4</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="right">
								<p>
									<b>9</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="right">
								<p>
									<b>8</b>
								</p>
							</c>
							<c ca="right">
								<p>
									<b>7</b>
								</p>
							</c>
							<c ca="right">
								<p>
									<b>1</b>
								</p>
							</c>
							<c ca="right">
								<p>
									<b>6</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="right">
								<p>
									<b>2</b>
								</p>
							</c>
							<c ca="right">
								<p>
									<b>5</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>
									<b>Name</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>TATA</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>INR</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>INR1</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>DPE1</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>DPE2</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>GAGA</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>DRE</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>E-box</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>
									<b>Totals</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>8289</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>511</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>1501</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>113</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>80</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>147</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>311</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>311</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>604</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>649</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>287</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>359</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>424</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>215</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>1593</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>1184</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="19">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>
									<b>(a)</b>
								</p>
							</c>
							<c ca="left">
								<p>STATAAA</p>
							</c>
							<c ca="left">
								<p>DMp1</p>
							</c>
							<c ca="center">
								<p>511</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>98</p>
							</c>
							<c ca="center">
								<p>9</p>
							</c>
							<c ca="center">
								<p>
									<ul>2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>10</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>6</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>4</ul>
								</p>
							</c>
							<c ca="center">
								<p>19</p>
							</c>
							<c ca="center">
								<p>28</p>
							</c>
							<c ca="center">
								<p>
									<ul>9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>21</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>26</ul>
									</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>TCAGTY</p>
							</c>
							<c ca="left">
								<p>DMp2</p>
							</c>
							<c ca="center">
								<p>1501</p>
							</c>
							<c ca="center">
								<p>98</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<ul>12</ul>
								</p>
							</c>
							<c ca="center">
								<p>25</p>
							</c>
							<c ca="center">
								<p>
									<b>43</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>15</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>18</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>34</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>17</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>12</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>100</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>108</b>
								</p>
							</c>
							<c ca="center">
								<p>38</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>67</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>112</ul>
									</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>TCATTCG</p>
							</c>
							<c ca="left">
								<p>DMp3</p>
							</c>
							<c ca="center">
								<p>113</p>
							</c>
							<c ca="center">
								<p>9</p>
							</c>
							<c ca="center">
								<p>
									<ul>12</ul>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<ul>0</ul>
								</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
							<c ca="center">
								<p>
									<ul>3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1</ul>
								</p>
							</c>
							<c ca="center">
								<p>10</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
							<c ca="center">
								<p>
									<ul>9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>9</ul>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>CGGACGT</p>
							</c>
							<c ca="left">
								<p>DMp4</p>
							</c>
							<c ca="center">
								<p>80</p>
							</c>
							<c ca="center">
								<p>
									<ul>2</ul>
								</p>
							</c>
							<c ca="center">
								<p>25</p>
							</c>
							<c ca="center">
								<p>
									<ul>0</ul>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<ul>1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2</ul>
								</p>
							</c>
							<c ca="center">
								<p>10</p>
							</c>
							<c ca="center">
								<p>6</p>
							</c>
							<c ca="center">
								<p>
									<ul>1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>6</ul>
								</p>
							</c>
							<c ca="center">
								<p>9</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>KCGGTTSK</p>
							</c>
							<c ca="left">
								<p>DMp5</p>
							</c>
							<c ca="center">
								<p>147</p>
							</c>
							<c ca="center">
								<p>
									<ul>4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>43</b>
								</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
							<c ca="center">
								<p>
									<ul>1</ul>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<ul>3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>3</ul>
								</p>
							</c>
							<c ca="center">
								<p>14</p>
							</c>
							<c ca="center">
								<p>11</p>
							</c>
							<c ca="center">
								<p>7</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>4</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>18</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>CARCCCT</p>
							</c>
							<c ca="left">
								<p>DMv1</p>
							</c>
							<c ca="center">
								<p>311</p>
							</c>
							<c ca="center">
								<p>
									<ul>2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>15</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>3</ul>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>16</p>
							</c>
							<c ca="center">
								<p>
									<ul>13</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>18</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>5</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>7</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>7</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>79</b>
								</p>
							</c>
							<c ca="center">
								<p>46</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>TGGYAACR</p>
							</c>
							<c ca="left">
								<p>DMv2</p>
							</c>
							<c ca="center">
								<p>311</p>
							</c>
							<c ca="center">
								<p>
									<ul>8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>18</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0</ul>
								</p>
							</c>
							<c ca="center">
								<p>16</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<ul>8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>15</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>6</ul>
								</p>
							</c>
							<c ca="center">
								<p>59</p>
							</c>
							<c ca="center">
								<p>
									<b>64</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>CAYCNCTA</p>
							</c>
							<c ca="left">
								<p>DMv3</p>
							</c>
							<c ca="center">
								<p>604</p>
							</c>
							<c ca="center">
								<p>
									<ul>10</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>34</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>13</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>8</ul>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<ul>18</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>1</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>16</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>282</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>63</ul>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>GGYCACAC</p>
							</c>
							<c ca="left">
								<p>DMv4</p>
							</c>
							<c ca="center">
								<p>649</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>6</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>17</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>18</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>15</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>18</ul>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>64</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>12</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>12</ul>
								</p>
							</c>
							<c ca="center">
								<p>95</p>
							</c>
							<c ca="center">
								<p>
									<ul>59</ul>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>TGGTATTT</p>
							</c>
							<c ca="left">
								<p>DMv5</p>
							</c>
							<c ca="center">
								<p>287</p>
							</c>
							<c ca="center">
								<p>
									<ul>4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>12</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>64</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<ul>0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>5</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>26</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>38</ul>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>GAGAGCG</p>
							</c>
							<c ca="left">
								<p>NDM1</p>
							</c>
							<c ca="center">
								<p>359</p>
							</c>
							<c ca="center">
								<p>19</p>
							</c>
							<c ca="center">
								<p>
									<b>100</b>
								</p>
							</c>
							<c ca="center">
								<p>10</p>
							</c>
							<c ca="center">
								<p>10</p>
							</c>
							<c ca="center">
								<p>14</p>
							</c>
							<c ca="center">
								<p>
									<ul>5</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0</ul>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>26</p>
							</c>
							<c ca="center">
								<p>18</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>6</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>28</ul>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>CGMYGYCR</p>
							</c>
							<c ca="left">
								<p>NDM2</p>
							</c>
							<c ca="center">
								<p>424</p>
							</c>
							<c ca="center">
								<p>28</p>
							</c>
							<c ca="center">
								<p>
									<b>108</b>
								</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
							<c ca="center">
								<p>6</p>
							</c>
							<c ca="center">
								<p>11</p>
							</c>
							<c ca="center">
								<p>
									<ul>7</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>16</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>12</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>5</ul>
								</p>
							</c>
							<c ca="center">
								<p>26</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<ul>6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>33</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>34</ul>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>GAAAGCT</p>
							</c>
							<c ca="left">
								<p>NDM3</p>
							</c>
							<c ca="center">
								<p>215</p>
							</c>
							<c ca="center">
								<p>
									<ul>9</ul>
								</p>
							</c>
							<c ca="center">
								<p>38</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
							<c ca="center">
								<p>
									<ul>1</ul>
								</p>
							</c>
							<c ca="center">
								<p>7</p>
							</c>
							<c ca="center">
								<p>7</p>
							</c>
							<c ca="center">
								<p>
									<ul>6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>12</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2</ul>
								</p>
							</c>
							<c ca="center">
								<p>18</p>
							</c>
							<c ca="center">
								<p>
									<ul>6</ul>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<ul>22</ul>
								</p>
							</c>
							<c ca="center">
								<p>33</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>ATCGATA</p>
							</c>
							<c ca="left">
								<p>NDM4</p>
							</c>
							<c ca="center">
								<p>1593</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>21</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>67</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>4</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>79</b>
								</p>
							</c>
							<c ca="center">
								<p>59</p>
							</c>
							<c ca="center">
								<p>
									<b>282</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>95</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>26</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>6</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>33</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>22</ul>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>265</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>CAGCTSWW</p>
							</c>
							<c ca="left">
								<p>NDM5</p>
							</c>
							<c ca="center">
								<p>1184</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>26</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>112</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>9</ul>
								</p>
							</c>
							<c ca="center">
								<p>9</p>
							</c>
							<c ca="center">
								<p>18</p>
							</c>
							<c ca="center">
								<p>46</p>
							</c>
							<c ca="center">
								<p>
									<b>64</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>63</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>59</ul>
								</p>
							</c>
							<c ca="center">
								<p>38</p>
							</c>
							<c ca="center">
								<p>
									<ul>28</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>34</ul>
								</p>
							</c>
							<c ca="center">
								<p>33</p>
							</c>
							<c ca="center">
								<p>
									<b>265</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>
									<b>Unique</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>4156</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>304</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>932</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>58</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>30</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>48</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>146</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>146</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>220</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>366</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>141</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>165</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>195</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>88</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>783</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>534</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="19">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>
									<b>Totals</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>8289</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>511</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>1501</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>113</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>80</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>147</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>311</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>311</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>604</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>649</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>287</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>359</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>424</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>215</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>1593</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>1184</b>
								</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>
									<b>(b)</b>
								</p>
							</c>
							<c ca="left">
								<p>STATAAA</p>
							</c>
							<c ca="left">
								<p>DMp1</p>
							</c>
							<c ca="center">
								<p>511</p>
							</c>
							<c ca="center">
								<p>4.7</p>
							</c>
							<c ca="center">
								<p>6.5</p>
							</c>
							<c ca="center">
								<p>8.0</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.5</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.7</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.7</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>0.9</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>5.3</p>
							</c>
							<c ca="center">
								<p>6.6</p>
							</c>
							<c ca="center">
								<p>
									<ul>4.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>1.3</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>2.2</ul>
									</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>TCAGTY</p>
							</c>
							<c ca="left">
								<p>DMp2</p>
							</c>
							<c ca="center">
								<p>1501</p>
							</c>
							<c ca="center">
								<p>19.2</p>
							</c>
							<c ca="center">
								<p>13.8</p>
							</c>
							<c ca="center">
								<p>
									<ul>10.6</ul>
								</p>
							</c>
							<c ca="center">
								<p>31.3</p>
							</c>
							<c ca="center">
								<p>
									<b>29.3</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>4.8</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>5.8</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>5.6</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>2.6</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>4.2</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>27.9</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>25.5</b>
								</p>
							</c>
							<c ca="center">
								<p>17.7</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>4.2</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>9.5</ul>
									</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>TCATTCG</p>
							</c>
							<c ca="left">
								<p>DMp3</p>
							</c>
							<c ca="center">
								<p>113</p>
							</c>
							<c ca="center">
								<p>1.8</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>1.0</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>3.4</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>2.8</p>
							</c>
							<c ca="center">
								<p>1.2</p>
							</c>
							<c ca="center">
								<p>2.3</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.8</ul>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>CGGACGT</p>
							</c>
							<c ca="left">
								<p>DMp4</p>
							</c>
							<c ca="center">
								<p>80</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>1.7</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>0.7</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.7</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.7</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.7</ul>
								</p>
							</c>
							<c ca="center">
								<p>2.8</p>
							</c>
							<c ca="center">
								<p>1.4</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.5</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>0.8</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>KCGGTTSK</p>
							</c>
							<c ca="left">
								<p>DMp5</p>
							</c>
							<c ca="center">
								<p>147</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>2.9</b>
								</p>
							</c>
							<c ca="center">
								<p>4.4</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>1.4</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>3.9</p>
							</c>
							<c ca="center">
								<p>2.6</p>
							</c>
							<c ca="center">
								<p>3.3</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>0.3</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>1.5</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>CARCCCT</p>
							</c>
							<c ca="left">
								<p>DMv1</p>
							</c>
							<c ca="center">
								<p>311</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>1.0</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.7</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>2.9</p>
							</c>
							<c ca="center">
								<p>5.1</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.7</ul>
								</p>
							</c>
							<c ca="center">
								<p>3.3</p>
							</c>
							<c ca="center">
								<p>
									<b>5.0</b>
								</p>
							</c>
							<c ca="center">
								<p>3.9</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>TGGYAACR</p>
							</c>
							<c ca="left">
								<p>DMv2</p>
							</c>
							<c ca="center">
								<p>311</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>1.2</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.5</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>5.1</p>
							</c>
							<c ca="center">
								<p>2.9</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>3.7</p>
							</c>
							<c ca="center">
								<p>
									<b>5.4</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>CAYCNCTA</p>
							</c>
							<c ca="left">
								<p>DMv3</p>
							</c>
							<c ca="center">
								<p>604</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>2.3</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>5.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>4.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.6</ul>
								</p>
							</c>
							<c ca="center">
								<p>5.5</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>3.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>0.3</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>3.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>4.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>17.7</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>5.3</ul>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>GGYCACAC</p>
							</c>
							<c ca="left">
								<p>DMv4</p>
							</c>
							<c ca="center">
								<p>649</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>1.2</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>1.1</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>3.5</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.5</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.7</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>5.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>4.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>3.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>6.0</p>
							</c>
							<c ca="center">
								<p>
									<b>22.3</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>5.6</ul>
								</p>
							</c>
							<c ca="center">
								<p>6.0</p>
							</c>
							<c ca="center">
								<p>
									<ul>5.0</ul>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>TGGTATTT</p>
							</c>
							<c ca="left">
								<p>DMv5</p>
							</c>
							<c ca="center">
								<p>287</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>0.8</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.5</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.5</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>9.9</b>
								</p>
							</c>
							<c ca="center">
								<p>2.6</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.6</ul>
								</p>
							</c>
							<c ca="center">
								<p>3.2</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>GAGAGCG</p>
							</c>
							<c ca="left">
								<p>NDM1</p>
							</c>
							<c ca="center">
								<p>359</p>
							</c>
							<c ca="center">
								<p>3.7</p>
							</c>
							<c ca="center">
								<p>
									<b>6.7</b>
								</p>
							</c>
							<c ca="center">
								<p>8.9</p>
							</c>
							<c ca="center">
								<p>12.5</p>
							</c>
							<c ca="center">
								<p>9.5</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>0.2</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>3.3</p>
							</c>
							<c ca="center">
								<p>6.1</p>
							</c>
							<c ca="center">
								<p>8.4</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>0.4</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.4</ul>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>CGMYGYCR</p>
							</c>
							<c ca="left">
								<p>NDM2</p>
							</c>
							<c ca="center">
								<p>424</p>
							</c>
							<c ca="center">
								<p>5.5</p>
							</c>
							<c ca="center">
								<p>
									<b>7.2</b>
								</p>
							</c>
							<c ca="center">
								<p>4.4</p>
							</c>
							<c ca="center">
								<p>7.5</p>
							</c>
							<c ca="center">
								<p>7.5</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.7</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.7</ul>
								</p>
							</c>
							<c ca="center">
								<p>7.2</p>
							</c>
							<c ca="center">
								<p>3.9</p>
							</c>
							<c ca="center">
								<p>2.8</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.9</ul>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>GAAAGCT</p>
							</c>
							<c ca="left">
								<p>NDM3</p>
							</c>
							<c ca="center">
								<p>215</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>2.5</p>
							</c>
							<c ca="center">
								<p>4.4</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>4.8</p>
							</c>
							<c ca="center">
								<p>2.3</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.5</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.7</ul>
								</p>
							</c>
							<c ca="center">
								<p>5.0</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>2.0</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>2.8</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>ATCGATA</p>
							</c>
							<c ca="left">
								<p>NDM4</p>
							</c>
							<c ca="center">
								<p>1593</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>4.1</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>4.5</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>8.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>7.5</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>2.7</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>25.4</b>
								</p>
							</c>
							<c ca="center">
								<p>19.0</p>
							</c>
							<c ca="center">
								<p>
									<b>46.7</b>
								</p>
							</c>
							<c ca="center">
								<p>14.6</p>
							</c>
							<c ca="center">
								<p>
									<ul>9.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>1.7</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>7.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>10.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>14.6</p>
							</c>
							<c ca="center">
								<p>
									<b>22.4</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>CAGCTSWW</p>
							</c>
							<c ca="left">
								<p>NDM5</p>
							</c>
							<c ca="center">
								<p>1184</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>5.1</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>7.5</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>8.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>11.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>12.2</p>
							</c>
							<c ca="center">
								<p>14.8</p>
							</c>
							<c ca="center">
								<p>
									<b>20.6</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>10.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>9.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>13.2</p>
							</c>
							<c ca="center">
								<p>
									<ul>7.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>8.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>15.4</p>
							</c>
							<c ca="center">
								<p>
									<b>16.6</b>
								</p>
							</c>
							<c ca="center">
								<p>10.9</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>
									<b>Unique</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>59.5</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>62.1</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>51.3</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>37.5</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>32.7</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>47.0</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>47.0</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>36.4</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>56.4</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>49.1</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>46.0</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>46.0</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>40.9</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>49.2</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>45.1</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="19">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>
									<b>Totals</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>8289</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>511</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>1501</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>113</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>80</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>147</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>311</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>311</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>604</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>649</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>287</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>359</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>424</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>215</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>1593</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>1184</b>
								</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>
									<b>(c)</b>
								</p>
							</c>
							<c ca="left">
								<p>STATAAA</p>
							</c>
							<c ca="left">
								<p>DMp1</p>
							</c>
							<c ca="center">
								<p>511</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>3.2</p>
							</c>
							<c ca="center">
								<p>0.8</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.5</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>4.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>4.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>7.3</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>0.2</p>
							</c>
							<c ca="center">
								<p>1.1</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>14.2</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>5.4</ul>
									</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>TCAGTY</p>
							</c>
							<c ca="left">
								<p>DMp2</p>
							</c>
							<c ca="center">
								<p>1501</p>
							</c>
							<c ca="center">
								<p>3.2</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<ul>0.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>4.1</p>
							</c>
							<c ca="center">
								<p>
									<b>5.9</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>6.5</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>5.1</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>10.2</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>22.6</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>7.0</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>11.8</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>10.1</b>
								</p>
							</c>
							<c ca="center">
								<p>0.9</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>40.4</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>5.6</ul>
									</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>TCATTCG</p>
							</c>
							<c ca="left">
								<p>DMp3</p>
							</c>
							<c ca="center">
								<p>113</p>
							</c>
							<c ca="center">
								<p>0.8</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.4</ul>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<ul>0.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>1.4</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>2.1</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.8</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.4</ul>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>CGGACGT</p>
							</c>
							<c ca="left">
								<p>DMp4</p>
							</c>
							<c ca="center">
								<p>80</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>4.1</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.1</ul>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>3.3</p>
							</c>
							<c ca="center">
								<p>0.7</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>KCGGTTSK</p>
							</c>
							<c ca="left">
								<p>DMp5</p>
							</c>
							<c ca="center">
								<p>147</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.5</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>5.9</b>
								</p>
							</c>
							<c ca="center">
								<p>1.4</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<ul>0.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.7</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>3.2</p>
							</c>
							<c ca="center">
								<p>1.3</p>
							</c>
							<c ca="center">
								<p>1.3</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>5.5</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>0.2</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>CARCCCT</p>
							</c>
							<c ca="left">
								<p>DMv1</p>
							</c>
							<c ca="center">
								<p>311</p>
							</c>
							<c ca="center">
								<p>
									<ul>4.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>6.5</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.1</ul>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>1.5</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.5</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>0.1</p>
							</c>
							<c ca="center">
								<p>
									<b>6.3</b>
								</p>
							</c>
							<c ca="center">
								<p>1.5</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>TGGYAACR</p>
							</c>
							<c ca="left">
								<p>DMv2</p>
							</c>
							<c ca="center">
								<p>311</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>5.1</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.6</ul>
								</p>
							</c>
							<c ca="center">
								<p>1.5</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<ul>1.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>1.4</p>
							</c>
							<c ca="center">
								<p>
									<b>6.3</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>CAYCNCTA</p>
							</c>
							<c ca="left">
								<p>DMv3</p>
							</c>
							<c ca="center">
								<p>604</p>
							</c>
							<c ca="center">
								<p>
									<ul>4.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>10.2</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.7</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.5</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.8</ul>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<ul>3.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>7.4</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>84.2</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.1</ul>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>GGYCACAC</p>
							</c>
							<c ca="left">
								<p>DMv4</p>
							</c>
							<c ca="center">
								<p>649</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>7.3</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>22.6</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.6</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>3.1</ul>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>19.9</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.8</ul>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>TGGTATTT</p>
							</c>
							<c ca="left">
								<p>DMv5</p>
							</c>
							<c ca="center">
								<p>287</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>7.0</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>19.9</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<ul>3.9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>0.6</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>GAGAGCG</p>
							</c>
							<c ca="left">
								<p>NDM1</p>
							</c>
							<c ca="center">
								<p>359</p>
							</c>
							<c ca="center">
								<p>0.2</p>
							</c>
							<c ca="center">
								<p>
									<b>11.8</b>
								</p>
							</c>
							<c ca="center">
								<p>2.1</p>
							</c>
							<c ca="center">
								<p>3.3</p>
							</c>
							<c ca="center">
								<p>3.2</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>7.4</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>3.9</ul>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>2.5</p>
							</c>
							<c ca="center">
								<p>3.3</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>16.8</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.2</ul>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>CGMYGYCR</p>
							</c>
							<c ca="left">
								<p>NDM2</p>
							</c>
							<c ca="center">
								<p>424</p>
							</c>
							<c ca="center">
								<p>1.1</p>
							</c>
							<c ca="center">
								<p>
									<b>10.1</b>
								</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.7</p>
							</c>
							<c ca="center">
								<p>1.3</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.9</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>2.5</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<ul>0.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>4.7</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.2</ul>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>GAAAGCT</p>
							</c>
							<c ca="left">
								<p>NDM3</p>
							</c>
							<c ca="center">
								<p>215</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>0.9</p>
							</c>
							<c ca="center">
								<p>0.8</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>1.3</p>
							</c>
							<c ca="center">
								<p>0.1</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>3.3</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.3</ul>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<ul>1.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>1.3</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>ATCGATA</p>
							</c>
							<c ca="left">
								<p>NDM4</p>
							</c>
							<c ca="center">
								<p>1593</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>14.2</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>40.4</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.3</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>5.5</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>6.3</b>
								</p>
							</c>
							<c ca="center">
								<p>1.4</p>
							</c>
							<c ca="center">
								<p>
									<b>84.2</b>
								</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>
									<ul>2.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>16.8</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>4.7</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.1</ul>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>13.5</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>CAGCTSWW</p>
							</c>
							<c ca="left">
								<p>NDM5</p>
							</c>
							<c ca="center">
								<p>1184</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>5.4</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<ul>5.6</ul>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.4</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.0</ul>
								</p>
							</c>
							<c ca="center">
								<p>0.2</p>
							</c>
							<c ca="center">
								<p>1.5</p>
							</c>
							<c ca="center">
								<p>
									<b>6.3</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.1</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>0.8</ul>
								</p>
							</c>
							<c ca="center">
								<p>0.6</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>
									<ul>1.2</ul>
								</p>
							</c>
							<c ca="center">
								<p>1.3</p>
							</c>
							<c ca="center">
								<p>
									<b>13.5</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>The 15 motifs are grouped into three groups, DMp1 to 5, DMv1 to 5, and NDM1 to 5. <b>(a) </b>The number of promoters that contain two motifs, each that occurs in a peak, was determined. To the left are the 15 motifs followed by the number of their occurrences in the peak. <b>(b) </b>The frequency of promoters containing one motif also containing a second motif. DMp1 (TATA) for example, is found in 4.7% of all promoters but occurs in 6.5% of promoters that contain DMp2 (INR). <b>(c) </b>The probability. Throughout all three panels of the table, positive correlations are shown as normal numbers, negative correlations are underlined and if the probability term has a value <it>p </it>&#8804; 10<sup>-5</sup>, one in 100,000, then the numbers are in bold. For example, INR is found in 1,501 promoters, which is 13.8% of all promoters. However, in the 1,593 DRE promoters, the INR only occurs in 4.2% of them. This observed under-representation or negative correlation has a one in 10<sup>40 </sup>probability occurring by chance.</p>
					</tblfn>
				</tbl>
				<p>For the precisely positioned directional motifs (DMp1 to 5: TATA, INR, INR1, DPE, and DPE1), promoters that contain INR also preferentially contain either the TATA or DPE sequence. However, TATA and DPE motifs negatively correlate. All five members of the DMp class negatively correlate with some or all of the DMv class. DMp1 to 5 positively correlate with three of the NDMs (NDM1 to 3) but negatively correlate with NDM4 and NDM5.</p>
				<p>The five variably positioned directional motifs (DMv1 to 5) have both positive and negative correlations amongst themselves and with the NDMs. The DMv class members positively correlate with NDM4 and NDM5 and negatively correlate with NDM1 to 3, correlations that are exactly the opposite of those observed for the DMp class (see above). On average, members of the NDM class positively correlate with each other. Positive correlations between motifs suggest the possibility of physical interactions between the proteins that bind the co-occurring DNA motifs. Negative correlations, as are observed between the precisely positioned DMs (DMp) and the variably positioned DMs (DMv), suggest that the proteins that bind them have distinct functions.</p>
			</sec>
			<sec>
				<st>
					<p>Consensus DNA motifs correlate with biological function</p>
				</st>
				<p>The non-random distribution of individual motifs and motif combinations at core promoters strongly suggests that the identified motifs are biologically significant and promoters that share the same motif in a peak may also share similar biological functions. To evaluate this possibility, we calculated statistical over- and under-representation of 5,200 Gene Ontology (GO) annotation terms <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> for <it>Drosophila </it>genes whose promoters contained any of the 15 motifs, either within the peak or elsewhere in the promoter region. We found highly significant correlations (<it>p </it>&lt; 10<sup>-4</sup>) for each motif only when they occurred in the peak (Figure <figr fid="F11">11a</figr>). With one exception, the simple presence elsewhere within the 1,500 bp promoter region does not correlate with GO terms, demonstrating that the position of a motif in the promoter is critical for predicting biological function, as was observed in human promoters <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. The directional positioned motifs, DMp and DMv, not only co-occur in promoters with either NDM1 to 3 or NDM4 and NDM5, respectively, but also correlate with similar GO terms. This indicates a combinatorial code of motifs at core promoters directing batteries of genes.</p>
				<fig id="F11">
					<title>
						<p>Figure 11</p>
					</title>
					<caption>
						<p>Correlations between DNA motifs in promoters and function (GO terms and mRNA expression properties)</p>
					</caption>
					<text>
						<p>Correlations between DNA motifs in promoters and function (GO terms and mRNA expression properties). In both sections of the figure, promoter lists in blue are DMp, green are DMv, and red are NDM. Control groups with the DNA motifs not in the peak but between -1,000 bp and +499 bp are in black with an asterisk.<b>(a) </b>False-color image of representation bias in GO terms and mRNA expression clusters for the 15 DNA motifs, either in the peak or elsewhere in the promoter region. Values plotted are -log<sub>10</sub>(<it>p </it>value) calculated by Fisher's exact test. Data for the 54 most strongly correlated GO terms are shown (some redundant GO terms are removed). On the far left are results for over/under representation in self-organizing map (SOM) clusters identified from previously published expression data [20]. Over-represented categories are colored in red and under-represented categories are in blue. N values displayed at the top are total numbers of genes in the reference set assigned to that group. <b>(b) </b>False-color image of hierarchically clustered median percentile ranks of mRNA expression ratios, for previously published data for embryo and adult samples [21]. Each ratio represents expression relative to a global mean across arrays. Columns represent each of 89 array experiments, clustered so that embryo samples are at left and adult samples are at right. 'All Promoters' represents all genes and shows no preferences (median percentile rank = 50).</p>
					</text>
					<graphic file="gb-2006-7-7-r53-11"/>
				</fig>
				<p>Additional insight can be inferred by examining individual GO terms that correlate. For example, <it>Drosophila </it>mitochondrial ribosomal genes contain the E-box (<it>p </it>&lt; 10<sup>-8</sup>). In contrast, promoters of human mitochondrial ribosomal genes contain the ETS motif, a motif that peaks in human but not in <it>Drosophila</it>. Thus, even though the mitochondrial ribosomal genes are highly conserved, their regulation is evolving.</p>
				<p>If core promoter motifs are used to drive the expression of gene batteries participating in a common biological process, this should be evident in global gene expression profiles. We turned to <it>Drosophila </it>mRNA expression patterns determined by micoarray experiments <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp> to evaluate whether genes that are co-expressed have the same motif in their promoters. Figure <figr fid="F11">11a</figr> shows correlations between all 15 motifs, either in the peak or elsewhere in the promoter region, and gene expression in testis (male germline), ovary (female germline), and soma. The presence of TATA in the peak in the promoter positively correlates with gene expression in somatic tissue but negatively correlates with expression in germline tissue. The presence of positioned DMv3 to 5, and DRE in promoters positively correlates with female germline expression and negatively correlates with male germline expression. If the motif occurs outside the peak, few correlations are observed, supporting the conclusion that motif position is functionally important.</p>
				<p>We see more striking correlations between promoter motifs and mRNA expression in the embryonic and adult stages of <it>Drosophila </it>development that express different sets of genes. Figure <figr fid="F11">11b</figr> presents a hierarchal clustering of mRNA expression for 89 samples from a survey of gene expression in embryos and adults for promoters containing any of the 15 motifs (either in or outside the peak). Genes with motifs in the peak show strong mRNA expression differences between embryo and adult samples, suggesting that these motifs help direct the differential utilization of the genome between embryos and adult. Genes with promoters containing DMv1 to 5 and co-occurring NDM4 and NDM5 are preferentially active in the embryo. In contrast, genes with promoters containing the three abundant precisely positioned directional motifs (TATA, INR, and DPE) and the co-occurring NDM1 to 3 are preferentially active in the adult.</p>
			</sec>
			<sec>
				<st>
					<p>INR derivatives</p>
				</st>
				<p>Both <it>Drosophila </it>and human promoters have a CA peak exactly at the TSS in a significant number of promoters. About 2,100 <it>Drosophila </it>promoters contain the CA sequence at the TSS but only 400 of these are part of the consensus INR sequence (TCAGTY). We examined the remaining promoter sequences for related INR sequences and identified 4 more motifs, resulting in 1,080 promoters with INR related sequences exactly positioned at the TSS. To evaluate if these INR related sequences correlate with distinct functions or are variants of a single motif, we investigated the correlation of the INR variants with different biological properties by examining GO terms and mRNA expression properties. Figure <figr fid="F12">12a</figr> shows that the variant INR motifs have distinct patterns of enrichment with categories of GO terms. Similarly, the developmental mRNA expression analysis (Figure <figr fid="F12">12b</figr>) indicates that one of the INR motif variants (BCACWS) is preferentially associated with genes with embryonic expression while the other variants are preferentially associated with adult expression genes. While some of the GO categories enriched for specific INR variants (for example, mesoderm development) appear at odds with the adult/embryo expression patterns, the overall impression suggests that these variant INR sequences are functionally distinct and may be recognized by distinct proteins. The discrepancies between the GO term enrichment and adult/embryo expression patterns can be explained if one assumes that the preferential use of INR signals is not absolute. Thus, even though there is a general trend toward preferential use of different elements at different stages in development, certain genes may use the 'adult INRs' during embryogenesis.</p>
				<fig id="F12">
					<title>
						<p>Figure 12</p>
					</title>
					<caption>
						<p>Correlations between five INR variants localized exactly at the TSS in promoters and function (GO terms and mRNA expression properties)</p>
					</caption>
					<text>
						<p>Correlations between five INR variants localized exactly at the TSS in promoters and function (GO terms and mRNA expression properties). <b>(a) </b>False-color image of representation bias in GO terms and mRNA expression clusters for the five variants of the INR motif in the peak. Values are calculated and displayed as in Figure 11a. The 42 most strongly correlated GO terms are shown. Note that each INR variant correlates with different GO terms. <b>(b) </b>False-color image of hierarchically clustered median percentile ranks of mRNA expression ratios, for previously published data for embryo and adult samples 21. Data are calculated and displayed as in Figure 1</p>
					</text>
					<graphic file="gb-2006-7-7-r53-12"/>
				</fig>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Discussion</p>
			</st>
			<p>We have determined the localization of all 8-mers in 10,914 <it>Drosophila </it>and two sets of human promoters (UCSC, 15,011 promoters; DBTSS, 12,926 promoters) aligned relative to the TSS and have identified DNA motifs that are non-randomly distributed in each dataset. Though we examined the region between -1,000 bp and +499 bp, all peaks are within 100 bp of the TSS. Two dramatic differences are observed between <it>Drosophila </it>and human promoters. First, there is little overlap in the DNA motifs that localize in the promoters of these two species. Second, of the 15 motifs identified in <it>Drosophila </it>promoters, 10 are directional DNA motifs (DNA sequences that occur on the positive but not the negative strand of DNA), while in human, promoters TATA, INR and DPE1 are the only DMs. We suggest that these DMs may be binding sites for core promoter selectivity factors <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. While there is little overlap between motifs identified in <it>Drosophila </it>and human, both organisms contain identifiable TATA and INR core promoter elements, with humans having only a barely discernable DPE element. The identification of common elements in both species indicates a fundamental similarity in core promoter organization, as would be expected because the proteins that bind these sequences are conserved in both species.</p>
			<p>A comparison of the promoter structures of two organisms depends on the quality of the data being analyzed. In an attempt to ensure that our results were not biased by differences in the quality of annotation of the TSS of the <it>Drosophila </it>and human genomes, we have analyzed three datasets. We used the annotation from the UCSC Genome Browser for both <it>Drosophila </it>and human to construct a dataset of promoters that represents the standard view of these genomes. Additionally, we have constructed a set of promoters based on annotations from the human DBTSS <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, a database specifically aimed at correctly identifying the TSS through the use of full-length cDNA cloning methods. As shown in Figure <figr fid="F1">1d</figr>, all three datasets show distinct CA peaks at the TSS, with the <it>Drosophila </it>peak being intermediate in amplitude between the two human datasets. The qualitative similarity of the findings of the two human datasets suggests that the differences we observe between the <it>Drosophila </it>and human promoters are not due to differences in the quality of the underlying datasets. Additionally, the fact that both <it>Drosophila </it>and human datasets are sufficiently aligned with respect to the TSS is exemplified by our ability to readily identify over-represented, localized 8-mers in all datasets. We note that our technique is aimed at finding abundant over-represented, localized motifs that have a low degree of degeneracy. Thus, our inability to find a given motif in an organism could indicate one of four possibilities: the motif is absent; the motif is present in low abundance; the motif is present but is highly degenerate; or the motif is present but not significantly constrained with respect to its position relative to the TSS.</p>
			<p>Previous work has addressed the DNA sequence of <it>Drosophila </it>promoters. However, these studies have either examined a limited number of promoters or did not examine the position of motifs relative to the TSS. Kutach and Kadonaga <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> examined a set of 200 <it>Drosophila </it>promoters and identified four types of promoters characterized by containing TATA only (29%), DPE only (26%), TATA + DPE (14%), or neither DNA motif (31%). Our global analysis looks at a much larger set of <it>Drosophila </it>promoters and finds a lower proportion of genes with these sequences. Instead of 60% of promoters containing a TATA motif, we find only 4.7% and, instead of 40% of promoters containing a DPE motif, we find only 2.1% of promoters that contain these motifs. Kutach and Kadonaga <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> used a less stringent criterion to define the motifs and it is also possible that the 200 promoters examined were biased towards TATA and DPE. They observed a conserved distance between the INR and DPE motifs and experimentally demonstrated that the conserved distance is critical for optimal function. This conserved distance is confirmed in our global analysis.</p>
			<p>Another analysis of 2,000 <it>Drosophila </it>promoters identified 10 motifs that are conserved near the TSS <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>; we identified 15 motifs, including 9 of the 10 identified by Ohler <it>et al</it>. The motif that did not peak in our analysis is motif ten element (MTE), a downstream element important for initiation <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Our global analysis extends this analysis of 2,000 promoters. We show that many of the identified DNA motifs occur on only one strand of DNA and are uniquely positioned relative to the TSS. Furthermore, the 