<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>gb-2006-7-2-r15</ui>
	<ji>GBJ</ji>
	<fm>
		<dochead>Research</dochead>
		<bibl>
			<title>
				<p>Comparison of dot chromosome sequences from <it>D. melanogaster </it>and <it>D. virilis </it>reveals an enrichment of DNA transposon sequences in heterochromatic domains</p>
			</title>
			<aug>
				<au id="A1">
					<snm>Slawson</snm>
					<mi>E</mi>
					<fnm>Elizabeth</fnm>
					<insr iid="I1"/>
					<email>libbyslawson@gmail.com</email>
				</au>
				<au id="A2">
					<snm>Shaffer</snm>
					<mi>D</mi>
					<fnm>Christopher</fnm>
					<insr iid="I1"/>
					<email>shaffer@biology.wustl.edu</email>
				</au>
				<au id="A3">
					<snm>Malone</snm>
					<mi>D</mi>
					<fnm>Colin</fnm>
					<insr iid="I1"/>
					<email>cdmalone@artsci.wustl.edu</email>
				</au>
				<au id="A4">
					<snm>Leung</snm>
					<fnm>Wilson</fnm>
					<insr iid="I1"/>
					<email>wleung@artsci.wustl.edu</email>
				</au>
				<au id="A5">
					<snm>Kellmann</snm>
					<fnm>Elmer</fnm>
					<insr iid="I1"/>
				</au>
				<au id="A6">
					<snm>Shevchek</snm>
					<mi>B</mi>
					<fnm>Rachel</fnm>
					<insr iid="I1"/>
				</au>
				<au id="A7">
					<snm>Craig</snm>
					<mi>A</mi>
					<fnm>Carolyn</fnm>
					<insr iid="I1"/>
				</au>
				<au id="A8">
					<snm>Bloom</snm>
					<mi>M</mi>
					<fnm>Seth</fnm>
					<insr iid="I2"/>
				</au>
				<au id="A9">
					<snm>Bogenpohl</snm>
					<fnm>James</fnm>
					<suf>II</suf>
					<insr iid="I2"/>
				</au>
				<au id="A10">
					<snm>Dee</snm>
					<fnm>James</fnm>
					<insr iid="I2"/>
				</au>
				<au id="A11">
					<snm>Morimoto</snm>
					<mi>TA</mi>
					<fnm>Emiko</fnm>
					<insr iid="I2"/>
				</au>
				<au id="A12">
					<snm>Myoung</snm>
					<fnm>Jenny</fnm>
					<insr iid="I2"/>
				</au>
				<au id="A13">
					<snm>Nett</snm>
					<mi>S</mi>
					<fnm>Andrew</fnm>
					<insr iid="I2"/>
				</au>
				<au id="A14">
					<snm>Ozsolak</snm>
					<fnm>Fatih</fnm>
					<insr iid="I2"/>
				</au>
				<au id="A15">
					<snm>Tittiger</snm>
					<mi>E</mi>
					<fnm>Mindy</fnm>
					<insr iid="I2"/>
				</au>
				<au id="A16">
					<snm>Zeug</snm>
					<fnm>Andrea</fnm>
					<insr iid="I2"/>
				</au>
				<au id="A17">
					<snm>Pardue</snm>
					<fnm>Mary-Lou</fnm>
					<insr iid="I3"/>
				</au>
				<au id="A18">
					<snm>Buhler</snm>
					<fnm>Jeremy</fnm>
					<insr iid="I4"/>
					<email>jbuhler@cse.wustl.edu</email>
				</au>
				<au id="A19">
					<snm>Mardis</snm>
					<mi>R</mi>
					<fnm>Elaine</fnm>
					<insr iid="I5"/>
					<email>emardis@watson.wustl.edu</email>
				</au>
				<au id="A20" ca="yes">
					<snm>Elgin</snm>
					<mi>CR</mi>
					<fnm>Sarah</fnm>
					<insr iid="I1"/>
					<email>selgin@biology.wustl.edu</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Biology Department, Washington University, St Louis, MO 63130, USA</p>
				</ins>
				<ins id="I2">
					<p>Member, Bio 4342 class, Washington University, St Louis, MO 63130, USA</p>
				</ins>
				<ins id="I3">
					<p>Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA</p>
				</ins>
				<ins id="I4">
					<p>Computer Science and Engineering, Washington University, St Louis, MO 63130, USA</p>
				</ins>
				<ins id="I5">
					<p>Genome Sequencing Center and Department of Genetics, Washington University, St Louis, MO 63108, USA</p>
				</ins>
			</insg>
			<source>Genome Biology</source>
			<issn>1465-6906</issn>
			<pubdate>2006</pubdate>
			<volume>7</volume>
			<issue>2</issue>
			<fpage>R15</fpage>
			<url>http://genomebiology.com/2006/7/2/R15</url>
			<xrefbib>
				<pubidlist><pubid idtype="pmpid">16507169</pubid><pubid idtype="doi">10.1186/gb-2006-7-2-r15</pubid>
				</pubidlist></xrefbib>
		</bibl>
		<history>
			<rec>
				<date>
					<day>1</day>
					<month>8</month>
					<year>2005</year>
				</date>
			</rec>
			<revrec>
				<date>
					<day>15</day>
					<month>9</month>
					<year>2005</year>
				</date>
			</revrec>
			<acc>
				<date>
					<day>25</day>
					<month>1</month>
					<year>2006</year>
				</date>
			</acc>
			<pub>
				<date>
					<day>20</day>
					<month>2</month>
					<year>2006</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2006</year>
			<collab>Slawson et al.; licensee BioMed Central Ltd.</collab>
			<note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<shorttitle>
			<p><it>Drosophila </it>dot chromosomes</p>
		</shorttitle>
		<shortabs>
			<p>Sequencing and analysis of fosmid hybridization to the dot chromosomes of <it>Drosophila virilis </it>and <it>D. melanogaster </it>suggest that repetitive elements and density are important in determining higher-order chromatin packaging.</p>
		</shortabs>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st>
					<p>Chromosome four of <it>Drosophila melanogaster</it>, known as the dot chromosome, is largely heterochromatic, as shown by immunofluorescent staining with antibodies to heterochromatin protein 1 (HP1) and histone H3K9me. In contrast, the absence of HP1 and H3K9me from the dot chromosome in <it>D. virilis </it>suggests that this region is euchromatic. <it>D. virilis </it>diverged from <it>D. melanogaster </it>40 to 60 million years ago.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st>
					<p>Here we describe finished sequencing and analysis of 11 fosmids hybridizing to the dot chromosome of <it>D. virilis </it>(372,650 base-pairs) and seven fosmids from major euchromatic chromosome arms (273,110 base-pairs). Most genes from the dot chromosome of <it>D. melanogaster </it>remain on the dot chromosome in <it>D. virilis</it>, but many inversions have occurred. The dot chromosomes of both species are similar to the major chromosome arms in gene density and coding density, but the dot chromosome genes of both species have larger introns. The <it>D. virilis </it>dot chromosome fosmids have a high repeat density (22.8%), similar to homologous regions of <it>D. melanogaster </it>(26.5%). There are, however, major differences in the representation of repetitive elements. Remnants of DNA transposons make up only 6.3% of the <it>D. virilis </it>dot chromosome fosmids, but 18.4% of the homologous regions from <it>D. melanogaster</it>; <it>DINE-1 </it>and <it>1360 </it>elements are particularly enriched in <it>D. melanogaster</it>. Euchromatic domains on the major chromosomes in both species have very few DNA transposons (less than 0.4 %).</p>
				</sec>
				<sec>
					<st>
						<p>Conclusion</p>
					</st>
					<p>Combining these results with recent findings about RNAi, we suggest that specific repetitive elements, as well as density, play a role in determining higher-order chromatin packaging.</p>
				</sec>
			</sec>
		</abs>
	</fm>
	<meta>
		<classifications>
			<classification type="BMC" subtype="man_spc_id" id="30010009">Genetics</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
		</classifications>
	</meta>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p>FDNA in the eukaryotic interphase nucleus can broadly be distinguished as packaged into two different forms of chromatin, heterochromatin and euchromatin <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Classically, heterochromatin has been described as the fraction that remains highly condensed in interphase, has high affinity for DNA-specific dyes, and is commonly seen around the periphery of the nucleus <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Heterochromatic regions of the genome have very low rates of meiotic recombination and generally replicate late in S phase. These regions are rich in repetitive sequences, including remnants of transposable elements and retroviruses, as well as simple repeats (satellite DNA). Heterochromatin tends to be gene poor, and those genes found in heterochromatin tend to be larger (longer transcripts) than genes found in euchromatin <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. Introns of heterochromatic genes have a much higher density of transposable elements than introns of euchromatic genes, accounting for this shift <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. The less densely packaged euchromatin contains most of the actively transcribed genes. In contrast to this general picture of repeat distribution, Pardue <it>et al</it>. <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> have found by <it>in situ </it>hybridization that the frequency of (dC-dA)&#183;(dG-dT) dinucleotide repeats is higher in euchromatin than in heterochromatin.</p>
			<p>Several biochemical marks have been identified that distinguish heterochromatin from euchromatin, including a distinctive pattern of histone modification and the association of particular chromosomal proteins <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. High concentrations of heterochromatin protein 1 (HP1) are found primarily in pericentric heterochromatin and associated with telomeres in organisms from the yeast <it>Schizosaccharomyces pombe </it>to mammals <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. Histones in euchromatic domains are typically hyperacetylated, particularly the amino-terminal tails of H3 and H4. In contrast, methylation of histone H3 at lysine 9 (producing H3K9me) is a consistent mark of heterochromatin <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. HP1 binds to H3K9me through its chromo domain and to SU(VAR)3-9, a methyltransferase that specifically modifies histone H3 at K9, through its chromo shadow domain <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. These interactions are thought to contribute to heterochromatin maintenance and spreading <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. The functional significance of this chromatin packaging is demonstrated by the observation that loss-of-function mutations in the gene for HP1, including one that disrupts binding of HP1 to H3K9me, result in a loss of silencing of reporter genes placed in or near heterochromatin (suppression of position effect variegation) <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>.</p>
			<p>Chromosome four of <it>Drosophila melanogaster</it>, also known as the dot chromosome or the F element, is unique in its chromatin composition. The banded portion (amplified during polytenization) is 1.2 Mb long with 82 genes; this gene density is similar to that of the euchromatic regions of the major (euchromatic) chromosome arms <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. However, the fourth chromosome also displays many characteristics of heterochromatin, including late replication <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> and a complete lack of meiotic recombination <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. The banded region of chromosome 4 is known to have an approximately ten-fold higher density of repetitive elements (for example, remnants of retroviruses, transposable elements) in comparison with the long arms of chromosomes 2, 3, and X <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>, but has little or no (dC-dA)&#183;(dG-dT) dinucleotide repeats <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, again resembling heterochromatin rather than euchromatin. Immunofluorescent staining of polytene chromosomes with antibodies directed against HP1 shows an abundance of HP1 in a banded pattern on chromosome four <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. A very similar pattern is seen with antibodies directed against H3K9me <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B21">21</abbr></abbrgrp>.</p>
			<p>A transposable <it>P </it>element containing an <it>hsp70</it>-driven <it>white </it>(<it>w</it>) gene has been a useful reporter of chromatin packaging, giving a uniform red eye phenotype when inserted into the euchromatic arms but a variegating phenotype when inserted into the pericentric heterochromatin or into telomere associated sequences <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. The variegating phenotype is associated with packaging into a nucleosome array showing more uniform spacing, accompanied by a loss of DNase hypersensitive (DH) sites <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Transposition events resulting in insertions on the fourth chromosome produce both variegating and solid red eye phenotypes. The data suggest that while the fourth chromosome of <it>D. melanogaster </it>is largely heterochromatic, it also includes some euchromatic domains <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
			<p><it>P </it>element transposition-induced deletions and duplications of small genomic regions around the genes <it>Hcf </it>and <it>CG2052 </it>on chromosome four have been shown to cause switching of eye phenotypes from red to variegating and vice versa <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Mapping of the breakpoints has shown that the small deletions and duplications lead to changes in the distance of the reporter from a particular DNA transposon, <it>1360 (</it>also known as <it>hoppel </it>or PROTOP_A). In the region of the fourth chromosome studied, if the inserted <it>P </it>element is within approximately 10 kilobases (kb) of a <it>1360 </it>element, the <it>white </it>reporter gene has a greater than 90% chance of exhibiting variegating expression, suggesting it is in a heterochromatic domain. If the reporter is more than 10 kb away from a <it>1360 </it>element, it has a greater than 90% chance of generating a red eye phenotype, suggesting that it is in a euchromatic domain. Therefore, Sun <it>et al</it>. <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> have suggested that proximity to the <it>1360 </it>element can influence the chromatin packaging state. Recent results from fungi and plants <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, as well as <it>Drosophila </it><abbrgrp><abbr bid="B26">26</abbr></abbrgrp> have shown that heterochromatin formation is dependent on the RNA interference (RNAi) system. Small double-stranded (ds)RNAs have been recovered from many of the repetitive elements in <it>Drosophila</it>, including <it>1360 </it><abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, and might target repetitive elements in the genome for silencing by initiation and spreading of heterochromatin packaging.</p>
			<p>The small dot chromosome exists in many species of <it>Drosophila </it><abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. It has long been recognized that phenotypes of similar mutations map to the dot chromosomes of both <it>D. melanogaster </it>and <it>D. virilis </it><abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. Podemski <it>et al</it>. <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> have shown that probes for several genes from the <it>D. melanogaster </it>fourth chromosome, including <it>ci </it>and <it>Caps</it>, hybridize to the dot chromosome in <it>D. virilis</it>. <it>D. virilis </it>is a member of a <it>Drosophila </it>genus that diverged from <it>D. melanogaster </it>40 to 60 million years ago <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. In addition to the sex chromosomes, it has four large autosomes, rather than the two of <it>D. melanogaster</it>; thus, the dot chromosome of <it>D. virilis </it>is chromosome six. The polytenized regions of both dot chromosomes are similar in size. In this study, we will refer to chromosome six of <it>D. virilis </it>and chromosome four of <it>D. melanogaster </it>as dot chromosomes. Our analysis concerns the banded 1.2 Mb region of these chromosomes, estimated to contain approximately 80 genes.</p>
			<p>Prior reports indicated that the dot chromosome of <it>D. virilis </it>does not share the heterochromatic characteristics of the dot chromosome of <it>D. melanogaster</it>, despite the fact that it maintains a similar proximity to the heterochromatic chromocenter, as seen in polytene nuclei. <it>In situ </it>hybridizations performed by Lowenhaupt <it>et al</it>. <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> demonstrated that the (dC-dA)&#183;(dG-dT) dinucleotide repeat frequency of the <it>D. virilis </it>dot chromosome is similar to that in its euchromatic arms. In contrast to the observations using <it>D. melanogaster</it>, recombination is observed on the <it>D. virilis </it>dot chromosome <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B34">34</abbr></abbrgrp>. Further, the polytenized portion of the dot chromosome in <it>D. virilis </it>fails to stain with antibodies directed against HP1 <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> (Figure <figr fid="F1">1b</figr>).</p>
			<fig id="F1">
				<title>
					<p>Figure 1</p>
				</title>
				<caption>
					<p>Immunofluorescent staining of the polytene chromosomes</p>
				</caption>
				<text>
					<p>Immunofluorescent staining of the polytene chromosomes. Polytene chromosomes from <b>(a) </b><it>D. melanogaster </it>and <b>(b) </b><it>D. virilis </it>are shown. Top left, phase contrast; others as labeled. Panels on the right provide a close-up of the chromocenter and the dot chromosome. In the merge picture, yellow represents equal staining, red represents more H3K9me staining, and green represents more HP1 staining. The dot chromosome is indicated with an arrow. In <it>D. melanogaster</it>, antibodies for HP1 and H3K9me stain both the chromocenter and the dot chromosome, although the HP1 staining is slightly stronger than the H3K9me staining on the dot. In <it>D. virilis</it>, both antibodies stain the chromocenter but neither stains the dot chromosome.</p>
				</text>
				<graphic file="gb-2006-7-2-r15-1" hint_layout="double"/>
			</fig>
			<p>Comparative genomics has been invaluable in discovering new functional and regulatory elements in the genomes of a cluster of yeast species, using <it>Saccharomyces cerevisiae </it>as the reference point <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. We believe this comparative approach will be equally valuable as comparisons of <it>Drosophila </it>species become possible <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>. If the gene compositions of the dot chromosomes of <it>D. melanogaster </it>and <it>D. virilis </it>are similar, what other differences in the DNA sequence could lead to the apparent difference in higher-order chromatin structure? To address this question, we have generated a finished, clone-based sequence for a sample from the <it>D. virilis </it>dot chromosome and from the long chromosome arms; finished sequence leads to more accurate inferences about repetitive sequences <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. By comparing similar regions of the two dot chromosomes, we show that while the overall repeat density of the dot chromosomes is similar, the density of DNA transposon remnants is significantly higher in <it>D. melanogaster </it>than in <it>D. virilis</it>; the difference is particularly striking for the <it>DINE-1 </it>elements and <it>1360 </it>elements, discussed above. These results, combined with recent findings about RNAi, lead us to suggest that the difference in chromatin packaging between the dot chromosomes of these two species of <it>Drosophila </it>could be a function of the density and distribution of a subclass of repetitive elements.</p>
		</sec>
		<sec>
			<st>
				<p>Results</p>
			</st>
			<sec>
				<st>
					<p>Immunofluorescent staining indicates that the <it>D. virilis </it>dot chromosome is largely euchromatic, in contrast to the heterochromatic <it>D. melanogaster </it>dot chromosome</p>
				</st>
				<p>The dot chromosome of <it>D. melanogaster </it>is largely heterochromatic, with some interspersed domains of euchromatin <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Immunofluorescent staining of <it>D. melanogaster </it>polytene chromosomes using HP1 antibody shows a banded pattern on the dot chromosome. Many species in the <it>Drosophila </it>genus closely related to <it>D. melanogaster </it>share this staining pattern, including <it>D. simulans</it>, <it>D. yakuba</it>, and <it>D. pseudoobscura </it>(data not shown). In <it>D. melanogaster</it>, staining with an antibody against histone H3 methylated at lysine 9 (anti-H3K9me) coincides with the HP1 staining, at a level slightly less than seen in the pericentric heterochromatin <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> (Figure <figr fid="F1">1a</figr>). In contrast, the dot chromosome of <it>D. virilis </it>does not stain with either anti-HP1 or anti-H3K9me (Figure <figr fid="F1">1b</figr>), supporting the inference that the banded portion of the dot chromosome of <it>D. virilis </it>is generally euchromatic.</p>
			</sec>
			<sec>
				<st>
					<p>Identification of fosmids from the dot chromosome of <it>D. virilis</it></p>
				</st>
				<p>The chromosomes of <it>D. virilis </it>tend to map to corresponding portions of the chromosomes of <it>D. melanogaster </it><abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. We compared the recently posted genomic sequence for <it>D. pseudoobscura </it><abbrgrp><abbr bid="B37">37</abbr><abbr bid="B40">40</abbr></abbrgrp> with the <it>D. melanogaster </it>dot chromosome genes to look for regions of sufficient sequence similarity to act as conserved hybridization probes. The desired probes (see Materials and methods) were radiolabeled and used to screen a <it>D. virilis </it>genomic library (BDVIF01 fosmids, Tucson strain 15010-1001.10, available spotted on a single filter) at low stringency. Positive clones were verified and characterized by <it>in situ </it>hybridizations to the polytene chromosomes from third instar larval salivary glands of <it>D. virilis</it>. Sample results are shown in Figure <figr fid="F2">2</figr>. Eleven fosmids were recovered with homology to the dot chromosome of <it>D. virilis</it>, and seven fosmids were recovered with homology to the major chromosome arms. Based on the <it>in situ </it>hybridization results, the order of the fosmid clones on the dot chromosome is as follows: contigs 30, 103, and 106 appear to cluster near the centromere; contigs 67, 72, and 91 are in the middle of the chromosome; and contigs 50 and 113 hybridize near the telomere. There is also a minor signal with the contig 30 probe near the telomere; this may be the result of a repetitive element present in multiple regions in the chromosome.</p>
				<fig id="F2">
					<title>
						<p>Figure 2</p>
					</title>
					<caption>
						<p><it>In situ </it>hybridizations of fosmids to <it>D. virilis </it>polytene chromosomes</p>
					</caption>
					<text>
						<p><it>In situ </it>hybridizations of fosmids to <it>D. virilis </it>polytene chromosomes. Fosmid DNA was labeled and used for <it>in situ </it>hybridization on denatured polytene chromosomes from <it>D. virilis</it>. Three examples are shown (left to right: contigs 106, 72, 113) demonstrating hybridization to a specific band on the dot chromosome (arrowhead). In some cases, signal is associated with the chromocenter, presumably due to repetitive sequences shared with the band on the dot. <it>In situ </it>hybridizations were performed with at least one fosmid from every contig from the dot chromosome with similar results (data not shown). See Table 1 for the chromosome locations of the other fosmids.</p>
					</text>
					<graphic file="gb-2006-7-2-r15-2" hint_layout="single"/>
				</fig>
				<tbl id="T1" hint_layout="double">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>Annotation of the <it>D. virilis </it>contigs</p>
					</caption>
					<tblbdy cols="5">
						<r>
							<c ca="right">
								<p>Contig</p>
							</c>
							<c ca="left">
								<p>BACPAC</p>
							</c>
							<c ca="left">
								<p>Genes</p>
							</c>
							<c ca="center">
								<p>Size (bp)</p>
							</c>
							<c ca="center">
								<p>Repeat analysis</p>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c cspan="5" ca="left">
								<p><it>D. virilis </it>dot chromosome fosmids</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>30</p>
							</c>
							<c ca="left">
								<p>15E14, 12E24</p>
							</c>
							<c ca="left">
								<p><it>pan </it>(4), <it>CG32005 </it>(4), <it>Caps </it>(4)</p>
							</c>
							<c ca="center">
								<p>61,074</p>
							</c>
							<c ca="center">
								<p>Yes</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>103</p>
							</c>
							<c ca="left">
								<p>44I5</p>
							</c>
							<c ca="left">
								<p><it>CG5367 </it>(2L), <it>CG11093 </it>(4), <it>CG32016 </it>(4), <it>Glu-RA </it>(4)</p>
							</c>
							<c ca="center">
								<p>39,850</p>
							</c>
							<c ca="center">
								<p>Yes</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>106</p>
							</c>
							<c ca="left">
								<p>39O6</p>
							</c>
							<c ca="left">
								<p><it>toy </it>(4), <it>plexA </it>(4)</p>
							</c>
							<c ca="center">
								<p>40,734</p>
							</c>
							<c ca="center">
								<p>Yes</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>67</p>
							</c>
							<c ca="left">
								<p>23A13, 15G13</p>
							</c>
							<c ca="left">
								<p><it>Ephrin </it>(4), <it>CG1970 </it>(4), <it>Pur-alpha </it>(4), <it>Thd1 </it>(4), <it>zfh2 </it>(4)</p>
							</c>
							<c ca="center">
								<p>54,154</p>
							</c>
							<c ca="center">
								<p>Yes</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>72</p>
							</c>
							<c ca="left">
								<p>3G18</p>
							</c>
							<c ca="left">
								<p><it>sv </it>(4), <it>lgs </it>(4), <it>onecut </it>(4), <it>CG1909 </it>(4), <it>Ephrin </it>(4)</p>
							</c>
							<c ca="center">
								<p>43,948</p>
							</c>
							<c ca="center">
								<p>No</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>91</p>
							</c>
							<c ca="left">
								<p>42I6</p>
							</c>
							<c ca="left">
								<p>predicted gene, <it>CG31992 </it>(4), <it>Eph </it>(4), <it>CaMKI </it>(4)</p>
							</c>
							<c ca="center">
								<p>39,292</p>
							</c>
							<c ca="center">
								<p>No</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>50</p>
							</c>
							<c ca="left">
								<p>38M22, 34I22</p>
							</c>
							<c ca="left">
								<p><it>bt </it>(4), <it>Arc70 </it>(4), <it>CG11148 </it>(4), C <it>G11152 </it>(4)</p>
							</c>
							<c ca="center">
								<p>56,333</p>
							</c>
							<c ca="center">
								<p>Yes</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>113</p>
							</c>
							<c ca="left">
								<p>47B4</p>
							</c>
							<c ca="left">
								<p><it>CG2052 </it>(4)</p>
							</c>
							<c ca="center">
								<p>37,265</p>
							</c>
							<c ca="center">
								<p>Yes</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c cspan="5" ca="left">
								<p><it>D. virilis </it>fosmids from major chromosomes</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>11</p>
							</c>
							<c ca="left">
								<p>43O10</p>
							</c>
							<c ca="left">
								<p><it>CG32521 </it>(X), <it>Tim13 </it>(3L)</p>
							</c>
							<c ca="center">
								<p>40,809</p>
							</c>
							<c ca="center">
								<p>No</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>13</p>
							</c>
							<c ca="left">
								<p>26E5</p>
							</c>
							<c ca="left">
								<p>pseudogene, <it>CG31337 </it>(3R)</p>
							</c>
							<c ca="center">
								<p>40,479</p>
							</c>
							<c ca="center">
								<p>Yes</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>80</p>
							</c>
							<c ca="left">
								<p>22L1, 42E12</p>
							</c>
							<c ca="left">
								<p><it>CG14129 </it>(3L), <it>CG5917 </it>(3L), <it>CG1732 </it>(4), <it>CG14130 </it>(3L), <it>CG9384 </it>(3L), <it>Trl </it>(3L), <it>CG9343 </it>(3L), <it>ome </it>(3L)</p>
							</c>
							<c ca="center">
								<p>68,774</p>
							</c>
							<c ca="center">
								<p>No</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>112</p>
							</c>
							<c ca="left">
								<p>10J19</p>
							</c>
							<c ca="left">
								<p><it>CG10440 </it>(2R), <it>Egfr </it>(2R)</p>
							</c>
							<c ca="center">
								<p>34,783</p>
							</c>
							<c ca="center">
								<p>Yes</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>121</p>
							</c>
							<c ca="left">
								<p>18G4</p>
							</c>
							<c ca="left">
								<p><it>CG17267 </it>(3R), <it>cdc2c </it>(3R), <it>Oamb </it>(3R)</p>
							</c>
							<c ca="center">
								<p>47,154</p>
							</c>
							<c ca="center">
								<p>Yes</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>122</p>
							</c>
							<c ca="left">
								<p>36E24</p>
							</c>
							<c ca="left">
								<p><it>Syn </it>(3R), <it>CG12814 </it>(3R), <it>Best1 </it>(3R), <it>CG6995 </it>(3R)</p>
							</c>
							<c ca="center">
								<p>41,111</p>
							</c>
							<c ca="center">
								<p>Yes</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>The table lists contigs sequenced from <it>D. virilis</it>. The top section lists contigs from the dot chromosome of <it>D. virilis </it>in approximate order on the chromosome from centromere to telomere (as determined by <it>in situ </it>hybridization). The bottom section lists contigs from major chromosomes of <it>D. virilis </it>in an arbitrary order. The contig name is followed by the number(s) of the fosmid clone(s) sequenced (BACPAC Center at CHORI [69]). Genes are listed in the order in which they occur in the contig, with the number in parentheses representing the chromosome in which the homologous gene is found in the <it>D. melanogaster </it>genome. The total size of the contig is given; the final column indicates whether the contig was used in the repeat analysis (see Materials and methods).</p>
					</tblfn>
				</tbl>
			</sec>
			<sec>
				<st>
					<p>Fosmid sequencing and annotation</p>
				</st>
				<p>The 18 fosmids recovered from the screen were sequenced in collaboration with the Genome Sequencing Center at Washington University School of Medicine. Plasmid subclone libraries were prepared and approximately 600 subclones from each fosmid were end sequenced. The sequences were assembled and finished to high quality by Washington University undergraduate students in the Bio 4342 'Research Explorations in Genomics' course, using <it>phred</it>, <it>phrap</it>, and <it>consed </it><abbrgrp><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>. Finished sequences had an estimated error rate of less than 0.01%, and showed <it>in silico </it>restriction digests that matched digests obtained from the starting fosmid with a minimum of two enzymes. Students annotated the finished sequences by looking for genes, repetitive elements, and other features as described in Materials and methods. Four pairs of fosmids have significant sequence overlap; each pair was collapsed into a single contig of non-redundant sequence (contigs 30, 50, 67, and 80).</p>
				<p>Initial annotation focused on gene finding. <it>D. virilis </it>is evolutionarily close enough to <it>D. melanogaster </it>that the protein coding regions are well conserved. Gene prediction algorithms and local alignment search tools (such as GENSCAN and BLAST; see Materials and methods) were used to annotate genes and determine intron-exon boundaries. In most cases, it was possible to identify the entire coding region of the gene, but the high level of sequence divergence made defining untranslated regions impossible <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. Comparison of the <it>D. virilis </it>contigs with homologous regions of the <it>D. melanogaster </it>dot chromosome identified specific regions where synteny has been maintained, as well as those regions where inversions have occurred. Figure <figr fid="F3">3</figr> shows a comparison of two <it>D. virilis </it>contigs with the homologous regions from the <it>D. melanogaster </it>chromosomes. Detailed annotation results and comparisons between the other individual <it>D. virilis </it>fosmids and their homologous regions in <it>D. melanogaster </it>are available as Additional data file 1 (dot chromosome sequences) and Additional data file 2 (non-dot chromosome sequences). Note that the strain of <it>D. virilis </it>used here is a different strain from that recently sequenced (by Agencourt Bioscience Corporation, Beverly, MA, USA). The two strains differ by about 1% base substitutions, with numerous insertions or deletions (indels), but show similar organization at the gene level (CDS, unpublished observation). The clone-based sequencing used here results in more accurate inferences in regions that are highly repetitive; the sequences most likely to be missed in whole genome shotgun techniques are the repeats <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>.</p>
				<fig id="F3">
					<title>
						<p>Figure 3</p>
					</title>
					<caption>
						<p>Map for two sample contigs from <it>D. virilis </it>(Dv) in comparison with homologous regions of the <it>D. melanogaster </it>(Dm) genome. Shown are two contigs from <it>D. virilis </it>with the corresponding regions from <it>D. melanogaster</it></p>
					</caption>
					<text>
						<p>Map for two sample contigs from <it>D. virilis </it>(Dv) in comparison with homologous regions of the <it>D. melanogaster </it>(Dm) genome. Shown are two contigs from <it>D. virilis </it>with the corresponding regions from <it>D. melanogaster</it>. Coding sequences (dark blue boxes) are indicated above each diagram. In the case of <it>D. melanogaster</it>, the thick dark blue bar indicates open reading frames (ORFs), and the thin aqua bar indicates UTRs; only ORFs are identified for <it>D. virilis</it>. Repeat sequences are shown below: red boxes are DNA transposon fragments, while other repetitive elements are represented as yellow boxes. <b>(a) </b>Contig 112 represents a clone from one of the large chromosomes of <it>D. virilis</it>. While the orientations of <it>Egfr </it>and <it>CG10440 </it>are the same with respect to each other, there is a large tandem repeat between the two genes in <it>D. virilis</it>, but not in <it>D. melanogaster</it>. <b>(b) </b>Contig 67 represents a clone from the dot chromosome of <it>D. virilis</it>. The structure of the genomic region is similar to the corresponding region in <it>D. melanogaster</it>, but there is more intergenic space in <it>D. virilis</it>, whereas in <it>D. melanogaster</it>, there are more transposable elements in the introns. All of the fosmids described here with homologous regions in <it>D. melanogaster </it>have been annotated in a similar manner; the maps are available in the Additional data files. Scale: one division equals 5 kb.</p>
					</text>
					<graphic file="gb-2006-7-2-r15-3" hint_layout="double"/>
				</fig>
				<p>Table <tblr tid="T1">1</tblr> shows all contigs sequenced, giving their total sizes, listing annotated genes, and providing clone names (BACPAC Center). <it>In situ </it>hybridization results identified the fosmids as either on the dot chromosome or on a major <it>D. virilis </it>chromosome. In parentheses following each gene is the chromosome position of the gene in the genome of <it>D. melanogaster</it>. Figure <figr fid="F4">4</figr> maps the contigs from the dot chromosome of <it>D. virilis </it>to the dot chromosome of <it>D. melanogaster </it>based on the presence of orthologous genes. Three of the contigs (67, 106, and 113) are completely syntenic with respect to the <it>D. melanogaster </it>dot chromosome. One contig, 103, is completely syntenic with respect to its genes from the dot chromosome, but also contains <it>CG5367</it>, a gene from the second chromosome of <it>D. melanogaster</it>. Four contigs (30, 72, 50, and 91) contain genes that are exclusively from the dot chromosome of <it>D. melanogaster </it>but show evidence of a high number of inversions with respect to the <it>D. melanogaster </it>chromosome. For example, contig 30 contains both <it>pan </it>and <it>Caps</it>, genes that come from opposite sides of the banded portion of the <it>D. melanogaster </it>dot chromosome. (This rearrangement was also observed in earlier studies <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>.) Of the 28 genes identified in the <it>D. virilis </it>dot chromosome clones, only one lies elsewhere in the <it>D. melanogaster </it>genome. In the <it>D. virilis </it>contigs from major chromosomes, four (contigs 13, 112, 121, 122) are completely syntenic compared to homologous gene regions from <it>D. melanogaster</it>, and two (contigs 11 and 80) show inversions within the chromosomes. Only one major chromosome contig (80) contains a gene that is found on the dot chromosome in <it>D. melanogaster</it>. Contig 80 maps to a major arm of <it>D. virilis; </it>it contains <it>D. melanogaster </it>dot chromosome gene <it>CG1732 </it>flanked by several genes from <it>D. melanogaster </it>chromosome 3. In total, the fosmids sequenced represent 372,650 bp of sequence from the dot chromosome of <it>D. virilis </it>and 273,110 bp of sequence from the major chromosomes. <it>D. virilis </it>contigs 72 and 91 from the dot chromosome and 11 and 80 from the major arms showed so much rearrangement that it was impossible to define precise homologous area(s) from <it>D. melanogaster</it>. These contigs were not used in comparisons for intron size, percent DNA transcribed, or in any of the repeat density calculations. Maps representing locations and sizes of genes and repeats in each contig are available in Additional data files 1 and 2.</p>
				<fig id="F4">
					<title>
						<p>Figure 4</p>
					</title>
					<caption>
						<p>Map of the <it>D. virilis </it>(Dv) dot chromosome contigs in relation to the dot chromosome of <it>D. melanogaster </it>(Dm)</p>
					</caption>
					<text>
						<p>Map of the <it>D. virilis </it>(Dv) dot chromosome contigs in relation to the dot chromosome of <it>D. melanogaster </it>(Dm). Shown at the bottom is a map of the genes on the <it>D. melanogaster </it>dot chromosome. Colored bars with labels represent genes for which we have identified a (complete or partial) homologue in the <it>D. virilis </it>fosmids sequenced. Colored boxes above the scale bar are schematic (not to scale) representations of the <it>D. virilis </it>contigs. Immediately above the scale bar is a representation of those sequenced contigs that contain syntenic regions from <it>D. virilis</it>, where genes are in the same order and orientation as in <it>D. melanogaster</it>. In the uppermost portion of the figure are the contigs mapping to the <it>D. virilis </it>dot chromosome that are rearranged with respect to the <it>D. melanogaster </it>dot chromosome. Boxes are color-coded to represent the genes present in the contig, with dashed lines connecting to show the extent of rearrangement. Notably, contig 30 contains both <it>pan </it>and <it>Caps</it>, which lie on opposite sides of the banded portion of the <it>D. melanogaster </it>dot chromosome.</p>
					</text>
					<graphic file="gb-2006-7-2-r15-4" hint_layout="double"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Average intron size and percent DNA transcribed</p>
				</st>
				<p>While centromeric regions are rich in satellite DNA and relatively gene poor <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, gene density (defined as the number of genes per Mb) in the banded portion of the dot chromosome is similar to the major chromosomes of <it>D. melanogaster </it><abbrgrp><abbr bid="B19">19</abbr></abbrgrp> (66.5 genes/Mb for the dot and 74.6 genes/Mb for the major chromosomes for the regions analyzed here). This is also true for the regions of the <it>D. virilis </it>genome we have sequenced (62.2 genes/Mb for the dot and 67.3 genes/Mb for major chromosomes). Observation of those few heterochromatic genes that have been cloned and sequenced (for example, <it>light </it><abbrgrp><abbr bid="B44">44</abbr></abbrgrp>) suggests that these genes may have larger introns on average, and this has been reported for <it>D. melanogaster </it>dot chromosome genes <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. Average intron size, defined as total intron length divided by total number of introns, is 448 bp (&#177; 126 bp) for our sample from the major <it>D. virilis </it>chromosomes and 405 bp (&#177; 110 bp) for the corresponding regions of <it>D. melanogaster</it>. <it>D. virilis </it>dot chromosome genes in our sample have an average intron length of 890 bp (&#177; 179 bp); in homologous regions of the <it>D. melanogaster </it>genome, it is 859 bp (&#177; 115 bp). Figure <figr fid="F5">5</figr> shows a graph that compares the intron size cumulative distribution functions of the dot chromosomes with the major chromosomes. Due to the non-normal distribution of intron sizes, the non-parametric Kolmogorov-Smirnov (KS) test is used to evaluate the statistical significance in the pairwise comparisons. The KS test indicates that the difference in the distribution of intron sizes between the two dot chromosomes is not statistically significant (D = 0.1237, <it>p</it> = 0.2816). However, the distribution of intron sizes for the dot chromosomes is significantly different from those for the major chromosomes for both species (D = 0.223, <it>p</it> = 0.0496 and D = 0.245, <it>p</it> = 0.0291 for <it>D. virilis </it>and <it>D. melanogaster</it>, respectively).</p>
				<fig id="F5">
					<title>
						<p>Figure 5</p>
					</title>
					<caption>
						<p>Distribution of intron sizes in <it>D. virilis </it>compared to <it>D. melanogaster</it></p>
					</caption>
					<text>
						<p>Distribution of intron sizes in <it>D. virilis </it>compared to <it>D. melanogaster</it>. Introns from all <it>D. virilis </it>and <it>D. melanogaster </it>genes in the contigs studied were separated into groups based on size. The number on the <it>x </it>axis represents the minimal intron size; an intron is counted in that bin if it has that many bases or fewer. The <it>y </it>axis tallies the percent of total introns that fall into that bin. The two dot chromosomes have significantly similar intron size distributions, which differ significantly from those of the major chromosome arms.</p>
					</text>
					<graphic file="gb-2006-7-2-r15-5" hint_layout="double"/>
				</fig>
				<p>Percent DNA transcribed, defined as primary transcript length over total sequence length, is more similar between the homologous chromosomes than between the dot chromosomes and the major chromosomes. (In this instance, 5' and 3' untranslated regions (UTRs) were not scored in calculations of percent DNA transcribed, as these regions could not be identified in the putative <it>D. virilis </it>genes.) The sequenced regions of the <it>D. virilis </it>and comparable regions of the <it>D. melanogaster </it>dot chromosomes have transcript densities of 58.7% and 51.0%, respectively, while transcript densities of the major chromosomes are 22.2% for <it>D. virilis </it>and 25.9% for <it>D. melanogaster</it>. The difference in percent DNA transcribed between the dot and non-dot contigs reflects the larger average size of introns in the dot chromosome genes.</p>
			</sec>
			<sec>
				<st>
					<p>(dC-dA)&#183;(dG-dT) dinucleotide repeat frequency</p>
				</st>
				<p>One marker of euchromatin is the presence of abundant (dC-dA)&#183;(dG-dT) dinucleotide repeats, also known as CA/GT repeats. <it>In situ </it>hybridization shows that these repeats are widely distributed in euchromatin, but that the dot chromosome of <it>D. melanogaster </it>has a much lower density of these repeats <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. The dot chromosome of <it>D. virilis </it>has a CA/GT repeat frequency similar to its major autosomes, as shown by <it>in situ </it>hybridization <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Dinucleotide repeat analysis of the sequences from the <it>D. virilis </it>fosmids in comparison with the homologous regions of the <it>D. melanogaster </it>genome supports the <it>in situ </it>hybridization results. The fosmids from the dot chromosome of <it>D. virilis </it>have CA/GT repeats with an average length of 36 bp and a total density of 0.15%. Regions of the <it>D. melanogaster </it>dot chromosome homologous to these fosmids have only one CA/GT repeat, which is 21 bp long, giving a total CA/GT density of 0.0069%. In the <it>D. virilis </it>clones mapping to major chromosomes, 0.96% of the DNA is made up of CA/GT, with the average repeat being 32 bp long. In homologous regions of the <it>D. melanogaster </it>genome, 0.32% of the DNA is CA/GT, with the average length of dinucleotide regions being 24 bp. Thus, while the <it>D. virilis </it>dot chromosome has a lower level of CA/GT than the major chromosome arms (about six-fold less than <it>D. virilis </it>and about two-fold less than <it>D. melanogaster</it>), it has a approximately 20-fold higher level of this repeat than is found in the dot chromosome of <it>D. melanogaster</it>.</p>
			</sec>
			<sec>
				<st>
					<p>Repeat analysis</p>
				</st>
				<p>Initial analysis of known repetitive elements in the <it>D. virilis </it>contigs was performed using RepeatMasker <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. RepBase 8.12 <abbrgrp><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp> contains previously characterized repeats from the <it>D. virilis </it>species group. As a simple initial approach we searched for <it>de novo </it>repeats by comparing the fosmid sequences to each other, looking for regions of high similarity by BLASTN <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. Most apparently novel repeated sequences identified by this technique were immediately adjacent to known repeats identified by RepeatMasker and were, therefore, assumed to be unmasked extensions of those repeats. A few novel repeats were identified that were not similar to any other known repetitive element, expressed sequence tag (EST), or protein sequence. Using this simple technique, novel repeats constituted less than 1% of the total repetitive DNA; however, given the small size of our dataset (0.65 Mb) it is possible that repetitive elements could be missed.</p>
				<p>Figure <figr fid="F6">6a</figr> shows the repeat density of different classes of repetitive elements in the <it>D. virilis </it>contigs and the comparable regions of the <it>D. melanogaster </it>genome using RepeatMasker/RepBase (<it>Drosophila </it>default parameters) plus this simple <it>de novo </it>BLASTN technique. While there is some variation in repeat density between the contigs of a given region (dot chromosome or major chromosome), the totals appear to represent an average value of the contigs studied. Using this analysis, the overall repeat density of the <it>D. virilis </it>dot chromosome contigs is 14.6%; the average of the individual repeat densities is 15.4% &#177; 7.9%. The overall repeat density of the homologous <it>D. melanogaster </it>regions is 25.3%; the average of the individual repeat densities is 24.7% &#177; 5.4%. Fosmids from the dot chromosome of <it>D. melanogaster </it>show a consistently higher density of DNA transposons and <it>DINE-1 </it>elements than do the fosmids from the dot chromosome of <it>D. virilis</it>. Comparison of the sample from the dot chromosome of <it>D. melanogaster </it>analyzed here to the entire banded portion of the dot chromosome (using RepeatMasker and RepBase 8.12) shows very similar results (Figure <figr fid="F6">6a</figr>). In contrast, the euchromatic arms of the large chromosomes of <it>D. melanogaster </it>and <it>D. virilis </it>have similar repeat densities, with approximately 6% of the sequence classified as repetitive. (Quesneville <it>et al</it>. <abbrgrp><abbr bid="B49">49</abbr></abbrgrp> estimate the total repeat density of <it>D. melanogaster </it>to be 5.3%.) Other repeat types differed between the two species as well. In our sample from these chromosome arms, <it>D. virilis </it>has more simple repeats and <it>D. melanogaster </it>has more retroelements. Overall, these results suggest that both the higher repeat density and the overrepresentation of DNA transposons contribute to heterochromatin formation on the <it>D. melanogaster </it>dot chromosome. However, because <it>D. virilis </it>is not as well studied as <it>D. melanogaster</it>, it is possible that this approach misses some uncharacterized repeats. To address this issue, we undertook several different strategies.</p>
				<fig id="F6">
					<title>
						<p>Figure 6</p>
					</title>
					<caption>
						<p>Repeat analysis of <it>D. virilis </it>contigs compared to the <it>D. melanogaster </it>genome</p>
					</caption>
					<text>
						<p>Repeat analysis of <it>D. virilis </it>contigs compared to the <it>D. melanogaster </it>genome. The repeat density, defined as the percentage of total sequence (in base-pairs) that has been annotated as repetitive has been calculated using the <it>D. virilis </it>fosmid sequence obtained in this study and homologous regions from <it>D. melanogaster </it>(see Materials and methods). <it>D. melanogaster </it>and <it>D. virilis </it>have a very similar low repeat density on the major chromosome arms, and a similar but much higher repeat density on the dot chromosomes. <b>(a) </b>Percent repeat for each type identified by RepeatMasker using RebBase 8.12 with additional repeats identified in a BLASTN all-by-all comparison of the fosmid sequences presented here. <b>(b) </b>Percent repeat for each type identified by RepeatMasker using the Superlibrary (see text for description). The dot chromosome of <it>D. melanogaster </it>has about three times more DNA transposon sequence than does the <it>D. virilis </it>dot chromosome. 'Unknown' repeats are those from both RebBase 8.12 and the <it>D. virilis </it>PILER-DF library that have not been classified as to type.</p>
					</text>
					<graphic file="gb-2006-7-2-r15-6" hint_layout="double"/>
				</fig>
				<p>Recent investigations have developed multiple search tools for <it>de novo </it>identification of novel repetitive sequences in genome assemblies <abbrgrp><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr></abbrgrp>. Using such tools, we created a 'Superlibrary' in which we added sequences from species-specific libraries from both <it>D. melanogaster </it>and <it>D. virilis </it>to the RebBase 8.12 Drosophila transposable element (TE) library to generate a library with as little bias as possible. The additional repeats came from three sources. Two novel repetitive elements that were identified in <it>D. melanogaster </it>using the PILER-TR program were added <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. We also added a complete set of 66 elements from <it>D. virilis </it>identified by PILER-DF analysis (C Smith and G Karpen, personal communication) of the posted <it>D. virilis </it>whole genome assembly <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. Finally, a recently identified sequence of <it>DINE-1 </it>from <it>D. yakuba </it>was added <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>.</p>
				<p>All of the <it>D. virilis </it>and <it>D. melanogaster </it>sequences used in this study were then analyzed for repetitive DNA using RepeatMasker with this Superlibrary. This approach identified a total repeat density of the <it>D. virilis </it>contigs from the dot chromosome of 22.8%, while homologous regions of the <it>D. melanogaster </it>dot chromosome have 26.5% repetitive DNA (Figure <figr fid="F6">6b</figr>). Using the same Superlibrary, the segments from the major chromosomes of <it>D. virilis </it>have a total repeat density of 8.4%, compared to <it>D. melanogaster </it>major chromosomes, which have a density of 6.8%. This analysis shows that the overall density of repeats on the <it>D. virilis </it>and <it>D. melanogaster </it>dot chromosome fosmids is similar, and significantly higher than the density of repeats on the major chromosomes from either species. Other analysis techniques used to assess the difference between the <it>D. virilis </it>and <it>D. melanogaster </it>sequences, including a TBLASTX comparison using a RebBase 8.12 library from which invertebrate sequences had been removed <abbrgrp><abbr bid="B49">49</abbr><abbr bid="B54">54</abbr></abbrgrp>, and a Repeat Scout library assembly <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>, also showed little difference in the total amount of repetitive sequence found in the <it>D. virilis </it>and <it>D. melanogaster </it>dot sequences (not shown). Thus, all of the follow-up techniques applied indicate that the sequences from the dot chromosomes of both <it>D. virilis </it>and <it>D. melanogaster </it>are enriched for repetitive sequences compared to the sequences derived from the major chromosomes of both species. The analysis of each contig as well as the total representation of each type of repeat is presented in Table <tblr tid="T2">2</tblr> and in Figure <figr fid="F6">6b</figr>. The contrast between the results shown in Figure <figr fid="F6">6a</figr> and those shown in Figure <figr fid="F6">6b</figr> illustrates the problem posed by biased repeat libraries, an issue that must be carefully considered in studies of this type. The observation that three different analyses (discussed above) support the results shown in Figure <figr fid="F6">6b</figr> lends confidence to the conclusions derived here.</p>
				<tbl id="T2" hint_layout="double">
					<title>
						<p>Table 2</p>
					</title>
					<caption>
						<p>Repeat analysis of individual contigs from <it>D. virilis </it>compared to <it>D. melanogaster</it></p>
					</caption>
					<tblbdy cols="9">
						<r>
							<c ca="center">
								<p>Contig</p>
							</c>
							<c ca="center">
								<p>Size (bp)</p>
							</c>
							<c ca="center">
								<p><it>1360 </it>%</p>
							</c>
							<c ca="center">
								<p><it>DINE-1 </it>%</p>
							</c>
							<c ca="center">
								<p>DNA %</p>
							</c>
							<c ca="center">
								<p>Retro %</p>
							</c>
							<c ca="center">
								<p>Simple %</p>
							</c>
							<c ca="center">
								<p>Unk %</p>
							</c>
							<c ca="center">
								<p>Total %</p>
							</c>
						</r>
						<r>
							<c cspan="9">
								<hr/>
							</c>
						</r>
						<r>
							<c cspan="9" ca="left">
								<p><it>Drosophila virilis </it>dot chromosome contigs</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>30</p>
							</c>
							<c ca="center">
								<p>61,074</p>
							</c>
							<c ca="center">
								<p>1.9</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>5.6</p>
							</c>
							<c ca="center">
								<p>15.1</p>
							</c>
							<c ca="center">
								<p>5.6</p>
							</c>
							<c ca="center">
								<p>1.8</p>
							</c>
							<c ca="center">
								<p>30.1</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>50</p>
							</c>
							<c ca="center">
								<p>56,333</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>3.7</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>2.8</p>
							</c>
							<c ca="center">
								<p>5.3</p>
							</c>
							<c ca="center">
								<p>11.9</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>103</p>
							</c>
							<c ca="center">
								<p>39,850</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>7.8</p>
							</c>
							<c ca="center">
								<p>5.6</p>
							</c>
							<c ca="center">
								<p>5.7</p>
							</c>
							<c ca="center">
								<p>4.3</p>
							</c>
							<c ca="center">
								<p>23.3</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>106</p>
							</c>
							<c ca="center">
								<p>40,734</p>
							</c>
							<c ca="center">
								<p>2.6</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>7.4</p>
							</c>
							<c ca="center">
								<p>4.0</p>
							</c>
							<c ca="center">
								<p>5.5</p>
							</c>
							<c ca="center">
								<p>0.6</p>
							</c>
							<c ca="center">
								<p>20.1</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>67</p>
							</c>
							<c ca="center">
								<p>54,154</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.1</p>
							</c>
							<c ca="center">
								<p>3.8</p>
							</c>
							<c ca="center">
								<p>10.3</p>
							</c>
							<c ca="center">
								<p>6.0</p>
							</c>
							<c ca="center">
								<p>0.1</p>
							</c>
							<c ca="center">
								<p>20.3</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>113</p>
							</c>
							<c ca="center">
								<p>37,265</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.2</p>
							</c>
							<c ca="center">
								<p>5.8</p>
							</c>
							<c ca="center">
								<p>20.5</p>
							</c>
							<c ca="center">
								<p>6.2</p>
							</c>
							<c ca="center">
								<p>0.3</p>
							</c>
							<c ca="center">
								<p>33.0</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<b>Totals</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>289,410</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>0.8</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>0.1</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>5.5</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>9.1</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>5.2</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>2.2</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>22.8</b>
								</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>72</p>
							</c>
							<c ca="center">
								<p>43,948</p>
							</c>
							<c ca="center">
								<p>0.3</p>
							</c>
							<c ca="center">
								<p>0.4</p>
							</c>
							<c ca="center">
								<p>9.7</p>
							</c>
							<c ca="center">
								<p>1.4</p>
							</c>
							<c ca="center">
								<p>10.5</p>
							</c>
							<c ca="center">
								<p>1.1</p>
							</c>
							<c ca="center">
								<p>23.4</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>91</p>
							</c>
							<c ca="center">
								<p>39,292</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.2</p>
							</c>
							<c ca="center">
								<p>9.2</p>
							</c>
							<c ca="center">
								<p>6.8</p>
							</c>
							<c ca="center">
								<p>0.5</p>
							</c>
							<c ca="center">
								<p>16.7</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c cspan="9" ca="left">
								<p><it>Drosophila melanogaster </it>dot chromosome contigs</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>30.1</p>
							</c>
							<c ca="center">
								<p>40,013</p>
							</c>
							<c ca="center">
								<p>2.0</p>
							</c>
							<c ca="center">
								<p>6.0</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>3.2</p>
							</c>
							<c ca="center">
								<p>6.9</p>
							</c>
							<c ca="center">
								<p>0.1</p>
							</c>
							<c ca="center">
								<p>18.2</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>30.2</p>
							</c>
							<c ca="center">
								<p>23,681</p>
							</c>
							<c ca="center">
								<p>14.4</p>
							</c>
							<c ca="center">
								<p>6.7</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>25.6</p>
							</c>
							<c ca="center">
								<p>2.9</p>
							</c>
							<c ca="center">
								<p>4.6</p>
							</c>
							<c ca="center">
								<p>54.3</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>50.1</p>
							</c>
							<c ca="center">
								<p>43,617</p>
							</c>
							<c ca="center">
								<p>10.0</p>
							</c>
							<c ca="center">
								<p>4.2</p>
							</c>
							<c ca="center">
								<p>5.4</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>2.4</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>22.0</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>50.2</p>
							</c>
							<c ca="center">
								<p>34,148</p>
							</c>
							<c ca="center">
								<p>4.8</p>
							</c>
							<c ca="center">
								<p>9.3</p>
							</c>
							<c ca="center">
								<p>12.1</p>
							</c>
							<c ca="center">
								<p>4.7</p>
							</c>
							<c ca="center">
								<p>2.9</p>
							</c>
							<c ca="center">
								<p>0.2</p>
							</c>
							<c ca="center">
								<p>34.0</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>103</p>
							</c>
							<c ca="center">
								<p>40,295</p>
							</c>
							<c ca="center">
								<p>5.4</p>
							</c>
							<c ca="center">
								<p>14.6</p>
							</c>
							<c ca="center">
								<p>6.1</p>
							</c>
							<c ca="center">
								<p>0.7</p>
							</c>
							<c ca="center">
								<p>2.7</p>
							</c>
							<c ca="center">
								<p>0.2</p>
							</c>
							<c ca="center">
								<p>29.8</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>106</p>
							</c>
							<c ca="center">
								<p>38,285</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>21.8</p>
							</c>
							<c ca="center">
								<p>0.8</p>
							</c>
							<c ca="center">
								<p>3.2</p>
							</c>
							<c ca="center">
								<p>2.0</p>
							</c>
							<c ca="center">
								<p>0.3</p>
							</c>
							<c ca="center">
								<p>28.1</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>67</p>
							</c>
							<c ca="center">
								<p>44,243</p>
							</c>
							<c ca="center">
								<p>0.1</p>
							</c>
							<c ca="center">
								<p>8.1</p>
							</c>
							<c ca="center">
								<p>4.6</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>3.7</p>
							</c>
							<c ca="center">
								<p>0.2</p>
							</c>
							<c ca="center">
								<p>16.7</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>113</p>
							</c>
							<c ca="center">
								<p>36,538</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>13.0</p>
							</c>
							<c ca="center">
								<p>0.1</p>
							</c>
							<c ca="center">
								<p>6.2</p>
							</c>
							<c ca="center">
								<p>2.7</p>
							</c>
							<c ca="center">
								<p>0.4</p>
							</c>
							<c ca="center">
								<p>22.4</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<b>Totals</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>300,820</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>4.1</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>10.5</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>3.8</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>4.2</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>3.3</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>0.5</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>26.5</b>
								</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<it>Entire 4</it>
									<sup>th</sup>
								</p>
							</c>
							<c ca="center">
								<p>
									<it>1,237,870</it>
								</p>
							</c>
							<c ca="center">
								<p>
									<it>3.8</it>
								</p>
							</c>
							<c ca="center">
								<p>
									<it>9.2</it>
								</p>
							</c>
							<c ca="center">
								<p>
									<it>3.0</it>
								</p>
							</c>
							<c ca="center">
								<p>
									<it>9.5</it>
								</p>
							</c>
							<c ca="center">
								<p>
									<it>2.9</it>
								</p>
							</c>
							<c ca="center">
								<p>
									<it>0.0</it>
								</p>
							</c>
							<c ca="center">
								<p>
									<it>28.4</it>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c cspan="9" ca="left">
								<p><it>Drosophila virilis </it>contigs from major chromosomes</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>13</p>
							</c>
							<c ca="center">
								<p>40,479</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.4</p>
							</c>
							<c ca="center">
								<p>1.0</p>
							</c>
							<c ca="center">
								<p>8.6</p>
							</c>
							<c ca="center">
								<p>0.3</p>
							</c>
							<c ca="center">
								<p>10.3</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>112</p>
							</c>
							<c ca="center">
								<p>34,783</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.7</p>
							</c>
							<c ca="center">
								<p>17.8</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>18.6</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>121</p>
							</c>
							<c ca="center">
								<p>47,154</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>3.4</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>3.4</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>122</p>
							</c>
							<c ca="center">
								<p>41,111</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.6</p>
							</c>
							<c ca="center">
								<p>0.1</p>
							</c>
							<c ca="center">
								<p>3.1</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>3.8</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<b>Totals</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>163,527</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>0.0</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>0.0</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>0.2</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>0.4</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>7.7</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>0.1</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>8.4</b>
								</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>11</p>
							</c>
							<c ca="center">
								<p>40,809</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.1</p>
							</c>
							<c ca="center">
								<p>0.3</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>7.6</p>
							</c>
							<c ca="center">
								<p>0.3</p>
							</c>
							<c ca="center">
								<p>8.3</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>80</p>
							</c>
							<c ca="center">
								<p>68,774</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.1</p>
							</c>
							<c ca="center">
								<p>0.2</p>
							</c>
							<c ca="center">
								<p>0.6</p>
							</c>
							<c ca="center">
								<p>4.4</p>
							</c>
							<c ca="center">
								<p>0.6</p>
							</c>
							<c ca="center">
								<p>6.0</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c cspan="9" ca="left">
								<p><it>Drosophila melanogaster </it>contigs from major chromosomes</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>13</p>
							</c>
							<c ca="center">
								<p>42,664</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>4.1</p>
							</c>
							<c ca="center">
								<p>3.7</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>7.8</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>112</p>
							</c>
							<c ca="center">
								<p>29,021</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.2</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>3.7</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>3.9</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>121</p>
							</c>
							<c ca="center">
								<p>46,784</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>1.1</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>1.1</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>122</p>
							</c>
							<c ca="center">
								<p>55,801</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>10.6</p>
							</c>
							<c ca="center">
								<p>1.6</p>
							</c>
							<c ca="center">
								<p>0.0</p>
							</c>
							<c ca="center">
								<p>12.2</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<b>Totals</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>174,270</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>0.0</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>0.0</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>0.0</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>4.4</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>2.3</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>0.0</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>6.8</b>
								</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>Individual contigs from <it>D. virilis </it>are shown in each row with the homologous region from <it>D. melanogaster </it>shown below. Total size of the contig is shown in bp, followed by percentage of the contig made up of DNA transposons (separated into <it>1360</it>, <it>DINE-1 </it>and all other DNA transposons), retroelements, simple repeats, and repeats of unknown type (see Materials and methods). Total repeats in the last column is for the contig in that row, and the bold row at the bottom of each group represents the totals for that group. Rows below the bold row represent <it>D. virilis </it>contigs where a homologous region from <it>D. melanogaster </it>could not be defined, so they were not used in the calculations of total repeat density (see Materials and methods).</p>
					</tblfn>
				</tbl>
				<p>While the overall density of repetitious elements is similar, there is a major difference in the density of DNA transposons (Table <tblr tid="T2">2</tblr>). Of the <it>D. melanogaster </it>dot chromosome DNA from our sample, 18.6% consists of remnants of DNA transposons, including sequences from <it>1360 </it>elements, <it>P </it>elements (artifacts and related fragments), <it>Tc1 </it>elements and <it>DINE-1</it>. Only 6.4% of these regions from the dot chromosome of <it>D. virilis </it>consists of remnants of DNA transposons, about a three-fold reduction. The bulk of the repetitive sequence in the <it>D. virilis </it>dot fosmids tentatively classified as DNA transposons are the dvir.16.2 centroid and the dvir.16.17 centroid, sequences identified in the PILER-DF analysis. Table <tblr tid="T3">3</tblr> shows the repeat element and class of the most common repeats in the <it>D. virilis </it>and <it>D. melanogaster </it>dot chromosome contigs studied here, as identified by RepeatMasker/Superlibrary. DNA transposon families are preferentially represented in <it>D. melanogaster</it>, while retroelements (LINEs and LTRs) are more common in <it>D. virilis</it>. Examination of the quantitative results in Table <tblr tid="T2">2</tblr> suggests that the dot chromosome of <it>D. virilis </it>has an increase in retroelements (9.1%) in comparison with homologous regions of <it>D. melanogaster </it>(4.2%). However, this difference appears to be due to sample bias, as RepeatMasker/RebBase 8.12 classifies 8.7% of the whole <it>D. melanogaster </it>dot chromosome as retroelements.</p>
				<tbl id="T3" hint_layout="double">
					<title>
						<p>Table 3</p>
					</title>
					<caption>
						<p>Frequency of individual repetitive elements in <it>D. virilis</it></p>
					</caption>
					<tblbdy cols="6">
						<r>
							<c ca="left">
								<p>Repeat element</p>
							</c>
							<c ca="left">
								<p>Class</p>
							</c>
							<c ca="center">
								<p>Number</p>
							</c>
							<c ca="center">
								<p>Average size (bp)</p>
							</c>
							<c ca="center">
								<p>Largest size (bp)</p>
							</c>
							<c ca="center">
								<p>Percent of total repeats</p>
							</c>
						</r>
						<r>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c cspan="6" ca="left">
								<p>Ten most frequent repeat types (<it>D. virilis </it>dot)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>dvir.16.2.centroid</p>
							</c>
							<c ca="left">
								<p>DNA</p>
							</c>
							<c ca="center">
								<p>94</p>
							</c>
							<c ca="center">
								<p>178</p>
							</c>
							<c ca="center">
								<p>1,071</p>
							</c>
							<c ca="center">
								<p>23.4</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>
									<it>Penelope</it>
								</p>
							</c>
							<c ca="left">
								<p>LINE</p>
							</c>
							<c ca="center">
								<p>35</p>
							</c>
							<c ca="center">
								<p>199</p>
							</c>
							<c ca="center">
								<p>427</p>
							</c>
							<c ca="center">
								<p>9.8</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>dvir.2.37.centroid</p>
							</c>
							<c ca="left">
								<p>LTR</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
							<c ca="center">
								<p>6,832</p>
							</c>
							<c ca="center">
								<p>6,832</p>
							</c>
							<c ca="center">
								<p>9.6</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>dvir.0.85.centroid</p>
							</c>
							<c ca="left">
								<p>LINE</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>764</p>
							</c>
							<c ca="center">
								<p>3,711</p>
							</c>
							<c ca="center">
								<p>8.6</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>dvir.11.33.centroid</p>
							</c>
							<c ca="left">
								<p>Tandem Repeat</p>
							</c>
							<c ca="center">
								<p>31</p>
							</c>
							<c ca="center">
								<p>191</p>
							</c>
							<c ca="center">
								<p>615</p>
							</c>
							<c ca="center">
								<p>8.3</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>dvir.5.67.centroid</p>
							</c>
							<c ca="left">
								<p>LTR</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
							<c ca="center">
								<p>5,406</p>
							</c>
							<c ca="center">
								<p>5,406</p>
							</c>
							<c ca="center">
								<p>7.6</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>dvir.7.88.centroid</p>
							</c>
							<c ca="left">
								<p>LINE</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
							<c ca="center">
								<p>3,370</p>
							</c>
							<c ca="center">
								<p>3,370</p>
							</c>
							<c ca="center">
								<p>4.7</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>dvir.16.17.centroid</p>
							</c>
							<c ca="left">
								<p>DNA</p>
							</c>
							<c ca="center">
								<p>15</p>
							</c>
							<c ca="center">
								<p>207</p>
							</c>
							<c ca="center">
								<p>739</p>
							</c>
							<c ca="center">
								<p>4.3</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>dvir.11.23.centroid</p>
							</c>
							<c ca="left">
								<p>Tandem Repeat</p>
							</c>
							<c ca="center">
								<p>13</p>
							</c>
							<c ca="center">
								<p>186</p>
							</c>
							<c ca="center">
								<p>430</p>
							</c>
							<c ca="center">
								<p>3.4</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>dvir.22.25.centroid</p>
							</c>
							<c ca="left">
								<p>Unknown</p>
							</c>
							<c ca="center">
								<p>6</p>
							</c>
							<c ca="center">
								<p>391</p>
							</c>
							<c ca="center">
								<p>669</p>
							</c>
							<c ca="center">
								<p>3.3</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Other</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>16.9</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c cspan="6" ca="left">
								<p>Ten most frequent repeat types (<it>D. melanogaster </it>dot)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>DNAREP1_DM (<it>DINE-1</it>)</p>
							</c>
							<c ca="left">
								<p>DNA</p>
							</c>
							<c ca="center">
								<p>126</p>
							</c>
							<c ca="center">
								<p>201</p>
							</c>
							<c ca="center">
								<p>631</p>
							</c>
							<c ca="center">
								<p>34.5</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>yakuba_cons (<it>DINE-1</it>)</p>
							</c>
							<c ca="left">
								<p>DNA</p>
							</c>
							<c ca="center">
								<p>54</p>
							</c>
							<c ca="center">
								<p>170</p>
							</c>
							<c ca="center">
								<p>722</p>
							</c>
							<c ca="center">
								<p>12.4</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>FB4_DM</p>
							</c>
							<c ca="left">
								<p>DNA</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>630</p>
							</c>
							<c ca="center">
								<p>1,215</p>
							</c>
							<c ca="center">
								<p>6.8</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>PROTOP</p>
							</c>
							<c ca="left">
								<p>DNA</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
							<c ca="center">
								<p>938</p>
							</c>
							<c ca="center">
								<p>3,615</p>
							</c>
							<c ca="center">
								<p>6.4</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>PROTOP_A (<it>1360</it>)</p>
							</c>
							<c ca="left">
								<p>DNA</p>
							</c>
							<c ca="center">
								<p>9</p>
							</c>
							<c ca="center">
								<p>518</p>
							</c>
							<c ca="center">
								<p>1,111</p>
							</c>
							<c ca="center">
								<p>6.3</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>DMCR1A</p>
							</c>
							<c ca="left">
								<p>LINE</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>528</p>
							</c>
							<c ca="center">
								<p>1,253</p>
							</c>
							<c ca="center">
								<p>5.7</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>PROTOP_B</p>
							</c>
							<c ca="left">
								<p>DNA</p>
							</c>
							<c ca="center">
								<p>14</p>
							</c>
							<c ca="center">
								<p>221</p>
							</c>
							<c ca="center">
								<p>1,083</p>
							</c>
							<c ca="center">
								<p>4.2</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>TC1_DM (<it>Tc1</it>)</p>
							</c>
							<c ca="left">
								<p>DNA</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>343</p>
							</c>
							<c ca="center">
								<p>990</p>
							</c>
							<c ca="center">
								<p>3.7</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>TART</p>
							</c>
							<c ca="left">
								<p>LINE</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
							<c ca="center">
								<p>520</p>
							</c>
							<c ca="center">
								<p>663</p>
							</c>
							<c ca="center">
								<p>3.5</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>BARI1_DM (<it>Bari</it>)</p>
							</c>
							<c ca="left">
								<p>DNA</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
							<c ca="center">
								<p>1,728</p>
							</c>
							<c ca="center">
								<p>1,728</p>
							</c>
							<c ca="center">
								<p>2.3</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Other</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>14.0</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
				<p><it>DINE-1</it>, also known as <it>DNAREP-1 </it>or <it>INE-1</it>, is a repetitive element that is very common in the genome of <it>D. melanogaster </it><abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. The density of <it>DINE-1 </it>elements is especially high on the dot chromosome of <it>D. melanogaster</it>, more so than on the major chromosome arms or on the dot chromosome of <it>D. virilis </it><abbrgrp><abbr bid="B17">17</abbr><abbr bid="B56">56</abbr></abbrgrp>. Using our Superlibrary and repeat identification process, RepeatMasker identifies 0.1% of the <it>D. virilis </it>contigs as sequences with significant similarity to <it>DINE-1 </it>elements, while in the homologous regions of the <it>D. melanogaster </it>dot the density is 10.5%. (The entire <it>D. melanogaster </it>dot has a 9.2% incidence of <it>DINE-1 </it>elements, assessed using RepeatMasker/Superlibrary.) There has been considerable debate as to the origin of <it>DINE-1 </it>elements <abbrgrp><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr></abbrgrp>. Kapitonov and Jurka <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> have recently suggested that <it>DINE-1 </it>is a retrotransposon based on homology to a <it>D. virilis Penelope </it>GenBank accession, but sequences with homology to <it>DINE-1 </it>in this accession fall outside of the canonical <it>Penelope </it>sequence <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> (C Bergman, personal communication). Analysis of <it>DINE-1 </it>elements in <it>D. yakuba </it>suggests a relatively recent burst of transposition in that species. A consensus sequence based on these recent <it>DINE-1 </it>elements contains no long terminal repeats nor a poly-A tail (suggestive of a retroelement), but does have a terminal 12 bp perfect repeat, a characteristic of transposons <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. Thus, while we have provided separate statistics for this class, we consider <it>DINE-1 </it>elements to be DNA transposon remnants. Separate statistics are also provided in Table <tblr tid="T2">2</tblr> for the <it>1360 </it>DNA transposon fragments, as this class is of particular interest as a potential target for heterochromatin formation, as discussed above. Again, this family is significantly enriched in <it>D. melanogaster </it>dot chromosome fosmids, making up 4.1% of the sample DNA, in comparison to 0.8% in <it>D. virilis </it>dot chromosome fosmids and 0% in the samples from the major chromosome arms.</p>
				<p>Transposable elements are much more prevalent in the introns of heterochromatic genes than in the introns of euchromatic genes <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>; this may contribute to the evolution and structure of genes in heterochromatin. Maintaining a focus on total repeat density (and not repeat type), we analyzed the introns of all of the contigs with a repeat database generated by combining the RepeatScout output from both the <it>D. melanogaster </it>and the <it>D. virilis </it>whole genome assemblies. Using RepeatMasker with this library (omitting low complexity and simple repeats), one finds that introns of the <it>D. virilis </it>dot chromosome genes studied here contain 27.0% repetitive elements, while in homologous regions of the <it>D. melanogaster </it>dot, 33.1 % of the introns are made up of repetitive elements. Analysis of the contigs from the major chromosomes of <it>D. virilis </it>and the homologous regions from <it>D. melanogaster </it>did not find any recognizable transposable elements in the introns. Thus the two dot chromosomes are in this respect more similar to each other than they are to the major chromosomes from either species.</p>
				<p>Comparing Figures <figr fid="F6">6a</figr> and <figr fid="F6">6b</figr>, it is apparent that the two repeat-finding strategies represented gave very different results. <it>D. melanogaster </it>and <it>D. virilis </it>are fairly close together phylogenetically, but use of the previously defined RepBase library, which has good representation of <it>D. melanogaster </it>repeats, was insufficient to find all of the <it>D. virilis </it>repeats, particularly on the dot chromosome. This result stresses the importance of using techniques such as PILER to find species-specific repeats as new species are sequenced, even when repeat sequences are available from a well-studied nearby species. Relying on existing repeat databases can lead to erroneously low estimates of repeat content.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Discussion</p>
			</st>
			<sec>
				<st>
					<p>The dot chromosomes of <it>D. melanogaster </it>and <it>D. virilis </it>differ in the density of DNA transposons</p>
				</st>
				<p>While one of the conspicuous characteristics of pericentric heterochromatin is a low gene density, previous sequence analysis has shown that the heterochromatic <it>D. melanogaster </it>dot chromosome resembles euchromatic domains in this regard, having a gene density (number of genes per Mb) similar to the long arms of the major autosomes <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B19">19</abbr></abbrgrp>. Interestingly, the <it>D. melanogaster </it>dot chromosome does have an approximately two-fold higher percentage of DNA transcribed (percentage of DNA between the start sites and stop sites for transcription) than the major chromosomes, due primarily to longer introns in the dot chromosome genes. Introns of dot chromosome genes of both species examined here were longer than introns from the major chromosomes (Figure <figr fid="F5">5</figr>), apparently reflecting the higher repeat content of the dot chromosomes (see above). Thus, the heterochromatic <it>D. melanogaster </it>and the euchromatic <it>D. virilis </it>dot chromosomes are very similar to each other in gene density, percent DNA transcribed, and gene/intron size, suggesting that these parameters are not critical in determining chromatin packaging decisions.</p>
				<p>Total repeat density (percentage of the DNA in repetitive sequences) for the <it>D. virilis </it>dot chromosome fosmids has a value of 22.8%, while the homologous regions of the dot chromosome of <it>D. melanogaster </it>have a total density of 26.5%. Kaminker <it>et al</it>. <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> analyzed the distribution of transposable elements (not including simple or tandem repeats) in the <it>D. melanogaster </it>genome. This analysis indicated that the number of repetitive elements per Mb is five to ten times higher on the dot chromosome than in the rest of the sequenced genome, which includes very little heterochromatin. The repeat analysis of our region of study agrees with the whole chromosome results from Kaminker <it>et al</it>. <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> in that the level of repetitive elements (predominantly partial or dead TEs) shows a large difference (about three- to four-fold) between the dot chromosomes and the major chromosomes. Our analysis reported here shows that there is only a small difference in the total repeat density on the heterochromatic <it>D. melanogaster </it>dot chromosome and on the euchromatic <it>D. virilis </it>dot chromosome. This finding suggests that the higher density of repetitive elements probably does not play a deciding role in driving the heterochromatic packaging of the dot chromosome in <it>D. melanogaster</it>. This does not preclude the possibility that high repeat densities are a necessary precondition for heterochromatin formation, but argues that a high repeat density is not sufficient in and of itself to drive formation of heterochromatin.</p>
				<p>This analysis rather focuses attention on the high level of DNA transposons found in the <it>D. melanogaster </it>dot chromosome, but lacking in the <it>D. virilis </it>dot chromosome. Prominent elements of this type in <it>D. melanogaster </it>include <it>1360 </it>(aka <it>hoppel </it>or PROTOP_A) and <it>DINE-1</it>. It has previously been suggested that <it>DINE-1 </it>might contribute to heterochromatin packaging on the dot chromosome of <it>D. melanogaster </it><abbrgrp><abbr bid="B17">17</abbr><abbr bid="B56">56</abbr></abbrgrp>. In our computational analysis, we found that sequences homologous to <it>DINE-1 </it>were also present on the dot chromosome of <it>D. virilis</it>, but at a much lower concentration (0.1%) compared to <it>D. melanogaster </it>(10.5%), in agreement with the <it>in situ </it>hybridization analysis previously reported <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>. Our computer homology searches and analysis by others <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> indicates that portions of the <it>DINE-1 </it>element found in RebBase 8.12 show high similarity to a genomic fragment containing <it>Penelope </it>elements from <it>D. virilis</it>, but this similarity falls outside of the region defined by Evgen'ev <it>et al</it>. to be required for <it>Penelope </it>activity <abbrgrp><abbr bid="B53">53</abbr><abbr bid="B58">58</abbr></abbrgrp> (C Bergman, personal communication). Thus, in our analysis, <it>DINE-1 </it>has been treated as a DNA transposon type, but has been reported separately in Figure <figr fid="F6">6</figr> for clarity. <it>DINE-1 </it>is absent from the major chromosome arms in this sample. In contrast, <it>Penelope </it>elements are very common in the dot chromosome (approximately 9%), and are present in the major chromosomes of <it>D. virilis </it>(0.4%). Retrotransposons such as <it>Penelope </it>are important in determining chromosome rearrangements <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>, but have not been associated with heterochromatin formation.</p>
				<p>It has been suggested that the buildup of repetitive elements on the dot chromosome may be due in part to the lack of recombination <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B60">60</abbr></abbrgrp>. However, we find an overabundance of repetitive sequences on the dot chromosomes of both <it>D. melanogaster </it>and <it>D. virilis</it>. Recombination does occur on the dot chromosome of <it>D. virilis</it>, albeit at a lower rate <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. This observation suggests that there may be a selective advantage in maintaining a higher than average density of repetitive sequences (and larger than average genes) in this small chromosome, regardless of the chromatin packaging status.</p>
			</sec>
			<sec>
				<st>
					<p>DNA transposons may be targets for heterochromatin formation</p>
				</st>
				<p>Work by Sun <it>et al</it>. <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> suggests a particular DNA transposon that may act as an initiator of heterochromatin formation on the dot chromosome of <it>D. melanogaster</it>. If a <it>white </it>reporter <it>P </it>element insertion site is within 10 kb of a <it>1360 </it>element on the dot chromosome, there is a high probability of a variegating phenotype <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Hence remnants of this DNA transposon may serve as a <it>cis</it>-acting determinant of heterochromatin formation on the dot chromosome of <it>D. melanogaster</it>, presumably acting as targets of an RNAi-directed process <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> analogous to that reported in <it>S. pombe </it><abbrgrp><abbr bid="B1">1</abbr></abbrgrp>.</p>
				<p><it>1360 </it>elements are fragments of a DNA transposon that has been recognized in many studies to have a high concentration on the dot chromosome and in the pericentric heterochromatin <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B19">19</abbr></abbrgrp>. Coelho <it>et al</it>. <abbrgrp><abbr bid="B60">60</abbr></abbrgrp> studied <it>1360 </it>in many different strains of <it>D. melanogaster </it>and found that the association with heterochromatin is very consistent, again suggesting that <it>1360 </it>elements play an important role in the structure and function of heterochromatin. Given a lack of introns, it has been suggested that <it>1360 </it>elements are derivatives of a retrotranscription event <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>, but because this element has terminal inverted repeats at its ends and encodes a transposase with similarity to the <it>P </it>enzyme <abbrgrp><abbr bid="B57">57</abbr><abbr bid="B61">61</abbr></abbrgrp>, it is likely to function as a DNA transposon. <it>1360 </it>may be a very recent invader of the <it>D. melanogaster </it>genome <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>; some differences in insertion sites are observed in different stocks of <it>D. melanogaster </it><abbrgrp><abbr bid="B24">24</abbr><abbr bid="B60">60</abbr></abbrgrp>. The origin of RNAi is thought to be as a silencing mechanism for retroviral genome invaders or transposons with multiple exact copies. Thus, it is possible that recent invaders are most likely to be targets of RNAi-induced heterochromatin formation.</p>
				<p>Some regions of the <it>D. melanogaster </it>dot chromosome have significantly lower density of DNA transposons other than <it>DINE-1 </it>than the chromosome as a whole, particularly contigs 30.1, 106, and 67 (Table <tblr tid="T2">2</tblr>). The density of <it>DINE-1 </it>elements in these contigs is similar to that in the rest of the dot chromosome, but the level of other DNA transposons is less than 2%. Interestingly, these regions appear to be euchromatic domains of the dot chromosome of <it>D. melanogaster </it>as shown by a <it>white </it>reporter. The region around contig 30.1 is within 25 kb of an insertion site for a <it>P </it>element with full red eye expression, and the region around contig 67 is within 20 kb of six <it>P </it>elements with full red eye expression <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. This suggests that the local density of repetitive elements other than <it>DINE-1 </it>may be important in driving changes in chromatin structure, or that another factor might be countering the influence of <it>DINE-1 </it>in these regions.</p>
				<p>Why might DNA transposons be a preferred target for heterochromatin formation? DNA transposons contain inverted repeats at each end that facilitate their mobilization within the genome; these could intrinsically lead to dsRNA if both ends are transcribed. This has been reported in <it>C. elegans </it><abbrgrp><abbr bid="B62">62</abbr></abbrgrp>, but we are not aware of any similar reports for <it>D. melanogaster</it>. Using a <it>P </it>element reporter, Dorer and Henikoff <abbrgrp><abbr bid="B63">63</abbr></abbrgrp> showed that DNA transposon mobilization events can lead to tandem and inverse duplications of the <it>P </it>element. These tandem arrays can lead to heterochromatin formation and silencing of a reporter gene within the <it>P </it>element construct. It is also possible that an endogenous transposable element could be present as inverted copies in an intron; transcription could then produce hairpins that could be targeted by the RNAi machinery for degradation and result in heterochromatin-mediated transcriptional gene silencing. (Fragments of LINE elements in inverted orientations in the introns of mammalian genes have been found to be a source of miRNAs <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>.) Thus, the mode of mobilization might generate configurations of DNA transposon sequence that make these elements a preferred target for RNAi-directed gene silencing and heterochromatin formation.</p>
				<p>Our computational analysis has shown that inverted fragments of TEs can readily be found in introns in both species. Screening for inverted repeats (IRs; using RepeatMasker/RebBase 8.12) located within a single intron and within 100 bp of each other revealed 87 copies of inverted TEs (81 <it>DINE-1</it>, 2 <it>1360 </it>elements, 4 <it>S2_DM </it>elements) within the <it>D. melanogaster </it>genome and three (all <it>Penelope</it>) within our <it>D. virilis </it>fosmids. Some of these candidates are predicted to form stable hairpin structures by mfold <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>. Among these hairpin candidates are the <it>1360 </it>IR found in an intron of <it>Caps </it>on the <it>D. melanogaster </it>dot chromosome, and the <it>Penelope </it>IR found in an intron of <it>toy </it>on the <it>D. virilis </it>dot chromosome. Transcription through these loci could create hairpin structures that might be subsequently processed by the Drosha and Dicer machinery to produce short dsRNA, leading to initiation of heterochromatin formation. Hence the potential exists for both DNA transposons and retroelements to act as targets for RNAi-directed gene silencing and heterochromatin formation; why the former appears to be favored is not clear.</p>
				<p>Empirically, the absence of heterochromatin formation on the <it>D. virilis </it>dot chromosome appears not to be related to a lower density of repetitive elements in this genomic domain, but may be a consequence of the low density of DNA tranposons. Gene density, percent DNA transcribed, and size of introns do not seem to be critical discriminatory factors. However, a higher frequency of CA/GT repeats on the <it>D. virilis </it>dot chromosome is associated with euchromatic chromatin packaging. No mechanism has been suggested to explain the significance of this correlation. Other differences in primary sequence and density of particular transposable elements between the dot chromosomes of these two species, as yet unidentified, could also play a role. Genomic data is forthcoming on many <it>Drosophila </it>species <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. A comparison of several heterochromatic dot chromosomes with several euchromatic dot chromosomes will no doubt further elucidate the basis of heterochromatin formation in <it>Drosophila</it>.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Materials and methods</p>
			</st>
			<sec>
				<st>
					<p>Chromosome staining</p>
				</st>
				<p>Immunofluorescent staining of polytene chromosomes from <it>D. melanogaster </it>(Oregon R) and <it>D. virilis </it>(Tucson strain 15010-1001.10) third instar larvae was carried out as described <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>. HP1 antibody was monoclonal mouse C1A9 antibody (cell supernatant) used at a dilution of 1:1. Rabbit antibody for histone H3 methylated at lysine 9 was from Upstate Biotechnology (Lake Placid, NY, USA) used at a dilution of 1:25. Secondary antibodies were labeled with Alexa fluor 488 (Molecular Probes, Eugene, OR, USA) for goat anti-mouse and Alexa fluor 594 (Molecular Probes) for goat anti-rabbit, both used at a dilution of 1:400.</p>
			</sec>
			<sec>
				<st>
					<p>Identification of fosmids from the dot chromosome of <it>D. virilis</it></p>
				</st>
				<p>Coding sequences from all genes from the dot chromosome of <it>D. melanogaster </it>were used in BLASTN <abbrgrp><abbr bid="B48">48</abbr></abbrgrp> searches against the <it>D. pseudoobscura </it>NCBI trace archive <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. Similar regions were entered into Block Maker <abbrgrp><abbr bid="B67">67</abbr></abbrgrp> to find regions of highest conservation; PCR primers were designed around these regions using CODEHOP <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>. Criteria for inclusion were a length of at least 200 bp, with 80% homology between <it>D. melanogaster </it>and <it>D. pseudoobscura</it>, with screening to the <it>D. melanogaster </it>genome to ensure that the region did not contain any repetitive elements. PCR was performed using <it>D. melanogaster </it>genomic DNA as the template; the PCR products were subsequently labeled and used to probe the BDVIF01 library (of Tucson strain 15010-1001.10) originally described in Bergman <it>et al</it>. <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, now available spotted on a single filter from BACPAC <abbrgrp><abbr bid="B69">69</abbr></abbrgrp>. Hybridizations and washes were performed at low stringency. Positive clones were verified using Southern blots, slot blots, and restriction mapping. Some of the recovered clones mapped to chromosomes other than the <it>D. virilis </it>dot; these appear to have been identified by cross hybridization resulting from low stringency hybridization and washing conditions.</p>
			</sec>
			<sec>
				<st>
					<p><it>In situ </it>hybridization of fosmids to polytene chromosomes</p>
				</st>
				<p><it>In situ </it>hybridization probes were digoxygenin-dUTP labeled fosmid inserts. Hybridizations were performed on polytene chromosomes of third instar larvae of <it>D. virilis </it>(Tucson strain 15010-1001.10) as described by Casacuberta and Pardue <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Sequencing of <it>D. virilis </it>fosmid clones</p>
				</st>
				<p><it>D. virilis </it>fosmid DNA was prepared by streaking the glycerol stocks onto selective media agar plates, picking three isolated colonies and preparing a mini-prep of DNA from each. Mini-prep DNA was digested using <it>Hin</it>dIII and analyzed by agarose gel electrophoresis to compare restriction patterns to those obtained initially from the clones. Colonies verified by the restriction pattern were then inoculated into 200 ml of liquid media and grown in culture. Large-scale fosmid DNA isolation was performed from these cultures. The DNA was then sheared with the Hydroshear&#8482; (manufactured by Genomic Solutions, Ann Arbor, MI, USA) to a nominal size range of 3 to 4 kb, end-repaired, and separated on an agarose sizing gel. The 3 to 4 kb band was excised from the gel and purified by phenol:chloroform extraction. The sheared insert DNA was subcloned into pZero2.1 (Invitrogen, Frederick, MD, USA), electroporated into DH10B cells, and the cells plated onto solid media. For each fosmid project, 768 subclones were picked into glycerol-containing media and archive stocks were grown for 24 hours at 37&#176;C. Of these, 384 subclones were processed through the Genome Sequencing Center production sequencing pipeline, including magnetic bead-based DNA purification, dual end sequencing with Big Dye version 3.1 terminator chemistry (Applied Biosystems Inc., Foster City, CA, USA), and analysis on ABI 3730xl sequencers. Sequence assembly was performed using <it>phred</it>, <it>phrap</it>, and <it>consed </it>to design finishing strategies <abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>. All fosmids were finished to the same quality standard as used for the human genome <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>. Sequences were confirmed by comparison of <it>in silico </it>digests of the finished sequence to restriction digest patterns of the purified fosmid DNA for at least two separate restriction enzymes. The nucleotide sequences and predicted protein sequences reported here have been submitted to GenBank (accession numbers <ext-link ext-link-type="gen" ext-link-id="DQ378280">DQ378280</ext-link>-<ext-link ext-link-type="gen" ext-link-id="DQ378293">DQ378293</ext-link>).</p>
			</sec>
			<sec>
				<st>
					<p>Curation strategy</p>
				</st>
				<p>Gene sequences were initially identified by similarity in BLASTALL <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>, comparing finished <it>D. virilis </it>fosmids to <it>D. melanogaster </it>protein, EST, cDNA and genomic sequences in GenBank. <it>D. virilis </it>fosmids were also compared to <it>D. pseudoobscura </it>genomic sequence <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> using BLASTN and TBLASTN <abbrgrp><abbr bid="B72">72</abbr></abbrgrp>. Intron-exon boundaries were determined by visual inspection of coding matches, aided by Genscan <abbrgrp><abbr bid="B73">73</abbr><abbr bid="B74">74</abbr></abbrgrp>, in conjunction with other annotated features. BLAT <abbrgrp><abbr bid="B75">75</abbr></abbrgrp> was also used in this process, comparing intron-exon boundaries of predicted coding sequence across <it>D. melanogaster</it>, <it>D. yakuba </it>and <it>D. pseudoobscura</it>. Known protein-coding sequences, exon boundaries and intron sizes were obtained from Ensembl <abbrgrp><abbr bid="B76">76</abbr></abbrgrp> and Flybase <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B77">77</abbr></abbrgrp>.</p>
				<p><it>D. virilis </it>retroelements, DNA transposons, low-complexity and simple repeat sequences were masked by RepeatMasker <abbrgrp><abbr bid="B45">45</abbr></abbrgrp> using default parameters for <it>Drosophila </it>sequences. The RepeatMasker version used was from 03/06/04 with crossmatch version 0.990329 (RebBase update 8.12). Repeats unique to the <it>D. virilis </it>fosmids were predicted by BLASTN of each fosmid against a database of all the <it>D. virilis </it>sequences obtained, excluding regions that had significant <it>D. melanogaster </it>EST and protein BLASTALL matches.</p>
				<p>The Superlibrary was created by combining repeats identified by the Drosophila TE library in RepBase 8.12, novel repeats identified using PILER-DF on the <it>D. virilis </it>dvirAra08 assembly <abbrgrp><abbr bid="B78">78</abbr></abbrgrp> (C Smith and G Karpen, personal communication), novel <it>D. melanogaster </it>repeats identified using PILER-TR <abbrgrp><abbr bid="B50">50</abbr></abbrgrp> and the <it>D. yakuba DINE-1 </it>element <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. RepeatMasker was used with the Superlibrary to identify the portion of each fosmid or equivalent region that contained repetitive sequences.</p>
				<p>RepeatScout (1.0.1) <abbrgrp><abbr bid="B51">51</abbr></abbrgrp> was run with default parameters against the dmelWGS2 and dvirAra08 assemblies to generate the RepeatScout libraries for <it>D. melanogaster </it>and <it>D. virilis</it>. Tandem repeats and simple repeats in each RepeatScout library were removed using trf <abbrgrp><abbr bid="B79">79</abbr><abbr bid="B80">80</abbr></abbrgrp> and nseg <abbrgrp><abbr bid="B81">81</abbr><abbr bid="B82">82</abbr></abbrgrp>. The results were then combined to create a custom <it>Drosophila </it>RepeatScout library.</p>
			</sec>
			<sec>
				<st>
					<p>Sequence comparison</p>
				</st>
				<p>Sequence comparisons were made of <it>D. virilis </it>fosmids with corresponding regions from the <it>D. melanogaster </it>(Release 3.2) genome, as determined by a set of reproducible guidelines involving syntenic features and equivalent flanking sequence. If a region of non-coding sequence from <it>D. virilis </it>was interrupted in the corresponding <it>D. melanogaster </it>sequence by an annotated gene, then only sequence up to that gene was included in the comparison. Where non-coding sequence flanked a corresponding gene in <it>D. melanogaster</it>, with no other identifying features that would indicate a loss of synteny, the extracted sequence extended an equal number of bases from the gene as seen in <it>D. virilis</it>. Several of the <it>D. virilis </it>fosmids (contigs 11, 72, 80, and 91) had no appreciable synteny with the <it>D. melanogaster </it>genome and were, therefore, not included in repeat comparisons between the two species. The gene density and intron size calculations take into account all fosmids.</p>
			</sec>
			<sec>
				<st>
					<p>Kolmogorov-Smirnov test</p>
				</st>
				<p>The KS test is a non-parametric, distribution-free statistical test that can determine if two datasets differ significantly. It produces a D statistic that represents the maximum difference of the two distributions in an empirical cumulative distribution plot, allowing one to accept or reject a hypothesis that the datasets are from the same distribution. Statistical analysis was done using program R from the R Foundation for Statistical Computing <abbrgrp><abbr bid="B83">83</abbr></abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>CA/GT repeat analysis</p>
				</st>
				<p>BLASTN <abbrgrp><abbr bid="B48">48</abbr></abbrgrp> was used to search the contigs for CA/GT repeats. Blast databases containing either all of the <it>D. virilis </it>sequence obtained here or all of the corresponding regions from <it>D. melanogaster </it>were constructed using formatdb with default parameters. The databases were searched with a sequence of 100 (CA) repeats using BLASTALL default parameters except that low complexity filtering was turned off and the (E)xpect value was set to 0.1. The location of all hits was analyzed to remove any duplicate hits prior to assignment to either the dot or major chromosomes.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Additional data files</p>
			</st>
			<p>The following additional data are available with the online version of this paper. Additional data file <supplr sid="S1">1</supplr> provides maps of each fosmid from <it>D. virilis </it>and the homologous regions from <it>D. melanogaster </it>(if available) showing the genes and identified repetitive elements for dot chromosome sequences. Additional data file <supplr sid="S2">2</supplr> provides maps of each fosmid from <it>D. virilis </it>and the homologous regions from <it>D. melanogaster </it>(if available) showing the genes and identified repetitive elements for non-dot chromosome sequences. Additional data file <supplr sid="S3">3</supplr> provides a fasta file pre-formatted for use with RepeatMasker containing the PILER-DF identified repeats from the <it>D. virilis </it>assembly dvirAra08, the <it>D. yakuba DINE-1 </it>element, and the PILER-TR identified novel repeats from <it>D. melanogaster</it>, which were added to RepBase 8.12 <it>Drosophila </it>TE library to generate the Superlibrary used to analyze repeats.</p>
			<suppl id="S1">
				<title>
					<p>Additional data file 1</p>
				</title>
				<caption>
					<p>Maps of each fosmid from <it>Drosophila virilis </it>and the homologous regions from <it>Drosophila melanogaster</it>, dot chromosome sequences</p>
				</caption>
				<text>
					<p>Maps of each fosmid from <it>Drosophila virilis </it>and the homologous regions from <it>Drosophila melanogaster </it>(if available) showing the genes and identified repetitive elements for dot chromosome sequences</p>
				</text>
				<file name="gb-2006-7-2-r15-S1.pdf">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S2">
				<title>
					<p>Additional data file 2</p>
				</title>
				<caption>
					<p>Maps of each fosmid from <it>D. virilis </it>and the homologous regions from <it>D. melanogaster</it>, non-dot chromosome sequnces</p>
				</caption>
				<text>
					<p>Maps of each fosmid from <it>D. virilis </it>and the homologous regions from <it>D. melanogaster </it>(if available) showing the genes and identified repetitive elements for non-dot chromosome sequences</p>
				</text>
				<file name="gb-2006-7-2-r15-S2.pdf">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S3">
				<title>
					<p>Additional data file 3</p>
				</title>
				<caption>
					<p>PILER-DF identified repeats from the <it>D. virilis </it>assembly dvirAra08, the <it>D. yakuba DINE-1 </it>element, and the PILER-TR identified novel repeats from <it>D. melanogaster</it></p>
				</caption>
				<text>
					<p>A fasta file pre-formatted for use with RepeatMasker containing the PILER-DF identified repeats from the <it>D. virilis </it>assembly dvirAra08, the <it>D. yakuba DINE-1 </it>element, and the PILER-TR identified novel repeats from <it>D. melanogaster</it>, which were added to RepBase 8.12 <it>Drosophila </it>TE library to generate the Superlibrary used to analyze repeats</p>
				</text>
				<file name="gb-2006-7-2-r15-S3.txt">
					<p>Click here for file</p>
				</file>
			</suppl>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>We would like to thank the Washington University Genome Sequencing Center for generating raw sequences and providing training and support for members of Bio 4342. Ginger Fewell and Catrina Fronick coordinated library construction and sequencing, Darren O'Brien directed wet lab work, while Cynthia Madsen-Strong, J Phillip Latreille, Charlene Pearman, and Joelle Viezer provided training and support in the use of <it>consed</it>. Michael Brent and Mani Arumugam aided in annotation and analysis of fosmid sequences. We would like to thank Alan Templeton for his help in the statistical analysis of intron size and Casey Bergman (University of Manchester, Manchester, UK) for review and advice on repetitive sequence analysis. We would also like to thank Chris Smith and Gary Karpen (Lawrence Berkeley National Lab, Berkeley, CA, USA) for access to and use of their <it>D. virilis </it>whole genome PILER-DF library and HP Yang (National Yang-Ming University, Taipei, Taiwan, ROC) for making available her analysis of <it>D. yakuba DINE-1 </it>sequence prior to publication. Members of the Elgin lab contributed throughout the process with criticism and suggestions. This work was supported primarily by Grant #52003904 from the Howard Hughes Medical Institute to Washington University (for SCRE) with additional funding from NIH grant GM050315 to MLP and NIH grant GM068388 to SCRE.</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>Heterochromatin: new possibilities for the inheritance of structure.</p>
				</title>
				<aug>
					<au>
						<snm>Grewal</snm>
						<fnm>SI</fnm>
					</au>
					<au>
						<snm>Elgin</snm>
						<fnm>SC</fnm>
					</au>
				</aug>
				<source>Curr Opin Genet Dev</source>
				<pubdate>2002</pubdate>
				<volume>12</volume>
				<fpage>178</fpage>
				<lpage>187</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0959-437X(02)00284-8</pubid>
						<pubid idtype="pmpid" link="fulltext">11893491</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Das Heterochromatin der Moose.</p>
				</title>
				<aug>
					<au>
						<snm>Heitz</snm>
						<fnm>E</fnm>
					</au>
				</aug>
				<source>Jahrbucher fur Wissenschaftliche Botanik</source>
				<pubdate>1928</pubdate>
				<volume>69</volume>
				<fpage>762</fpage>
				<lpage>818</lpage>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Heterochromatic sequences in a <it>Drosophila </it>whole-genome shotgun assembly.</p>
				</title>
				<aug>
					<au>
						<snm>Hoskins</snm>
						<fnm>RA</fnm>
					</au>
					<au>
						<snm>Smith</snm>
						<fnm>CD</fnm>
					</au>
					<au>
						<snm>Carlson</snm>
						<fnm>JW</fnm>
					</au>
					<au>
						<snm>Carvalho</snm>
						<fnm>AB</fnm>
					</au>
					<au>
						<snm>Halpern</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Kaminker</snm>
						<fnm>JS</fnm>
					</au>
					<au>
						<snm>Kennedy</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Mungall</snm>
						<fnm>CJ</fnm>
					</au>
					<au>
						<snm>Sullivan</snm>
						<fnm>BA</fnm>
					</au>
					<au>
						<snm>Sutton</snm>
						<fnm>GG</fnm>
					</au>
					<etal/>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2002</pubdate>
				<volume>3</volume>
				<fpage>R1</fpage>
				<lpage>0085</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1186/gb-2002-3-12-research0085</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Colonization of heterochromatic genes by transposable elements in <it>Drosophila</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Dimitri</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Junakovic</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Arca</snm>
						<fnm>B</fnm>
					</au>
				</aug>
				<source>Mol Biol Evol</source>
				<pubdate>2003</pubdate>
				<volume>20</volume>
				<fpage>503</fpage>
				<lpage>512</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/molbev/msg048</pubid>
						<pubid idtype="pmpid" link="fulltext">12654931</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>(dC-dA)n.(dG-dT)n sequences have evolutionarily conserved chromosomal locations in <it>Drosophila </it>with implications for roles in chromosome structure and function.</p>
				</title>
				<aug>
					<au>
						<snm>Pardue</snm>
						<fnm>ML</fnm>
					</au>
					<au>
						<snm>Lowenhaupt</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Rich</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Nordheim</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>EMBO J</source>
				<pubdate>1987</pubdate>
				<volume>6</volume>
				<fpage>1781</fpage>
				<lpage>1789</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">553555</pubid>
						<pubid idtype="pmpid">3111846</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Epigenetic codes for heterochromatin formation and silencing: rounding up the usual suspects.</p>
				</title>
				<aug>
					<au>
						<snm>Richards</snm>
						<fnm>EJ</fnm>
					</au>
					<au>
						<snm>Elgin</snm>
						<fnm>SC</fnm>
					</au>
				</aug>
				<source>Cell</source>
				<pubdate>2002</pubdate>
				<volume>108</volume>
				<fpage>489</fpage>
				<lpage>500</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0092-8674(02)00644-X</pubid>
						<pubid idtype="pmpid" link="fulltext">11909520</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>Identification of a nonhistone chromosomal protein associated with heterochromatin in <it>Drosophila melanogaster </it>and its gene.</p>
				</title>
				<aug>
					<au>
						<snm>James</snm>
						<fnm>TC</fnm>
					</au>
					<au>
						<snm>Elgin</snm>
						<fnm>SC</fnm>
					</au>
				</aug>
				<source>Mol Cell Biol</source>
				<pubdate>1986</pubdate>
				<volume>6</volume>
				<fpage>3862</fpage>
				<lpage>3872</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">367149</pubid>
						<pubid idtype="pmpid">3099166</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>The HP1 protein family: getting a grip on chromatin.</p>
				</title>
				<aug>
					<au>
						<snm>Eissenberg</snm>
						<fnm>JC</fnm>
					</au>
					<au>
						<snm>Elgin</snm>
						<fnm>SC</fnm>
					</au>
				</aug>
				<source>Curr Opin Genet Dev</source>
				<pubdate>2000</pubdate>
				<volume>10</volume>
				<fpage>204</fpage>
				<lpage>210</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0959-437X(00)00058-7</pubid>
						<pubid idtype="pmpid" link="fulltext">10753776</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>Central role of <it>Drosophila </it>SU(VAR)3-9 in histone H3-K9 methylation and heterochromatic gene silencing.</p>
				</title>
				<aug>
					<au>
						<snm>Schotta</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Ebert</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Krauss</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Fischer</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Hoffmann</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Rea</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Jenuwein</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Dorn</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Reuter</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>EMBO J</source>
				<pubdate>2002</pubdate>
				<volume>21</volume>
				<fpage>1121</fpage>
				<lpage>1131</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">125909</pubid>
						<pubid idtype="pmpid" link="fulltext">11867540</pubid>
						<pubid idtype="doi">10.1093/emboj/21.5.1121</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Selective recognition of methylated lysine 9 on histone H3 by the HP1 chromo domain.</p>
				</title>
				<aug>
					<au>
						<snm>Bannister</snm>
						<fnm>AJ</fnm>
					</au>
					<au>
						<snm>Zegerman</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Partridge</snm>
						<fnm>JF</fnm>
					</au>
					<au>
						<snm>Miska</snm>
						<fnm>EA</fnm>
					</au>
					<au>
						<snm>Thomas</snm>
						<fnm>JO</fnm>
					</au>
					<au>
						<snm>Allshire</snm>
						<fnm>RC</fnm>
					</au>
					<au>
						<snm>Kouzarides</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2001</pubdate>
				<volume>410</volume>
				<fpage>120</fpage>
				<lpage>124</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/35065138</pubid>
						<pubid idtype="pmpid" link="fulltext">11242054</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>Mutation in a heterochromatin-specific chromosomal protein is associated with suppression of position-effect variegation in <it>Drosophila melanogaster</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Eissenberg</snm>
						<fnm>JC</fnm>
					</au>
					<au>
						<snm>James</snm>
						<fnm>TC</fnm>
					</au>
					<au>
						<snm>Foster-Hartnett</snm>
						<fnm>DM</fnm>
					</au>
					<au>
						<snm>Hartnett</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Ngan</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Elgin</snm>
						<fnm>SC</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>1990</pubdate>
				<volume>87</volume>
				<fpage>9923</fpage>
				<lpage>9927</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">55286</pubid>
						<pubid idtype="pmpid" link="fulltext">2124708</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>Annotation of the <it>Drosophila melanogaster </it>euchromatic genome: a systematic review.</p>
				</title>
				<aug>
					<au>
						<snm>Misra</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Crosby</snm>
						<fnm>MA</fnm>
					</au>
					<au>
						<snm>Mungall</snm>
						<fnm>CJ</fnm>
					</au>
					<au>
						<snm>Matthews</snm>
						<fnm>BB</fnm>
					</au>
					<au>
						<snm>Campbell</snm>
						<fnm>KS</fnm>
					</au>
					<au>
						<snm>Hradecky</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Huang</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Kaminker</snm>
						<fnm>JS</fnm>
					</au>
					<au>
						<snm>Millburn</snm>
						<fnm>GH</fnm>
					</au>
					<au>
						<snm>Prochnik</snm>
						<fnm>SE</fnm>
					</au>
					<etal/>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2002</pubdate>
				<volume>R3</volume>
				<fpage>1</fpage>
				<lpage>0083</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1186/gb-2002-3-12-research0083</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>FlyBase: genes and gene models.</p>
				</title>
				<aug>
					<au>
						<snm>Drysdale</snm>
						<fnm>RA</fnm>
					</au>
					<au>
						<snm>Crosby</snm>
						<fnm>MA</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2005</pubdate>
				<volume>33</volume>
				<issue>Database</issue>
				<fpage>D390</fpage>
				<lpage>395</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">540000</pubid>
						<pubid idtype="pmpid" link="fulltext">15608223</pubid>
						<pubid idtype="doi">10.1093/nar/gki046</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B14">
				<title>
					<p><it>In vitro </it>study of the DNA replication patterns of somatic chromosomes of <it>Drosophila melanogaster</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Barigozzi</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Dolfini</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Fraccaro</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Raimondi</snm>
						<fnm>GR</fnm>
					</au>
					<au>
						<snm>Tiepolo</snm>
						<fnm>L</fnm>
					</au>
				</aug>
				<source>Exp Cell Res</source>
				<pubdate>1966</pubdate>
				<volume>43</volume>
				<fpage>231</fpage>
				<lpage>234</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/0014-4827(66)90399-5</pubid>
						<pubid idtype="pmpid">5915613</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B15">
				<title>
					<p>The mutants and linkage data of chromosome four of <it>Drosophila melanogaster</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Bridges</snm>
						<fnm>CB</fnm>
					</au>
				</aug>
				<source>Biol Zh</source>
				<pubdate>1935</pubdate>
				<volume>4</volume>
				<fpage>401</fpage>
				<lpage>420</lpage>
			</bibl>
			<bibl id="B16">
				<title>
					<p>Microcloning reveals a high frequency of repetitive sequences characteristic of chromosome 4 and the beta-heterochromatin of <it>Drosophila melanogaster</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Miklos</snm>
						<fnm>GL</fnm>
					</au>
					<au>
						<snm>Yamamoto</snm>
						<fnm>MT</fnm>
					</au>
					<au>
						<snm>Davies</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Pirrotta</snm>
						<fnm>V</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>1988</pubdate>
				<volume>85</volume>
				<fpage>2051</fpage>
				<lpage>2055</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">279926</pubid>
						<pubid idtype="pmpid" link="fulltext">3127823</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<title>
					<p>Analysis of two cosmid clones from chromosome 4 of Drosophila <it>melanogaster </it>reveals two new genes amid an unusual arrangement of repeated sequences.</p>
				</title>
				<aug>
					<au>
						<snm>Locke</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Podemski</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Roy</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Pilgrim</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Hodgetts</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>1999</pubdate>
				<volume>9</volume>
				<fpage>137</fpage>
				<lpage>149</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">310724</pubid>
						<pubid idtype="pmpid" link="fulltext">10022978</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B18">
				<title>
					<p>The transposable elements of the <it>Drosophila melanogaster </it>euchromatin: a genomics perspective.</p>
				</title>
				<aug>
					<au>
						<snm>Kaminker</snm>
						<fnm>JS</fnm>
					</au>
					<au>
						<snm>Bergman</snm>
						<fnm>CM</fnm>
					</au>
					<au>
						<snm>Kronmiller</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Carlson</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Svirskas</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Patel</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Frise</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Wheeler</snm>
						<fnm>DA</fnm>
					</au>
					<au>
						<snm>Lewis</snm>
						<fnm>SE</fnm>
					</au>
					<au>
						<snm>Rubin</snm>
						<fnm>GM</fnm>
					</au>
					<etal/>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2002</pubdate>
				<volume>3</volume>
				<fpage>R1</fpage>
				<lpage>0084</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1186/gb-2002-3-12-research0084</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<title>
					<p>On the abundance and distribution of transposable elements in the genome of <it>Drosophila melanogaster</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Bartolome</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Maside</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Charlesworth</snm>
						<fnm>B</fnm>
					</au>
				</aug>
				<source>Mol Biol Evol</source>
				<pubdate>2002</pubdate>
				<volume>19</volume>
				<fpage>926</fpage>
				<lpage>937</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">12032249</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B20">
				<title>
					<p>Distribution patterns of HP1, a heterochromatin-associated nonhistone chromosomal protein of <it>Drosophila</it>.</p>
				</title>
				<aug>
					<au>
						<snm>James</snm>
						<fnm>TC</fnm>
					</au>
					<au>
						<snm>Eissenberg</snm>
						<fnm>JC</fnm>
					</au>
					<au>
						<snm>Craig</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Dietrich</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Hobson</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Elgin</snm>
						<fnm>SC</fnm>
					</au>
				</aug>
				<source>Eur J Cell Biol</source>
				<pubdate>1989</pubdate>
				<volume>50</volume>
				<fpage>170</fpage>
				<lpage>180</lpage>
				<xrefbib>
					<pubid idtype="pmpid">2515059</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p>Analyzing heterochromatin formation using chromosome 4 of <it>Drosophila melanogaster</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Haynes</snm>
						<fnm>KA</fnm>
					</au>
					<au>
						<snm>Leibovitch</snm>
						<fnm>BA</fnm>
					</au>
					<au>
						<snm>Rangwala</snm>
						<fnm>SH</fnm>
					</au>
					<au>
						<snm>Craig</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Elgin</snm>
						<fnm>SC</fnm>
					</au>
				</aug>
				<source>Cold Spring Harb Symp Quant Biol</source>
				<pubdate>2004</pubdate>
				<volume>69</volume>
				<fpage>267</fpage>
				<lpage>272</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1101/sqb.2004.69.267</pubid>
						<pubid idtype="pmpid" link="fulltext">16117658</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B22">
				<title>
					<p>Position effect variegation in <it>Drosophila </it>is associated with an altered chromatin structure.</p>
				</title>
				<aug>
					<au>
						<snm>Wallrath</snm>
						<fnm>LL</fnm>
					</au>
					<au>
						<snm>Elgin</snm>
						<fnm>SC</fnm>
					</au>
				</aug>
				<source>Genes Dev</source>
				<pubdate>1995</pubdate>
				<volume>9</volume>
				<fpage>1263</fpage>
				<lpage>1277</lpage>
				<xrefbib>
					<pubid idtype="pmpid">7758950</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B23">
				<title>
					<p>The fourth chromosome of <it>Drosophila melanogaster </it>: interspersed euchromatic and heterochromatic domains.</p>
				</title>
				<aug>
					<au>
						<snm>Sun</snm>
						<fnm>FL</fnm>
					</au>
					<au>
						<snm>Cuaycong</snm>
						<fnm>MH</fnm>
					</au>
					<au>
						<snm>Craig</snm>
						<fnm>CA</fnm>
					</au>
					<au>
						<snm>Wallrath</snm>
						<fnm>LL</fnm>
					</au>
					<au>
						<snm>Locke</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Elgin</snm>
						<fnm>SC</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2000</pubdate>
				<volume>97</volume>
				<fpage>5340</fpage>
				<lpage>5345</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">25830</pubid>
						<pubid idtype="pmpid" link="fulltext">10779561</pubid>
						<pubid idtype="doi">10.1073/pnas.090530797</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<title>
					<p><it>cis </it>-Acting determinants of heterochromatin formation on <it>Drosophila melanogaster </it>chromosome four.</p>
				</title>
				<aug>
					<au>
						<snm>Sun</snm>
						<fnm>FL</fnm>
					</au>
					<au>
						<snm>Haynes</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Simpson</snm>
						<fnm>CL</fnm>
					</au>
					<au>
						<snm>Lee</snm>
						<fnm>SD</fnm>
					</au>
					<au>
						<snm>Collins</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Wuller</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Eissenberg</snm>
						<fnm>JC</fnm>
					</au>
					<au>
						<snm>Elgin</snm>
						<fnm>SC</fnm>
					</au>
				</aug>
				<source>Mol Cell Biol</source>
				<pubdate>2004</pubdate>
				<volume>24</volume>
				<fpage>8210</fpage>
				<lpage>8220</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">515050</pubid>
						<pubid idtype="pmpid" link="fulltext">15340080</pubid>
						<pubid idtype="doi">10.1128/MCB.24.18.8210-8220.2004</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B25">
				<title>
					<p>RNAi-mediated pathways in the nucleus.</p>
				</title>
				<aug>
					<au>
						<snm>Matzke</snm>
						<fnm>MA</fnm>
					</au>
					<au>
						<snm>Birchler</snm>
						<fnm>JA</fnm>
					</au>
				</aug>
				<source>Nat Rev Genet</source>
				<pubdate>2005</pubdate>
				<volume>6</volume>
				<fpage>24</fpage>
				<lpage>35</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nrg1500</pubid>
						<pubid idtype="pmpid" link="fulltext">15630419</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B26">
				<title>
					<p>Heterochromatic silencing and HP1 localization in <it>Drosophila </it>are dependent on the RNAi machinery.</p>
				</title>
				<aug>
					<au>
						<snm>Pal-Bhadra</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Leibovitch</snm>
						<fnm>BA</fnm>
					</au>
					<au>
						<snm>Gandhi</snm>
						<fnm>SG</fnm>
					</au>
					<au>
						<snm>Rao</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Bhadra</snm>
						<fnm>U</fnm>
					</au>
					<au>
						<snm>Birchler</snm>
						<fnm>JA</fnm>
					</au>
					<au>
						<snm>Elgin</snm>
						<fnm>SC</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2004</pubdate>
				<volume>303</volume>
				<fpage>669</fpage>
				<lpage>672</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1092653</pubid>
						<pubid idtype="pmpid" link="fulltext">14752161</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B27">
				<title>
					<p>The small RNA profile during <it>Drosophila melanogaster </it>development.</p>
				</title>
				<aug>
					<au>
						<snm>Aravin</snm>
						<fnm>AA</fnm>
					</au>
					<au>
						<snm>Lagos-Quintana</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Yalcin</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Zavolan</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Marks</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Snyder</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Gaasterland</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Meyer</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Tuschl</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>Dev Cell</source>
				<pubdate>2003</pubdate>
				<volume>5</volume>
				<fpage>337</fpage>
				<lpage>350</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S1534-5807(03)00228-4</pubid>
						<pubid idtype="pmpid" link="fulltext">12919683</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B28">
				<title>
					<p>Overview of chromosomal evolution in the family Drosophilidae.</p>
				</title>
				<aug>
					<au>
						<snm>Clayton</snm>
						<fnm>FE</fnm>
					</au>
					<au>
						<snm>Guest</snm>
						<fnm>WC</fnm>
					</au>
				</aug>
				<source>The Genetics and Biology of Drosophila</source>
				<pubdate>1986</pubdate>
				<volume>3E</volume>
				<fpage>1</fpage>
				<lpage>38</lpage>
			</bibl>
			<bibl id="B29">
				<title>
					<p>The homologies of the chromosome elements in the genus <it>Drosophila</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Sturtevant</snm>
						<fnm>AH</fnm>
					</au>
					<au>
						<snm>Novitski</snm>
						<fnm>E</fnm>
					</au>
				</aug>
				<source>Genetics</source>
				<pubdate>1941</pubdate>
				<volume>26</volume>
				<fpage>517</fpage>
				<lpage>538</lpage>
			</bibl>
			<bibl id="B30">
				<title>
					<p>Cytological and linkage maps of <it>Drosophila virilis </it>chromosomes.</p>
				</title>
				<aug>
					<au>
						<snm>Gubenko</snm>
						<fnm>IS</fnm>
					</au>
					<au>
						<snm>Evgen'ev</snm>
						<fnm>MB</fnm>
					</au>
				</aug>
				<source>Genetica</source>
				<pubdate>1984</pubdate>
				<volume>65</volume>
				<fpage>127</fpage>
				<lpage>139</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1007/BF00135277</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B31">
				<title>
					<p>Whole arm inversions of chromosome 4 in <it>Drosophila </it>species.</p>
				</title>
				<aug>
					<au>
						<snm>Podemski</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Ferrer</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Locke</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Chromosoma</source>
				<pubdate>2001</pubdate>
				<volume>110</volume>
				<fpage>305</fpage>
				<lpage>312</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">11534822</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B32">
				<title>
					<p>Drosophila molecular phylogenies and their uses.</p>
				</title>
				<aug>
					<au>
						<snm>Powell</snm>
						<fnm>JR</fnm>
					</au>
					<au>
						<snm>DeSalle</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Evol Biol</source>
				<pubdate>1995</pubdate>
				<volume>28</volume>
				<fpage>87</fpage>
				<lpage>138</lpage>
			</bibl>
			<bibl id="B33">
				<title>
					<p>Nonrandom distribution of long mono- and dinucleotide repeats in <it>Drosophila </it>chromosomes: correlations with dosage compensation, heterochromatin, and recombination.</p>
				</title>
				<aug>
					<au>
						<snm>Lowenhaupt</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Rich</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Pardue</snm>
						<fnm>ML</fnm>
					</au>
				</aug>
				<source>Mol Cell Biol</source>
				<pubdate>1989</pubdate>
				<volume>9</volume>
				<fpage>1173</fpage>
				<lpage>1182</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">362708</pubid>
						<pubid idtype="pmpid">2725493</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B34">
				<title>
					<p>Mutants and crossing over in the dot-like chromosome of <it>Drosophila virilis</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Chino</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Kikkawa</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Genetics</source>
				<pubdate>1933</pubdate>
				<volume>18</volume>
				<fpage>111</fpage>
				<lpage>116</lpage>
			</bibl>
			<bibl id="B35">
				<title>
					<p>Surveying <it>Saccharomyces </it>genomes to identify functional elements by comparative DNA sequence analysis.</p>
				</title>
				<aug>
					<au>
						<snm>Cliften</snm>
						<fnm>PF</fnm>
					</au>
					<au>
						<snm>Hillier</snm>
						<fnm>LW</fnm>
					</au>
					<au>
						<snm>Fulton</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Graves</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Miner</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Gish</snm>
						<fnm>WR</fnm>
					</au>
					<au>
						<snm>Waterston</snm>
						<fnm>RH</fnm>
					</au>
					<au>
						<snm>Johnston</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2001</pubdate>
				<volume>11</volume>
				<fpage>1175</fpage>
				<lpage>1186</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1101/gr.182901</pubid>
						<pubid idtype="pmpid" link="fulltext">11435399</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B36">
				<title>
					<p>Assessing the impact of comparative genomic sequence data on the functional annotation of the <it>Drosophila </it>genome.</p>
				</title>
				<aug>
					<au>
						<snm>Bergman</snm>
						<fnm>CM</fnm>
					</au>
					<au>
						<snm>Pfeiffer</snm>
						<fnm>BD</fnm>
					</au>
					<au>
						<snm>Rincon-Limas</snm>
						<fnm>DE</fnm>
					</au>
					<au>
						<snm>Hoskins</snm>
						<fnm>RA</fnm>
					</au>
					<au>
						<snm>Gnirke</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Mungall</snm>
						<fnm>CJ</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>AM</fnm>
					</au>
					<au>
						<snm>Kronmiller</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Pacleb</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Park</snm>
						<fnm>S</fnm>
					</au>
					<etal/>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2002</pubdate>
				<volume>3</volume>
				<fpage>R1</fpage>
				<lpage>0086</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1186/gb-2002-3-12-research0086</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B37">
				<title>
					<p>Comparative genome sequencing of <it>Drosophila pseudoobscura </it>: chromosomal, gene, and <it>cis</it>-element evolution.</p>
				</title>
				<aug>
					<au>
						<snm>Richards</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Liu</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Bettencourt</snm>
						<fnm>BR</fnm>
					</au>
					<au>
						<snm>Hradecky</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Letovsky</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Nielsen</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Thornton</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Hubisz</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Chen</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Meisel</snm>
						<fnm>RP</fnm>
					</au>
					<etal/>
				</aug>
				<source>Genome Res</source>
				<pubdate>2005</pubdate>
				<volume>15</volume>
				<fpage>1</fpage>
				<lpage>18</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">540289</pubid>
						<pubid idtype="pmpid" link="fulltext">15632085</pubid>
						<pubid idtype="doi">10.1101/gr.3059305</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B38">
				<title>
					<p>A whole-genome assembly of <it>Drosophila</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Myers</snm>
						<fnm>EW</fnm>
					</au>
					<au>
						<snm>Sutton</snm>
						<fnm>GG</fnm>
					</au>
					<au>
						<snm>Delcher</snm>
						<fnm>AL</fnm>
					</au>
					<au>
						<snm>Dew</snm>
						<fnm>IM</fnm>
					</au>
					<au>
						<snm>Fasulo</snm>
						<fnm>DP</fnm>
					</au>
					<au>
						<snm>Flanigan</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Kravitz</snm>
						<fnm>SA</fnm>
					</au>
					<au>
						<snm>Mobarry</snm>
						<fnm>CM</fnm>
					</au>
					<au>
						<snm>Reinert</snm>
						<fnm>KH</fnm>
					</au>
					<au>
						<snm>Remington</snm>
						<fnm>KA</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>2000</pubdate>
				<volume>287</volume>
				<fpage>2196</fpage>
				<lpage>2204</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.287.5461.2196</pubid>
						<pubid idtype="pmpid" link="fulltext">10731133</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B39">
				<title>
					<p>Genome structure and evolution in <it>Drosophila </it>: applications of the framework P1 map.</p>
				</title>
				<aug>
					<au>
						<snm>Hartl</snm>
						<fnm>DL</fnm>
					</au>
					<au>
						<snm>Nurminsky</snm>
						<fnm>DI</fnm>
					</au>
					<au>
						<snm>Jones</snm>
						<fnm>RW</fnm>
					</au>
					<au>
						<snm>Lozovskaya</snm>
						<fnm>ER</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>1994</pubdate>
				<volume>91</volume>
				<fpage>6824</fpage>
				<lpage>6829</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">44290</pubid>
						<pubid idtype="pmpid" link="fulltext">8041703</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B40">
				<title>
					<p><it>Drosophila </it>Genome Project at Baylor College of Medicine</p>
				</title>
				<url>http://www.hgsc.bcm.tmc.edu/projects/drosophila/</url>
			</bibl>
			<bibl id="B41">
				<title>
					<p>Base-calling of automated sequencer traces using phred. I. Accuracy assessment.</p>
				</title>
				<aug>
					<au>
						<snm>Ewing</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Hillier</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Wendl</snm>
						<fnm>MC</fnm>
					</au>
					<au>
						<snm>Green</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>1998</pubdate>
				<volume>8</volume>
				<fpage>175</fpage>
				<lpage>185</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">9521921</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B42">
				<title>
					<p>Base-calling of automated sequencer traces using phred. II. Error probabilities.</p>
				</title>
				<aug>
					<au>
						<snm>Ewing</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Green</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>1998</pubdate>
				<volume>8</volume>
				<fpage>186</fpage>
				<lpage>194</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">9521922</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B43">
				<title>
					<p>Consed: a graphical tool for sequence finishing.</p>
				</title>
				<aug>
					<au>
						<snm>Gordon</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Abajian</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Green</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>1998</pubdate>
				<volume>8</volume>
				<fpage>195</fpage>
				<lpage>202</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">9521923</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B44">
				<title>
					<p>The organization and expression of the light gene, a heterochromatic gene of <it>Drosophila melanogaster</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Devlin</snm>
						<fnm>RH</fnm>
					</au>
					<au>
						<snm>Bingham</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Wakimoto</snm>
						<fnm>BT</fnm>
					</au>
				</aug>
				<source>Genetics</source>
				<pubdate>1990</pubdate>
				<volume>125</volume>
				<fpage>129</fpage>
				<lpage>140</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">2111263</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B45">
				<title>
					<p>RepeatMasker</p>
				</title>
				<url>http://repeatmasker.org</url>
			</bibl>
			<bibl id="B46">
				<title>
					<p>Repeats in genomic DNA: mining and meaning.</p>
				</title>
				<aug>
					<au>
						<snm>Jurka</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Curr Opin Struct Biol</source>
				<pubdate>1998</pubdate>
				<volume>8</volume>
				<fpage>333</fpage>
				<lpage>337</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0959-440X(98)80067-5</pubid>
						<pubid idtype="pmpid">9666329</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B47">
				<title>
					<p>Repbase update: a database and an electronic journal of repetitive elements.</p>
				</title>
				<aug>
					<au>
						<snm>Jurka</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Trends Genet</source>
				<pubdate>2000</pubdate>
				<volume>16</volume>
				<fpage>418</fpage>
				<lpage>420</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0168-9525(00)02093-X</pubid>
						<pubid idtype="pmpid" link="fulltext">10973072</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B48">
				<title>
					<p>Basic local alignment search tool.</p>
				</title>
				<aug>
					<au>
						<snm>Altschul</snm>
						<fnm>SF</fnm>
					</au>
					<au>
						<snm>Gish</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Miller</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Myers</snm>
						<fnm>EW</fnm>
					</au>
					<au>
						<snm>Lipman</snm>
						<fnm>DJ</fnm>
					</au>
				</aug>
				<source>J Mol Biol</source>
				<pubdate>1990</pubdate>
				<volume>215</volume>
				<fpage>403</fpage>
				<lpage>410</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/jmbi.1990.9999</pubid>
						<pubid idtype="pmpid" link="fulltext">2231712</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B49">
				<title>
					<p>Combined evidence annotation of transposable elements in genome sequences.</p>
				</title>
				<aug>
					<au>
						<snm>Quesneville</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Bergman</snm>
						<fnm>CM</fnm>
					</au>
					<au>
						<snm>Andrieu</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Autard</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Nouaud</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Ashburner</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Anxolabehere</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>PLoS Comput Biol</source>
				<pubdate>2005</pubdate>
				<volume>1</volume>
				<fpage>166</fpage>
				<lpage>175</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1185648</pubid>
						<pubid idtype="pmpid" link="fulltext">16110336</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B50">
				<title>
					<p>PILER: Identification and classification of genomic repeats.</p>
				</title>
				<aug>
					<au>
						<snm>Edgar</snm>
						<fnm>RC</fnm>
					</au>
					<au>
						<snm>Myers</snm>
						<fnm>EW</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<issue>Suppl 1</issue>
				<fpage>i152</fpage>
				<lpage>i158</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bti1003</pubid>
						<pubid idtype="pmpid" link="fulltext">15961452</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B51">
				<title>
					<p><it>De novo </it>identification of repeat families in large genomes.</p>
				</title>
				<aug>
					<au>
						<snm>Price</snm>
						<fnm>AL</fnm>
					</au>
					<au>
						<snm>Jones</snm>
						<fnm>NC</fnm>
					</au>
					<au>
						<snm>Pevzner</snm>
						<fnm>PA</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<issue>Suppl 1</issue>
				<fpage>i351</fpage>
				<lpage>i358</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bti1018</pubid>
						<pubid idtype="pmpid" link="fulltext">15961478</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B52">
				<title>
					<p>Assembly/Alignment/Annotation of 12 related <it>Drosophila </it>species</p>
				</title>
				<url>http://rana.lbl.gov/drosophila</url>
			</bibl>
			<bibl id="B53">
				<title>
					<p>Genome-wide comparative analysis of the highly abundant transposable element DINE-1 suggests a recent transpositional burst in <it>Drosophila yakuba</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Yang</snm>
						<fnm>HP</fnm>
					</au>
					<au>
						<snm>Hung</snm>
						<fnm>TL</fnm>
					</au>
					<au>
						<snm>You</snm>
						<fnm>TL</fnm>
					</au>
					<au>
						<snm>Yang</snm>
						<fnm>TH</fnm>
					</au>
				</aug>
				<source>Genetics</source>
				<pubdate>2005</pubdate>
				<note>doi:10.1534/genetics.105.051714</note>
			</bibl>
			<bibl id="B54">
				<title>
					<p>Detection of new transposable element families in <it>Drosophila melanogaster </it>and <it>Anopheles gambiae </it>genomes.</p>
				</title>
				<aug>
					<au>
						<snm>Quesneville</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Nouaud</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Anxolabehere</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>J Mol Evol</source>
				<pubdate>2003</pubdate>
				<volume>57</volume>
				<issue>Suppl 1</issue>
				<fpage>S50</fpage>
				<lpage>59</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/s00239-003-0007-2</pubid>
						<pubid idtype="pmpid" link="fulltext">15008403</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B55">
				<title>
					<p>Genomic heterogeneity of background substitutional patterns in <it>Drosophila melanogaster</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Singh</snm>
						<fnm>ND</fnm>
					</au>
					<au>
						<snm>Arndt</snm>
						<fnm>PF</fnm>
					</au>
					<au>
						<snm>Petrov</snm>
						<fnm>DA</fnm>
					</au>
				</aug>
				<source>Genetics</source>
				<pubdate>2005</pubdate>
				<volume>169</volume>
				<fpage>709</fpage>
				<lpage>722</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1534/genetics.104.032250</pubid>
						<pubid idtype="pmpid" link="fulltext">15520267</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B56">
				<title>
					<p>The characterization of DINE-1, a short, interspersed repetitive element present on chromosome and in the centric heterochromatin of <it>Drosophila melanogaster</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Locke</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Howard</snm>
						<fnm>LT</fnm>
					</au>
					<au>
						<snm>Aippersbach</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Podemski</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Hodgetts</snm>
						<fnm>RB</fnm>
					</au>
				</aug>
				<source>Chromosoma</source>
				<pubdate>1999</pubdate>
				<volume>108</volume>
				<fpage>356</fpage>
				<lpage>366</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/s004120050387</pubid>
						<pubid idtype="pmpid" link="fulltext">10591995</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B57">
				<title>
					<p>Molecular paleontology of transposable elements in the <it>Drosophila melanogaster </it>genome.</p>
				</title>
				<aug>
					<au>
						<snm>Kapitonov</snm>
						<fnm>VV</fnm>
					</au>
					<au>
						<snm>Jurka</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2003</pubdate>
				<volume>100</volume>
				<fpage>6569</fpage>
				<lpage>6574</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">164487</pubid>
						<pubid idtype="pmpid" link="fulltext">12743378</pubid>
						<pubid idtype="doi">10.1073/pnas.0732024100</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B58">
				<title>
					<p>Penelope retroelements from <it>Drosophila virilis </it>are active after transformation of <it>Drosophila melanogaster</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Pyatkov</snm>
						<fnm>KI</fnm>
					</au>
					<au>
						<snm>Shostak</snm>
						<fnm>NG</fnm>
					</au>
					<au>
						<snm>Zelentsova</snm>
						<fnm>ES</fnm>
					</au>
					<au>
						<snm>Lyozin</snm>
						<fnm>GT</fnm>
					</au>
					<au>
						<snm>Melekhin</snm>
						<fnm>MI</fnm>
					</au>
					<au>
						<snm>Finnegan</snm>
						<fnm>DJ</fnm>
					</au>
					<au>
						<snm>Kidwell</snm>
						<fnm>MG</fnm>
					</au>
					<au>
						<snm>Evgen'ev</snm>
						<fnm>MB</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2002</pubdate>
				<volume>99</volume>
				<fpage>16150</fpage>
				<lpage>16155</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">138580</pubid>
						<pubid idtype="pmpid" link="fulltext">12451171</pubid>
						<pubid idtype="doi">10.1073/pnas.252641799</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B59">
				<title>
					<p>Invasion of <it>Drosophila virilis </it>by the Penelope transposable element.</p>
				</title>
				<aug>
					<au>
						<snm>Evgen'ev</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Zelentsova</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Mnjoian</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Poluectova</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Kidwell</snm>
						<fnm>MG</fnm>
					</au>
				</aug>
				<source>Chromosoma</source>
				<pubdate>2000</pubdate>
				<volume>109</volume>
				<fpage>350</fpage>
				<lpage>357</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">11007494</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B60">
				<title>
					<p>Pattern of chromosomal localization of the Hoppel transposable element family in the <it>Drosophila melanogaster </it>subgroup.</p>
				</title>
				<aug>
					<au>
						<snm>Coelho</snm>
						<fnm>PA</fnm>
					</au>
					<au>
						<snm>Queiroz-Machado</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Hartl</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Sunkel</snm>
						<fnm>CE</fnm>
					</au>
				</aug>
				<source>Chromosome Res</source>
				<pubdate>1998</pubdate>
				<volume>6</volume>
				<fpage>385</fpage>
				<lpage>395</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1023/A:1009277322626</pubid>
						<pubid idtype="pmpid" link="fulltext">9872668</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B61">
				<title>
					<p>Hoppel, a P-like element without introns: a P-element ancestral structure or a retrotranscription derivative?</p>
				</title>
				<aug>
					<au>
						<snm>Reiss</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Quesneville</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Nouaud</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Andrieu</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Anxolabehere</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Mol Biol Evol</source>
				<pubdate>2003</pubdate>
				<volume>20</volume>
				<fpage>869</fpage>
				<lpage>879</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/molbev/msg090</pubid>
						<pubid idtype="pmpid" link="fulltext">12717001</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B62">
				<title>
					<p>Transposon silencing in the <it>Caenorhabditis elegans </it>germ line by natural RNAi.</p>
				</title>
				<aug>
					<au>
						<snm>Sijen</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Plasterk</snm>
						<fnm>RH</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2003</pubdate>
				<volume>426</volume>
				<fpage>310</fpage>
				<lpage>314</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nature02107</pubid>
						<pubid idtype="pmpid" link="fulltext">14628056</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B63">
				<title>
					<p>Expansions of transgene repeats cause heterochromatin formation and gene silencing in <it>Drosophila</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Dorer</snm>
						<fnm>DR</fnm>
					</au>
					<au>
						<snm>Henikoff</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Cell</source>
				<pubdate>1994</pubdate>
				<volume>77</volume>
				<fpage>993</fpage>
				<lpage>1002</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/0092-8674(94)90439-1</pubid>
						<pubid idtype="pmpid" link="fulltext">8020105</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B64">
				<title>
					<p>Mammalian microRNAs derived from genomic repeats.</p>
				</title>
				<aug>
					<au>
						<snm>Smalheiser</snm>
						<fnm>NR</fnm>
					</au>
					<au>
						<snm>Torvik</snm>
						<fnm>VI</fnm>
					</au>
				</aug>
				<source>Trends Genet</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<fpage>322</fpage>
				<lpage>326</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.tig.2005.04.008</pubid>
						<pubid idtype="pmpid" link="fulltext">15922829</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B65">
				<title>
					<p>Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure.</p>
				</title>
				<aug>
					<au>
						<snm>Matthews</snm>
						<fnm>DH</fnm>
					</au>
					<au>
						<snm>Sabina</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Zuker</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Turner</snm>
						<fnm>DH</fnm>
					</au>
				</aug>
				<source>J Mol Biol</source>
				<pubdate>1999</pubdate>
				<volume>288</volume>
				<fpage>911</fpage>
				<lpage>940</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/jmbi.1999.2700</pubid>
						<pubid idtype="pmpid" link="fulltext">10329189</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B66">
				<title>
					<p>Immunofluorescent staining of polytene chromosomes: exploiting genetic tools.</p>
				</title>
				<aug>
					<au>
						<snm>Stephens</snm>
						<fnm>GE</fnm>
					</au>
					<au>
						<snm>Craig</snm>
						<fnm>CA</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Wallrath</snm>
						<fnm>LL</fnm>
					</au>
					<au>
						<snm>Elgin</snm>
						<fnm>SCR</fnm>
					</au>
				</aug>
				<source>Methods Enzymol</source>
				<pubdate>2004</pubdate>
				<volume>376</volume>
				<fpage>372</fpage>
				<lpage>393</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">14975319</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B67">
				<title>
					<p>Automated construction and graphical presentation of protein blocks from unaligned sequences.</p>
				</title>
				<aug>
					<au>
						<snm>Henikoff</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Henikoff</snm>
						<fnm>JG</fnm>
					</au>
					<au>
						<snm>Alford</snm>
						<fnm>WJ</fnm>
					</au>
					<au>
						<snm>Pietrokovski</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Gene</source>
				<pubdate>1995</pubdate>
				<volume>163</volume>
				<fpage>GC17</fpage>
				<lpage>26</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/0378-1119(95)00486-P</pubid>
						<pubid idtype="pmpid" link="fulltext">7590261</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B68">
				<title>
					<p>Consensus-degenerate hybrid oligonucleotide primers for amplification of distantly related sequences.</p>
				</title>
				<aug>
					<au>
						<snm>Rose</snm>
						<fnm>TM</fnm>
					</au>
					<au>
						<snm>Schultz</snm>
						<fnm>ER</fnm>
					</au>
					<au>
						<snm>Henikoff</snm>
						<fnm>JG</fnm>
					</au>
					<au>
						<snm>Pietrokovski</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>McCallum</snm>
						<fnm>CM</fnm>
					</au>
					<au>
						<snm>Henikoff</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>1998</pubdate>
				<volume>26</volume>
				<fpage>1628</fpage>
				<lpage>1635</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">147464</pubid>
						<pubid idtype="pmpid" link="fulltext">9512532</pubid>
						<pubid idtype="doi">10.1093/nar/26.7.1628</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B69">
				<title>
					<p>BACPAC Center at CHORI</p>
				</title>
				<url>http://bacpac.chori.org</url>
			</bibl>
			<bibl id="B70">
				<title>
					<p>Coevolution of the telomeric retrotransposons across <it>Drosophila </it>species.</p>
				</title>
				<aug>
					<au>
						<snm>Casacuberta</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Pardue</snm>
						<fnm>ML</fnm>
					</au>
				</aug>
				<source>Genetics</source>
				<pubdate>2002</pubdate>
				<volume>161</volume>
				<fpage>1113</fpage>
				<lpage>1124</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">12136015</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B71">
				<title>
					<p>Finishing the euchromatic sequence of the human genome.</p>
				</title>
				<aug>
					<au>
						<cnm>International Human Genome Sequencing Consortium</cnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2004</pubdate>
				<volume>431</volume>
				<fpage>931</fpage>
				<lpage>945</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nature03001</pubid>
						<pubid idtype="pmpid" link="fulltext">15496913</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B72">
				<title>
					<p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.</p>
				</title>
				<aug>
					<au>
						<snm>Altschul</snm>
						<fnm>SF</fnm>
					</au>
					<au>
						<snm>Madden</snm>
						<fnm>TL</fnm>
					</au>
					<au>
						<snm>Schaffer</snm>
						<fnm>AA</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Miller</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Lipman</snm>
						<fnm>DJ</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>1997</pubdate>
				<volume>25</volume>
				<fpage>3389</fpage>
				<lpage>3402</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">146917</pubid>
						<pubid idtype="pmpid" link="fulltext">9254694</pubid>
						<pubid idtype="doi">10.1093/nar/25.17.3389</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B73">
				<title>
					<p>GENSCAN</p>
				</title>
				<url>http://genes.mit.edu/GENSCAN.html</url>
			</bibl>
			<bibl id="B74">
				<title>
					<p>Prediction of complete gene structures in human genomic DNA.</p>
				</title>
				<aug>
					<au>
						<snm>Burge</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Karlin</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>J Mol Biol</source>
				<pubdate>1997</pubdate>
				<volume>268</volume>
				<fpage>78</fpage>
				<lpage>94</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/jmbi.1997.0951</pubid>
						<pubid idtype="pmpid" link="fulltext">9149143</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B75">
				<title>
					<p>BLAT - the BLAST-like alignment tool.</p>
				</title>
				<aug>
					<au>
						<snm>Kent</snm>
						<fnm>WJ</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2002</pubdate>
				<volume>12</volume>
				<fpage>656</fpage>
				<lpage>664</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">187518</pubid>
						<pubid idtype="pmpid" link="fulltext">11932250</pubid>
						<pubid idtype="doi">10.1101/gr.229202. Article published online before March 2002</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B76">
				<title>
					<p>Ensembl</p>
				</title>
				<url>http://www.ensembl.org</url>
			</bibl>
			<bibl id="B77">
				<title>
					<p>Flybase</p>
				</title>
				<url>http://www.flybase.org</url>
			</bibl>
			<bibl id="B78">
				<title>
					<p><it>Drosophila </it>Heterochromatin Genome Project PILER-DF Libraries</p>
				</title>
				<url>ftp://ftp.dhgp.org/pub/DHGP/PILER-DF/</url>
			</bibl>
			<bibl id="B79">
				<title>
					<p>Tandem Repeat Finder (trf)</p>
				</title>
				<url>http://tandem.bu.edu/trf/trf.html</url>
			</bibl>
			<bibl id="B80">
				<title>
					<p>Tandem repeats finder: a program to analyze DNA sequences.</p>
				</title>
				<aug>
					<au>
						<snm>Benson</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>1999</pubdate>
				<volume>27</volume>
				<fpage>573</fpage>
				<lpage>580</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">148217</pubid>
						<pubid idtype="pmpid" link="fulltext">9862982</pubid>
						<pubid idtype="doi">10.1093/nar/27.2.573</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B81">
				<title>
					<p>Segment sequence(s) by local complexity (seg)</p>
				</title>
				<url>ftp://ftp.ncbi.nih.gov/pub/seg/nseg/</url>
			</bibl>
			<bibl id="B82">
				<title>
					<p>Statistics of local complexity in amino acid sequences and sequence databases.</p>
				</title>
				<aug>
					<au>
						<snm>Wooton</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Federhen</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Comput Chem</source>
				<pubdate>1993</pubdate>
				<volume>17</volume>
				<fpage>149</fpage>
				<lpage>163</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1016/0097-8485(93)85006-X</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B83">
				<title>
					<p>R project</p>
				</title>
				<url>http://www.r-project.org</url>
			</bibl>
		</refgrp>
	</bm>
</art>
