<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>gb-2005-6-6-r51</ui>
	<ji>GBJ</ji>
	<fm>
		<dochead>Research</dochead>
		<bibl>
			<title>
				<p>Natural antisense transcripts with coding capacity in <it>Arabidopsis </it>may have a regulatory role that is not linked to double-stranded RNA degradation</p>
			</title>
			<aug>
				<au id="A1">
					<snm>Jen</snm>
					<fnm>Chih-Hung</fnm>
					<insr iid="I1"/>
					<email>bmbcj@leeds.ac.uk</email>
				</au>
				<au id="A2">
					<snm>Michalopoulos</snm>
					<fnm>Ioannis</fnm>
					<insr iid="I1"/>
					<email>I.Michalopoulos@leeds.ac.uk</email>
				</au>
				<au id="A3">
					<snm>Westhead</snm>
					<mi>R</mi>
					<fnm>David</fnm>
					<insr iid="I1"/>
					<email>D.R.Westhead@leeds.ac.uk</email>
				</au>
				<au id="A4" ca="yes">
					<snm>Meyer</snm>
					<fnm>Peter</fnm>
					<insr iid="I2"/>
					<email>p.meyer@leeds.ac.uk</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>School of Biochemistry and Microbiology, University of Leeds, Leeds LS2 9JT, UK</p>
				</ins>
				<ins id="I2">
					<p>Centre for Plant Science, The University of Leeds, Leeds LS2 9JT, UK</p>
				</ins>
			</insg>
			<source>Genome Biology</source>
			<issn>1465-6906</issn>
			<pubdate>2005</pubdate>
			<volume>6</volume>
			<issue>6</issue>
			<fpage>R51</fpage>
			<url>http://genomebiology.com/2005/6/6/R51</url>
			<xrefbib>
				<pubidlist><pubid idtype="pmpid">15960803</pubid><pubid idtype="doi">10.1186/gb-2005-6-6-r51</pubid>
				</pubidlist></xrefbib>
		</bibl>
		<history>
			<rec>
				<date>
					<day>16</day>
					<month>12</month>
					<year>2004</year>
				</date>
			</rec>
			<revrec>
				<date>
					<day>9</day>
					<month>3</month>
					<year>2005</year>
				</date>
			</revrec>
			<acc>
				<date>
					<day>5</day>
					<month>5</month>
					<year>2005</year>
				</date>
			</acc>
			<pub>
				<date>
					<day>1</day>
					<month>6</month>
					<year>2005</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2005</year>
			<collab>Jen et al.; licensee BioMed Central Ltd.</collab>
			<note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<shorttitle>
			<p>New regulatory role of natural antisense transcripts in <it>Arabidopsis</it></p>
		</shorttitle>
		<shortabs>
			<p>Transcription data analysis of overlapping gene pairs in <it>Arabidopsis thaliana </it>argues against a predominant RNA degradation effect induced by dsRNA formation. Instead, it suggests alternative roles for dsRNAs such as regulation of alternative splicing in polyadenylation.</p>
		</shortabs>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st>
					<p>Overlapping transcripts in antisense orientation have the potential to form double-stranded RNA (dsRNA), a substrate for a number of different RNA-modification pathways. One prominent route for dsRNA is its breakdown by Dicer enzyme complexes into small RNAs, a pathway that is widely exploited by RNA interference technology to inactivate defined genes in transgenic lines. The significance of this pathway for endogenous gene regulation remains unclear.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st>
					<p>We have examined transcription data for overlapping gene pairs in <it>Arabidopsis thaliana</it>. On the basis of an analysis of transcripts with coding regions, we find the majority of overlapping gene pairs to be convergently overlapping pairs (COPs), with the potential for dsRNA formation. In all tissues, COP transcripts are present at a higher frequency compared to the overall gene pool. The probability that both the sense and antisense copy of a COP are co-transcribed matches the theoretical value for coexpression under the assumption that the expression of one partner does not affect the expression of the other. Among COPs, we observe an over-representation of spliced (intron-containing) genes (90%) and of genes with alternatively spliced transcripts. For loci where antisense transcripts overlap with sense transcript introns, we also find a significant bias in favor of alternative splicing and variation of polyadenylation.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusion</p>
					</st>
					<p>The results argue against a predominant RNA degradation effect induced by dsRNA formation. Instead, our data support alternative roles for dsRNAs. They suggest that at least for a subgroup of COPs, antisense expression may induce alternative splicing or polyadenylation.</p>
				</sec>
			</sec>
		</abs>
	</fm>
	<meta>
		<classifications>
			<classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010016">Molecular biology</classification>
		</classifications>
	</meta>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p>Genome-wide searches in the genomes of several species have identified a surprisingly high proportion of overlapping gene pairs. Depending on the sample sizes analyzed and search criteria, the frequencies for overlapping gene pairs vary between 4% and 9% for the human genome, 1.7%-14% in the murine genome, and up to 22% in the fly genome <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. The predominant composition of overlapping gene pairs is an antiparallel convergent arrangement <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>, where sense and antisense genes overlap within their 3' regions. Joint expression of both these genes in the same cell would allow the partly overlapping transcripts to associate as dsRNA molecules, which may interfere with RNA processing, transport, stability or other molecular mechanisms. Convergently overlapping gene pairs (COPs) can therefore provide the source for natural antisense transcripts (NATs) that may act as regulators of the sense gene. In addition to NATs being transcribed from the same locus as the sense transcript (<it>cis</it>-NATs), NATs can be transcribed from a different locus (<it>trans</it>-NATs), as illustrated by a search for overlapping transcripts with coding capacity in the human genome, which identified 87 <it>cis</it>-NATs and 80 <it>trans-</it>NATs <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>.</p>
			<p>In bacteria, more than 100 NATs are involved in the regulation of a variety of biological functions, including the control of copy number, conjugation and post-segregational killing in plasmids, lysis/lysogeny switches in phages, and transposition frequency in transposons <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. In eukaryotes, a very detailed characterization of the molecular role of specific NATs has only been achieved for a few examples.</p>
			<p>NAT-mediated interference with splicing is illustrated by the alternative processing of mRNAs of the gene for the thyroid hormone receptor ErbA&#945;, which is regulated by an antisense transcript <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Overlapping genes can share a bidirectional poly(A) region as demonstrated for the human genes <it>ABHD1 </it>and <it>Sec12 </it><abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Several examples document the fact that antisense transcripts can increase sense transcript stability, when dsRNA regions cover the 3' untranslated region (UTR) and possibly mask out target sequences for RNA cleavage <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Alternatively, RNA duplex formation can increase transcript sensitivity and induce site-specific cleavage, as shown for the human <it>TYMS </it>mRNA and <it>TRS </it>antisense transcripts <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>.</p>
			<p>An example of RNA interference (RNAi)-based regulation of an endogenous gene via NATs is the repression of the testis-expressed <it>Stellate </it>gene in <it>Drosophila </it>by paralogous <it>Su(Ste) </it>tandem repeats <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Both strands of repressor <it>Su(Ste) </it>repeats are transcribed, producing sense and antisense RNA, most probably as part of a dsRNA-based silencing mechanism, as <it>Stellate </it>silencing is associated with the presence of short <it>Su(Ste) </it>RNAs. Antisense expression can also affect translation, as illustrated by the influence of an antisense transcript on the translation of different isoforms of fibroblast growth factor-2 (FGF2) <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. In the nucleus, dsRNA can be edited by dsRNA-dependent adenosine deaminases, which convert about 50% of adenosine residues into inosines, leading to the unwinding of the RNA duplex <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Inosine-containing RNAs are not translated as they are retained in the nucleus <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>.</p>
			<p>In mice about 35% of overlapping genes transcribe noncoding RNA. Overlapping genes are scattered around the genome with no apparent bias. Overlaps range from 20 to 3,400 base-pairs (bp) with an average of 372 bp, as far as the quality of transcript annotation allows such predictions. There is some evidence for an over-representation of overlapping genes among specific functional categories, that is, imprinted genes and DNA repair genes <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Twenty-two out of 58 known imprinted murine genes are transcribed from both strands. Frequently, one partner transcribes a noncoding RNA. Antisense transcripts may regulate imprinting states of the sense promoter (<it>Kcnq1</it>/<it>Kcnq1ot1</it>) or may induce dsRNA-based gene silencing as proposed for <it>Ifd2R</it>/<it>Air</it>. About 20% of known human DNA repair genes overlap either convergently or divergently in an antiparallel arrangement <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>.</p>
			<p>Mammalian mRNAs that form sense-antisense pairs frequently exhibit reciprocal expression patterns, but permanent coexpression of sense and antisense transcripts can also occur in some tissues, although it is difficult to prove that both genes are transcribed in the same cell. Coexistence of sense and antisense transcripts may indicate a stabilizing effect of dsRNA, or it may depict cases where RNA duplex formation is impaired as a result of secondary structures, or because sense and antisense transcripts or the enzymes required for duplex formation are separated by compartmentalization <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>.</p>
			<p>To gain an insight into the existence and role of overlapping antisense pairs in plants, we have screened the <it>Arabidopsis thaliana </it>genome for COPs with sense and antisense genes that encode a protein, and have compared the expression profile of the associated genes.</p>
		</sec>
		<sec>
			<st>
				<p>Results</p>
			</st>
			<sec>
				<st>
					<p>Overlapping gene pairs in the <it>Arabidopsis </it>genome</p>
				</st>
				<p>A screen of the <it>Arabidopsis </it>genome for protein-coding genes with overlapping orientations identified 1,083 groups containing a total of 2,147 overlapping genes. For 26 groups, the overlap involves three genes, and for 1,057 groups, two genes are arranged as overlapping pairs (Figure <figr fid="F1">1</figr> and Additional data file 1). The majority of overlapping gene pairs are organized as COPs. The size of the overlapping region for these 956 COPs varies between 1 and 2,820 bp, with an average length of 431 bp. The genes are scattered around all five chromosomes with no obvious clustering bias (data not shown). Among the 1,912 COP genes, we found nine transposable elements. This is in contrast to the presence of 2,372 transposons among the 30,624 <it>Arabidopsis </it>genes. Transposons are therefore strongly under-represented among COPs, therefore transposon-derived antiparallel gene pairs are under heavy selective pressure. Among the 1,912 genes of the 956 COPs, 954 genes have antisense transcripts that extend into the open reading frame (ORF) region of the sense transcript (Figure <figr fid="F2">a-c</figr> and Additional data file 2) but the ORF regions of the sense and antisense transcript overlap for only 13 COPs (Figure <figr fid="F2">2a</figr> and Table <tblr tid="T1">1</tblr>).</p>
				<fig id="F1">
					<title>
						<p>Figure 1</p>
					</title>
					<caption>
						<p>A comparison of the arrangements of overlapping gene pairs in <it>Arabidopsis thaliana</it></p>
					</caption>
					<text>
						<p>A comparison of the arrangements of overlapping gene pairs in <it>Arabidopsis thaliana</it>. A and A' label the start and end of the sense transcript, B' and B label the start and end of the antisense transcript. The total number of genes involved in group 1, 2 and 3 is 2,157, of which 2,147 are unique; the remaining 10 comprise four genes that are members of both group 1 and group 2 pairs, five genes that are members of both group 1 and group 3 pairs, and 1 gene that is a member of both a group 1 and group 3 pair.</p>
					</text>
					<graphic file="gb-2005-6-6-r51-1"/>
				</fig>
				<fig id="F2">
					<title>
						<p>Figure 2</p>
					</title>
					<caption>
						<p>The organization of convergent overlapping gene pairs with respect to the protein coding capacity of the sense and antisense transcripts</p>
					</caption>
					<text>
						<p>The organization of convergent overlapping gene pairs with respect to the protein coding capacity of the sense and antisense transcripts.</p>
					</text>
					<graphic file="gb-2005-6-6-r51-2"/>
				</fig>
				<tbl id="T1">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>COPs with sense-antisense overlaps within the coding regions</p>
					</caption>
					<tblbdy cols="5">
						<r>
							<c ca="left">
								<p>Sense gene ID</p>
							</c>
							<c ca="left">
								<p>Annotation</p>
							</c>
							<c ca="left">
								<p>Antisense gene ID</p>
							</c>
							<c ca="left">
								<p>Annotation</p>
							</c>
							<c ca="center">
								<p>ORF overlap (bp)</p>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>AT1G08260</p>
							</c>
							<c ca="left">
								<p>DNA-directed DNA polymerase epsilon catalytic subunit, putative</p>
							</c>
							<c ca="left">
								<p>AT1G08270</p>
							</c>
							<c ca="left">
								<p>Expressed protein</p>
							</c>
							<c ca="center">
								<p>45</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>AT1G52010</p>
							</c>
							<c ca="left">
								<p>Mutator-like transposase family</p>
							</c>
							<c ca="left">
								<p>AT1G52020</p>
							</c>
							<c ca="left">
								<p>Pseudogene, Ulp1 protease family</p>
							</c>
							<c ca="center">
								<p>44</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>AT1G52087</p>
							</c>
							<c ca="left">
								<p>Hypothetical protein</p>
							</c>
							<c ca="left">
								<p>AT1G52090</p>
							</c>
							<c ca="left">
								<p>Hypothetical protein</p>
							</c>
							<c ca="center">
								<p>72</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>AT1G68935</p>
							</c>
							<c ca="left">
								<p>Expressed protein</p>
							</c>
							<c ca="left">
								<p>AT1G68940</p>
							</c>
							<c ca="left">
								<p>Armadillo/beta-catenin repeat protein-related/U-box domain-containing protein</p>
							</c>
							<c ca="center">
								<p>698</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>AT2G12855</p>
							</c>
							<c ca="left">
								<p>Gypsy-like retrotransposon family</p>
							</c>
							<c ca="left">
								<p>AT2G12860</p>
							</c>
							<c ca="left">
								<p>Gypsy-like retrotransposon family</p>
							</c>
							<c ca="center">
								<p>116</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>AT2G19330</p>
							</c>
							<c ca="left">
								<p>Leucine-rich repeat family protein</p>
							</c>
							<c ca="left">
								<p>AT2G19340</p>
							</c>
							<c ca="left">
								<p>Membrane protein, putative</p>
							</c>
							<c ca="center">
								<p>141</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>AT3G59940</p>
							</c>
							<c ca="left">
								<p>Kelch repeat-containing F-box family protein</p>
							</c>
							<c ca="left">
								<p>AT3G59950</p>
							</c>
							<c ca="left">
								<p>Autophagy 4b (APG4b)</p>
							</c>
							<c ca="center">
								<p>10</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>AT4G02200</p>
							</c>
							<c ca="left">
								<p>Drought-responsive family protein</p>
							</c>
							<c ca="left">
								<p>AT4G02210</p>
							</c>
							<c ca="left">
								<p>Expressed protein</p>
							</c>
							<c ca="center">
								<p>13</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>AT4G21366</p>
							</c>
							<c ca="left">
								<p>S-locus protein kinase-related</p>
							</c>
							<c ca="left">
								<p>AT4G21370</p>
							</c>
							<c ca="left">
								<p>S-locus protein kinase, putative</p>
							</c>
							<c ca="center">
								<p>72</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>AT4G29830</p>
							</c>
							<c ca="left">
								<p>Transducin family protein/WD-40 repeat family protein</p>
							</c>
							<c ca="left">
								<p>AT4G29840</p>
							</c>
							<c ca="left">
								<p>Threonine synthase, chloroplast</p>
							</c>
							<c ca="center">
								<p>587</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>AT5G18210</p>
							</c>
							<c ca="left">
								<p>Short-chain dehydrogenase/reductase (SDR) family protein</p>
							</c>
							<c ca="left">
								<p>AT5G18220</p>
							</c>
							<c ca="left">
								<p>Glycosyl hydrolase family 17 protein</p>
							</c>
							<c ca="center">
								<p>6</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>AT5G28232</p>
							</c>
							<c ca="left">
								<p>Mutator-like transposase family</p>
							</c>
							<c ca="left">
								<p>AT5G28235</p>
							</c>
							<c ca="left">
								<p>Ulp1 protease family protein</p>
							</c>
							<c ca="center">
								<p>29</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>AT5G48200</p>
							</c>
							<c ca="left">
								<p>Hypothetical protein</p>
							</c>
							<c ca="left">
								<p>AT5G48205</p>
							</c>
							<c ca="left">
								<p>Hypothetical protein</p>
							</c>
							<c ca="center">
								<p>334</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
				<p>To examine the degree of sequence conservation for COPs sense and antisense genes, we used BLAST to search for homologs of each COPs gene. For a subset of 242 genes, we can define 89 homology groups with 2-11 members. The proteins encoded by the sense members of each group are at least 20% identical, with an E-value less than 0.001. An analysis of the degree of sequence conservation among family members showed very low conservation among the coding or promoter regions of the antisense partners of homologous sense genes. With the exception of the largest family, the promoter regions of homologous sense genes are also poorly conserved (Table <tblr tid="T2">2</tblr> and Additional data files 4, 5 and 6). With a few exceptions, possibly representing relatively recent duplications, the data indicate that homologous sense genes do not in general have homologous antisense partners.</p>
				<tbl id="T2">
					<title>
						<p>Table 2</p>
					</title>
					<caption>
						<p>Homology assessment for 89 COPs families that contain 2-11 family members</p>
					</caption>
					<tblbdy cols="6">
						<r>
							<c ca="left">
								<p>Number of COPs family</p>
							</c>
							<c ca="center">
								<p>Number of family members</p>
							</c>
							<c ca="center">
								<p>Sense-gene-encoded proteins with a similarity E-value &lt; 0.001</p>
							</c>
							<c ca="center">
								<p>1 kb sense promoter regions with a similarity E-value &lt; 0.001</p>
							</c>
							<c ca="center">
								<p>Antisense-gene-encoded proteins with a similarity E-value &lt; 0.001</p>
							</c>
							<c ca="center">
								<p>1 kb antisense promoter regions with a similarity E-value &lt; 0.001</p>
							</c>
						</r>
						<r>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>1</p>
							</c>
							<c ca="center">
								<p>11</p>
							</c>
							<c ca="center">
								<p>11</p>
							</c>
							<c ca="center">
								<p>7</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>2</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>3</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>4</p>
							</c>
							<c ca="center">
								<p>7</p>
							</c>
							<c ca="center">
								<p>7</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>5-6</p>
							</c>
							<c ca="center">
								<p>7</p>
							</c>
							<c ca="center">
								<p>7</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>7</p>
							</c>
							<c ca="center">
								<p>6</p>
							</c>
							<c ca="center">
								<p>6</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>8-9</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>10</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>11-12</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>13-14</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>15</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>16-21</p>
							</c>
							<c ca="center">
								<p>3</p>
							</c>
							<c ca="center">
								<p>3</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>22-72</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>73-75</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>76-77</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>78-89</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>The numbers refer to the family members that share sequence similarity of an E-value below 0.001 with at least one other family member. Among the COPs families, the homology is well conserved among sense-gene-encoded proteins, while sequence conservation is rare among antisense-gene-encoded proteins. With the exception of family 1 sense gene promoters, the homology is also poorly conserved among promoter regions of sense and antisense genes.</p>
					</tblfn>
				</tbl>
			</sec>
			<sec>
				<st>
					<p>COPs gene-expression profiles</p>
				</st>
				<p>To analyze the transcriptional activity of COPs sense and antisense genes, we used the GSE636 annotated gene-expression database <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>, which provides expression data for 1,866 COPs genes in suspension culture, in 7-day old seedlings, in roots and in flowers. If antisense arrangements are predominantly responsible for dsRNA-mediated transcript degradation, we would expect that a significant proportion of COPs genes would be under-represented among the transcript pool. For the total pool of 26,939 genes represented in the GSE636 database, we find a representation of 49.8-53.1% of these genes among the detectable transcripts (Table <tblr tid="T3">3</tblr>). Of the 1,866 COPs genes represented among this pool, 63.4-67.9% are expressed, which argues against a specific depletion of COPs transcripts in any of the four sample tissues.</p>
				<tbl id="T3">
					<title>
						<p>Table 3</p>
					</title>
					<caption>
						<p>Expression analysis of 1,866 COPs genes based on expression data from the GSE636 annotated gene-expression database</p>
					</caption>
					<tblbdy cols="5">
						<r>
							<c ca="left">
								<p>Tissue</p>
							</c>
							<c ca="center">
								<p>% of expressed genes among 26,939 <it>Arabidopsis </it>genes</p>
							</c>
							<c ca="center">
								<p>% of expressed genes among 1,866 overlapping COPs genes</p>
							</c>
							<c ca="center">
								<p>% of COPs with jointly expressed sense and antisense genes (observed value)</p>
							</c>
							<c ca="center">
								<p>% of COPs with jointly expressed sense and antisense genes (expected value)</p>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Flowers</p>
							</c>
							<c ca="center">
								<p>52.5%</p>
							</c>
							<c ca="center">
								<p>67.9%</p>
							</c>
							<c ca="center">
								<p>45.6%</p>
							</c>
							<c ca="center">
								<p>46.1%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Roots</p>
							</c>
							<c ca="center">
								<p>51.4%</p>
							</c>
							<c ca="center">
								<p>63.4%</p>
							</c>
							<c ca="center">
								<p>38.5%</p>
							</c>
							<c ca="center">
								<p>40.2%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Suspension culture</p>
							</c>
							<c ca="center">
								<p>53.1%</p>
							</c>
							<c ca="center">
								<p>66.3%</p>
							</c>
							<c ca="center">
								<p>42.7%</p>
							</c>
							<c ca="center">
								<p>44.0%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>7 day old seedlings</p>
							</c>
							<c ca="center">
								<p>49.8%</p>
							</c>
							<c ca="center">
								<p>64.1%</p>
							</c>
							<c ca="center">
								<p>40.4%</p>
							</c>
							<c ca="center">
								<p>41.1%</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
				<p>This assumption is further supported by the lack of any bias against the joint expression of sense and antisense copies from the same COP. We can calculate a theoretical value for the joint expression of a sense and an antisense member of the same COP based on the representation of the COP genes in the transcriptional pool. If, for example, the probability that a COP gene is expressed in flowers is 67.9%, the probability that any two COP genes are jointly expressed is 67.9% &#215; 67.9%. The expected value of 46.1% matches the observed value of 45.6% determined for the joint expression of both genes of a COP (Table <tblr tid="T3">3</tblr>). We observe a similar match for the other tissues, which suggests that there is no bias against the joint expression of both COP partner genes. For about 20% of all COPs, both members are jointly expressed in all tissues tested.</p>
				<p>We also examined the microarray data provided by the Nottingham <it>Arabidopsis </it>Stock Centre (NASC) <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Table <tblr tid="T4">4</tblr> compiles 21 Affymetrix ATH1 arrays for seven different <it>Arabidopsis </it>tissues based on three replicates for each assay. The datasets were retrieved by searching for BioSource_ID on <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>.</p>
				<tbl id="T4">
					<title>
						<p>Table 4</p>
					</title>
					<caption>
						<p>Expression of 1,596 COPs genes based on the NASC microarray database</p>
					</caption>
					<tblbdy cols="5">
						<r>
							<c ca="left">
								<p>Tissue</p>
							</c>
							<c ca="center">
								<p>% of expressed genes among 22,746 <it>Arabidopsis </it>genes</p>
							</c>
							<c ca="center">
								<p>% of expressed genes among 1,596 overlapping COPs genes</p>
							</c>
							<c ca="center">
								<p>% of COPs with jointly expressed sense and antisense genes (observed value)</p>
							</c>
							<c ca="center">
								<p>% of COPs with jointly expressed sense and antisense genes (expected value)</p>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Flowers</p>
							</c>
							<c ca="center">
								<p>62.9%</p>
							</c>
							<c ca="center">
								<p>82.5%</p>
							</c>
							<c ca="center">
								<p>67.7%</p>
							</c>
							<c ca="center">
								<p>68.1%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Pollen</p>
							</c>
							<c ca="center">
								<p>31.7%</p>
							</c>
							<c ca="center">
								<p>36.0%</p>
							</c>
							<c ca="center">
								<p>12.3%</p>
							</c>
							<c ca="center">
								<p>13.0%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Seedlings, green parts</p>
							</c>
							<c ca="center">
								<p>57.3%</p>
							</c>
							<c ca="center">
								<p>76.7%</p>
							</c>
							<c ca="center">
								<p>57.8%</p>
							</c>
							<c ca="center">
								<p>58.8%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Cotyledons</p>
							</c>
							<c ca="center">
								<p>55.7%</p>
							</c>
							<c ca="center">
								<p>75.3%</p>
							</c>
							<c ca="center">
								<p>56.1%</p>
							</c>
							<c ca="center">
								<p>56.7%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Leaves</p>
							</c>
							<c ca="center">
								<p>55.6%</p>
							</c>
							<c ca="center">
								<p>74.8%</p>
							</c>
							<c ca="center">
								<p>54.5%</p>
							</c>
							<c ca="center">
								<p>56.0%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Roots</p>
							</c>
							<c ca="center">
								<p>62.6%</p>
							</c>
							<c ca="center">
								<p>76.9%</p>
							</c>
							<c ca="center">
								<p>58.3%</p>
							</c>
							<c ca="center">
								<p>59.1%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Hypocotyl</p>
							</c>
							<c ca="center">
								<p>62.9%</p>
							</c>
							<c ca="center">
								<p>82.1%</p>
							</c>
							<c ca="center">
								<p>66.8%</p>
							</c>
							<c ca="center">
								<p>67.4%</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
				<p>Although the expression probabilities of both the total gene pool and the COPs gene pool differ significantly among individual tissues, the expected and observed values for the coexpression of COPs sense and antisense genes again match, reinforcing the lack of any indication of a transcript degradation mechanism (Table <tblr tid="T4">4</tblr>). To assess whether transcript degradation depends on a specific experimental condition, we assembled the expression data for all 1,437 microarrays that were available. For each microarray, we calculated the proportion of transcripts that were expressed, both for the total gene pool comprising 22,746 genes and for the 1,596 COPs genes represented in the total pool. A depletion of COPs-specific transcripts for any of the 1,437 microarrays should result in a significant reduction of the expressed COPs pool. We do not find a single case where the COPs genes are under-represented among the transcripts detectable in an array experiment. Compared to the transcriptional activity of the whole gene pool, the transcriptional activity of the COPs gene pool is between 1.008 and 1.485 times higher. This argues against a specific reduction in COPs gene activity under any of the experimental conditions used for the array experiments (Additional data file 7).</p>
			</sec>
			<sec>
				<st>
					<p>Indications for a role of antisense transcripts in sense transcript splicing</p>
				</st>
				<p>Among the 1,912 COPs genes we find a considerable bias for splicing. Of the COPs genes, 1,723 (90.1%) are spliced, while among a total of 30,624 <it>Arabidopsis </it>genes only 21,157 (69.1%) genes are spliced. If antisense transcripts played a role in sense transcript alternative splicing, we would expect an enrichment among COPs genes in alternative splicing. Out of the total pool of 30,624 genes, 7.6% encode more than one transcript, which are all alternatively spliced. This proportion rises to 14.5% among the 1,912 COPs genes (Table <tblr tid="T5">5</tblr>). This increase is even more pronounced in the rice genome, where it increases from 4.4% to 20.9% COPs.</p>
				<tbl id="T5">
					<title>
						<p>Table 5</p>
					</title>
					<caption>
						<p>Representation of spliced genes among COPs, and correlation analysis for transcript modifications among these genes</p>
					</caption>
					<tblbdy cols="4">
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Total genes</p>
							</c>
							<c ca="left">
								<p>COPs</p>
							</c>
							<c ca="left">
								<p><it>p</it>-value</p>
							</c>
						</r>
						<r>
							<c cspan="4">
								<hr/>
							</c>
						</r>
						<r>
							<c cspan="4" ca="left">
								<p>COP genes show a strong positive bias for splicing</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Total</p>
							</c>
							<c ca="left">
								<p>30,624</p>
							</c>
							<c ca="left">
								<p>1,912</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Spliced</p>
							</c>
							<c ca="left">
								<p>21,157 (69.1%)</p>
							</c>
							<c ca="left">
								<p>1,723 (90.1%)</p>
							</c>
							<c ca="left">
								<p>4.7e-113</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c cspan="4" ca="left">
								<p>Spliced COP genes show a positive bias for alternative splicing</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Spliced</p>
							</c>
							<c ca="left">
								<p>21,157</p>
							</c>
							<c ca="left">
								<p>1,723</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Alternatively spliced</p>
							</c>
							<c ca="left">
								<p>2,331 (11.0%)</p>
							</c>
							<c ca="left">
								<p>268 (15.6%)</p>
							</c>
							<c ca="left">
								<p>1.3e-9</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c cspan="4" ca="left">
								<p>Alternatively spliced COP genes do not show a significant bias for alternative splicing at the last intron, TSS variation or polyadenylation site variation</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Alternatively spliced</p>
							</c>
							<c ca="left">
								<p>2,331</p>
							</c>
							<c ca="left">
								<p>268</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Last intron alternative splicing</p>
							</c>
							<c ca="left">
								<p>1,662 (71.3%)</p>
							</c>
							<c ca="left">
								<p>195 (72.8%)</p>
							</c>
							<c ca="left">
								<p>0.31</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>TSS variation</p>
							</c>
							<c ca="left">
								<p>1,424 (61%)</p>
							</c>
							<c ca="left">
								<p>158 (59.0%)</p>
							</c>
							<c ca="left">
								<p>0.24</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Polyadenylation site variation</p>
							</c>
							<c ca="left">
								<p>1,019 (43.7%)</p>
							</c>
							<c ca="left">
								<p>107 (39.9%)</p>
							</c>
							<c ca="left">
								<p>0.10</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
				<p>To assess if the over-representation of multiple transcripts among COPs was linked to a variation in splicing, transcriptional start site (TSS) or polyadenylation, we analyzed the representation of these modifications among the spliced COPs genes. The results showed a positive bias for alternative splicing (Table <tblr tid="T5">5</tblr>). Interestingly, this bias was restricted to COPs with an antisense transcript that overlaps the intron region of the sense transcript (Table <tblr tid="T6">6</tblr>). Moreover, among the COPs with antisense transcripts overlapping at least 40 bp of the sense transcript, we also detected a positive bias for a variation of the polyadenylation sites (Table <tblr tid="T6">6</tblr>), whereas no positive or negative bias was observed for TSS variation (see Additional data file 8).</p>
				<tbl id="T6">
					<title>
						<p>Table 6</p>
					</title>
					<caption>
						<p>Analysis of preferences for alternative splicing and polyadenylation site variation among spliced COPs genes, in dependence of the termination site of the antisense transcript</p>
					</caption>
					<tblbdy cols="4">
						<r>
							<c cspan="4" ca="left">
								<p>Spliced COP genes with an antisense transcript not overlapping a sense transcript intron region, show a significant negative bias for alternative splicing</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>COPs genes</p>
							</c>
							<c ca="left">
								<p>COPs with antisense gene ending 3,000-0 bp before the sense I/E boundary</p>
							</c>
							<c ca="left">
								<p><it>p</it>-value</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Spliced genes</p>
							</c>
							<c ca="left">
								<p>1,723</p>
							</c>
							<c ca="left">
								<p>1,497</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Alternatively spliced genes</p>
							</c>
							<c ca="left">
								<p>268 (15.6%)</p>
							</c>
							<c ca="left">
								<p>217 (14.5%)</p>
							</c>
							<c ca="left">
								<p>0.0018</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c cspan="4" ca="left">
								<p>Spliced COPs genes with an antisense transcript overlapping a sense transcript intron region, show a significant positive bias for alternative splicing</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>COPs genes</p>
							</c>
							<c ca="left">
								<p>COPs with an antisense gene ending 0-3,000 bp behind the sense I/E boundary</p>
							</c>
							<c ca="left">
								<p><it>p</it>-value</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Gene with splicing</p>
							</c>
							<c ca="left">
								<p>1,723</p>
							</c>
							<c ca="left">
								<p>226</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Alternative splicing</p>
							</c>
							<c ca="left">
								<p>268 (15.6%)</p>
							</c>
							<c ca="left">
								<p>51 (22.6%)</p>
							</c>
							<c ca="left">
								<p>0.0018</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>COPs genes</p>
							</c>
							<c ca="left">
								<p>COPs with an antisense gene ending &gt; 40 bp behind the sense I/E boundary</p>
							</c>
							<c ca="left">
								<p><it>p</it>-value</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Gene with splicing</p>
							</c>
							<c ca="left">
								<p>1,723</p>
							</c>
							<c ca="left">
								<p>129</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Alternative splicing</p>
							</c>
							<c ca="left">
								<p>268 (15.6%)</p>
							</c>
							<c ca="left">
								<p>35 (27.1%)</p>
							</c>
							<c ca="left">
								<p>0.00032</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c cspan="4" ca="left">
								<p>Alternatively spliced COPs sense genes with an antisense transcript ending more than 40 bp behind their last I/E boundary, show a positive bias for polyadenylation site variation</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>COPs genes</p>
							</c>
							<c ca="left">
								<p>COPs with an antisense gene ending &gt; 40 bp behind the sense I/E boundary</p>
							</c>
							<c ca="left">
								<p><it>p</it>-value</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Alternatively spliced</p>
							</c>
							<c ca="left">
								<p>268</p>
							</c>
							<c ca="left">
								<p>35</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Polyadenylation site variation</p>
							</c>
							<c ca="left">
								<p>107 (39.9%)</p>
							</c>
							<c ca="left">
								<p>25 (71.4%)</p>
							</c>
							<c ca="left">
								<p>5.5e-05</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
				<p>For 146 genes, the antisense transcript terminates within 10 bp away from the intron-exon boundary of the sense transcript (Figure <figr fid="F3">3</figr> and Additional data file 3). The proportion of alternatively spliced genes among this group increases to 21.2%. We tested whether these COPs genes were specifically prone to alternative splicing of the final exon but could not find any evidence for this assumption (see Additional data file 8).</p>
				<fig id="F3">
					<title>
						<p>Figure 3</p>
					</title>
					<caption>
						<p>Illustration of the distance between the end of the antisense transcript and the last intron-exon boundary of the sense transcript</p>
					</caption>
					<text>
						<p>Illustration of the distance between the end of the antisense transcript and the last intron-exon boundary of the sense transcript. Negative values refer to a termination of the antisense transcript 5' to the intron-exon boundary.</p>
					</text>
					<graphic file="gb-2005-6-6-r51-3"/>
				</fig>
				<p>Overall, the enhanced likelihood that members of overlapping gene pairs contain introns, the enrichment in genes encoding alternatively spliced transcripts, and the increased frequency of alternatively spliced and variably polyadenylated transcripts when an intron overlaps with an antisense transcript, suggest that, at least for the majority of overlapping gene pairs, the antisense transcript could play a role in the regulation of splicing and/or polyadenylation.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Discussion</p>
			</st>
			<p>We have characterised the organization and expression profiles of 956 convergent overlapping gene pairs of <it>A. thaliana </it>to assess the potential molecular mechanisms associated with this unusual gene organization, which provided the opportunity for dsRNA formation as a result of the annealing of a sense and antisense transcript.</p>
			<p>In animal genomes especially, a number of different mechanisms have been described that involve dsRNA formation. dsRNA formation can interfere with biological activities that require binding of RNA or proteins to the transcript <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. This may include processes such as RNA splicing, editing, transport, degradation or translation. Alternatively, dsRNA could function as a trigger for an RNAi process, providing a target for Dicer enzymes, and be specifically degraded to small interfering RNA (siRNA) molecules <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. The latter mechanism would lead to mutual destruction of the sense and antisense transcripts, whereas antisense transcript-mediated effects on sense RNA processing would not necessarily alter the primary transcript levels, although they could still influence the potential for primary transcript expression.</p>
			<p>As in other species <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B19">19</abbr></abbrgrp>, the majority of overlapping gene pairs in <it>A. thaliana </it>are arranged in a convergent orientation. Only a very small group of 13 out of 956 COPs show an overlap between the ORFs of the sense and antisense transcripts, which probably reflects the associated evolutionary stress of such an arrangement, as any mutation in the overlapping region would affect both genes. For 50.1% of COPs the sense-antisense overlap does not include any protein-coding region, which makes it unlikely that in this subgroup of COPs the antisense transcript plays a role in regulating the coding region of the sense gene. Antisense transcripts for the members of this group are more likely to jointly use bidirectional poly(A) signals <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> or to regulate transcript stability <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>.</p>
			<p>We do observe a very high level of joint expression of sense and antisense transcript from overlapping gene pairs. This is in marked contrast to data from <it>Plasmodium falciparum</it>, where only 5% of sense-antisense loci show joint expression <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> and thus support models for a direct regulation of sense transcript by antisense expression via dsRNA degradation. In contrast, the relatively high expression frequency of COPs in <it>Arabidopsis</it>, and the joint presence of sense and antisense transcripts in the same tissue, do not support a dsRNA degradation model. Even a detailed analysis of 1,437 microarrays does not imply that under any conditions or for any specific tissue the COPs gene pool is significantly depleted. While dsRNA-based transcript degradation may occur for some COPs, our data suggest that for the majority of COPs, antisense expression is not linked to transcript degradation pathways.</p>
			<p>An interesting observation, which may hint at an alternative interference mechanism between sense and antisense transcripts, is the significant bias for COPs to be spliced, and the enrichment among COPs of alternatively spliced transcripts. These features may indicate a role for antisense transcripts in alternative splicing. Such a mechanism would resemble the effect of antisense expression for the thyroid hormone receptor gene, <it>erbA&#945;</it>, which leads to alternative RNA processing <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. The bias for antisense transcripts to terminate close to the final intron-exon boundary remains a mystery. One could assume that the termination of the antisense transcript near the final sense intron-exon boundary might reflect a selection for antisense transcripts that interfere with splicing of the last intron. However, we could not observe any positive bias for such events among this COPs gene group.</p>
			<p>The assumption that antisense transcripts can interfere with splicing events is further supported by the observation that overlaps between antisense transcripts and a sense intron region generate a bias for alternative splicing and also for polyadenylation variation. This may reflect a linkage between these two mechanisms, which has been demonstrated for animal systems where polyadenylation and the splicing of the final intron especially can be coupled <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>.</p>
			<p>During the review process of this paper, a similar study by Wang and co-workers <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> was published. The main differences between the two studies are in the sets of overlapping genes considered, and the nature of the experimental evidence of gene expression. While we consider only gene pairs where both partners show evidence of protein-coding capacity, Wang and co-workers also considered cases where at least one transcript may not be protein coding. We base our studies of expression on two large microarray datasets, while Wang and co-workers use data from a massively parallel signature sequencing (MPSS) study. While we find no evidence in the microarray data of exclusive transcription relationships for COPs gene pairs, the MPSS evidence of exclusive transcription of the gene pairs in the Wang study is clear. This apparent contradiction may be explained by differences in the gene sets studied, particularly as expression data is only available for a subset of the genes in each study, or by differences in the quality of the expression data. Nevertheless, the two studies taken together give evidence of various significant biological consequences of gene overlaps, including effects on sense gene splicing or polyadenylation (our study) and coexpression of gene pairs <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>.</p>
			<p>It is important to remember that in our study, we have concentrated on a specific subgroup of convergently overlapping genes, with both sense and antisense transcripts encoding an ORF. Among this group there may be an over-representation of gene pairs for which sense and antisense transcription jointly regulate the production efficiency of both proteins, for example via the common use of a bidirectional polyadenylation region, or by co-editing of both strands associated as dsRNA. Such mechanisms would require the joint transcription of both genes in the same tissue, and our data do suggest that sense and antisense transcripts are frequently coexpressed. On the other hand, one would assume that co-regulation would preferentially be used for genes encoding proteins that are linked in their biological role. One would therefore expect a high degree of conservation for both proteins among homologous COPs, whereas our data show that COPs with homologous proteins encoded by their sense genes do not show the same conservation for the proteins encoded by their antisense genes.</p>
			<p>The selection of sense/antisense transcripts with coding capacity may also be the reason for the lack of an indication of dsRNA-based degradation of COPs. A considerable proportion of overlapping antisense genes are noncoding RNAs <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> or are <it>trans</it>-NATs transcribed from different genetic loci <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. These overlapping transcript types may contain a much higher proportion of genes regulated by transcript degradation than the COPs analyzed in this study. A final confirmation of the role of dsRNA for individual genes will require a more detailed experimental analysis. Our analysis should, however, provide a useful first step in defining distinct groups of COP genes as a basis for a more detailed molecular characterization.</p>
		</sec>
		<sec>
			<st>
				<p>Conclusion</p>
			</st>
			<p>The <it>Arabidopsis </it>genome contains 956 COPs with coding capacity that have the potential to form dsRNA. In contrast to data from other species, a comparative expression analysis indicates that sense and antisense transcripts of COPs loci can coexist in the same tissue at the same frequency as the transcripts of any other unlinked genes, with no indication of specific degradation of such sense-antisense transcript pairs. This observation does not exclude the presence of dsRNA degradation pathways for individual loci, but it refocuses the attention on alternative roles for natural antisense transcripts in plants, preferentially those that do not lead to an overall change in transcript levels but rather affect transcript processing or localization. In line with this view, we observe a high proportion of intron-containing genes in COPs, in both <it>Arabidopsis </it>and rice, and an enrichment for genes with alternatively spliced transcripts, indicative of a role for some COP antisense transcripts in splicing modification. In addition, we detect a potential link between alternative splicing and poly(A) site variation. This work provides a set of databases for COPs, based on the degree of sense-antisense overlap and expression, which should provide a basis for the selection of individual candidate loci for a detailed molecular analysis of the different dsRNA pathways.</p>
		</sec>
		<sec>
			<st>
				<p>Materials and methods</p>
			</st>
			<sec>
				<st>
					<p>Analysis of overlapping transcripts</p>
				</st>
				<p>All <it>Arabidopsis </it>genome information, including gene ID (AGI code), transcript orientation, and gene and exon position coordinates of transcripts with coding regions, was downloaded from The <it>Arabidopsis </it>Information Resource (TAIR) ftp website <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. These data were stored in a MySQL database <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> designed for general genomic analysis, and the overlapping transcript analysis was implemented with custom SQL language queries and Perl scripts using the Perl DBI module. The rice genome data were obtained from <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Analysis of gene variation</p>
				</st>
				<p>Genes with more than one transcript were further analyzed for variation in TSS, polyadenylation site, and alternative splicing. TSS variation was detected by comparing the starting genomic positions of the first exons of genes with more than one transcript, and variation in the polyadenylation site by comparing the ends of the last exons. Genes with more than one transcript that had identical TSSs and polyadenylation sites, or had different numbers of exons, were considered as alternatively spliced. Genes with more than one transcript that had the same number of exons and variation in TSS and polyadenylation site, underwent comparison of their intron boundaries to detect alternative splicing. To detect alternative splicing in the last intron, alternatively spliced transcripts underwent comparison of the borders of their last intron.</p>
			</sec>
			<sec>
				<st>
					<p>Hypergeometric distribution</p>
				</st>
				<p><it>p</it>-values for over- or under-representations of genes were calculated as the upper or lower tail of the hypergeometric distribution <it>p</it>(<it>x </it>&#8805; X) or <it>p</it>(<it>x </it>&#8804; <it>X</it>), respectively, where <it>p</it>(<it>x;N,R,k</it>) = <it>C</it>(<it>R,x</it>)<it>C</it>(<it>N </it>- <it>R,k </it>- <it>x</it>)/<it>C</it>(<it>N,k</it>). Here <it>p </it>refers to the probability that a list of <it>k </it>genes should contain <it>x </it>genes with a particular property (for example, alternative splicing), when the list has been selected randomly without replacement from a set of <it>N </it>genes in which <it>R </it>genes exhibit the same property. <it>C</it>(<it>n,m</it>) is the number of distinct combinations of m objects that can be drawn from a set of <it>n </it>objects. The hypergeometric distribution was calculated with the <it>R </it>package <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Microarray data</p>
				</st>
				<p><it>Arabidopsis </it>microarray data were obtained from two sources. The dataset from Gene Expression Omnibus <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp> with accession number GSE636 is a collection of microarray experiments using high-density oligonucleotide arrays. It contains transcriptional activity information (detection call only) for the complete set of all protein-coding genes in different tissues. The Affymetrix ATH1 array data were acquired using the Nottingham <it>Arabidopsis </it>Stock Centre (NASC) AffyWatch service <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. In NASC's datasets, including 1,437 arrays for 93 experimental purposes, the transcription information for each gene consists of detection call and signal value, as calculated from the Affymetrix MAS 5.0 analysis algorithms <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. The analysis of expression data reported in Results was achieved using a combination of Perl script processing and Microsoft Excel spreadsheet analysis.</p>
			</sec>
			<sec>
				<st>
					<p>Coding region and upstream sequence similarity analysis of homologous COP genes</p>
				</st>
				<p>The COPs genes were clustered on the basis of their protein sequences with a 20% similarity threshold using the program BLASTclust <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. The similarity of the associated coding and upstream regulatory regions within each cluster were tested by pairwise searches using BLAST2P <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Additional data files</p>
			</st>
			<p>The following additional files are available with the online version of this paper. Additional data file <supplr sid="S1">1</supplr> is a supplement to Figure <figr fid="F1">1</figr> listing all overlapping genes with ID, annotation and size of overlapping region. Additional data file <supplr sid="S2">2</supplr> is a supplement to Figure <figr fid="F2">2</figr> classifying 1,912 COPs genes according to their overlapping regions. Additional data file <supplr sid="S3">3</supplr> is a supplement to Figure <figr fid="F3">3</figr>, calculating the antisense transcript end position in relation to the sense intron-exon boundary for the 956 COPs pairs. Additional data file <supplr sid="S4">4</supplr> is a supplement A to Table <tblr tid="T2">2</tblr>, listing 242 COPs member genes of 89 COPs families. Additional data file <supplr sid="S5">5</supplr> is a supplement B to Table <tblr tid="T2">2</tblr>, comparing homology among 1 kb promoter regions of COPs family members. Additional data file <supplr sid="S6">6</supplr> is a supplement C to Table <tblr tid="T2">2</tblr>, comparing homology among sense and antisense encoded proteins for members of 89 COPs families. Additional data file <supplr sid="S7">7</supplr> is a supplement to Table <tblr tid="T4">4</tblr>, showing expression analysis of the total gene pool and the COPs gene pool for 1,437 microarray experiments. Additional data file <supplr sid="S8">8</supplr> is a supplement to Table <tblr tid="T6">6</tblr>, including a correlation analysis of alternative splicing, TSS variation and polyadenylation variation for COPs with respect to the termination of the antisense transcript in relation to the sense intron-exon boundary.</p>
			<suppl id="S1">
				<title>
					<p>Additional File 1</p>
				</title>
				<caption>
					<p>Supplement to Figure <figr fid="F1">1</figr>. List of all overlapping genes with ID, annotation and size of overlapping region</p>
				</caption>
				<text>
					<p>Supplement to Figure <figr fid="F1">1</figr>. List of all overlapping genes with ID, annotation and size of overlapping region. List of all overlapping genes with ID, annotation and size of overlapping region</p>
				</text>
				<file name="gb-2005-6-6-r51-S1.xls">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S2">
				<title>
					<p>Additional File 2</p>
				</title>
				<caption>
					<p>Supplement to Figure <figr fid="F2">2</figr>. Classification of 1,912 COPs genes according to their overlapping regions</p>
				</caption>
				<text>
					<p>Supplement to Figure <figr fid="F2">2</figr>. Classification of 1,912 COPs genes according to their overlapping regions. Classification of 1,912 COPs genes according to their overlapping regions</p>
				</text>
				<file name="gb-2005-6-6-r51-S2.xls">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S3">
				<title>
					<p>Additional File 3</p>
				</title>
				<caption>
					<p>Supplement to Figure <figr fid="F3">3</figr>. Calculation of the antisense transcript end position in relation to the sense intron-exon boundary for the 956 COPs pairs</p>
				</caption>
				<text>
					<p>Supplement to Figure <figr fid="F3">3</figr>. Calculation of the antisense transcript end position in relation to the sense intron-exon boundary for the 956 COPs pairs. Calculation of the antisense transcript end position in relation to the sense intron-exon boundary for the 956 COPs pairs</p>
				</text>
				<file name="gb-2005-6-6-r51-S3.xls">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S4">
				<title>
					<p>Additional File 4</p>
				</title>
				<caption>
					<p>Supplement A to Table <tblr tid="T2">2</tblr>. 242 COPs member genes of 89 COPs families</p>
				</caption>
				<text>
					<p>Supplement A to Table <tblr tid="T2">2</tblr>. 242 COPs member genes of 89 COPs families. 242 COPs member genes of 89 COPs families</p>
				</text>
				<file name="gb-2005-6-6-r51-S4.xls">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S5">
				<title>
					<p>Additional File 5</p>
				</title>
				<caption>
					<p>Supplement B to Table <tblr tid="T2">2</tblr>. Comparison of homology among 1 kb promoter regions of COPs family members</p>
				</caption>
				<text>
					<p>Supplement B to Table <tblr tid="T2">2</tblr>. Comparison of homology among 1 kb promoter regions of COPs family members. Comparison of homology among 1 kb promoter regions of COPs family members</p>
				</text>
				<file name="gb-2005-6-6-r51-S5.xls">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S6">
				<title>
					<p>Additional File 6</p>
				</title>
				<caption>
					<p>Supplement C to Table <tblr tid="T2">2</tblr>. Homology comparison among sense and antisense encoded proteins for members of 89 COPs families</p>
				</caption>
				<text>
					<p>Supplement C to Table <tblr tid="T2">2</tblr>. Homology comparison among sense and antisense encoded proteins for members of 89 COPs families. Homology comparison among sense and antisense encoded proteins for members of 89 COPs families</p>
				</text>
				<file name="gb-2005-6-6-r51-S6.xls">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S7">
				<title>
					<p>Additional File 7</p>
				</title>
				<caption>
					<p>Supplement to Table <tblr tid="T4">4</tblr>. Expression analysis of the total gene pool and the COPs gene pool for 1,437 microarray experiments</p>
				</caption>
				<text>
					<p>Supplement to Table <tblr tid="T4">4</tblr>. Expression analysis of the total gene pool and the COPs gene pool for 1,437 microarray experiments. Expression analysis of the total gene pool and the COPs gene pool for 1,437 microarray experiments</p>
				</text>
				<file name="gb-2005-6-6-r51-S7.xls">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S8">
				<title>
					<p>Additional File 8</p>
				</title>
				<caption>
					<p>Supplement to Table <tblr tid="T6">6</tblr>. Correlation analysis of alternative splicing, TSS variation and polyadenylation variation for COPs with respect to the termination of the antisense transcript in relation to the sense intron-exon boundary</p>
				</caption>
				<text>
					<p>Supplement to Table <tblr tid="T6">6</tblr>. Correlation analysis of alternative splicing, TSS variation and polyadenylation variation for COPs with respect to the termination of the antisense transcript in relation to the sense intron-exon boundary. Correlation analysis of alternative splicing, TSS variation and polyadenylation variation for COPs with respect to the termination of the antisense transcript in relation to the sense intron-exon boundary</p>
				</text>
				<file name="gb-2005-6-6-r51-S8.pdf">
					<p>Click here for file</p>
				</file>
			</suppl>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>We are grateful to Athanasios Theologis for providing access to the GSE636 database. This work was supported by a European Commission grant to the Epigenome Network of Excellence (503433).</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>Shedding light on the dark side of the genome: overlapping genes in higher eukaryotes.</p>
				</title>
				<aug>
					<au>
						<snm>Boi</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Solda</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Tenchini</snm>
						<fnm>ML</fnm>
					</au>
				</aug>
				<source>Curr Genomics</source>
				<pubdate>2004</pubdate>
				<volume>5</volume>
				<fpage>509</fpage>
				<lpage>524</lpage>
				<xrefbib>
					<pubid idtype="doi">10.2174/1389202043349020</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Overlapping antisense transcription in the human genome.</p>
				</title>
				<aug>
					<au>
						<snm>Fahey</snm>
						<fnm>ME</fnm>
					</au>
					<au>
						<snm>Moore</snm>
						<fnm>TF</fnm>
					</au>
					<au>
						<snm>Higgins</snm>
						<fnm>DG</fnm>
					</au>
				</aug>
				<source>Comp Funct Genomics</source>
				<pubdate>2002</pubdate>
				<volume>3</volume>
				<fpage>244</fpage>
				<lpage>253</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1002/cfg.173</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Antisense transcripts in the human genome.</p>
				</title>
				<aug>
					<au>
						<snm>Lehner</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Williams</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Campbell</snm>
						<fnm>RD</fnm>
					</au>
					<au>
						<snm>Sanderson</snm>
						<fnm>CM</fnm>
					</au>
				</aug>
				<source>Trends Genet</source>
				<pubdate>2002</pubdate>
				<volume>18</volume>
				<fpage>63</fpage>
				<lpage>65</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0168-9525(02)02598-2</pubid>
						<pubid idtype="pmpid" link="fulltext">11818131</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Antisense RNAs everywhere?</p>
				</title>
				<aug>
					<au>
						<snm>Wagner</snm>
						<fnm>EGH</fnm>
					</au>
					<au>
						<snm>Flardh</snm>
						<fnm>K</fnm>
					</au>
				</aug>
				<source>Trends Genet</source>
				<pubdate>2002</pubdate>
				<volume>18</volume>
				<fpage>223</fpage>
				<lpage>226</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0168-9525(02)02658-6</pubid>
						<pubid idtype="pmpid" link="fulltext">12047936</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>Expression of the thyroid hormone receptor gene, erbA&#945;, in B lymphocytes: alternative mRNA processing is independent of differentiation but correlates with antisense RNA levels.</p>
				</title>
				<aug>
					<au>
						<snm>Hastings</snm>
						<fnm>ML</fnm>
					</au>
					<au>
						<snm>Milcarek</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Martincic</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Peterson</snm>
						<fnm>ML</fnm>
					</au>
					<au>
						<snm>Munroe</snm>
						<fnm>SH</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>1997</pubdate>
				<volume>25</volume>
				<fpage>4296</fpage>
				<lpage>4300</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">147039</pubid>
						<pubid idtype="pmpid" link="fulltext">9336460</pubid>
						<pubid idtype="doi">10.1093/nar/25.21.4296</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>The gene structure and expression of human ABHD1: overlapping polyadenylation signal sequence with Sec12.</p>
				</title>
				<aug>
					<au>
						<snm>Edgar</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>BMC Genomics</source>
				<pubdate>2003</pubdate>
				<volume>4</volume>
				<fpage>18</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">156608</pubid>
						<pubid idtype="pmpid" link="fulltext">12735795</pubid>
						<pubid idtype="doi">10.1186/1471-2164-4-18</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>Phylogenetic conservation of the Makorin-2 gene, encoding a multiple zinc-finger protein, antisense to the RAF1 proto-oncogene.</p>
				</title>
				<aug>
					<au>
						<snm>Gray</snm>
						<fnm>TA</fnm>
					</au>
					<au>
						<snm>Azama</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Whitmore</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Min</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Abe</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Nicholls</snm>
						<fnm>RD</fnm>
					</au>
				</aug>
				<source>Genomics</source>
				<pubdate>2001</pubdate>
				<volume>77</volume>
				<fpage>119</fpage>
				<lpage>126</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/geno.2001.6627</pubid>
						<pubid idtype="pmpid" link="fulltext">11597136</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>Natural antisense (rTS[&#945;]) RNA induces site-specific cleavage of thymidylate synthase mRNA.</p>
				</title>
				<aug>
					<au>
						<snm>Chu</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Dolnick</snm>
						<fnm>BJ</fnm>
					</au>
				</aug>
				<source>Biochim Biophys Acta</source>
				<pubdate>2002</pubdate>
				<volume>1587</volume>
				<fpage>183</fpage>
				<lpage>193</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">12084460</pubid>
						<pubid idtype="doi">10.1016/S0925-4439(02)00081-9</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>Double-stranded RNA-mediated silencing of genomic tandem repeats and transposable elements in the <it>D. melanogaster </it>germline.</p>
				</title>
				<aug>
					<au>
						<snm>Aravin</snm>
						<fnm>AA</fnm>
					</au>
					<au>
						<snm>Naumova</snm>
						<fnm>NM</fnm>
					</au>
					<au>
						<snm>Tulin</snm>
						<fnm>AV</fnm>
					</au>
					<au>
						<snm>Vagin</snm>
						<fnm>VV</fnm>
					</au>
					<au>
						<snm>Rozovsky</snm>
						<fnm>YM</fnm>
					</au>
					<au>
						<snm>Gvozdev</snm>
						<fnm>VA</fnm>
					</au>
				</aug>
				<source>Curr Biol</source>
				<pubdate>2001</pubdate>
				<volume>11</volume>
				<fpage>1017</fpage>
				<lpage>1027</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0960-9822(01)00299-8</pubid>
						<pubid idtype="pmpid" link="fulltext">11470406</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Erratum to expression of alternatively spliced FGF-2 antisense RNA transcripts in the central nervous system: regulation of FGF-2 mRNA translation.</p>
				</title>
				<aug>
					<au>
						<snm>Li</snm>
						<fnm>AW</fnm>
					</au>
					<au>
						<snm>Murphy</snm>
						<fnm>PR</fnm>
					</au>
				</aug>
				<source>Mol Cell Endocrinol</source>
				<pubdate>2000</pubdate>
				<volume>170</volume>
				<fpage>231</fpage>
				<lpage>242</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1016/S0303-7207(00)00440-8</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>An unwinding activity that covalently modifies its double-stranded RNA substrate.</p>
				</title>
				<aug>
					<au>
						<snm>Bass</snm>
						<fnm>BL</fnm>
					</au>
					<au>
						<snm>Weintraub</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Cell</source>
				<pubdate>1988</pubdate>
				<volume>55</volume>
				<fpage>1089</fpage>
				<lpage>1098</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/0092-8674(88)90253-X</pubid>
						<pubid idtype="pmpid">3203381</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>The fate of dsRNA in the nucleus: a p54nrb-containing complex mediates the nuclear retention of promiscuously A-to-I edited RNAs.</p>
				</title>
				<aug>
					<au>
						<snm>Zhang</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Carmichael</snm>
						<fnm>GG</fnm>
					</au>
				</aug>
				<source>Cell</source>
				<pubdate>2001</pubdate>
				<volume>106</volume>
				<fpage>465</fpage>
				<lpage>475</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0092-8674(01)00466-4</pubid>
						<pubid idtype="pmpid" link="fulltext">11525732</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>Do natural antisense transcripts make sense in eukaryotes?</p>
				</title>
				<aug>
					<au>
						<snm>Vanhee-Brossollet</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Vaquero</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Gene</source>
				<pubdate>1998</pubdate>
				<volume>211</volume>
				<fpage>1</fpage>
				<lpage>9</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0378-1119(98)00093-6</pubid>
						<pubid idtype="pmpid" link="fulltext">9573333</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B14">
				<title>
					<p>Gene Expression Omnibus</p>
				</title>
				<url>http://www.ncbi.nlm.nih.gov/geo</url>
			</bibl>
			<bibl id="B15">
				<title>
					<p>NCBI GEO: mining millions of expression profiles - database and tools.</p>
				</title>
				<aug>
					<au>
						<snm>Barrett</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Suzek</snm>
						<fnm>TO</fnm>
					</au>
					<au>
						<snm>Troup</snm>
						<fnm>DB</fnm>
					</au>
					<au>
						<snm>Wilhite</snm>
						<fnm>SE</fnm>
					</au>
					<au>
						<snm>Ngau</snm>
						<fnm>WC</fnm>
					</au>
					<au>
						<snm>Ledoux</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Rudnev</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Lash</snm>
						<fnm>AE</fnm>
					</au>
					<au>
						<snm>Fujibuchi</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Edgar</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2005</pubdate>
				<issue>33 Database</issue>
				<fpage>D562</fpage>
				<lpage>D566</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">539976</pubid>
						<pubid idtype="pmpid" link="fulltext">15608262</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B16">
				<title>
					<p>NASCArrays: a repository for microarray data generated by NASC's transcriptomics service.</p>
				</title>
				<aug>
					<au>
						<snm>Craigon</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>James</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Okyere</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Higgins</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Jotham</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>May</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Nucleic Acid Res</source>
				<pubdate>2004</pubdate>
				<issue>32 Database</issue>
				<fpage>D575</fpage>
				<lpage>D577</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">308867</pubid>
						<pubid idtype="pmpid" link="fulltext">14681484</pubid>
						<pubid idtype="doi">10.1093/nar/gkh133</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<title>
					<p>NASCA Arrays: Affymetrix ATH1 arrays database</p>
				</title>
				<url>http://affymetrix.arabidopsis.info/narrays/experimentbrowse.pl</url>
			</bibl>
			<bibl id="B18">
				<title>
					<p>Potent and specific genetic interference by double-stranded RNA in <it>Caenorhabditis elegans</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Fire</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Xu</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Montgomery</snm>
						<fnm>MK</fnm>
					</au>
					<au>
						<snm>Kostas</snm>
						<fnm>SA</fnm>
					</au>
					<au>
						<snm>Driver</snm>
						<fnm>SE</fnm>
					</au>
					<au>
						<snm>Mello</snm>
						<fnm>CC</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>1998</pubdate>
				<volume>391</volume>
				<fpage>806</fpage>
				<lpage>811</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/35888</pubid>
						<pubid idtype="pmpid" link="fulltext">9486653</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<title>
					<p>In search of antisense.</p>
				</title>
				<aug>
					<au>
						<snm>Lavorgna</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Dahary</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Lehner</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Sorek</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Sanderson</snm>
						<fnm>CM</fnm>
					</au>
					<au>
						<snm>Casari</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Trends Biochem Sci</source>
				<pubdate>2004</pubdate>
				<volume>29</volume>
				<fpage>88</fpage>
				<lpage>94</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.tibs.2003.12.002</pubid>
						<pubid idtype="pmpid" link="fulltext">15102435</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B20">
				<title>
					<p>Widespread distribution of antisense transcripts in the <it>Plasmodium falciparum </it>genome</p>
				</title>
				<aug>
					<au>
						<snm>Gunasekera</snm>
						<fnm>AM</fnm>
					</au>
					<au>
						<snm>Patankar</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Schug</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Eisen</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Kissinger</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Roos</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Wirth</snm>
						<fnm>DF</fnm>
					</au>
				</aug>
				<source>Mol Biochem Parasitol</source>
				<pubdate>2004</pubdate>
				<volume>136</volume>
				<fpage>35</fpage>
				<lpage>42</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.molbiopara.2004.02.007</pubid>
						<pubid idtype="pmpid" link="fulltext">15138065</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p>Transcriptional termination in the Balbiani ring 1gene is closely coupled to 3'-end formation and excision of the 3'-terminal intron.</p>
				</title>
				<aug>
					<au>
						<snm>Bauren</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Belikov</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Wieslander</snm>
						<fnm>L</fnm>
					</au>
				</aug>
				<source>Genes Dev</source>
				<pubdate>1998</pubdate>
				<volume>12</volume>
				<fpage>2759</fpage>
				<lpage>2769</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">317118</pubid>
						<pubid idtype="pmpid" link="fulltext">9732273</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B22">
				<title>
					<p>Genome-wide prediction and identification of cis-natural antisense transcripts in <it>Arabidopsis thaliana</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Wang</snm>
						<fnm>X-J</fnm>
					</au>
					<au>
						<snm>Gaasterland</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Chua</snm>
						<fnm>N-H</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2005</pubdate>
				<volume>6</volume>
				<fpage>R30</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">15833117</pubid>
						<pubid idtype="doi">10.1186/gb-2005-6-4-r30</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B23">
				<title>
					<p>Antisense transcripts with FANTOM2 clone set and their implications for gene regulation.</p>
				</title>
				<aug>
					<au>
						<snm>Kiyosawa</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Yamanaka</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Osato</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Kondo</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Hayashizaki</snm>
						<fnm>Y</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2003</pubdate>
				<volume>13</volume>
				<fpage>1324</fpage>
				<lpage>1334</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">403655</pubid>
						<pubid idtype="pmpid" link="fulltext">12819130</pubid>
						<pubid idtype="doi">10.1101/gr.982903</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<title>
					<p>TAIR ftp website</p>
				</title>
				<url>ftp://ftp.arabidopsis.org/Maps/seqviewer_data/sv_gene_feature.data</url>
			</bibl>
			<bibl id="B25">
				<title>
					<p>MySQL.</p>
				</title>
				<aug>
					<au>
						<snm>DuBois</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<publisher>Indianapolis, IN: New Riders Publishing</publisher>
				<pubdate>2000</pubdate>
			</bibl>
			<bibl id="B26">
				<title>
					<p>TIGR rice: rice expression database</p>
				</title>
				<url>http://www.tigr.org/tdb/e2k1/osa1/expression/alt_spliced.info.shtml</url>
			</bibl>
			<bibl id="B27">
				<title>
					<p><it>O. sativa </it>database</p>
				</title>
				<url>ftp://ftp.tigr.org/pub/data/Eukaryotic_Projects/o_sativa/annotation_dbs/pseudomolecules/version_3.0/all_chrs/all.TU_model.brief_info.3.1</url>
			</bibl>
			<bibl id="B28">
				<title>
					<p>R:A language for data analysis and graphics.</p>
				</title>
				<aug>
					<au>
						<snm>Ihaka</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Gentlement</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>J Comp Graph Statist</source>
				<pubdate>1996</pubdate>
				<volume>5</volume>
				<fpage>299</fpage>
				<lpage>314</lpage>
			</bibl>
			<bibl id="B29">
				<title>
					<p>Robust estimators for expression analysis.</p>
				</title>
				<aug>
					<au>
						<snm>Hubbell</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Liu</snm>
						<fnm>WM</fnm>
					</au>
					<au>
						<snm>Mei</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2002</pubdate>
				<volume>18</volume>
				<fpage>1585</fpage>
				<lpage>1592</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/18.12.1585</pubid>
						<pubid idtype="pmpid" link="fulltext">12490442</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B30">
				<title>
					<p>BLAST: at the core of a powerful and diverse set of sequence analysis tools.</p>
				</title>
				<aug>
					<au>
						<snm>McGinnis</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Madden</snm>
						<fnm>TL</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2004</pubdate>
				<volume>32</volume>
				<fpage>W20</fpage>
				<lpage>W25</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">441573</pubid>
						<pubid idtype="pmpid" link="fulltext">15215342</pubid>
						<pubid idtype="doi">10.1093/nar/gnh003</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B31">
				<title>
					<p>Enhanced homology searching through genome reading frame predetermination.</p>
				</title>
				<aug>
					<au>
						<snm>Yuan</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Bush</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Elbrecht</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Liu</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Zhao</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Blevins</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2004</pubdate>
				<volume>20</volume>
				<fpage>1416</fpage>
				<lpage>1427</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bth115</pubid>
						<pubid idtype="pmpid" link="fulltext">14976033</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
		</refgrp>
	</bm>
</art>
