<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>gb-2005-6-6-r52</ui>
	<ji>GBJ</ji>
	<fm>
		<dochead>Research</dochead>
		<bibl>
			<title>
				<p>Tiling microarray analysis of rice chromosome 10 to identify the transcriptome and relate its expression to chromosomal architecture</p>
			</title>
			<aug>
				<au id="A1" ce="yes">
					<snm>Li</snm>
					<fnm>Lei</fnm>
					<insr iid="I1"/>
					<email>lei.li.ll326@yale.edu</email>
				</au>
				<au id="A2" ce="yes">
					<snm>Wang</snm>
					<fnm>Xiangfeng</fnm>
					<insr iid="I2"/>
					<insr iid="I3"/>
					<insr iid="I4"/>
					<email>wangxiangfeng@genomics.org.cn</email>
				</au>
				<au id="A3">
					<snm>Xia</snm>
					<fnm>Mian</fnm>
					<insr iid="I5"/>
					<email>mianxia@hotmail.com</email>
				</au>
				<au id="A4">
					<snm>Stolc</snm>
					<fnm>Viktor</fnm>
					<insr iid="I1"/>
					<insr iid="I6"/>
					<email>vstolc@mail.arc.nasa.gov</email>
				</au>
				<au id="A5">
					<snm>Su</snm>
					<fnm>Ning</fnm>
					<insr iid="I1"/>
					<email>ning.su@yale.edu</email>
				</au>
				<au id="A6">
					<snm>Peng</snm>
					<fnm>Zhiyu</fnm>
					<insr iid="I2"/>
					<email>pengzhy@mail.cbi.pku.edu.cn</email>
				</au>
				<au id="A7">
					<snm>Li</snm>
					<fnm>Songgang</fnm>
					<insr iid="I3"/>
					<email>Lisg@pku.edu.cn</email>
				</au>
				<au id="A8">
					<snm>Wang</snm>
					<fnm>Jun</fnm>
					<insr iid="I4"/>
					<email>Wangj@genomics.org.cn</email>
				</au>
				<au id="A9">
					<snm>Wang</snm>
					<fnm>Xiping</fnm>
					<insr iid="I5"/>
					<email>xipingwang@hotmail.com</email>
				</au>
				<au id="A10" ca="yes">
					<snm>Deng</snm>
					<mnm>Wang</mnm>
					<fnm>Xing</fnm>
					<insr iid="I1"/>
					<email>xingwang.deng@yale.edu</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, CT 06520, USA</p>
				</ins>
				<ins id="I2">
					<p>National Institute of Biological Sciences, Zhongguancun Life Science Park, Beijing 102206, China</p>
				</ins>
				<ins id="I3">
					<p>Peking-Yale Joint Research Center of Plant Molecular Genetics and Agrobiotechnology, College of Life Sciences, Peking University, Beijing 100871, China</p>
				</ins>
				<ins id="I4">
					<p>Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 101300, China</p>
				</ins>
				<ins id="I5">
					<p>National Center of Crop Design, China Bioway Biotech Group Co., LTD, Beijing 100085, China</p>
				</ins>
				<ins id="I6">
					<p>Genome Research Facility, NASA Ames Research Center, MS 239-11, Moffett Field, CA 94035, USA</p>
				</ins>
			</insg>
			<source>Genome Biology</source>
			<issn>1465-6906</issn>
			<pubdate>2005</pubdate>
			<volume>6</volume>
			<issue>6</issue>
			<fpage>R52</fpage>
			<url>http://genomebiology.com/2005/6/6/R52</url>
			<xrefbib>
				<pubidlist><pubid idtype="pmpid">15960804</pubid><pubid idtype="doi">10.1186/gb-2005-6-6-r52</pubid>
				</pubidlist></xrefbib>
		</bibl>
		<history>
			<rec>
				<date>
					<day>14</day>
					<month>1</month>
					<year>2005</year>
				</date>
			</rec>
			<revrec>
				<date>
					<day>1</day>
					<month>4</month>
					<year>2005</year>
				</date>
			</revrec>
			<acc>
				<date>
					<day>25</day>
					<month>4</month>
					<year>2005</year>
				</date>
			</acc>
			<pub>
				<date>
					<day>27</day>
					<month>5</month>
					<year>2005</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2005</year>
			<collab>Li et al.; licensee BioMed Central Ltd.</collab>
			<note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<shorttitle>
			<p>Tiling microarray analysis of rice chromosome 10</p>
		</shorttitle>
		<shortabs>
			<p>A transcriptome analysis of chromosome 10 of 2 rice subspecies identifies 549 new gene models and gives experimental evidence for around 75% of the previously unsupported predicted genes.
</p>
		</shortabs>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st>
					<p>Sequencing and annotation of the genome of rice (<it>Oryza sativa</it>) have generated gene models in numbers that top all other fully sequenced species, with many lacking recognizable sequence homology to known genes. Experimental evaluation of these gene models and identification of new models will facilitate rice genome annotation and the application of this knowledge to other more complex cereal genomes.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st>
					<p>We report here an analysis of the chromosome 10 transcriptome of the two major rice subspecies, <it>japonica </it>and <it>indica</it>, using oligonucleotide tiling microarrays. This analysis detected expression of approximately three-quarters of the gene models without previous experimental evidence in both subspecies. Cloning and sequence analysis of the previously unsupported models suggests that the predicted gene structure of nearly half of those models needs improvement. Coupled with comparative gene model mapping, the tiling microarray analysis identified 549 new models for the <it>japonica </it>chromosome, representing an 18% increase in the annotated protein-coding capacity. Furthermore, an asymmetric distribution of genome elements along the chromosome was found that coincides with the cytological definition of the heterochromatin and euchromatin domains. The heterochromatin domain appears to associate with distinct chromosome level transcriptional activities under normal and stress conditions.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusion</p>
					</st>
					<p>These results demonstrated the utility of genome tiling microarray in evaluating annotated rice gene models and in identifying novel transcriptional units. The tiling microarray sanalysis further revealed a chromosome-wide transcription pattern that suggests a role for transposable element-enriched heterochromatin in shaping global transcription in response to environmental changes in rice.</p>
				</sec>
			</sec>
		</abs>
	</fm>
	<meta>
		<classifications>
			<classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010019">Plant biology</classification>
		</classifications>
	</meta>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p>As one of the most important crop species in the world and a model for the Gramineae family, rice (<it>Oryza sativa</it>) was selected as the first monocotyledonous plant to have its genome completely sequenced. Draft genome sequences of the two major subspecies of rice, <it>indica </it>and <it>japonica</it>, were made available in 2002 <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. These were followed by the advanced sequences of <it>japonica </it>chromosomes 1, 4 and 10 <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. The finish-quality whole-genome sequences of <it>indica </it>and <it>japonica </it>have recently been obtained <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>.</p>
			<p>Available rice sequences have been subjected to extensive annotation using <it>ab initio </it>gene prediction, comparative genomics, and a variety of other methods. These analyses revealed abundant compositional and structural features of the predicted rice genes that deviate from genes in other model organisms. For example, distinctive negative gradients of GC content, codon usage, and amino-acid usage along the direction of transcription were observed in many rice gene models <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B9">9</abbr></abbrgrp>. On the other hand, many predicted rice genes that lack significant homology to genes in other organisms also exhibit characteristics such as unusual GC composition and distribution, suggesting that they might not be true genes <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. Furthermore, the abundance and diversity of transposable elements (TEs) within the rice genome that possess a coding capacity pose an additional challenge to accurate annotation of the rice genome <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>.</p>
			<p>As such, our understanding of the rice genome is largely limited to the state-of-the-art gene prediction and annotation programs. This is probably best reflected by the lack of a consensus of the estimation of the total gene number in rice <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. Estimated total gene number based on the draft sequences of <it>japonica </it>and <it>indica </it>ranged widely from 30,000 to 60,000 <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Finished sequences of chromosome 1, 4 and 10 allowed a more finely tuned estimate that placed the total number of rice genes between 57,000 and 62,500 <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. These estimates included a large number of gene models that contain TE-related open reading frames (ORFs). Excluding the TE-related ORFs could reduce the gene number to about 45,000 <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. Even then, between one third and one half of the predicted genes appear to have no recognizable homologs in the other model plant <it>Arabidopsis thaliana </it><abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. Further, aggressive manual annotations of portions of the finished rice sequence have disqualified many of the low-homology gene models as TE-related or artifacts, arguing that there are no more than 40,000 nonredundant genes in rice <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>.</p>
			<p>Experimental evidence such as full-length cDNA sequences and expressed sequence tags (ESTs) is critical for evaluation and improvement of the genome annotation <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. Large collections of rice full-length cDNA and ESTs are available <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B17">17</abbr></abbrgrp>; however, given the large number of rice genes, current methods for collecting expressed sequences do not provide the necessary depth of coverage. For example, based on high-stringency alignments to EST sequences available at that time, only 24.7% of the 3,471 initially predicted genes of chromosome 10 were matched <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Conversely, other experiment-oriented approaches, such as massively parallel signature sequencing <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, are able to provide sufficient coverage of the transcriptome but by their nature are limited in their ability to define gene structures. Thus, it is important to survey the transcriptome using additional experimental means that permit detailed analyses of current gene models and the identification of new models.</p>
			<p>Recent studies in several model organisms have demonstrated the utility of tiling microarrays in transcriptome identification <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. Armed with new microarray technologies, it is now possible to prepare high-density oligonucleotide tiling microarrays to interrogate genomic sequences irrespective of their annotations. Consequently, results from these studies indicate that a significant portion of the transcriptome resides outside the predicted coding regions <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>. In addition, these studies show that tiling microarrays are able to improve or correct the predicted gene structures <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B23">23</abbr><abbr bid="B26">26</abbr></abbrgrp>. Based on considerations of feature density, versatility of modification, and compatibility with our existing conventional microarray facility, the maskless array synthesizer (MAS) platform <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B26">26</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr></abbrgrp> was chosen for our rice transcriptome analysis.</p>
			<p>Here we report the construction and analysis of two independent sets of custom high-density oligonucleotide tiling microarrays with unique 36-mer probe sequences tiled throughout the nonrepetitive sequences of chromosome 10 for both <it>japonica </it>and <it>indica </it>rice. Hybridized with a mixed pool of cDNA targets, these tiling microarrays detected over 80% of the annotated nonredundant gene models in both <it>japonica </it>and <it>indica</it>, and identified a large number of transcriptionally active intergenic regions. These results, coupled with comparative gene model mapping and reverse transcription PCR (RT-PCR) analysis, allowed the first comprehensive identification and analysis of a rice chromosomal transcriptome. These results further revealed an association of chromosome 10 transcriptome regulation with the euchromatin-heterochromatin organization at the chromosomal level.</p>
		</sec>
		<sec>
			<st>
				<p>Results</p>
			</st>
			<sec>
				<st>
					<p>Rice chromosome 10 oligonucleotide tiling microarrays</p>
				</st>
				<p>Based on recent studies using MAS oligonucleotide tiling microarrays to obtain gene expression and structure information <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B26">26</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr></abbrgrp>, we designed two independent sets of 36-mer probes, with 10-nucleotide intervals, tiled throughout both strands of <it>japonica </it>and <it>indica </it>chromosome 10, respectively. After filtering out those probes that represent sequences with a high copy number or a high degree of complementarity, 750,282 and 838,816 probes were retained to interrogate the entire nonrepetitive sequences of <it>japonica </it>and <it>indica </it>chromosome 10 and were synthesized in two sets of MAS microarrays <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B26">26</abbr><abbr bid="B29">29</abbr></abbrgrp>. The arrays were hybridized with target cDNA prepared from equal amounts of four selected poly(A)<sup>+ </sup>RNA populations (the N Arrays), namely, seedling roots, seedling shoots, panicles, and suspension cultured cells of the respective rice subspecies. In addition, a set of <it>japonica </it>arrays was hybridized to shoot poly(A)<sup>+ </sup>RNA derived from seedlings with a mineral/nutrient disturbance (the S Arrays).</p>
				<p>Our MAS microarrays utilize a 'chessboard' design, meaning that each positive feature, which contains an interrogating probe, is surrounded by four negative features and vice versa <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B26">26</abbr></abbrgrp>. Given that both positive and negative features contain a linker oligo to which the interrogating probes were synthesized, it was possible to determine signal probes (those that detect an RNA target) using a two-step procedure. After normalization (Figure <figr fid="F1">1a,b</figr>), positive features with fluorescence intensities lower than the mean intensity of the four surrounding negative features were masked. A characteristic bimodal intensity distribution of the remaining positive features was observed for each microarray (Figure <figr fid="F1">1c</figr>). Based on a statistical model to reject noise probes at a 90% confidence (see Materials and methods), signal probes and their normalized fluorescence intensities were determined (Figure <figr fid="F1">1c</figr>). Signal probes were correlated with the transcriptionally active regions (TARs) of the chromosome by alignment of the probes to the chromosomal coordinates (Figure <figr fid="F2">2</figr>). Experimental identification of the transcriptome was then achieved by systematically examining the expression of the annotated gene models and screening for intergenic TARs.</p>
				<fig id="F1">
					<title>
						<p>Figure 1</p>
					</title>
					<caption>
						<p>Processing the rice chromosome 10 tiling microarray hybridization data</p>
					</caption>
					<text>
						<p>Processing the rice chromosome 10 tiling microarray hybridization data. <b>(a) </b>Distribution of fluorescence intensity of all positive and negative features of the four <it>indica </it>N Arrays. <b>(b) </b>All eight distributions were scaled to have a uniform intensity peak value at 8 (log<sub>2</sub>). <b>(c) </b>Mathematic model for determination of signal probes. A bimodal distribution of log<sub>2 </sub>background-adjusted intensity of all positive features is used to model the noise as a normal distribution by mirroring the distribution of low intensity (&lt; 6 of log<sub>2</sub>). A cutoff value corresponding to a 90% confidence level to reject noise probes according to the modeled noise distribution is indicated. <b>(d) </b>Distribution of hybridization rate in the exonic and intronic regions of rice chromosome 10. Hybridization rate (HR) is calculated as the ratio of the number of signal probes against the total number of interrogating probes per kilobase of sequence.</p>
					</text>
					<graphic file="gb-2005-6-6-r52-1"/>
				</fig>
				<fig id="F2">
					<title>
						<p>Figure 2</p>
					</title>
					<caption>
						<p>Tiling microarray analysis of the rice chromosome 10 transcriptome</p>
					</caption>
					<text>
						<p>Tiling microarray analysis of the rice chromosome 10 transcriptome. <b>(a) </b>Schematic representation of rice chromosome 10. The purple oval denotes the centromere. <b>(b) </b>A region from the long arm of chromosome 10 displaying the three sets of gene models used: BGI <it>indica</it>; TIGR <it>japonica </it>and BGI <it>japonica</it>. The nonredundant protein-coding gene models are aligned to the chromosomal sequences and color-coded on the basis of their classification (see text). <b>(c) </b>Detailed tiling profile of one representative CG model. The model is represented here as block arrows, which point in the direction of transcription. Signal oligos are aligned according to their chromosomal coordinates. The fluorescence intensity value of each signal oligo, capped at 2,500, is depicted as a vertical bar. The shade of the bar represents the oligo index score (see Materials and methods). The red blocks underneath the bars indicate the presence of an interrogating oligo in the microarray.</p>
					</text>
					<graphic file="gb-2005-6-6-r52-2"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Rice chromosome 10 gene models</p>
				</st>
				<p>Finished sequences have been determined for both <it>japonica </it>and <it>indica </it>chromosome 10 <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. Initial annotation of <it>japonica </it>chromosome 10 produced 3,471 protein-coding gene models <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, which was updated to 3,856 in the release 2 of the Rice Pseudomolecules from The Institute for Genomic Research (TIGR) <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Of these, 829 (21.5%) were found to be TE-related models. Eight gene models were mapped to other chromosomes, and were not included in this study. Classification of the 3,019 nonredundant protein-coding gene models was based on alignments to the rice full-length cDNA and ESTs <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B17">17</abbr></abbrgrp>. These analyses led to the identification of 935 (31.0%) cDNA-supported gene (CG) and 321 (10.6%) EST-supported gene (EG) models. The remaining 1763 (58.4%) models were classified as unsupported gene (UG) models. This model set is designated TIGR <it>japonica </it>(Table <tblr tid="T1">1</tblr>, Figure <figr fid="F2">2</figr> and see Additional data file 1).</p>
				<tbl id="T1">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>Classification and array detection of rice chromosome 10 gene models</p>
					</caption>
					<tblbdy cols="6">
						<r>
							<c ca="left">
								<p>Annotation</p>
							</c>
							<c cspan="4" ca="left">
								<p>Nonredundant protein-coding gene model</p>
							</c>
							<c ca="center">
								<p>TE</p>
							</c>
						</r>
						<r>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Type</p>
							</c>
							<c ca="left">
								<p>Annotated</p>
							</c>
							<c ca="left">
								<p>Detected</p>
							</c>
							<c ca="left">
								<p>Percentage</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>BGI <it>indica</it></p>
							</c>
							<c ca="left">
								<p>CG</p>
							</c>
							<c ca="left">
								<p>821</p>
							</c>
							<c ca="left">
								<p>784</p>
							</c>
							<c ca="left">
								<p>95.5%</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>EG</p>
							</c>
							<c ca="left">
								<p>328</p>
							</c>
							<c ca="left">
								<p>290</p>
							</c>
							<c ca="left">
								<p>88.4%</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>UG</p>
							</c>
							<c ca="left">
								<p>1,660</p>
							</c>
							<c ca="left">
								<p>1,354</p>
							</c>
							<c ca="left">
								<p>81.6%</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Total</p>
							</c>
							<c ca="left">
								<p>2,809</p>
							</c>
							<c ca="left">
								<p>2,428</p>
							</c>
							<c ca="left">
								<p>86.4%</p>
							</c>
							<c ca="center">
								<p>574</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>BGI <it>japonica</it></p>
							</c>
							<c ca="left">
								<p>CG</p>
							</c>
							<c ca="left">
								<p>943</p>
							</c>
							<c ca="left">
								<p>879</p>
							</c>
							<c ca="left">
								<p>93.2%</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>EG</p>
							</c>
							<c ca="left">
								<p>272</p>
							</c>
							<c ca="left">
								<p>238</p>
							</c>
							<c ca="left">
								<p>87.5%</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>UG</p>
							</c>
							<c ca="left">
								<p>1,549</p>
							</c>
							<c ca="left">
								<p>1,202</p>
							</c>
							<c ca="left">
								<p>77.6%</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Total</p>
							</c>
							<c ca="left">
								<p>2,764</p>
							</c>
							<c ca="left">
								<p>2,319</p>
							</c>
							<c ca="left">
								<p>83.9%</p>
							</c>
							<c ca="center">
								<p>851</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>TIGR <it>japonica</it></p>
							</c>
							<c ca="left">
								<p>CG</p>
							</c>
							<c ca="left">
								<p>935</p>
							</c>
							<c ca="left">
								<p>871</p>
							</c>
							<c ca="left">
								<p>93.2%</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>EG</p>
							</c>
							<c ca="left">
								<p>321</p>
							</c>
							<c ca="left">
								<p>291</p>
							</c>
							<c ca="left">
								<p>90.7%</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>UG</p>
							</c>
							<c ca="left">
								<p>1,763</p>
							</c>
							<c ca="left">
								<p>1,310</p>
							</c>
							<c ca="left">
								<p>74.3%</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Total</p>
							</c>
							<c ca="left">
								<p>3,019</p>
							</c>
							<c ca="left">
								<p>2,472</p>
							</c>
							<c ca="left">
								<p>81.9%</p>
							</c>
							<c ca="center">
								<p>829</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>Rice chromosome 10 protein-coding gene models were divided into TE and nonredundant models based on available annotations. Because of their repetitiveness, expression of TE models was not assessed. The nonredundant models were further divided into CG, EG and UG models based on their alignment to rice full-length cDNAs and ESTs and their expression assessed by tiling microarray analysis.</p>
					</tblfn>
				</tbl>
				<p>For comparison, the so-called BGI <it>japonica </it>gene models were included, whereby the <it>japonica </it>chromosome 10 sequence was independently annotated by the Beijing Genomics Institute (BGI) <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B30">30</abbr></abbrgrp>. This model set, generated by the FGENESH output with limited full-length cDNA/EST input, contains 851 TE, 943 CG, 272 EG, and 1,549 UG models (Table <tblr tid="T1">1</tblr>, Figure <figr fid="F2">2</figr>). To analyze the <it>indica </it>chromosome 10 transcriptome, and for comparative analysis, the BGI <it>indica </it>models were also examined <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B6">6</abbr><abbr bid="B30">30</abbr></abbrgrp>. Classification of the <it>indica </it>models identified 574 TE, 821 CG, 328 EG, and 1,660 UG models (Table <tblr tid="T1">1</tblr>, Figure <figr fid="F2">2</figr> and see Additional data file 2).</p>
			</sec>
			<sec>
				<st>
					<p>Tiling microarray detection of rice chromosome 10 gene models</p>
				</st>
				<p>Analysis of the N arrays detected 2,428 out of 2,809 BGI <it>indica </it>(86.4%), 2,319 out of 2,764 BGI <it>japonica </it>(83.9%), and 2,472 out of 3,019 TIGR <it>japonica </it>(81.9%) nonredundant gene models (Table <tblr tid="T1">1</tblr>). Although no technical replication was performed, several observations indicate that tiling microarray analysis provides a reliable evaluation of the expression of the gene models. First, consistent with their classification, gene models with previous experimental support (CG and EG) showed a higher detection rate than the unsupported models (Table <tblr tid="T1">1</tblr>). For example, 93.2% and 90.7% of the TIGR <it>japonica </it>CG and EG models were detected, respectively, whereas only 74.3% of the UG models were (Table <tblr tid="T1">1</tblr>). Second, supported models (CG and EG) exhibited very similar array detection rates across the three sets of gene models. Because the same cDNA and ESTs were used to classify the three sets of gene models, this result implies a strong correlation between tiling microarray detection and expressed sequences. In supporting of this conclusion, TIGR <it>japonica </it>models with at least one match with rice EST sequences exhibited a 92.7% (1,010 of 1,089) detection rate whereas only 75.7% (1,458 of 1,925) models without a matching EST were detected. Third, examination of signal probe distribution, measured by hybridization rate (HR, see Materials and methods), in the annotated exonic and intronic regions indicates that the tiling microarrays detected transcription predominantly locate in the exons. Across the three annotations, the HRs of both the intronic regions (dashed lines) and exonic regions (solid lines) showed bimodal distributions, with their respective major peaks well separated (Figure <figr fid="F1">1d</figr>). The minor intronic HR peak likely reflects transcriptional activities of exons misidentified as introns or in uncharacterized splice variants. Conversely, the minor exonic HR peak is likely to be due to misinterpretation of introns as exons, or exons or genes not expressed at all in the RNA populations used (Figure <figr fid="F1">1d</figr>).</p>
			</sec>
			<sec>
				<st>
					<p>Analysis of previously unsupported gene models</p>
				</st>
				<p>The relatively poor detection rate for the unsupported models suggests that their expression may be more restricted to specific cell types or developmental stages, thus eluding tiling array detection. Alternatively, some of these UG models might be false and do not represent real genes. For further analysis, gene models were classified as high homology (HH) and low homology (LH) models based on comparison using an expect value of e<sup>-7 </sup>for predicted protein homology between rice and <it>Arabidopsis </it><abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. It should be noted that the simple sequence alignment is likely to fail to detect some structural homology. However, this simple division is useful for separating two groups of gene models for expression comparison. For example, in the BGI <it>japonica </it>annotation, there are 589 UG/HH and 960 UG/LH models. By comparison, our tiling microarray detected 495 (84.0%) UG/HH models, but only 707 (73.7%) UG/LH models. Because the UG/LH models lack any previous supporting evidence (either homology or expression), concerns have been raised as to whether they represent real genes <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>; therefore, the expression properties of the UG/LH models are of particular interest for further evaluation.</p>
				<p>To investigate the possibility that expression of some UG/LH models is restricted to special conditions, we analyzed the S Arrays with regard to UG model expression. Of the gene models in the BGI <it>japonica </it>annotation, 63.4% were detected in seedling shoots under a variety of stress conditions that are known to significantly alter gene expression profiles <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>. These included 39 (2 CG/HH, 2 EG/HH, 8 UG/HH, 2 CG/LH, 2 EG/LH and 23 UG/LH) models that eluded detection by the N Arrays. The enrichment of UG/LH models in S Arrays-specific models indicates that some UG/LH models indeed have specialized expression. Though it is entirely possible that additional UG/LH models could be detected under other stress conditions, the small number of UG/LH models specifically detected from the S Arrays (23 of 960, or 2.4%) suggests that specialized expression of UG/LH models alone may not account for the overall low detection rate of the UG/LH models.</p>
				<p>In a separate approach to verify UG model annotation, 589 UG models were randomly selected for a high throughput RT-PCR analysis. Overall, 196 (33.3%) of the selected UG models were cloned and sequence-confirmed from the same RNA samples used for the N Arrays (Figure <figr fid="F3">3a</figr> and Additional data file 3). Given that only 62% (49/79) of CG models were successfully cloned and sequence-confirmed in a control experiment, these results suggest that expression of approximately half (33% over 62%) of the UG models can be confirmed in our experimental conditions. Closer inspection of the confirmed UG transcripts showed that only 102 (52%) contain an identical ORF as predicted, whilst 94 (48%) exhibit different ORFs compared to the predictions (Figure <figr fid="F3">3a,c</figr>), suggesting that the gene structure of about half of the UG models need to be corrected or improved. Since the tiling microarrays used in this study have limited ability to pinpoint precise intron-exon junctions, transcript cloning and sequence analysis are still required to verify the annotated gene structures.</p>
				<fig id="F3">
					<title>
						<p>Figure 3</p>
					</title>
					<caption>
						<p>Cloning and sequence analysis of <it>japonica </it>chromosome 10 UG models and intergenic TARs</p>
					</caption>
					<text>
						<p>Cloning and sequence analysis of <it>japonica </it>chromosome 10 UG models and intergenic TARs. <b>(a) </b>Summary of RT-PCR analysis of selected UG models. ORF identical, annotated ORF is the same as determined from the cloned sequence; ORF different, annotated ORF is different from that in the cloned sequence. <b>(b) </b>Summary of RT-PCR analysis of selected intergenic TARs. Gene model, cloned TARs overlapping with TIGR models; BGF prediction, cloned TARs overlapping with BGF predictions; unique, cloned TARs not overlapping with any annotated feature. <b>(c) </b>Representative UG models whose cloned sequences either differ from (OsJN02936) or are the same as (OsJN03072) the annotated ones. <b>(d) </b>Representative intergenic TARs whose cloned sequences either overlap with a TIGR model (OsJN01855) or are completely intergenic (C10_ZN376). Representation of microarray data in this figure is the same as in Figure 2 except that the oligo index is omitted.</p>
					</text>
					<graphic file="gb-2005-6-6-r52-3"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Identification and analysis of intergenic TARs</p>
				</st>
				<p>We found that 10.26% and 11.75% of the probes in the <it>japonica </it>and <it>indica </it>N Arrays were considered signal probes, respectively (Figure <figr fid="F1">1c</figr>). Approximately 55% and 15% of these signal probes were found to locate in the intergenic and intronic regions, respectively, of the TIGR <it>japonica</it>, BGI <it>japonica</it>, and BGI <it>indica </it>annotations. These results indicate that, irrespective of different annotations, significant transcriptional activities locate in the annotated intergenic regions. A sliding-window-based approach was used to systematically identify intergenic TARs (see Materials and methods). Through this analysis, 574 and 522 intergenic TARs in <it>indica </it>and <it>japonica </it>were identified from the N Arrays, respectively. In addition, 466 unique intergenic TARs were identified from the S Arrays, bringing the total number of <it>japonica </it>intergenic TARs to 988. These TARs have a cumulative length of approximately 700 Kb or 3% of the chromosome. The average length of the intergenic TARs was about 700 bp (Figure <figr fid="F4">4a</figr> and Additional data file 4).</p>
				<fig id="F4">
					<title>
						<p>Figure 4</p>
					</title>
					<caption>
						<p>Analysis of intergenic TARs of <it>japonica </it>chromosome 10</p>
					</caption>
					<text>
						<p>Analysis of intergenic TARs of <it>japonica </it>chromosome 10. <b>(a) </b>The 988 <it>japonica </it>chromosome 10 intergenic TARs distributed by length. <b>(b) </b>RNA gel blotting analysis of selected <it>japonica </it>intergenic TARs. Probes for the intergenic TARs shown in this panel were derived from corresponding PCR-amplified TAR sequences from <it>japonica </it>rice genomic DNA. <b>(c) </b>Probes shown in this panel were derived from RT-PCR amplification of the corresponding TARs from poly(A)<sup>+ </sup>RNA. <b>(d) </b>The rice cDNAs for <it>eIF4A </it>and <it>actin2 </it>were used as loading controls. 5 &#956;g of RNA from the four sources - root, shoot, panicle, and suspension cell culture - that were used for probing tiling microarrays were used for RNA blot analysis here.</p>
					</text>
					<graphic file="gb-2005-6-6-r52-4"/>
				</fig>
				<p>Several lines of evidence support the idea that the majority of intergenic TARs represent legitimate elements of the rice transcriptome. Sequence analysis revealed that 301 (55.0%) <it>indica </it>and 455 (46.0%) <it>japonica </it>intergenic TARs possess a significant coding capacity (more than 50 amino acids). Selected intergenic TARs were used as probes in RNA gel-blot analysis to confirm expression of these TARs. Overall, 26 out of 34 probes detected a discrete band, with tissue specificity, whereas the rest failed to detect any, suggesting that the majority of the intergenic TARs correspond to <it>in vivo </it>transcripts rather than being caused by cross hybridization (Figure <figr fid="F4">4b-d</figr>). A total of 280 intergenic TARs were selected for further analysis using an RT-PCR strategy designed to clone transcripts containing an intergenic TAR and its entire downstream (3') sequence (see Materials and methods and Additional data file 5). Of the 77 cloned transcripts whose sequences could be unambiguously confirmed, 37 overlap with existing gene models (Figure <figr fid="F3">3b,d</figr>), suggesting they are uncharacterized portions, such as 5' or 3' untranslated regions (UTRs), or splice variants of the neighboring gene models. The rest of the confirmed transcripts (40 out of 77) were located entirely in intergenic regions, suggesting that they likely represent independent novel transcriptional units (Figure <figr fid="F3">3b,d</figr>).</p>
				<p>To further characterize the 988 <it>japonica </it>intergenic TARs, they were aligned to the output of the rice gene finder BGF <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B6">6</abbr><abbr bid="B30">30</abbr></abbrgrp> using the <it>japonica </it>chromosome 10 sequence, and 72 novel gene models were identified (Additional data file 1). Comparison with the cloned intergenic TARs showed that 23 of the 40 cloned novel transcripts (57.5%) were also predicted in the novel BGF models (Figure <figr fid="F3">3b</figr>), indicating that the BGF program was able to detect half of the potential novel genes represented by the intergenic TARs. However, the incomplete nature of the 17 unaccounted transcripts (Figure <figr fid="F3">3b</figr>) made it difficult to unambiguously determine whether they encode proteins.</p>
			</sec>
			<sec>
				<st>
					<p>Tiling microarray-based gene model comparison and integration</p>
				</st>
				<p>The TIGR model set contained 200-250 more gene models than the BGI sets (Table <tblr tid="T1">1</tblr>). These extra models were evenly distributed into HH and LH models (Figure <figr fid="F5">5a</figr>). The TIGR/HH models showed a similar array-detection rate, while the TIGR/LH models were detected at a lower rate (but of a similar number) in comparison with the two BGI sets (Figure <figr fid="F5">5a</figr>). This result suggests that the extra TIGR/LH models may be of low confidence and need to be further examined. Comparison of the BGI and TIGR <it>japonica </it>models indicates that there were 2323 (84.0%) and 2488 (82.4%) common to each annotation, respectively, based on ORF sequence overlaps (Additional data file 6). Meanwhile, 441 (16.1%) BGI models and 531 (17.6%) TIGR models were regarded as unique to each annotation (Additional data file 6). Naturally, the common models are more reliable, and were consequently enriched with expression- or homology-supported models. For example, only 64.5% of the unique TIGR models were detected by tiling microarrays. However, expression of 363 of the unique BGI models was confirmed by tiling array and/or cDNA and EST alignment, indicating that they are part of the <it>japonica </it>chromosome 10 transcriptome (Figure <figr fid="F5">5b</figr>).</p>
				<fig id="F5">
					<title>
						<p>Figure 5</p>
					</title>
					<caption>
						<p>Comparison and integration of chromosome 10 gene models</p>
					</caption>
					<text>
						<p>Comparison and integration of chromosome 10 gene models. <b>(a) </b>Number of annotated and array-detected high homology (HH) and low homology (LH) models in the BGI <it>indica</it>, BGI <it>japonica</it>, and TIGR <it>japonica </it>annotations. <b>(b) </b>The 549 new gene models were combined with the 3,019 TIGR models. Origins of the new models are shown on the left. Expression support for the TIGR models is shown on the right. Expressed, models matching full-length cDNA/EST; array-detected, models not supported by the expressed sequences but detected by microarray; undetected, models neither supported by expressed sequences nor detected by microarray. <b>(c) </b>Classification of integrated <it>japonica </it>chromosome 10 gene models based on tiling array detection and exon number (left), homology to <it>Arabidopsis </it>genes (middle), and previous expression or homology support to the models (right).</p>
					</text>
					<graphic file="gb-2005-6-6-r52-5"/>
				</fig>
				<p>The <it>indica </it>gene models were more evenly distributed along the chromosome, and the number and distribution of array-detected models was similar to that of <it>japonica </it>(Figure <figr fid="F6">6a-c</figr>). Exceptions were noted in certain regions, such as at approximately10 Mb, where <it>indica </it>models showed increased array detection rates. Such a disparity is likely to be caused by the skewed distance between corresponding <it>japonica</it>/<it>indica </it>model pairs (see below). Comparative gene model mapping indicates that 97.6% of the <it>japonica </it>chromosome10 CG/HH models had their counterparts in <it>indica</it>, while 98.3% of the <it>indica </it>CG/HH models were mapped to <it>japonica </it>(Additional data file 6 and data not shown). As the full-length cDNAs were derived from <it>japonica </it><abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, this result suggests that roughly 2% of either genome sequence was erroneous or incomplete, thereby disrupting the integrity of the affected genes such that they could not be recognized. However, only 85.3% and 88.1% of <it>japonica </it>and <it>indica </it>UG/LH models could be mapped to their reciprocal genomes. These results indicate that the unmapped UG models between <it>japonica </it>and <it>indica </it>were common but not recognized in the reciprocal genomes, or subspecies specific, or false predictions. Thus, identification of the first group of models would facilitate a better recognition of the transcriptome of both genomes. Indeed, 2,640 <it>indica </it>models were mapped to <it>japonica </it>chromosome 10 (Additional data file 7). Among those mapped <it>indica </it>models, 114 were detected by tiling array, with corresponding genome sequences that were more than 95% identical to that of <it>japonica </it>chromosome 10, but were not annotated in <it>japonica</it>. These results suggest that the counterparts of these 114 <it>indica </it>models may exist in the <it>japonica </it>chromosome 10 transcriptome (Figure <figr fid="F5">5b</figr>).</p>
				<fig id="F6">
					<title>
						<p>Figure 6</p>
					</title>
					<caption>
						<p>Rice chromosome 10 gene model distribution and expression</p>
					</caption>
					<text>
						<p>Rice chromosome 10 gene model distribution and expression. <b>(a) </b>Characterization of TIGR nonredundant protein-coding gene models. Model density, array detection rate, number of signal oligos, number of intergenic TARs, and cumulative length (in kilobases) of masked oligos are calculated in 100-kb windows along the length of chromosome 10, and are represented by color-coded vertical bars. A scale representing the physical length of chromosome 10 is shown at the bottom of the panel. The arrowhead delimits the division of domain I and domain II as indicated in the text. Note that the centromere is located at a position around 7 to 8 Mb in chromosome 10. <b>(b) </b>Gene model density and array detection rate of the BGI <it>japonica </it>annotation. <b>(c) </b>Gene model density and array detection rate of the BGI <it>indica </it>annotation. <b>(d) </b>Comparison of the S Arrays and the N Arrays using the BGI <it>japonica </it>annotation. Log<sub>2 </sub>(S/N) of the hybridization intensity was calculated for individual models (top) and the mean intensity of all models in 100-kb windows along the length of chromosome 10 (bottom).</p>
					</text>
					<graphic file="gb-2005-6-6-r52-6"/>
				</fig>
				<p>To provide a comprehensive representation of the <it>japonica </it>chromosome 10 transcriptome, the 549 new models, including 363 BGI <it>japonica </it>models, 114 BGI <it>indica </it>models, and 72 novel BGF models (see above), were integrated with the TIGR <it>japonica </it>gene models (Figure <figr fid="F5">5b</figr>). The resulting 3,568 nonredundant protein-coding gene models, including the 3,019 TIGR models, represent an 18% increase in the annotated coding capacity of <it>japonica </it>chromosome 10 (Figure <figr fid="F5">5b</figr>). The integrated models included 3005 (84.2%) that were detected by tiling arrays, of which, 1,120 (31.4%) were not previously supported by expression data or homology. Thus, 3,255 (91.2%) models in the integrated set now have at least one piece of supporting evidence (for example, expressed sequences, homology, or tiling microarray) (Figure <figr fid="F5">5c</figr>). Classification of the array-detected and undetected models, based on exon number, homology to <it>Arabidopsis </it>genes, and previous supporting evidence, indicates that detection by our tiling microarray was not biased regarding gene structure and was in general agreement with all other annotation information (Figure <figr fid="F5">5c</figr>). These results demonstrate tiling microarray analysis as a useful platform to validate and incorporate information from multiple sources to fully identify the rice transcriptome.</p>
			</sec>
			<sec>
				<st>
					<p>Heterochromatin-associated regulation of chromosome-wide transcriptional activity</p>
				</st>
				<p>We applied the tiling microarrays to study chromosomal position effects on gene expression. As shown in Figure <figr fid="F6">6</figr>, chromosome-wide gene model distribution and expression suggests that chromosome 10 can be divided into two roughly equal-sized domains, with domain I consisting of the short arm and the proximal end of the long arm, while domain II encompasses the rest of the chromosome. This division was based on transcriptional profiles of the two domains, as revealed by tiling microarray analysis (Figure <figr fid="F6">6</figr>). Domain II had a higher density of nonredundant gene models (Figure <figr fid="F7">7a</figr>). Under normal growth conditions (the N Arrays), it also contained more signal oligos and more array-detected models and thus was more transcriptionally active relative to domain I (Figure <figr fid="F6">6</figr>). Such a distinction between the two domains was further supported by the higher number of CG models in domain II, which are presumably highly expressed (Figure <figr fid="F7">7b</figr>). Interestingly, although only a small number of gene models were specifically detected from the S Arrays (see above), overall transcriptional activity in domain I was elevated under the examined stress conditions (Figure <figr fid="F6">6d</figr>). The activation was observed both at the individual gene model level and in 100 kb windows across domain I (Figure <figr fid="F6">6d</figr>). Such a general derepression of transcription under stress conditions may imply another layer of gene regulation at the chromosomal level in rice.</p>
				<fig id="F7">
					<title>
						<p>Figure 7</p>
					</title>
					<caption>
						<p>Chromosome-wide distribution of gene models and chromosomal elements</p>
					</caption>
					<text>
						<p>Chromosome-wide distribution of gene models and chromosomal elements. <b>(a) </b>Distribution of TIGR <it>japonica </it>nonredundant protein-coding gene models (non-TE) and transposable element-related models (TE) in 1-Mb windows across chromosome 10. The division between domain I and II is indicated by the arrowhead. Note that the centromere is located at around 7 to 8 Mb in chromosome 10. <b>(b) </b>Distribution of BGI <it>japonica </it>CG and UG models in 1-Mb windows across chromosome 10. <b>(c) </b>Distribution of BGI <it>japonica </it>HH and LH models in 1-Mb windows across chromosome 10. <b>(d) </b>Numbers of the TIGR <it>japonica </it>nonredundant protein-coding gene models (TIGR Non-TE) and tiling array-detected intergenic TARs in 1-Mb windows across chromosome 10.</p>
					</text>
					<graphic file="gb-2005-6-6-r52-7"/>
				</fig>
				<p>The observed transcriptional profiles of the two domains were associated with several architectural features of the chromosome. In general, domain I was more enriched with TE and LH models (Figure <figr fid="F7">7a,c</figr>). Domain I also harbored more repetitive sequence, as was evident from the greater number of oligos masked during array design (Figure <figr fid="F6">6a</figr>). To further examine the two domains, colinearity of the CG models in chromosome 10 of <it>japonica </it>and <it>indica </it>rice was calculated. Mapping chromosomal positions of corresponding orthologous CG model pairs along chromosome 10 of <it>japonica </it>(blue) and <it>indica </it>(red) against the sequential orders of the CG pairs resulted in two apparently smooth parallel curves (Figure <figr fid="F8">8a</figr>). This observation indicates that the order of CG models is well preserved between chromosome 10 of <it>japonica </it>and <it>indica </it>rice. However, calculation of the physical distance between corresponding <it>japonica </it>and <it>indica </it>CG models along the chromosome indicated that the positions of the CG models were more skewed in domain I, with many CG models shuffled more than 1 Mb away from their orthologous counterparts in the reciprocal chromosome (Figure <figr fid="F8">8b</figr>).</p>
				<fig id="F8">
					<title>
						<p>Figure 8</p>
					</title>
					<caption>
						<p>Colinearity of the CG models for chromosome 10 in <it>japonica </it>and <it>indica </it>rice</p>
					</caption>
					<text>
						<p>Colinearity of the CG models for chromosome 10 in <it>japonica </it>and <it>indica </it>rice. <b>(a) </b>Chromosomal positions of corresponding CG model pairs along chromosome 10 in <it>japonica </it>(blue) and <it>indica </it>(red) rice are plotted against the sequential orders of the CG pairs. <b>(b) </b>Physical distance between corresponding CG pairs is plotted against their sequential orders along the chromosome.</p>
					</text>
					<graphic file="gb-2005-6-6-r52-8"/>
				</fig>
				<p>These results coincide with cytological data showing that domain I is primarily heterochromatin, whereas domain II is primarily euchromatin <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B33">33</abbr></abbrgrp>. Although it remains to be seen whether the phenomena mentioned above are general features associated with the division of heterochromatin and euchromatin in rice, these results collectively indicate that the heterochromatic domain of chromosome 10 is more evolutionarily active and compositionally dynamic. Our results further indicate that the genomic characteristics of the heterochromatin domain are associated with its transcriptional activities (Figure <figr fid="F6">6</figr>).</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Discussion</p>
			</st>
			<p>Sequencing of the rice genome provides a cornerstone to understand the biology of this agriculturally important crop <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. A first step in fully realizing the potential of available genome sequence is to understand its coding information and expression; however, current annotated gene models and other functional elements of a genome by and large represent hypotheses that must be experimentally tested and validated. Importantly, approximately 20,000 predicted rice genes exhibit no recognizable sequence homology to genes in other organisms, especially <it>Arabidopsis</it>, the first model plant sequenced <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. The unusual compositional and structural features, as well as the lack of EST coverage for a large number of novel genes, require high-throughput experimental means that are not limited by the current annotations.</p>
			<sec>
				<st>
					<p>Identification of the rice chromosome 10 transcriptome by tiling microarrays</p>
				</st>
				<p>In this study, we developed whole-chromosome oligonucleotide tiling microarrays, and demonstrated their utility in experimentally identifying the transcriptome of both <it>japonica </it>and <it>indica </it>chromosome 10. Because oligonucleotide tiling microarrays provide unbiased end-to-end coverage of the entire chromosome and measure transcriptional activity of gene models from multiple independent probes (Figure <figr fid="F2">2</figr>), they can detect the transcriptome in a comprehensive and unbiased way <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>. The tiling microarray analysis of rice chromosome 10 detected transcription of 86.4% BGI <it>indica </it>(2,428/2,809), 83.9% BGI <it>japonica </it>(2,319/2,764), and 81.9% TIGR <it>japonica </it>(2,472/3,019) gene models (Table <tblr tid="T1">1</tblr>). Using a set of the least reliable gene models (UG models, see below), RT-PCR analysis revealed disparity in gene structure of close to 50% of these models (Figure <figr fid="F3">3</figr>). These results are consistent with previous assessments of current computational gene finders, which can reliably locate a gene model in the correct chromosome locus, but are less than satisfactory to predict the fine gene structure <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>.</p>
				<p>Based on alignment to rice full-length cDNA and EST sequences, the gene models for both <it>japonica </it>and <it>indica </it>chromosome 10 were classified as UG, EG, and CG models (Table <tblr tid="T1">1</tblr>, Figure <figr fid="F2">2</figr>). This classification places the gene models in three groups with an ascending order of confidence, because the presence of an expressed sequence provides strong support to the corresponding model. In keeping with this idea, these three classes of gene models were also detected by tiling microarrays in an ascending order (Table <tblr tid="T1">1</tblr>). This result, together with the high detection rate of CG models, suggests that the chromosome 10 transcriptomes identified by the tiling microarrays are rather exhaustive. In support of this conclusion, tiling array analysis of rice seedlings which had undergone severe stress treatments only identified an additional 39 (less than 1.7% of the total detected) models. These results likely can be attributed to the high sensitivity of the tiling microarrays such that even if activation of certain genes is conditional, the basal level transcripts could still be detected by the tiling microarray.</p>
				<p>Therefore, the UG models (particularly UG/LH) that failed to be detected by the tiling microarray need to be more closely inspected (Table <tblr tid="T1">1</tblr>, Figure <figr fid="F3">3</figr>). We did find that the gene models specifically detected following the stress treatments were enriched with UG/LH models (23/39), suggesting that some UG/LH might be stress responsive and their expression is not readily detectable under normal conditions. It should be noted that though redundant gene models such as those derived from long terminal repeat (LTR) retrotransposons and Pack-MULEs are generally under-represented in the expressed sequence collections <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B39">39</abbr></abbrgrp>, many are stress responsive and share similar <it>cis</it>-elements with plant defense genes <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. Thus, it cannot be ruled out that some of the UG/LH models are related to low copy number retrotransposons with unusual structures.</p>
				<p>Reasoning that the tiling microarray-detected transcriptome is both exhaustive and reliable, tiling microarray-supported gene models were mapped and integrated. This analysis identified 363 unique BGI <it>japonica</it>, 114 unique BGI <it>indica</it>, and 72 novel models that could be integrated into the TIGR <it>japonica </it>gene model set to comprehensively represent the <it>japonica </it>chromosome 10 transcriptome (Figure <figr fid="F5">5</figr>). Note that the added gene models do not necessarily increase the number of <it>japonica </it>chromosome 10 genes, even if their transcription was detected. As elaborated above, some of these gene models could be unrecognized TEs, uncharacterized UTRs or alternative exons. However, as all these extra gene models are transcribed, their identification will not only better represent the transcriptome, but further examination of these elements will also yield insight into rice genome composition and structure.</p>
				<p>Extensive antisense transcription was observed for the rice chromosome 10 gene models. For instance, in a preliminary analysis whereby regions of the antisense strand covering the 3,019 TIGR <it>japonica </it>gene models were examined, excluding those that contain less than three signal oligos, 591 (19.6%) were found to have antisense expression. The proportion of rice gene models showing antisense transcription is consistent with that reported from tiling microarray analyses in <it>Arabidopsis </it><abbrgrp><abbr bid="B23">23</abbr></abbrgrp> and human <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>, adding to an increasing body of evidence that indicates antisense transcription as an inherent property of the genomes. However, it should be cautioned that the potential effects of several experimental artifacts such as unintended second-strand synthesis, formation of specific RNA-DNA hybrids, or spurious priming events during target preparation have to be precisely assessed before a final conclusion on the nature and extent of antisense transcription in rice can be drawn.</p>
				<p>Transcriptional activities outside the annotated gene models in the form of intergenic TARs, accounted for approximately 3% of the chromosome size (Figure <figr fid="F4">4a</figr>). RNA gel blotting and RT-PCR analyses confirmed only a portion of the selected TARs (Figure <figr fid="F3">3</figr>, <figr fid="F4">4</figr>), suggesting that the unconfirmed TARs could be experimental artifacts or correspond to transcripts of extreme low abundance <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B25">25</abbr><abbr bid="B27">27</abbr></abbrgrp>. Transcriptome components outside of previously annotated gene models are expected to correspond to: novel genes with unusual sequence composition; under-represented UTRs or exons of splice variants; nonprotein coding RNA transcripts; or uncharacterized transcribed TEs. RT-PCR analysis of selected <it>japonica </it>intergenic TARs suggests that the majority of the TARs belong to the first two groups (Figure <figr fid="F3">3b</figr>). This conclusion is consistent with the observation that the intergenic TARs were slightly enriched in regions of the chromosome with lower gene density (Figure <figr fid="F7">7d</figr>). A preliminary analysis whereby 214 plant miRNAs (including 122 from rice and 92 from <it>Arabidopsis</it>) <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp> were used in a BLAST search against the intergenic TARs revealed no significant hits, suggesting that the TARs do not contain known plant microRNAs.</p>
				<p>We thus focused our efforts on further analyzing the first two groups of TARs. For the current rice annotation, five different gene finders (primarily FGENESH) were used to generate gene models <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. To annotate the intergenic TARs, we used the relatively new rice gene-finder program BGF <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B6">6</abbr><abbr bid="B30">30</abbr></abbrgrp>, which identified 72 novel gene models (Figure <figr fid="F5">5</figr>). Sequence comparison between the 40 cloned intergenic TAR transcripts and the novel BGF models showed that 23 (57.5%) were predicted (Figure <figr fid="F3">3b</figr>), indicating that the BGF program was able to detect slightly more than half of the novel transcriptional units that might be represented by the intergenic TARs. Extrapolation from these observations suggests that there might be up to 2,000 novel genes yet to be recognized by current rice gene finders; however, the incomplete nature of the cloned transcripts made it difficult to unambiguously determine whether they encode proteins. Thus, it is possible that some of these transcripts may correspond to noncoding RNAs.</p>
			</sec>
			<sec>
				<st>
					<p>Association of chromosomal architecture with transcriptional activity</p>
				</st>
				<p>Eukaryotic genomes contain heterochromatin as cytologically intensely staining nuclear materials that are thought to be composed mainly of noncoding DNA and silent transposons <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B43">43</abbr></abbrgrp>. A salient feature of rice chromosome 10 is that its heterochromatin is not limited to the pericentric regions, but includes the entire short arm as well as the proximal portion of the long arm <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Comparison of cytological and sequence data suggests that this heterochromatin region is roughly 11-12 Mb in length <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B33">33</abbr></abbrgrp>. Although recent genetic and microarray studies in plants have indicated a role for gene regulation by well defined small heterochromatin regions <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp>, virtually no data are available regarding the association of transcriptional activity with large-scale heterochromatin domains in regulating gene expression, chromosome behavior, and genome evolution.</p>
				<p>Profiling the transcriptional activities of rice chromosome 10 using tiling microarrays revealed that gene expression in the heterochromatin region is generally low under normal growth conditions (the N Arrays) relative to the euchromatin (Figure <figr fid="F6">6a-c</figr>). Consistent with this observation, gene model distribution showed that the heterochromatin domain is relatively low in CG models but more abundant in UG models (Figure <figr fid="F7">7b</figr>). In support of the cytological data, an enrichment of TE models in the heterochromatin domain is evident (Figure <figr fid="F7">7a</figr>) <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Exclusion of the high copy number TEs and repetitive sequences from the tiling microarray analysis might contribute to the lower gene model density in the heterochromatin (Figure <figr fid="F7">7a-c</figr>); however, the generally lower detection rate of gene expression indicates that expression of many non-TE models is also somewhat repressed (Figure <figr fid="F7">7a-c</figr>). Interestingly, when plants were subjected to mineral or nutrient stresses, a general activation of transcription was observed in the heterochromatin (Figure <figr fid="F6">6d</figr>). These results are consistent with findings that heterochromatin stability and heterochromatin-mediated gene silencing can be regulated by development <abbrgrp><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp> or by modulating levels of specific transcription factors <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>.</p>
				<p>The distribution of TE and non-TE gene models in the heterochromatic and euchromatic regions was a near mirror image (Figure <figr fid="F7">7a</figr>). This result suggests that the heterochromatin and euchromatin may have similar capacities to accommodate protein-coding gene models (TE and non-TE), even though the heterochromatin is enriched with repetitive sequences (Figure <figr fid="F6">6a</figr>) <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Furthermore, the heterochromatin is relatively enriched with LH models and low in CG models compared with the euchromatin (Figure <figr fid="F7">7b, c</figr>). Thus it is likely that the differential packaging of genome elements in heterochromatin and euchromatin might enable rice to regulate and coordinate gene expression at the chromosomal level. Although the underlying molecular mechanism of this regulation is currently unknown, DNA methylation, histone modifications, and small interfering RNAs have all been implicated <abbrgrp><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr></abbrgrp>.</p>
				<p>The distance between corresponding <it>japonica </it>and <it>indica </it>CG models along the chromosome was more skewed in the heterochromatin, with many CG genes shuffled more than 1 Mb in physical distance from the location of their orthologous counterparts. In contrast, the gene distance in the euchromatin is largely homogeneous (Figure <figr fid="F8">8</figr>). Previous studies have shown a mosaic organization of grass genomes where conserved sequences are disrupted by nonconserved sequences, and that gene amplification, movement, and activity of retrotransposons account for the bulk of the interspersing nonconserved sequences <abbrgrp><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr></abbrgrp>. Thus, these results collectively indicate that the heterochromatin domain is more evolutionarily active and compositionally dynamic. Such a conclusion is in keeping with the genomic stress hypothesis that TEs are involved in host adaptation to environmental changes <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr><abbr bid="B59">59</abbr></abbrgrp>.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Materials and methods</p>
			</st>
			<sec>
				<st>
					<p>Plant materials and treatments</p>
				</st>
				<p><it>Oryza sativa </it>ssp. <it>japonica </it>cv. Nipponbare and <it>Oryza sativa </it>ssp. <it>indica </it>cv. <it>93-11 </it>were used for all experiments. Seeds were surface-sterilized, imbibed at 37&#176;C for 2 days, and then transferred to MS medium (Invitrogen) solidified with 0.8% (w/v) agar. Seedlings were kept under continuous light at 28&#176;C for seven days before harvest for total RNA isolation. Alternatively, 7-day-old seedlings were transferred to soil and maintained under long-day conditions (16 h light/8 h dark) at 26-28&#176;C in the greenhouse until flowering. Heading and filling stage panicles were then collected from these plants. Suspension-cultured cells were prepared and maintained as previously described <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>. For stress treatment, <it>japonica </it>seedlings were grown for seven days on MS medium under four different conditions: MS medium deprived of nitrogen; MS medium deprived of phosphorus, or supplemented with 150 mM NaCl or 100 &#956;M CdSO<sub>4</sub>. For RNA isolation, plant materials were frozen in liquid nitrogen and homogenized. Total RNA and mRNA were isolated using the RNeasy Plant Mini kit (Qiagen) and the Oligotex mRNA kit (Qiagen) according to the manufacturer's recommendations, respectively.</p>
			</sec>
			<sec>
				<st>
					<p>MAS microarray design, production, and hybridization</p>
				</st>
				<p>Based on the MAS platform, a minimal tiling strategy was designed to effectively represent the nonrepetitive sequences of rice chromosome 10 <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B26">26</abbr></abbrgrp>. Briefly, 36-mer oligonucleotides were designed using an algorithm based on sequence-dependent factors such as length, extent of complementarity, and the overall base composition. Oligos that could form a stem-loop structure with stem length greater than seven bases and those that have an oligo index score greater than 5 were excluded. To calculate the index score for each oligo, the 20 possible consecutive 17-mer sequences within each oligo were searched against the whole genome. The average copy number of the 17-mer sequences was scored as the oligo index. MAS microarray production was performed as previously described <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B26">26</abbr><abbr bid="B29">29</abbr></abbrgrp> using the sequences of chromosome 10 for <it>japonica </it>and <it>indica </it>rice as were available on 12 April, 2004 <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> and 1 August, 2003 <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B30">30</abbr></abbrgrp>, respectively. Oligos were synthesized at a density of 389,000 oligos per array in a chessboard design wherein each positive feature, which contains an interrogating oligo, was surrounded by four negative features and vice versa.</p>
				<p>The <it>japonica </it>and <it>indica </it>N Arrays both included four individual MAS arrays that contain oligos representing other portions of the genome (other than chromosome 10) not analyzed in the current study. The N Arrays were hybridized to cDNA target mixtures derived in equal amounts from seedling roots, seedling shoots, panicles, and suspension-cultured cells of both <it>japonica </it>(cv. Nipponbare) and <it>indica </it>(cv. <it>93-11</it>) rice. Additionally, a set of two <it>japonica </it>arrays (S Arrays) were hybridized to targets derived from pooled poly(A)<sup>+ </sup>RNA isolated from leaves of stress-treated <it>japonica </it>seedlings. Target preparation, array hybridization, and hybridization intensity value acquisition were carried out as previously described <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B26">26</abbr><abbr bid="B29">29</abbr><abbr bid="B61">61</abbr></abbrgrp>. Tiling microarray design and experimental data are available in the National Center for Biotechnology Information (NCBI) Gene Expression Omnibus under series GSE2500.</p>
			</sec>
			<sec>
				<st>
					<p>Chromosome 10 gene model compilation</p>
				</st>
				<p>The <it>japonica </it>(TIGR Rice Pseudomolecule released on 12 April 2004) <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> and <it>indica </it>(released by BGI on 1 August 2003) <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B30">30</abbr></abbrgrp> chromosome 10 annotations were used in this study. In addition, the <it>japonica </it>chromosome 10 sequence was annotated using the BGI gene prediction flow to generate the BGI <it>japonica </it>gene model set. All gene models were aligned to a collection of rice full-length cDNA sequences <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> and all available rice EST sequences in GenBank <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> as of 15 April 2004 by the BLAT program <abbrgrp><abbr bid="B62">62</abbr></abbrgrp> using cutoff criteria of 100 bp overlap and 90% identity over the entire length of each match. The predicted genes without matches to cDNA and EST sequences, excluding those with coding capacities of less than 50 amino acids, were classified as UG models.</p>
			</sec>
			<sec>
				<st>
					<p>Determination of gene model expression and identification of intergenic TARs</p>
				</st>
				<p>Hybridization intensity of all positive and all negative features within each array was plotted separately and then scaled to have a peak log<sub>2 </sub>intensity of 8.0 (Figure <figr fid="F1">1a,b</figr>). Signal and noise probe determination is shown in Figure <figr fid="F1">1c</figr> and discussed in main text. Expression level of a given gene model was represented by the value of hybridization intensity (<it>HI</it>) of this model locus that takes into account two parameters: <it>FI</it>, which is the mean of fluorescence intensity of all signal probes of a given gene model, and hybridization rate (<it>HR</it>), which is defined as the percentage of signal probes over total interrogating probes per kilobase of genomic sequence. <it>HI </it>is calculated using the formula <it>HI </it>= <it>FI </it>+ <it>FI </it>&#215; (<it>HR</it><sub><it>E </it></sub>- <it>HR</it><sub><it>M</it></sub>) in which <it>HR</it><sub><it>E </it></sub>is <it>HR </it>of the exon regions whilst <it>HR</it><sub><it>M </it></sub>is the mean <it>HR </it>of all intron regions. <it>HI </it>value of each model was then compared against a threshold designated as the mean fluorescence intensity plus twice the standard deviation (95% confidence) of all noise probes within each array.</p>
				<p>To identify intergenic TARs, HR was calculated in a sliding window of 500 nucleotides across the intergenic regions of chromosome 10 with a bandwidth equal to an interrogating probe. Windows with HR above a threshold of 0.4 were considered positive. Contiguously transcribed regions (TARs) were generated by joining overlapping positive windows that were delineated by the 5' probe of the first window and 3' probe of the last. TARs less than 220 bp (five consecutive probes) long were discarded. The <it>japonica </it>intergenic TARs were first identified using the BGI <it>japonica </it>annotation, followed by comparison with TIGR models. TARs overlapping with TIGR models were masked. Sequences of all retained intergenic TARs were aligned to the BGF gene predictions, and were used to BLASTX search the nonredundant protein database SWISS-PROT. Those BGF-predicted genes that overlap more than 100 bp with the sequence of intergenic TARs on the same strand of DNA were considered positive.</p>
			</sec>
			<sec>
				<st>
					<p>Cloning and verification of UG models and intergenic TARs</p>
				</st>
				<p>Selected UG models were cloned by means of RT-PCR. The PCR products were cloned into the pGEM-T vector (Promega) and sequenced. To clone intergenic TARs with downstream sequence, reverse transcription was performed on mixed poly(A)<sup>+ </sup>RNA derived from seedling roots, seedling shoots, panicles and suspension-cultured cells of <it>japonica </it>rice using the primer RT-CPK (5'-TGCAGTCTAGCTGGAATGACCTCATTGCAGAAT<sub>24</sub>). The PCR procedure to clone the TARs was carried out using a cascade of thermal asymmetric interlaced PCR cycles <abbrgrp><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr></abbrgrp> that employ three consecutively nested gene-specific primers to pair with primer RT-1 (5'-GCAGTCTAGCTGGAAT), RT-2 (5'-CTGGAATGACCTCATT), and RT-3 (5'-GCTGGAATGACCTCATTGCAGAAT), which anneal to overlapping regions of RT-CPK. Sequences of all the cloned PCR products were aligned back to <it>japonica </it>chromosome 10 using BLAT <abbrgrp><abbr bid="B62">62</abbr></abbrgrp> to confirm their identify and to map their corresponding gene structure. RNA gel-blot analysis of intergenic TARs was conducted as previously described <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Integration of <it>japonica </it>chromosome 10 gene models</p>
				</st>
				<p>All <it>japonica </it>chromosome 10 related gene models were sorted, and only those that met certain criteria were retained. The TIGR nonredundant gene models that can be mapped to the <it>japonica </it>chromosome 10 sequence were all retained. The additional models included BGI <it>japonica</it>, BGI <it>indica </it>models mapped to <it>japonica </it>chromosome 10, and tiling array-derived novel BGF models. From these models, those without previous full-length cDNA/EST or tiling microarray support, or those overlapping with TIGR models were discarded. All retained models were aligned back to the <it>japonica </it>chromosome 10 sequences to further confirm their identities and were combined with the TIGR <it>japonica </it>models.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Additional data files</p>
			</st>
			<p>The following additional data files are available with the online verison of this paper. Additional data file <supplr sid="S1">1</supplr> contains a table of integrated <it>japonica </it>chromosome 10 nonredundant gene models. Additional data file <supplr sid="S2">2</supplr> contains a table of <it>indica </it>chromosome 10 nonredundant gene models. Additional data file <supplr sid="S3">3</supplr> contains a table of the sequence analysis of cloned UG models. Additional data file <supplr sid="S4">4</supplr> contains <it>japonica </it>chromosome 10 intergenic TARs. Additional data file <supplr sid="S5">5</supplr> contains the sequence analysis of cloned intergenic TARs. Additional data file <supplr sid="S6">6</supplr> contains a comparison of BGI and TIGR <it>japonica </it>chromosome 10 gene models. Additional data file <supplr sid="S7">7</supplr> contains a comparison of BGI <it>indica </it>and <it>japonica </it>chromosome 10 gene models.</p>
			<suppl id="S1">
				<title>
					<p>Additional File 1</p>
				</title>
				<caption>
					<p>Table S1. Integrated <it>japonica </it>chromosome 10 nonredundant gene models</p>
				</caption>
				<text>
					<p>Table S1. Integrated <it>japonica </it>chromosome 10 nonredundant gene models. Integrated <it>japonica </it>chromosome 10 nonredundant gene models.</p>
				</text>
				<file name="gb-2005-6-6-r52-S1.pdf">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S2">
				<title>
					<p>Additional File 2</p>
				</title>
				<caption>
					<p>Table S2: <it>Indica </it>chromosome 10 nonredundant gene models</p>
				</caption>
				<text>
					<p>Table S2: <it>Indica </it>chromosome 10 nonredundant gene models. <it>Indica </it>chromosome 10 nonredundant gene models.</p>
				</text>
				<file name="gb-2005-6-6-r52-S2.pdf">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S3">
				<title>
					<p>Additional File 3</p>
				</title>
				<caption>
					<p>Table S3: Sequence analysis of cloned UG models. Sequence analysis of cloned UG models</p>
				</caption>
				<text>
					<p>Table S3: Sequence analysis of cloned UG models. Sequence analysis of cloned UG models.</p>
				</text>
				<file name="gb-2005-6-6-r52-S3.pdf">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S4">
				<title>
					<p>Additional File 4</p>
				</title>
				<caption>
					<p>Table S4: <it>Japonica </it>chromosome 10 intergenic TARs</p>
				</caption>
				<text>
					<p>Table S4: <it>Japonica </it>chromosome 10 intergenic TARs. <it>Japonica </it>chromosome 10 intergenic TARs.</p>
				</text>
				<file name="gb-2005-6-6-r52-S4.pdf">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S5">
				<title>
					<p>Additional File 5</p>
				</title>
				<caption>
					<p>Table S5: Sequence analysis of cloned intergenic TARs</p>
				</caption>
				<text>
					<p>Table S5: Sequence analysis of cloned intergenic TARs. Sequence analysis of cloned intergenic TARs.</p>
				</text>
				<file name="gb-2005-6-6-r52-S5.pdf">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S6">
				<title>
					<p>Additional File 6</p>
				</title>
				<caption>
					<p>Table S6: Comparison of BGI and TIGR <it>japonica </it>chromosome 10 gene models</p>
				</caption>
				<text>
					<p>Table S6: Comparison of BGI and TIGR <it>japonica </it>chromosome 10 gene models. Comparison of BGI and TIGR <it>japonica </it>chromosome 10 gene models.</p>
				</text>
				<file name="gb-2005-6-6-r52-S6.pdf">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S7">
				<title>
					<p>Additional File 7</p>
				</title>
				<caption>
					<p>Table S7: Comparison of BGI <it>indica </it>and <it>japonica </it>chromosome 10 gene models</p>
				</caption>
				<text>
					<p>Table S7: Comparison of BGI <it>indica </it>and <it>japonica </it>chromosome 10 gene models. Comparison of BGI <it>indica </it>and <it>japonica </it>chromosome 10 gene models.</p>
				</text>
				<file name="gb-2005-6-6-r52-S7.pdf">
					<p>Click here for file</p>
				</file>
			</suppl>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>We thank Jessica Habashi for critical reading of the manuscript. The rice tiling microarray project at Yale University was supported by a grant from the NSF Plant Genome Program (DBI-0421675). The collaborative research effort in China was supported by the 863-rice functional genomics program from the Ministry of Science and Technology of China, and by the National Institute of Biological Sciences at Beijing. L.L. was initially supported by a Yale University Brown postdoctoral fellowship.</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>A draft sequence of the rice genome (<it>Oryza sativa </it>L. <it>ssp japonica</it>).</p>
				</title>
				<aug>
					<au>
						<snm>Goff</snm>
						<fnm>SA</fnm>
					</au>
					<au>
						<snm>Ricke</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Lan</snm>
						<fnm>TH</fnm>
					</au>
					<au>
						<snm>Presting</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>RL</fnm>
					</au>
					<au>
						<snm>Dunn</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Glazebrook</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Sessions</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Oeller</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Varma</snm>
						<fnm>H</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>2002</pubdate>
				<volume>296</volume>
				<fpage>92</fpage>
				<lpage>100</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1068275</pubid>
						<pubid idtype="pmpid" link="fulltext">11935018</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>A draft sequence of the rice genome (<it>Oryza sativa </it>L. ssp. <it>indica</it>).</p>
				</title>
				<aug>
					<au>
						<snm>Yu</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Hu</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Wong</snm>
						<fnm>GK</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Liu</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Deng</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Dai</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Zhou</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>X</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>2002</pubdate>
				<volume>296</volume>
				<fpage>79</fpage>
				<lpage>92</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1068037</pubid>
						<pubid idtype="pmpid" link="fulltext">11935017</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Sequence and analysis of rice chromosome 4.</p>
				</title>
				<aug>
					<au>
						<snm>Feng</snm>
						<fnm>Q</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Hao</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Fu</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Huang</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Zhu</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Liu</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Hu</snm>
						<fnm>X</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nature</source>
				<pubdate>2002</pubdate>
				<volume>420</volume>
				<fpage>316</fpage>
				<lpage>320</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nature01183</pubid>
						<pubid idtype="pmpid" link="fulltext">12447439</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>The genome sequence and structure of rice chromosome 1.</p>
				</title>
				<aug>
					<au>
						<snm>Sasaki</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Matsumoto</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Yamamoto</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Sakata</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Baba</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Katayose</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Wu</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Niimura</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Cheng</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Nagamura</snm>
						<fnm>Y</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nature</source>
				<pubdate>2002</pubdate>
				<volume>420</volume>
				<fpage>312</fpage>
				<lpage>316</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nature01184</pubid>
						<pubid idtype="pmpid" link="fulltext">12447438</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>In-depth view of structure, activity, and evolution of rice chromosome 10.</p>
				</title>
				<aug>
					<au>
						<cnm>The Rice Chromosome 10 Sequencing Consortium</cnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2003</pubdate>
				<volume>300</volume>
				<fpage>1566</fpage>
				<lpage>1569</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1083523</pubid>
						<pubid idtype="pmpid" link="fulltext">12791992</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>The genomes of <it>Oryza sativa</it>: a history of duplications.</p>
				</title>
				<aug>
					<au>
						<snm>Yu</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Lin</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Zhou</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Ni</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Dong</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Hu</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Zeng</snm>
						<fnm>C</fnm>
					</au>
					<etal/>
				</aug>
				<source>PLoS Biol</source>
				<pubdate>2005</pubdate>
				<volume>3</volume>
				<fpage>e38</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">546038</pubid>
						<pubid idtype="pmpid" link="fulltext">15685292</pubid>
						<pubid idtype="doi">10.1371/journal.pbio.0030038</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>IRGSP releases the assembled rice genome sequences</p>
				</title>
				<url>http://rgp.dna.affrc.go.jp/IRGSP/Build2/build2.html</url>
			</bibl>
			<bibl id="B8">
				<title>
					<p>TIGR Rice Genome Annotation</p>
				</title>
				<url>http://www.tigr.org/tdb/e2k1/osa1/pseudomolecules/info.shtml</url>
			</bibl>
			<bibl id="B9">
				<title>
					<p>Compositional gradients in <it>Gramineae </it>genes</p>
				</title>
				<aug>
					<au>
						<snm>Wong</snm>
						<fnm>GK</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Tao</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Tan</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Passey</snm>
						<fnm>DA</fnm>
					</au>
					<au>
						<snm>Yu</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2002</pubdate>
				<volume>12</volume>
				<fpage>851</fpage>
				<lpage>856</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">12045139</pubid>
						<pubid idtype="doi">10.1101/gr.189102</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Consistent over-estimation of gene number in complex plant genomes.</p>
				</title>
				<aug>
					<au>
						<snm>Bennetzen</snm>
						<fnm>JL</fnm>
					</au>
					<au>
						<snm>Coleman</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Liu</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Ma</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Ramakrishna</snm>
						<fnm>W</fnm>
					</au>
				</aug>
				<source>Curr Opin Plant Biol</source>
				<pubdate>2004</pubdate>
				<volume>7</volume>
				<fpage>732</fpage>
				<lpage>736</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.pbi.2004.09.003</pubid>
						<pubid idtype="pmpid" link="fulltext">15491923</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>The new genes of rice: a closer look.</p>
				</title>
				<aug>
					<au>
						<snm>Jabbari</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Cruveiller</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Clay</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Le Saux</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Bernardi</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Trends Plant Sci</source>
				<pubdate>2004</pubdate>
				<volume>9</volume>
				<fpage>281</fpage>
				<lpage>285</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.tplants.2004.04.006</pubid>
						<pubid idtype="pmpid" link="fulltext">15165559</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>Pack-MULE transposable elements mediate gene evolution in plants.</p>
				</title>
				<aug>
					<au>
						<snm>Jiang</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Bao</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Eddy</snm>
						<fnm>SR</fnm>
					</au>
					<au>
						<snm>Wessler</snm>
						<fnm>SR</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2004</pubdate>
				<volume>431</volume>
				<fpage>569</fpage>
				<lpage>573</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nature02953</pubid>
						<pubid idtype="pmpid" link="fulltext">15457261</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>Transposable element annotation of the rice genome.</p>
				</title>
				<aug>
					<au>
						<snm>Juretic</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Bureau</snm>
						<fnm>TE</fnm>
					</au>
					<au>
						<snm>Bruskiewich</snm>
						<fnm>RM</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2004</pubdate>
				<volume>20</volume>
				<fpage>155</fpage>
				<lpage>160</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bth019</pubid>
						<pubid idtype="pmpid" link="fulltext">14734305</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B14">
				<title>
					<p>Full-length messenger RNA sequences greatly improve genome annotation.</p>
				</title>
				<aug>
					<au>
						<snm>Hass</snm>
						<fnm>BJ</fnm>
					</au>
					<au>
						<snm>Volfovsky</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Town</snm>
						<fnm>CD</fnm>
					</au>
					<au>
						<snm>Troukhan</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Alexandrov</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Feldmann</snm>
						<fnm>KA</fnm>
					</au>
					<au>
						<snm>Flavell</snm>
						<fnm>RB</fnm>
					</au>
					<au>
						<snm>White</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Salzberg</snm>
						<fnm>SL</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2002</pubdate>
				<volume>3</volume>
				<fpage>research0029.1</fpage>
				<lpage>0029.12</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1186/gb-2002-3-6-research0029</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B15">
				<title>
					<p>Collection, mapping, and annotation of over 28,000 cDNA clones from <it>japonica </it>rice.</p>
				</title>
				<aug>
					<au>
						<snm>Kikuchi</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Satoh</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Nagata</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Kawagashira</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Doi</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Kishimoto</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Yazaki</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Ishikawa</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Yamada</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Ooka</snm>
						<fnm>H</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>2003</pubdate>
				<volume>301</volume>
				<fpage>376</fpage>
				<lpage>379</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1083523</pubid>
						<pubid idtype="pmpid" link="fulltext">12791992</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B16">
				<title>
					<p>Whole genome sequence comparisons and 'full-length' cDNA sequences: a combined approach to evaluate and improve <it>Arabidopsis </it>genome annotation.</p>
				</title>
				<aug>
					<au>
						<snm>Castelli</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Aury</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Jaillon</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Wincker</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Clepet</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Menard</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Cruaud</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Quetier</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Scarpelli</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Schachter</snm>
						<fnm>V</fnm>
					</au>
					<etal/>
				</aug>
				<source>Genome Res</source>
				<pubdate>2004</pubdate>
				<volume>14</volume>
				<fpage>406</fpage>
				<lpage>413</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">353228</pubid>
						<pubid idtype="pmpid" link="fulltext">14993207</pubid>
						<pubid idtype="doi">10.1101/gr.1515604</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<title>
					<p>NCBI Expressed Sequence Tags Database</p>
				</title>
				<url>http://www.ncbi.nlm.nih.gov/dbEST</url>
			</bibl>
			<bibl id="B18">
				<title>
					<p>Analysis of the transcriptional complexity of <it>Arabidopsis thaliana </it>by massively parallel signature sequencing.</p>
				</title>
				<aug>
					<au>
						<snm>Meyers</snm>
						<fnm>BC</fnm>
					</au>
					<au>
						<snm>Vu</snm>
						<fnm>TH</fnm>
					</au>
					<au>
						<snm>Tej</snm>
						<fnm>SS</fnm>
					</au>
					<au>
						<snm>Ghazal</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Matvienko</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Agrawal</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Ning</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Haudenschild</snm>
						<fnm>CD</fnm>
					</au>
				</aug>
				<source>Nat Biotechnol</source>
				<pubdate>2004</pubdate>
				<volume>22</volume>
				<fpage>1006</fpage>
				<lpage>1011</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nbt992</pubid>
						<pubid idtype="pmpid" link="fulltext">15247925</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<title>
					<p>Experimental annotation of the human genome using microarray technology.</p>
				</title>
				<aug>
					<au>
						<snm>Shoemaker</snm>
						<fnm>DD</fnm>
					</au>
					<au>
						<snm>Schadt</snm>
						<fnm>EE</fnm>
					</au>
					<au>
						<snm>Armour</snm>
						<fnm>CD</fnm>
					</au>
					<au>
						<snm>He</snm>
						<fnm>YD</fnm>
					</au>
					<au>
						<snm>Garrett-Engele</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>McDonagh</snm>
						<fnm>PD</fnm>
					</au>
					<au>
						<snm>Loerch</snm>
						<fnm>PM</fnm>
					</au>
					<au>
						<snm>Leonardson</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Lum</snm>
						<fnm>PY</fnm>
					</au>
					<au>
						<snm>Cavet</snm>
						<fnm>G</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nature</source>
				<pubdate>2001</pubdate>
				<volume>409</volume>
				<fpage>922</fpage>
				<lpage>927</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/35057141</pubid>
						<pubid idtype="pmpid" link="fulltext">11237012</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B20">
				<title>
					<p>RNA expression analysis using a 30 base pair resolution <it>Escherichia coli </it>genome array.</p>
				</title>
				<aug>
					<au>
						<snm>Selinger</snm>
						<fnm>DW</fnm>
					</au>
					<au>
						<snm>Cheung</snm>
						<fnm>KJ</fnm>
					</au>
					<au>
						<snm>Mei</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Johansson</snm>
						<fnm>EM</fnm>
					</au>
					<au>
						<snm>Richmond</snm>
						<fnm>CS</fnm>
					</au>
					<au>
						<snm>Blattner</snm>
						<fnm>FR</fnm>
					</au>
					<au>
						<snm>Lockhart</snm>
						<fnm>DJ</fnm>
					</au>
					<au>
						<snm>Church</snm>
						<fnm>GM</fnm>
					</au>
				</aug>
				<source>Nature Biotechnol</source>
				<pubdate>2000</pubdate>
				<volume>18</volume>
				<fpage>1262</fpage>
				<lpage>1268</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1038/82367</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p>Large-scale transcriptional activity in chromosomes 21 and 22.</p>
				</title>
				<aug>
					<au>
						<snm>Kapranov</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Cawley</snm>
						<fnm>SE</fnm>
					</au>
					<au>
						<snm>Drenkow</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Bekiranov</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Strausberg</snm>
						<fnm>RL</fnm>
					</au>
					<au>
						<snm>Fodor</snm>
						<fnm>SP</fnm>
					</au>
					<au>
						<snm>Gingeras</snm>
						<fnm>TR</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2002</pubdate>
				<volume>296</volume>
				<fpage>916</fpage>
				<lpage>919</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1068597</pubid>
						<pubid idtype="pmpid" link="fulltext">11988577</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B22">
				<title>
					<p>The transcriptional activity of human chromosome 22.</p>
				</title>
				<aug>
					<au>
						<snm>Rinn</snm>
						<fnm>JL</fnm>
					</au>
					<au>
						<snm>Euskirchen</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Bertone</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Martone</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Luscombe</snm>
						<fnm>NM</fnm>
					</au>
					<au>
						<snm>Hartman</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Harrison</snm>
						<fnm>PM</fnm>
					</au>
					<au>
						<snm>Nelson</snm>
						<fnm>FK</fnm>
					</au>
					<au>
						<snm>Miller</snm>
						<fnm>P</fnm>
					</au>
					<etal/>
				</aug>
				<source>Genes Dev</source>
				<pubdate>2003</pubdate>
				<volume>17</volume>
				<fpage>529</fpage>
				<lpage>540</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">195998</pubid>
						<pubid idtype="pmpid" link="fulltext">12600945</pubid>
						<pubid idtype="doi">10.1101/gad.1055203</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B23">
				<title>
					<p>Empirical analysis of transcriptional activity in the <it>Arabidopsis </it>genome.</p>
				</title>
				<aug>
					<au>
						<snm>Yamada</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Lim</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Dale</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Chen</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Shinn</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Palm</snm>
						<fnm>CJ</fnm>
					</au>
					<au>
						<snm>Southwick</snm>
						<fnm>AM</fnm>
					</au>
					<au>
						<snm>Wu</snm>
						<fnm>HC</fnm>
					</au>
					<au>
						<snm>Kim</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Nguyen</snm>
						<fnm>M</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>2003</pubdate>
				<volume>302</volume>
				<fpage>842</fpage>
				<lpage>846</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1088305</pubid>
						<pubid idtype="pmpid" link="fulltext">14593172</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<title>
					<p>Global identification of human transcribed sequences with genome tiling arrays.</p>
				</title>
				<aug>
					<au>
						<snm>Bertone</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Stolc</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Royce</snm>
						<fnm>TE</fnm>
					</au>
					<au>
						<snm>Rozowsky</snm>
						<fnm>JS</fnm>
					</au>
					<au>
						<snm>Urban</snm>
						<fnm>AE</fnm>
					</au>
					<au>
						<snm>Zhu</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Tongprasit</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Samanta</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Weissman</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Rinn</snm>
						<fnm>JL</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>2004</pubdate>
				<volume>306</volume>
				<fpage>2242</fpage>
				<lpage>2246</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1103388</pubid>
						<pubid idtype="pmpid" link="fulltext">15539566</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B25">
				<title>
					<p>Novel RNAs identified from an in-depth analysis of the transcriptome of human chromosomes 21 and 22.</p>
				</title>
				<aug>
					<au>
						<snm>Kampa</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Cheng</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Kapranov</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Yamanaka</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Brubaker</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Cawley</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Drenkow</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Piccolboni</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Bekiranov</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Helt</snm>
						<fnm>G</fnm>
					</au>
					<etal/>
				</aug>
				<source>Genome Res</source>
				<pubdate>2004</pubdate>
				<volume>14</volume>
				<fpage>331</fpage>
				<lpage>342</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">353210</pubid>
						<pubid idtype="pmpid" link="fulltext">14993201</pubid>
						<pubid idtype="doi">10.1101/gr.2094104</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B26">
				<title>
					<p>A gene expression map for the euchromatic genome of <it>Drosophila melanogaster</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Stolc</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Gauhar</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Mason</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Halasz</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>vanBatenburg</snm>
						<fnm>MF</fnm>
					</au>
					<au>
						<snm>Rifkin</snm>
						<fnm>SA</fnm>
					</au>
					<au>
						<snm>Hua</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Herreman</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Tongprasit</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Barbano</snm>
						<fnm>PE</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>2004</pubdate>
				<volume>306</volume>
				<fpage>655</fpage>
				<lpage>660</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1101312</pubid>
						<pubid idtype="pmpid" link="fulltext">15499012</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B27">
				<title>
					<p>Applications of DNA tiling arrays for whole-genome analysis.</p>
				</title>
				<aug>
					<au>
						<snm>Mockler</snm>
						<fnm>TC</fnm>
					</au>
					<au>
						<snm>Ecker</snm>
						<fnm>JR</fnm>
					</au>
				</aug>
				<source>Genomics</source>
				<pubdate>2005</pubdate>
				<volume>85</volume>
				<fpage>1</fpage>
				<lpage>15</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.ygeno.2004.10.005</pubid>
						<pubid idtype="pmpid" link="fulltext">15607417</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B28">
				<title>
					<p>Maskless fabrication of light-directed oligonucleotide microarrays using a digital micromirror array.</p>
				</title>
				<aug>
					<au>
						<snm>Singh-Gasson</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Green</snm>
						<fnm>RD</fnm>
					</au>
					<au>
						<snm>Yue</snm>
						<fnm>YJ</fnm>
					</au>
					<au>
						<snm>Nelson</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Blattner</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Sussman</snm>
						<fnm>MR</fnm>
					</au>
					<au>
						<snm>Cerrina</snm>
						<fnm>F</fnm>
					</au>
				</aug>
				<source>Nat Biotechnol</source>
				<pubdate>1999</pubdate>
				<volume>17</volume>
				<fpage>974</fpage>
				<lpage>978</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/13664</pubid>
						<pubid idtype="pmpid" link="fulltext">10504697</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B29">
				<title>
					<p>Gene expression analysis using oligonucleotide arrays produced by maskless photolithography.</p>
				</title>
				<aug>
					<au>
						<snm>Nuwaysir</snm>
						<fnm>EF</fnm>
					</au>
					<au>
						<snm>Huang</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Albert</snm>
						<fnm>TJ</fnm>
					</au>
					<au>
						<snm>Singh</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Nuwaysir</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Pitas</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Richmond</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Gorski</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Berg</snm>
						<fnm>JP</fnm>
					</au>
					<au>
						<snm>Ballin</snm>
						<fnm>J</fnm>
					</au>
					<etal/>
				</aug>
				<source>Genome Res</source>
				<pubdate>2002</pubdate>
				<volume>12</volume>
				<fpage>1749</fpage>
				<lpage>1755</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">187555</pubid>
						<pubid idtype="pmpid" link="fulltext">12421762</pubid>
						<pubid idtype="doi">10.1101/gr.362402</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B30">
				<title>
					<p>BGI-RIS: an integrated information resource and comparative analysis workbench for rice genomics.</p>
				</title>
				<aug>
					<au>
						<snm>Zhao</snm>
						<fnm>WM</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>He</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Huang</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Jiao</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Dai</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Wei</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Fu</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Chen</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Ren</snm>
						<fnm>X</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2004</pubdate>
				<volume>32</volume>
				<fpage>D377</fpage>
				<lpage>D382</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">308819</pubid>
						<pubid idtype="pmpid" link="fulltext">14681438</pubid>
						<pubid idtype="doi">10.1093/nar/gkh085</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B31">
				<title>
					<p>Phosphate starvation triggers distinct alterations of genome expression in <it>Arabidopsis </it>roots and leaves.</p>
				</title>
				<aug>
					<au>
						<snm>Wu</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Ma</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Hou</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Wu</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Liu</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Deng</snm>
						<fnm>XW</fnm>
					</au>
				</aug>
				<source>Plant Physiol</source>
				<pubdate>2003</pubdate>
				<volume>132</volume>
				<fpage>1260</fpage>
				<lpage>1271</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">167066</pubid>
						<pubid idtype="pmpid" link="fulltext">12857808</pubid>
						<pubid idtype="doi">10.1104/pp.103.021022</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B32">
				<title>
					<p>Monitoring expression profiles of rice genes under cold, drought, and high-salinity stresses and abscisic acid application using cDNA microarray and RNA gel-blot analyses.</p>
				</title>
				<aug>
					<au>
						<snm>Rabbani</snm>
						<fnm>MA</fnm>
					</au>
					<au>
						<snm>Maruyama</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Abe</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Khan</snm>
						<fnm>MA</fnm>
					</au>
					<au>
						<snm>Katsura</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Ito</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Yoshiwara</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Seki</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Shinozaki</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Yamaguchi-Shinozaki</snm>
						<fnm>K</fnm>
					</au>
				</aug>
				<source>Plant Physiol</source>
				<pubdate>2003</pubdate>
				<volume>133</volume>
				<fpage>1755</fpage>
				<lpage>1767</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">300730</pubid>
						<pubid idtype="pmpid" link="fulltext">14645724</pubid>
						<pubid idtype="doi">10.1104/pp.103.025742</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B33">
				<title>
					<p>Toward a cytological characterization of the rice genome.</p>
				</title>
				<aug>
					<au>
						<snm>Cheng</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Buell</snm>
						<fnm>CR</fnm>
					</au>
					<au>
						<snm>Wing</snm>
						<fnm>RA</fnm>
					</au>
					<au>
						<snm>Gu</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Jiang</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2001</pubdate>
				<volume>11</volume>
				<fpage>2133</fpage>
				<lpage>2141</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">311230</pubid>
						<pubid idtype="pmpid" link="fulltext">11731505</pubid>
						<pubid idtype="doi">10.1101/gr.194601</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B34">
				<title>
					<p>Comparative genetics in the grasses.</p>
				</title>
				<aug>
					<au>
						<snm>Gale</snm>
						<fnm>MD</fnm>
					</au>
					<au>
						<snm>Devos</snm>
						<fnm>KM</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>1998</pubdate>
				<volume>95</volume>
				<fpage>1971</fpage>
				<lpage>1974</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">33824</pubid>
						<pubid idtype="pmpid" link="fulltext">9482816</pubid>
						<pubid idtype="doi">10.1073/pnas.95.5.1971</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B35">
				<title>
					<p>Rice as a model for comparative genomics of plants.</p>
				</title>
				<aug>
					<au>
						<snm>Shimamoto</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Kyozuka</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Annu Rev Plant Biol</source>
				<pubdate>2002</pubdate>
				<volume>53</volume>
				<fpage>399</fpage>
				<lpage>419</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1146/annurev.arplant.53.092401.134447</pubid>
						<pubid idtype="pmpid">12221982</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B36">
				<title>
					<p><it>Arabidopsis </it>to rice. Applying knowledge from a weed to enhance our understanding of a crop species.</p>
				</title>
				<aug>
					<au>
						<snm>Rensink</snm>
						<fnm>WA</fnm>
					</au>
					<au>
						<snm>Buell</snm>
						<fnm>CR</fnm>
					</au>
				</aug>
				<source>Plant Physiol</source>
				<pubdate>2004</pubdate>
				<volume>135</volume>
				<fpage>622</fpage>
				<lpage>629</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1104/pp.104.040170</pubid>
						<pubid idtype="pmpid" link="fulltext">15208410</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B37">
				<title>
					<p>Current methods of gene prediction, their strengths and weaknesses.</p>
				</title>
				<aug>
					<au>
						<snm>Math&#233;</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Sagot</snm>
						<fnm>M-F</fnm>
					</au>
					<au>
						<snm>Schiex</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Rouz&#233;</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2002</pubdate>
				<volume>30</volume>
				<fpage>4103</fpage>
				<lpage>4117</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">12364589</pubid>
						<pubid idtype="doi">10.1093/nar/gkf543</pubid>
						<pubid idtype="pmcid">140543</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B38">
				<title>
					<p>Computational prediction of eukaryotic protein-coding genes.</p>
				</title>
				<aug>
					<au>
						<snm>Zhang</snm>
						<fnm>MQ</fnm>
					</au>
				</aug>
				<source>Nat Rev Genet</source>
				<pubdate>2002</pubdate>
				<volume>3</volume>
				<fpage>698</fpage>
				<lpage>709</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nrg890</pubid>
						<pubid idtype="pmpid" link="fulltext">12209144</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B39">
				<title>
					<p>Plant transposable elements: where genetics meets genomics.</p>
				</title>
				<aug>
					<au>
						<snm>Feschotte</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Jiang</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Wessler</snm>
						<fnm>SR</fnm>
					</au>
				</aug>
				<source>Nat Rev Genet</source>
				<pubdate>2002</pubdate>
				<volume>3</volume>
				<fpage>329</fpage>
				<lpage>341</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nrg793</pubid>
						<pubid idtype="pmpid" link="fulltext">11988759</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B40">
				<title>
					<p>Activation of plant retrotransposons under stress conditions.</p>
				</title>
				<aug>
					<au>
						<snm>Grandbastien</snm>
						<fnm>MA</fnm>
					</au>
				</aug>
				<source>Trends Plant Sci</source>
				<pubdate>1998</pubdate>
				<volume>3</volume>
				<fpage>181</fpage>
				<lpage>187</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1016/S1360-1385(98)01232-1</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B41">
				<title>
					<p>Computational identification of plant microRNAs and their targets, including a stress-induced miRNA.</p>
				</title>
				<aug>
					<au>
						<snm>Jones-Rhoades</snm>
						<fnm>MW</fnm>
					</au>
					<au>
						<snm>Bartel</snm>
						<fnm>DP</fnm>
					</au>
				</aug>
				<source>Mol Cell</source>
				<pubdate>2004</pubdate>
				<volume>14</volume>
				<fpage>787</fpage>
				<lpage>799</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.molcel.2004.05.027</pubid>
						<pubid idtype="pmpid" link="fulltext">15200956</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B42">
				<title>
					<p>Sorghum genome sequencing by methylation filtration.</p>
				</title>
				<aug>
					<au>
						<snm>Bedell</snm>
						<fnm>JA</fnm>
					</au>
					<au>
						<snm>Budiman</snm>
						<fnm>MA</fnm>
					</au>
					<au>
						<snm>Nunberg</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Citek</snm>
						<fnm>RW</fnm>
					</au>
					<au>
						<snm>Robbins</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Jones</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Flick</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Rohlfing</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Fries</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Bradford</snm>
						<fnm>K</fnm>
					</au>
					<etal/>
				</aug>
				<source>PLoS Biol</source>
				<pubdate>2005</pubdate>
				<volume>3</volume>
				<fpage>e13</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">539327</pubid>
						<pubid idtype="pmpid" link="fulltext">15660154</pubid>
						<pubid idtype="doi">10.1371/journal.pbio.0030013</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B43">
				<title>
					<p>Heterochromatin.</p>
				</title>
				<aug>
					<au>
						<snm>Hennig</snm>
						<fnm>W</fnm>
					</au>
				</aug>
				<source>Chromosoma</source>
				<pubdate>1999</pubdate>
				<volume>108</volume>
				<fpage>1</fpage>
				<lpage>9</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/s004120050346</pubid>
						<pubid idtype="pmpid" link="fulltext">10199951</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B44">
				<title>
					<p>Developmental patterns of chromatin structure and DNA methylation responsible for epigenetic expression of a maize regulatory gene.</p>
				</title>
				<aug>
					<au>
						<snm>Hoekenga</snm>
						<fnm>OA</fnm>
					</au>
					<au>
						<snm>Muszynski</snm>
						<fnm>MG</fnm>
					</au>
					<au>
						<snm>Cone</snm>
						<fnm>KC</fnm>
					</au>
				</aug>
				<source>Genetics</source>
				<pubdate>2000</pubdate>
				<volume>155</volume>
				<fpage>1889</fpage>
				<lpage>1902</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">10924483</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B45">
				<title>
					<p>Differential chromatin structure within a tandem array 100 kb upstream of the maize b1 locus is associated with paramutation.</p>
				</title>
				<aug>
					<au>
						<snm>Stam</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Belele</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Dorweiler</snm>
						<fnm>JE</fnm>
					</au>
					<au>
						<snm>Chandler</snm>
						<fnm>VL</fnm>
					</au>
				</aug>
				<source>Genes Dev</source>
				<pubdate>2002</pubdate>
				<volume>16</volume>
				<fpage>1906</fpage>
				<lpage>1918</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">186425</pubid>
						<pubid idtype="pmpid" link="fulltext">12154122</pubid>
						<pubid idtype="doi">10.1101/gad.1006702</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B46">
				<title>
					<p>Gene expression analyses of <it>Arabidopsis </it>chromosome 2 using a genomic DNA amplicon microarray.</p>
				</title>
				<aug>
					<au>
						<snm>Kim</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Snesrud</snm>
						<fnm>EC</fnm>
					</au>
					<au>
						<snm>Haas</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Cheung</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Town</snm>
						<fnm>CD</fnm>
					</au>
					<au>
						<snm>Quackenbush</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2003</pubdate>
				<volume>13</volume>
				<fpage>327</fpage>
				<lpage>340</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">430289</pubid>
						<pubid idtype="pmpid" link="fulltext">12618363</pubid>
						<pubid idtype="doi">10.1101/gr.552003</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B47">
				<title>
					<p>Formation of stable epialleles and their paramutation-like interaction in tetraploid <it>Arabidopsis thaliana</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Mittelsten Scheid</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Afsar</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Paszkowski</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2003</pubdate>
				<volume>34</volume>
				<fpage>450</fpage>
				<lpage>454</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/ng1210</pubid>
						<pubid idtype="pmpid" link="fulltext">12847525</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B48">
				<title>
					<p>Chromatin silencing and <it>Arabidopsis </it>development: a role for polycomb protein.</p>
				</title>
				<aug>
					<au>
						<snm>Preuss</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Plant Cell</source>
				<pubdate>1999</pubdate>
				<volume>11</volume>
				<fpage>765</fpage>
				<lpage>768</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1105/tpc.11.5.765</pubid>
						<pubid idtype="pmpid" link="fulltext">10330463</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B49">
				<title>
					<p>Transcriptional transgene silencing and chromatin components.</p>
				</title>
				<aug>
					<au>
						<snm>Meyer</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Plant Mol Biol</source>
				<pubdate>2000</pubdate>
				<volume>43</volume>
				<fpage>221</fpage>
				<lpage>234</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1023/A:1006483428789</pubid>
						<pubid idtype="pmpid" link="fulltext">10999406</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B50">
				<title>
					<p>Modulation of a transcription factor counteracts heterochromatic gene silencing in <it>Drosophila</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Ahmad</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Henikof</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Cell</source>
				<pubdate>2001</pubdate>
				<volume>104</volume>
				<fpage>839</fpage>
				<lpage>847</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0092-8674(01)00281-1</pubid>
						<pubid idtype="pmpid" link="fulltext">11290322</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B51">
				<title>
					<p>Analysis of histone acetyltransferase and histone deacetylase families of <it>Arabidopsis thaliana </it>suggests functional diversification of chromatin modification among multicellular eukaryotes.</p>
				</title>
				<aug>
					<au>
						<snm>Pandey</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Muller</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Napoli</snm>
						<fnm>CA</fnm>
					</au>
					<au>
						<snm>Selinger</snm>
						<fnm>DA</fnm>
					</au>
					<au>
						<snm>Pikaard</snm>
						<fnm>CS</fnm>
					</au>
					<au>
						<snm>Richards</snm>
						<fnm>EJ</fnm>
					</au>
					<au>
						<snm>Bender</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Mount</snm>
						<fnm>DW</fnm>
					</au>
					<au>
						<snm>Jorgensen</snm>
						<fnm>RA</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2002</pubdate>
				<volume>30</volume>
				<fpage>5036</fpage>
				<lpage>5055</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">137973</pubid>
						<pubid idtype="pmpid" link="fulltext">12466527</pubid>
						<pubid idtype="doi">10.1093/nar/gkf660</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B52">
				<title>
					<p>Chromatin-remodeling and memory factors. New regulators of plant development.</p>
				</title>
				<aug>
					<au>
						<snm>Reyes</snm>
						<fnm>JC</fnm>
					</au>
					<au>
						<snm>Hennig</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Gruissem</snm>
						<fnm>W</fnm>
					</au>
				</aug>
				<source>Plant Physiol</source>
				<pubdate>2002</pubdate>
				<volume>130</volume>
				<fpage>1090</fpage>
				<lpage>1101</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1104/pp.006791</pubid>
						<pubid idtype="pmpid" link="fulltext">12427976</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B53">
				<title>
					<p>DNA methylation controls histone H3 lysine 9 methylation and heterochromatin assembly in <it>Arabidopsis</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Soppe</snm>
						<fnm>WJ</fnm>
					</au>
					<au>
						<snm>Jasencakova</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Houben</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Kakutani</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Meister</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Huang</snm>
						<fnm>MS</fnm>
					</au>
					<au>
						<snm>Jacobsen</snm>
						<fnm>SE</fnm>
					</au>
					<au>
						<snm>Schubert</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Fransz</snm>
						<fnm>PF</fnm>
					</au>
				</aug>
				<source>EMBO J</source>
				<pubdate>2002</pubdate>
				<volume>21</volume>
				<fpage>6549</fpage>
				<lpage>6559</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">136960</pubid>
						<pubid idtype="pmpid">12456661</pubid>
						<pubid idtype="doi">10.1093/emboj/cdf657</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B54">
				<title>
					<p>Role of transposable elements in heterochromatin and epigenetic control.</p>
				</title>
				<aug>
					<au>
						<snm>Lippman</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Gendrel</snm>
						<fnm>AV</fnm>
					</au>
					<au>
						<snm>Black</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Vaughn</snm>
						<fnm>MW</fnm>
					</au>
					<au>
						<snm>Dedhia</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>McCombie</snm>
						<fnm>WR</fnm>
					</au>
					<au>
						<snm>Lavine</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Mittal</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>May</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Kasschau</snm>
						<fnm>KD</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nature</source>
				<pubdate>2004</pubdate>
				<volume>430</volume>
				<fpage>471</fpage>
				<lpage>476</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nature02651</pubid>
						<pubid idtype="pmpid" link="fulltext">15269773</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B55">
				<title>
					<p>The role of RNA interference in heterochromatic silencing.</p>
				</title>
				<aug>
					<au>
			