<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2008-9-3-r57</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Transcriptional analysis of highly syntenic regions between <it>Medicago truncatula </it>and <it>Glycine max </it>using tiling microarrays</p>
         </title>
         <aug>
            <au id="A1" ce="yes">
               <snm>Li</snm>
               <fnm>Lei</fnm>
               <insr iid="I1"/>
               <insr iid="I8"/>
               <email>ll4jn@virginia.edu</email>
            </au>
            <au id="A2" ce="yes">
               <snm>He</snm>
               <fnm>Hang</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>hang.he@yale.edu</email>
            </au>
            <au id="A3">
               <snm>Zhang</snm>
               <fnm>Juan</fnm>
               <insr iid="I4"/>
               <email>jzhang@danforthcenter.org</email>
            </au>
            <au id="A4">
               <snm>Wang</snm>
               <fnm>Xiangfeng</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>xiangfeng.wang@yale.edu</email>
            </au>
            <au id="A5">
               <snm>Bai</snm>
               <fnm>Sulan</fnm>
               <insr iid="I5"/>
               <email>sulanb@sina.com</email>
            </au>
            <au id="A6">
               <snm>Stolc</snm>
               <fnm>Viktor</fnm>
               <insr iid="I6"/>
               <email>vstolc@mail.arc.nasa.gov</email>
            </au>
            <au id="A7">
               <snm>Tongprasit</snm>
               <fnm>Waraporn</fnm>
               <insr iid="I6"/>
               <email>wtongprasit@mail.arc.nasa.gov</email>
            </au>
            <au id="A8">
               <snm>Young</snm>
               <mi>D</mi>
               <fnm>Nevin</fnm>
               <insr iid="I7"/>
               <email>neviny@umn.edu</email>
            </au>
            <au id="A9">
               <snm>Yu</snm>
               <fnm>Oliver</fnm>
               <insr iid="I4"/>
               <email>oyu@danforthcenter.org</email>
            </au>
            <au id="A10" ca="yes">
               <snm>Deng</snm>
               <fnm>Xing-Wang</fnm>
               <insr iid="I1"/>
               <email>xingwang.deng@yale.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, CT 06520, USA</p>
            </ins>
            <ins id="I2">
               <p>National Institute of Biological Sciences, Beijing 102206, China</p>
            </ins>
            <ins id="I3">
               <p>Peking-Yale Joint Research Center of Plant Molecular Genetics and Agrobiotechnology, Peking University, Beijing 100871, China</p>
            </ins>
            <ins id="I4">
               <p>Donald Danforth Plant Science Center, St Louis, MO 63132, USA</p>
            </ins>
            <ins id="I5">
               <p>College of Life Sciences, Capital Normal University, Beijing 100037, China</p>
            </ins>
            <ins id="I6">
               <p>Genome Research Facility, NASA Ames Research Center, Moffett Field, CA 94035, USA</p>
            </ins>
            <ins id="I7">
               <p>Department of Plant Pathology, University of Minnesota, St Paul, MN 55108, USA</p>
            </ins>
            <ins id="I8">
               <p>Current address: Department of Biology, University of Virginia, Charlottesville, VA 22904, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>3</issue>
         <fpage>R57</fpage>
         <url>http://genomebiology.com/2008/9/3/R57</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18348734</pubid>
               <pubid idtype="doi">10.1186/gb-2008-9-3-r57</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>11</day>
               <month>10</month>
               <year>2007</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>30</day>
               <month>1</month>
               <year>2008</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>19</day>
               <month>3</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>19</day>
               <month>03</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Li et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Profiling barrel medic and soybean syntenic regions</p>
      </shorttitle>
      <shortabs>
         <p>The comparative transcriptional analysis of highly syntenic regions in six different organ types between <it>Medicago truncatula</it> (barrel medic) and <it>Glycine max</it> (soybean), using nucleotide tiling microarrays, provides insights into genome organization and transcriptional regulation in these legume plants.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Legumes are the third largest family of flowering plants and are unique among crop species in their ability to fix atmospheric nitrogen. As a result of recent genome sequencing efforts, legumes are now one of a few plant families with extensive genomic and transcriptomic data available in multiple species. The unprecedented complexity and impending completeness of these data create opportunities for new approaches to discovery.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We report here a transcriptional analysis in six different organ types of syntenic regions totaling approximately 1 Mb between the legume plants barrel medic (<it>Medicago truncatula</it>) and soybean (<it>Glycine max</it>) using oligonucleotide tiling microarrays. This analysis detected transcription of over 80% of the predicted genes in both species. We also identified 499 and 660 transcriptionally active regions from barrel medic and soybean, respectively, over half of which locate outside of the predicted exons. We used the tiling array data to detect differential gene expression in the six examined organ types and found several genes that are preferentially expressed in the nodule. Further investigation revealed that some collinear genes exhibit different expression patterns between the two species.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>These results demonstrate the utility of genome tiling microarrays in generating transcriptomic data to complement computational annotation of the newly available legume genome sequences. The tiling microarray data was further used to quantify gene expression levels in multiple organ types of two related legume species. Further development of this method should provide a new approach to comparative genomics aimed at elucidating genome organization and transcriptional regulation.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010019">Plant biology</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010015">Model organisms</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The rapidly increasing number of genome and transcript sequences in recent years is having two marked, complementary effects on the relatively new discipline of plant genomics and transcriptomics. The newly available sequences need to be fully annotated to identify all the functional and structural elements. Because genome annotation is a reiterative process that is heavily dependent on large-scale, high-throughput experimental data, each additional genome sequence comes as a new challenge. On the other hand, the availability of multiple genomic and transcriptomic datasets fosters comparative analyses that improve structural annotation of the genomes and generate new insight into the function and evolution of protein-coding and non-coding regions of the genomes.</p>
         <p>One approach to systematically characterize genome transcription is to use high feature-density tiling microarrays on which a given genome sequence is represented <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Genome tiling arrays have been used in a number of model species for which the full genome sequence is available <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. Results from these studies have shown that for well-documented transcripts, such as those of polyadenylated RNAs from annotated genes, hybridization signals from tiling arrays identify the transcriptional start and stop sites, the locations of introns, and the events of alternative splicing <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. Tiling arrays therefore provide a valuable means for confirming the large number of predicted genes that otherwise lack supportive experimental evidence. However, tiling array signals also reveal a large number of putative novel transcripts for which no conventional explanations are yet available.</p>
         <p>With respect to plants, the <it>Arabidopsis thaliana </it>genome was the first to be probed by tiling microarrays <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Tiling array analysis of the more complex rice genome has been carried out as well <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. The rice tiling array data were used to detect transcription of the majority of the annotated genes. For example, of the 43,914 non-transposable element protein-coding genes from the improved <it>indica</it> whole genome shotgun sequence <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, transcription of 35,970 (81.9%) was detected <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. On the other hand, comprehensive identification of transcriptionally active regions (TARs) from tiling array profiles revealed significant transcriptional activities outside of the annotated exons <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. Subsequent analyses indicate that about 80% of the non-exonic TARs can be assigned to various putatively functional or structural elements of the rice genome, ranging from splice variants, uncharacterized portions of incompletely annotated genes, antisense transcripts, duplicated gene fragments, to potential non-coding RNAs <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>.</p>
         <p>In addition to detecting transcriptome components, genome tiling arrays in theory can be used to directly quantify the expression levels of individual transcription units. As an alternative approach to the surrogate expression arrays, tiling arrays offer two potential advantages. First, in tiling arrays each transcription unit is interrogated by hundreds of probes according to the actual genomic sequence. This strategy eliminates the need to arbitrarily select a small number of supposedly gene-specific probes and thus alleviates probe bias and improves cross-platform comparability in microarray experiments. Second, measurement of gene expression using tiling arrays allows averaging of the results from multiple probes per gene, which can reduce inconsistent probe behavior and thus provide improved statistical confidence.</p>
         <p>Using DNA microarrays to study gene expression in closely related species has become an important approach to identify the genetic basis for phenotypic variation and to trace evolution of gene regulation <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. However, expression levels as well as sequences may differ between species, creating additional technical challenges for inter-species comparisons. Current approaches to control for the effect of sequence divergence are either to mask probes with sequence mismatches <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp> or to use probes derived from the various species of interest to cancel out the sequence mismatch effect <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. Both approaches, however, rely on a few empirically or computationally selected probes for each gene of interest. Consequently, the effectiveness and accuracy of these approaches is still a matter of debate <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. In related species for which genome sequences have all been determined, genomic tiling arrays could provide an alternative approach to inter-species comparison of gene expression. Again, the inclusion of multiple probes per transcription unit in tiling arrays could potentially improve the accuracy and fairness of the estimation of gene expression levels in each species, which in turn could improve cross-species comparison of the expression patterns of orthologous genes.</p>
         <p>As the third largest family of flowering plants, legumes (Fabaceae) are unique among crop species in their ability to fix atmospheric nitrogen through symbiotic relationships with rhizobia bacteria <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. Extensive expressed sequence tags have been collected for a number of legume species, including soybean (<it>Glycine max</it>), lotus (<it>Lotus japonicus</it>), common bean (<it>Phaseolus vulgaris</it>), and barrel medic (<it>Medicago truncatula</it>) <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>. Genomes of barrel medic, soybean, and lotus are being sequenced because all are models for studying nitrogen fixation and symbiosis, tractable to genetic manipulation, and exhibit diploid genetics and modest genome sizes. Both barrel medic and lotus have a diploid genome of approximately 475 Mb while soybean has a diploidized tetraploid genome estimated at 950 Mb <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>. Recently, preliminary genome assembly and annotation of barrel medic (Mt2.0) and soybean (Glyma0) became publicly available <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. As a result, legumes are now one of a few plant families in which extensive genome sequences in multiple species are available.</p>
         <p>Comparisons of genome sequences have revealed various degrees of synteny (conservation of gene content and order) among species related at different taxonomic levels. For legume plants, early work based on DNA markers demonstrated substantial genome conservation among some Phasoloid species, including mungbean (<it>Vigna radiata</it>) and cowpea (<it>V. unguiculata</it>) <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, and between <it>Vigna </it>and the common bean <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Genome-wide gene-based analysis among legumes using a large set of cross-species genetic markers produced chromosome alignments from five species of the Papilionoid subfamily, including barrel medic and soybean <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. More recently, direct synteny comparison of the finished and anchored genome sequences from barrel medic and lotus was made. Results from this study indicated that three-quarters of the genome of each species may reside in conserved syntenic segments in the genome of the other <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, which share at least ten large-scale synteny blocks that frequently extend the length of whole chromosome arms <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>.</p>
         <p>Two soybean regions comprising approximately 0.5 Mb each surrounding the soybean cyst nematode resistance loci, <it>rhg1 </it>and <it>Rhg4</it>, were extensively characterized <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. Using these sequences, Mudge <it>et al</it>. <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> identified the syntenic regions from barrel medic. They found that many predicted genes in the syntenic regions were conserved and collinear between the two species. Here, we used tiling microarray analysis to verify the predicted genes, to identify additional transcripts, and to compare transcription patterns in six different organ types in each species. Our results provide transcriptional support to over 80% of the predicted genes and identified 499 and 660 TARs from barrel medic and soybean, respectively. The gene expression patterns in the six organ types of some collinear genes showed significant differences between the two species despite synteny at the DNA level, demonstrating the usefulness of genomic tiling analysis in comparative genomics.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Genes in the syntenic regions between barrel medic and soybean</p>
            </st>
            <p>In a previous study, two regions in the soybean genome comprising approximately 0.5 Mb each surrounding the soybean cyst nematode resistance loci, <it>rhg1 </it>and <it>Rhg4</it>, were used to identify syntenic regions in the <it>Medicago </it>genome <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. Because there was a 2 cM gap in the first region, these sequences were referred to as synteny blocks 1a, 1b, and 2 <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. The syntenic regions in barrel medic also totaled about 1 Mb, though they were scattered into smaller contigs. For example, synteny block 1b in barrel medic contained two additional gaps <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. In barrel medic, there were two segmental duplications (block 2i and 2ii) that were both syntenic to soybean synteny block 2 <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>.</p>
            <p>Genes were predicted in the 1 Mb barrel medic and soybean sequence contigs using FGENESH <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Both the dicot plants (<it>Arabidopsis</it>) and the <it>Medicago </it>(legume plant) matrixes were used and their outputs compared <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Using the legume matrix, 229 and 217 genes were predicted for the barrel medic and soybean sequences, respectively (Additional data file 1). These represent significantly more but shorter genes (exons) compared with the <it>Arabidopsis </it>matrix outputs. However, the legume matrix prediction also resulted in more base-pairs in the exons (increases of 10.3% and 8.2% for barrel medic and soybean, respectively; Additional data file 1). These results clearly demonstrate that gene prediction output is sensitive to the training matrix and highlight the importance of experimental means in verifying and improving computational gene prediction. For simplicity, we selected the gene prediction from the legume matrix for further analysis.</p>
         </sec>
         <sec>
            <st>
               <p>Tiling microarray detection of predicted genes</p>
            </st>
            <p>We designed two independent sets of overlapping 36-mer oligonucleotide probes offset by five nucleotides to represent both DNA strands of the 1 Mb syntenic barrel medic and soybean sequences (see Materials and methods). Each set of probes was synthesized into a single array based on Maskless Array Synthesis technology <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B34">34</abbr></abbrgrp>. The barrel medic and soybean arrays were hybridized in parallel with target cDNA prepared from six organ types of each plant, namely, root, nodule, stem, leaf, flower and developing seed. Fluorescence intensity of the probes was correlated with the genome position by alignment of the probes to the chromosomal coordinates (Figure <figr fid="F1">1</figr>). Transcriptional analysis of the syntenic regions was then achieved by examining expression of the predicted genes and systematically screening for TARs.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Tiling microarray analysis of the 1 Mb syntenic regions</p>
               </caption>
               <text>
                  <p>Tiling microarray analysis of the 1 Mb syntenic regions. A representative Gene Browser window is shown in which predicted genes are aligned to the chromosomal coordinates. Arrows indicate the direction of transcription. The interrogating tiling probes are also aligned to the chromosome coordinates with the fluorescence intensity value depicted as a vertical bar in the six organ types. From top to bottom: nodule, root, stem, leaf, flower and seed.</p>
               </text>
               <graphic file="gb-2008-9-3-r57-1"/>
            </fig>
            <p>We used a method based on the binomial theorem to score the tiling array data obtained from the six organ types to detect transcription of the predicted genes <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Analysis of the tiling array data detected 193 out of 229 (84%) and 176 out of 217 (81%) predicted genes in at least one of the six organ types in barrel medic and soybean, respectively (Figure <figr fid="F2">2a</figr>), indicating that most predicted gene loci are transcribed. Among the six organ types, detection rates of predicted genes ranged from 48% (flower) to 75% (nodule) in barrel medic, and from 60% (root) to 76% (flower) in soybean (Figure <figr fid="F2">2b</figr>). Interestingly, the gene detection rate in the nodule was the most similar between both species (74.7% and 73.3% in barrel medic and soybean, respectively; Figure <figr fid="F2">2b</figr>). These results suggest that transcription of the predicted genes from the 1 Mb syntenic sequences between barrel medic and soybean is, to a large extent, differentially regulated in the two species, which was further investigated (see below).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Tiling microarray detection of the predicted genes in the 1 Mb region syntenic between barrel medic and soybean</p>
               </caption>
               <text>
                  <p>Tiling microarray detection of the predicted genes in the 1 Mb region syntenic between barrel medic and soybean. <b>(a) </b>Pie charts showing the number and percentage of genes detected by tiling arrays in at least one of the six examined organ types. <b>(b) </b>Tiling array detection rates of predicted genes in the six organ types in barrel medic and soybean.</p>
               </text>
               <graphic file="gb-2008-9-3-r57-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Identification and characterization of TARs</p>
            </st>
            <p>We next scored tiling microarray data blind to the annotated genes and identified 499 and 660 unique TARs in barrel medic and soybean, respectively (see Materials and methods). The barrel medic and soybean TARs exhibited distinct overall organ specificity. Compared with TARs in barrel medic, soybean TARs in general were detected in more tissue types (Figure <figr fid="F3">3a</figr>), implying a more constitutive expression pattern. Furthermore, roughly equal numbers of barrel medic (181) and soybean (187) TARs were detected in just one organ type. These TARs were detected in barrel medic mainly from stem and leaf while nodule and root were the most abundance source in soybean (Figure <figr fid="F3">3b</figr>). Thus, these TARs appear to represent organ-specific transcriptional activities that differ in the examined sequences between barrel medic and soybean.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Analysis of the frequency of TARs in different organ types</p>
               </caption>
               <text>
                  <p>Analysis of the frequency of TARs in different organ types. <b>(a) </b>Percentage and number of TARs detected by tiling arrays in one, two, three, four, five and all six organ types in barrel medic and soybean. <b>(b) </b>Organ-specific number of TARs detected from only one organ type by tiling arrays in barrel medic and soybean.</p>
               </text>
               <graphic file="gb-2008-9-3-r57-3"/>
            </fig>
            <p>Aligning against the predicted genes, 188 (38%) and 305 (46%) barrel medic and soybean TARs intersect with an exon. The remaining 311 (62%) barrel medic and 355 (54%) soybean TARs are located outside of or antisense to the predicted exons and are referred to as non-exonic TARs. The distributions of TARs detected in barrel medic and soybean in the different annotated genome components are illustrated in Figure <figr fid="F4">4a</figr>. Interestingly, the relative proportion of TARs in each annotated genome component is largely comparable to results from a whole-genome tiling array analysis in rice <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. This observation indicates that predicted exons account for less than half of the transcriptome detected by tiling arrays in rice and legume plants, despite their different genome sizes and distinct genome organization. Furthermore, a significant portion of TARs was found antisense to the predicted genes in both barrel medic (14%) and soybean (16%) (Figure <figr fid="F4">4a</figr>), which adds to previous tiling array analysis in <it>Arabidopsis </it><abbrgrp><abbr bid="B5">5</abbr></abbrgrp> and rice <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp> in showing that antisense transcription is an inherent property of the plant genomes.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Classification of TARs based on physical location relative to the predicted genes</p>
               </caption>
               <text>
                  <p>Classification of TARs based on physical location relative to the predicted genes. <b>(a) </b>Pie charts showing percentage of all identified TARs in different genome components relative to the predicted gene structures in barrel medic and soybean. <b>(b) </b>Number of non-exonic TARs in different sub-genic regions in barrel medic and soybean.</p>
               </text>
               <graphic file="gb-2008-9-3-r57-4"/>
            </fig>
            <p>The non-exonic TARs were further analyzed in terms of their physical location relative to the predicted genes. In this analysis, genome regions were divided into eight different configurations against the predicted exons (Figure <figr fid="F4">4b</figr>). Interestingly, in almost all antisense configurations, there were more TARs in soybean than in barrel medic (Figure <figr fid="F4">4b</figr>), suggesting that antisense transcription is more prevalent in soybean than in barrel medic. This analysis also revealed a surprisingly large number of intergenic TARs (36 in barrel medic and 45 in soybean) located in close proximity on the antisense strand 5' to the start of a predicted gene (Figure <figr fid="F4">4b</figr>). Because the predicted genes do not include untranslated regions, it is conceivable that transcripts derived from these TARs and the corresponding genes are arranged in a divergent antisense orientation and could potentially form duplex transcript pairs.</p>
         </sec>
         <sec>
            <st>
               <p>Differential gene expression in the syntenic regions</p>
            </st>
            <p>The binomial theorem-based method used to detect gene transcription does not assign a value to the expression level and is only useful for present calls <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. Therefore, we used a median polishing-based method that fits an additive linear model <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> to determine differential expression of the predicted genes in the six examined organ types and to assess the relative deviation of gene expression level in each organ type (see Materials and methods). In barrel medic, 67 (29%) of the 229 predicted genes were identified as differentially expressed (<it>p </it>&lt; 0.001) among the six examined organ types (Figure <figr fid="F5">5</figr>). In soybean, 72 (33%) of the 217 predicted genes displayed differential expression (Figure <figr fid="F5">5</figr>).</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Analysis of differentially expressed genes</p>
               </caption>
               <text>
                  <p>Analysis of differentially expressed genes. Heat maps represent unsupervised clustering of differentially expressed genes in barrel medic and soybean. The red, yellow, and blue colors depict positive deviation, no deviation, and negative deviation of the transcription level, respectively.</p>
               </text>
               <graphic file="gb-2008-9-3-r57-5"/>
            </fig>
            <p>Precise transcriptional and developmental controls are required for the establishment of the complex interaction between the nitrogen-fixing rhizobia and plant cells in the nodule. To begin to understand the transcriptional program in nodules, we identified and compared genes specifically expressed in the nodule. Within the syntenic regions in barrel medic, 11 (16%, including one duplicated gene) differentially expressed genes showed higher transcription levels in the nodule than in the other five organ types (Additional data file 2). In soybean, there were 10 (14%) differentially expressed genes showing higher transcription levels in the nodule (Additional data file 3). Nodule-enhanced expression levels of six randomly selected genes in soybean were all confirmed by RT-PCR analysis (Figure <figr fid="F6">6a</figr>), indicating that the median polishing-based method used to score the tiling data is accurate in detecting organ type-specific transcripts. A particular example is illustrated in Figure <figr fid="F6">6b</figr>. This gene (Gm_121) is homologous to the <it>Ljsbp </it>gene from <it>Lotus japonicus </it>that encodes a putative selenium binding protein <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. <it>In situ </it>hybridization analysis revealed that the <it>Ljsbp </it>transcripts were localized in the young nodules, the vascular tissues of young seedpods and embryos <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>, which is consistent with the tiling array and RT-PCR data on the soybean ortholog (Figure <figr fid="F6">6b</figr>).</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Verification of tiling array detected differentially expressed genes</p>
               </caption>
               <text>
                  <p>Verification of tiling array detected differentially expressed genes. <b>(a) </b>RT-PCR analysis of the transcript abundance in six organ types for six selected soybean genes that are preferentially expressed in the nodule. Total RNA (5 &#956;g) was reverse transcribed and 5% of the product used as template for PCR, which was carried out for 35 cycles. <b>(b) </b>Organ type-specific variation of the expression level of the gene Gm_121, as determined by median-polishing of the tiling array data. Dashed lines indicate the deviation value at <it>p </it>= 0.001.</p>
               </text>
               <graphic file="gb-2008-9-3-r57-6"/>
            </fig>
            <p>In soybean, all but one of the detected nodule-enhanced genes are known genes (Additional data file 3). In contrast, only three of the 11 nodule-enhanced genes detected in barrel medic match with a known gene while the other eight genes have no assigned functions (Additional data file 2). When the nodule-enhanced genes detected in barrel medic and soybean were compared for synteny, six of the ten soybean genes were found to have a collinear counterpart in barrel medic, although transcription of the collinear genes in barrel medic was not nodule-enhanced (Additional data file 3). Consequently, there was only one gene encoding a TGACG-binding transcription factor that is collinear as well as specifically expressed in the nodule in both species.</p>
         </sec>
         <sec>
            <st>
               <p>Transcriptional pattern of collinear genes in the syntenic regions</p>
            </st>
            <p>The barrel medic and soybean sequences interrogated by the tiling microarray are highly syntenic. In the previous report, a total of 68 pairs of genes were found to be collinear with both the gene order and orientation conserved between barrel medic and soybean homologs <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. In the current study, we were able to identify 78 collinear gene pairs based on the gene prediction output from the legume matrix.</p>
            <p>To begin to obtain information on the variation in gene expression between barrel medic and soybean, which is important for defining transcriptional regulatory networks that contribute to their phenotypic variations <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>, we examined the expression pattern of the collinear genes. To this end, we used the transcription level deviation in the six organ types for each collinear gene as a parameter to profile gene expression patterns. Consistent with the fact that most genes were not differentially expressed in different organ types, a majority of the collinear genes showed relatively small organ type deviation (Figure <figr fid="F7">7</figr>). However, a number of collinear genes exhibited drastic variation in transcription levels across the organ types. In barrel medic, the most conspicuous example is a group of genes that are down-regulated in the seed but up-regulated in the stem. In soybean, the root exhibited the greatest gene expression variation (Figure <figr fid="F7">7</figr>). Importantly, the transcription pattern of these collinear genes is not conserved in the reciprocal species, suggesting that the regulatory sequence of these genes is under positive selection.</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Analysis of the transcription patterns of collinear genes</p>
               </caption>
               <text>
                  <p>Analysis of the transcription patterns of collinear genes. The collinear genes in both barrel medic and soybean are ordered by chromosome position. For each gene, the deviation of transcription level was calculated based on median polishing for the six organ types (see Materials and methods). The gene order was then plotted against the corresponding deviation value in each of the six organ types, which is color-coded.</p>
               </text>
               <graphic file="gb-2008-9-3-r57-7"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>The rapidly accumulating amount of genome and transcriptome data in recent years is having profound effects on biological research. Elucidating all the functional and structural elements of the genome sequences and how they are organized and regulated, and how they evolved has thus become the focus of the next phase of genome projects. In these regards, genome tiling microarray analysis is emerging as a new powerful approach, which involves the development of tiling arrays containing progressive oligonucleotide tiles that represent a target genome. Recent advances in microarray technologies allow oligonucleotide arrays to be made with several hundred thousand to several million discrete features per array, which permits tiling complex genomes with a manageable number of arrays <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. This in turn has resulted in transcriptomic tiling data for a large number of model species <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>.</p>
         <p>Application of tiling array analysis in genomics studies has significantly broadened our understanding of the genetic information encoded in the genome sequences. When probed against various RNA samples, tiling array hybridization patterns identify transcript ends and intron locations <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. Tiling array analysis thus provides a valuable means for verifying genome annotation, which is a challenge that must be met for each new genome sequence. In the current study, we generated tiling array data for a 1 Mb region syntenic between barrel medic and soybean in six different organ types (Figure <figr fid="F1">1</figr>). Analysis of the tiling array data detected 193 out of 229 (84%) and 176 out of 217 (81%) predicted genes in barrel medic and soybean, respectively (Figure <figr fid="F2">2</figr>), similar to results reported from tiling array analysis of the rice genome <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. Because genome annotation is a highly reiterative process that improves with the parallel refinement of gene-finding programs and the availability of experimental evidence, we anticipate further application of tiling array analysis to facilitate annotation of the fast emerging legume genome sequences <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B30">30</abbr><abbr bid="B39">39</abbr></abbrgrp>.</p>
         <p>Another use of the tiling array data is to identify transcription units in addition to the predicted genes <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Previous tiling analyses indeed documented large numbers of putative novel transcripts in virtually all the genomes examined <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. For example, detailed characterization of the non-exonic TARs identified in the <it>japonica </it>rice genome showed that they could be assigned to various putatively functional or structural elements of the genome, ranging from splice variants, uncharacterized portions of incompletely annotated genes, antisense transcripts, duplicated gene fragments, to potential non-coding RNAs <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. In carrying out tiling array analysis of the legume sequences, we identified 499 and 660 unique TARs in barrel medic and soybean, respectively (Figure <figr fid="F3">3</figr>). Aligning against the predicted genes, 311 (62%) barrel medic and 355 (54%) soybean TARs were found to locate outside of or antisense to the predicted exons (Figure <figr fid="F4">4</figr>). Interestingly, in a promoter trapping study in lotus in which a promoter-less GUS reporter system was used, GUS activation, often tissue-specific, was found beyond the predicted genic regions <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. Together, these observations indicate that novel transcripts missed by gene annotation account for a significant portion of the transcriptome in legume plants.</p>
         <p>As a novel use of tiling array data for transcriptomic profiling, we used a median polishing-based procedure <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B36">36</abbr></abbrgrp> to determine the relative transcription levels and differential expression of the predicted genes. Because there are multiple probes involved in tiling a given gene, the median polishing-based procedure will have the corollary benefit of improved statistical confidence. Based on this method, approximately 30% of genes were found to be differentially expressed among the six examined organ types (Figure <figr fid="F5">5</figr>). The nodule-enhanced expression pattern of six selected soybean genes was subsequently verified by RT-PCR analysis (Figure <figr fid="F6">6</figr>). Collectively, these results indicate that genomic tiling array analysis can be extended to quantitatively examine the transcription levels of individual genes. This may prove particularly useful for quantifying transcription levels of members of paralogous gene families, which are notoriously hard to discriminate in conventional expression arrays that employ relatively fewer probes per gene.</p>
         <p>Interestingly, 11 and 10 genes were identified as preferentially expressed in the nodule in barrel medic and soybean, respectively. These genes exhibited little overlap between the two species (Additional data files 2 and 3). Barrel medic and soybean diverged from a common ancestor approximately 50 million years ago, and represent two distinct groups of nodulating plants <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. Barrel medic forms indeterminate nodules, which maintain an active meristem inside nodule primordia during the early stages of nodule development; while soybean forms determinate nodules that, after initial cell divisions, grow by cell expansions. These morphological differences may thus affect the architecture and gene expression in the nodules <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>.</p>
         <p>The availability of multiple genomic and transcriptomic datasets fosters comparative analyses that improve structural annotation and generate new insight into the function and evolution of coding and non-coding regions of the genomes <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. A major principle of comparative genomics is that the functional DNA sequences in related species conserved from the last common ancestor are preserved in contemporary genome sequences, which encode the proteins and RNAs and the regulatory sequences controling genes with similar expression patterns <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. Alignment of primary DNA sequences is the core process in most comparative analyses. The resulting information on sequence similarity among genomes is a major resource for infering gene functions, identifying other candidate funcational elements, and finding conserved genes missed from annotation in one genome or another.</p>
         <p>Transcriptomic data from multiple species are also being extensively used in comparative analysis. For example, direct comparison of multiple transcript datasets using genome annotation tools has been shown as an effective way to uncover 'unannotated' genes. In rice, 255 new candidate genes were identified by cross-species spliced alignment of expressed sequence tags and cDNA to the genome sequence <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. In this regard, the rich transcriptional activity documented from genomic tiling analysis constitutes an excellent complement to other tag-based transcriptome data. In the present tiling analysis of syntenic regions between two legume species, we identified over 300 unique TARs in both barrel medic and soybean in addition to the predicted exons. Transcripts tagged by these TARs should be useful for further comparison aimed at improving genome annotation and elucidating the transcriptome.</p>
         <p>Furthermore, comparison of transcription levels in six different organ types revealed that a large portion of the collinear genes between barrel medic and soybean exhibit different expression patterns (Figure <figr fid="F7">7</figr>). It should be noted that there is a segmental duplication of synteny block 2 (block 2i and 2ii) in barrel medic <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. The process of subfunctionalization following gene duplication, where degenerative mutations in both genes result in the partitioning of ancestral functions or expression patterns in the duplicated genes, could, therefore, contribute to the observed expression divergence among the examined collinear genes between barrel medic and soybean. Further analysis of the <it>cis</it>-regulatory regions of the syntenic genes should help to identify the key regulatory sequence divergence that accounts for the differences in related legume species and add to our general knowledge of plant genome evolution and regulation.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We report here a transcriptional analysis using high-resolution tiling microarrays of syntenic regions totaling 1 Mb between the legume plants barrel medic and soybean in six different organ types. This analysis generated transcriptomic data that is useful for three purposes. First, we detected transcription of over 80% of the predicted genes in the interrogated genome regions in both legume species. As genome annotation is a reiterative process that is heavily dependent on experimental data, genomic tiling analysis is thus one valid option to meet the challenge of analyzing large-scale transcriptomic datasets for newly sequenced legume genomes. Second, we identified 499 and 660 TARs from barrel medic and soybean, respectively, over half of which are outside of the predicted exons. Further functional characterization of these candidate transcripts should be useful to better our understanding of the complexity and dynamics of the transcriptome of legume plants. Third, we used the tiling array data to detect differential gene expression and to compare transcription patterns of collinear genes. This novel approach was validated by the high confirmation rate by RT-PCR analysis of genes that are preferentially expressed in the nodule. Further investigation revealed that some collinear genes exhibited drastically different transcription patterns between the two species. Collectively, these results demonstrate that genomic tiling analysis is an effective approach to simultaneously complement computational annotation of newly available genome sequences and to facilitate comparative genomics aimed at elucidating genome organization and transcriptional regulation in closely related species.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Plant materials and treatments</p>
            </st>
            <p>Barrel medic (<it>Medicago truncatula </it>cv. <it>Jemalong A17</it>) seed was treated with concentrated H<sub>2</sub>SO<sub>4 </sub>for 10 minutess, rinsed with water and then allowed to germinate on moist filter paper at room temperature for a week. Seedlings 1-2 cm in length were planted in soil and maintained in the greenhouse with nitrogen-free plant nutrient solution as previously described <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. Soybean (<it>Glycine max </it>cv. <it>William 82</it>) seed was directly sown in soil and maintained in the greenhouse with nitrogen-free plant nutrient solution as described by Subramanian <it>et al</it>. <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>.</p>
            <p>The <it>Rhizobium </it>bacterium <it>Sinorhizobium meliloti </it>1021 and <it>Bradyrhizobium japonicum </it>USDA110 was used to inoculate barrel medic and soybean plants, respectively. The bacteria were grown in a yeast extract-mannitol medium for three days at 28&#176;C as previously described <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. The bacterial cells were then suspended in nitrogen-free nutrient solution to an OD<sub>600 </sub>of 0.08 and used to water four-week-old plants. This flood-inoculation step was repeated after two weeks. The nodules were collected three weeks after the second treatment. Each nodule was separated from the roots with sharp tweezers and placed on dry ice immediately. The stem, root, and leaf organs were harvested from four-week-old plants that were maintained with nitrogen containing plant nutrient solution. The same plants were maintained until maturity for collection of the flower and seed organs.</p>
         </sec>
         <sec>
            <st>
               <p>Sequence selection and gene prediction</p>
            </st>
            <p>Soybean sequences from four bacterial artificial chromosomes (BACs) were obtained from GenBank (accession numbers: <ext-link ext-link-type="gen" ext-link-id="AX196294.1">AX196294.1</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AX196295.1">AX196295.1</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AX196297.1">AX196297.1</ext-link>, and <ext-link ext-link-type="gen" ext-link-id="AX197417.1">AX197417.1</ext-link>). The BACs AX196294 and AX196295, and AX196297 and AX197417 form two contigs. There is a physical gap (represented by 100 Ns) between AX196294 and AX196295, and an approximately 50 Kb overlap between AX196297 and AX197417. Thus, the two contigs represent a total of 977 Kb of non-redundant sequences. Putative homologs to these soybean sequences in barrel medic were identified from sequenced barrel medic BACs as previously reported <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. A total of 12 BACs (accession numbers: <ext-link ext-link-type="gen" ext-link-id="AC141115.22">AC141115.22</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AC149303.10">AC149303.10</ext-link>, <ext-link ext-link-type="gen" ext-link-id="CR378662.1">CR378662.1</ext-link>, <ext-link ext-link-type="gen" ext-link-id="CR378661.1">CR378661.1</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AC142498.20">AC142498.20</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AC146585.18">AC146585.18</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AY224188.1">AY224188.1</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AC146706.8">AC146706.8</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AY224189.1">AY224189.1</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AC146705.11">AC146705.11</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AC144644.3">AC144644.3</ext-link>, and <ext-link ext-link-type="gen" ext-link-id="AC146683.9">AC146683.9</ext-link>) were identified. The sequences were aligned and merged in regions of sequence overlap on the basis of &#8805;99% identity <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. The merged barrel medic sequences form 7 contigs and total 1,060 Kb. Genes were predicted from soybean and barrel medic sequence contigs using FGENESH <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Both the dicot plants (<it>Arabidopsis</it>) and the <it>Medicago </it>(legume plant) matrix was used and their output compared.</p>
         </sec>
         <sec>
            <st>
               <p>Tiling microarray design, production, and hybridization</p>
            </st>
            <p>Both the barrel medic and soybean tiling arrays were produced on the Maskless Array Synthesizer platform as previously described <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B8">8</abbr><abbr bid="B34">34</abbr></abbrgrp>. Briefly, tiling paths consisting of 36-mer oligonucleotides offset by five nucleotides were designed to represent both DNA strands of the selected barrel medic and soybean genome sequence. Probes were synthesized at a feature-density of 390,000 probes per array in a 'chessboard' design <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. Microarray production and storage were carried out previously described <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B8">8</abbr><abbr bid="B34">34</abbr></abbrgrp>.</p>
            <p>Total RNA and mRNA were sequentially isolated using the RNeasy Plant Mini kit (Qiagen, Valencia, CA, USA) and the Oligotex mRNA kit (Qiagen), respectively, according to the manufacturers' recommendations. mRNA from different organ types was reverse transcribed using a mixture of oligo(dT)<sub>18 </sub>and random nonamer primers <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B34">34</abbr></abbrgrp>, during which amino-allyl-modified dUTP (aa-dUTP) was incorporated. The aa-dUTP decorated cDNA was fluorescent labeled by conjugating the monofunctional Cy3 dye (GE Healthcare, Piscataway, NJ, USA) to the amino-allyl functional groups in the cDNA. We used 2 &#956;g dye-labeled targets for hybridization as previously described <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B8">8</abbr><abbr bid="B34">34</abbr></abbrgrp>. Tiling microarray design and experimental data are available in the NCBI Gene Expression Omnibus under the SuperSeries GSE10151, which is composed of subset series GSE10055 and GSE10056 for the barrel medic and soybean arrays, respectively.</p>
         </sec>
         <sec>
            <st>
               <p>
                  <b>Tiling array analysis of gene expression</b>
               </p>
            </st>
            <p>Raw microarray data were first Log<sub>2 </sub>transformed and then quantile normalized for the six organ types. A sign-test was used to determine transcription of the predicted gene <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B35">35</abbr></abbrgrp>. First, probes lying within the exons of a predicted gene were checked to determine if their intensity was greater than the median of all probes in the array. Next, we determined whether or not the number of probes with intensity above the median was more than expected by chance alone. The probability, p, of obtaining h probes with intensity above median out of N probes is given by the equation:</p>
            <p>
               <display-formula>
                  <m:math name="gb-2008-9-3-r57-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtext>p</m:mtext>
                           <m:mo>=</m:mo>
                           <m:mtext>0</m:mtext>
                           <m:msup>
                              <m:mrow>
                                 <m:mtext>.5</m:mtext>
                              </m:mrow>
                              <m:mtext>N</m:mtext>
                           </m:msup>
                           <m:mtext>&#8201;</m:mtext>
                           <m:mstyle displaystyle="true">
                              <m:munderover>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mrow>
                                    <m:mtext>i=h</m:mtext>
                                 </m:mrow>
                                 <m:mtext>N</m:mtext>
                              </m:munderover>
                              <m:mrow>
                                 <m:mrow>
                                    <m:mo>(</m:mo>
                                    <m:mrow>
                                       <m:mtable>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mtext>N</m:mtext>
                                             </m:mtd>
                                          </m:mtr>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mtext>i</m:mtext>
                                             </m:mtd>
                                          </m:mtr>
                                       </m:mtable>
                                    </m:mrow>
                                    <m:mo>)</m:mo>
                                 </m:mrow>
                              </m:mrow>
                           </m:mstyle>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8GiVeY=Pipec8Eeeu0xXdbba9frFj0xb9Lqpepeea0xd9q8qiYRWxGi6xij=hbbc9s8aq0=yqpe0xbbG8A8frFve9Fve9Fj0dmeaabaqaciGacaGaaeqabaqabeGadaaakeaacaqGWbaccaGae8xpa0Jaaeimaiaab6cacaqG1aWaaWbaaSqabeaacaqGobaaaOGaaGPaVpaaqahabaWaaeWaaeaafaqabeGabaaabaGaaeOtaaqaaiaabMgaaaaacaGLOaGaayzkaaaaleaacaqGPbGaaeypaiaabIgaaeaacaqGobaaniabggHiLdaaaa@3FED@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>Finally, genes with a <it>p</it>-value smaller than 0.05 were considered as detected.</p>
            <p>Determination of differential gene expression was carried out based on the Tukey's median polish procedure, which fits an additive linear model <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B36">36</abbr></abbrgrp>. The expression level of a given gene in the six organ types, <it>M</it>, is estimated by the equation:</p>
            <p>
               <display-formula><it>O</it><sub><it>ij </it></sub>= <it>M </it>+ <it>a</it><sub><it>i </it></sub>+ <it>v</it><sub><it>j </it></sub>+ <it>e</it><sub><it>j</it></sub></display-formula>
            </p>
            <p>where <it>O</it><sub><it>ij </it></sub>is the observed intensity of the <it>i</it><sup>th </sup>probe in the <it>j</it><sup>th </sup>organ type, <it>a</it><sub><it>i </it></sub>is the probe affinity effect of the <it>i</it><sup>th </sup>probe, <it>v</it><sub><it>j </it></sub>is the organ type variation of the <it>j</it><sup>th </sup>organ type, and <it>e</it><sub><it>j </it></sub>is the experimental error. A series of iterations of residue subtraction was performed using Tukey's median polishing until the matrix reaches a stable status, and the organ type deviation <it>v </it>was determined for each organ type. To obtain the false negative error rate, a permutation test was applied. The intensities of every probe were randomly shuffled among the organ types and median polishing repeated 1,000 times. A significant level at <it>p </it>&lt; 0.001 was selected for organ-specific expression calling.</p>
         </sec>
         <sec>
            <st>
               <p>Identification of TARs</p>
            </st>
            <p>To identify TARs in each organ type, the Log<sub>2</sub>-transformed, quantile-normalized data that were anchored to the chromosome coordinates were scanned from the 5' end of each DNA strand one probe at a time. TARs were identified as regions with length >70 nucleotides (spanning roughly 14 probes) with no 2 consecutive probes having an intensity below the cutoff of 11.0.</p>
         </sec>
         <sec>
            <st>
               <p>Identification of collinear genes</p>
            </st>
            <p>To identify collinear genes, repetitive sequences were masked and tandemly duplicated genes were counted as one. The BLAT <abbrgrp><abbr bid="B48">48</abbr></abbrgrp> program was then used to search the protein sequences of the predicted genes between barrel medic and soybean. The gene pairs with more than 10 amino acids mapped and more than 40% identity over the entire mapped sequences were identified as potential collinear genes, which were further manually inspected for gene order and orientation.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>BAC, bacterial artificial chromosome; TAR, transcriptionally active region.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>X-WD and NDY conceived the project; X-WD and LL designed the research; LL performed experiments; HH, LL, XW, VS, and WT analyzed the data; LL wrote the paper; JZ, OY, and SB provided plant materials.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available. Additional data file <supplr sid="S1">1</supplr> is a table showing a summary of predicted genes in the 1 Mb syntenic regions between <it>Medicago truncatula </it>and <it>Glycine max</it>. Additional data file <supplr sid="S2">2</supplr> is a table listing the <it>Medicago truncatula </it>genes preferentially expressed in the nodule. Additional data file <supplr sid="S3">3</supplr> is a table listing <it>Glycine max </it>genes preferentially expressed in the nodule.</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>Predicted genes in the 1 Mb syntenic regions between <it>Medicago truncatula </it>and <it>Glycine max</it></p>
            </caption>
            <text>
               <p>Predicted genes in the 1 Mb syntenic regions between <it>Medicago truncatula </it>and <it>Glycine max</it>.</p>
            </text>
            <file name="gb-2008-9-3-r57-S1.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p><it>Medicago truncatula </it>genes preferentially expressed in the nodule</p>
            </caption>
            <text>
               <p><it>Medicago truncatula </it>genes preferentially expressed in the nodule.</p>
            </text>
            <file name="gb-2008-9-3-r57-S2.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p><it>Glycine max </it>genes preferentially expressed in the nodule</p>
            </caption>
            <text>
               <p><it>Glycine max </it>genes preferentially expressed in the nodule.</p>
            </text>
            <file name="gb-2008-9-3-r57-S3.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Xuanli Yao at Yale University for assistance in plant sample collection, and Terry Graham at Ohio State University for the <it>Sinorhizobium </it>strain. This work was supported by a grant from the NSF Plant Genome Program (DBI-0421675) to X-WD.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Applications of DNA tiling arrays for whole-genome analysis.</p>
            </title>
            <aug>
               <au>
                  <snm>Mockler</snm>
                  <fnm>TC</fnm>
               </au>
               <au>
                  <snm>Chan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sundaresan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Jacobsen</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Ecker</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Genomics</source>
            <pubdate>2005</pubdate>
            <volume>85</volume>
            <fpage>1</fpage>
            <lpage>15</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ygeno.2004.10.005</pubid>
                  <pubid idtype="pmpid" link="fulltext">15607417</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Dark matter in the genome: evidence of widespread transcription detected by microarray tiling experiments.</p>
            </title>
            <aug>
               <au>
                  <snm>Johnson</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Edwards</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shoemaker</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Schadt</snm>
                  <fnm>EE</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>93</fpage>
            <lpage>102</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2004.12.009</pubid>
                  <pubid idtype="pmpid" link="fulltext">15661355</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>RNA expression analysis using a 30 base pair resolution <it>Escherichia coli </it>genome array.</p>
            </title>
            <aug>
               <au>
                  <snm>Selinger</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Cheung</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Mei</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Johansson</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Richmond</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Blattner</snm>
                  <fnm>FR</fnm>
               </au>
               <au>
                  <snm>Lockhart</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2000</pubdate>
            <volume>18</volume>
            <fpage>1262</fpage>
            <lpage>1268</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/82367</pubid>
                  <pubid idtype="pmpid" link="fulltext">11101804</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Large-scale transcriptional activity in chromosomes 21 and 22.</p>
            </title>
            <aug>
               <au>
                  <snm>Kapranov</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Cawley</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Drenkow</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bekiranov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Strausberg</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Fodor</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Gingeras</snm>
                  <fnm>TR</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>296</volume>
            <fpage>916</fpage>
            <lpage>919</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1068597</pubid>
                  <pubid idtype="pmpid" link="fulltext">11988577</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Empirical analysis of transcriptional activity in the <it>Arabidopsis </it>genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Yamada</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Lim</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dale</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Shinn</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Palm</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Southwick</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Nguyen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pham</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Cheuk</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Karlin-Newmann</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>SX</fnm>
               </au>
               <au>
                  <snm>Lam</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sakano</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Miranda</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Quach</snm>
                  <fnm>HL</fnm>
               </au>
               <au>
                  <snm>Tripp</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Toriumi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chan</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Tang</snm>
                  <fnm>CC</fnm>
               </au>
               <au>
                  <snm>Onodera</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Akiyama</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ansari</snm>
                  <fnm>Y</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>302</volume>
            <fpage>842</fpage>
            <lpage>846</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1088305</pubid>
                  <pubid idtype="pmpid" link="fulltext">14593172</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Global identification of human transcribed sequences with genome tiling arrays.</p>
            </title>
            <aug>
               <au>
                  <snm>Bertone</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Stolc</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Royce</snm>
                  <fnm>TE</fnm>
               </au>
               <au>
                  <snm>Rozowsky</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Urban</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Rinn</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Tongprasit</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Samanta</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Weissman</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gerstein</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Snyder</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>306</volume>
            <fpage>2242</fpage>
            <lpage>2246</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1103388</pubid>
                  <pubid idtype="pmpid" link="fulltext">15539566</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Cheng</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kapranov</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Drenkow</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dike</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Brubaker</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Patel</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Long</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stern</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Tammana</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Helt</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Sementchenko</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Piccolboni</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bekiranov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bailey</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Ganesh</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ghosh</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bell</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Gerhard</snm>
                  <fnm>DS</fnm>
               </au>
               <au>
                  <snm>Gingeras</snm>
                  <fnm>TR</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>308</volume>
            <fpage>1149</fpage>
            <lpage>1154</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1108625</pubid>
                  <pubid idtype="pmpid" link="fulltext">15790807</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Genome-wide transcription analyses in rice using tiling microarrays.</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Stolc</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Su</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Tongprasit</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>XW</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2006</pubdate>
            <volume>38</volume>
            <fpage>124</fpage>
            <lpage>129</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1704</pubid>
                  <pubid idtype="pmpid" link="fulltext">16369532</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Tiling microarray analysis of rice chromosome 10 to identify the transcriptome and relate its expression to chromosomal architecture.</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Xia</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stolc</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Su</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Peng</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Tongprasit</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>XW</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>R52</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2005-6-6-r52</pubid>
                  <pubid idtype="pmpid" link="fulltext">15960804</pubid>
                  <pubid idtype="pmcid">1175972</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Global identification and characterization of transcriptionally active regions in the rice genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Sasidharan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Stolc</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Korbel</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Tongprasit</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Ronald</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gerstein</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>XW</fnm>
               </au>
            </aug>
            <source>PLoS ONE</source>
            <pubdate>2007</pubdate>
            <volume>2</volume>
            <fpage>e294</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1808428</pubid>
                  <pubid idtype="pmpid" link="fulltext">17372628</pubid>
                  <pubid idtype="doi">10.1371/journal.pone.0000294</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>The genomes of <it>Oryza sativa</it>: a history of duplications.</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ni</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Dong</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zeng</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Zheng</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Cong</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Yin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Geng</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Shi</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lv</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ran</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Shi</snm>
                  <fnm>X</fnm>
               </au>
               <etal/>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>e38</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">546038</pubid>
                  <pubid idtype="pmpid" link="fulltext">15685292</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0030038</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Intra- and interspecific variation in primate gene expression patterns.</p>
            </title>
            <aug>
               <au>
                  <snm>Enard</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Khaitovich</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Klose</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Z&#246;llner</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Heissig</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Giavalisco</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Nieselt-Struwe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Muchmore</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Varki</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ravid</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Doxiadis</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Bontrop</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>P&#228;&#228;bo</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>296</volume>
            <fpage>340</fpage>
            <lpage>343</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1068996</pubid>
                  <pubid idtype="pmpid" link="fulltext">11951044</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Sex-dependent gene expression and evolution of the <it>Drosophila </it>transcriptome.</p>
            </title>
            <aug>
               <au>
                  <snm>Ranz</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Castillo-Davis</snm>
                  <fnm>CI</fnm>
               </au>
               <au>
                  <snm>Meiklejohn</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Hartl</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>300</volume>
            <fpage>1742</fpage>
            <lpage>1745</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1085881</pubid>
                  <pubid idtype="pmpid" link="fulltext">12805547</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Common pattern of evolution of gene expression level and protein sequence in <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Nuzhdin</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Wayne</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Harmon</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>McIntyre</snm>
                  <fnm>LM</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <fpage>1308</fpage>
            <lpage>1317</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msh128</pubid>
                  <pubid idtype="pmpid" link="fulltext">15034135</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>From wild wolf to domestic dog: gene expression changes in the brain.</p>
            </title>
            <aug>
               <au>
                  <snm>Saetre</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lindberg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Leonard</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Olsson</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Pettersson</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Ellegren</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Bergstrom</snm>
                  <fnm>TF</fnm>
               </au>
               <au>
                  <snm>Vila</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jazin</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Brain Res Mol Brain Res</source>
            <pubdate>2004</pubdate>
            <volume>126</volume>
            <fpage>198</fpage>
            <lpage>206</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.molbrainres.2004.05.003</pubid>
                  <pubid idtype="pmpid" link="fulltext">15249144</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Expression profiling in primates reveals a rapid evolution of human transcription factors.</p>
            </title>
            <aug>
               <au>
                  <snm>Gilad</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Oshlack</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Smyth</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>TP</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>KP</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2006</pubdate>
            <volume>440</volume>
            <fpage>242</fpage>
            <lpage>245</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04559</pubid>
                  <pubid idtype="pmpid" link="fulltext">16525476</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Parallel patterns of evolution in the genomes and transcriptomes of humans and chimpanzees.</p>
            </title>
            <aug>
               <au>
                  <snm>Khaitovich</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hellmann</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Enard</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Nowick</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Leinweber</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Franz</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Weiss</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lachmann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>P&#228;&#228;bo</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>309</volume>
            <fpage>1850</fpage>
            <lpage>1854</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1108296</pubid>
                  <pubid idtype="pmpid" link="fulltext">16141373</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Genetic diversity contribution to errors in short oligonucleotide microarray analysis.</p>
            </title>
            <aug>
               <au>
                  <snm>Kirst</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Caldo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Casati</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Tanimoto</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Walbot</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Wise</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Buckler</snm>
                  <fnm>ES</fnm>
               </au>
            </aug>
            <source>Plant Biotechnol J</source>
            <pubdate>2006</pubdate>
            <volume>4</volume>
            <fpage>489</fpage>
            <lpage>498</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17309725</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Multi-species microarrays reveal the effect of sequence divergence on gene expression profiles.</p>
            </title>
            <aug>
               <au>
                  <snm>Gilad</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Rifkin</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Bertone</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gerstein</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>KP</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>674</fpage>
            <lpage>680</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1088295</pubid>
                  <pubid idtype="pmpid" link="fulltext">15867429</pubid>
                  <pubid idtype="doi">10.1101/gr.3335705</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Using DNA microarrays to study gene expression in closely related species.</p>
            </title>
            <aug>
               <au>
                  <snm>Oshlack</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Chabot</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Smyth</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Gilad</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <fpage>1235</fpage>
            <lpage>1242</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btm111</pubid>
                  <pubid idtype="pmpid" link="fulltext">17384014</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>The rest of the iceberg. Legume diversity and evolution in a phylogenetic context.</p>
            </title>
            <aug>
               <au>
                  <snm>Doyle</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Luckow</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2003</pubdate>
            <volume>131</volume>
            <fpage>900</fpage>
            <lpage>910</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1540290</pubid>
                  <pubid idtype="pmpid" link="fulltext">12644643</pubid>
                  <pubid idtype="doi">10.1104/pp.102.018150</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Sequencing <it>Medicago truncatula </it>expressed sequenced tags using 454 Life Sciences technology.</p>
            </title>
            <aug>
               <au>
                  <snm>Cheung</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Haas</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Goldberg</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>May</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Xiao</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Town</snm>
                  <fnm>CD</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>272</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1635983</pubid>
                  <pubid idtype="pmpid" link="fulltext">17062153</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-7-272</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>The Gene Index Project</p>
            </title>
            <url>http://compbio.dfci.harvard.edu/tgi/plant.html</url>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Sequencing the genespaces of <it>Medicago truncatula </it>and <it>Lotus japonicus</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Young</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Cannon</snm>
                  <fnm>SB</fnm>
               </au>
               <au>
                  <snm>Sato</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cook</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Town</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Roe</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Tabata</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2005</pubdate>
            <volume>137</volume>
            <fpage>1174</fpage>
            <lpage>1181</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1088310</pubid>
                  <pubid idtype="pmpid" link="fulltext">15824279</pubid>
                  <pubid idtype="doi">10.1104/pp.104.057034</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Legume genome evolution viewed through the <it>Medicago truncatula </it>and <it>Lotus japonicus </it>genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Cannon</snm>
                  <fnm>SB</fnm>
               </au>
               <au>
                  <snm>Sterck</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Rombauts</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sato</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cheung</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Gouzy</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Mudge</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Vasdewani</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schiex</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Spannagl</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Monaghan</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Nicholson</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Humphray</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Schoof</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Mayer</snm>
                  <fnm>KF</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Qu&#233;tier</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Oldroyd</snm>
                  <fnm>GE</fnm>
               </au>
               <au>
                  <snm>Debell&#233;</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Cook</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Retzel</snm>
                  <fnm>EF</fnm>
               </au>
               <au>
                  <snm>Roe</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Town</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Tabata</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Van de Peer</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>ND</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>14959</fpage>
            <lpage>14964</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1578499</pubid>
                  <pubid idtype="pmpid" link="fulltext">17003129</pubid>
                  <pubid idtype="doi">10.1073/pnas.0603228103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p><it>Medicago truncatula </it>Sequencing Resources</p>
            </title>
            <url>http://www.medicago.org/genome/downloads/Mt2/</url>
         </bibl>
         <bibl id="B27">
            <title>
               <p>The Phytozome Project</p>
            </title>
            <url>http://www.phytozome.net/soybean</url>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Comparative genome analysis of mungbean (<it>Vigna radiata </it>L. Wilczek) and cowpea (<it>V. unguiculata </it>L. Walpers) using RFLP mapping data.</p>
            </title>
            <aug>
               <au>
                  <snm>Menancio-Hautea</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Fatokun</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Danesh</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>ND</fnm>
               </au>
            </aug>
            <source>Theor Appl Genet</source>
            <pubdate>1993</pubdate>
            <volume>86</volume>
            <fpage>797</fpage>
            <lpage>810</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00212605</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Genome conservation among three legume genera detected with DNA markers.</p>
            </title>
            <aug>
               <au>
                  <snm>Boutin</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Olson</snm>
                  <fnm>TC</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>Z-H</fnm>
               </au>
               <au>
                  <snm>Vallejos</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Shoemaker</snm>
                  <fnm>RC</fnm>
               </au>
            </aug>
            <source>Genome</source>
            <pubdate>1995</pubdate>
            <volume>38</volume>
            <fpage>928</fpage>
            <lpage>937</lpage>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Estimating genome conservation between crop and model legume species.</p>
            </title>
            <aug>
               <au>
                  <snm>Choi</snm>
                  <fnm>HK</fnm>
               </au>
               <au>
                  <snm>Mun</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Baek</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Mudge</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Roe</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ellis</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Doyle</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kiss</snm>
                  <fnm>GB</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Cook</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>15289</fpage>
            <lpage>15294</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">524433</pubid>
                  <pubid idtype="pmpid" link="fulltext">15489274</pubid>
                  <pubid idtype="doi">10.1073/pnas.0402251101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>A decade of QTL mapping for cyst nematode resistance in soybean.</p>
            </title>
            <aug>
               <au>
                  <snm>Concibido</snm>
                  <fnm>VC</fnm>
               </au>
               <au>
                  <snm>Diers</snm>
                  <fnm>BW</fnm>
               </au>
               <au>
                  <snm>Arelli</snm>
                  <fnm>PR</fnm>
               </au>
            </aug>
            <source>Crop Science</source>
            <pubdate>2004</pubdate>
            <volume>44</volume>
            <fpage>1121</fpage>
            <lpage>1131</lpage>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Highly syntenic regions in the genomes of soybean, <it>Medicago truncatula</it>, and <it>Arabidopsis thaliana</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Mudge</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cannon</snm>
                  <fnm>SB</fnm>
               </au>
               <au>
                  <snm>Kalo</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Oldroyd</snm>
                  <fnm>GE</fnm>
               </au>
               <au>
                  <snm>Roe</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Town</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>ND</fnm>
               </au>
            </aug>
            <source>BMC Plant Biol</source>
            <pubdate>2005</pubdate>
            <volume>5</volume>
            <fpage>15</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1201151</pubid>
                  <pubid idtype="pmpid" link="fulltext">16102170</pubid>
                  <pubid idtype="doi">10.1186/1471-2229-5-15</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>FGENESH Gene Prediction Program</p>
            </title>
            <url>http://www.softberry.com</url>
         </bibl>
         <bibl id="B34">
            <title>
               <p>A pilot study of transcription unit analysis in rice using oligonucleotide tiling-path microarray.</p>
            </title>
            <aug>
               <au>
                  <snm>Stolc</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Su</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Tongprasit</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Xue</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Snyder</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gerstein</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>XW</fnm>
               </au>
            </aug>
            <source>Plant Mol Biol</source>
            <pubdate>2005</pubdate>
            <volume>59</volume>
            <fpage>137</fpage>
            <lpage>149</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s11103-005-6164-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">16217608</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Issues in the analysis of oligonucleotide tiling microarrays for transcript mapping.</p>
            </title>
            <aug>
               <au>
                  <snm>Royce</snm>
                  <fnm>TE</fnm>
               </au>
               <au>
                  <snm>Rozowsky</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Bertone</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Samanta</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stolc</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Weissman</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Snyder</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gerstein</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>466</fpage>
            <lpage>475</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1855044</pubid>
                  <pubid idtype="pmpid" link="fulltext">15979196</pubid>
                  <pubid idtype="doi">10.1016/j.tig.2005.06.007</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>NMPP: a user-customized NimbleGen microarray data processing pipeline.</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>XW</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <fpage>2955</fpage>
            <lpage>2957</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btl525</pubid>
                  <pubid idtype="pmpid" link="fulltext">17038341</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p><it>Lotus japonicus </it>gene Ljsbp is highly conserved among plants and animals and encodes a homologue to the mammalian selenium-binding proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Flemetakis</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Agalou</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kavroulakis</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Dimou</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Martsikovskaya</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Slater</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Spaink</snm>
                  <fnm>HP</fnm>
               </au>
               <au>
                  <snm>Roussis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Katinakis</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Mol Plant Microbe Interact</source>
            <pubdate>2002</pubdate>
            <volume>15</volume>
            <fpage>313</fpage>
            <lpage>322</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1094/MPMI.2002.15.4.313</pubid>
                  <pubid idtype="pmpid" link="fulltext">12026169</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Comparative analysis of gene-expression patterns in human and African great ape cultured fibroblasts.</p>
            </title>
            <aug>
               <au>
                  <snm>Karaman</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Houck</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Chemnick</snm>
                  <fnm>LG</fnm>
               </au>
               <au>
                  <snm>Nagpal</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chawannakul</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Sudano</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Pike</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Ho</snm>
                  <fnm>VV</fnm>
               </au>
               <au>
                  <snm>Ryder</snm>
                  <fnm>OA</fnm>
               </au>
               <au>
                  <snm>Hacia</snm>
                  <fnm>JG</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>1619</fpage>
            <lpage>1630</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403735</pubid>
                  <pubid idtype="pmpid" link="fulltext">12840040</pubid>
                  <pubid idtype="doi">10.1101/gr.1289803</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>CGKB: an annotation knowledge base for cowpea (<it>Vigna unguiculata </it>L.) methylation filtered genomic genespace sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Laudeman</snm>
                  <fnm>TW</fnm>
               </au>
               <au>
                  <snm>Rushton</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Spraggins</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Timko</snm>
                  <fnm>MP</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>129</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1868039</pubid>
                  <pubid idtype="pmpid" link="fulltext">17445272</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-8-129</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Promoter trapping in <it>Lotus japonicus </it>reveals novel root and nodule GUS expression domains.</p>
            </title>
            <aug>
               <au>
                  <snm>Buzas</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Lohar</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Sato</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Tabata</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Vickers</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Stiller</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gresshoff</snm>
                  <fnm>PM</fnm>
               </au>
            </aug>
            <source>Plant Cell Physiol</source>
            <pubdate>2005</pubdate>
            <volume>46</volume>
            <fpage>1202</fpage>
            <lpage>1212</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/pcp/pci129</pubid>
                  <pubid idtype="pmpid" link="fulltext">15899881</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Paleopolyploidy and gene duplication in soybean and other legumes.</p>
            </title>
            <aug>
               <au>
                  <snm>Shoemaker</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Schlueter</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Doyle</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>Curr Opin Plant Biol</source>
            <pubdate>2006</pubdate>
            <volume>9</volume>
            <fpage>104</fpage>
            <lpage>109</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.pbi.2006.01.007</pubid>
                  <pubid idtype="pmpid" link="fulltext">16458041</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Distinct, crucial roles of flavonoids during legume nodulation.</p>
            </title>
            <aug>
               <au>
                  <snm>Subramanian</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Stacey</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Trends Plant Sci</source>
            <pubdate>2007</pubdate>
            <volume>12</volume>
            <fpage>282</fpage>
            <lpage>285</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tplants.2007.06.006</pubid>
                  <pubid idtype="pmpid" link="fulltext">17591456</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Comparative genomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Hardison</snm>
                  <fnm>RC</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2003</pubdate>
            <volume>1</volume>
            <fpage>e58</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">261895</pubid>
                  <pubid idtype="pmpid" link="fulltext">14624258</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0000058</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Improvement of whole-genome annotati