<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2008-9-2-r34</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Software</dochead>
      <bibl>
         <title>
            <p>Ancora: a web resource for exploring highly conserved noncoding elements and their association with developmental regulatory genes</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Engstr&#246;m</snm>
               <mi>G</mi>
               <fnm>P&#228;r</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>par.engstrom@bccs.uib.no</email>
            </au>
            <au id="A2">
               <snm>Fredman</snm>
               <fnm>David</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>david.fredman@bccs.uib.no</email>
            </au>
            <au id="A3" ca="yes">
               <snm>Lenhard</snm>
               <fnm>Boris</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>boris.lenhard@bccs.uib.no</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Computational Biology Unit, Bergen Center for Computational Science, University of Bergen, Thorm&#248;hlensgate, N-5008 Bergen, Norway</p>
            </ins>
            <ins id="I2">
               <p>Sars Centre for Marine Molecular Biology, University of Bergen, N-5008 Bergen, Norway</p>
            </ins>
            <ins id="I3">
               <p>Programme for Genomics and Bioinformatics, Department of Cell and Molecular Biology, Karolinska Institutet, S-17177 Stockholm, Sweden</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>2</issue>
         <fpage>R34</fpage>
         <url>http://genomebiology.com/2008/9/2/R34</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18279518</pubid>
               <pubid idtype="doi">10.1186/gb-2008-9-2-r34</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>12</day>
               <month>10</month>
               <year>2007</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>20</day>
               <month>1</month>
               <year>2008</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>15</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>15</day>
               <month>02</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Engstr&#246;m et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Exploring highly conserved noncoding elements</p>
      </shorttitle>
      <shortabs>
         <p>Ancora is a web resource that provides data and tools for exploring genomic organization of highly conserved noncoding elements for multiple genomes.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <p>Metazoan genomes contain arrays of highly conserved noncoding elements (HCNEs) that span developmental regulatory genes and define regulatory domains. We describe Ancora <url>http://ancora.genereg.net</url>, a web resource that provides data and tools for exploring genomic organization of HCNEs for multiple genomes. Ancora includes a genome browser that shows HCNE locations and features novel HCNE density plots as a powerful tool to discover developmental regulatory genes and distinguish their regulatory elements and domains.</p>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010005">Development</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010016">Molecular biology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Rationale</p>
         </st>
         <p>Comparisons of metazoan genome sequences have revealed an abundance of genomic segments that are highly conserved across large evolutionary distances even though they do not encode proteins and do not tend to be near transcription start sites. For example, 256 non-exonic segments longer than 200 bp were found to be perfectly conserved between human, mouse and rat genomes; 140 of these were more than 10 kb away from any known gene <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Using less stringent criteria for length and sequence similarity, other investigators have found thousands of non-exonic segments in the human genome that are conserved in organisms as distant as fugu <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp> and shark <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>.</p>
         <p>Several lines of evidence indicate that these highly conserved noncoding elements (HCNEs) play a fundamental role in regulating animal development and constraining genome evolution. In vertebrates, insects and worms, HCNEs tend to cluster in the vicinity of developmental regulatory genes <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. Through experiments in transgenic animals in which cloned HCNEs are tested for the ability to drive transcription of a reporter gene, many HCNE sequences have shown the ability to induce part of the embryonic expression pattern of a developmental regulatory gene located in the genomic neighborhood of the endogenous HCNE <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. These experiments have associated HCNEs and developmental genes separated by considerable genomic distances, up to 800 kb in human <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, suggesting that many HCNEs act as long-range regulatory elements. Hundreds of HCNEs have now been characterized as developmental enhancers in transgenic mice, frogs or zebrafish, and the list is growing rapidly <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>.</p>
         <p>The emerging model for explaining these observations is that an array of HCNEs defines a region of regulatory inputs of its target gene(s), and that the full complement of those inputs results in the expression pattern of the gene <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. If this notion that HCNE arrays constitute regulatory domains is correct, chromosomal rearrangements within HCNE arrays should be selected against in evolution <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. Accordingly, large HCNE arrays have been found to correspond to the largest and most deeply conserved blocks of synteny across vertebrates <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> and across insects <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. In addition to HCNE arrays and their target genes, many of these synteny blocks contain unrelated (bystander) genes that do not appear to be regulated by the HCNEs, although they can be situated between HCNEs and target genes, as well as contain HCNEs in their introns. Kikuta <it>et al</it>. <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> termed these synteny blocks 'genomic regulatory blocks' (GRBs) and demonstrated that, for some GRBs, it is possible to distinguish bystander from target genes by comparing mammalian genome sequences with those of teleost fish (such as fugu and zebrafish). This is facilitated by a whole-genome duplication event that occurred in the teleost lineage <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> and caused each GRB to be present in two copies, thereby allowing some bystander genes to be disentangled from HCNE arrays during the subsequent rediploidization <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>.</p>
         <p>Despite a rising interest in HCNEs in the genomics and evo-devo community, there has been a lack of resources that provide information about HCNEs and allow researchers to explore the distribution of HCNEs along chromosomes. Here, we describe Ancora <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, a web resource consisting of: a genome browser where HCNE locations and HCNE density plots can be viewed over different genomes, with a number of adjustable parameters; data files that allow users to easily view HCNE locations and densities in the UCSC Genome Browser <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>; and a service that allows users to view HCNE data in the Ensembl browser <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> through the distributed annotation system (DAS) protocol for sharing sequence annotations <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. We demonstrate how Ancora can be used to discover developmental regulatory genes and distinguish their chromosomal regulatory domains that correspond to the GRBs described above. The visualization of these regulatory domains is the most powerful and novel function of Ancora. We anticipate that Ancora will be particularly useful for assigning distal regulatory elements to their target genes, and for the discovery of hitherto unknown developmental regulatory genes, including noncoding RNAs.</p>
      </sec>
      <sec>
         <st>
            <p>A comprehensive HCNE database</p>
         </st>
         <p>Ancora rests on a database of HCNEs conserved between various metazoan genomes (Figure <figr fid="F1">1</figr>). Building on our previously described strategies for detecting HCNEs <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B6">6</abbr><abbr bid="B18">18</abbr></abbrgrp> we have created a refined procedure that is not biased against a chosen base genome and better captures HCNEs duplicated in genome evolution. We identify HCNEs by scanning pairwise BLASTZ net whole-genome alignments (nets) <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> downloaded from the UCSC Genome Browser database <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> for regions with at least <it>I </it>identities over <it>C </it>alignment columns. Because different similarity criteria may be appropriate for different loci and investigations, we scan for conserved elements using at least two different window sizes (<it>C </it>= 30 and <it>C </it>= 50) and several different similarity thresholds (<it>I</it>/<it>C</it>) in the range 70-100% for each species pair. The algorithm that creates net alignments is designed to retain only the best alignment for each position in one of the genomes <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. For each pairwise comparison, we therefore scan two sets of nets (one from the perspective of each genome) in order not to miss elements duplicated in either lineage. This is particularly important for comparisons between teleost fish and other vertebrates, because of the whole-genome duplication that occurred in the teleost lineage <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. We subsequently merge highly conserved elements that overlap on both genomes, but not elements that coincide on only one of the genomes, so that duplicated elements remain distinct. After discarding elements whose genome coordinates overlap by one or more base-pairs with annotated exons, we remove repetitive sequences by considering overlap with known repeats and the number of high-identity alignments obtained by realignment of each sequence against the two respective genomes. We consider remaining elements as HCNEs. The exon and repeat annotations we use, and the realignment parameters we employ, are listed on the Ancora web site, where an up-to-date description of our HCNE detection procedure is maintained. To illustrate the effect of parameter changes on the number of HCNEs detected, Table <tblr tid="T1">1</tblr> lists HCNE counts for some selected settings and genomes.</p>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Counts for selected HCNE sets</p>
            </caption>
            <tblbdy cols="6">
               <r>
                  <c cspan="2" ca="left">
                     <p>Criteria for HCNE detection</p>
                  </c>
                  <c cspan="4" ca="right">
                     <p>Number of HCNEs detected in indicated comparison</p>
                  </c>
               </r>
               <r>
                  <c cspan="2">
                     <hr/>
                  </c>
                  <c cspan="4">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Minimum identity</p>
                  </c>
                  <c ca="right">
                     <p>Minimum size (bp)</p>
                  </c>
                  <c ca="right">
                     <p>Human vs mouse</p>
                  </c>
                  <c ca="right">
                     <p>Human vs chicken</p>
                  </c>
                  <c ca="right">
                     <p>Human vs zebrafish</p>
                  </c>
                  <c ca="right">
                     <p>Zebrafish vs <it>Tetraodon</it></p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>80% over 30 columns</p>
                  </c>
                  <c ca="right">
                     <p>30</p>
                  </c>
                  <c ca="right">
                     <p>NC</p>
                  </c>
                  <c ca="right">
                     <p>125,174</p>
                  </c>
                  <c ca="right">
                     <p>19,596</p>
                  </c>
                  <c ca="right">
                     <p>57,681</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>90% over 30 columns</p>
                  </c>
                  <c ca="right">
                     <p>30</p>
                  </c>
                  <c ca="right">
                     <p>NC</p>
                  </c>
                  <c ca="right">
                     <p>78,831</p>
                  </c>
                  <c ca="right">
                     <p>8,260</p>
                  </c>
                  <c ca="right">
                     <p>26,157</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>96% over 30 columns</p>
                  </c>
                  <c ca="right">
                     <p>30</p>
                  </c>
                  <c ca="right">
                     <p>305,015</p>
                  </c>
                  <c ca="right">
                     <p>50,478</p>
                  </c>
                  <c ca="right">
                     <p>3,656</p>
                  </c>
                  <c ca="right">
                     <p>10,205</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>100%</p>
                  </c>
                  <c ca="right">
                     <p>30</p>
                  </c>
                  <c ca="right">
                     <p>150,487</p>
                  </c>
                  <c ca="right">
                     <p>35,338</p>
                  </c>
                  <c ca="right">
                     <p>1,721</p>
                  </c>
                  <c ca="right">
                     <p>4,737</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>70% over 50 columns</p>
                  </c>
                  <c ca="right">
                     <p>50</p>
                  </c>
                  <c ca="right">
                     <p>NC</p>
                  </c>
                  <c ca="right">
                     <p>93,162</p>
                  </c>
                  <c ca="right">
                     <p>16,725</p>
                  </c>
                  <c ca="right">
                     <p>45,828</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>80% over 50 columns</p>
                  </c>
                  <c ca="right">
                     <p>50</p>
                  </c>
                  <c ca="right">
                     <p>NC</p>
                  </c>
                  <c ca="right">
                     <p>63,304</p>
                  </c>
                  <c ca="right">
                     <p>7,169</p>
                  </c>
                  <c ca="right">
                     <p>25,997</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>90% over 50 columns</p>
                  </c>
                  <c ca="right">
                     <p>50</p>
                  </c>
                  <c ca="right">
                     <p>265,537</p>
                  </c>
                  <c ca="right">
                     <p>36,794</p>
                  </c>
                  <c ca="right">
                     <p>3,127</p>
                  </c>
                  <c ca="right">
                     <p>8,610</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>95% over 50 columns</p>
                  </c>
                  <c ca="right">
                     <p>50</p>
                  </c>
                  <c ca="right">
                     <p>107,860</p>
                  </c>
                  <c ca="right">
                     <p>22,530</p>
                  </c>
                  <c ca="right">
                     <p>1,228</p>
                  </c>
                  <c ca="right">
                     <p>3,078</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>98% over 50 columns</p>
                  </c>
                  <c ca="right">
                     <p>50</p>
                  </c>
                  <c ca="right">
                     <p>68,600</p>
                  </c>
                  <c ca="right">
                     <p>17,579</p>
                  </c>
                  <c ca="right">
                     <p>763</p>
                  </c>
                  <c ca="right">
                     <p>1,782</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>100%</p>
                  </c>
                  <c ca="right">
                     <p>50</p>
                  </c>
                  <c ca="right">
                     <p>34,785</p>
                  </c>
                  <c ca="right">
                     <p>11,934</p>
                  </c>
                  <c ca="right">
                     <p>330</p>
                  </c>
                  <c ca="right">
                     <p>754</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>90% over 50 columns</p>
                  </c>
                  <c ca="right">
                     <p>100</p>
                  </c>
                  <c ca="right">
                     <p>81,065</p>
                  </c>
                  <c ca="right">
                     <p>15,339</p>
                  </c>
                  <c ca="right">
                     <p>733</p>
                  </c>
                  <c ca="right">
                     <p>1,695</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>95% over 50 columns</p>
                  </c>
                  <c ca="right">
                     <p>100</p>
                  </c>
                  <c ca="right">
                     <p>25,801</p>
                  </c>
                  <c ca="right">
                     <p>7,901</p>
                  </c>
                  <c ca="right">
                     <p>188</p>
                  </c>
                  <c ca="right">
                     <p>450</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>100%</p>
                  </c>
                  <c ca="right">
                     <p>100</p>
                  </c>
                  <c ca="right">
                     <p>4,919</p>
                  </c>
                  <c ca="right">
                     <p>2,475</p>
                  </c>
                  <c ca="right">
                     <p>20</p>
                  </c>
                  <c ca="right">
                     <p>61</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>100%</p>
                  </c>
                  <c ca="right">
                     <p>200</p>
                  </c>
                  <c ca="right">
                     <p>494</p>
                  </c>
                  <c ca="right">
                     <p>365</p>
                  </c>
                  <c ca="right">
                     <p>0</p>
                  </c>
                  <c ca="right">
                     <p>2</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>Counts indicate the number of HCNEs obtained by collapsing HCNEs onto the assembled chromosomes of selected reference genomes. HCNEs were counted in this way to reduce redundancy and thereby make counts more comparable between data sets. The underlying Ancora data sets are not biased by selecting either genome as a reference. Note that HCNEs are generally larger than the window size (30 or 50) used to identify them because the procedure that detects HCNEs merges overlapping conserved elements. NC, not calculated.</p>
            </tblfn>
         </tbl>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Comparisons available in Ancora</p>
            </caption>
            <text>
               <p>Comparisons available in Ancora. Shaded boxes correspond to genomes shown in the Ancora genome browser. Connecting lines indicate pairwise genome comparisons for which HCNEs are available in Ancora. The following genome assemblies underlie the current data sets: human NCBI 36, mouse NCBI 36 and 37, chicken v2.1 [41], <it>Xenopus tropicalis </it>v4.1 (US DoE Joint Genome Institute), zebrafish Zv6 and Zv7 (The Wellcome Trust Sanger Institute), fugu v4.0 [42], <it>Tetraodon nigroviridis </it>V7 [19], stickleback v1.0 (The Broad Institute), medaka v1.0 [43], <it>D. melanogaster </it>rel. 5 [44], <it>D. pseudoobscura </it>rel. 2 [45] and the February 2006 releases of <it>D. ananassae</it>, <it>D. virilis and D. mojavensis </it>[46].</p>
            </text>
            <graphic file="gb-2008-9-2-r34-1"/>
         </fig>
      </sec>
      <sec>
         <st>
            <p>Exploring HCNEs and GRBs with the Ancora genome browser</p>
         </st>
         <p>Ancora contains a genome browser designed to explore the distribution of HCNEs on metazoan chromosomes (Figure <figr fid="F2">2a</figr>). The browser is currently set up to show the genomes of human, mouse, zebrafish and <it>Drosophila melanogaster</it>; we aim to expand this list in the future.</p>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>A 1.7 Mb region around the human <it>SHOX2 </it>gene</p>
            </caption>
            <text>
               <p>A 1.7 Mb region around the human <it>SHOX2 </it>gene. <b>(a) </b>Ancora genome browser main view. <it>SHOX2</it>, a homeobox gene implicated in limb development [47], is embedded in an array of HCNEs detected by comparison with mouse and zebrafish genomes. Overlaid density plots show densities of HCNEs detected at similarity thresholds of 95% (yellow), 98% (orange) and 100% (red) in the mouse comparison and similarity thresholds of 70%, 80% and 90% in the zebrafish comparison, over a 50 column sliding window. Note that the density of the most strongly conserved HCNEs (red) peaks around <it>SHOX2</it>. Synteny blocks are based on net alignments with the zebrafish genome [18]; boxes indicate aligned segments, connecting lines indicate gaps and labels indicate alignment orientation and position in the zebrafish genome assembly. The centrally shown synteny block encompasses <it>SHOX2</it>, <it>RSRC1 </it>(a gene of unknown function) and the array of HCNEs conserved in zebrafish. <b>(b) </b>Conservation profiles for the same region in the UCSC Genome Browser [21]. Comparison between (a) and (b) demonstrates qualitatively different information provided by the HCNE density plots in (a).</p>
            </text>
            <graphic file="gb-2008-9-2-r34-2"/>
         </fig>
         <sec>
            <st>
               <p>Basic usage</p>
            </st>
            <p>To put HCNEs in context, the browser also shows gene annotation from NCBI <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, Ensembl <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, the UCSC Genome Browser <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, Mouse Genome Informatics <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, the Zebrafish Information Network <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, FlyBase <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> and miRBase <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, as well as a selection of other annotation tracks from UCSC. The user can click on gene models to bring up detailed gene information pages from the original data sources. By default, the HCNEs are colored by the chromosome they align to in the other genome. This simplifies the identification of conserved HCNE arrays: a stretch of HCNEs in the same color suggests a conserved array. To visualize the tendency of HCNE arrays to correspond to large synteny blocks, we have included tracks showing human-zebrafish synteny blocks and <it>Drosophila </it>synteny blocks from recent analyses <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B18">18</abbr></abbrgrp>. (The human-zebrafish synteny blocks should be interpreted with caution, however, because of artifacts in the underlying zebrafish genome assembly - in particular artificial segmental duplications, which may appear as overlapping synteny blocks on the human genome.) The user can move between the vertebrate genomes that the genome browser displays by clicking on HCNEs and synteny blocks, which link aligned regions from the different genomes. Ancora also provides links that bring up the same region in other major genome browsers (Ensembl, UCSC and FlyBase) and the VISTA browser, which is useful for detailed examination of sequence conservation <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>GBrowse extensions in Ancora</p>
            </st>
            <p>The Ancora genome browser was built using the GBrowse software <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, which is used by most model organism databases. The basic user interface should thus be familiar to most users. To visualize HCNE data in the most informative manner and to efficiently plot HCNE densities along entire chromosomes, we have extended GBrowse with a number of plugins and custom glyphs. The plugins that retrieve and render HCNE data can be configured by selecting a HCNE data set of interest from the 'Reports &amp; Analysis' menu above the 'Scroll/Zoom' controls or by clicking on a HCNE density plot (Figure <figr fid="F2">2a</figr>). On the configuration page (Figure <figr fid="F3">3</figr>), the user can select which similarity thresholds to show HCNEs and HCNE densities for, and configure additional properties of the density plots (see below).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>HCNE track configuration page</p>
               </caption>
               <text>
                  <p>HCNE track configuration page. Up to three HCNE sets from each pairwise comparison can be shown simultaneously. A set is selected by choosing a similarity threshold (for example, 70% identity over 50 alignment columns), and can be further restricted by an arbitrary threshold on HCNE size. Note that HCNEs may be larger than the window size (30 or 50 columns) used to identify them because the procedure that detects HCNEs merges overlapping conserved elements. For each selected set, the user can choose to see HCNE densities, HCNE locations, or both. Density plots for the different sets will be overlaid (Figure 2a), so that the plot for set two is drawn on top of that for set one, and the plot for set three drawn on top of that for set two. If the option to separate densities based on chromosomes in other genomes is used, the browser will attempt to create one density plot for each chromosome (in the other genome) for which there are HCNEs in the displayed region, or within half a sliding window to either side. If the resulting number of plots exceeds the number of plots requested on this configuration page, densities for the chromosomes with least HCNE sequence in this region will be combined into one plot labeled 'other' (Figure 5).</p>
               </text>
               <graphic file="gb-2008-9-2-r34-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Unique information revealed by HCNE density plots</p>
            </st>
            <p>Plots of HCNE density along chromosomes highlight regions that harbor large HCNE arrays and, thus, are likely to contain key developmental regulatory genes and correspond to regulatory domains <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B6">6</abbr><abbr bid="B18">18</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>. Unlike conservation profiles, which can be seen in several other genome browsers <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B30">30</abbr><abbr bid="B34">34</abbr></abbrgrp>, HCNE density plots do not directly reflect conservation on the sequence/alignment level; instead, they show density distributions of HCNEs on a larger scale. The result is qualitatively different from a sequence based conservation plot such as the Conservation track in the UCSC Genome Browser (Figure <figr fid="F2">2</figr>, compare (a) and (b)): it clearly reveals chromosomal regions of extensive noncoding conservation (Figure <figr fid="F4">4</figr>) and points to the approximate extent of GRBs, as well as the most likely target gene(s) within those regions <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>HCNE density distributions on human chromosome 3</p>
               </caption>
               <text>
                  <p>HCNE density distributions on human chromosome 3. Shown are densities of HCNEs identified from comparison with mouse, chicken and three different fish genomes. This genome browser screenshot has been manually labeled with likely target genes of HCNE enhancer activity at major density peaks. Target genes were identified by zooming in to inspect gene annotations at each peak.</p>
               </text>
               <graphic file="gb-2008-9-2-r34-4"/>
            </fig>
            <p>We compute HCNE densities as the percentage of bases covered by HCNEs within a window of a given size. Because the genome browser computes HCNE densities on demand, the window size can be set by the user. The algorithm that computes the densities moves a window across the displayed chromosomal segment in steps of a size that is adapted to the size of the displayed segment. If the user zooms in to single-base resolution, densities are computed for every base shown. At lower resolutions, the step size is at least one step per pixel and ten steps per window. In our experience, this is more than sufficient for detecting peaks of interest. At resolutions where several density values are computed for each pixel, the plot shows the maximum density value per pixel, so that peaks are not omitted. By default, the browser displays overlaid density curves for HCNEs detected at three different sequence identity thresholds (Figure <figr fid="F2">2a</figr>). This allows users to easily locate regions with the most strongly conserved HCNEs and simultaneously delineate other HCNE-dense regions. The default window size for vertebrate genomes is 300 kb. It is important to note that this large window size leads to slopes of GRB signals extending outside the actual HCNE-spanned regions. To estimate the edges better, the user should consult synteny and HCNE location tracks, or decrease the window size in density plots. Despite this side effect, large window sizes are more appropriate for outlining GRB distribution along chromosomes, as well as for the determination of most likely target genes. It should also be noted that extremely high densities of HCNEs detected at the most stringent identity thresholds (high red density peaks) can originate from (rare) cross-species contamination of genome sequences. Users of the Ancora genome browser can identify such contamination as high HCNE densities coming from near-identical sequence segments confined to a single compared species. For example, much of <it>Xenopus tropicalis </it>scaffold 7291 is composed of fragments of near-identity to human chromosome 5, even though these regions have no HCNEs conserved in mouse, chicken or fish.</p>
         </sec>
         <sec>
            <st>
               <p>Discovering genes that encode developmental regulators</p>
            </st>
            <p>Since there is a strong association between HCNE arrays and developmental regulatory genes <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>, it is likely that most regions of high HCNE-density contain at least one developmental regulatory gene, even in cases where no such gene has been annotated. Inspection of HCNE density can thus be used to formulate hypotheses about gene function and identify likely target genes of putative enhancer activity of HCNEs. In a study from 2004, Sandelin <it>et al</it>. <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> identified HCNEs conserved among human, mouse and fugu, and closely inspected the 50 most HCNE-rich regions for the presence of developmental regulatory genes. They found 41 of these regions to contain a gene known to be involved in embryonic development. Of the remaining nine regions, seven contained a gene known to be a transcription factor or predicted as such based on homology. In a recent study, one of these transcription factor genes (FLJ20321) was recognized as a homolog of the <it>Drosophila </it>gene <it>castor </it>and found to be upregulated in cell differentiation <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, confirming the prediction from HCNE density. Sandelin <it>et al</it>. focused on the 50 HCNE-densest regions they detected in the human genome. Inspection of other HCNE-dense regions has revealed that several coincide with microRNA gene loci <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, a class of regulators implicated in multiple aspects of development <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. We predict that many additional HCNE-dense regions will be found to contain developmental regulators. By plotting HCNE densities along entire chromosomes, the Ancora genome browser makes it easy to survey genomes for HCNE-dense regions (Figure <figr fid="F4">4</figr>). HCNE density curves from multiple pairwise genome comparisons can be shown simultaneously, so that users can identify regions rich in HCNEs that are specific to a subset of species, or shared across many species, if so desired. By zooming in, the user can investigate these regions in detail by inspecting the genome annotation available in Ancora as well as annotation in the other genome browsers to which direct links are provided. As a demonstration of the immediate utility of Ancora, we identified 129 genomic regions in the human genome in which the density of human-zebrafish HCNEs (70% identity over 50 columns) surpassed 0.5% and, using the principles outlined here, identified putative target genes in 120 of these regions (Additional data file 1). The regions in which no target gene could be assigned are prime candidates for discovery of novel genes or non-coding RNA involved in developmental regulation.</p>
         </sec>
         <sec>
            <st>
               <p>Detecting and interpreting duplicated GRBs</p>
            </st>
            <p>As a result of whole-genome duplication in teleosts, many mammalian GRBs have two orthologous GRBs in teleost genomes. The Ancora genome browser makes it easy to locate such GRBs by coloring HCNEs according to the chromosome they align to in the other genome. For example, when viewing human-zebrafish HCNEs along human chromosomes, the hallmark of a GRB present in two copies in zebrafish is a HCNE-dense region where HCNEs occur mainly in two different colors. Such regions can also be discovered by activating an option that makes the genome browser separate HCNE density plots based on chromosomes in the other genome (Figure <figr fid="F3">3</figr>). Figure <figr fid="F5">5a</figr> shows an example: the GRB of <it>PAX7</it>, a transcription factor gene implicated in muscle development <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> and situated within an array of HCNEs. Most of the human-zebrafish HCNEs in this region are colored either gray or light green in the genome browser (Figure <figr fid="F5">5a</figr>) and align to orthologous loci on zebrafish chromosomes 23 and 11, respectively. Thus, this view quickly suggests that noncoding putative regulatory sequences have been preserved to a similar extent at both of the <it>pax7 </it>loci in zebrafish. In contrast, Figure <figr fid="F5">5b</figr> shows an example where duplicate GRBs have diverged to a large extent in zebrafish. Human <it>LHX1</it>, a LIM homeobox transcription factor gene implicated in head, neural and reproductive development <abbrgrp><abbr bid="B38">38</abbr></abbrgrp> is within an array of HCNEs that extends into the neighboring genes <it>AATF</it>, which encodes a transcription factor involved in cell cycle control, and <it>ACACA</it>, which encodes a carboxylase involved in fatty acid synthesis. Most of the human-zebrafish HCNEs in this region are colored blue in the genome browser (Figure <figr fid="F5">5b</figr>) and align to the region around <it>lhx1a </it>on zebrafish chromosome 15. Thus, noncoding putative regulatory sequences appear to have been preserved to a much larger extent around <it>lhx1a </it>than around <it>1hx1b</it>. A detailed inspection of the zebrafish loci reveals that orthologs of <it>AATF </it>and <it>ACACA </it>have been retained at the <it>lhx1b </it>locus, but lost from the <it>lhx1a </it>locus, where there are more HCNEs (in Ancora, the zebrafish loci can be inspected by jumping from the displayed human locus to corresponding loci in the zebrafish genome by clicking on HCNEs or synteny blocks). Following the rationale in <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> this confirms that the HCNE array is unrelated to <it>AATF </it>and <it>ACACA</it>, and allows the classification of these two genes as bystanders.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Duplicated GRBs</p>
               </caption>
               <text>
                  <p>Duplicated GRBs. Zebrafish HCNEs and their density distribution are shown for the human <b>(a) </b><it>PAX7 </it>and <b>(b) </b><it>LHX1 </it>loci. HCNEs are colored by the zebrafish chromosome they map to. In (a), most HCNEs are colored light green or gray and map to zebrafish chromosomes 11 or 23, respectively. In (b), most HCNEs are colored blue and map to zebrafish chromosome 15 (because this region contains many HCNEs, they are collapsed on a single row in this screenshot). The density plots are also separated based on zebrafish chromosomes. Comparison of synteny blocks to exon locations indicate that orthologs of <it>AATF </it>and <it>ACACA </it>are present next to the <it>LHX1 </it>ortholog (<it>lhx1b</it>) on zebrafish chromosome 5, but not on zebrafish chromosome 15 where <it>lhx1a </it>is located; this can be confirmed by detailed inspection of the zebrafish loci.</p>
               </text>
               <graphic file="gb-2008-9-2-r34-5"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Distinguishing chromosomal regulatory domains</p>
            </st>
            <p>By comparing HCNE arrays and synteny blocks, we have observed that the extent of a HCNE array often provides a good approximation of the extent of the corresponding GRB <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B18">18</abbr></abbrgrp>. However, unless synteny conservation is taken into account, partitioning of HCNEs into separate arrays becomes arbitrary in regions with high noncoding conservation. In the Ancora genome browser, it is easy to visualize synteny conservation of HCNE arrays over large genomic segments by activating the option that separates HCNE density plots based on chromosome in the other genome. The result is an overview of how HCNE-dense regions have been partitioned over different chromosomes in evolution (Figure <figr fid="F6">6</figr>). Based on the assumption that fundamental regulatory domains have been maintained in evolution <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>, the displayed separation of HCNE-dense regions across chromosomes should correspond to a separation of distinct regulatory domains. We expect the resolution of this approach to increase as more genomes are sequenced and assembled.</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Distinguishing regulatory domains</p>
               </caption>
               <text>
                  <p>Distinguishing regulatory domains. Screenshot from the Ancora genome browser showing 26 Mb of human chromosome 10 and HCNE densities from comparisons with mouse, zebrafish and medaka. HCNE density curves are separated based on chromosomes in these organisms (Zv7_NA53 is a contig that has not been assigned to a chromosome). To illustrate the use of this view for distinguishing chromosomal regulatory domains, rectangles have been manually added to the screenshot around density peaks indicating clusters of HCNEs in conserved synteny. Rectangles are labeled with regulatory genes annotated in the corresponding genomic regions.</p>
               </text>
               <graphic file="gb-2008-9-2-r34-6"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Viewing HCNEs and density plots in other genome browsers</p>
         </st>
         <p>The Ancora genome browser provides the most flexible way to explore the HCNE data in Ancora. However, it is often useful to view these data in other browsers where it can be compared to other types of annotation. We aimed to make it as straightforward as possible to view HCNE data in the UCSC Genome Browser <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> and Ensembl <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> (Figure <figr fid="F7">7</figr>).</p>
         <fig id="F7">
            <title>
               <p>Figure 7</p>
            </title>
            <caption>
               <p>Ancora tracks in the UCSC and Ensembl genome browsers</p>
            </caption>
            <text>
               <p>Ancora tracks in the UCSC and Ensembl genome browsers. Genome browser views of region around the human <it>SHOX2 </it>gene. Added tracks show locations and densities for HCNEs detected at similarity thresholds of 95% in the mouse comparison and 70% in the zebrafish comparison. <b>(a) </b>UCSC, same region as in Figure 2. <b>(b) </b>Ensembl, displaying 1 Mb (the maximum allowed size in ContigView) of the same region. Additional tracks from Ensembl show conserved elements ('Conservation') and transcripts ('Ensembl trans.').</p>
            </text>
            <graphic file="gb-2008-9-2-r34-7"/>
         </fig>
         <p>HCNE locations and precomputed density curves are available for download in the 'bed' and 'wig' formats used for UCSC Genome Browser custom tracks <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. It is not necessary to download the .bed and .wig files to use them as custom tracks in the UCSC Genome Browser: the user can simply copy the URLs for track files of interest from the Ancora downloads section and paste them into the 'add custom tracks' form on the UCSC Genome Browser web site.</p>
         <p>The Ensembl browser can display sequence annotations provided over the web through DAS, a method for data exchange <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Much of the Ancora data are available through DAS. Ancora provides an interface where the user can add HCNE tracks to Ensembl ContigView. Tracks added in this way are stored as part of the user's Ensembl preferences. Users who are familiar with DAS can also retrieve data directly from the DAS server. For example, the URL given in reference <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> provides a list of available tracks.</p>
      </sec>
      <sec>
         <st>
            <p>Comparison to other tools</p>
         </st>
         <p>While the genome browsers at UCSC and Ensembl provide rich and diverse annotation sets including information about sequence conservation, they do not distinguish coding from noncoding conserved elements. To our knowledge, the Ancora genome browser is the first tool that makes it easy to visualize HCNE distributions on large genomic regions, up to whole chromosomes, and the browser is tailored to show data in a flexible manner at this level.</p>
         <p>The ECR Browser <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> and VISTA Browser <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> allow detailed inspection of sequence conservation profiles across many genomes, highlight conserved elements in a user-customizable manner and distinguish noncoding from coding conservation. In the ECR Browser, one drawback is that thresholds for detection of conserved elements are uniform across all comparisons shown, irrespective of evolutionary distance. In contrast, Ancora and VISTA browsers can show results for multiple different thresholds simultaneously. A limitation of both the ECR and VISTA browsers is that they are not designed for visualizing the distribution of conserved elements on segments larger than a few megabases. The VISTA Browser can only display regions up to 5 Mb in size and the ECR Browser's display of large regions is difficult to interpret because conserved elements are drawn close together. In contrast, the HCNE density plots in Ancora make it possible to view and intuitively interpret HCNE content at any scale. Ancora is therefore better suited for exploring conservation genome-wide and discovering regulatory domains at loci not known beforehand, while the ECR and VISTA browsers provide more functionality for close examination of sequence-level conservation profiles.</p>
         <p>The CONDOR database <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> holds information on about 6,800 HCNEs from about 120 blocks of conserved synteny between human and fugu and provides a graphical interface to view the distribution of HCNEs in those regions. While there are several similarities between Ancora and CONDOR, Ancora has the advantage of providing HCNE data for entire genomes. Another difference between the two resources is that the Ancora HCNE sets are not as stringently defined in terms of conservation as those in CONDOR, where HCNEs are required to be conserved among four diverged vertebrates. In Ancora, we have chosen to provide a range of HCNE data sets from different pairwise comparisons and with different similarity thresholds (Figure <figr fid="F1">1</figr> and Table <tblr tid="T1">1</tblr>), so that users can choose to look at the data appropriate for their questions. A valuable section of CONDOR provides developmental expression patterns for about 100 HCNEs that have been investigated by reporter assays in zebrafish. We are preparing to link similar data to Ancora.</p>
      </sec>
      <sec>
         <st>
            <p>Summary</p>
         </st>
         <p>Ancora is a new web resource that provides data and tools for exploring HCNEs and their association with developmental regulatory genes. Built upon a database of HCNEs conserved between various metazoan genomes, Ancora provides a genome browser for visualizing the distribution of those elements on chromosomes in the context of other types of annotation integrated from different sources. One of the novel features of Ancora is the possibility to display highly customizable plots of HCNE density along chromosomes. HCNE density plots are qualitatively different from conservation profiles available in other genome browsers <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B30">30</abbr><abbr bid="B34">34</abbr></abbrgrp>: they clearly reveal regions of extensive noncoding conservation and highlight larger chromosomal regulatory domains (GRBs) that have been maintained in evolution. The GRBs typically coincide with loci of developmental regulatory genes, for which HCNEs appear to act as enhancers <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. Consequently, we anticipate that Ancora will be highly useful for discovering developmental regulatory genes and their distal <it>cis</it>-regulatory elements. We have illustrated how Ancora can be used to define the chromosomal regulatory domains of those genes and distinguish genes that appear to be functionally associated with HCNEs from unrelated 'bystander' genes within the same GRB. The HCNE data in Ancora are also available for download and can easily be displayed in the popular general-purpose genome browsers at UCSC <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> and Ensembl <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>DAS, distributed annotation system; GRB, genomic regulatory block; HCNE, highly conserved noncoding element.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>BL conceived of and supervised the study. PGE and DF designed and implemented the pipeline for detecting HCNEs. PGE implemented the web resource. All authors participated in writing the paper and approved the final version.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper. Human genomic regions in which the density (in a 300 kb sliding window) of human-zebrafish HCNEs (70% identity over 50 columns) surpassed 0.5% and putative target genes in 120 of these regions.</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>Human genomic regions in which the density (in a 300 kb sliding window) of human-zebrafish HCNEs (70% identity over 50 columns) surpassed 0.5% and putative target genes in 120 of these regions</p>
            </caption>
            <text>
               <p>Human genomic regions in which the density (in a 300 kb sliding window) of human-zebrafish HCNEs (70% identity over 50 columns) surpassed 0.5% and putative target genes in 120 of these regions.</p>
            </text>
            <file name="gb-2008-9-2-r34-S1.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was supported by the Functional Genomics Programme (FUGE) of the Research Council of Norway, Bergen Research Foundation (Bergen Forskningsstiftelse, BFS), and a core grant from the Sars Centre. We thank Ying Sheng, Xianjun Dong and Altuna Akalin for comments on the genome browser.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Ultraconserved elements in the human genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Bejerano</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Pheasant</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Makunin</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Stephen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kent</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Mattick</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>304</volume>
            <fpage>1321</fpage>
            <lpage>1325</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1098119</pubid>
                  <pubid idtype="pmpid" link="fulltext">15131266</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Arrays of ultraconserved non-coding regions span the loci of key developmental genes in vertebrate genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Sandelin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bailey</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bruce</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Engstr&#246;m</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Klos</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Wasserman</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Ericson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lenhard</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>99</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">544600</pubid>
                  <pubid idtype="pmpid" link="fulltext">15613238</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-5-99</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Highly conserved non-coding sequences are associated with vertebrate development.</p>
            </title>
            <aug>
               <au>
                  <snm>Woolfe</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Goodson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goode</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Snell</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>McEwen</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Vavouri</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>North</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Callaway</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kelly</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Walter</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Abnizova</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Gilks</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Edwards</snm>
                  <fnm>YJ</fnm>
               </au>
               <au>
                  <snm>Cooke</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Elgar</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>e7</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">526512</pubid>
                  <pubid idtype="pmpid" link="fulltext">15630479</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0030007</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Ancient noncoding elements conserved in the human genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Venkatesh</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kirkness</snm>
                  <fnm>EF</fnm>
               </au>
               <au>
                  <snm>Loh</snm>
                  <fnm>YH</fnm>
               </au>
               <au>
                  <snm>Halpern</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dandona</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Viswanathan</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Tay</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Venter</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Strausberg</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Brenner</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2006</pubdate>
            <volume>314</volume>
            <fpage>1892</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1130708</pubid>
                  <pubid idtype="pmpid" link="fulltext">17185593</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Ultraconserved elements in insect genomes: a highly conserved intronic sequence implicated in the control of homothorax mRNA splicing.</p>
            </title>
            <aug>
               <au>
                  <snm>Glazov</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Pheasant</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>McGraw</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Bejerano</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Mattick</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>800</fpage>
            <lpage>808</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1142470</pubid>
                  <pubid idtype="pmpid" link="fulltext">15899965</pubid>
                  <pubid idtype="doi">10.1101/gr.3545105</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Genomic regulatory blocks underlie extensive microsynteny conservation in insects.</p>
            </title>
            <aug>
               <au>
                  <snm>Engstr&#246;m</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Ho Sui</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Drivenes</snm>
                  <fnm>&#216;</fnm>
               </au>
               <au>
                  <snm>Becker</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Lenhard</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <fpage>1898</fpage>
            <lpage>1908</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2099597</pubid>
                  <pubid idtype="pmpid" link="fulltext">17989259</pubid>
                  <pubid idtype="doi">10.1101/gr.6669607</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Parallel evolution of conserved non-coding elements that target a common set of developmental regulatory genes from worms to humans.</p>
            </title>
            <aug>
               <au>
                  <snm>Vavouri</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Walter</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Gilks</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Lehner</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Elgar</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>R15</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1852409</pubid>
                  <pubid idtype="pmpid" link="fulltext">17274809</pubid>
                  <pubid idtype="doi">10.1186/gb-2007-8-2-r15</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Scanning human gene deserts for long-range enhancers.</p>
            </title>
            <aug>
               <au>
                  <snm>Nobrega</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Ovcharenko</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Afzal</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>302</volume>
            <fpage>413</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1088328</pubid>
                  <pubid idtype="pmpid" link="fulltext">14563999</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Characterization of the pufferfish Otx2 cis-regulators reveals evolutionarily conserved genetic mechanisms for vertebrate head specification.</p>
            </title>
            <aug>
               <au>
                  <snm>Kimura-Yoshida</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kitajima</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Oda-Ishii</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Tian</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Suzuki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yamamoto</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Suzuki</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kobayashi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Aizawa</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Matsuo</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Development</source>
            <pubdate>2004</pubdate>
            <volume>131</volume>
            <fpage>57</fpage>
            <lpage>71</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1242/dev.00877</pubid>
                  <pubid idtype="pmpid" link="fulltext">14645121</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>A functional survey of the enhancer activity of conserved non-coding sequences from vertebrate Iroquois cluster gene deserts.</p>
            </title>
            <aug>
               <au>
                  <snm>de la Calle-Mustienes</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Feijoo</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Manzanares</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tena</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Rodriguez-Seguel</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Letizia</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Allende</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Gomez-Skarmeta</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>1061</fpage>
            <lpage>1072</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1182218</pubid>
                  <pubid idtype="pmpid" link="fulltext">16024824</pubid>
                  <pubid idtype="doi">10.1101/gr.4004805</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p><it>In vivo </it>enhancer analysis of human conserved non-coding sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Pennacchio</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Ahituv</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Moses</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Prabhakar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nobrega</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Shoukry</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Minovitsky</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dubchak</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Holt</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Plajzer-Frick</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Akiyama</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>De Val</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Afzal</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Black</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Couronne</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Visel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2006</pubdate>
            <volume>444</volume>
            <fpage>499</fpage>
            <lpage>502</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature05295</pubid>
                  <pubid idtype="pmpid" link="fulltext">17086198</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Human-zebrafish non-coding conserved elements act <it>in vivo </it>to regulate transcription.</p>
            </title>
            <aug>
               <au>
                  <snm>Shin</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Priest</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Ovcharenko</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Ronco</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Burns</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>MacRae</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>5437</fpage>
            <lpage>5445</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1236720</pubid>
                  <pubid idtype="pmpid" link="fulltext">16179648</pubid>
                  <pubid idtype="doi">10.1093/nar/gki853</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>VISTA Enhancer Browser - a database of tissue-specific human enhancers.</p>
            </title>
            <aug>
               <au>
                  <snm>Visel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Minovitsky</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dubchak</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Pennacchio</snm>
                  <fnm>LA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>D88</fpage>
            <lpage>92</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1716724</pubid>
                  <pubid idtype="pmpid" link="fulltext">17130149</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl822</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>CONDOR: a database resource of developmentally associated conserved non-coding elements.</p>
            </title>
            <aug>
               <au>
                  <snm>Woolfe</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Goode</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Cooke</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Callaway</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Snell</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>McEwen</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Elgar</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>BMC Dev Biol</source>
            <pubdate>2007</pubdate>
            <volume>7</volume>
            <fpage>100</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2020477</pubid>
                  <pubid idtype="pmpid" link="fulltext">17760977</pubid>
                  <pubid idtype="doi">10.1186/1471-213X-7-100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Mapping cis-regulatory domains in the human genome using multi-species conservation of synteny.</p>
            </title>
            <aug>
               <au>
                  <snm>Ahituv</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Prabhakar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Poulin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Couronne</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Hum Mol Genet</source>
            <pubdate>2005</pubdate>
            <volume>14</volume>
            <fpage>3057</fpage>
            <lpage>3063</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/hmg/ddi338</pubid>
                  <pubid idtype="pmpid" link="fulltext">16155111</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Long-range control of gene expression: emerging mechanisms and disruption in disease.</p>
            </title>
            <aug>
               <au>
                  <snm>Kleinjan</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>van Heyningen</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2005</pubdate>
            <volume>76</volume>
            <fpage>8</fpage>
            <lpage>32</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1196435</pubid>
                  <pubid idtype="pmpid" link="fulltext">15549674</pubid>
                  <pubid idtype="doi">10.1086/426833</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>The random versus fragile breakage models of chromosome evolution: a matter of resolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Becker</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Lenhard</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Mol Genet Genomics</source>
            <pubdate>2007</pubdate>
            <volume>278</volume>
            <fpage>487</fpage>
            <lpage>491</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00438-007-0287-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">17851692</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Genomic regulatory blocks encompass multiple neighboring genes and maintain conserved synteny in vertebrates.</p>
            </title>
            <aug>
               <au>
                  <snm>Kikuta</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Laplante</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Navratilova</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Komisarczuk</snm>
                  <fnm>AZ</fnm>
               </au>
               <au>
                  <snm>Engstr&#246;m</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Fredman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Akalin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Caccamo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sealy</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ghislain</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pezeron</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Mourrain</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ellingsen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Oates</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Thisse</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Thisse</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Foucher</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Adolf</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Geling</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lenhard</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Becker</snm>
                  <fnm>TS</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <fpage>545</fpage>
            <lpage>555</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1855176</pubid>
                  <pubid idtype="pmpid" link="fulltext">17387144</pubid>
                  <pubid idtype="doi">10.1101/gr.6086307</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Genome duplication in the teleost fish <it>Tetraodon nigroviridis </it>reveals the early vertebrate proto-karyotype.</p>
            </title>
            <aug>
               <au>
                  <snm>Jaillon</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Aury</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Brunet</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Petit</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Stange-Thomann</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Mauceli</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bouneau</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Fischer</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ozouf-Costaz</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bernot</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nicaud</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jaffe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Fisher</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lutfalla</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Dossat</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Segurens</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Dasilva</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Salanoubat</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Levy</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Boudet</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Castellano</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Anthouard</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Jubin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Castelli</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Katinka</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Vacherie</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bi&#233;mont</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Skalli</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Cattolico</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Poulain</snm>
                  <fnm>J</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>431</volume>
            <fpage>946</fpage>
            <lpage>957</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03025</pubid>
                  <pubid idtype="pmpid" link="fulltext">15496914</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Ancora</p>
            </title>
            <url>http://ancora.genereg.net</url>
         </bibl>
         <bibl id="B21">
            <title>
               <p>The UCSC genome browser database: update 2007.</p>
            </title>
            <aug>
               <au>
                  <snm>Kuhn</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Karolchik</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Zweig</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Trumbower</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Thakkapallayil</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sugnet</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Stanke</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Siepel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rosenbloom</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Rhead</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Raney</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Pohl</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Hsu</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hinrichs</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Harte</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Diekhans</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Clawson</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Bejerano</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Barber</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Baertsch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kent</snm>
                  <fnm>WJ</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>D668</fpage>
            <lpage>673</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1669757</pubid>
                  <pubid idtype="pmpid" link="fulltext">17142222</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl928</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Ensembl 2007.</p>
            </title>
            <aug>
               <au>
                  <snm>Hubbard</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Aken</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Beal</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ballester</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Caccamo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Clarke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Coates</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Cunningham</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Cutts</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Down</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Dyer</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Fitzgerald</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Fernandez-Banet</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Graf</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Haider</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hammond</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Herrero</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Holland</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Kahari</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Keefe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kokocinski</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Kulesha</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Lawson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Longden</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Melsopp</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Megy</snm>
                  <fnm>K</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>D610</fpage>
            <lpage>617</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1761443</pubid>
                  <pubid idtype="pmpid" link="fulltext">17148474</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl996</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>The distributed annotation system.</p>
            </title>
            <aug>
               <au>
                  <snm>Dowell</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Jokerst</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Day</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Stein</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2001</pubdate>
            <volume>2</volume>
            <fpage>7</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">58584</pubid>
                  <pubid idtype="pmpid" link="fulltext">11667947</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-2-7</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Evolution's cauldron: duplication, deletion, and rearrangement in the mouse and human genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Kent</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Baertsch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hinrichs</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>11484</fpage>
            <lpage>11489</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">208784</pubid>
                  <pubid idtype="pmpid" link="fulltext">14500911</pubid>
                  <pubid idtype="doi">10.1073/pnas.1932072100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Database resources of the National Center for Biotechnology Information.</p>
            </title>
            <aug>
               <au>
                  <snm>Wheeler</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Barrett</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Benson</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Bryant</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Canese</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Chetvernin</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>DiCuccio</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Edgar</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Federhen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Geer</snm>
                  <fnm>LY</fnm>
               </au>
               <au>
                  <snm>Kapustin</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Khovayko</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Landsman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Maglott</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Ostell</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Pruitt</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Schuler</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Sequeira</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Sherry</snm>
                  <fnm>ST</fnm>
               </au>
               <au>
                  <snm>Sirotkin</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Souvorov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Starchenko</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Wagner</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Yaschenko</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>D5</fpage>
            <lpage>12</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1781113</pubid>
                  <pubid idtype="pmpid" link="fulltext">17170002</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl1031</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>The mouse genome database (MGD): new features facilitating a model system.</p>
            </title>
            <aug>
               <au>
                  <snm>Eppig</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Bult</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Kadin</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>JE</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>D630</fpage>
            <lpage>637</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1751527</pubid>
                  <pubid idtype="pmpid" link="fulltext">17135206</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl940</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>The Zebrafish Information Network: the zebrafish model organism database provides expanded support for genotypes and phenotypes.</p>
            </title>
            <aug>
               <au>
                  <snm>Sprague</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bayraktaroglu</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Bradford</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Conlin</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Dunn</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Fashena</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Frazer</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Haendel</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Knight</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mani</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Moxon</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Pich</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ramachandran</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Schaper</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Segerdell</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Shao</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Singer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Song</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Sprunger</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Van Slyke</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Westerfield</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <issue>36 Database</issue>
            <fpage>D768</fpage>
            <lpage>772</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2238839</pubid>
                  <pubid idtype="pmpid" link="fulltext">17991680</pubid>
                  <pubid idtype="doi">10.1093/nar/gkm956</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>FlyBase: genomes by the dozen.</p>
            </title>
            <aug>
               <au>
                  <snm>Crosby</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Goodman</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Strelets</snm>
                  <fnm>VB</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gelbart</snm>
                  <fnm>WM</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>D486</fpage>
            <lpage>491</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1669768</pubid>
                  <pubid idtype="pmpid" link="fulltext">17099233</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl827</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>miRBase: microRNA sequences, targets and gene nomenclature.</p>
            </title>
            <aug>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Grocock</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>van Dongen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Enright</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <fpage>D140</fpage>
            <lpage>144</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347474</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381832</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj112</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Multiple whole genome alignments and novel biomedical applications at the VISTA portal.</p>
            </title>
            <aug>
               <au>
                  <snm>Brudno</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Poliakov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Minovitsky</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ratnere</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Dubchak</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>W669</fpage>
            <lpage>674</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1933192</pubid>
                  <pubid idtype="pmpid" link="fulltext">17488840</pubid>
                  <pubid idtype="doi">10.1093/nar/gkm279</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>The generic genome browser: a building block for a model organism system database.</p>
            </title>
            <aug>
               <au>
                  <snm>Stein</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Mungall</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Shu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Caudy</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mangone</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Day</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nickerson</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Stajich</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>TW</fnm>
               </au>
               <au>
                  <snm>Arva</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <fpage>1599</fpage>
            <lpage>1610</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">187535</pubid>
                  <pubid idtype="pmpid" link="fulltext">12368253</pubid>
                  <pubid idtype="doi">10.1101/gr.403602</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Genome sequence, comparative analysis and haplotype structure of the domestic dog.</p>
            </title>
            <aug>
               <au>
                  <snm>Lindblad-Toh</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wade</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Mikkelsen</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Karlsson</snm>
                  <fnm>EK</fnm>
               </au>
               <au>
                  <snm>Jaffe</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Kamal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Clamp</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Kulbokas</snm>
                  <fnm>EJ</fnm>
                  <suf>3rd</suf>
               </au>
               <au>
                  <snm>Zody</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Mauceli</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Xie</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Breen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wayne</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Ostrander</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Ponting</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Galibert</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>DeJong</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Kirkness</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Alvarez</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Biagi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Brockman</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chin</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Cook</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cuff</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Daly</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>DeCaprio</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gnerre</snm>
                  <fnm>S</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>438</volume>
            <fpage>803</fpage>
            <lpage>819</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04338</pubid>
                  <pubid idtype="pmpid" link="fulltext">16341006</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Genome of the marsupial <it>Monodelphis domestica </it>reveals innovation in non-coding sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Mikkelsen</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Wakefield</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Aken</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Amemiya</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Duke</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Garber</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gentles</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Goodstadt</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Heger</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Jurka</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kamal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mauceli</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Searle</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Sharpe</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Baker</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Batzer</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Benos</snm>
                  <fnm>PV</fnm>
               </au>
               <au>
                  <snm>Belov</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Clamp</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cook</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cuff</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Das</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Davidow</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Deakin</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Fazzari</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Glass</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Grabherr</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Greally</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Gu</snm>
                  <fnm>W</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>447</volume>
            <fpage>167</fpage>
            <lpage>177</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature05805</pubid>
                  <pubid idtype="pmpid" link="fulltext">17495919</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>ECR Browser: a tool for visualizing and accessing data from comparisons of multiple vertebrate genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Ovcharenko</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Nobrega</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Loots</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Stubbs</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>W280</fpage>
            <lpage>286</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">441493</pubid>
                  <pubid idtype="pmpid" link="fulltext">15215395</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh355</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Molecular cloning and characterization of human Castor, a novel human gene upregulated during cell differentiation.</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Tan</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Cullion</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Thiele</snm>
                  <fnm>CJ</fnm>
               </au>
            </aug>
            <source>Biochem Biophys Res Commun</source>
            <pubdate>2006</pubdate>
            <volume>344</volume>
            <fpage>834</fpage>
            <lpage>844</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.bbrc.2006.03.207</pubid>
                  <pubid idtype="pmpid" link="fulltext">16631614</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>A developmental view of microRNA function.</p>
            </title>
            <aug>
               <au>
                  <snm>Zhao</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Srivastava</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>2007</pubdate>
            <volume>32</volume>
            <fpage>189</fpage>
            <lpage>197</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tibs.2007.02.006</pubid>
                  <pubid idtype="pmpid" link="fulltext">17350266</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Skeletal muscle progenitor cells and the role of Pax genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Buckingham</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>C R Biol</source>
            <pubdate>2007</pubdate>
            <volume>330</volume>
            <fpage>530</fpage>
            <lpage>533</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.crvi.2007.03.015</pubid>
                  <pubid idtype="pmpid" link="fulltext">17631448</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>LIM-homeodomain genes in mammalian development and human disease.</p>
            </title>
            <aug>
               <au>
                  <snm>Hunter</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Rhodes</snm>
                  <fnm>SJ</fnm>
               </au>
            </aug>
            <source>Mol Biol Rep</source>
            <pubdate>2005</pubdate>
            <volume>32</volume>
            <fpage>67</fpage>
            <lpage>77</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s11033-004-7657-z</pubid>
                  <pubid idtype="pmpid" link="fulltext">16022279</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>UCSC Genome Bioinformatics - FAQ: Data File Formats</p>
            </title>
            <url>http://genome.ucsc.edu/FAQ/FAQformat</url>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Ancora DAS Tracks</p>
            </title>
            <url>http://ancora.genereg.net/das/dsn</url>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution.</p>
            </title>
            <aug>
               <au>
                  <cnm>International Chicken Genome Sequencing Consortium</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>432</volume>
            <fpage>695</fpage>
            <lpage>716</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03154</pubid>
                  <pubid idtype="pmpid" link="fulltext">15592404</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Whole-genome shotgun assembly and analysis of the genome of <it>Fugu rubripes</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Aparicio</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chapman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stupka</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Putnam</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Chia</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Dehal</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Christoffels</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rash</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hoon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Smit</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gelpke</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Roach</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Oh</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ho</snm>
                  <fnm>IY</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Detter</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Verhoef</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Predki</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Tay</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lucas</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Edwards</snm>
                  <fnm>YJ</fnm>
               </au>
               <au>
                  <snm>Doggett</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Zharkikh</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Tavtigian</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Pruss</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Barnstead</snm>
                  