<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2009-10-9-r91</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Method</dochead>
      <bibl>
         <title>
            <p>Genome-wide prioritization of disease genes and identification of disease-disease associations from an integrated human functional linkage network</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Linghu</snm>
               <fnm>Bolan</fnm>
               <insr iid="I1"/>
               <email>blinghu@bu.edu</email>
            </au>
            <au id="A2">
               <snm>Snitkin</snm>
               <mi>S</mi>
               <fnm>Evan</fnm>
               <insr iid="I1"/>
               <email>esnitkin@gmail.com</email>
            </au>
            <au id="A3">
               <snm>Hu</snm>
               <fnm>Zhenjun</fnm>
               <insr iid="I1"/>
               <email>zjhu@bu.edu</email>
            </au>
            <au id="A4">
               <snm>Xia</snm>
               <fnm>Yu</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>yuxia@bu.edu</email>
            </au>
            <au ca="yes" id="A5">
               <snm>DeLisi</snm>
               <fnm>Charles</fnm>
               <insr iid="I1"/>
               <email>delisi@bu.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Bioinformatics Program, Boston University, 24 Cummington Street, Boston, MA 02215, USA</p>
            </ins>
            <ins id="I2">
               <p>Department of Chemistry, Boston University, 590 Commonwealth Avenue, Boston, MA 02215, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2009</pubdate>
         <volume>10</volume>
         <issue>9</issue>
         <fpage>R91</fpage>
         <url>http://genomebiology.com/2009/10/9/R91</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/gb-2009-10-9-r91</pubid>
               <pubid idtype="pmpid">19728866</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>2</day>
               <month>5</month>
               <year>2009</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>9</day>
               <month>7</month>
               <year>2009</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>3</day>
               <month>9</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>3</day>
               <month>9</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Linghu et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p> Functional-linkage network </p>
      </shorttitle>
      <shortabs>
         <p> An evidence-weighted functional-linkage network of human genes reveals associations among diseases that share no known disease genes and have dissimilar phenotypes
</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <p>We integrate 16 genomic features to construct an evidence-weighted functional-linkage network comprising 21,657 human genes. The functional-linkage network is used to prioritize candidate genes for 110 diseases, and to reliably disclose hidden associations between disease pairs having dissimilar phenotypes, such as hypercholesterolemia and Alzheimer's disease. Many of these disease-disease associations are supported by epidemiology, but with no previous genetic basis. Such associations can drive novel hypotheses on molecular mechanisms of diseases and therapies.</p>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010012">Medicine</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010013">Methods</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Recently, a number of computational approaches have been developed to predict or prioritize candidate disease genes <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>. Most approaches are based on the idea that genes associated with the same or related disease phenotypes tend to participate in common functional modules (such as protein complexes, metabolic pathways, developmental or organogenesis processes, and so on) <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. This concept is supported by functional analysis of genes associated with diverse diseases <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>, and by the success of various disease gene prioritization studies based on the concept <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B29">29</abbr></abbrgrp>.</p>
         <p>Network-based approaches have also been employed to infer new candidate disease genes based upon network linkages with known disease genes <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>. These methods typically first construct a gene-gene association network based on one or more types of genomic and proteomic data, and subsequently rank candidate genes based on network proximity to known disease associated genes. Although some of these methods perform well using just one specific type of evidence for functional association, such as protein-protein physical-interaction data or co-expression data, the restriction to only one type of functional association potentially limits their predictive ability <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>. To address this issue, Franke <it>et al</it>. <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> constructed a functional linkage network (FLN) by integrating multiple types of data, and utilized the FLN for disease gene prioritization. However, their results indicate that the performance was highly dependent on Gene Ontology (GO) annotations, in addition to functional associations from curated databases such as the Kyoto Encyclopaedia of Genes and Genomes (KEGG) and Reactome <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. As a result, the predictions tend to be biased towards well-characterized genes, and thus limit potential inferences. In a second study, Kohler <it>et al</it>. <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> constructed an FLN from heterogeneous data sources, and used a random walk algorithm for disease gene prioritization. However, their network did not incorporate linkage weight to differentiate confidences in functional associations among genes. Therefore, the FLN-based disease gene prioritization still needs to be further explored.</p>
         <p>In addition to identifying genes associated with different diseases, other work has explored relationships among human diseases <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B37">37</abbr></abbrgrp>. Recent studies indicate that human diseases tend to form an interrelated landscape, whereby different diseases are linked together based on perturbing the same biological processes <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. Perhaps unsurprising is the finding that diseases with similar phenotypes tend to be caused by dysfunctions of the same genes <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. Less anticipated was the finding that diseases with dissimilar phenotypes can also be related at the molecular level <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. To study disease-disease relationships, some previous methods used the similarity of phenotype descriptions or examined the hospital diagnosis records to quantify the disease-disease associations <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B37">37</abbr></abbrgrp>. However, because these approaches characterize disease-disease associations entirely at the phenotypic level, they have the potential limitation of missing those disease-disease associations that can be easily detected at the molecular level but not at the phenotypic level.</p>
         <p>Recently, Goh <it>et al</it>. <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> proposed a method to identify disease-disease associations at the molecular level based on shared disease genes, which therefore may capture associations missed by the phenotype-based approaches. However, the breadth of this method is limited by the relative paucity of knowledge of disease causing genes. A potential solution to this problem is the use of functional linkages to identify associations between genes involved in different diseases. This can result in the identification of relationships between diseases that while they may not be associated with the same genes, are associated with functionally related sets of genes.</p>
         <p>Here, we construct an integrated FLN in human for two purposes: to prioritize new (not previously recognized) genes that are potentially associated with a given disease; and to explore the inter-relationships between diverse diseases revealed by considering functional associations between genes associated with different diseases (Figure <figr fid="F1">1</figr>). We use a na&#239;ve Bayes classifier <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp> to integrate 16 functional genomics features assembled from 32 sub-features. The result of this integration is a genome-scale FLN (composed of 21,657 genes and 22,388,609 links), in which nodes represent genes, and edge weights the likelihood that the linked nodes participate in a common biological process. Our integrated FLN has a higher coverage and increased accuracy compared to networks based on individual data sources.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Construction of an integrated functional linkage network (FLN) with applications in prioritizing candidate disease genes and quantifying the disease-disease associations</p>
            </caption>
            <text>
               <p>Construction of an integrated functional linkage network (FLN) with applications in prioritizing candidate disease genes and quantifying the disease-disease associations. Functional associations between genes are retrieved from diverse data sources (Table 1). These functional associations are then integrated into one single FLN using a na&#239;ve Bayes classifier, in which the nodes represent individual genes and the weighted edges represent the degree of their overall functional association upon combining all contributing data sources. Green arrows represent the two steps of using of the FLN for candidate disease gene prioritization: step 1, given a particular disease (Disease I), label genes known to be associated with this disease as seeds (pink colored nodes); step 2, prioritize all other genes in terms of their association with the disease based on the sum of the weights of their network links to the seed genes. The purple arrows represent the two steps of using the FLN to quantify the disease-disease associations: step 1, label genes known to be associated with different diseases with different colors (gene K is labeled with two colors since it is associated with two diseases); step 2, quantify the associations between any two diseases based on the degree of association between the two corresponding disease gene sets within the FLN.</p>
            </text>
            <graphic file="gb-2009-10-9-r91-1"/>
         </fig>
         <p>Next, we use this FLN to predict new candidate disease genes for 110 diverse diseases from the Online Mendelian Inheritance in Man Database (OMIM) database <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. For each disease, we quantify the degree of association between each gene and the disease by considering how tightly the candidate gene is connected to known disease genes in the FLN. This then allows us to rank the probabilities of all genes being involved in a particular disease, based upon their degree of functional relatedness to genes known to be associated with a given disease.</p>
         <p>Finally, using the FLN, we identify disease-disease associations based on functional correlations between disease-related genes. Specifically, our approach considers not only whether diseases share associated genes, but also whether gene sets from different diseases are tightly linked in the FLN. We show that the FLN can be used to identify associations between phenotypically diverse diseases, and to reveal associations even in the absence of common known disease genes or common pathological symptoms. With knowledge of such disease-disease associations, prior knowledge gained from one disease can shed light on the underlying molecular mechanisms and relevant therapies of related diseases.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>A genome-scale human functional linkage network built through data integration</p>
            </st>
            <p>Our goals are to exploit the functional coherence of genes involved in a given disease to identify genes that underlie diverse disorders, and to find previously unknown links between phenotypically dissimilar human diseases. We pursue these goals by first integrating genomic features from disparate data sources to establish quantitative functional links among human genes. Since each data source usually characterizes only one type of functional association between genes, and covers a relatively limited set of genes, functional associations from various sources need to be combined to attain maximal coverage and accuracy. We systematically assemble a set of 16 genomic features, which incorporates 32 sub-features. These genomic features include diverse functional genomics data in human, as well as functional associations mapped through orthology from five model organisms (yeast, worm, fly, mouse, and rat; Table <tblr tid="T1">1</tblr>).</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Data sources for FLN construction</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Data sources</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Description</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Number of unique gene pairs</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Number of unique genes</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Curated PPI</p>
                     </c>
                     <c ca="left">
                        <p>Curated human PPI from HPRD, BIND, BIOGRID, INTACT, MIPS, DIPS, and MINT <abbrgrp><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr></abbrgrp></p>
                     </c>
                     <c ca="center">
                        <p>90,352</p>
                     </c>
                     <c ca="center">
                        <p>10,281</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Y2H</p>
                     </c>
                     <c ca="left">
                        <p>PPI from high-throughput yeast two-hybrid experiments <abbrgrp><abbr bid="B52">52</abbr></abbrgrp></p>
                     </c>
                     <c ca="center">
                        <p>2,611</p>
                     </c>
                     <c ca="center">
                        <p>1,522</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Masspec</p>
                     </c>
                     <c ca="left">
                        <p>PPI from large-scale mass spectrometry experiments <abbrgrp><abbr bid="B53">53</abbr></abbrgrp></p>
                     </c>
                     <c ca="center">
                        <p>2,046</p>
                     </c>
                     <c ca="center">
                        <p>1,159</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>DDI</p>
                     </c>
                     <c ca="left">
                        <p>Protein pairs containing interacting protein domains <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp></p>
                     </c>
                     <c ca="center">
                        <p>6,933,469</p>
                     </c>
                     <c ca="center">
                        <p>13,454</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Co-exp</p>
                     </c>
                     <c ca="left">
                        <p>Expression correlation from multiple large-scale expression datasets <abbrgrp><abbr bid="B56">56</abbr><abbr bid="B88">88</abbr><abbr bid="B89">89</abbr><abbr bid="B90">90</abbr></abbrgrp></p>
                     </c>
                     <c ca="center">
                        <p>5,110,798</p>
                     </c>
                     <c ca="center">
                        <p>16,287</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>DS</p>
                     </c>
                     <c ca="left">
                        <p>Proteins pairs sharing same protein domains <abbrgrp><abbr bid="B91">91</abbr></abbrgrp></p>
                     </c>
                     <c ca="center">
                        <p>2,064,262</p>
                     </c>
                     <c ca="center">
                        <p>17,328</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>PG</p>
                     </c>
                     <c ca="left">
                        <p>Gene pairs having correlated phylogenetic profiles <abbrgrp><abbr bid="B56">56</abbr></abbrgrp></p>
                     </c>
                     <c ca="center">
                        <p>18,086</p>
                     </c>
                     <c ca="center">
                        <p>2,607</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GN</p>
                     </c>
                     <c ca="left">
                        <p>Gene pairs located close to each other along the chromosome <abbrgrp><abbr bid="B56">56</abbr></abbrgrp></p>
                     </c>
                     <c ca="center">
                        <p>10,070</p>
                     </c>
                     <c ca="center">
                        <p>1,365</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fusion</p>
                     </c>
                     <c ca="left">
                        <p>Protein pairs fused into one single protein in other species <abbrgrp><abbr bid="B56">56</abbr></abbrgrp></p>
                     </c>
                     <c ca="center">
                        <p>361</p>
                     </c>
                     <c ca="center">
                        <p>361</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Yeast</p>
                     </c>
                     <c ca="left">
                        <p>Functional associations mapped from seven types of functional genomics data in yeast through gene orthology <abbrgrp><abbr bid="B92">92</abbr></abbrgrp></p>
                     </c>
                     <c ca="center">
                        <p>123,380</p>
                     </c>
                     <c ca="center">
                        <p>3,809</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Worm</p>
                     </c>
                     <c ca="left">
                        <p>Functional associations mapped from four types of functional genomics data in worm through gene orthology <abbrgrp><abbr bid="B41">41</abbr></abbrgrp></p>
                     </c>
                     <c ca="center">
                        <p>96,911</p>
                     </c>
                     <c ca="center">
                        <p>5,737</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fly</p>
                     </c>
                     <c ca="left">
                        <p>Functional associations mapped from three types of functional genomics data in fly through gene orthology <abbrgrp><abbr bid="B56">56</abbr></abbrgrp></p>
                     </c>
                     <c ca="center">
                        <p>139,984</p>
                     </c>
                     <c ca="center">
                        <p>5,966</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mouse-rat</p>
                     </c>
                     <c ca="left">
                        <p>Functional associations mapped from three types of functional genomics data in mouse and rat through gene orthology <abbrgrp><abbr bid="B56">56</abbr></abbrgrp></p>
                     </c>
                     <c ca="center">
                        <p>254,477</p>
                     </c>
                     <c ca="center">
                        <p>11,789</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TexM</p>
                     </c>
                     <c ca="left">
                        <p>Co-occurrence in PubMed abstracts <abbrgrp><abbr bid="B56">56</abbr></abbrgrp></p>
                     </c>
                     <c ca="center">
                        <p>518,716</p>
                     </c>
                     <c ca="center">
                        <p>12,286</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MF</p>
                     </c>
                     <c ca="left">
                        <p>Gene pairs sharing same molecular function terms in GO <abbrgrp><abbr bid="B93">93</abbr></abbrgrp></p>
                     </c>
                     <c ca="center">
                        <p>6,937,725</p>
                     </c>
                     <c ca="center">
                        <p>7,863</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CC</p>
                     </c>
                     <c ca="left">
                        <p>Gene pairs sharing same cellular component terms in GO <abbrgrp><abbr bid="B93">93</abbr></abbrgrp></p>
                     </c>
                     <c ca="center">
                        <p>5,591,796</p>
                     </c>
                     <c ca="center">
                        <p>12,503</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>See Additional data file 5 for detailed descriptions of data sources for FLN construction. CC, cellular component; Co-exp, co-expressed; DDI, domain-domain interaction; DS, protein domain sharing; GN, gene neighbor; HPRD, Human Protein Reference Database; Masspec, mass spectrometry; MF, molecular function; MIPS, Munich Information Center for Protein Sequences; PG, phylogenetic profiles; PPI, protein-protein interaction; TexM, text mining; Y2H, yeast two hybrid experiments.</p>
               </tblfn>
            </tbl>
            <p>We then use a na&#239;ve Bayes classifier to compute functional links between human genes by integrating these genomic features. Each functional link is weighted by a log likelihood ratio (LR) score, which reflects the probability of the linked gene pair sharing the same biological process after summing over evidence from all available data sources (see Materials and methods). Such extensive data integration outperforms individual data sources in terms of inferring functional linkages (Figure <figr fid="F2">2</figr>), demonstrating the importance of data integration.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Data integration outperforms individual data sources in terms of quantifying functional links between human genes</p>
               </caption>
               <text>
                  <p>Data integration outperforms individual data sources in terms of quantifying functional links between human genes. The x-axis represents linkage sensitivity, defined as the fraction of the gold standard positive (GSP) gene pairs that are linked at different linkage weight cutoffs (see Materials and methods). The y-axis represents linkage precision, defined as the fraction of the linked gold standard gene pairs that belong to the GSP set (see Materials and methods). GSPs are defined as gene pairs sharing the same biological process term in Gene Ontology (GO). Gold-standard negatives (GSNs) are defined as gene pairs annotated with GO biological process terms that do not share any term. To generate the random control curve, we randomize the class labels in the gold standard datasets and then perform the same evaluation. In Figure S1 of Additional data file 5, we provide the same plot with the x-axis in log scale to show details for individual data sources. CC, cellular component; Co-exp, co-expressed; DDI, domain-domain interaction; DS, protein domain sharing; GN, gene neighbor; Masspect, mass spectrometry; MF, molecular function; PG, phylogenetic profiles; PPI, protein-protein interaction; TexM, text mining; Y2H, yeast two hybrid experiments. The descriptions of the 16 individual data sources are listed in Table 1.</p>
               </text>
               <graphic file="gb-2009-10-9-r91-2"/>
            </fig>
            <p>After data integration, we choose a permissive linkage weight cutoff (LR score is higher than 1; Equation 1 in the Materials and methods) such that two genes are linked if the overall evidence supports the functional linkage. This threshold is very intuitive, as it retains edges with more evidence for functional association than against it, and removes edges with more evidence against functional association than for it. In addition, the same cutoff has also been successfully used by Lee <it>et al</it>. <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> to predict perturbation phenotypes of genes based on an integrated FLN in worm. The resulting genome-scale FLN network consists of 21,657 genes (covering approximately 85% of RefSeq annotated human genes <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>) and 22,388,609 weighted links (Additional data file 1). Despite its high coverage, our network retains its accuracy because each link is weighted and the linkage weight is proportional to the linkage precision (Figure <figr fid="F2">2</figr>). The average number of linked neighbours per gene is around 2,000. Such high linkage density together with linkage weighting allows a quantification of functional associations between thousands of genes. Such high coverage is critical to the successful utilization of the FLN for both disease gene prioritization and mapping the disease-disease associations at the molecular level.</p>
         </sec>
         <sec>
            <st>
               <p>Identifying candidate disease-associated genes</p>
            </st>
            <p>Given a network of functionally linked genes, our first goal is to use the information in this network to identify genes most likely to be associated with a particular disease. The motivation for using the FLN to identify potentially disease-related-genes is the hypothesis that genes whose dysfunction contributes to a disease phenotype tend to be functionally related <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. Our approach exploits this concept by using genes known to be associated with a particular disease as network 'seeds', and identifying those genes whose connectivity with the seeds indicates a strong functional relation. In particular, for a given disease, each gene in the network is prioritized according to the sum of the weights of its network links to the known disease (seed) genes (Equation 4; see Materials and methods) <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B43">43</abbr></abbrgrp>. This prioritization rule is referred to as neighbourhood weighting.</p>
         </sec>
         <sec>
            <st>
               <p>Validation of identified candidate disease-associated genes</p>
            </st>
            <p>To test this approach, we first extract from the OMIM database <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> 1,025 known disease genes, and assemble them into 110 seed gene sets, representing 110 disorders covering a wide spectrum of human disease phenotypes (Additional data file 2). Each seed set contains at least 5 genes, and the average seed count is 11 (with some seed genes associated with more than one disease). Next we use the FLN to identify new candidate disease genes for each of the 110 diseases, based on the neighbourhood weighting rule <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>. As a result, on average, nearly half of the genome is prioritized for each disease. In Additional data file 3, we list the top 100 ranked new candidate disease genes for each disease. To help investigators estimate the prediction precision at a particular rank cutoff, for each disease we provide a plot of precision estimate versus different rank cutoffs (see Materials and methods; Additional data file 4). We assess the performance of our FLN-based disease gene prediction method using leave-one-out cross validation, with so-called disease-centric <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B43">43</abbr></abbrgrp> and gene-centric approaches <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B23">23</abbr></abbrgrp> (see Materials and methods).</p>
            <sec>
               <st>
                  <p>Disease centric assessment</p>
               </st>
               <p>The disease-centric evaluation approach first ranks each gene based on the neighborhood weighting rule for a particular disease, and then for each disease computes the area under the receiver operating characteristic (ROC) curve (AUC), which is obtained by varying the rank cutoff (see Materials and methods) <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. The AUC is an indication of how highly in the ranked list the known disease genes are, where the AUC will be 1 if all disease genes are at the top of the list and 0.5 if the disease genes are randomly distributed in the list. Examples of ROC curves for seven diseases are provided in Figure <figr fid="F3">3</figr>. Additionally, we also provide the same plot using just the extreme left side of the ROC curve, which represents the top ranking predictions (Figure S2 of Additional data file 5).</p>
               <fig id="F3">
                  <title>
                     <p>Figure 3</p>
                  </title>
                  <caption>
                     <p>Predictability of seven example diseases evaluated by ROC curves in disease-centric assessment</p>
                  </caption>
                  <text>
                     <p>Predictability of seven example diseases evaluated by ROC curves in disease-centric assessment. Prediction performance for individual diseases is measured by the true positive rate (sensitivity) versus false positive rate (1 - specificity). In particular, for each given disease, each gene in the network is ranked based on the disease association score (<it>S</it><sub><it>i</it></sub>; Equation 4). The <it>S</it><sub><it>i </it></sub>for each known disease (seed) gene is computed using leave-one-out cross-validation, based on its connectivity to other seeds. Next the performance for each disease is assessed by calculating the sensitivity (True positives/(True positives + False negatives)) and 1 - specificity (False positives/(True negatives + False positives)) at different <it>S</it><sub><it>i </it></sub>cutoffs. Here True positives is the number of seed genes above the <it>S</it><sub><it>i </it></sub>cutoff, False positives is the number of non-seed genes above the cutoff, True negatives is the number of non-seed genes below the cutoff, and False negatives is the number of seed genes below the cutoff. Random prediction performance is indicated by the diagonal.</p>
                  </text>
                  <graphic file="gb-2009-10-9-r91-3"/>
               </fig>
               <p>Disease-centric evaluation shows that FLN-based disease gene prioritization has an extremely high median AUC of 0.98 for the 110 diseases tested, indicating a high predictive capacity across a large number of diseases (Figure <figr fid="F4">4a</figr>). In light of this very high performance, we next consider an issue that may potentially inflate our performance. Specifically, one of the data sources used to construct our FLN is text mining of PubMed abstracts. The potential issue with this text mining feature is the possibility that some of the gene-disease associations in the OMIM database, which we use to evaluate our method, could be originally derived from the same literature references that text mining is based on <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B24">24</abbr></abbrgrp>. To assess the impact of this potential bias, we create a FLN excluding text mining, and find that the resulting AUCs have a median value of 0.85, lower than full FLN, but still far superior to the random expectation of 0.5. In particular, when we exclude text mining data from the FLN, 80%, 65%, and 39% of the diseases still have an AUC of over 0.75, 0.8, and 0.9, respectively. Additionally, we have also performed the disease centric analysis using only the area of the extreme left side of the ROC curve, which represents the top ranking predictions (see Additional data file 5 for the ROC-50 analysis). The results are consistent with those using the whole ROC curve (Figure S3 of Additional data file 5). Therefore, our FLN is capable of predicting candidate genes for diverse diseases, even in the absence of text mining data.</p>
               <fig id="F4">
                  <title>
                     <p>Figure 4</p>
                  </title>
                  <caption>
                     <p>FLN-based disease gene prioritization significantly outperforms random control</p>
                  </caption>
                  <text>
                     <p>FLN-based disease gene prioritization significantly outperforms random control. Performances are compared between FLN (inclusion or exclusion of text mining data) based disease-gene prioritization and the random control. The random control is generated using the FLN to prioritize randomly assembled disease gene sets (see Materials and methods). <b>(a) </b>Box plots of AUCs of disease gene prioritization performances for 110 diseases, based on disease-centric assessment (see Materials and methods). For each box plot, the bottom, middle, and top lines of the box represent the first quartile, the median, and the third quartile, respectively; whiskers represent 1.5 times the inter-quartile range; red plus signs represent outliers. <b>(b) </b>Disease gene prioritization performance based on gene-centric assessment using the artificial chromosome region background (see Materials and methods). Gene-centric assessment treats each known gene-disease association as a test case. For each test case, the task is to assess how well the known disease (seed) gene ranks relative to a background gene set according to the disease-association score (<it>S</it><sub><it>i</it></sub>; Equation 4). The <it>S</it><sub><it>i </it></sub>for each gene in each test case is calculated in leave-one-out setting based on the connectivity to the remaining seed genes. The background gene set used is referred to as the artificial chromosomal region, which is composed of a collection of 100 nearest genes flanking the tested disease gene physically on the chromosome. Finally, after the rank of each tested disease gene for each test case is determined, all the test cases are pooled together and the overall performance is assessed by evaluating the fraction of the tested disease genes ranked above various rank cutoffs. <b>(c) </b>Same evaluation as (b) using the background gene set composed of all the genes represented in the FLN, as opposed to just those proximate on the chromosome.</p>
                  </text>
                  <graphic file="gb-2009-10-9-r91-4"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Gene-centric assessment</p>
               </st>
               <p>This evaluation treats each known gene-disease association as a test case, and assesses how well each known disease gene ranks relative to a background set of genes not known to be associated with the particular disease (see Materials and methods). Then, all test cases are pooled together, and the overall performance is evaluated by calculating the fraction of tested disease genes that are ranked above various rank cutoffs. We use two background sets to define the background pool of candidate genes from which we pick out disease-associated genes. One set is a collection of 100 nearest genes flanking the test disease gene physically on the chromosome. This background is referred to as the artificial chromosome region background, and is intended to mimic the common scenario in which a chromosomal region is known to be associated with a disease through genetic association studies, but the specific disease-causing genes are unknown. The other background set contains all genes in the network and is intended to mimic the common scenario where the set of potential candidate genes cannot be narrowed down. This set is referred to as the genome background.</p>
               <p>With the artificial chromosome region background, 85% (with text mining) or 62% (without text mining) of disease genes are ranked in the top 10 out of 100 (Figure <figr fid="F4">4b</figr>). Moreover, we calculate the fold enrichment score, defined as the average rank of a gene before prioritization divided by the rank after prioritization (see Materials and methods). The average fold enrichments are 35.5 (with text mining) or 20.5 (without text mining). Finally, we also carry out gene-centric evaluation using the genome background, and the results are similar (Figure <figr fid="F4">4c</figr>).</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Monogeneic versus polygeneic disorders</p>
            </st>
            <p>Next, we investigate the difference in prioritization performance between monogenic diseases and complex diseases. It is potentially important to distinguish these two classes of diseases, as complex diseases tend to be caused by dysfunctions in multiple biological processes, and this lack of functional coherence may reduce the utility of the FLN in predicting novel disease genes. Kohler <it>et al</it>. <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> published a gene-disease association benchmark dataset that explicitly separates monogenic diseases (83 diseases), polygenic diseases (12 diseases) and cancers (12 diseases). We adopt their categorization so that we can evaluate the three disease groups separately. In particular, using the gene-centric evaluation (see Materials and methods), we evaluate how well each known disease gene is ranked relative to a background gene set in a leave-one-out cross-validation (see Materials and methods). The background gene set is composed of the 100 nearest genes flanking the test disease gene physically on the chromosome. As expected, the results are best for monogeneic diseases (Figure S4 in Additional data file 5). The lower performance for complex diseases and cancers is not surprising, since disease gene prediction methods are based on the assumption of functional coherence among genes contributing to the same disease, while as mentioned above, complex diseases tend to perturb multiple biological processes, making the contributing genes less functionally coherent. Despite the lower performance for complex disorders, the results are still far better than random control; for example, 65% of tested disease genes ranked in the top 20 among the background gene set composed of 100 genes.</p>
         </sec>
         <sec>
            <st>
               <p>The importance of data integration for gene prioritization</p>
            </st>
            <p>Disease genes have previously been prioritized using network-based strategies that used only protein-protein interactions (PPI) <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B23">23</abbr></abbrgrp>. Here we have integrated multiple data sources with the expectation that such integration will improve performance. To assess whether this is in fact the case, we compare disease-prioritization performances between our FLN and a PPI network that combines human PPI links from seven major curated PPI databases <abbrgrp><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr></abbrgrp>, along with high-throughput PPI data from yeast two-hybrid and mass spectrometry <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr></abbrgrp>, and interactions mapped from PPI of other model organisms <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. To avoid bias, interactions from different sources in the PPI network are weighted using the same procedure as FLN construction (Equation S1 in Additional data file 5). In the end, a total of 105,361 interactions among 11,886 genes are included in the PPI network.</p>
            <p>As expected, data integration does improve performance (Figure <figr fid="F5">5</figr>). Using the gene-centric evaluation with the artificial chromosome region background, 62% of disease genes rank in the top 10 among 100 using the integrated FLN (excluding text miming), in contrast to 40% in the PPI network. Similar results were also found using the disease-centric assessment (Figure S5a in Additional data file 5; see Materials and methods for the description of disease-centric assessment). Further support for using an FLN-based approach is the increased gene coverage. In the PPI network only 40% of disease genes are connected to seed genes, and thus only 40% can be prioritized. In contrast, in the integrated network more than 92% of disease genes are linked to seeds and can, therefore, be prioritized. Finally, the benefit of data integration is also evident when we evaluate the prioritization performance of the FLN at different linkage weight cutoffs. After the application of the permissive linkage weight cutoff (LR > 1; Equation 1), we explore other higher cutoffs but find no improvement in the prioritization performance (Figure S5b, c in Additional data file 5). This further demonstrates that functional links are assigned proper weights after data integration, and that the neighbourhood weighting decision rule (Equation 4) allows links with lower weights to contribute to performance.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>FLN-based disease gene prioritization shows improvement over PPI network</p>
               </caption>
               <text>
                  <p>FLN-based disease gene prioritization shows improvement over PPI network. Gene-centric assessment (as described in the legend of Figure 4b) is used to compare the disease-gene prioritization performances between the integrated FLN and a representative PPI network. The PPI network is composed of curated PPI databases <abbrgrp><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr></abbrgrp>, along with high-throughput PPI data from yeast two-hybrid and mass spectrometry <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr></abbrgrp>, and interactions mapped from PPI of other model organisms. The artificial chromosomal region composed of a collection of 100 nearest genes flanking each tested disease gene physically on the chromosome is used as the background gene set. Here, we exclude text mining data from the FLN.</p>
               </text>
               <graphic file="gb-2009-10-9-r91-5"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Evaluation of new predictions using recently identified disease genes</p>
            </st>
            <p>The performance evaluations described above are based on leave-one-out cross-validation. Here we evaluate the predictive performance for unknown disease genes by simulating the search for new disease genes. We first manually check the date of the landmark reference for each gene-disease association recorded in the OMIM database. Next, disease genes with references published after January 2007 are set aside for testing, while all other disease genes with reference dates before 2007 are used for seed genes. For the purpose of this evaluation we included text mining from the STRING database, which was curated before January 2007 <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>.</p>
            <p>Within the FLN, there are a total of 61 disease genes associated with 31 diseases that are published after January 2007. These are used for evaluating the FLN-based predictions. Among them, 45 disease genes associated with 24 diseases are also present in the representative PPI network. These genes are used for evaluating PPI-based predictions. These recently identified disease genes and their landmark references are listed in Additional data file 6.</p>
            <p>Again, the FLN shows improvement over the PPI network (Figure <figr fid="F6">6</figr>). For instance, using the gene-centric evaluation with the artificial chromosome region background, 45% of disease genes are ranked in the top 5 in the FLN, in contrast to fewer than 25% in the PPI network. It is noteworthy that there is a drop in performance for both PPI and FLN relative to the cross-validation analysis presented above. In particular, the fold enrichment drops from 16 to 8.2 for PPI and from 35.5 to 16.5 for FLN. This indicates that cross-validation tends to overestimate performance, and it is important to consider this when interpreting cross-validation results.</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>FLN shows improvement over PPI network for predicting 'new' disease genes</p>
               </caption>
               <text>
                  <p>FLN shows improvement over PPI network for predicting 'new' disease genes. Disease genes whose disease-association landmark references were published before January 2007 are considered 'known' and are used as seed genes, and disease genes that were published after January 2007 are considered 'new' and are used as test genes. Performances are compared among FLN, PPI network, and the random control of the FLN. Gene-centric assessment is used to evaluate the performance using the artificial chromosome region background composed of 100 genes, as described in the legend of Figure 4b.</p>
               </text>
               <graphic file="gb-2009-10-9-r91-6"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Obesity: a case study</p>
            </st>
            <p>Obesity is a polygenic disorder involving genes from various processes, such as nutrient catabolism and appetite control <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>. Our FLN includes 24 obesity-associated genes in the OMIM database, and 334 additional obesity-associated genes collected from the literature by Hancock <it>et al</it>. <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>. We will subsequently refer to this set of 334 genes as 'ObesHancock' genes. There is no overlap between the 24 OMIM obesity genes and the 334 ObesHancock genes. Here, we rank obesity-related genes using the 24 OMIM genes as seeds, then evaluate the utility of our ranking using the non-overlapping set of ObesHancock genes. Since ObesHancock genes are collected from the literature, we exclude the text mining data source from FLN construction.</p>
            <p>We find that the ObesHanecok set is overrepresented in the top scoring FLN genes, with 22 of them occurring in the top 100 (<it>P </it>&lt; 1.0 &#215; 10<sup>-13</sup>; see Additional data file 5 for <it>P</it>-value calculation). The list of the 22 ObesHancock genes and their supporting evidence are provided in Additional data file 7. Detailed analysis of the subset of the top 100 ranked obesity genes that does not overlap with the ObesHancock set reveals additional genes with potential roles in obesity. For instance, <it>NR1H3</it>, ranked 24th, is predominantly expressed in adipose tissue and plays an important role in cholesterol, lipid, and carbohydrate metabolism <abbrgrp><abbr bid="B59">59</abbr><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr><abbr bid="B62">62</abbr><abbr bid="B63">63</abbr></abbrgrp>. Recently, Dahlman <it>et al</it>. <abbrgrp><abbr bid="B64">64</abbr></abbrgrp> also found that one <it>NR1H3 </it>single nucleotide polymorphism (SNP), rs2279238, is associated with the obesity phenotype. Similarly, <it>NMUR2</it>, ranked 35th, is exclusively expressed in the central nervous system as a receptor for neuromedin U, a neuropeptide regulating feeding behavior and body weight <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>. Additionally, Schmolz <it>et al</it>. <abbrgrp><abbr bid="B66">66</abbr></abbrgrp> found that a <it>NMUR2 </it>variant potentially related to obesity in a mouse model.</p>
         </sec>
         <sec>
            <st>
               <p>FLN-based identification of disease-disease associations at the molecular level</p>
            </st>
            <p>As described in the Introduction, human diseases tend to form an interrelated landscape. We hypothesize that the basis for these relationships stems from multiple diseases resulting from dysfunctions in the same genes, and more broadly, multiple diseases resulting from dysfunction of the same or related biological processes <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. Associations between diseases potentially stemming from common causal genes were previously reported by Goh <it>et al</it>. <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. Here we focus on quantifying associations between diseases based on perturbation in common biological processes by developing the concept of 'mutual predictability' (see Materials and methods). The mutual predictability between two diseases measures the extent to which genes known to be associated with either member of a disease pair can be used to identify genes known to be associated with the other member (see Materials and methods). We hypothesize that disease pairs with high mutual predictability will be closely related to each other, as a high mutual predictability should be indicative of high connectivity in the FLN between the two gene sets associated with two diseases, and hence should quantify the functional relatedness between diseases.</p>
            <p>We validate our mutual-predictability-based disease-disease associations at the molecular and gene network level, using disease-disease associations based on the classification in Goh <it>et al</it>. <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, where the diseases in OMIM were manually partitioned into 22 classes based on physiological system-level phenotypic observations. After calculating the mutual predictability (Equation 7) between every possible disease pair (all mutual predictability scores are provided in Additional data file 8), we threshold pair selection with increasing score cutoffs. At each cutoff, we examine the fraction of disease pairs belonging to the same disease class (excluding 'unclassified class' and 'multiple class'; the former has insufficient information for disease class assignment, and the latter lacks physiological system specificity). As seen in Figure <figr fid="F7">7</figr>, the fraction of pairs placed in the categories defined by Goh <it>et al</it>. increases rapidly with increasing score cutoff. This demonstrates that FLN-based mutual predictability can capture disease-disease association in a quantitative way.</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Fraction of related disease pairs increases as mutual predictability cutoffs increase</p>
               </caption>
               <text>
                  <p>Fraction of related disease pairs increases as mutual predictability cutoffs increase. Disease pairs are considered to be related if they belong to the same disease class based on Goh <it>et al</it>.'s manual classification <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>.</p>
               </text>
               <graphic file="gb-2009-10-9-r91-7"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>The FLN discloses hidden associations between diseases sharing no known disease genes and having dissimilar phenotypes</p>
            </st>
            <p>To visualize disease-disease association estimated by mutual predictability, we create a network of disease associations, in which the nodes represent individual diseases and the weighted edges represent mutual predictability. Figure <figr fid="F8">8a</figr> shows a high confidence subset of the disease network obtained by selecting the top 100 pairs (out of 5,995, that is, the top 1.7%; Figure S9 of Additional data file 5), corresponding to a mutual predictability cutoff of approximately 0.85. These 100 pairs cover a total of 66 diseases. At this cutoff, the disease pairs are four times more likely to share the same disease class than expected at random (Figure <figr fid="F7">7</figr>). Moreover, 97 of the 100 disease pairs are supported by various types of evidence, such as the classification scheme of Goh <it>et al</it>. (within the same disease class) or other literature evidence (Additional data file 9). These results suggest that disease pairs with high mutual predictability tend to be related.</p>
            <fig id="F8">
               <title>
                  <p>Figure 8</p>
               </title>
               <caption>
                  <p>FLN-based disease-disease association network</p>
               </caption>
               <text>
                  <p>FLN-based disease-disease association network. <b>(a) </b>An example network of 66 diseases (nodes) linked by the top 100 edges with the highest mutual predictability scores. Many of these edges cannot be identified using the simple disease-gene sharing method, and these are colored red. The rest of the edges are colored blue, which link disease pairs with one or more overlapping disease genes. This disease-disease association network has a modular topology, and diseases tend to form clusters (rectangles). The disease clusters are labeled based on enriched disease class or underlying molecular mechanisms. The following disease nodes mentioned in the text are labeled: ovarian cancer (OC), prostate cancer (PC), Leber congenital amaurosis (LC), cone or cone-rod dystrophy (CD), retinitis pigmentosa (RP), night blindness (NB), Leigh syndrome (LS), mitochondrial DNA depletion syndrome (MD), combined oxidative phosphorylation deficiency (CO), mitochondrial complex I/II/III deficiency (MC), Alzheimer's disease (AZ), hypercholesterolemia (HC), pseudohypoaldosteronism (PH), Bartter syndrome (BS), holoprosencephaly (HP), Waardenburg syndrome (WS), central hypoventilation syndrome (CH), Hirschsprung disease (HD), ceroid-lipofuscinosis (CL), and glaucoma (GC). <b>(b) </b>Functional links among Alzheimer's disease genes (purple nodes) and hypercholesterolemia disease genes (blue nodes) in the FLN. Green edges represent functional links between Alzheimer's disease genes and hypercholesterolemia genes. Light blue edges represent functional links connecting genes associated with the same disease. Edge thickness is proportional to linkage weight. Both networks are generated by Visant <abbrgrp><abbr bid="B76">76</abbr></abbrgrp>.</p>
               </text>
               <graphic file="gb-2009-10-9-r91-8"/>
            </fig>
            <p>We compare our method with another available disease-disease association identification method proposed by Goh <it>et al</it>., which identifies the associations between two diseases by counting their overlapping disease genes <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. However, Goh <it>et al</it>.'s method could only identify the associations between diseases with known overlapping disease genes. In contrast, our method is able to identify additional associations for those diseases sharing no known disease genes but having dense functional links between their corresponding disease gene sets by taking advantage of the FLN.</p>
            <p>Among the 97 potentially related disease pairs with literature support, 48 pairs share known disease genes and are identified by both methods (pairs connected by blue links in Figure <figr fid="F8">8a</figr>). However, the remaining 49 pairs share no known disease genes, and their associations are identified solely based upon functional links among associated genes (pairs connected by red links in Figure <figr fid="F8">8a</figr>). An example of a non-trivial disease-disease linkage in the latter group is the association between Alzheimer's disease, a neurological disorder, and hypercholesterolemia, a metabolic disorder. Since the two diseases share no disease genes in OMIM, their predicted association is based entirely on the strong and dense functional links between the corresponding disease gene sets (Figure <figr fid="F8">8b</figr>). Importantly, associations between diseases identified using the FLN provide immediate insight into the molecular mechanisms underlying different diseases, and thus generate novel hypotheses for therapeutic strategies. For instance, based on the association of hypercholesterolemia and Alzheimer's diseases, we propose that high cholesterol may play an important role in the development of Alzheimer's disease and that modulation of cholesterol levels might help to reduce or delay the risk of Alzheimer's disease, which is indeed supported by recent literature <abbrgrp><abbr bid="B67">67</abbr><abbr bid="B68">68</abbr><abbr bid="B69">69</abbr></abbrgrp>. Besides Alzheimer's disease and hypercholesterolemia, there are diverse disease-disease associations that are identified only by the FLN but not by the disease gene sharing method. These include night blindness/Leber's congenital amaurosis, which are both ophthalmological; pseudohypoaldosteronism/Bartter syndrome - both involved in ion transport deficiency); and holoprosencephaly/Waardenburg syndrome - both involved in developmental deficiencies (Additional data file 9). We provide a more quantitative comparison between our mutual predictability method and disease gene sharing method in Additional data file 5.</p>
            <p>Since our disease-disease association identified at the molecular level correlates with disease-disease associations based on phenotypic level classification (Figure <figr fid="F7">7</figr>), it is not surprising that some diseases in the same disease class are found to be connected in our disease network. For example, prostate cancer and ovarian cancer both belong to the cancer class, and are connected in our network. Potentially of more interest is the observation that among the 97 potentially associated disease pairs identified by high mutual predictability and supported by the literature, 54 disease pairs belong to different disease classes, as defined by Goh <it>et al</it>. <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. This finding suggests that diseases with different phenotypes can share common etiology at the molecular level <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Examples of such disease pairs include Alzheimer's disease (neurological class) and hypercholesterolemia (metabolic class) described above, central hypoventilation syndrome (respiratory class) and Hirschsprung disease (gastrointestinal class), both of which are potentially caused by a defective developmental process <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, and ceroid-lipofuscinosis (neurological class) and glaucoma (ophthamological class), both of which are involved in abnormalities in lysosomal function <abbrgrp><abbr bid="B70">70</abbr><abbr bid="B71">71</abbr></abbrgrp> (Additional data file 9). These results suggest that standard disease classifications may be much less informative than currently thought.</p>
            <p>In addition to validating individual links, we visualize the topology of our disease-disease association network based on the network connectivity. We find that some network modules (such as those defined by rectangles in Figure <figr fid="F8">8a</figr>) can be readily identified through visual inspection. Specifically, diseases within a module tend to be more connected than diseases between different network modules. These disease modules tend to be enriched with homogeneous diseases belonging to the same or related classes, or sharing similar molecular mechanisms (Figure <figr fid="F8">8a</figr>, homogeneous modules are colored correspondingly; cluster details are listed in Additional data file 10 and Figure S9 in Additional data file 5). Examples are the ophthalmological disease clique of night blindness/Leber's congenital amaurosis/retinitis pigmentosa/cone or cone-rod dystrophy, and the mitochondrion deficiency clique of mitochondrial complex I/II/III deficiency/combined oxidative phosphorylation deficiency/Leigh syndrome/mitochondrial DNA depletion syndrome. These results confirm that the FLN-based mutual predictability measure can be used to identify biologically meaningful disease-disease associations.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <sec>
            <st>
               <p>Comparison with other disease gene prioritization methods</p>
            </st>
            <p>We have demonstrated that our FLN-based disease gene prioritization approach successfully predicts new candidate disease genes for diverse diseases. The method compares favorably to previous methods, providing 50% more coverage than <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> with accuracy comparable to <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B24">24</abbr><abbr bid="B72">72</abbr></abbrgrp>. (Detailed descriptions of the performance comparisons are provided in Additional data file 5.)</p>
            <p>A potential strategy for further improving on our predictive performance is to consider not only functional linkages between neighbors as was done here, but to also incorporate global network topology into disease gene identification <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. It should be noted, however, that the use of a global propagation scheme should be done with extra care. A global propagation scheme may more comprehensively exploit the information contained in the FLN by considering non-adjacent nodes, but at the same time it can be more susceptible to noise. Such issues are exemplified in recent work by Wu <it>et al</it>. <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, where they compared a direct interaction based local rule and a shortest path rule, based on their abilities to prioritize disease genes. Surprisingly, the shortest path rule, which uses global network topological information, performs worse than the direct interaction rule <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. A likely reason is that although, in principle, rules utilizing global network topological information are preferred, they are more sensitive to the false positive and false negative links in the network. In practice, however, a properly implemented local rule can perform well, when combined with a comprehensive and weighted FLN.</p>
         </sec>
         <sec>
            <st>
               <p>Further considerations for disease gene prioritization</p>
            </st>
            <p>Among the data sources used for constructing the integrated FLN, literature-based text mining, GO annotation (GO annotations in the molecular function and cellular component categories are included in the integration), and curated PPI might be biased towards well characterized genes (Table <tblr tid="T1">1</tblr>). Similarly, text mining and curated PPI data mapped from mouse and rat (organisms close to human in the evolutionary tree) via orthology may also have the same bias. Consequently, predictions based on these sources tend to be biased towards well characterized genes. On the other hand, the remaining data sources are relatively unbiased, and include sequence data, high-throughput experiments, and functional associations mapped from yeast, worm, and fly (organisms distant to human in the evolutionary tree). Predictions from these latter sources can reveal new disease genes that are not well characterized. To test the extent to which our predictions depend on the relatively biased data sources, we remove them and use the remaining unbiased sources for FLN construction and disease gene prioritization. The prediction performance remains far better than random control (Figure S6 in Additional data file 5). Using the gene-centric assessment, over 40% of disease genes are still ranked within the top 10 positions in the artificial chromosome region background composed of 100 genes. The disease-centric assessment also shows similar results. Therefore, our predictions do not solely rely on the potentially biased text mining, molecular function and cellular component annotations of GO, and curated PPI data sources; other relatively unbiased sources also make significant contributions to the disease gene prediction and biological insights that go beyond well characterized genes.</p>
            <p>Our current framework assumes that known disease genes ('seed genes') contribute equally to the disease, but in reality some seed genes may contribute more than others, especially for complex diseases. An example is homozygosity in the <it>APOE4 </it>allele, which confers a 40% chance of late-onset Alzheimer's disease <abbrgrp><abbr bid="B73">73</abbr></abbrgrp>. Incorporation of this notion of 'seed strength' into our framework would be desirable. In addition, our framework requires the knowledge of a number of known disease genes as seed genes in order to predict new candidate genes. A further improvement might be to borrow seed genes from closely related diseases when the disease under study does not have enough seeds, adjusting the strength in accordance with how closely related the two diseases are.</p>
            <p>Additionally, FLN has been made available in VisANT <abbrgrp><abbr bid="B74">74</abbr></abbrgrp>, a web-based platform for the integrative visualization and analysis of biological networks and pathways <abbrgrp><abbr bid="B75">75</abbr><abbr bid="B76">76</abbr><abbr bid="B77">77</abbr><abbr bid="B78">78</abbr><abbr bid="B79">79</abbr><abbr bid="B80">80</abbr></abbrgrp>. Users can interactively query the FLN for genes of interest by choosing FLN as the current database in the VisANT toolbar; compare the FLN with other interaction data by updating gene interactions stored in the Predictome database; and filter the FLN with preferred weights and visualize the weight using edge color, edge thickness or both. In addition, we are developing a series of functions to integrate the methods of FLN-based prediction of new disease genes and disease-disease associations into VisANT, including making predictions for the new seed sets provided by users. Each disease will be represented as a metanode <abbrgrp><abbr bid="B78">78</abbr></abbrgrp> - a special type of node that contains associated sub-nodes (disease genes), which allows direct integration of disease information and the FLN. These new functions will extend the applications developed here to other areas, such as the prediction of new proteins for protein complexes, cross-talk between pathways and so on.</p>
         </sec>
         <sec>
            <st>
               <p>Further considerations for FLN-based identification of disease-disease associations</p>
            </st>
            <p>Here, we carry out a proof-of-principle study with our mutual predictability measure to demonstrate that the functional links provided by the FLN can be used to quantify disease-disease associations at the molecular level as illustrated in the Results section. Further studies are needed to improve our measures as well as to explore related measures of disease-disease associations based on the FLN. Our approach for assessing disease-disease associations requires the knowledge of a few known disease genes for each disease, and cannot be applied to diseases with no known disease genes. In these cases, phenotype-based methods such as <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> are good alternatives if there is enough disease phenotype information available, bearing in mind that it is possible for two diseases to share related molecular pathology mechanisms but be less similar at the phenotypic level <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. In general, it is likely that the identification of disease-disease associations can be further improved by combing our FLN-based approach with phenotype-based approaches.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>In this work, we build a comprehensive and high quality weighted FLN by integrating 16 functional genomics features assembled from 32 sub-features from 6 model organisms. To our knowledge, our integrated human FLN is the most comprehensive to date.</p>
         <p>We show that genome-wide disease gene prioritization for diverse diseases can be obtained using our integrated FLN. Top ranking candidate disease genes can be used to guide the design of new genetic association studies. Conversely, in the future, results from genetic association studies can be further integrated into our framework for better disease gene predictions.</p>
         <p>We also show that the integrated FLN can be used to identify disease-disease associations, even among diseases that share no known disease genes or have dissimilar phenotypes. Thus, we show that by considering functional linkages between genes involved in different diseases, relationships between diseases that are based on underlying molecular mechanisms can be revealed. Such associations can potentially be used to generate novel hypotheses on the molecular mechanisms of human diseases, and can in turn guide the development of relevant therapy as illustrated in the case of Alzheimer's disease and hypercholesterolemia.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Human dataset</p>
            </st>
            <p>All methods are applied to a common set of 25,564 protein encoding genes (downloaded from RefSeq in November 2007 <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>) with the Entrez gene ID as the unique identifier <abbrgrp><abbr bid="B81">81</abbr></abbrgrp>. For data sources using different gene or protein identifiers, the original identifiers are all mapped to Entrez gene IDs using the cross-reference files from Entrez Gene <abbrgrp><abbr bid="B81">81</abbr></abbrgrp>, the International Protein Index (IPI) database <abbrgrp><abbr bid="B82">82</abbr></abbrgrp>, or Biomart <abbrgrp><abbr bid="B83">83</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>FLN construction</p>
            </st>
            <sec>
               <st>
                  <p>Gold standard</p>
               </st>
               <p>We construct gold standard datasets for gene-gene functional association in the following way. Gold-standard positives (GSPs) are defined as gene pairs sharing the same biological process term in GO. Gold-standard negatives (GSNs) are defined as gene pairs annotated by GO biological process terms but that do not share any term. As a result, 1,456,585 pairs are included in GSP, and 3,345,661 pairs are included in GSN. These gene pairs are composed of 5,955 human genes. Details are provided in Additional data files 5 and 11.</p>
            </sec>
            <sec>
               <st>
                  <p>Na&#239;ve Bayes classifier for FLN construction</p>
               </st>
               <p>We use a na&#239;ve Bayes classifier <abbrgrp><abbr bid="B84">84</abbr></abbrgrp> to predict gene pairs that participate in the same GO biological process. The inputs to the classifier include diverse genomic features assembled from various data sources (Table <tblr tid="T1">1</tblr>). Similar to other studies <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B85">85</abbr></abbrgrp>, we use a log likelihood ratio (LR) score to quantify the degree of functional association between two genes. The associated gene pairs are then assembled into a functional linkage network with individual genes as nodes and the LR score as the linkage weight.</p>
               <p>There are two primary reasons that we choose na&#239;ve Bayes as the integration approach. First, it allows for the direct integration of heterogeneous data sources in an easily interpretable model. Second, it calculates the probability that two genes participate in the same biological process given the input features. Na&#239;ve Bayes classifiers have been used to successfully combine heterogeneous data in human <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>. It is possible, and perhaps likely, that other multivariate methods would perform equally as well <abbrgrp><abbr bid="B86">86</abbr></abbrgrp>.</p>
               <p>In the Bayesian framework, we use the LR score to quantify the degree of functional association between a gene pair:</p>
               <p>
                  <display-formula id="M1">
                     <graphic file="gb-2009-10-9-r91-i1.gif"/>
                  </display-formula>
               </p>
               <p>where <it>I </it>is a binary variable representing the existence of functional association (GSP pairs), ~<it>I </it>represents the absence of functional association (GSN pairs), <it>f</it><sub>1 </sub>through <it>f</it><sub><it>n </it></sub>are the genomic features, and LR is the likelihood ratio.</p>
               <p>In na&#239;ve Bayes, we assume that features are conditionally independent. As a result, <it>LR</it>(<it>f</it><sub>1</sub>,..., <it>f</it><sub><it>n</it></sub>) can be further written as:</p>
               <p>
                  <display-formula id="M2">
                     <graphic file="gb-2009-10-9-r91-i2.gif"/>
                  </display-formula>
               </p>
               <p>Equation 2 is equivalent to the following, where LLR represents log likelihood ratios:</p>
               <p>
                  <display-formula id="M3">
                     <graphic file="gb-2009-10-9-r91-i3.gif"/>
                  </display-formula>
               </p>
               <p>Na&#239;ve Bayes assumes conditional independence among features, but the na&#239;ve Bayes classifier can still be applied even when this assumption is not strictly satisfied <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B85">85</abbr><abbr bid="B87">87</abbr></abbrgrp>. In our work, we minimize conditional dependence among features by combining related datasets or features into single features before final integration (Additional data file 5). Indeed, no strong correlations exist among our final features after this procedure (Additional data file 12).</p>
               <p>As listed in Table <tblr tid="T1">1</tblr>, a total of 16 features assembled from 32 sub-features are utilized for the final integration (see Additional data file 5 for details).</p>
            </sec>
            <sec>
               <st>
                  <p>FLN quality assessment</p>
               </st>
               <p>After the na&#239;ve Bayes integration, each gene pair is assigned a probability for sharing a GO biological process. The quality of this FLN is evaluated by plotting linkage precision versus linkage sensitivity at various linkage weight cutoffs. Given a pre-specified linkage weight cutoff, 'linkage precision' is defined as the fraction of the linked gold-standard gene pairs that belong to the GSP set, and 'linkage sensitivity' is defined as the fraction of the GSP pairs that are linked. We perform threefold cross-validation by randomly partitioning the gold standard datasets into three equal segments, two serving as training sets and the third as the test set. This procedure is repeated three times such that each segment serves once as the test set. Next, the three test sets are combined and the performance is assessed by plotting linkage precision versus linkage sensitivity. Finally the whole gold standard dataset is used for training to generate one final FLN, which is then used for disease gene prioritization and prediction of disease-disease associations.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>FLN-based disease gene prioritization</p>
            </st>
            <p>For a given disease, the known disease associated genes are used as seeds. New candidate genes can then be ranked based on their association with these seed genes in the FLN.</p>
            <sec>
               <st>
                  <p>Seed genes</p>
               </st>
               <p>The seed disease genes are obtained from the OMIM database, a compendium of human disease genes and phenotypes <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. The disease-gene associations are downloaded from the Morbid Map (April 2008) <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. To obtain reliable test statistics, we include in the analysis only diseases with at least five seed genes in the FLN. This results in 110 unique diseases with a total of 1,025 seed genes (see Additional data files 2 and 5 for details).</p>
            </sec>
            <sec>
               <st>
                  <p>Neighborhood-weighting decision rule</p>
               </st>
               <p>Given a particular disease and its seed gene set, we quantify the association of each candidate gene <it>i </it>with the disease using the following disease association score <it>S</it><sub><it>i</it></sub>:</p>
               <p>
                  <display-formula id="M4">
                     <graphic file="gb-2009-10-9-r91-i4.gif"/>
                  </display-formula>
               </p>
               <p>where <it>w</it><sub><it>ij </it></sub>is the linkage weight connecting gene <it>i </it>and seed <it>j </it><abbrgrp><abbr bid="B41">41</abbr><abbr bid="B43">43</abbr></abbrgrp>. This score is 0 for genes not connected to any seed. The justifications of the rule selection are described in Additional data file 5.</p>
            </sec>
            <sec>
               <st>
                  <p>Performance assessment for disease gene prioritization</p>
               </st>
               <sec>
                  <st>
                     <p>Disease-centric assessment</p>
                  </st>
                  <p>Disease-centric assessment evaluates an overall prediction performance for each individual disease <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B43">43</abbr></abbrgrp>. In particular, for a given disease, first each gene is ranked based on the disease association score (<it>S</it><sub><it>i</it></sub>; Equation 4). The <it>S</it><sub><it>i </it></sub>for each seed gene is computed using leave-one-out cross-validation, based on its connectivity to other seeds. Next a ROC curve is generated to assess how well known disease (seed) genes are ranked relative to non-seed genes in the ranked gene list. The ROC curve plots true positive rate (sensitivity) versus false positive rate (1 - specificity) at different <it>S</it><sub><it>i </it></sub>cutoffs:</p>
                  <p>
                     <display-formula id="M5">
                        <graphic file="gb-2009-10-9-r91-i5.gif"/>
                     </display-formula>
                  </p>
                  <p>
                     <display-formula id="M6">
                        <graphic file="gb-2009-10-9-r91-i6.gif"/>
                     </display-formula>
                  </p>
                  <p>where TP (true positive) is the number of seed genes above the <it>S</it><sub><it>i </it></sub>cutoff, FP (false positive) is the number of non-seed genes above the cutoff, TN (true negative) is the number of non-seed genes below the cutoff, and FN (false negative) is the number of seed genes below the cutoff.</p>
                  <p>Finally, an AUC is calculated for each ROC curve to represent the predictability of a disease. AUC scores range between 0 and 1, with 0.5 and 1.0 indicating random and perfect predictive performance, respectively. Here we treat non-seed genes as non-disease genes, but some of these non-seed genes can be new disease-associated genes. Hence, our performance evaluation is likely an underestimate.</p>
               </sec>
               <sec>
                  <st>
                     <p>Gene-centric assessment</p>
                  </st>
                  <p>Gene-centric assessment treats each known gene-disease association as a test case <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B23">23</abbr></abbrgrp>. For each test case, the task is to assess how well the known disease (seed) gene ranks relative to a background gene set. Gene centric assessment is also carried out using leave-one-out cross-validation. For each test case, the tested seed gene is ranked according to its connectivity to the remaining seed genes (Equation 4), and compared against a background gene set.</p>
                  <p>To assess the overall performance of a method, we first determine the rank of each tested disease gene among the background gene set within each test case, and then pool together all test cases and compute the fraction of the tested disease genes ranked above various cutoffs. A second commonly used measure is fold enrichment, defined as the average rank of a gene before prioritization divided by the rank after prioritization <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. For instance, to rank a test disease gene with a background gene set of 100 genes, the average rank before prioritization is 50. If the test disease gene is ranked top 1, then its fold enrichment is 50/1 = 50.</p>
               </sec>
            </sec>
            <sec>
               <st>
                  <p>Random control</p>
               </st>
               <p>Similar to McGary <it>et al</it>. <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>, we assemble 100 random gene sets from the FLN for each given disease. Each gene set has the same number of genes as the seed gene set for the given disease. Next we perform the same FLN-based disease gene prioritization and evaluation using these randomly assembled seed sets.</p>
            </sec>
            <sec>
               <st>
                  <p>Estimate precision for disease gene predictions</p>
               </st>
               <p>To estimate the prediction for disease gene predictions, we first rank the predicted candidate disease genes by their disease association scores (<it>S</it><sub><it>i</it></sub>; Equation 4). Precision is then estimated at different rank cutoffs.</p>
               <p>For each particular disease, each known disease gene (seed) is assigned a disease association score (<it>S</it><sub><it>i</it></sub>) based on its functional linkage to other seeds. We then rank the seed genes together with other non-seed genes based on their <it>S</it><sub><it>i </it></sub>scores. At each rank cutoff, we calculate the precision as the fraction of genes above the cutoff that are seed genes. This is likely an underestimate, since some non-seed genes can be new disease-related genes.</p>
               <p>For each of the 110 diseases, we provide the precision estimate for its top 100 predicted new candidate genes by plotting precision versus rank cutoff.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Mutual predictability as an estimate for disease-disease association at the molecular level</p>
            </st>
            <p>Given two diseases, I and II, each of which is associated with a set of known disease genes, we define a mutual predictability score that reflects the degree of association between the two diseases (Figure S7 in Additional data file 5). The mutual predictability score accounts for both the number of genes in common between the two disease gene sets, and the functional links between the two sets in the FLN. In essence, the mutual predictability score evaluates the degree to which genes associated with disease I can be used as seed genes to predict genes associated with disease II (predictability <sub>I-II</sub>), and vice versa (predictability <sub>II-I</sub>). High mutual predictability implies close association between the two disease gene sets in the FLN, suggesting related molecular mechanisms.</p>
            <p>To quantify predictability <sub>I-II</sub>, we first use genes associated with disease I as seeds, and rank all other genes based on the <it>S</it><sub><it>i </it></sub>score (Equation 4). In addition, disease II genes that overlap with the seed genes from disease I are given a <it>S</it><sub><it>i </it></sub>score of infinity and ranked on the top. Next, we plot a ROC curve of sensitivity versus 1 - specificity on how well the disease II genes are ranked in the sorted gene list. Sensitivity and (1 - specificity) are defined in Equations 5 and 6, where TP is the number of disease II genes above a particular <it>S</it><sub><it>i </it></sub>cutoff, TN is the number of genes associated with neither disease below the cutoff, FP is the number of genes associated with neither disease above the cutoff, and FN is the number of disease II genes below the cutoff. Finally, we calculate the AUC of the ROC curve (AUC<sub>I-II</sub>) as a measure for predictability <sub>I-II</sub>.</p>
            <p>Similarly, we can calculate AUC<sub>II-I </sub>as a measure of predictability<sub>II-I</sub>. The mutual predictability between disease I and II is defined as the geometric mean of AUC<sub>I-II </sub>and AUC<sub>II-I</sub>:</p>
            <p>
               <display-formula id="M7">
                  <graphic file="gb-2009-10-9-r91-i7.gif"/>
               </display-formula>
            </p>
            <p>Since AUC ranges from 0 to 1, the mutual predictability also ranges from 0 to 1.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>AUC: area under receiver operating characteristic curve; CC: cellular component; DDI: domain-domain interaction; DS: protein domain sharing; FLN: functional linkage network; GO: Gene Ontology; GS: gold standard; GSP: gold standard positive; GSN: gold standard negative; GN: gene neighbour; HPRD: Human Protein Reference Database; KEGG: Kyoto Encyclopaedia of Genes and Genomes; LR: likelihood ratio; LLR: log likelihood ratio; Masspec: Mass Spectrometry; MF: molecular function; MIPs: The Munich Information Center for Protein Sequences; OMIM: Online Mendelian Inheritance in Man Database; PG: phylogenetic profiles; PPI: protein-protein interaction; RefSeq: Reference Sequence Database; ROC: receiver operating characteristic; TexM: text mining; Y2H: yeast two hybrid experiments.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>BL designed and implemented the whole computation framework. ESS provided constructive discussions and revised the manuscript. ZH provided constructive discussions. YX monitored the whole framework. CD directed the whole project and is Principal Investigator on the NIH grant that funded the project. All the authors have read and agreed to the manuscript.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper: a text file listing edges in the integrated human functional linkage network (Additional data file <supplr sid="S1">1</supplr>); a table listing known (seed) disease genes associated with each of the 110 diseases (Additional data file <supplr sid="S2">2</supplr>); a list of the top 100 predicted candidate disease genes for each of the 110 diseases (Additional data file <supplr sid="S3">3</supplr>); a plot of precision versus rank cutoff for the top 100 predicted candidate disease genes for each of the 110 diseases (Additional data file <supplr sid="S4">4</supplr>); supplementary text and figures (Additional data file <supplr sid="S5">5</supplr>); a list of recently identified disease genes and landmark references dated after January 2007 (Additional data file <supplr sid="S6">6</supplr>); a list of the 22 predicted obesity genes among the top 100 predicted gene list that overlap with the obesity genes collected from literature by Hanock et al. <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> (Additional data file <supplr sid="S7">7</supplr>); a list of mutual predictability scores for all the possible disease pairs between the 110 diseases (Additional data file <supplr sid="S8">8</supplr>); a list of the top 100 disease pairs with the highest mutual predictability scores and the supporting evidence for the association (Additional data file <supplr sid="S9">9</supplr>); a list of the disease clusters, their disease members, and evidence supporting the associations in Figure S9 of Additional data file <supplr sid="S5">5</supplr> and Figure <figr fid="F8">8a</figr> (Additional data file <supplr sid="S10">10</supplr>); a list of the 53 informative GO terms used to define the gold standard sets for the na&#239;ve Bayes integration (Additional data file <supplr sid="S11">11</supplr>); a list of Pearson correlation coefficients between the 16 features used for the na&#239;ve Bayes integration (Additional data file <supplr sid="S12">12</supplr>).</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>Edges in the integrated human functional linkage network</p>
            </caption>
            <text>
               <p>Column 1, Entrez Gene ID for gene A; column 2, Entrez Gene ID for gene B; column 3, functional linkage weight (log likelihood ratio of the na&#239;ve Bayes integration).</p>
            </text>
            <file name="gb-2009-10-9-r91-S1.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>Known (seed) disease genes associated with each of the 110 diseases</p>
            </caption>
            <text>
               <p>Known (seed) disease genes associated with each of the 110 diseases.</p>
            </text>
            <file name="gb-2009-10-9-r91-S2.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p>The top 100 predicted candidate disease genes for each of the 110 diseases</p>
            </caption>
            <text>
               <p>A WinRAR archive composed of a readme.txt file and 110 prediction files for the 110 diseases. For each disease, we list the top 100 ranked new candidate disease genes not included in the disease seed gene set. The description of the prediction files is provided in the readme.txt file.</p>
            </text>
            <file name="gb-2009-10-9-r91-S3.rar">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S4">
            <title>
               <p>Additional data file 4</p>
            </title>
            <caption>
               <p>Plot of precision versus rank cutoff for the top 100 predicted candidate disease genes for each of the 110 diseases</p>
            </caption>
            <text>
               <p>Plot of precision versus rank cutoff for the top 100 predicted candidate disease genes for each of the 110 diseases.</p>
            </text>
            <file name="gb-2009-10-9-r91-S4.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S5">
            <title>
               <p>Additional data file 5</p>
            </title>
            <caption>
               <p>Supplementary text and figures</p>
            </caption>
            <text>
               <p>Supplemental methods, supplementary results, supplementary Figures S1 to S9, and supplementary Table S1 to S4.</p>
            </text>
            <file name="gb-2009-10-9-r91-S5.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S6">
            <title>
               <p>Additional data file 6</p>
            </title>
            <caption>
               <p>Recently identified disease genes and landmark references dated after January 2007</p>
            </caption>
            <text>
               <p>Recently identified disease genes and landmark references dated after January 2007.</p>
            </text>
            <file name="gb-2009-10-9-r91-S6.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S7">
            <title>
               <p>Additional data file 7</p>
            </title>
            <caption>
               <p>The 22 predicted obesity genes among the top 100 predicted gene list that overlap with the obesity genes collected from literature by Hanock <it>et al</it>. <abbrgrp><abbr bid="B58">58</abbr></abbrgrp></p>
            </caption>
            <text>
               <p>The 22 predicted obesity genes among the top 100 predicted gene list that overlap with the obesity genes collected from literature by Hanock <it>et al</it>. <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>.</p>
            </text>
            <file name="gb-2009-10-9-r91-S7.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S8">
            <title>
               <p>Additional data file 8</p>
            </title>
            <caption>
               <p>Mutual predictability scores for all the possible disease pairs between the 110 diseases</p>
            </caption>
            <text>
               <p>Mutual predictability scores for all the possible disease pairs between the 110 diseases.</p>
            </text>
            <file name="gb-2009-10-9-r91-S8.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S9">
            <title>
               <p>Additional data file 9</p>
            </title>
            <caption>
               <p>The top 100 disease pairs with the highest mutual predictability scores and the supporting evidence for the association</p>
            </caption>
            <text>
               <p>The top 100 disease pairs with the highest mutual predictability scores and the supporting evidence for the association.</p>
            </text>
            <file name="gb-2009-10-9-r91-S9.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S10">
            <title>
               <p>Additional data file 10</p>
            </title>
            <caption>
               <p>Disease clusters, their disease members, and evidence supporting the associations in Figure S9 of Additional data file 5 and Figure <figr fid="F8">8a</figr></p>
            </caption>
            <text>
               <p>Disease clusters, their disease members, and evidence supporting the associations in Figure S9 of Additional data file 5 and Figure <figr fid="F8">8a</figr>.</p>
            </text>
            <file name="gb-2009-10-9-r91-S10.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S11">
            <title>
               <p>Additional data file 11</p>
            </title>
            <caption>
               <p>The 53 informative GO terms used to define the gold standard sets for the na&#239;ve Bayes integration</p>
            </caption>
            <text>
               <p>The 53 informative GO terms used to define the gold standard sets for the na&#239;ve Bayes integration.</p>
            </text>
            <file name="gb-2009-10-9-r91-S11.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S12">
            <title>
               <p>Additional data file 12</p>
            </title>
            <caption>
               <p>Pearson correlation coefficients between the 16 features used for the na&#239;ve Bayes integration</p>
            </caption>
            <text>
               <p>Pearson correlation coefficients between the 16 features used for the na&#239;ve Bayes integration.</p>
            </text>
            <file name="gb-2009-10-9-r91-S12.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>CD acknowledges funding from the NIH R01 RR022971. YX is supported by a Research Starter Grant in Informatics from the PhRMA Foundation.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Phenome connections.</p>
            </title>
            <aug>
               <au>
                  <snm>Oti</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Brunner</snm>
                  <fnm>HG</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2008</pubdate>
            <volume>24</volume>
            <fpage>103</fpage>
            <lpage>106</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2007.12.005</pubid>
                  <pubid idtype="pmpid" link="fulltext">18243400</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>The modular nature of genetic diseases.</p>
            </title>
            <aug>
               <au>
                  <snm>Oti</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brunner</snm>
                  <fnm>HG</fnm>
               </au>
            </aug>
            <source>Clin Genet</source>
            <pubdate>2007</pubdate>
            <volume>71</volume>
            <fpage>1</fpage>
            <lpage>11</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1399-0004.2006.00708.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">17204041</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>A text-mining analysis of the human phenome.</p>
            </title>
            <aug>
               <au>
                  <snm>van surDriel</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Bruggeman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Vriend</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Brunner</snm>
                  <fnm>HG</fnm>
               </au>
               <au>
                  <snm>Leunissen</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Eur J Hum Genet</source>
            <pubdate>2006</pubdate>
            <volume>14</volume>
            <fpage>535</fpage>
            <lpage>542</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.ejhg.5201585</pubid>
                  <pubid idtype="pmpid" link="fulltext">16493445</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>The human disease network.</p>
            </title>
            <aug>
               <au>
                  <snm>Goh</snm>
                  <fnm>KI</fnm>
               </au>
               <au>
                  <snm>Cusick</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Valle</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Childs</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Vidal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Barabasi</snm>
                  <fnm>AL</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <fpage>8685</fpage>
            <lpage>8690</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0701361104</pubid>
                  <pubid idtype="pmcid">1885563</pubid>
                  <pubid idtype="pmpid" link="fulltext">17502601</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Network-based model weighting to detect multiple loci influencing complex diseases.</p>
            </title>
            <aug>
               <au>
                  <snm>Pan</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Hum Genet</source>
            <pubdate>2008</pubdate>
            <volume>124</volume>
            <fpage>225</fpage>
            <lpage>234</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00439-008-0545-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">18719944</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Protein networks in disease.</p>
            </title>
            <aug>
               <au>
                  <snm>Ideker</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sharan</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2008</pubdate>
            <volume>18</volume>
            <fpage>644</fpage>
            <lpage>652</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.071852.107</pubid>
                  <pubid idtype="pmpid" link="fulltext">18381899</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>High-dimensional and large-scale phenotyping of yeast mutants.</p>
            </title>
            <aug>
               <au>
                  <snm>Ohya</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Sese</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yukawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sano</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Nakatani</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Saito</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Saka</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fukuda</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ishihara</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Oka</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Suzuki</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hirata</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ohtani</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sawai</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Fraysse</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Fraysse</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Francois</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Aebi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tanaka</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Muramatsu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Araki</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Sonoike</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Nogami</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Morishita</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>19015</fpage>
            <lpage>19020</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0509436102</pubid>
                  <pubid idtype="pmcid">1316885,1316885</pubid>
                  <pubid idtype="pmpid" link="fulltext">16365294</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>A global view of pleiotropy and phenotypically derived gene function in yeast.</p>
            </title>
            <aug>
               <au>
                  <snm>Dudley</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Janse</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Tanay</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Shamir</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>Mol Syst Biol</source>
            <pubdate>2005</pubdate>
            <volume>1</volume>
            <note>2005 0001.</note>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/msb4100004</pubid>
                  <pubid idtype="pmcid">1681449</pubid>
                  <pubid idtype="pmpid" link="fulltext">16729036</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>A similarity-based method for genome-wide prediction of disease-relevant human genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Freudenberg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Propping</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <issue>Suppl 2</issue>
            <fpage>S110</fpage>
            <lpage>115</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12385992</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>A protein-protein interaction network for human inherited ataxias and disorders of Purkinje cell degeneration.</p>
            </title>
            <aug>
               <au>
                  <snm>Lim</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hao</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Shaw</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Patel</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Szabo</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Rual</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Fisk</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Smolyar</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Barabasi</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Vidal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zoghbi</snm>
                  <fnm>HY</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2006</pubdate>
            <volume>125</volume>
            <fpage>801</fpage>
            <lpage>814</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2006.03.032</pubid>
                  <pubid idtype="pmpid" link="fulltext">16713569</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>A protein interaction network links GIT1, an enhancer of huntingtin aggregation, to Huntington's disease.</p>
            </title>
            <aug>
               <au>
                  <snm>Goehler</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lalowski</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stelzl</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Waelter</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Stroedicke</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Worm</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Droege</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lindenberg</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Knoblich</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Haenig</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Herbst</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Suopanki</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Scherzinger</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Abraham</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bauer</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Hasenbank</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fritzsche</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ludewig</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Bussow</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Coleman</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Gutekunst</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Landwehrmeyer</snm>
                  <fnm>BG</fnm>
               </au>
               <au>
                  <snm>Lehrach</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Wanker</snm>
                  <fnm>EE</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>2004</pubdate>
            <volume>15</volume>
            <fpage>853</fpage>
            <lpage>865</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.molcel.2004.09.016</pubid>
                  <pubid idtype="pmpid" link="fulltext">15383276</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>A network-based analysis of systemic inflammation in humans.</p>
            </title>
            <aug>
               <au>
                  <snm>Calvano</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Xiao</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Richards</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Felciano</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Baker</snm>
                  <fnm>HV</fnm>
               </au>
               <au>
                  <snm>Cho</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>RO</fnm>
               </au>
               <au>
                  <snm>Brownstein</snm>
                  <fnm>BH</fnm>
               </au>
               <au>
                  <snm>Cobb</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Tschoeke</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Miller-Graziano</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Moldawer</snm>
                  <fnm>LL</fnm>
               </au>
               <au>
                  <snm>Mindrinos</snm>
                  <fnm>MN</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Tompkins</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Lowry</snm>
                  <fnm>SF</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>437</volume>
            <fpage>1032</fpage>
            <lpage>1037</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03985</pubid>
                  <pubid idtype="pmpid" link="fulltext">16136080</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Integrating genetic and network analysis to characterize genes related to mouse weight.</p>
            </title>
            <aug>
               <au>
                  <snm>Ghazalpour</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Doss</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Plaisier</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Castellanos</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Brozell</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schadt</snm>
                  <fnm>EE</fnm>
               </au>
               <au>
                  <snm>Drake</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Lusis</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Horvath</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>PLoS Genet</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <fpage>e130</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pgen.0020130</pubid>
                  <pubid idtype="pmcid">1550283,1550283</pubid>
                  <pubid idtype="pmpid" link="fulltext">16934000</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Network modeling links breast cancer susceptibility and centrosome dysfunction.</p>
            </title>
            <aug>
               <au>
                  <snm>Pujana</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Starita</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Stevens</snm>
                  <fnm>KN</fnm>
               </au>
               <au>
                  <snm>Tewari</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ahn</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Rennert</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Moreno</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Kirchhoff</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Gold</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Assmann</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Elshamy</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Rual</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Levine</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Rozek</snm>
                  <fnm>LS</fnm>
               </au>
               <au>
                  <snm>Gelman</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Gunsalus</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Greenberg</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Sobhian</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bertin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Venkatesan</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ayivi-Guedehoussou</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Sole</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Hernandez</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lazaro</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Nathanson</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Weber</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Cusick</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Offit</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2007</pubdate>
            <volume>39</volume>
            <fpage>1338</fpage>
            <lpage>1349</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng.2007.2</pubid>
                  <pubid idtype="pmpid" link="fulltext">17922014</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Franke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Bakel</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Fokkens</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>de surJong</snm>
                  <fnm>ED</fnm>
               </au>
               <au>
                  <snm>Egmont-Petersen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wijmenga</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2006</pubdate>
            <volume>78</volume>
            <fpage>1011</fpage>
            <lpage>1025</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/504300</pubid>
                  <pubid idtype="pmcid">1474084</pubid>
                  <pubid idtype="pmpid" link="fulltext">16685651</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>The genomic landscapes of human breast and colorectal cancers.</p>
            </title>
            <aug>
               <au>
                  <snm>Wood</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Parsons</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sjoblom</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Leary</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Shen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Boca</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Barber</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ptak</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Silliman</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Szabo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dezso</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Ustyanksky</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Nikolskaya</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nikolsky</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Karchin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Kaminker</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Croshaw</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Willis</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dawson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Shipitsin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Willson</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Sukumar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Polyak</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>BH</fnm>
               </au>
               <au>
                  <snm>Pethiyagoda</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Pant</snm>
                  <fnm>PV</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2007</pubdate>
            <volume>318</volume>
            <fpage>1108</fpage>
            <lpage>1113</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1145720</pubid>
                  <pubid idtype="pmpid" link="fulltext">17932254</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Prediction of human disease genes by human-mouse conserved coexpression analysis.</p>
            </title>
            <aug>
               <au>
                  <snm>Ala</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Piro</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Grassi</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Damasco</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Silengo</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Oti</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Provero</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Di surCunto</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>PLoS Comput Biol</source>
            <pubdate>2008</pubdate>
            <volume>4</volume>
            <fpage>e1000043</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pcbi.1000043</pubid>
                  <pubid idtype="pmcid">2268251</pubid>
                  <pubid idtype="pmpid" link="fulltext">18369433</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Using protein complexes to predict phenotypic effects of gene mutation.</p>
            </title>
            <aug>
               <au>
                  <snm>Fraser</snm>
                  <fnm>HB</fnm>
               </au>
               <au>
                  <snm>Plotkin</snm>
                  <fnm>JB</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>R252</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2007-8-11-r252</pubid>
                  <pubid idtype="pmcid">2258176</pubid>
                  <pubid idtype="pmpid" link="fulltext">18042286</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Walking the interactome for prioritization of candidate disease genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Kohler</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bauer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Horn</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Robinson</snm>
                  <fnm>PN</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2008</pubdate>
            <volume>82</volume>
            <fpage>949</fpage>
            <lpage>958</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ajhg.2008.02.013</pubid>
                  <pubid idtype="pmcid">2427257</pubid>
                  <pubid idtype="pmpid" link="fulltext">18371930</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>A human phenome-interactome network of protein complexes implicated in genetic disorders.</p>
            </title>
            <aug>
               <au>
                  <snm>Lage</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Karlberg</snm>
                  <fnm>EO</fnm>
               </au>
               <au>
                  <snm>Storling</snm>
                  <fnm>ZM</fnm>
               </au>
               <au>
                  <snm>Olason</snm>
                  <fnm>PI</fnm>
               </au>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Rigina</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Hinsby</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Tumer</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Pociot</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Tommerup</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Moreau</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Brunak</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2007</pubdate>
            <volume>25</volume>
            <fpage>309</fpage>
            <lpage>316</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt1295</pubid>
                  <pubid idtype="pmpid" link="fulltext">17344885</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Predicting disease genes using protein-protein interactions.</p>
            </title>
            <aug>
               <au>
                  <snm>Oti</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Brunner</snm>
                  <fnm>HG</fnm>
               </au>
            </aug>
            <source>J Med Genet</source>
            <pubdate>2006</pubdate>
            <volume>43</volume>
            <fpage>691</fpage>
            <lpage>698</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1136/jmg.2006.041376</pubid>
                  <pubid idtype="pmcid">2564594</pubid>
                  <pubid idtype="pmpid" link="fulltext">16611749</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Conserved co-expression for candidate disease gene prioritization.</p>
            </title>
            <aug>
               <au>
                  <snm>Oti</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>van surReeuwijk</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Brunner</snm>
                  <fnm>HG</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <fpage>208</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2105-9-208</pubid>
                  <pubid idtype="pmcid">2383918</pubid>
                  <pubid idtype="pmpid" link="fulltext">18433471</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Network-based global inference of human disease genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Wu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>MQ</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Mol Syst Biol</source>
            <pubdate>2008</pubdate>
            <volume>4</volume>
            <fpage>189</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/msb.2008.27</pubid>
                  <pubid idtype="pmcid">2424293</pubid>
                  <pubid idtype="pmpid" link="fulltext">18463613</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Gene prioritization through genomic data fusion.</p>
            </title>
            <aug>
               <au>
                  <snm>Aerts</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lambrechts</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Maity</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Van surLoo</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Coessens</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>De surSmet</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Tranchevent</snm>
                  <fnm>LC</fnm>
               </au>
               <au>
                  <snm>De surMoor</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Marynen</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hassan</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Carmeliet</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Moreau</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2006</pubdate>
            <volume>24</volume>
            <fpage>537</fpage>
            <lpage>544</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt1203</pubid>
                  <pubid idtype="pmpid" link="fulltext">16680138</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Association of genes to genetically inherited diseases using data mining.</p>
            </title>
            <aug>
               <au>
                  <snm>Perez-Iratxeta</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Andrade</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2002</pubdate>
            <volume>31</volume>
            <fpage>316</fpage>
            <lpage>319</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12006977</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Integration of text- and data-mining using ontologies successfully selects disease gene candidates.</p>
            </title>
            <aug>
               <au>
                  <snm>Tiffin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Kelso</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Powell</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Pan</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Bajic</snm>
                  <fnm>VB</fnm>
               </au>
               <au>
                  <snm>Hide</snm>
                  <fnm>WA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>1544</fpage>
            <lpage>1552</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gki296</pubid>
                  <pubid idtype="pmcid">1065256</pubid>
                  <pubid idtype="pmpid" link="fulltext">15767279</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>POCUS: mining genomic sequence annotation to predict disease genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Turner</snm>
                  <fnm>FS</fnm>
               </au>
               <au>
                  <snm>Clutterbuck</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Semple</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>R75</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2003-4-11-r75</pubid>
                  <pubid idtype="pmcid">329128</pubid>
                  <pubid idtype="pmpid" link="fulltext">14611661</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Speeding disease gene discovery by sequence based candidate prioritization.</p>
            </title>
            <aug>
               <au>
                  <snm>Adie</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Porteous</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Pickard</snm>
                  <fnm>BS</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>55</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2105-6-55</pubid>
                  <pubid idtype="pmcid">1274252</pubid>
                  <pubid idtype="pmpid" link="fulltext">15766383</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Systematic identification of human mitochondrial disease genes through integrative genomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Calvo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jain</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Xie</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Sheth</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Goldberger</snm>
                  <fnm>OA</fnm>
               </au>
               <au>
                  <snm>Spinazzola</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Zeviani</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Carr</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Mootha</snm>
                  <fnm>VK</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2006</pubdate>
            <volume>38</volume>
            <fpage>576</fpage>
            <lpage>582</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1776</pubid>
                  <pubid idtype="pmpid" link="fulltext">16582907</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>A computational system to select candidate genes for complex human traits.</p>
            </title>
            <aug>
               <au>
                  <snm>Gaulton</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Mohlke</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Vision</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <fpage>1132</fpage>
            <lpage>1140</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btm001</pubid>
                  <pubid idtype="pmpid" link="fulltext">17237041</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Genome-wide identification of genes likely to be involved in human genetic disease.</p>
            </title>
            <aug>
               <au>
                  <snm>Lopez-Bigas</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>3108</fpage>
            <lpage>3114</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkh605</pubid>
                  <pubid idtype="pmcid">434425</pubid>
                  <pubid idtype="pmpid" link="fulltext">15181176</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Commonality of functional annotation: a method for prioritization of candidate genes from genome-wide linkage studies.</p>
            </title>
            <aug>
               <au>
                  <snm>Shriner</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Baye</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Padilla</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Vaughan</snm>
                  <fnm>LK</fnm>
               </au>
               <au>
                  <snm>Loraine</snm>
                  <fnm>AE</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2008</pubdate>
            <volume>36</volume>
            <fpage>e26</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkn007</pubid>
                  <pubid idtype="pmcid">2275105</pubid>
                  <pubid idtype="pmpid" link="fulltext">18263617</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>A genomewide functional network for the laboratory mouse.</p>
            </title>
            <aug>
               <au>
                  <snm>Guan</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lemischka</snm>
                  <fnm>IR</fnm>
               </au>
               <au>
                  <snm>Bult</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Troyanskaya</snm>
                  <fnm>OG</fnm>
               </au>
            </aug>
            <source>PLoS Comput Biol</source>
            <pubdate>2008</pubdate>
            <volume>4</volume>
            <fpage>e1000165</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pcbi.1000165</pubid>
                  <pubid idtype="pmcid">2527685</pubid>
                  <pubid idtype="pmpid" link="fulltext">18818725</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>An en masse phenotype and function prediction system for <it>Mus musculus </it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Tasan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tian</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Gibbons</snm>
                  <fnm>FD</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Roth</snm>
                  <fnm>FP</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <issue>Suppl 1</issue>
            <fpage>S8</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2008-9-s1-s8</pubid>
                  <pubid idtype="pmcid">2447542</pubid>
                  <pubid idtype="pmpid" link="fulltext">18613952</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>The KEGG resource for deciphering the genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Kanehisa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kawashima</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Okuno</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>D277</fpage>
            <lpage>280</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkh063</pubid>
                  <pubid idtype="pmcid">308797</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681412</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Reactome: a knowledge base of biologic pathways and processes.</p>
            </title>
            <aug>
               <au>
                  <snm>Vastrik</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>D'Eustachio</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Joshi-Tope</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gopinath</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Croft</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>de surBono</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gillespie</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jassal</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Matthews</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Stein</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>R39</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2007-8-3-r39</pubid>
                  <pubid idtype="pmcid">1868929</pubid>
                  <pubid idtype="pmpid" link="fulltext">17367534</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Probing genetic overlap among complex human phenotypes.</p>
            </title>
            <aug>
               <au>
                  <snm>Rzhetsky</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wajngurt</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Zheng</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <fpage>11694</fpage>
            <lpage>11699</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0704820104</pubid>
                  <pubid idtype="pmcid">1906727,1906727</pubid>
                  <pubid idtype="pmpid" link="fulltext">17609372</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Probabilistic prediction and ranking of human protein-protein interactions.</p>
            </title>
            <aug>
               <au>
                  <snm>Scott</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Barton</snm>
                  <fnm>GJ</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>239</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2105-8-239</pubid>
                  <pubid idtype="pmcid">1939716</pubid>
                  <pubid idtype="pmpid" link="fulltext">17615067</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Probabilistic model of the human protein-protein interaction network.</p>
            </title>
            <aug>
               <au>
                  <snm>Rhodes</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Tomlins</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Varambally</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mahavisno</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Barrette</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kalyana-Sundaram</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ghosh</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Pandey</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Chinnaiyan</snm>
                  <fnm>AM</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2005</pubdate>
            <volume>23</volume>
            <fpage>951</fpage>
            <lpage>959</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt1103</pubid>
                  <pubid idtype="pmpid" link="fulltext">16082366</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders.</p>
            </title>
            <aug>
               <au>
                  <snm>Hamosh</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Amberger</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Bocchini</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>McKusick</snm>
                  <fnm>VA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>D514</fpage>
            <lpage>517</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gki033</pubid>
                  <pubid idtype="pmcid">539987</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608251</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>A single gene network accurately predicts phenotypic effects of gene perturbation in <it>Caenorhabditis elegans </it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Lehner</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Crombie</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Marcotte</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2008</pubdate>
            <volume>40</volume>
            <fpage>181</fpage>
            <lpage>188</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng.2007.70</pubid>
                  <pubid idtype="pmpid" link="fulltext">18223650</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Pruitt</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Maglott</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>D61</fpage>
            <lpage>65</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkl842</pubid>
                  <pubid idtype="pmcid">1716718</pubid>
                  <pubid idtype="pmpid" link="fulltext">17130148</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Broad network-based predictability of <it>Saccharomyces cerevisiae </it>gene loss-of-function phenotypes.</p>
            </title>
            <aug>
               <au>
                  <snm>McGary</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Marcotte</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>R258</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2007-8-12-r258</pubid>
                  <pubid idtype="pmcid">2246260</pubid>
                  <pubid idtype="pmpid" link="fulltext">18053250</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Functional annotation from predicted protein interaction networks.</p>
            </title>
            <aug>
               <au>
                  <snm>McDermott</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bumgarner</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Samudrala</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>3217</fpage>
            <lpage>3226</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti514</pubid>
                  <pubid idtype="pmpid" link="fulltext">15919725</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Human protein reference database--2006 update.</p>
            </title>
            <aug>
               <au>
                  <snm>Mishra</snm>
                  <fnm>GR</fnm>
               </au>
               <au>
                  <snm>Suresh</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kumaran</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kannabiran</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Suresh</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bala</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Shivakumar</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Anuradha</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Reddy</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Raghavan</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Menon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hanumanthu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gupta</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Upendran</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gupta</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mahesh</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jacob</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Mathew</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Chatterjee</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Arun</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Sharma</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chandrika</snm>
                  <fnm>KN</fnm>
               </au>
               <au>
                  <snm>Deshpande</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Palvankar</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Raghavnath</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Krishnakanth</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Karathia</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Rekha</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Nayak</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Vishnupriya</snm>
                  <fnm>G</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <fpage>D411</fpage>
            <lpage>414</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkj141</pubid>
                  <pubid idtype="pmcid">1347503</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381900</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>BIND: the Biomolecular Interaction Network Database.</p>
            </title>
            <aug>
               <au>
                  <snm>Bader</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Betel</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hogue</snm>
                  <fnm>CW</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>248</fpage>
            <lpage>250</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkg056</pubid>
                  <pubid idtype="pmcid">165503</pubid>
                  <pubid idtype="pmpid" link="fulltext">12519993</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>The BioGRID Interaction Database: 2008 update.</p>
            </title>
            <aug>
               <au>
                  <snm>Breitkreutz</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Stark</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Reguly</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Boucher</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Breitkreutz</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Livstone</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Oughtred</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lackner</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Bahler</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wood</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Dolinski</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Tyers</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2008</pubdate>
            <volume>36</volume>
            <fpage>D637</fpage>
            <lpage>640</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkm1001</pubid>
                  <pubid idtype="pmcid">2238873</pubid>
                  <pubid idtype="pmpid" link="fulltext">18000002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>IntAct - open source resource for molecular interaction data.</p>
            </title>
            <aug>
               <au>
                  <snm>Kerrien</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Alam-Faruque</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Aranda</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bancarz</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Bridge</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Derow</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Dimmer</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Feuermann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Friedrichsen</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Huntley</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kohler</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Khadake</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Leroy</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Liban</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lieftink</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Montecchi-Palazzi</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Orchard</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Risse</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Robbe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Roechert</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Thorneycroft</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Apweiler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hermjakob</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>D561</fpage>
            <lpage>565</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkl958</pubid>
                  <pubid idtype="pmcid">1751531</pubid>
                  <pubid idtype="pmpid" link="fulltext">17145710</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>MIPS: analysis and annotation of genome information in 2007.</p>
            </title>
            <aug>
               <au>
                  <snm>Mewes</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Dietmann</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Frishman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gregory</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mannhaupt</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Mayer</snm>
                  <fnm>KF</fnm>
               </au>
               <au>
                  <snm>Munsterkotter</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ruepp</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Spannagl</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stumpflen</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Rattei</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2008</pubdate>
            <volume>36</volume>
            <fpage>D196</fpage>
            <lpage>201</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkm980</pubid>
                  <pubid idtype="pmcid">2238900</pubid>
                  <pubid idtype="pmpid" link="fulltext">18158298</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>The Database of Interacting Proteins: 2004 update.</p>
            </title>
            <aug>
               <au>
                  <snm>Salwinski</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Pettit</snm>
                  <fnm>FK</fnm>
               </au>
               <au>
                  <snm>Bowie</snm>
                  <fnm>JU</fnm>
               </au>
               <au>
                  <snm>Eisenberg</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>D449</fpage>
            <lpage>451</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkh086</pubid>
                  <pubid idtype="pmcid">308820</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681454</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Searching the protein interaction space through the MINT Database.</p>
            </title>
            <aug>
               <au>
                  <snm>Chatr-Aryamontri</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Zanzoni</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ceol</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cesareni</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Methods Mol Biol</source>
            <pubdate>2008</pubdate>
            <volume>484</volume>
            <fpage>305</fpage>
            <lpage>317</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">full_text</pubid>
                  <pubid idtype="pmpid" link="fulltext">18592188</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Towards a proteome-scale map of the human protein-protein interaction network.</p>
            </title>
            <aug>
               <au>
                  <snm>Rual</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Venkatesan</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hao</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hirozane-Kishikawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Dricot</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Berriz</snm>
                  <fnm>GF</fnm>
               </au>
               <au>
                  <snm>Gibbons</snm>
                  <fnm>FD</fnm>
               </au>
               <au>
                  <snm>Dreze</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ayivi-Guedehoussou</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Klitgord</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Boxem</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Milstein</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rosenberg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Goldberg</snm>
                  <fnm>DS</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>LV</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Franklin</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Albala</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Lim</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Fraughton</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Llamosas</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Cevik</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bex</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lamesch</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Sikorski</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Vandenhaute</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zoghbi</snm>
                  <fnm>HY</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>437</volume>
            <fpage>1173</fpage>
            <lpage>1178</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04209</pubid>
                  <pubid idtype="pmpid" link="fulltext">16189514</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Large-scale mapping of human protein-protein interactions by mass spectrometry.</p>
            </title>
            <aug>
               <au>
                  <snm>Ewing</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Chu</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Elisma</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Climie</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>McBroom-Cerajewski</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Robinson</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>O'Connor</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Dharsee</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ho</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Heilbut</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ornatsky</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Bukhman</snm>
                  <fnm>YV</fnm>
               </au>
               <au>
                  <snm>Ethier</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sheng</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Vasilescu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Abu-Farha</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lambert</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Duewel</snm>
                  <fnm>HS</fnm>
               </au>
               <au>
                  <snm>Stewart</snm>
                  <fnm>II</fnm>
               </au>
               <au>
                  <snm>Kuehl</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Hogue</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Colwill</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Gladwish</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Muskat</snm>
                  <fnm>B</fnm>
               </au>
               <etal/>
            </aug>
            <source>Mol Syst Biol</source>
            <pubdate>2007</pubdate>
            <volume>3</volume>
            <fpage>89</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/msb4100134</pubid>
                  <pubid idtype="pmcid">1847948</pubid>
                  <pubid idtype="pmpid" link="fulltext">17353931</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>A human protein-protein interaction network: a resource for annotating the proteome.</p>
            </title>
            <aug>
               <au>
                  <snm>Stelzl</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Worm</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Lalowski</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Haenig</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Brembeck</snm>
                  <fnm>FH</fnm>
               </au>
               <au>
                  <snm>Goehler</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Stroedicke</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zenkner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Schoenherr</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Koeppen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Timm</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mintzlaff</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Abraham</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bock</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Kietzmann</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Goedde</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Toksoz</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Droege</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Krobitsch</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Korn</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Birchmeier</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lehrach</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Wanker</snm>
                  <fnm>EE</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2005</pubdate>
            <volume>122</volume>
            <fpage>957</fpage>
            <lpage>968</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2005.08.029</pubid>
                  <pubid idtype="pmpid" link="fulltext">16169070</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>A first-draft human protein-interaction map.</p>
            </title>
            <aug>
               <au>
                  <snm>Lehner</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>AG</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>R63</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2004-5-9-r63</pubid>
                  <pubid idtype="pmcid">522870</pubid>
                  <pubid idtype="pmpid" link="fulltext">15345047</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>STRING 7 - recent developments in the integration and prediction of protein interactions.</p>
            </title>
            <aug>
               <au>
                  <snm>von surMering</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Kuhn</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chaffron</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Doerks</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kruger</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>D358</fpage>
            <lpage>362</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkl825</pubid>
                  <pubid idtype="pmcid">1669762</pubid>
                  <pubid idtype="pmpid" link="fulltext">17098935</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Obesity, metabolic syndrome, and cardiovascular disease.</p>
            </title>
            <aug>
               <au>
                  <snm>Grundy</snm>
                  <fnm>SM</fnm>
               </au>
            </aug>
            <source>J Clin Endocrinol Metab</source>
            <pubdate>2004</pubdate>
            <volume>89</volume>
            <fpage>2595</fpage>
            <lpage>2600</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1210/jc.2004-0372</pubid>
                  <pubid idtype="pmpid" link="fulltext">15181029</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Adaptations to climate in candidate genes for common metabolic disorders.</p>
            </title>
            <aug>
               <au>
                  <snm>Hancock</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Witonsky</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Gordon</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Eshel</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Pritchard</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Coop</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Di surRienzo</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>PLoS Genet</source>
            <pubdate>2008</pubdate>
            <volume>4</volume>
            <fpage>e32</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pgen.0040032</pubid>
                  <pubid idtype="pmcid">2242814,2242814</pubid>
                  <pubid idtype="pmpid" link="fulltext">18282109</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>LXRbeta is required for adipocyte growth, glucose homeostasis, and beta cell function.</p>
            </title>
            <aug>
               <au>
                  <snm>Gerin</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Dolinsky</snm>
                  <fnm>VW</fnm>
               </au>
               <au>
                  <snm>Shackman</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Kennedy</snm>
                  <fnm>RT</fnm>
               </au>
               <au>
                  <snm>Chiang</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Burant</snm>
                  <fnm>CF</fnm>
               </au>
               <au>
                  <snm>Steffensen</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Gustafsson</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>MacDougald</snm>
                  <fnm>OA</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2005</pubdate>
            <volume>280</volume>
            <fpage>23024</fpage>
            <lpage>23031</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M412564200</pubid>
                  <pubid idtype="pmpid" link="fulltext">15831500</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Liver X receptor signaling pathways in cardiovascular disease.</p>
            </title>
            <aug>
               <au>
                  <snm>Tontonoz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Mangelsdorf</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Mol Endocrinol</source>
            <pubdate>2003</pubdate>
            <volume>17</volume>
            <fpage>985</fpage>
            <lpage>993</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1210/me.2003-0061</pubid>
                  <pubid idtype="pmpid" link="fulltext">12690094</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>PPAR- and LXR-dependent pathways controlling lipid metabolism and the development of atherosclerosis.</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Glass</snm>
                  <fnm>CK</fnm>
               </au>
            </aug>
            <source>J Lipid Res</source>
            <pubdate>2004</pubdate>
            <volume>45</volume>
            <fpage>2161</fpage>
            <lpage>2173</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1194/jlr.R400010-JLR200</pubid>
                  <pubid idtype="pmpid" link="fulltext">15489539</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Putative metabolic effects of the liver X receptor (LXR).</p>
            </title>
            <aug>
               <au>
                  <snm>Steffensen</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Gustafsson</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Diabetes</source>
            <pubdate>2004</pubdate>
            <volume>53</volume>
            <issue>Suppl 1</issue>
            <fpage>S36</fpage>
            <lpage>42</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.2337/diabetes.53.2007.S36</pubid>
                  <pubid idtype="pmpid" link="fulltext">14749264</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Liver X receptors and the control of cholesterol homeostasis: potential therapeutic targets for the treatment of atherosclerosis.</p>
            </title>
            <aug>
               <au>
                  <snm>Millatt</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Bocher</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Fruchart</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Staels</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Biochim Biophys Acta</source>
            <pubdate>2003</pubdate>
            <volume>1631</volume>
            <fpage>107</fpage>
            <lpage>118</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12633677</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Liver X receptor gene polymorphisms and adipose tissue expression levels in obesity.</p>
            </title>
            <aug>
               <au>
                  <snm>Dahlman</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Nilsson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jiao</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hoffstedt</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lindgren</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Humphreys</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kere</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gustafsson</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Arner</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Dahlman-Wright</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Pharmacogenet Genomics</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>881</fpage>
            <lpage>889</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1097/01.fpc.0000236334.49422.48</pubid>
                  <pubid idtype="pmpid" link="fulltext">17108812</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Chemical genetic analysis reveals the effects of NMU2R on the expression of peptide hormones.</p>
            </title>
            <aug>
               <au>
                  <snm>Fang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Dong</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Neurosci Lett</source>
            <pubdate>2006</pubdate>
            <volume>404</volume>
            <fpage>148</fpage>
            <lpage>153</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.neulet.2006.05.034</pubid>
                  <pubid idtype="pmpid" link="fulltext">16781063</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Role of neuromedin-U in the central control of feeding behavior: a variant of the neuromedin-U receptor 2 contributes to hyperphagia in the New Zealand obese mouse.</p>
            </title>
            <aug>
               <au>
                  <snm>Schmolz</snm>
                  <fnm>KMP</fnm>
               </au>
               <au>
                  <snm>Bufe</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Vogel</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nogueiras</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Scherneck</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nestler</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zahn</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>R&#252;schendorf</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Tsch&#246;p</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Meyerhof</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Joost</snm>
                  <fnm>H-G</fnm>
               </au>
               <au>
                  <snm>Sch&#252;rmann</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Obesity Metabolism</source>
            <pubdate>2007</pubdate>
            <volume>3</volume>
            <fpage>28</fpage>
            <lpage>37</lpage>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Cholesterol retention in Alzheimer's brain is responsible for high beta- and gamma-secretase activities and Abeta production.</p>
            </title>
            <aug>
               <au>
                  <snm>Xiong</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Callaghan</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Walker</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Lue</snm>
                  <fnm>LF</fnm>
               </au>
               <au>
                  <snm>Beach</snm>
                  <fnm>TG</fnm>
               </au>
               <au>
                  <snm>Sue</snm>
                  <fnm>LI</fnm>
               </au>
               <au>
                  <snm>Woulfe</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Stanimirovic</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Neurobiol Dis</source>
            <pubdate>2008</pubdate>
            <volume>29</volume>
            <fpage>422</fpage>
            <lpage>437</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.nbd.2007.10.005</pubid>
                  <pubid idtype="pmcid">2720683</pubid>
                  <pubid idtype="pmpid" link="fulltext">18086530</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>Fatty acids, lipid metabolism and Alzheimer pathology.</p>
            </title>
            <aug>
               <au>
                  <snm>Hooijmans</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Kiliaan</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Eur J Pharmacol</source>
            <pubdate>2008</pubdate>
            <volume>585</volume>
            <fpage>176</fpage>
            <lpage>196</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ejphar.2007.11.081</pubid>
                  <pubid idtype="pmpid" link="fulltext">18378224</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>Cholesterol as a risk factor for dementia and cognitive decline: a systematic review of prospective studies with meta-analysis.</p>
            </title>
            <aug>
               <au>
                  <snm>Anstey</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Lipnicki</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Low</snm>
                  <fnm>LF</fnm>
               </au>
            </aug>
            <source>Am J Geriatr Psychiatry</source>
            <pubdate>2008</pubdate>
            <volume>16</volume>
            <fpage>343</fpage>
            <lpage>354</lpage>
            <xrefbib>
               <pubid idtype="pmpid">18448847</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>[Neurological presentations of lysosomal diseases in adult patients].</p>
            </title>
            <aug>
               <au>
                  <snm>Sedel</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Turpin</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Baumann</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Rev Neurol (Paris)</source>
            <pubdate>2007</pubdate>
            <volume>163</volume>
            <fpage>919</fpage>
            <lpage>929</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">18033028</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B71">
            <title>
               <p>Lysosomal enzymes in ocular tissues and diseases.</p>
            </title>
            <aug>
               <au>
                  <snm>Hayasaka</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Surv Ophthalmol</source>
            <pubdate>1983</pubdate>
            <volume>27</volume>
            <fpage>245</fpage>
            <lpage>258</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0039-6257(83)90125-X</pubid>
                  <pubid idtype="pmpid" link="fulltext">6342190</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B72">
            <title>
               <p>SUSPECTS: enabling fast and effective prioritization of positional candidates.</p>
            </title>
            <aug>
               <au>
                  <snm>Adie</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Porteous</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Pickard</snm>
                  <fnm>BS</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <fpage>773</fpage>
            <lpage>774</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btk031</pubid>
                  <pubid idtype="pmpid" link="fulltext">16423925</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B73">
            <title>
               <p>Apolipoprotein E: high-avidity binding to beta-amyloid and increased frequency of type 4 allele in late-onset familial Alzheimer disease.</p>
            </title>
            <aug>
               <au>
                  <snm>Strittmatter</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Saunders</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Schmechel</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Pericak-Vance</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Enghild</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Salvesen</snm>
                  <fnm>GS</fnm>
               </au>
               <au>
                  <snm>Roses</snm>
                  <fnm>AD</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1993</pubdate>
            <volume>90</volume>
            <fpage>1977</fpage>
            <lpage>1981</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.90.5.1977</pubid>
                  <pubid idtype="pmcid">46003</pubid>
                  <pubid idtype="pmpid">8446617</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B74">
            <title>
               <p>Integrative Visual Analysis Tool for Biological Networks and Pathways</p>
            </title>
            <url>http://visant.bu.edu/</url>
         </bibl>
         <bibl id="B75">
            <title>
               <p>VisANT: an integrative framework for networks in systems biology.</p>
            </title>
            <aug>
               <au>
                  <snm>Hu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Snitkin</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>DeLisi</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Brief Bioinform</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <fpage>317</fpage>
            <lpage>325</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bib/bbn020</pubid>
                  <pubid idtype="pmcid">2743399</pubid>
                  <pubid idtype="pmpid" link="fulltext">18463131</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B76">
            <title>
               <p>VisANT 3.0: new modules for pathway visualization, editing, prediction and construction.</p>
            </title>
            <aug>
               <au>
                  <snm>Hu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Ng</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Yamada</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kawashima</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mellor</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Linghu</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kanehisa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>DeLisi</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>W625</fpage>
            <lpage>632</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkm295</pubid>
                  <pubid idtype="pmcid">1933155</pubid>
                  <pubid idtype="pmpid" link="fulltext">17586824</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B77">
            <title>
               <p>VisANT: data-integrating visual framework for biological networks and modules.</p>
            </title>
            <aug>
               <au>
                  <snm>Hu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Mellor</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yamada</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Holloway</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Delisi</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>W352</fpage>
            <lpage>357</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gki431</pubid>
                  <pubid idtype="pmcid">1160192</pubid>
                  <pubid idtype="pmpid" link="fulltext">15980487</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B78">
            <title>
               <p>Towards zoomable multidimensional maps of the cell.</p>
            </title>
            <aug>
               <au>
                  <snm>Hu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Mellor</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kanehisa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>DeLisi</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2007</pubdate>
            <volume>25</volume>
            <fpage>547</fpage>
            <lpage>554</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt1304</pubid>
                  <pubid idtype="pmpid" link="fulltext">17483841</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B79">
            <title>
               <p>VisANT: an online visualization and analysis tool for biological interaction data.</p>
            </title>
            <aug>
               <au>
                  <snm>Hu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Mellor</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>DeLisi</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>17</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2105-5-17</pubid>
                  <pubid idtype="pmcid">368431</pubid>
                  <pubid idtype="pmpid" link="fulltext">15028117</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B80">
            <title>
               <p>VisANT 3.5: multi-scale network visualization, analysis and inference based on the gene ontology.</p>
            </title>
            <aug>
               <au>
                  <snm>Hu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Hung</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>YC</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Huyck</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>DeLisi</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2009</pubdate>
            <volume>37</volume>
            <fpage>W115</fpage>
            <lpage>121</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkp406</pubid>
                  <pubid idtype="pmcid">2703932</pubid>
                  <pubid idtype="pmpid" link="fulltext">19465394</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B81">
            <title>
               <p>Entrez Gene: gene-centered information at NCBI.</p>
            </title>
            <aug>
               <au>
                  <snm>Maglott</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ostell</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pruitt</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>D26</fpage>
            <lpage>31</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkl993</pubid>
                  <pubid idtype="pmcid">1761442</pubid>
                  <pubid idtype="pmpid" link="fulltext">17148475</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B82">
            <title>
               <p>Annotating single amino acid polymorphisms in the UniProt/Swiss-Prot knowledgebase.</p>
            </title>
            <aug>
               <au>
                  <snm>Yip</snm>
                  <fnm>YL</fnm>
               </au>
               <au>
                  <snm>Famiglietti</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gos</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Duek</snm>
                  <fnm>PD</fnm>
               </au>
               <au>
                  <snm>David</snm>
                  <fnm>FP</fnm>
               </au>
               <au>
                  <snm>Gateau</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bairoch</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Hum Mutat</source>
            <pubdate>2008</pubdate>
            <volume>29</volume>
            <fpage>361</fpage>
            <lpage>366</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/humu.20671</pubid>
                  <pubid idtype="pmpid" link="fulltext">18175334</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B83">
            <title>
               <p>EnsMart: a generic system for fast and flexible access to biological data.</p>
            </title>
            <aug>
               <au>
                  <snm>Kasprzyk</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Keefe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Smedley</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>London</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Spooner</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Melsopp</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hammond</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rocca-Serra</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>160</fpage>
            <lpage>169</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.1645104</pubid>
                  <pubid idtype="pmcid">314293</pubid>
                  <pubid idtype="pmpid" link="fulltext">14707178</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B84">
            <aug>
               <au>
                  <snm>Duda</snm>
                  <fnm>RO</fnm>
               </au>
               <au>
                  <snm>Hart</snm>
                  <fnm>PE</fnm>
               </au>
               <au>
                  <snm>Stork</snm>
                  <fnm>aDG</fnm>
               </au>
            </aug>
            <source>Pattern Classification</source>
            <publisher>New York, NY: Wiley-Interscience</publisher>
            <edition>2</edition>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B85">
            <title>
               <p>Genome-wide prediction of <it>C. elegans </it>genetic interactions.</p>
            </title>
            <aug>
               <au>
                  <snm>Zhong</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Sternberg</snm>
                  <fnm>PW</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2006</pubdate>
            <volume>311</volume>
            <fpage>1481</fpage>
            <lpage>1484</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1123287</pubid>
                  <pubid idtype="pmpid" link="fulltext">16527984</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B86">
            <title>
               <p>High-precision high-coverage functional inference from integrated data sources.</p>
            </title>
            <aug>
               <au>
                  <snm>Linghu</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Snitkin</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Holloway</snm>
                  <fnm>DT</fnm>
               </au>
               <au>
                  <snm>Gustafson</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Xia</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>DeLisi</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <fpage>119</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2105-9-119</pubid>
                  <pubid idtype="pmcid">2292694</pubid>
                  <pubid idtype="pmpid" link="fulltext">18298847</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B87">
            <title>
               <p>Evaluation of different biological data and computational classification methods for use in protein interaction prediction.</p>
            </title>
            <aug>
               <au>
                  <snm>Qi</snm>
                  <fnm>YJ</fnm>
               </au>
               <au>
                  <snm>Bar-Joseph</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Klein-Seetharaman</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>2006</pubdate>
            <volume>63</volume>
            <fpage>490</fpage>
            <lpage>500</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/prot.20865</pubid>
                  <pubid idtype="pmpid" link="fulltext">16450363</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B88">
            <title>
               <p>Coexpression analysis of human genes across many microarray data sets.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>HK</fnm>
               </au>
               <au>
                  <snm>Hsu</snm>
                  <fnm>AK</fnm>
               </au>
               <au>
                  <snm>Sajdak</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Qin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pavlidis</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>1085</fpage>
            <lpage>1094</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.1910904</pubid>
                  <pubid idtype="pmcid">419787</pubid>
                  <pubid idtype="pmpid" link="fulltext">15173114</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B89">
            <title>
               <p>ArrayProspector: a web resource of functional associations inferred from microarray expression data.</p>
            </title>
            <aug>
               <au>
                  <snm>Jensen</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Lagarde</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>von surMering</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>W445</fpage>
            <lpage>448</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkh407</pubid>
                  <pubid idtype="pmcid">441545</pubid>
                  <pubid idtype="pmpid" link="fulltext">15215427</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B90">
            <title>
               <p>Assessment and integration of publicly available SAGE, cDNA microarray, and oligonucleotide microarray expression data for global coexpression analyses.</p>
            </title>
            <aug>
               <au>
                  <snm>Griffith</snm>
                  <fnm>OL</fnm>
               </au>
               <au>
                  <snm>Pleasance</snm>
                  <fnm>ED</fnm>
               </au>
               <au>
                  <snm>Fulton</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Oveisi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ester</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Siddiqui</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>SJ</fnm>
               </au>
            </aug>
            <source>Genomics</source>
            <pubdate>2005</pubdate>
            <volume>86</volume>
            <fpage>476</fpage>
            <lpage>488</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ygeno.2005.06.009</pubid>
                  <pubid idtype="pmpid" link="fulltext">16098712</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B91">
            <title>
               <p>InterPro, progress and status in 2005.</p>
            </title>
            <aug>
               <au>
                  <snm>Mulder</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Apweiler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Attwood</snm>
                  <fnm>TK</fnm>
               </au>
               <au>
                  <snm>Bairoch</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Binns</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bradley</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bucher</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Cerutti</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Copley</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Courcelle</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Das</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fleischmann</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Gough</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Haft</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Harte</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hulo</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Kahn</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kanapin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Krestyaninova</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lonsdale</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lopez</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Letunic</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Madera</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Maslen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>McDowall</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mitchell</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nikolskaya</snm>
                  <fnm>AN</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>D201</fpage>
            <lpage>205</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gki106</pubid>
                  <pubid idtype="pmcid">540060</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608177</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B92">
            <title>
               <p>An improved, bias-reduced probabilistic functional gene network of baker's yeast, <it>Saccharomyces cerevisiae </it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Marcotte</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>PLoS ONE</source>
            <pubdate>2007</pubdate>
            <volume>2</volume>
            <fpage>e988</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pone.0000988</pubid>
                  <pubid idtype="pmcid">1991590</pubid>
                  <pubid idtype="pmpid" link="fulltext">17912365</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B93">
            <title>
               <p>The Gene Ontology (GO) database and informatics resource.</p>
            </title>
            <aug>
               <au>
                  <snm>Harris</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ireland</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lomax</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Foulger</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Eilbeck</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Mungall</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Richter</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Bult</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Dolan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Drabkin</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Eppig</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Ni</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Ringwald</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Balakrishnan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Cherry</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Christie</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Costanzo</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Dwight</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Engel</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Fisk</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Hirschman</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Hong</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Nash</snm>
                  <fnm>RS</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>D258</fpage>
            <lpage>261</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkh066</pubid>
                  <pubid idtype="pmcid">308770</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681407</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
