<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2007-8-10-r219</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Method</dochead>
      <bibl>
         <title>
            <p>Harnessing naturally randomized transcription to infer regulatory relationships among genes</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Chen</snm>
               <mi>S</mi>
               <fnm>Lin</fnm>
               <insr iid="I1"/>
               <email>chenlin@u.washington.edu</email>
            </au>
            <au id="A2">
               <snm>Emmert-Streib</snm>
               <fnm>Frank</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>fes99@u.washington.edu</email>
            </au>
            <au id="A3" ca="yes">
               <snm>Storey</snm>
               <mi>D</mi>
               <fnm>John</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>jstorey@u.washington.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Biostatistics, University of Washington, 1705 NE Pacific St, Seattle, WA 98195, USA.</p>
            </ins>
            <ins id="I2">
               <p>Department of Genome Sciences, University of Washington, 1705 NE Pacific St, Seattle, WA 98195, USA.</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>10</issue>
         <fpage>R219</fpage>
         <url>http://genomebiology.com/2007/8/10/R219</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17931418</pubid>
               <pubid idtype="doi">10.1186/gb-2007-8-10-r219</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>21</day>
               <month>5</month>
               <year>2007</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>24</day>
               <month>7</month>
               <year>2007</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>11</day>
               <month>10</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>11</day>
               <month>10</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Chen et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Inferring regulatory relationships among genes</p>
      </shorttitle>
      <shortabs>
         <p>An approach is developed that utilizes randomized genotypes to rigorously infer causal regulatory relationships among genes at the transcriptional level. The approach is applied to an experiment in yeast, yielding new insights into the topology of the yeast transcriptional regulatory network.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <p>We develop an approach utilizing randomized genotypes to rigorously infer causal regulatory relationships among genes at the transcriptional level, based on experiments in which genotyping and expression profiling are performed. This approach can be used to build transcriptional regulatory networks and to identify putative regulators of genes. We apply the method to an experiment in yeast, in which genes known to be in the same processes and functions are recovered in the resulting transcriptional regulatory network.</p>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010009">Genetics</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>It is now possible to measure DNA variation, RNA expression levels, and protein expression levels from thousands of genes in a given biologic sample <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. Of great interest is inferring the 'wiring diagram', or the way in which many genes regulate one another and interact, from these sources of high-throughput data <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. However, this goal is complicated by the fact that RNA levels, protein levels, phenotypes, and environmental conditions may all affect one another <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>, creating intractable sources of confounding. This has made it difficult to distinguish correlation from causal regulatory effects, limiting the success and applicability of constructed genome-wide regulatory networks <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>.</p>
         <p>A number of integrative genomics studies have recently been conducted, in which large-scale genotyping and expression profiling is performed on individuals with randomized genetic backgrounds <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>. Typically, linkage analyses have been performed on these studies in order to detect quantitative trait loci (QTLs) underlying gene 'expression traits' <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. Although these studies have shown that expression variation is highly heritable, this approach does not typically directly identify specific genes or mechanisms that are responsible for expression variation without additional experimentation. Instead of employing this experimental approach to genetically dissect expression traits, we have developed a method called 'Trigger' (Transcriptional Regulation Inference from Genetics of Gene ExpRession) for inferring causal regulatory relationships among all possible pairs of genes.</p>
         <p>Randomization is the 'gold standard' for inferring causality of one variable on another <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. This concept has successfully been applied in clinical trials to establish the causal effects of drugs on disease. Because DNA variation has a substantial and widespread effect on transcriptional variation <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>, we show that randomizing DNA content provides a natural mechanism for randomizing RNA levels. By utilizing this randomization, we present a new theoretical result defining three testable conditions that, when true, imply that a directed causal relationship exists among a pair of transcripts, where this causal relationship is robust against confounding caused by hidden variables. Using this theoretical result, we develop a method to test directly for this causal relationship, which allows us to estimate the probability that the specific causal model is true. These probabilities can in turn be used to build meaningful regulatory networks, in which the certainty of any such network is easily quantified by the false discovery rate (FDR) <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. In addition, the proposed approach explicitly identifies genes whose expression levels are responsible for variation of expression traits, overcoming a limitation of identifying only their QTLs.</p>
         <p>The concept of causal modeling has previously been considered within the context of genetic variation <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>. Several of these existing approaches search for the best-fitting causal model among genes or traits linked to a common locus. The consideration of causality in those papers is justified by the joint linkage of traits to a common locus, thereby reducing the total number of causal models <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>, but it is not justified by a randomization process. Whereas it has clearly been recognized that changes in linkage status when conditioning on traits in a specific order is strong evidence for a causal relationship among the traits <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B32">32</abbr></abbrgrp>, Trigger directly uses the 'Mendelian randomized' genotypes to test rigorously for causality. This allows for a strict definition of causality that can be directly tested. The proposed method has the notable feature that the test for causality is robust against false positives due to common hidden causal variables. The proposed method also provides a single significance measure for each potential causal relationship in such a way that they can be individually interpreted as well as combined to estimate an overall FDR of the network. Trigger avoids the ambiguities caused by selecting among several models by an often subjectively chosen model selection criterion.</p>
         <p>We apply the proposed method to an experiment on yeast <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B33">33</abbr></abbrgrp>, in which two distinct strains were crossed to produce 112 independent recombinant segregant lines, and genome-wide genotyping and expression profiling were performed on each segregant line. Applying Trigger to this study yields genome-wide regulatory probabilities that can be used to construct networks with any desired FDR. We identify regulatory relationships among genes that recapitulate previous findings, provide new predictions, and yield new information about the topology of the yeast transcriptional regulatory network.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <p>For an individual organism, DNA has the useful feature that it is usually a static variable, meaning that it is fixed and will not change with changing RNA levels, protein levels, phenotypes, or environmental conditions. By performing designed crosses of genetically distinct inbred or isogenic lines, one can randomize the genotypes of an organism from two or more genetic backgrounds, thereby producing independent realizations of DNA content from offspring to offspring <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. At the same time, one may measure gene expression, or any other molecular or clinical phenotype of interest, on each resulting recombinant line.</p>
         <p>We have developed Trigger as an approach for inferring regulatory relationships among all pairs of genes at the genome-wide level, based on these genetic cross experiments in which high-throughput expression profiling is also performed (Figure <figr fid="F1">1</figr>). However, one may also incorporate any other molecular or clinical phenotype of interest into the algorithm.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>An illustration of the properties required to infer the causal relationship <it>L </it>&#8594; <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j</it></sub></p>
            </caption>
            <text>
               <p>An illustration of the properties required to infer the causal relationship <it>L </it>&#8594; <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j</it></sub>. <b>(a) </b>All gene expression traits are normalized to follow a <it>N</it>(0,1) distribution. By the causality equivalence theorem, in order to conclude that <it>L </it>&#8594; <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j</it></sub>, it must be the case that <b>(b) </b><it>T</it><sub><it>i </it></sub>is linked to <it>L</it>, where the mean expression among segregants with allele at <it>L </it>inherited from the BY parental strain is different from the mean expression among segregants with allele at <it>L </it>inherited from the RM parental strain; <b>(c) </b><it>T</it><sub><it>j </it></sub>is also linked to <it>L</it>; and <b>(d) </b>the expression of <it>T</it><sub><it>j </it></sub>given <it>T</it><sub><it>i </it></sub>is no longer linked to <it>L</it>. Trigger is an algorithm to estimate the probability that all three conditions (shown in panels b to d) hold simultaneously.</p>
            </text>
            <graphic file="gb-2007-8-10-r219-1"/>
         </fig>
         <sec>
            <st>
               <p>Probabilities of transcriptional regulation</p>
            </st>
            <p>Suppose that there are <it>m </it>genes with transcription levels measured on recombinant offspring from an experimental genetic cross. (In the yeast experiment we consider, <it>m </it>= 6,216.) The goal is to use the data from such an experiment to estimate the probability that the transcription of gene <it>i </it>has a causal regulatory effect on the transcription of any other gene <it>j</it>, which we denote by <it>P</it><sub><it>ij</it></sub>, where 'causal regulatory effect' means that a change in the transcription level of gene <it>i </it>results in a predictable change in the level of gene <it>j</it>. This is not necessarily through a direct molecular interaction; however, if we directly modulate the transcriptional level of gene <it>i</it>, then this should result in a corresponding change in the transcriptional level of gene <it>j</it>. Trigger provides a conservative estimate of these probabilities, denoted by <inline-formula><m:math name="gb-2007-8-10-r219-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>P</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8viVeY=Nipec8Eeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGabmiuayaajaWaaSbaaSqaaiaadMgacaWGQbaabeaaaaa@328F@</m:annotation></m:semantics></m:math></inline-formula> for <it>i </it>= 1, ..., <it>m </it>and <it>j </it>= 1, ..., <it>m</it>.</p>
            <p>These estimated regulatory probabilities can be used to build a regulatory network based on a directed graph. The probability that a directed edge exists from gene <it>i </it>to gene <it>j </it>in the network is estimated by <inline-formula><m:math name="gb-2007-8-10-r219-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>P</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8viVeY=Nipec8Eeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGabmiuayaajaWaaSbaaSqaaiaadMgacaWGQbaabeaaaaa@328F@</m:annotation></m:semantics></m:math></inline-formula>. One can directly threshold the entries, essentially setting those not meeting the threshold equal to zero. For example, one could remove all potential edges with <inline-formula><m:math name="gb-2007-8-10-r219-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>P</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8viVeY=Nipec8Eeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGabmiuayaajaWaaSbaaSqaaiaadMgacaWGQbaabeaaaaa@328F@</m:annotation></m:semantics></m:math></inline-formula> &lt; 90% while including those with <inline-formula><m:math name="gb-2007-8-10-r219-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>P</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8viVeY=Nipec8Eeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGabmiuayaajaWaaSbaaSqaaiaadMgacaWGQbaabeaaaaa@328F@</m:annotation></m:semantics></m:math></inline-formula> &#8805; 90%. Therefore, a directed edge would be drawn from gene <it>i </it>to gene <it>j </it>if and only if <inline-formula><m:math name="gb-2007-8-10-r219-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>P</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8viVeY=Nipec8Eeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGabmiuayaajaWaaSbaaSqaaiaadMgacaWGQbaabeaaaaa@328F@</m:annotation></m:semantics></m:math></inline-formula> &#8805; 90% (Figure <figr fid="F2">2</figr>). The resulting network has an easily quantified and interpretable FDR, and each directed edge has an estimated probability that it is true (see Materials and methods [below] and Additional data file 1).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>A transcriptional regulatory network drawn from a Trigger probability threshold of 90%</p>
               </caption>
               <text>
                  <p>A transcriptional regulatory network drawn from a Trigger probability threshold of 90%. The network consists of 4,394 genes, 2,145 causal relationships, and 127 causal genes. Genes are represented by orange circles and causal relationships are represented by directed edges with black arrows.</p>
               </text>
               <graphic file="gb-2007-8-10-r219-2"/>
            </fig>
            <p>In addition to constructing a regulatory network from these estimated probabilities, each gene <it>i </it>can be examined as a putative regulator, and hence a quantitative trait gene or 'quantitative trait transcript' <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. Specifically, the probability that a specific gene <it>i </it>is a regulator for each other gene <it>j </it>is estimated as <inline-formula><m:math name="gb-2007-8-10-r219-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>P</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8viVeY=Nipec8Eeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGabmiuayaajaWaaSbaaSqaaiaadMgacaWGQbaabeaaaaa@328F@</m:annotation></m:semantics></m:math></inline-formula>. A threshold can be applied to these estimated probabilities to obtain the FDR of the significant genes (see Materials and methods [below] and Additional data file 1). This particular application of Trigger allows one to move beyond identifying QTL of expression traits to identifying a specific underlying causal quantitative trait transcript.</p>
         </sec>
         <sec>
            <st>
               <p>Causal models of transcriptional regulation</p>
            </st>
            <p>Trigger is based on a rigorous mathematical framework that we developed for utilizing randomized genetic backgrounds and genome-wide expression in order to test rigorously for causality among transcription levels. The approach starts with a pair of transcripts and a locus to which both are linked. Let <it>L </it>be the locus, <it>T</it><sub><it>i </it></sub>transcript <it>i</it>, and <it>T</it><sub><it>j </it></sub>transcript <it>j</it>.</p>
            <p>The goal is to identify triplets (<it>L</it>, <it>T</it><sub><it>i</it></sub>, <it>T</it><sub><it>j</it></sub>) such that <it>L </it>&#8594; <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j</it></sub>, where the arrow '&#8594;' means causation. The definition of 'causal' has been a topic of much interest <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>. Although definitions of causality differ slightly among the many articles published on this topic, in essence <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j </it></sub>means that the ideal manipulation of <it>T</it><sub><it>i </it></sub>will change the distribution of <it>T</it><sub><it>j</it></sub>, whereas the ideal manipulation of <it>T</it><sub><it>j </it></sub>will not disturb the distribution of <it>T</it><sub><it>i</it></sub>. 'Ideal manipulation' of a variable means to change the variable in a manner that leaves every other variable unchanged, at the moment when the manipulation occurs <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. This framework also applies to causality among random variables.</p>
            <p>With the genetic cross experimental design, the genotype at a fixed locus <it>L </it>is a random variable, whose random outcome occurs before and independently from the subsequently measured expression values. For example, in the yeast experiment analyzed below, two haploid parental strains (BY and RM) were crossed to produce 112 recombinant haploid segregant strains. Because of the random segregation of chromosomes during meiosis, the inheritance of <it>L </it>= <it>BY </it>or <it>L </it>= <it>RM </it>is random. Therefore, when measuring the alleles at a single locus <it>L </it>across 112 segregants, we observe 112 genotypes being generated from some probability distribution. (See Materials and methods [below] for explicit details on the assumptions we make about the randomized genotypes among the loci.)</p>
            <p>Because the randomization of <it>L </it>takes place before the expression levels of <it>T</it><sub><it>i </it></sub>are measured, this implies that if <it>T</it><sub><it>i </it></sub>is linked to locus <it>L </it>then <it>L </it>&#8594; <it>T</it><sub><it>i</it></sub>. This property is due to the well established principles in statistics showing that an association between two variables when one of them is properly randomized implies causation <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. Additionally, the randomization of <it>L </it>is carried through to the variation in <it>T</it><sub><it>i </it></sub>whenever <it>L </it>&#8594; <it>T</it><sub><it>i</it></sub>. If <it>L </it>&#8594; <it>T</it><sub><it>i</it></sub>, then segregants with <it>L </it>= BY have a different mean expression for <it>T</it><sub><it>i </it></sub>than segregants with <it>L </it>= RM. Therefore, the randomization of <it>L </it>provides a randomization of the mean level of expression for <it>T</it><sub><it>i</it></sub>. Figure <figr fid="F1">1a</figr> shows the transcriptional levels for a given gene, and Figure <figr fid="F1">1b</figr> shows a case in which it is linked to some locus <it>L</it>. Because the inherited allele <it>L </it>= BY or <it>L </it>= RM is random for each segregant, the mean level of expression for <it>T</it><sub><it>i </it></sub>is random when <it>L </it>&#8594; <it>T</it><sub><it>i</it></sub>.</p>
            <p>Importantly, some of the variation in <it>T</it><sub><it>i </it></sub>will not be explained by <it>L</it>, specifically the random fluctuations of the transcription levels within each genotype (Figure <figr fid="F1">1b</figr>). Therefore, it is not possible to conclude that <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j </it></sub>whenever <it>T</it><sub><it>i </it></sub>and <it>T</it><sub><it>j </it></sub>are significantly associated to <it>L</it>. This follows because there could be a common hidden variable affecting both <it>T</it><sub><it>i </it></sub>and <it>T</it><sub><it>j</it></sub>. (Note that if <it>T</it><sub><it>i </it></sub>were perfectly randomized, then there would be no causal hidden variable for <it>T</it><sub><it>i</it></sub>, which demonstrates the power of randomization.) Suppose that a hidden variable <it>H </it>is such that <it>H </it>&#8594; <it>T</it><sub><it>i </it></sub>and <it>H </it>&#8594; <it>T</it><sub><it>j</it></sub>. Because of this common hidden causal variable, any association between <it>T</it><sub><it>i </it></sub>and <it>T</it><sub><it>j </it></sub>would not allow us to conclude that <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j </it></sub>even though <it>T</it><sub><it>i </it></sub>has been partially randomized. In other words, the partial randomization of <it>T</it><sub><it>i </it></sub>caused by <it>L </it>is now confounded by the effect of <it>H</it>. The common causal hidden variable <it>H </it>does not prevent <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j </it></sub>from occurring; rather, we just are unable to draw any conclusion when this is the case, unless we are willing to model common hidden causal variables. Modeling common hidden causal variables has been shown to be particularly challenging in this high-dimensional setting <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, and doing so would require much additional work.</p>
            <p>If there is a common causal hidden variable <it>H </it>that affects both <it>T</it><sub><it>i </it></sub>and <it>T</it><sub><it>j</it></sub>, then the Trigger method is designed to not make any conclusions about causality. However, if there is not a common hidden causal variable, then it is now possible, in a straightforward manner, to determine whether <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j</it></sub>. The following new theorem identifies three conditions that are equivalent to the case in which both <it>L </it>&#8594; <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j </it></sub>and no common causal hidden variable affects both <it>T</it><sub><it>i </it></sub>and <it>T</it><sub><it>j</it></sub>. (See Materials and methods [below] for a mathematical proof.)</p>
            <sec>
               <st>
                  <p>Causality equivalence theorem</p>
               </st>
               <p>The causal relationship <it>L </it>&#8594; <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j </it></sub>exists and there are no hidden variables causal for both <it>T</it><sub><it>i </it></sub>and <it>T</it><sub><it>j </it></sub>if and only if the following three conditions hold: <it>L </it>&#8594; <it>T</it><sub><it>i</it></sub>, <it>L </it>&#8594; <it>T</it><sub><it>j</it></sub>, and <it>L </it>&#8869; <it>T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i</it></sub>.</p>
               <p>This theorem is used in the following manner. If <it>L </it>&#8594; <it>T</it><sub><it>i</it></sub>, <it>L </it>&#8594; <it>T</it><sub><it>j</it></sub>, and <it>L </it>&#8869; <it>T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i</it></sub>, then we may conclude that <it>L </it>&#8594; <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j </it></sub>exists and there are no hidden variables causal for both <it>T</it><sub><it>i </it></sub>and <it>T</it><sub><it>j</it></sub>. The fact that 'there are no hidden variables causal for both <it>T</it><sub><it>i </it></sub>and <it>T</it><sub><it>j</it></sub>' is not an assumption. Rather, it is a verified fact that follows when the three properties are true, as we show in the proof given in Materials and methods (below). We would prefer to detect all cases where <it>L </it>&#8594; <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j</it></sub>; however, as explained above, it is not yet possible to do so in the presence of common causal hidden variables.</p>
               <p>Figure <figr fid="F1">1</figr> provides a graphical representation of the three properties that must be satisfied. The last condition, <it>L </it>&#8869; <it>T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i</it></sub>, denotes that <it>T</it><sub><it>j </it></sub>conditioned on the information in <it>T</it><sub><it>i </it></sub>is independent from <it>L</it>. The first two conditions basically ensure that both transcripts are subjected to a common randomization. The third condition is the key one for inferring causality based on these randomizations. Basically, what the third condition determines is whether the causal effect from <it>L </it>on <it>T</it><sub><it>j </it></sub>can entirely be captured by <it>T</it><sub><it>i</it></sub>. If so, then <it>T</it><sub><it>i </it></sub>is indeed a causal factor for variation in <it>T</it><sub><it>j</it></sub>, with no hidden variables.</p>
               <p>For computational and statistical efficiency, we limit <it>L </it>to be the locus of gene <it>i </it>(see Additional data file 1), which we denote as <it>L</it><sub><it>i</it></sub>. We call <it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>i </it></sub>the primary <it>cis </it>linkage and <it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j </it></sub>for any other gene <it>j </it>the 'secondary linkage' here. Because Pr(<it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j</it></sub>) &#8805; Pr(<it>L </it>&#8594; <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j</it></sub>), we can obtain a conservative estimate of <it>P</it><sub><it>ij </it></sub>by estimating Pr(<it>L </it>&#8594; <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j</it></sub>). From the causality equivalence theorem it follows that:</p>
               <p>
                  <display-formula>Pr(<it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j</it></sub>)</display-formula>
               </p>
               <p>
                  <display-formula>= Pr(<it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>i </it></sub>and <it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j </it></sub>and <it>L</it><sub><it>i </it></sub>&#8869; <it>T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i</it></sub>)</display-formula>
               </p>
               <p>
                  <display-formula>= Pr(<it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>i</it></sub>) &#215; Pr(<it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j </it></sub>| <it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>i</it></sub>) &#215; Pr(<it>L</it><sub><it>i </it></sub>&#8869; <it>T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i </it></sub>| <it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>i </it></sub>and <it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j</it></sub>)</display-formula>
               </p>
               <p>The Trigger algorithm conservatively estimates <it>P</it><sub><it>ij </it></sub>by estimating each probability in the above product from left to right and taking their product. (See Materials and methods [below] and Additional data file 1.)</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Application to yeast</p>
            </st>
            <p>We applied the Trigger algorithm to the yeast experiment (Materials and methods [below]) and found several interesting characteristics of the resulting regulatory probability matrix. Table <tblr tid="T1">1</tblr> lists the overall significance results with different probability thresholds and Additional data file 2 contains the entire regulatory probability matrix. For example, at a probability threshold of 90%, we found 4,394 significant regulatory relationships among 2,145 genes where 127 are causal. Figure <figr fid="F2">2</figr> shows a regulatory network drawn from the Trigger results at this threshold, where a directed edge is drawn from gene <it>i </it>to gene <it>j </it>if and only if <it>P</it><sub><it>ij </it></sub>&#8805; 90%. It can be seen from Figure <figr fid="F2">2</figr> that we have constructed a highly interconnected network where there is clearly a 'hub structure'.</p>
            <tbl id="T1" hint_layout="double">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Overall significance of the regulatory probability matrix at different probability thresholds</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Probability threshold</p>
                     </c>
                     <c ca="left">
                        <p>Number of putative regulators</p>
                     </c>
                     <c ca="left">
                        <p>Total number of genes</p>
                     </c>
                     <c ca="left">
                        <p>Number of edges</p>
                     </c>
                     <c ca="left">
                        <p>FDR (%)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>0.95</p>
                     </c>
                     <c ca="left">
                        <p>76</p>
                     </c>
                     <c ca="left">
                        <p>1,075</p>
                     </c>
                     <c ca="left">
                        <p>1,499</p>
                     </c>
                     <c ca="left">
                        <p>2.7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>0.90</p>
                     </c>
                     <c ca="left">
                        <p>127</p>
                     </c>
                     <c ca="left">
                        <p>2,145</p>
                     </c>
                     <c ca="left">
                        <p>4,394</p>
                     </c>
                     <c ca="left">
                        <p>6.0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>0.85</p>
                     </c>
                     <c ca="left">
                        <p>194</p>
                     </c>
                     <c ca="left">
                        <p>3,150</p>
                     </c>
                     <c ca="left">
                        <p>8,826</p>
                     </c>
                     <c ca="left">
                        <p>9.4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>0.80</p>
                     </c>
                     <c ca="left">
                        <p>255</p>
                     </c>
                     <c ca="left">
                        <p>4,044</p>
                     </c>
                     <c ca="left">
                        <p>15,448</p>
                     </c>
                     <c ca="left">
                        <p>12.9</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>FDR, false discovery rate.</p>
               </tblfn>
            </tbl>
            <p>We examined in detail four genes as putative regulators: <it>CNS1 </it>on chromosome 2, <it>ILV6 </it>on chromosome 3, <it>SAL1 </it>on chromosome 14, and <it>NAM9 </it>on chromosome 14. Each was highly significant for <it>cis </it>linkage, and the locus of each putative regulator had many significant secondary linking genes. At a 90% posterior probability cut-off (FDR = 6%), 144, 51 and 36 genes were significant for being regulated by <it>CNS1</it>, <it>ILV6</it>, and <it>SAL1</it>, respectively. At an 80% posterior probability cut-off (FDR = 11%), 14 genes were significant for being regulated by <it>NAM9</it>. The significant genes, posterior probabilities, and other relevant information for each putative regulator can be found in Additional data file 3. Note that each of these putative regulators is also a significant quantitative trait gene (or quantitative trait transcript) for each expression trait that it significantly regulates. Figure <figr fid="F3">3</figr> shows heat maps of the four putative regulators and their corresponding significantly regulated genes. It can be seen that each significant gene is both linked to the locus of the putative regulator and has correlated expression with the regulator within each genotype, both of which are necessary but not sufficient for causality.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Heat-map display and hierarchical clustering of genes significantly regulated by the four putative regulators considered</p>
               </caption>
               <text>
                  <p>Heat-map display and hierarchical clustering of genes significantly regulated by the four putative regulators considered. The top row is the expression of the putative regulator (red indicates high expression, and blue low expression). All remaining rows are the hierarchically clustered significant genes. Each column represents a single segregant, where the segregants have been separated by genotype at the putative regulator's locus (black line). The columns have been ordered according to increasing expression of the putative regulator within each genotype. <b>(a) </b><it>CNS1 </it>and its 144 significant genes. <b>(b) </b><it>ILV6 </it>and its 51 significant genes. <b>(c) </b><it>SAL1 </it>and its 36 significant genes. <b>(d) </b><it>NAM9 </it>and its 14 significant genes.</p>
               </text>
               <graphic file="gb-2007-8-10-r219-3"/>
            </fig>
            <p>In order to determine whether the genes that are significant for each putative regulator show a coherent functional relationship, we employed the Gene Ontology (GO) database <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. For each putative regulator, we queried the database among all significant genes and the regulator itself. This approach takes independently performed experiments and synthesizes the information obtained from those. The GO searches allowed us to test specifically whether common processes, functions, and components are present among each set of genes. Indeed, we found an abundance of significance for enriched GO terms for each set of genes corresponding to a putative regulator.</p>
            <p>Figure <figr fid="F4">4</figr> shows the results of GO analysis for the putative regulator <it>NAM9</it>, which is a mitochondrial ribosomal component of the small subunit and inviable under deletion <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. It is a structural constituent of ribosome, involved in translation and mitochondrial small ribosome subunit <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr></abbrgrp>. For the 14 genes significant at an 80% posterior probability threshold (FDR = 11%), 13 are known to be in the same or similar pathway as <it>NAM9</it>. The other significant gene is heretofore uncharacterized. Translation, structural constituent of ribosome, and mitochondrial small ribosome subunit are all highly significant terms in the GO tree.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>GO trees for <it>NAM9 </it>and 14 significantly regulated genes at 80% posterior probability threshold (FDR 11%)</p>
               </caption>
               <text>
                  <p>GO trees for <it>NAM9 </it>and 14 significantly regulated genes at 80% posterior probability threshold (FDR 11%). The colors of the boxes indicate the significance of the various Gene Ontology (GO) terms. <it>NAM9 </it>encodes a mitochondrial ribosomal component of the small subunit, involved in translation and mitochondrial small ribosome subunit [39-41]. Yeast is unviable under <it>NAM9 </it>deletion [38]. <it>NAM9 </it>is a structural constituent of ribosome, and it can be seen that seven out of the 14 genes, together with <it>NAM9</it>, are involved in translation. Five of them are also a ribosomal structural constituent and encode mitochondrial ribosomal subunits. Among the 14 putatively regulated genes, all except one uncharacterized gene are associated with mitochondria. FDR, false discovery rate.</p>
               </text>
               <graphic file="gb-2007-8-10-r219-4"/>
            </fig>
            <p>Additional data file 1 (Figure S1) shows the results for the putative regulator <it>CNS1</it>, which is an essential tetratricopeptide repeat (TPR)-containing co-chaperone, deletion of which is inviable <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. It binds both heat shock protein 82p (Hsp82p) and Ssa1p (Hsp70), and stimulates the ATPase activity of <it>SSA1</it>. <it>CNS1 </it>is involved in the protein binding process, and its cellular component is associated with cytoplasm <abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>. Of the 144 genes significant at the 90% joint posterior probability cut-off (FDR = 6%), a substantial subset is involved in transferase activity and ribosome biogenesis and assembly, which coincides with the key role played by <it>CNS1 </it>in yeast. Many of the 144 genes were also found to be in the same pathway as <it>CNS1</it>; for example, <it>TRM8 </it>and <it>CNS1 </it>are both involved in a pathway for protein binding <abbrgrp><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp>.</p>
            <p>Additional data file 1 (Figure S2) shows the significant GO results for <it>ILV6 </it>and its 51 genes under statistically significant regulation. <it>ILV6 </it>is a regulatory subunit of acetolactate synthase, which catalyzes the first step of branched-chain amino acid biosynthesis <abbrgrp><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp>. Amino acid biosynthesis and its associated pathways are significantly enriched GO terms with <it>P </it>values below 10<sup>-10</sup>. Cyclohydrolase activity and lyase activity are some other significant pathways identified by GO analysis.</p>
            <p>The putative regulator <it>SAL1 </it>is a probable transporter and a member of the calcium-binding subfamily of the mitochondrial carrier family, with two EF-hand motifs. It works in transporter activity and calcium ion binding <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>, with its corresponding cellular component involved in the mitochondrial inner membrane <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. From the GO analysis (Additional data file 1 [Figure S3]), we can see that a number of the 36 genes significantly regulated by <it>SAL1 </it>are associated with the mitochondrian and membrane GO terms. Six of the 36 significantly regulated genes are involved in mitochondrial inner membrane with high statistical significance (<it>P </it>&lt; 10<sup>-8</sup>), a trend that is consistent with previous findings <abbrgrp><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr></abbrgrp>.</p>
            <p>It should be noted that in the case of <it>SAL1 </it>no polymorphism exists in the immediate 500 base regions upstream or downstream of the <it>SAL1 </it>open reading frame. The linkage peaks occur approximately 13 kilobases and 21 kilobases on either side. This illustrates that linkage does not have to be due to an unequivocally <it>cis</it>-acting regulatory polymorphism in order for Trigger to work. On the contrary, there must simply be some locus to which both expression traits <it>T</it><sub><it>i </it></sub>and <it>T</it><sub><it>j </it></sub>are linked. We justified limiting the locus <it>L </it>to be in the 50 kilobases region of <it>T</it><sub><it>i </it></sub>based on computational and statistical increases in efficiency (Additional data file 1).</p>
            <p>In addition to these four well characterized putative regulators, we noticed that expression levels of a number of genes with relatively unknown function (for instance, <it>YSW1</it>, <it>PHM7</it>, and so on), were predicted to regulate a number of genes, with significant GO terms appearing for each set. Therefore, our results can potentially be used to predict properties of relatively unknown genes as well. Furthermore, several transcription factors significantly regulated a number of genes, including <it>HAP1 </it><abbrgrp><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr></abbrgrp> and <it>RAD16 </it><abbrgrp><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr></abbrgrp>. In previous work it was found that mutations in <it>GPA1 </it>and <it>AMN1 </it>lead to expression changes in genes whose expression exhibits linkage to each respective locus <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Missense mutations (leading to amino acid changes in the protein product) were identified in both <it>GPA1 </it>and <it>AMN1 </it>that appear to be the cause of the expression changes in the linking genes. In work to be reported in the future we examine the <it>GPA1 </it>and <it>AMN1 </it>cases in detail, showing that there appears to be common causal hidden variables involved. The Trigger approach is extended to take into account these common causal hidden variables, allowing us to recapitulate the previous findings regarding <it>GPA1 </it>and <it>AMN1</it>.</p>
         </sec>
         <sec>
            <st>
               <p>Comparison with other approaches</p>
            </st>
            <sec>
               <st>
                  <p>Mendelian randomization</p>
               </st>
               <p>Recently, 'Mendelian randomization' was proposed as a technique in genetic epidemiology to study the environmental determinants of disease <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>. Trigger builds upon this concept in the sense that it also employs the randomization of genotypes as a starting point to infer causality. Essentially, we have extended this idea by deriving precise conditions under which the causality of one trait on another can be confirmed and by providing a statistical technique for estimating the probability that one trait is causal for another, among potentially thousands of traits.</p>
            </sec>
            <sec>
               <st>
                  <p>Model selection approaches</p>
               </st>
               <p>The concepts of 'causality' and 'regulation' have been utilized in different ways in previous reports concerning the construction of biologic networks <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B32">32</abbr><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr><abbr bid="B60">60</abbr></abbrgrp>. Among those using the more rigorous definition of causality <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B61">61</abbr></abbrgrp>, most published approaches have been to choose among the best fitting causal models by partial correlation or by model selection. The difference between our work and most previous work is that we explicitly test for and quantify each causal relationship of interest by using the randomization of genetic backgrounds built into the genetic cross experimental system. Furthermore, we assess the significance of each causal relationship by estimating the probability that the causal relationship is true, so that it can be considered in a straightforward manner with millions of other potential causal relationships.</p>
               <p>We have made some simple comparisons between Trigger and the model selection and correlation based approaches (Figure <figr fid="F5">5</figr>). In addition to Trigger showing different significance rankings relative to these approaches, it offers an increase in specificity. Most of the papers employing model selection have used the 'Akaike information criterion' (AIC) or derivatives thereof <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>. Among the about 38 million triplets (<it>L</it><sub><it>i</it></sub>, <it>T</it><sub><it>i</it></sub>, <it>T</it><sub><it>j</it></sub>), the AIC model selection method <abbrgrp><abbr bid="B62">62</abbr></abbrgrp> classifies about 15.4 million as causal, whereas Trigger identifies about 4,400 causal relationships with probability exceeding 90%. For the putative regulator <it>CNS1</it>, about 2,800 genes are classified as having a causal relationship with <it>CNS1 </it>by model selection, as opposed to the 144 Trigger found to be significant with probability exceeding 90%. The advantages that Trigger has over AIC and other model selection criteria are as follows: there is no generally applicable method to obtain an interpretable measure of significance based on these criteria (which is especially problematic when considering thousands of traits); and these approaches force one to model directly all possible hidden variables, making typically unverifiable assumptions about their underlying model <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>.</p>
               <fig id="F5">
                  <title>
                     <p>Figure 5</p>
                  </title>
                  <caption>
                     <p>A comparison of Trigger with correlation and model selection for inferring existence causal relationship with <it>CNS1</it></p>
                  </caption>
                  <text>
                     <p>A comparison of Trigger with correlation and model selection for inferring existence causal relationship with <it>CNS1</it>. <b>(a) </b>Significance ranking according to Trigger versus the ranking according to correlation. Although this plot is not calculated conditional on linkage to the <it>CNS1 </it>locus, the plot conditional on linkage yields an equivalent qualitative conclusion. <b>(b) </b>Significance ranking according to Trigger versus the ranking according to model selection. For <it>CNS1 </it>and each gene, AIC was employed to selection among models capturing causality (M1), an inconclusive relationship (M2), linkage only (M3), and independence (M4). The x-axis is broken up into models M1 to M4; within each model type the genes were ranked according to their AIC score. For both correlation and model selection, it can be seen that there is not a strong relationship with Trigger in terms of the ranking, although a ranking in both is clearly necessary for a high Trigger probability. Note that many Trigger probabilities are zero, so the ranking does not extend all of the way to 6,216.</p>
                  </text>
                  <graphic file="gb-2007-8-10-r219-5"/>
               </fig>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Extensions to other data types</p>
            </st>
            <p>We have presented Trigger within the context of inferring regulatory relationships based on gene expression data from organisms with randomized genetic backgrounds. However, this method may actually be applied to a much broader class of data types. Because the estimation is done in a nonparametric and scale-free manner (Materials and methods [below] and Additional data file 1), it is possible to combine any combination of expression, proteomic, metabolomic, and phenotypic data as the variables among which causal relationships are inferred. These may be considered separately or simultaneously, allowing one to discover regulatory relationships, say, among protein levels and transcriptions levels. The general requirement is that one must acquire organisms with random genetic backgrounds that are essentially stable as the expression levels and other potential traits are measured. The computational approach and statistical principles underlying the method remain the same for all of these data types.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The Trigger algorithm allows one to infer transcriptional regulatory relationships among genes at the genome-wide level, based on experiments in which large-scale genotyping and expression profiling are performed among individuals with randomized genetic backgrounds. Moreover, the algorithm can be applied to any high-throughput phenotypic data in which genotypes or some other static regulatory mechanism has been randomized. Trigger works by identifying pairs of genes with expression levels both affected by a common randomized genotype and then testing for three key properties that we have mathematically demonstrated to be equivalent to a directed causal relationship among the pair of gene expression traits.</p>
         <p>We applied Trigger to an experiment in yeast in which 112 independent recombinant segregants were subjected to genome-wide expression monitoring. The Trigger algorithm produced a regulatory probability matrix from this experiment that has been made available (Additional data file 2). This matrix can be used to build networks by a variety of techniques in which the noise level of any resulting network is easily assessed by the FDR. Our analysis of the results indicates that the proposed algorithm produces rich and biologically coherent information, mainly through a GO analysis of four putative regulators (<it>CNS1</it>, <it>ILV6</it>, <it>SAL1</it>, and <it>NAM9</it>).</p>
         <p>Some caveats and limitations of the proposed approach are apparent. First, for any gene to be identified in a causal relationship, it must be linked to some locus. This is because the expression levels must be subjected to randomization based on the randomization of the genotypes. Therefore, this approach will not find all causal relationships. Second, a comprehensive genetic network requires additional measurements beyond transcriptional levels. Although it is straightforward to include all quantitative information in Trigger, such as transcription, protein, metabolite, and phenotype levels, it is not clear how to include important qualitative information, such as known protein interactions or transcription factor binding sites. The Trigger approach would have to be extended or combined with an existing approach to incorporate such data types.</p>
         <p>The approach we have proposed is an early step toward moving beyond correlation and model selection based analyses of high-throughput molecular profiling data. Trigger offers a rigorous approach to inferring causality, based on the highly successful concept of randomized experiments, which has played a key role in science and medicine since its inception. This work also contributes to a better understanding of the ways in which multiple high-throughput data types can be combined to produce more informative estimates of the highly complex molecular networks underlying organisms.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Expression measurements and genotyping</p>
            </st>
            <p>The expression and genotype data were recently reported elsewhere <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B33">33</abbr></abbrgrp>. In that work, 112 segregants (one from each tetrad) were grown from a cross involving parental strains BY4716 (isogenic to the laboratory strain S288C) and the wild isolate RM11-1a. RNA was isolated and cDNA was hybridized to microarrays in the presence of the same BY reference material. Each array assayed 6216 yeast open reading frames. GeneChip Yeast Genome S98 microarrays were purchased from Affymetrix (Santa Clara, CA, USA). Genotyping was performed using GeneChip Yeast Genome S98 microarrays (Affymetrix) on all 112 F<sub>1 </sub>segregants. The resulting genetic map of 3,312 markers covered more than 99% of the genome.</p>
         </sec>
         <sec>
            <st>
               <p>Assumptions regarding random genotypes</p>
            </st>
            <p>We simply point out here that the main assumption regarding random genotypes is that the <it>L</it><sub><it>i </it></sub>are random variables occurring before and independently from the subsequently measured expression values. We also assume that the alleles inherited by different individuals at a fixed locus occurs independently; in other words, we assume that the crosses have been carried out independently. (If related segregants or offspring are collected, then Trigger can be adjusted to account for this.) However, we do not assume that the inheritance at several loci on a given chromosome occurs independently, and we make no other assumptions about independence of inheritance among loci. Segregation distortion, selection, and other traditionally problematic issues arising when performing genetic crosses for the purpose of genetic mapping do not invalidate Trigger.</p>
            <p>As in all genetic crosses, the more independent the inheritance of the loci is, the more information there is in the experiment. For example, suppose that loci <it>L</it><sub><it>i </it></sub>and <it>L</it><sub><it>k </it></sub>are dependent (for instance, they are located on the same chromosome, or their segregation is dependent because of selection). Suppose also that L<sub>i </sub>&#8594; T<sub>i </sub>&#8594; T<sub>j </sub>and L<sub>k </sub>&#8594; T<sub>j</sub>, but it is not the case that <it>L</it><sub><it>k </it></sub>&#8594; <it>T</it><sub><it>i</it></sub>. Because <it>L</it><sub><it>i </it></sub>and <it>L</it><sub><it>k </it></sub>are dependent, it will not be the case that <it>L</it><sub><it>i </it></sub>&#8869; <it>T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i</it></sub>, as not all linkage information for <it>T</it><sub><it>j </it></sub>is captured by <it>T</it><sub><it>i</it></sub>. Specifically, <it>L</it><sub><it>i </it></sub>contains some information about <it>L</it><sub><it>k </it></sub>because of their dependence, so <it>T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i </it></sub>is not independent from <it>L</it><sub><it>i</it></sub>. This is an example of how dependence of inheritance of different loci can reduce the power of Trigger. However, Trigger does not produce false positives because of this, so it is robust to linkage among loci on the same chromosome or other forms of dependence among loci.</p>
         </sec>
         <sec>
            <st>
               <p>Proof of causality equivalence theorem</p>
            </st>
            <p>The proof of the theorem follows from well-established theory in graphical and causal modeling <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B61">61</abbr><abbr bid="B63">63</abbr></abbrgrp>. Several basic assumptions are typically made in causal modeling to avoid nonsensical situations. The 'causal Markov assumption' states that in a causal model, each variable is independent of all of its non-descendants given information about all of its direct causes. The 'faithfulness assumption' states that any conditional independence relationships in the population exist in the presence of the causal Markov assumption. Under the faithfulness assumption, conditional independence of two variables implies there is no direct edge between the two. Our proof also relies on the known result that if a hidden variable is causal for both <it>X </it>and <it>Y</it>, then the directed graph associated with <it>X </it>and <it>Y </it>can be represented by <it>X </it>&#8594; <it>Y </it><abbrgrp><abbr bid="B63">63</abbr></abbrgrp>.</p>
            <p>We first show that if <it>L </it>&#8594; <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j </it></sub>with no hidden variables causal for both <it>T</it><sub><it>i </it></sub>and <it>T</it><sub><it>j</it></sub>, then <it>L </it>&#8594; <it>T</it><sub><it>i</it></sub>, <it>L </it>&#8594; <it>T</it><sub><it>j</it></sub>, and <it>L </it>&#8869; <it>T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i</it></sub>. Under these assumptions, the first two properties (<it>L </it>&#8594; <it>T</it><sub><it>i </it></sub>and <it>L </it>&#8594; <it>T</it><sub><it>j</it></sub>) are trivially true. Because there are no hidden variables involved, <it>T</it><sub><it>i </it></sub>is the only direct cause of <it>T</it><sub><it>j</it></sub>, and <it>L </it>is a non-descendant of <it>T</it><sub><it>j</it></sub>, it follows by the causal Markov assumption that the third property (<it>L </it>&#8869; <it>T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i</it></sub>) holds.</p>
            <p>We now show the more important direction of this equivalence: if <it>L </it>&#8594; <it>T</it><sub><it>i</it></sub>, <it>L </it>&#8594; <it>T</it><sub><it>j</it></sub>, and <it>L </it>&#8869; <it>T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i</it></sub>, then <it>L </it>&#8594; <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j </it></sub>and there are no hidden variables causal for both <it>T</it><sub><it>i </it></sub>and <it>T</it><sub><it>j</it></sub>. The third property (<it>L </it>&#8869; <it>T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i</it></sub>) implies that there is no direct edge between <it>L </it>and <it>T</it><sub><it>j </it></sub>by the faithfulness assumption.</p>
            <p>Let us first consider the case when there are no hidden variables causal for both <it>T</it><sub><it>i </it></sub>and <it>T</it><sub><it>j</it></sub>, so that the only variables involved in this causal graph are <it>L</it>, <it>T</it><sub><it>i</it></sub>, and T<sub><it>j</it></sub>. Because of the second property (<it>L </it>&#8594; <it>T</it><sub><it>j</it></sub>), and there is no <it>direct </it>edge between <it>L </it>and <it>T</it><sub><it>j</it></sub>, it must follow that there is a direct edge between <it>T</it><sub><it>i </it></sub>and <it>T</it><sub><it>j</it></sub>. Otherwise, <it>T</it><sub><it>j </it></sub>is completely independent of <it>L</it>, which violates the second property. Thus, <it>L </it>&#8594; <it>T</it><sub><it>i </it></sub>- <it>T</it><sub><it>j</it></sub>, where an edge without arrowheads implies dependence. If any two variables are dependent, then one is a cause of the other or there must be a third variable causal for both <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. Thus, either <it>T</it><sub><it>i </it></sub>is causal for <it>T</it><sub><it>j</it></sub>, or <it>T</it><sub><it>j </it></sub>is causal for <it>T</it><sub><it>i</it></sub>, or both cases are true. <it>L </it>cannot be the common direct cause for both <it>T</it><sub><it>i </it></sub>and <it>T</it><sub><it>j</it></sub>, because no direct edge exists between <it>L </it>and <it>T</it><sub><it>j</it></sub>. If <it>L </it>is an indirect cause of <it>T</it><sub><it>j</it></sub>, then <it>T</it><sub><it>i </it></sub>as the only other variable in the graph must be a direct cause of <it>T</it><sub><it>j</it></sub>, implying that <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j</it></sub>. If <it>T</it><sub><it>j </it></sub>&#8594; <it>T</it><sub><it>i </it></sub>and the first property (<it>L </it>&#8594; <it>T</it><sub><it>i</it></sub>) holds, then it cannot be the case that the third property (<it>L </it>&#8869; <it>T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i</it></sub>) holds. Thus, <it>T</it><sub><it>j </it></sub>is not causal for <it>T</it><sub><it>i </it></sub>but it is true that <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j</it></sub>, implying that <it>L </it>&#8594; <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j</it></sub>.</p>
            <p>Now consider the second case in which there might be causal hidden variables in the graph. Because <it>L </it>is an independently randomized, static variable, there cannot be any hidden variables causal for both <it>L </it>and <it>T</it><sub><it>i </it></sub>or both <it>L </it>and <it>T</it><sub><it>j</it></sub>. The only possible existence of hidden causal variable in this graph is one affecting both <it>T</it><sub><it>i </it></sub>and <it>T</it><sub><it>j</it></sub>. However, if there is a common hidden cause for <it>T</it><sub><it>i </it></sub>and <it>T</it><sub><it>j</it></sub>, then <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j </it></sub><abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. If this is true, then <it>T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i </it></sub>is dependent with <it>L</it>, contradicting the third property (<it>L T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i</it></sub>). Therefore, <it>L </it>&#8594; <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j </it></sub>with no hidden variables affecting either of the two.</p>
            <p>Note that it can be shown that the second and third properties (<it>L </it>&#8594; <it>T</it><sub><it>j </it></sub>and <it>L </it>&#8869; <it>T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i</it></sub>, respectively) imply the first property (<it>L </it>&#8594; <it>T</it><sub><it>i</it></sub>). However, we have designed Trigger to test for all three properties because conditioning on the first property increases the power to detect the state of the second and third properties.</p>
            <p>Text</p>
         </sec>
         <sec>
            <st>
               <p>Estimation of regulatory probabilities</p>
            </st>
            <p>The following method was developed to estimate the regulatory probabilities. Recall that by the causality equivalence theorem:</p>
            <p>
               <display-formula><it>P</it><sub><it>ij </it></sub>= Pr(<it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j</it></sub>)</display-formula>
            </p>
            <p>
               <display-formula>= Pr(<it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>i</it></sub>) &#215; Pr(<it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j </it></sub>| <it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>i</it></sub>) &#215; Pr(<it>L</it><sub><it>i </it></sub>&#8869; <it>T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i </it></sub>| <it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>i </it></sub>and <it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j</it></sub>)</display-formula>
            </p>
            <p>To compute the joint posterior probability, the probabilities on the right hand side of the equation are estimated from left to right in that respective order. The basic algorithm works as follows (with specific details following) (Note that further details about steps 1 to 6 can be found in Additional data file 1.)</p>
            <sec>
               <st>
                  <p>Step 1</p>
               </st>
               <p>Transform the expression data for each gene to follow a Normal distribution with mean 0 and variance 1.</p>
            </sec>
            <sec>
               <st>
                  <p>Step 2</p>
               </st>
               <p>For each transcript, <it>T</it><sub><it>i </it></sub>(<it>i </it>= 1, 2, ..., <it>m</it>), test the null hypothesis of no <it>cis </it>linkage to <it>L</it><sub><it>i </it></sub>versus the alternative hypothesis of <it>cis </it>linkage to <it>L</it><sub><it>i </it></sub>by performing a standard likelihood ratio test to obtain observed statistics <it>X</it><sub><it>i </it></sub>(<it>i </it>= 1, 2, ..., <it>m</it>). Permute the expression data <it>B </it>times and perform the test on the permuted data to obtain null statistics <inline-formula><m:math name="gb-2007-8-10-r219-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>X</m:mi><m:mi>i</m:mi><m:mrow><m:mn>0</m:mn><m:mi>b</m:mi></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8viVeY=Nipec8Eeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaamiwamaaDaaaleaacaWGPbaabaGaaGimaiaadkgaaaaaaa@333A@</m:annotation></m:semantics></m:math></inline-formula> (<it>b </it>= 1, 2, ..., <it>B</it>). This is equivalent to testing <it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>i</it></sub>.</p>
            </sec>
            <sec>
               <st>
                  <p>Step 3</p>
               </st>
               <p>For each pair (<it>L</it><sub><it>i</it></sub>, <it>T</it><sub><it>i</it></sub>) from step 2, carry out the following. For all other transcripts <it>T</it><sub><it>j </it></sub>(<it>j </it>&#8800; <it>i</it>), test the null hypothesis of no linkage to <it>L</it><sub><it>i </it></sub>versus the alternative hypothesis of linkage to <it>L</it><sub><it>i </it></sub>under the assumption that <it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>i</it></sub>. Similarly to above, apply a standard likelihood ratio test to obtain observed statistics <it>Y</it><sub><it>ij</it></sub>. Permute the expression data <it>B </it>times under the assumption that <it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>i</it></sub>, and perform the test on the permuted data to obtain null statistics <inline-formula><m:math name="gb-2007-8-10-r219-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>Y</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mrow><m:mn>0</m:mn><m:mi>b</m:mi></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8viVeY=Nipec8Eeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaamywamaaDaaaleaacaWGPbGaamOAaaqaaiaaicdacaWGIbaaaaaa@342A@</m:annotation></m:semantics></m:math></inline-formula> (<it>b </it>= 1, 2, ..., <it>B</it>).</p>
            </sec>
            <sec>
               <st>
                  <p>Step 4</p>
               </st>
               <p>For each triplet (<it>L</it><sub><it>i</it></sub>, <it>T</it><sub><it>i</it></sub>, <it>T</it><sub><it>j</it></sub>), carry out the following. Estimate the conditional distribution of <it>T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i</it></sub>, which is tractable under the Normal transformation. Test the null hypothesis of independence between <it>L</it><sub><it>i </it></sub>and <it>T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i </it></sub>versus the alternative hypothesis of dependence between <it>L</it><sub><it>i </it></sub>and <it>T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i</it></sub>. Again, apply a standard likelihood ratio test to obtain observed statistics <it>Z</it><sub><it>ij </it></sub>for this test. Permute the expression data <it>B </it>times under the assumption that <it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>i </it></sub>and <it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j</it></sub>, and perform the test on the permuted data to obtain null statistics <inline-formula><m:math name="gb-2007-8-10-r219-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>Z</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mrow><m:mn>0</m:mn><m:mi>b</m:mi></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8viVeY=Nipec8Eeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaamOwamaaDaaaleaacaWGPbGaamOAaaqaaiaaicdacaWGIbaaaaaa@342B@</m:annotation></m:semantics></m:math></inline-formula> (<it>b </it>= 1, 2, ..., <it>B</it>).</p>
            </sec>
            <sec>
               <st>
                  <p>Step 5</p>
               </st>
               <p>For each test from steps 2 to 4, the set of observed statistics and null statistics can be used to estimate the probability that the hypothesis of interest is true, based on previous methodology <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B26">26</abbr><abbr bid="B64">64</abbr></abbrgrp>. For example, the observed statistics <it>X</it><sub><it>i </it></sub>(<it>i </it>= 1, 2, ..., <it>m</it>) and null statistics <inline-formula><m:math name="gb-2007-8-10-r219-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>X</m:mi><m:mi>i</m:mi><m:mrow><m:mn>0</m:mn><m:mi>b</m:mi></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8viVeY=Nipec8Eeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaamiwamaaDaaaleaacaWGPbaabaGaaGimaiaadkgaaaaaaa@333A@</m:annotation></m:semantics></m:math></inline-formula> (<it>i </it>= 1, 2, ..., <it>m</it>; <it>b </it>= 1, 2, ..., <it>B</it>) from step 2 can be used to form an empirical Bayes estimate of Pr(<it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>i</it></sub>), which is equivalent to an estimate of the probability that the alternative hypothesis is true for each <it>i </it>= 1, 2, ..., <it>m</it>. The statistics from step 3 are used to estimate Pr(<it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j </it></sub>| <it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>i</it></sub>), and the statistics from step 4 are used to estimate Pr(<it>L</it><sub><it>i </it></sub>&#8869; <it>T</it><sub><it>j </it></sub>| <it>T</it><sub><it>i </it></sub>| <it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>i </it></sub>and <it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j</it></sub>).</p>
            </sec>
            <sec>
               <st>
                  <p>Step 6</p>
               </st>
               <p>Multiply the three estimated probabilities together to get an estimate of <it>P</it><sub><it>ij </it></sub>= Pr(<it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j</it></sub>), where:</p>
               <p>
                  <display-formula>
                     <m:math name="gb-2007-8-10-r219-i5" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:msub>
                                 <m:mover accent="true">
                                    <m:mi>P</m:mi>
                                    <m:mo>^</m:mo>
                                 </m:mover>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mi>j</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:mo>=</m:mo>
                              <m:mover accent="true">
                                 <m:mtext>P</m:mtext>
                                 <m:mo>^</m:mo>
                              </m:mover>
                              <m:mtext>r</m:mtext>
                              <m:mo stretchy="false">(</m:mo>
                              <m:msub>
                                 <m:mi>L</m:mi>
                                 <m:mi>i</m:mi>
                              </m:msub>
                              <m:mo>&#8594;</m:mo>
                              <m:msub>
                                 <m:mi>T</m:mi>
                                 <m:mi>i</m:mi>
                              </m:msub>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mo>&#215;</m:mo>
                              <m:mover accent="true">
                                 <m:mtext>P</m:mtext>
                                 <m:mo>^</m:mo>
                              </m:mover>
                              <m:mtext>r</m:mtext>
                              <m:mo stretchy="false">(</m:mo>
                              <m:msub>
                                 <m:mi>L</m:mi>
                                 <m:mi>i</m:mi>
                              </m:msub>
                              <m:mo>&#8594;</m:mo>
                              <m:msub>
                                 <m:mi>T</m:mi>
                                 <m:mi>j</m:mi>
                              </m:msub>
                              <m:mo>|</m:mo>
                              <m:msub>
                                 <m:mi>L</m:mi>
                                 <m:mi>i</m:mi>
                              </m:msub>
                              <m:mo>&#8594;</m:mo>
                              <m:msub>
                                 <m:mi>T</m:mi>
                                 <m:mi>i</m:mi>
                              </m:msub>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mo>&#215;</m:mo>
                              <m:mover accent="true">
                                 <m:mtext>P</m:mtext>
                                 <m:mo>^</m:mo>
                              </m:mover>
                              <m:mtext>r</m:mtext>
                              <m:mo stretchy="false">(</m:mo>
                              <m:msub>
                                 <m:mi>L</m:mi>
                                 <m:mi>i</m:mi>
                              </m:msub>
                              <m:mo>&#8869;</m:mo>
                              <m:msub>
                                 <m:mi>T</m:mi>
                                 <m:mi>j</m:mi>
                              </m:msub>
                              <m:mo>|</m:mo>
                              <m:msub>
                                 <m:mi>T</m:mi>
                                 <m:mi>i</m:mi>
                              </m:msub>
                              <m:mo>|</m:mo>
                              <m:msub>
                                 <m:mi>L</m:mi>
                                 <m:mi>i</m:mi>
                              </m:msub>
                              <m:mo>&#8594;</m:mo>
                              <m:msub>
                                 <m:mi>T</m:mi>
                                 <m:mi>i</m:mi>
                              </m:msub>
                              <m:mtext>&#160;and&#160;</m:mtext>
                              <m:msub>
                                 <m:mi>L</m:mi>
                                 <m:mi>i</m:mi>
                              </m:msub>
                              <m:mo>&#8594;</m:mo>
                              <m:msub>
                                 <m:mi>T</m:mi>
                                 <m:mi>j</m:mi>
                              </m:msub>
                              <m:mo stretchy="false">)</m:mo>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8GiVeY=Pipec8Eeeu0xXdbba9frFj0xb9Lqpepeea0xd9q8qiYRWxGi6xij=hbbc9s8aq0=yqpe0xbbG8A8frFve9Fve9Fj0dmeaabaqaciaacaGaaeqabaqabeGadaaakeaaceWGqbGbaKaadaWgaaWcbaGaamyAaiaadQgaaeqaaOGaeyypa0JabeiuayaajaGaaeOCaiaacIcacaWGmbWaaSbaaSqaaiaadMgaaeqaaOGaeyOKH4QaamivamaaBaaaleaacaWGPbaabeaakiaacMcacqGHxdaTceqGqbGbaKaacaqGYbGaaiikaiaadYeadaWgaaWcbaGaamyAaaqabaGccqGHsgIRcaWGubWaaSbaaSqaaiaadQgaaeqaaOGaaiiFaiaadYeadaWgaaWcbaGaamyAaaqabaGccqGHsgIRcaWGubWaaSbaaSqaaiaadMgaaeqaaOGaaiykaiabgEna0kqabcfagaqcaiaabkhacaGGOaGaamitamaaBaaaleaacaWGPbaabeaatqvzynuttLxBI9gBaeHbJ12C5fdmaGabaOGae8xPI8JaamivamaaBaaaleaacaWGQbaabeaakiaacYhacaWGubWaaSbaaSqaaiaadMgaaeqaaOGaaiiFaiaadYeadaWgaaWcbaGaamyAaaqabaGccqGHsgIRcaWGubWaaSbaaSqaaiaadMgaaeqaaOGaaeiiaiaabggacaqGUbGaaeizaiaabccacaWGmbWaaSbaaSqaaiaadMgaaeqaaOGaeyOKH4QaamivamaaBaaaleaacaWGQbaabeaakiaacMcaaaa@7446@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>False discovery rate estimation</p>
            </st>
            <p>A significance threshold can be applied to the probabilities for either the entire regulatory probability matrix or for a specific putative regulator. For the entire probability matrix, this would entail applying a threshold &#955; to the <inline-formula><m:math name="gb-2007-8-10-r219-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>P</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8viVeY=Nipec8Eeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGabmiuayaajaWaaSbaaSqaaiaadMgacaWGQbaabeaaaaa@328F@</m:annotation></m:semantics></m:math></inline-formula>, where we call <it>L</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>i </it></sub>&#8594; <it>T</it><sub><it>j </it></sub>significant if and only if <inline-formula><m:math name="gb-2007-8-10-r219-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>P</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8viVeY=Nipec8Eeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGabmiuayaajaWaaSbaaSqaaiaadMgacaWGQbaabeaaaaa@328F@</m:annotation></m:semantics></m:math></inline-formula> &#8805; &#955;. For a given putative regulator, the exact same thresholding would take place, except only the <inline-formula><m:math name="gb-2007-8-10-r219-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>P</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8viVeY=Nipec8Eeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGabmiuayaajaWaaSbaaSqaaiaadMgacaWGQbaabeaaaaa@328F@</m:annotation></m:semantics></m:math></inline-formula> for a fixed putative regulator, gene <it>i</it>, would be considered. The estimate of the FDR corresponding to &#955;, FDR(&#955;), is as follows:</p>
            <p>
               <display-formula>
                  <m:math name="gb-2007-8-10-r219-i6" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>F</m:mi>
                           <m:mover accent="true">
                              <m:mi>D</m:mi>
                              <m:mo>^</m:mo>
                           </m:mover>
                           <m:mi>R</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>&#955;</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:msub>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>i</m:mi>
                                          <m:mo>,</m:mo>
                                          <m:mi>j</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>&#8722;</m:mo>
                                       <m:msub>
                                          <m:mover accent="true">
                                             <m:mi>P</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mrow>
                                             <m:mi>i</m:mi>
                                             <m:mi>j</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mstyle>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mn>1</m:mn>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:msub>
                                    <m:mover accent="true">
                                       <m:mi>P</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mrow>
                                       <m:mi>i</m:mi>
                                       <m:mi>j</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>&#8805;</m:mo>
                                 <m:mi>&#955;</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mo>#</m:mo>
                                 <m:mo>{</m:mo>
                                 <m:msub>
                                    <m:mover accent="true">
                                       <m:mi>P</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mrow>
                                       <m:mi>i</m:mi>
                                       <m:mi>j</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>&#8805;</m:mo>
                                 <m:mi>&#955;</m:mi>
                                 <m:mo>}</m:mo>
                              </m:mrow>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8GiVeY=Pipec8Eeeu0xXdbba9frFj0xb9Lqpepeea0xd9q8qiYRWxGi6xij=hbbc9s8aq0=yqpe0xbbG8A8frFve9Fve9Fj0dmeaabaqaciaacaGaaeqabaqabeGadaaakeaacaWGgbGabmirayaajaGaamOuaiaacIcaiiGacqWF7oaBcaGGPaGaeyypa0tcfa4aaSaaaeaadaaeqaqaaiaacIcacaaIXaGaeyOeI0IabmiuayaajaWaaSbaaeaacaWGPbGaamOAaaqabaaabaGaamyAaiaacYcacaWGQbaabeGaeyyeIuoacaGGPaGaaGymaiaacIcaceWGqbGbaKaadaWgaaqaaiaadMgacaWGQbaabeaacqGHLjYScqWF7oaBcaGGPaaabaGaai4iaiaacUhaceWGqbGbaKaadaWgaaqaaiaadMgacaWGQbaabeaacqGHLjYScqWF7oaBcaGG9baaaaaa@531D@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>Where 1(<inline-formula><m:math name="gb-2007-8-10-r219-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>P</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8viVeY=Nipec8Eeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGabmiuayaajaWaaSbaaSqaaiaadMgacaWGQbaabeaaaaa@328F@</m:annotation></m:semantics></m:math></inline-formula> &#8805; &#955;) is 1 or 0 according to whether <inline-formula><m:math name="gb-2007-8-10-r219-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>P</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8viVeY=Nipec8Eeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGabmiuayaajaWaaSbaaSqaaiaadMgacaWGQbaabeaaaaa@328F@</m:annotation></m:semantics></m:math></inline-formula> &#8805; &#955; or not, respectively, and # {<inline-formula><m:math name="gb-2007-8-10-r219-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>P</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8viVeY=Nipec8Eeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGabmiuayaajaWaaSbaaSqaaiaadMgacaWGQbaabeaaaaa@328F@</m:annotation></m:semantics></m:math></inline-formula> &#8805; &#955;} is the total number of <inline-formula><m:math name="gb-2007-8-10-r219-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>P</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8viVeY=Nipec8Eeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGabmiuayaajaWaaSbaaSqaaiaadMgacaWGQbaabeaaaaa@328F@</m:annotation></m:semantics></m:math></inline-formula> &#8805; &#955; <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B65">65</abbr></abbrgrp>. Further details and justification can be found in Additional data file 1.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>FDR, false discovery rate; GO, Gene Ontology; Hsp, heat shock protein; QTL, quantitative trait locus; Trigger, Transcriptional Regulation Inference from Genetics of Gene ExpRession.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>LSC and JDS conceived the research, developed the methods, and wrote the paper. LSC analyzed the data. FES provided the visual organization of the network drawn in Figure <figr fid="F2">2</figr>.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper. Additional data file <supplr sid="S1">1</supplr> contains the supplementary text and figures. Additional data file <supplr sid="S2">2</supplr> contains the entire matrix of regulatory probabilities for all genes, where the rows are genes acting as regulators and the columns are genes under regulation. Thus, the (<it>i</it>, <it>j</it>) entry of this matrix is the probability that the expression level of gene <it>i </it>is causal for the expression level of gene <it>j</it>. Additional data file <supplr sid="S3">3</supplr> contains the list of significantly regulated genes, posterior probabilities, and other relevant information for each of the four putative regulators considered in detail.</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>Supplementary text and figures</p>
            </caption>
            <text>
               <p>Presented are supplementary text and figures, as referenced in the main text.</p>
            </text>
            <file name="gb-2007-8-10-r219-S1.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>Entire matrix of regulatory probabilities for all genes</p>
            </caption>
            <text>
               <p>Presented is the entire matrix of regulatory probabilities for all genes, where the rows are genes acting as regulators and the columns are genes under regulation. Thus, the (<it>i</it>,<it>j</it>) entry of this matrix is the probability that the expression level of gene <it>i </it>is causal for the expression level of gene <it>j</it>.</p>
            </text>
            <file name="gb-2007-8-10-r219-S2.zip">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p>Significantly regulated genes, posterior probabilities, and other relevant information</p>
            </caption>
            <text>
               <p>Presented is a list of significantly regulated genes, posterior probabilities, and other relevant information for each of the four putative regulators considered in detail.</p>
            </text>
            <file name="gb-2007-8-10-r219-S3.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We would like to thank Leonid Kruglyak for generously sharing data. We would also like to thank Joshua Akey, Troels Marstrand, Thomas Richardson, and James Ronald for several helpful conversations. This research was supported in part by NIH grant R01 HG002913.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Quantitative monitoring of gene expression patterns with a complementary DNA microarray.</p>
            </title>
            <aug>
               <au>
                  <snm>Schena</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shalon</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1995</pubdate>
            <volume>270</volume>
            <fpage>467</fpage>
            <lpage>470</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.270.5235.467</pubid>
                  <pubid idtype="pmpid" link="fulltext">7569999</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Printing proteins as microarrays for high-throughput function determination.</p>
            </title>
            <aug>
               <au>
                  <snm>MacBeath</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Schreiber</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>289</volume>
            <fpage>1760</fpage>
            <lpage>1763</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10976071</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Genotyping over 100,000 SNPs on a pair of oligonucleotide arrays.</p>
            </title>
            <aug>
               <au>
                  <snm>Matsuzaki</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Dong</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Loi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Di</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Hubbell</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Law</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Berntsen</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Chadha</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hui</snm>
                  <fnm>H</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nat Methods</source>
            <pubdate>2004</pubdate>
            <volume>1</volume>
            <fpage>109</fpage>
            <lpage>111</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nmeth718</pubid>
                  <pubid idtype="pmpid" link="fulltext">15782172</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Network biology: Understanding the cell's functional organization.</p>
            </title>
            <aug>
               <au>
                  <snm>Barabasi</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Oltvai</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>101</fpage>
            <lpage>113</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg1272</pubid>
                  <pubid idtype="pmpid" link="fulltext">14735121</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Systems biology 101: what you need to know.</p>
            </title>
            <aug>
               <au>
                  <snm>Ideker</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2004</pubdate>
            <volume>22</volume>
            <fpage>473</fpage>
            <lpage>475</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt0404-473</pubid>
                  <pubid idtype="pmpid">15085805</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <aug>
               <au>
                  <snm>Lynch</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Walsh</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Genetics and Analysis of Quantitative Traits</source>
            <publisher>Sinauer Associates, Sunderland, MA USA</publisher>
            <pubdate>1998</pubdate>
         </bibl>
         <bibl id="B7">
            <aug>
               <au>
                  <snm>Weinzierl</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Mechanisms of Gene Expression: Structure, Function and Evolution of the Basal Transcriptional Machinery</source>
            <publisher>World Scientific Publishing Company, Hackensack, NJ USA</publisher>
            <pubdate>1999</pubdate>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Genomic expression programs in the response of yeast cells to environmental changes.</p>
            </title>
            <aug>
               <au>
                  <snm>Gasch</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Spellman</snm>
                  <fnm>PT</fnm>
               </au>
               <au>
                  <snm>Kao</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Carmel-Harel</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Storz</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
            </aug>
            <source>Mol Biol Cell</source>
            <pubdate>2000</pubdate>
            <volume>11</volume>
            <fpage>4241</fpage>
            <lpage>4257</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">15070</pubid>
                  <pubid idtype="pmpid" link="fulltext">11102521</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Transcriptional regulatory networks in <it>Saccharomyces cerevisiae </it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Rinaldi</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Robert</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Odom</snm>
                  <fnm>DT</fnm>
               </au>
               <au>
                  <snm>Bar-Joseph</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Gerber</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Hannett</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Harbison</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>CM</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <fpage>799</fpage>
            <lpage>804</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1075090</pubid>
                  <pubid idtype="pmpid" link="fulltext">12399584</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Genetic interactions between polymorphisms that affect gene expression in yeast.</p>
            </title>
            <aug>
               <au>
                  <snm>Brem</snm>
                  <fnm>RB</fnm>
               </au>
               <au>
                  <snm>Storey</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Whittle</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kruglyak</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>436</volume>
            <fpage>701</fpage>
            <lpage>703</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1409747</pubid>
                  <pubid idtype="pmpid" link="fulltext">16079846</pubid>
                  <pubid idtype="doi">10.1038/nature03865</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>A statistical problem for inference to regulatory structure from associations of gene expression measurements with microarrays.</p>
            </title>
            <aug>
               <au>
                  <snm>Chu</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Glymour</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Scheines</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Spirtes</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <fpage>1147</fpage>
            <lpage>1152</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg011</pubid>
                  <pubid idtype="pmpid" link="fulltext">12801876</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Genetic dissection of transcriptional regulation in budding yeast.</p>
            </title>
            <aug>
               <au>
                  <snm>Brem</snm>
                  <fnm>RB</fnm>
               </au>
               <au>
                  <snm>Yvert</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Clinton</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kruglyak</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>296</volume>
            <fpage>752</fpage>
            <lpage>755</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1069516</pubid>
                  <pubid idtype="pmpid" link="fulltext">11923494</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Genetics of gene expression surveyed in maize, mouse, and man.</p>
            </title>
            <aug>
               <au>
                  <snm>Schadt</snm>
                  <fnm>EE</fnm>
               </au>
               <au>
                  <snm>Monks</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Drake</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Lusis</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Che</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Colinayo</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Ruff</snm>
                  <fnm>TG</fnm>
               </au>
               <au>
                  <snm>Milligan</snm>
                  <fnm>SB</fnm>
               </au>
               <au>
                  <snm>Lamb</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Cavet</snm>
                  <fnm>G</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>422</volume>
            <fpage>297</fpage>
            <lpage>302</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01434</pubid>
                  <pubid idtype="pmpid" link="fulltext">12646919</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p><it>Trans</it>-acting regulatory variation in <it>Saccharomyces cerevisiae </it>and the role of transcription factors.</p>
            </title>
            <aug>
               <au>
                  <snm>Yvert</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Brem</snm>
                  <fnm>RB</fnm>
               </au>
               <au>
                  <snm>Whittle</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Akey</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Foss</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>EN</fnm>
               </au>
               <au>
                  <snm>Mackelprang</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kruglyak</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2003</pubdate>
            <volume>35</volume>
            <fpage>57</fpage>
            <lpage>64</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1222</pubid>
                  <pubid idtype="pmpid" link="fulltext">12897782</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Natural variation in human gene expression assessed in lymphoblastoid cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Cheung</snm>
                  <fnm>VG</fnm>
               </au>
               <au>
                  <snm>Conlin</snm>
                  <fnm>LK</fnm>
               </au>
               <au>
                  <snm>Weber</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Arcaro</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jen</snm>
                  <fnm>KY</fnm>
               </au>
               <au>
                  <snm>Morley</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Spielman</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2003</pubdate>
            <volume>33</volume>
            <fpage>422</fpage>
            <lpage>425</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1094</pubid>
                  <pubid idtype="pmpid" link="fulltext">12567189</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Dimension reduction for mapping mRNA abundance as quantitative traits.</p>
            </title>
            <aug>
               <au>
                  <snm>Lan</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Stoehr</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Nadler</snm>
                  <fnm>ST</fnm>
               </au>
               <au>
                  <snm>Schueler</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Yandell</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Attie</snm>
                  <fnm>AD</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2003</pubdate>
            <volume>164</volume>
            <fpage>1607</fpage>
            <lpage>1614</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1462655</pubid>
                  <pubid idtype="pmpid" link="fulltext">12930764</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Multiple locus linkage analysis of genomewide expression in yeast.</p>
            </title>
            <aug>
               <au>
                  <snm>Storey</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Akey</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Kruglyak</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>PLoS Biology</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>e267</fpage>
            <lpage/>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pbio.0030267</pubid>
                  <pubid idtype="pmpid" link="fulltext">16035920</pubid>
                  <pubid idtype="pmcid">1180512</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Estimating causal effects of treatments in randomized and nonrandomized studies.</p>
            </title>
            <aug>
               <au>
                  <snm>Rubin</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>J Educ Psychol</source>
            <pubdate>1974</pubdate>
            <volume>66</volume>
            <fpage>688</fpage>
            <lpage>701</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1037/h0037350</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Statistics and Causal Inference.</p>
            </title>
            <aug>
               <au>
                  <snm>Holland</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>J Am Stat Assoc</source>
            <pubdate>1986</pubdate>
            <volume>81</volume>
            <fpage>945</fpage>
            <lpage>960</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/2289064</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Randomization, statistics, and causal inference.</p>
            </title>
            <aug>
               <au>
                  <snm>Greenland</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Epidemiology</source>
            <pubdate>1990</pubdate>
            <volume>1</volume>
            <fpage>421</fpage>
            <lpage>429</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1097/00001648-199011000-00003</pubid>
                  <pubid idtype="pmpid">2090279</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Detection of regulatory variation in mouse genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Cowles</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Hirschhorn</snm>
                  <fnm>JN</fnm>
               </au>
               <au>
                  <snm>Altshuler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2002</pubdate>
            <volume>32</volume>
            <fpage>432</fpage>
            <lpage>437</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng992</pubid>
                  <pubid idtype="pmpid" link="fulltext">12410233</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Variation in gene expression within and among natural populations.</p>
            </title>
            <aug>
               <au>
                  <snm>Oleksiak</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Churchill</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Crawford</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2002</pubdate>
            <volume>32</volume>
            <fpage>261</fpage>
            <lpage>266</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng983</pubid>
                  <pubid idtype="pmpid" link="fulltext">12219088</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>The contributions of sex, genotype and age to transcriptional variance in <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Jin</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Riley</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Wolfinger</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>KP</fnm>
               </au>
               <au>
                  <snm>Passador-Gurgel</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>389</fpage>
            <lpage>395</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng766</pubid>
                  <pubid idtype="pmpid" link="fulltext">11726925</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Allelic variation in human gene expression.</p>
            </title>
            <aug>
               <au>
                  <snm>Yan</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yuan</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Velculescu</snm>
                  <fnm>VE</fnm>
               </au>
               <au>
                  <snm>Vogelstein</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kinzler</snm>
                  <fnm>KW</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>297</volume>
            <fpage>1143</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1072545</pubid>
                  <pubid idtype="pmpid" link="fulltext">12183620</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Abundant raw material for <it>cis</it>-regulatory evolution in humans.</p>
            </title>
            <aug>
               <au>
                  <snm>Rockman</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Wray</snm>
                  <fnm>GA</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>1991</fpage>
            <lpage>2004</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12411608</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Statistical significance for genome-wide studies.</p>
            </title>
            <aug>
               <au>
                  <snm>Storey</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>9440</fpage>
            <lpage>9445</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">170937</pubid>
                  <pubid idtype="pmpid" link="fulltext">12883005</pubid>
                  <pubid idtype="doi">10.1073/pnas.1530509100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>How to avoid bias when comparing bone marrow transplantation with chemotherapy.</p>
            </title>
            <aug>
               <au>
                  <snm>Gray</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wheatley</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Bone Marrow Transplant</source>
            <pubdate>1991</pubdate>
            <issue>Suppl 3</issue>
            <fpage>9</fpage>
            <lpage>12</lpage>
         </bibl>
         <bibl id="B28">
            <title>
               <p>'Mendelian randomization': can genetic epidemiology contribute to understanding environmental determinants of disease?</p>
            </title>
            <aug>
               <au>
                  <snm>Smith</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Ebrahim</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Int J Epidemiol</source>
            <pubdate>2003</pubdate>
            <volume>32</volume>
            <fpage>1</fpage>
            <lpage>22</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/ije/dyg070</pubid>
                  <pubid idtype="pmpid" link="fulltext">12689998</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>An integrative genomics approach to infer causal associations between gene expression and disease.</p>
            </title>
            <aug>
               <au>
                  <snm>Schadt</snm>
                  <fnm>EE</fnm>
               </au>
               <au>
                  <snm>Lamb</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Edwards</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Guhathakurta</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Sieberts</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Monks</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Reitman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>C</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2005</pubdate>
            <volume>37</volume>
            <fpage>710</fpage>
            <lpage>717</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1589</pubid>
                  <pubid idtype="pmpid" link="fulltext">15965475</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Genetical genomics analysis of a yeast segregant population for transcription network inference.</p>
            </title>
            <aug>
               <au>
                  <snm>Bing</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hoeschele</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2005</pubdate>
            <volume>170</volume>
            <fpage>533</fpage>
            <lpage>542</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1450429</pubid>
                  <pubid idtype="pmpid" link="fulltext">15781693</pubid>
                  <pubid idtype="doi">10.1534/genetics.105.041103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Causal inference of regulator-target pairs by gene mapping of expression phenotypes.</p>
            </title>
            <aug>
               <au>
                  <snm>Kulp</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Jagular</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>125</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1481560</pubid>
                  <pubid idtype="pmpid" link="fulltext">16719927</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-7-125</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Structural model analysis of multiple quantitative traits.</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tsaih</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Shockley</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Stylianou</snm>
                  <fnm>IM</fnm>
               </au>
               <au>
                  <snm>Wergedal</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Paigen</snm>
                  <fnm>B</fnm>
               </au>
               <au>
        