<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>gb-2005-6-7-r62</ui>
	<ji>GBJ</ji>
	<fm>
		<dochead>Method</dochead>
		<bibl>
			<title>
				<p>Validation and refinement of gene-regulatory pathways on a network of physical interactions</p>
			</title>
			<aug>
				<au id="A1" ce="yes">
					<snm>Yeang</snm>
					<fnm>Chen-Hsiang</fnm>
					<insr iid="I1"/>
					<email>chyeang@soe.ucsc.edu</email>
				</au>
				<au id="A2" ce="yes">
					<snm>Mak</snm>
					<fnm>H Craig</fnm>
					<insr iid="I2"/>
					<email>cmak@biomail.ucsd.edu</email>
				</au>
				<au id="A3">
					<snm>McCuine</snm>
					<fnm>Scott</fnm>
					<insr iid="I2"/>
					<email>scott@bioeng.ucsd.edu</email>
				</au>
				<au id="A4">
					<snm>Workman</snm>
					<fnm>Christopher</fnm>
					<insr iid="I2"/>
					<email>cworkman@bioeng.ucsd.edu</email>
				</au>
				<au id="A5">
					<snm>Jaakkola</snm>
					<fnm>Tommi</fnm>
					<insr iid="I3"/>
					<email>tommi@csail.mit.edu</email>
				</au>
				<au id="A6" ca="yes">
					<snm>Ideker</snm>
					<fnm>Trey</fnm>
					<insr iid="I2"/>
					<email>trey@bioeng.ucsd.edu</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Center for Biomolecular Science and Engineering, Baskin School of Engineering, University of California at Santa Cruz, Santa Cruz, CA 95064, USA</p>
				</ins>
				<ins id="I2">
					<p>Department of Bioengineering, University of California at San Diego, La Jolla, CA 92093, USA</p>
				</ins>
				<ins id="I3">
					<p>Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA</p>
				</ins>
			</insg>
			<source>Genome Biology</source>
			<issn>1465-6906</issn>
			<pubdate>2005</pubdate>
			<volume>6</volume>
			<issue>7</issue>
			<fpage>R62</fpage>
			<url>http://genomebiology.com/2005/6/7/R62</url>
			<xrefbib>
				<pubidlist><pubid idtype="pmpid">15998451</pubid><pubid idtype="doi">10.1186/gb-2005-6-7-r62</pubid>
				</pubidlist></xrefbib>
		</bibl>
		<history>
			<rec>
				<date>
					<day>9</day>
					<month>3</month>
					<year>2005</year>
				</date>
			</rec>
			<revrec>
				<date>
					<day>3</day>
					<month>5</month>
					<year>2005</year>
				</date>
			</revrec>
			<acc>
				<date>
					<day>3</day>
					<month>6</month>
					<year>2005</year>
				</date>
			</acc>
			<pub>
				<date>
					<day>1</day>
					<month>7</month>
					<year>2005</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2005</year>
			<collab>Yeang et al.; licensee BioMed Central Ltd.</collab>
			<note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<shorttitle>
			<p>Validation and refinement of gene-regulatory pathways on a network of physical interactions</p>
		</shorttitle>
		<shortabs>
			<p>A new automated procedure for prioritizing genetic perturbations was used to evaluate 38 candidate regulatory networks in yeast. Further analysis of four high-priority gene knockout experiments provided new insights into two regulatory pathways</p>
		</shortabs>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<p>As genome-scale measurements lead to increasingly complex models of gene regulation, systematic approaches are needed to validate and refine these models. Towards this goal, we describe an automated procedure for prioritizing genetic perturbations in order to discriminate optimally between alternative models of a gene-regulatory network. Using this procedure, we evaluate 38 candidate regulatory networks in yeast and perform four high-priority gene knockout experiments. The refined networks support previously unknown regulatory mechanisms downstream of <it>SOK2 </it>and <it>SWI4</it>.</p>
			</sec>
		</abs>
	</fm>
	<meta>
		<classifications>
			<classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010013">Methods</classification>
		</classifications>
	</meta>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p>Recent advances in genomics and computational biology are enabling construction of large-scale models of gene-regulatory networks. High-throughput technologies such as automated sequencing <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, gene-expression arrays <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, chromatin immunoprecipitation <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, and yeast two-hybrid assays <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, each probe different aspects of the gene-regulatory system through genome-wide datasets. These data have spawned a variety of methods to infer the structure of gene-regulatory networks or to study their high-level properties, as recently reviewed <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>.</p>
			<p>Regulatory network models generated thus far in <it>Escherichia coli </it>and budding yeast (<it>Saccharomyces cerevisiae</it>) have been most often validated against functional databases or previous literature <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. In contrast, only a few studies have attempted to validate or refine models systematically <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. However, if we are to accurately model large gene networks in complex organisms, including fly, worm, mouse, and human, automated procedures will be essential for analyzing the network, choosing the best new experiments to test the model, conducting the experiments, and integrating the resulting data.</p>
			<p>The problem of choosing the best experiments to estimate a model, termed 'experimental design' or 'active learning', has been a significant area of research in statistics and machine learning <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. Automating the experimental design process can greatly accelerate data collection and model building, leading to substantial savings in time, materials, and human effort. For these reasons, many industries such as electronic circuit fabrication and airplane manufacturing incorporate experimental design as an integral step in the design process <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. A promising application of experimental design for biological systems was presented by King <it>et al</it>. <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, who integrated computational modeling and experimental design to reconstruct a small, well studied metabolic pathway. Whether automated experimental design can be useful in a large and poorly characterized biological system with noisy data remains an open question.</p>
			<p>We recently reported a procedure for inferring gene-regulatory network models by integrating gene-expression profiles with high-throughput measurements of protein interactions <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. Here we extend this procedure to incorporate automated design of new experiments. First, we use the previously described modeling procedure to generate a library of models corresponding to different gene-regulatory systems in yeast. Many of these models contain transcriptional interactions for which the regulatory effects (inducer versus repressor) are ambiguous and cannot be determined from publicly available expression profiles. Next, to address these ambiguities we implement a score function that ranks possible genetic perturbation experiments on the basis of their projected information content over the models. We perform four of the highest-ranking perturbations experimentally and integrate the data back into the model. The new data support two out of three novel regulatory pathways predicted to mediate expression changes downstream of the yeast transcriptional regulator <it>SWI4</it>.</p>
		</sec>
		<sec>
			<st>
				<p>Results</p>
			</st>
			<sec>
				<st>
					<p>Summary of physical regulatory models</p>
				</st>
				<p>We applied a previously described network-modeling procedure <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> to integrate three complementary sources of gene-regulatory information in yeast: 5,558 promoter-binding interactions for 106 transcription factors measured using chromatin immunoprecipitation followed by microarray chip hybridization (ChIP-chip) <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>; the set of all 15,116 pairwise protein-protein interactions recorded in the Database of Interacting Proteins as of April 2004 <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>; and a panel of mRNA expression profiles for 273 individual gene-deletion experiments <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Software for performing the network-modeling procedure is available as a plug-in to the Cytoscape package <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp> on our supplementary website <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
				<p>For each gene-deletion experiment, the modeling procedure identified the most probable paths of protein-protein and promoter-binding interactions that connect the deleted gene (the perturbation) to genes that were differentially expressed in response to the deletion (the effects of perturbation). Thus, a path represented one possible physical explanation by which a deleted gene regulates a second gene downstream. From the expression data, each interaction on a path was annotated with its probable direction of information flow and its probable regulatory effect as an inducer or repressor.</p>
				<p>For example, the model in Figure <figr fid="F1">1a</figr> (top center) includes a path from <it>GLN3 </it>through <it>GCN4 </it>to a block of downstream affected genes. This model integrates evidence that: Gln3p binds the promoter of <it>GCN4 </it>with high significance in a ChIP-chip assay <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> (<it>p </it>&#8804; 8 &#215; 10<sup>-4</sup>); Gcn4p binds the promoters of many genes in the ChIP-chip assay (<it>RIB5</it>, <it>YJL200C</it>, and others in the downstream block); and a significant number of genes in the block are upregulated in a <it>gln3</it>&#916; knockout but downregulated in a <it>gcn4</it>&#916;  knockout <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Together, this evidence confirms Gcn4p as an activator of downstream genes <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> and leads to a (novel) annotation that Gln3p is likely to regulate <it>GCN4 </it>via transcriptional repression.</p>
				<fig id="F1">
					<title>
						<p>Figure 1</p>
					</title>
					<caption>
						<p>Wiring diagrams for example network models</p>
					</caption>
					<text>
						<p>Wiring diagrams for example network models. <b>(a) </b>Model 0, showing regulatory pathways that have unique functional annotations. <b>(b,c) </b>Model 1, showing regulatory pathways downstream of <it>SWI4 </it>and <it>SOK2 </it>with ambiguous functional annotations (several would be consistent with the observed expression responses: two possibilities are shown in (b) and (c), respectively). In the models, a connection from gene <it>a </it>to <it>b </it>represents the experimental observation that the proteins encoded by <it>a </it>and <it>b </it>physically interact in a protein-protein interaction (dotted links), or that the protein encoded by <it>a </it>binds the promoter of <it>b </it>(solid links). Each gene is either defined by an original knockout (red nodes), a differentially expressed effect (yellow nodes), or a signal transducer that was chosen for follow-up perturbation (gray nodes). Functional annotations (edge colors) are uniquely determined in (a) whereas multiple annotations are possible in (b) and (c) based on the available data. Diagram layout is performed automatically using the Cytoscape package [21].</p>
					</text>
					<graphic file="gb-2005-6-7-r62-1"/>
				</fig>
				<p>In total, the modeling process generated 4,836 paths, each explaining expression changes for a particular gene in one or more knockout experiments. Of the 965 interactions covered by paths, 194 had regulatory effects that were uniquely determined by the data, while regulatory effects of the remaining 771 interactions were ambiguous. For example, Figure <figr fid="F1">1b</figr> includes ambiguous interaction paths through <it>SWI4</it>, <it>SOK2</it>, and <it>MSN4</it>, explaining the observation that many genes for which the promoters are bound by Msn4p are upregulated in a <it>swi4</it>&#916;  knockout. This observation can be explained by several alternative annotations: one scenario is that <it>SWI4 </it>activates <it>SOK2 </it>and <it>SOK2 </it>represses <it>MSN4 </it>(Figure <figr fid="F1">1b</figr>), whereas another is that <it>SWI4 </it>represses <it>SOK2 </it>and <it>SOK2 </it>activates <it>MSN4 </it>(Figure <figr fid="F1">1c</figr>). These regulatory annotations could be uniquely determined by measuring the expression changes of genes downstream of <it>MSN4 </it>in the model in response to a <it>sok2</it>&#916;  deletion and an <it>msn4</it>&#916;  deletion (see below).</p>
				<p>Paths with ambiguous interactions were partitioned into 37 independent network models (numbered 1-37), where each model contained a distinct region of the physical network (see Materials and methods and Additional data file 1). The remaining non-ambiguous paths were grouped into a single model (Model 0). As shown in Table <tblr tid="T1">1</tblr>, 21 of the models (55%) contained pathways that are well documented in the literature or are significantly enriched for genes belonging to specific Munich Information Center for Protein Sequences (MIPS) <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> functional categories. Of 132 protein-DNA interactions incorporated into Model 0, we found that 50 had been confirmed in classical (low-throughput) assays as reported in the Proteome BioKnowledge Library <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Moreover, the inferred regulatory roles (induction or repression) for 48 out of 50 of these interactions agreed with their experimentally determined roles (96%, binomial <it>p</it>-value &lt; 1.22 &#215; 10<sup>-7</sup>). Wiring diagrams for Models 0 and 1 are given in Figure <figr fid="F1">1</figr>; diagrams for all other regulatory network models are provided in Additional data file 1 and at <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
				<tbl id="T1">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>Internal validation for 21 of the 38 inferred models</p>
					</caption>
					<tblbdy cols="5">
						<r>
							<c ca="left">
								<p>Model</p>
							</c>
							<c ca="center">
								<p>Number of genes</p>
							</c>
							<c ca="center">
								<p>Number of variants</p>
							</c>
							<c ca="left">
								<p>Validated literature pathway</p>
							</c>
							<c ca="left">
								<p>Enriched MIPS functions</p>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>0</p>
							</c>
							<c ca="center">
								<p>130</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
							<c ca="left">
								<p>Kss1/Fus3-Ste12 (mating response and filamentous growth)</p>
							</c>
							<c ca="left">
								<p>Cell fate (1.48 &#215; 10-7); metabolism (0.0067)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>1</p>
							</c>
							<c ca="center">
								<p>69</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="left">
								<p>Sok2-Msn4 (PKA pathway)</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>2</p>
							</c>
							<c ca="center">
								<p>63</p>
							</c>
							<c ca="center">
								<p>16</p>
							</c>
							<c ca="left">
								<p>Tup1-Hhf1 (histone regulation)</p>
							</c>
							<c ca="left">
								<p>Protein synthesis (7.13 &#215; 10-8)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>3</p>
							</c>
							<c ca="center">
								<p>44</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="left">
								<p>Tup1/Ssn6-Nrg1 (glucose metabolism)</p>
							</c>
							<c ca="left">
								<p>Transport (1.05 x 10.5); metabolism (5.41 &#215; 10-4)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>4</p>
							</c>
							<c ca="center">
								<p>58</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="left">
								<p>Tup1/Ssn6-&#945;2/Mcm1 (mating response)</p>
							</c>
							<c ca="left">
								<p>Cell fate (1.12 &#215; 10-5);</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>5</p>
							</c>
							<c ca="center">
								<p>58</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="left">
								<p>Rpd3-Abf1 (histone modification)</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>6</p>
							</c>
							<c ca="center">
								<p>44</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="left">
								<p>Swi4-Ndd1-Ace2 (cell cycle)</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>7</p>
							</c>
							<c ca="center">
								<p>26</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Cell cycle (0.035)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>8</p>
							</c>
							<c ca="center">
								<p>36</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="left">
								<p>Slt2-Rlm1/Swi4 (PKC pathway)</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>10</p>
							</c>
							<c ca="center">
								<p>45</p>
							</c>
							<c ca="center">
								<p>16</p>
							</c>
							<c ca="left">
								<p>Med2-Gal4/Gcn4 (general transcription)</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>15</p>
							</c>
							<c ca="center">
								<p>13</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="left">
								<p>Cmd1-Cna1-Skn7 (calcium signaling)</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>19</p>
							</c>
							<c ca="center">
								<p>9</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Cell defense (6.33 &#215; 10-6)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>23</p>
							</c>
							<c ca="center">
								<p>17</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Metabolism (1.49 &#215; 10-6);energy (0.04)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>26</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Cell defense (9.62 &#215; 10-5)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>29</p>
							</c>
							<c ca="center">
								<p>9</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="left">
								<p>Yap1-Cad1 (metal response)</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>30</p>
							</c>
							<c ca="center">
								<p>12</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="left">
								<p>Med2-Srb6-Gal4 (general transcription)</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>32</p>
							</c>
							<c ca="center">
								<p>12</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="left">
								<p>Med2-Gal11-Gal4 (general transcription)</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>33</p>
							</c>
							<c ca="center">
								<p>12</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="left">
								<p>Med2-Srb5-Gal4 (general transcription)</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>34</p>
							</c>
							<c ca="center">
								<p>9</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="left">
								<p>Ste12-Mcm1 (mating response)</p>
							</c>
							<c ca="left">
								<p>Cell fate (4.55 &#215; 10-8); homeostasis (0.0012); cell communication (0.0345)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>36</p>
							</c>
							<c ca="center">
								<p>7</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Metabolism (0.0258)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>40</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Metabolism (0.0017)</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>The number of genes and variants are shown for each model along with the results of our preliminary validations. Each variant corresponds to a distinct set of functional annotations on the interactions in the model (directions and regulatory effects, see text). For Model 0, the expression data implied a unique set of annotations; for all other models multiple sets of annotations were possible. Each model was validated if its pathways were (wholly or partially) cited in previous studies or its downstream genes were significantly enriched for MIPS functional categories (<it>p </it>&#8804; 0.05; hypergeometric test with Bonferroni correction).</p>
					</tblfn>
				</tbl>
			</sec>
			<sec>
				<st>
					<p>Experiment selection</p>
				</st>
				<p>As shown in Figure <figr fid="F2">2</figr>, we implemented an information-theoretic approach to discriminate between ambiguous model annotations using the fewest additional gene-expression experiments. All non-lethal single-gene knockout experiments were ranked by their projected information content based on the inferred models (see Materials and methods). Table <tblr tid="T2">2</tblr> reports the list of top-ranking experiments. This list coincides roughly with biological intuition, in the sense that informative target genes typically encode proteins that are network 'hubs', each having a large number of regulatory interactions with downstream genes in the models. However, as discussed later, knocking out hubs only is not as effective as using the information-theoretic criteria.</p>
				<fig id="F2">
					<title>
						<p>Figure 2</p>
					</title>
					<caption>
						<p>Schematic of the experimental design approach</p>
					</caption>
					<text>
						<p>Schematic of the experimental design approach. The input to the approach is a set of alternative representations of a gene-regulatory model, each of which is equally likely given current expression data. In the present work, the alternatives arise as a result of ambiguities in the regulatory roles of interactions in the model as inducers or repressors of downstream genes. Next, a scoring procedure is used to rank candidate perturbations according to their expected information gain over the model alternatives. High-ranking perturbations are applied to the system and characterized using gene-expression microarrays. The resulting expression profiles validate or invalidate particular connections in the model and reduce the set of model alternatives to those that are consistent with both old and new expression measurements.</p>
					</text>
					<graphic file="gb-2005-6-7-r62-2"/>
				</fig>
				<tbl id="T2">
					<title>
						<p>Table 2</p>
					</title>
					<caption>
						<p>Top-ranking knock-out experiments proposed for model discrimination</p>
					</caption>
					<tblbdy cols="6">
						<r>
							<c ca="center">
								<p>Gene</p>
							</c>
							<c ca="left">
								<p>Function</p>
							</c>
							<c ca="center">
								<p>Score</p>
							</c>
							<c ca="center">
								<p>Downstream genes</p>
							</c>
							<c ca="center">
								<p>Rank</p>
							</c>
							<c ca="center">
								<p>Model</p>
							</c>
						</r>
						<r>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<it>HHF1</it>
								</p>
							</c>
							<c ca="left">
								<p>Histone</p>
							</c>
							<c ca="center">
								<p>52.1429</p>
							</c>
							<c ca="center">
								<p>74</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<it>SOK2*</it>
								</p>
							</c>
							<c ca="left">
								<p>Regulator for meiosis and PKA pathway</p>
							</c>
							<c ca="center">
								<p>45.0279</p>
							</c>
							<c ca="center">
								<p>64</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<it>CKA1</it>
								</p>
							</c>
							<c ca="left">
								<p>Protein kinase of cell cycle</p>
							</c>
							<c ca="center">
								<p>45.0075</p>
							</c>
							<c ca="center">
								<p>64</p>
							</c>
							<c ca="center">
								<p>3</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<it>A2</it>
								</p>
							</c>
							<c ca="left">
								<p>Mating response</p>
							</c>
							<c ca="center">
								<p>40.9023</p>
							</c>
							<c ca="center">
								<p>58</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<it>YAP6*</it>
								</p>
							</c>
							<c ca="left">
								<p>Stress response regulator</p>
							</c>
							<c ca="center">
								<p>35.1652</p>
							</c>
							<c ca="center">
								<p>50</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
							<c ca="center">
								<p>1, 3</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<it>NRG1</it>
								</p>
							</c>
							<c ca="left">
								<p>Regulator of glucose dependent genes</p>
							</c>
							<c ca="center">
								<p>31.6501</p>
							</c>
							<c ca="center">
								<p>45</p>
							</c>
							<c ca="center">
								<p>6</p>
							</c>
							<c ca="center">
								<p>3</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<it>FKH1</it>
								</p>
							</c>
							<c ca="left">
								<p>Regulator of cell cycle</p>
							</c>
							<c ca="center">
								<p>29.1194</p>
							</c>
							<c ca="center">
								<p>41</p>
							</c>
							<c ca="center">
								<p>7</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<it>FKH2</it>
								</p>
							</c>
							<c ca="left">
								<p>Regulator of cell cycle</p>
							</c>
							<c ca="center">
								<p>26.7131</p>
							</c>
							<c ca="center">
								<p>38</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>7</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<it>SLT2</it>
								</p>
							</c>
							<c ca="left">
								<p>Protein kinase of cell wall integrity pathway</p>
							</c>
							<c ca="center">
								<p>23.4727</p>
							</c>
							<c ca="center">
								<p>31</p>
							</c>
							<c ca="center">
								<p>9</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<it>MSN4*</it>
								</p>
							</c>
							<c ca="left">
								<p>Regulator of stress response</p>
							</c>
							<c ca="center">
								<p>21.8224</p>
							</c>
							<c ca="center">
								<p>31</p>
							</c>
							<c ca="center">
								<p>10</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<it>HAP4*</it>
								</p>
							</c>
							<c ca="left">
								<p>Regulator of cellular respiration</p>
							</c>
							<c ca="center">
								<p>6.3310</p>
							</c>
							<c ca="center">
								<p>9</p>
							</c>
							<c ca="center">
								<p>34</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>Each proposed target gene is reported, along with its function, mutual information score, rank, and the model(s) it informs. All target genes are non-lethal in rich media. *Gene knockouts selected in this study.</p>
					</tblfn>
				</tbl>
				<p>Among the highest-priority experiments, Model 1 (Figure <figr fid="F1">1b</figr>) was the most often targeted, containing three of the top 10 highest-scoring genetic perturbations: <it>sok2</it>&#916;, <it>yap6</it>&#916;, and <it>msn4</it>&#916;. A fourth perturbation to Model 1, <it>hap4</it>&#916;, was also highly ranked (rank 34). Therefore, Model 1 was chosen for further experimentation.</p>
			</sec>
			<sec>
				<st>
					<p>Model validation</p>
				</st>
				<p>Knockout strains corresponding to the high-ranking perturbations <it>sok2</it>&#916;, <it>yap6</it>&#916;, <it>hap4</it>&#916;, and <it>msn4</it>&#916;  were grown in quadruplicate under conditions identical to those for the initial 273 knockouts by Hughes <it>et al. </it><abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Gene-expression profiles were obtained for each knockout culture versus wild type using yeast genome microarrays. We sought to test the three regulatory cascades leading from <it>SWI4 </it>to <it>SOK2 </it>to either <it>MSN4</it>, <it>HAP4</it>, or <it>YAP6 </it>(Figure <figr fid="F1">1b</figr>). To verify these cascades independently of the model, we analyzed the expression patterns of gene sets known to be directly regulated by <it>MSN4</it>, <it>HAP4</it>, or <it>YAP6 </it>(obtained from the Proteome BioKnowledge Library <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>; see Additional data file 1). To normalize between our microarray procedures and those of Hughes <it>et al</it>., we also repeated the original <it>swi4</it>&#916;  expression profile, and filtered the above sets to select only those genes with expression changes that were reproducible (that is, same direction of change) between the Hughes <it>et al</it>. <it>swi4</it>&#916;  profile and our new profile. Expression changes were reproducible for 28 of 42 Msn4p-regulated genes, 11 of 29 Hap4p-regulated genes, and 64 of 119 Yap6p-regulated genes. Expression similarity among the genes in each filtered set was captured formally in a measure called 'coherence'; details of the computation of expression coherence and the selection of the gene sets are described further in Materials and methods and <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
				<p>As shown in Figure <figr fid="F3">3a</figr>, the gene set downstream of <it>MSN4 </it>showed coherent upregulation in the <it>swi4</it>&#916;  (<it>p </it>&#8804; 10<sup>-4</sup>) and <it>sok2</it>&#916;  (<it>p </it>&#8804; 10<sup>-4</sup>) knockouts, but downregulation in the <it>msn4</it>&#916;  (<it>p </it>&#8804; 8 &#215; 10<sup>-4</sup>) knockout. This result supports the existence of a regulatory cascade leading from <it>SWI4 </it>to <it>SOK2 </it>to <it>MSN4</it>. Furthermore, in the context of the present regulatory cascade, <it>MSN4 </it>appears to be an inducer as its downstream gene set was downregulated in the <it>msn4</it>&#916;  experiment. In contrast, <it>SOK2 </it>appears to be a repressor of <it>MSN4 </it>as a <it>sok2</it>&#916;  deletion experiment upregulates the same set of genes. Finally, <it>SWI4 </it>appears to be an inducer of <it>SOK2 </it>as the <it>swi4</it>&#916;  knockout has the same effect as <it>sok2</it>&#916;  (that is, upregulation).</p>
				<fig id="F3">
					<title>
						<p>Figure 3</p>
					</title>
					<caption>
						<p>Validation and refinement of Swi4 transcriptional cascades</p>
					</caption>
					<text>
						<p>Validation and refinement of Swi4 transcriptional cascades. Yeast genome microarrays were used to explore three transcriptional cascades from Model 1 involving the transcriptional regulators Swi4p, Sok2p, and either <b>(a) </b>Msn4p, <b>(b) </b>Hap4p, or <b>(c) </b>Yap6p. Bar charts show the expression coherence of genes regulated by Msn4p, Hap4p, or Yap6p in knockout strains <it>swi4</it>&#916;, <it>sok2</it>&#916;, <it>msn4</it>&#916;, <it>hap4</it>&#916;, and <it>yap6</it>&#916;. Coherence scores more extreme than &#177; 0.7 are significant (<it>p </it>&lt; 0.01, dotted lines). <b>(d) </b>Results are also shown for genes bound by Msn1p as representative of an unrelated model not targeted by these perturbations. This analysis provides validation for the Msn4 and Hap4 pathways and disambiguates the role of each pathway interaction as activating (Swi4 interactions) or repressing (Sok2 interactions) downstream genes <b>(e)</b>. The Yap6 pathway hypothesis is not supported by this analysis.</p>
					</text>
					<graphic file="gb-2005-6-7-r62-3"/>
				</fig>
				<p>Results were qualitatively similar for the <it>HAP4 </it>pathway (Figure <figr fid="F3">3b</figr>). The gene set downstream of <it>HAP4 </it>was upregulated in the <it>swi4</it>&#916;  (<it>p </it>&#8804; 10<sup>-2</sup>) and <it>sok2</it>&#916;  (<it>p </it>&#8804; 9 &#215; 10<sup>-4</sup>) knockouts but downregulated in <it>hap4</it>&#916;  (<it>p </it>&#8804; 10<sup>-4</sup>). These results suggest that <it>swi4</it>&#916;, <it>sok2</it>&#916;, and <it>hap4</it>&#916;  deletions affect the set of genes immediately downstream of <it>HAP4</it>, supporting the <it>SWI4-SOK2-HAP4 </it>regulatory pathway hypothesis. In contrast to the <it>MSN4 </it>and <it>HAP4 </it>pathways, the gene set downstream of <it>YAP6 </it>had insignificant responses to all follow-up knockout experiments (Figure <figr fid="F3">3c</figr>). Thus, the existence of the <it>SWI4-SOK2-YAP6 </it>regulatory pathway was not supported by our validation experiments.</p>
			</sec>
			<sec>
				<st>
					<p>Automated model refinement</p>
				</st>
				<p>We used our modeling procedure to construct a new physical network model using the original 273 knockout gene-expression experiments of Hughes <it>et al</it>. combined with the new <it>sok2</it>&#916;, <it>hap4</it>&#916;, <it>msn4</it>&#916;, and <it>yap6</it>&#916;  profiles. Overall, 60 protein-DNA interactions were disambiguated by our data: 50 interactions were resolved as definite inducers or repressors, whereas ten interactions were removed from the model because the expression of downstream genes did not change as a result of the knockout. In the updated Model 1, <it>MSN4 </it>and <it>HAP4 </it>were unambiguously annotated as inducers of downstream genes, <it>SOK2 </it>was annotated as a repressor of <it>MSN4 </it>and <it>HAP4</it>, and <it>SWI4 </it>was annotated as an inducer of <it>SOK2 </it>(Figure <figr fid="F3">3e</figr>). These results agree with our previous manually derived annotations (see 'Model validation' above).</p>
			</sec>
			<sec>
				<st>
					<p>Learning-curve analysis</p>
				</st>
				<p>We quantified the efficiency of our information-based approach by comparing it to two other methods of prioritizing gene knockout experiments: prioritizing hubs and prioritizing genes randomly. First, we generated a 'reference' model by fixing each ambiguous interaction in Models 1-37 to be an inducer or repressor. Assignments were chosen arbitrarily from the set of annotations that were consistent with the original knockout data. Next, we used each method (information, hub, or random) to iteratively 'learn' these assignments. In each iteration, we selected the highest-priority knockout experiment, simulated the resulting expression changes (up/down) using the reference model, updated the inferred model, and recorded the number of ambiguous interactions that were resolved. This iterative learning procedure was repeated 100 times.</p>
				<p>As shown in Figure <figr fid="F4">4</figr>, the mutual information criterion significantly outperformed hub-based and random selection. The learning curves also provide an estimate of the number of additional experiments needed to reduce model ambiguity below a given level. For example, using the information-based score, ten knockout experiments are needed to reduce the number of ambiguous interactions by 50%. In contrast, over 25 experiments are needed according to the hub-based method. Figure <figr fid="F4">4</figr> suggests that performing 40 additional experiments selected using the information-based score will clarify the regulatory roles of about 70% of the ambiguous interactions. The learning rate of the final 30% becomes very slow because these interactions are isolated in the physical network, unconnected to others, and thus require separate knockouts to decipher each of them.</p>
				<fig id="F4">
					<title>
						<p>Figure 4</p>
					</title>
					<caption>
						<p>Simulated learning curves of three experimental design methods</p>
					</caption>
					<text>
						<p>Simulated learning curves of three experimental design methods. Three different methods of selecting experiments are compared: mutual information scores (triangles), hub selection (circles), and random selection (squares). We performed 100 simulated trials and show the average number of ambiguous interactions remaining in the inferred model after each simulated knockout experiment. Vertical bars indicate the standard deviations for the random selection method. The standard deviations for the information and hub selection curves are less than five and are not shown for clarity.</p>
					</text>
					<graphic file="gb-2005-6-7-r62-4"/>
				</fig>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Discussion</p>
			</st>
			<p>We have used global expression profiles to validate models of transcriptional regulation inferred from protein-protein interactions, genome-wide location analysis, and expression data. A previously described network inference algorithm <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> identifies probable paths of physical interactions connecting a gene knockout to genes that are differentially expressed as a result of that knockout. The proposed validation strategy uses information gain as a criterion for choosing optimal knockouts to profile using microarray experiments. This strategy agrees with intuition, in that optimal knockouts typically target intermediate genes along the pathways under consideration. If an intermediate gene knockout fails to affect downstream genes in a pathway, that pathway is removed from the model.</p>
			<p>The validated pathways point to a combination of previously documented and novel findings. First, in agreement with previous literature, we confirm that <it>MSN4 </it>and <it>HAP4 </it>are inducers <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp> and that <it>SOK2 </it>is a repressor <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. For instance, <it>SOK2 </it>is known to act downstream of protein kinase A (PKA) to repress genes involved in stress response, glycogen storage, and pseudohyphal growth <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. However, although <it>SOK2 </it>is thought to control these pathways via a transcriptional cascade, the components of this cascade have remained unclear. Here, we provide evidence for a model in which <it>SOK2 </it>acts as a negative regulator upstream of <it>MSN4 </it>and <it>HAP4</it>. Interestingly, <it>MSN4 </it>has been shown to activate stress-response genes <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, and <it>HAP4 </it>has been shown to activate genes involved in energy conservation and oxidative carbohydrate metabolism <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. Thus, we have identified a candidate model for the transcriptional cascade downstream of PKA signaling that mediates stress response. This model includes two novel regulatory pathways from <it>SWI4 </it>to <it>SOK2 </it>to <it>MSN4 </it>and from <it>SWI4 </it>to <it>SOK2 </it>to <it>HAP4</it>. The validation experiments do not support the third predicted pathway from <it>SWI4 </it>to <it>SOK2 </it>to <it>YAP6</it>.</p>
			<p>In model simulations, choosing new gene knockout experiments with an information-theoretic approach significantly outperformed both random and hub-based selection. It also outperformed the observed experimental results: approximately 280 interactions were disambiguated after four simulated knockouts (Figure <figr fid="F4">4</figr>), whereas only 60 interactions were resolved due to the four actual knockouts <it>sok2</it>&#916;, <it>hap4</it>&#916;, <it>msn4</it>&#916;, and <it>yap6</it>&#916;. This difference in performance stems from key differences between the simulated and actual scenarios. In simulation, the four experiments are performed independently and iteratively, selecting the absolute highest-ranking knockout each time. In the actual study, four high-ranking experiments (but not the highest) are chosen to interrogate and maximally resolve a single pathway model, resulting in experiments that are highly co-dependent and performed simultaneously without intervening rounds of inference and experimental design. In addition, the simulation assumes that all interactions in the model are correct, along with one of the initial sets of inducer/repressor annotations. It therefore isolates the process of learning regulatory role annotations, whereas the actual procedure also serves to distinguish interactions as true versus false positives. Nevertheless, the simulation provides a useful comparison of experimental design methods relative to each other.</p>
			<p>An important limitation of the single-gene knockout approach is that single perturbations do not identify pathway intermediates for which loss of function can be compensated by another gene. Furthermore, our approach may not identify regulatory pathways in which several transcription factors independently activate gene expression. Applying knockouts in combination may prove fruitful in these cases. For instance, approximately 4,000 double knockouts have been reported in yeast that lead to synthetic lethality: that is, a lethal phenotype that is not observed in either of the single knockouts individually <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. These interactions suggest regulatory relationships which could be incorporated into future work.</p>
		</sec>
		<sec>
			<st>
				<p>Conclusion</p>
			</st>
			<p>Scientific discovery is an iterative process of building models to explain experimental observations and validating models with new experiments <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. Experimental design is the essential link between these two aspects. Here we have explored a framework for modeling transcriptional networks in which experimental design and validation are central features. This framework is based on computational analysis and expression microarrays, both of which are amenable to automation, suggesting a high-throughput strategy for mapping gene-regulatory pathways.</p>
		</sec>
		<sec>
			<st>
				<p>Materials and methods</p>
			</st>
			<sec>
				<st>
					<p>Model building and inference</p>
				</st>
				<p>Physical mechanisms of transcriptional regulation were modeled using an approach described previously <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. Briefly, we postulated that the regulatory effects of deleting a gene are propagated along paths of physical interactions (protein-protein and protein-DNA). We formalized the properties of these paths and interactions using a factor graph <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> and found the most probable set of paths using the max-product algorithm <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. The resulting set of paths was partitioned into independent network models, also as described previously <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. The raw data used in the modeling procedure included 5,558 promoter-binding interactions (at <it>p</it>-value &lt; 0.001) for 106 transcription factors <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, the set of all 15,166 pairwise protein-protein interactions recorded in the Database of Interacting Proteins as of April 2004 <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, and mRNA expression profiles for 273 individual gene deletion experiments <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Expression changes with a <it>p</it>-value &lt; 0.02 were considered significant.</p>
			</sec>
			<sec>
				<st>
					<p>Experiment scoring</p>
				</st>
				<p>We calculated the expected information gain for each of the 4,756 possible non-lethal single-gene deletion experiments that were not included in the set of 273 deletions used to generate our network models. Intuitively, information gain measures (the logarithm of) the number of ambiguous annotations in the model that are likely to be determined after generating a yeast-genome expression profile in response to a particular gene deletion under consideration. Each gene-deletion experiment predicts a distinct expression profile given a particular configuration of model annotations. Experiments with high information gain are those for which the predicted expression profiles are highly variable over the set of possible annotations. In these cases, only one (or at most a few) of the predicted profiles will match the true observed profile, efficiently constraining the space of possible model annotations.</p>
				<p>The information gain discussed above arises from the expected value of information calculations in statistical decision theory <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Here we describe the score more directly in terms of reduction of model entropy. The entropy of a set of ambiguous model annotations is given by:</p>
				<p>
					<graphic file="gb-2005-6-7-r62-i1.gif"/>
				</p>
				<p>The expected information gain is the difference between the entropies before and after a hypothetical experiment:</p>
				<p>
					<graphic file="gb-2005-6-7-r62-i2.gif"/>
				</p>
				<p>where <it>Y</it><sup><it>e </it></sup>denotes the vector of predicted expression changes for each gene in the model under experiment <it>e</it>. The conditional entropy <it>H</it>(<it>M</it>|<it>Y</it><sup><it>e</it></sup>) requires us to consider all possible models and corresponding outcomes resulting from experiment <it>e</it>. Direct enumeration of all values of <it>M </it>and <it>Y</it><sup><it>e </it></sup>is impractical; instead, we make several simplifying approximations as described at <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Expression profiling</p>
				</st>
				<p>Expression profiling experiments were based on the wild-type diploid BY4743 and homozygous gene knockout strains derived from this parent <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> (Invitrogen), with cultures grown identically to those of Hughes <it>et al</it>. <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Labeled cDNA from each gene knockout strain was co-hybridized versus wild type cDNA in quadruplicate two-color microarray hybridizations. Total RNA was isolated by hot acid phenol extraction, purified to mRNA (Ambion PolyAPure kits), and labeled with Cy3 or Cy5 by direct incorporation (Amersham CyScribe First-Strand cDNA Labeling Kit). DNA microarrays were spotted from the Yeast Genome Oligo Set v1.1 (Qiagen) on Corning UltraGAPS slides using an OmniGrid 100 robot (Genomic Solutions). Lyophilized Cy3- and Cy5-labeled samples were resuspended in 50 &#956;l buffer (5&#215; SSC, 0.1% SDS, 1&#215; Denhardt's solution, 25% formamide) and co-hybridized at 42&#176;C beneath a coverslip for 15 h. Arrays were imaged at 10 &#956;m resolution using a ScanArray Lite instrument (PerkinElmer). Raw quantitated background intensities were smoothed using a 7 &#215; 7 median filter, separately for the Cy3 and Cy5 channels, and data were corrected for cyanine-dye dependent bias using a Qspline normalization <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. The VERA/SAM package <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> was used to assign a log-likelihood statistic &#955; with each gene, indicating its significance of differential expression in each experiment. Microarray expression data are deposited in the ArrayExpress database <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> under accession numbers A-MEXP-217 (Arrays) and E-MEXP-351 (Experiments).</p>
			</sec>
			<sec>
				<st>
					<p>Expression coherence</p>
				</st>
				<p>The expression coherence of a set of genes measures whether the expression levels of these genes behave similarly in a particular experiment. Each gene <it>i </it>in gene-deletion experiment <it>e </it>has an expression ratio <it>r</it><sub><it>ie </it></sub>(versus wild type) and associated <it>p</it>-value <it>p</it><sub><it>ie </it></sub>of differential expression. First, we filter out insignificant expression changes with a <it>p</it>-value &gt; 0.5. Then, we use the inverse Gaussian cumulative distribution function, &#934;<sup>-1</sup>, to convert each remaining <it>p</it>-value into a z-score <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>:</p>
				<p><it>z</it><sub><it>ie </it></sub>= &#934;<sup>-1 </sup>(1 - <it>p</it><sub><it>ie</it></sub>)</p>
				<p>Next, we compute a 'signed z-score' by multiplying z by +1 if the expression level is increasing and by -1 if it is decreasing. The average signed z-score for a gene subset of size <it>N </it>is computed as:</p>
				<p>
					<graphic file="gb-2005-6-7-r62-i3.gif"/>
				</p>
				<p>Gene sets with expression changes that are significant and in the same direction result in large <it>Z</it>-values. A distribution of <it>Z </it>values obtained from random gene sets of size <it>N </it>was used to determine a <it>p</it>-value for each expression coherence score.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Additional data files</p>
			</st>
			<p>Additonal data is available with the online version of this paper. Additional data file 1 contains Tables S1-S4 and wiring diagram illustrations for Models 0-44. Table S1 gives the internal validation for 17 out of 24 restricted network models; Table S2 lists the correlations between swi4&#948; and gcn4&#948; data and Rosetta and the new experiments; Table S3 gives the restricted subsets used to evaluate the reproducibility; and Table S4 gives the gene sets for external validation.</p>
			<suppl id="S1">
				<title>
					<p>Additional File 1</p>
				</title>
				<caption>
					<p>Tables S1-S4 and wiring diagram illustrations for Models 0-44</p>
				</caption>
				<text>
					<p>Table S1 gives the internal validation for 17 out of 24 restricted network models; Table S2 lists the correlations between swi4&#948; and gcn4&#948; data and Rosetta and the new experiments; Table S3 gives the restricted subsets used to evaluate the reproducibility; and Table S4 gives the gene sets for external validation.</p>
				</text>
				<file name="gb-2005-6-7-r62-S1.pdf">
					<p>Click here for file</p>
				</file>
			</suppl>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>We are grateful to Owen Ozier, Ryan Kelley, and Rowan Christmas for their valuable assistance with model visualization, and to Julia Zeitlinger for commenting on the manuscript. C.M., C.W., and S.M. were supported by NIGMS grant GM070743-01 and NSF grant CCF-0425926. T.I. was supported by a David and Lucille Packard Fellowship award. C.Y. and T.J. were supported in part by NIH grant(s) GM68762 and GM69676.</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>Automated DNA sequencing and analysis of the human genome.</p>
				</title>
				<aug>
					<au>
						<snm>Hood</snm>
						<fnm>LE</fnm>
					</au>
					<au>
						<snm>Hunkapiller</snm>
						<fnm>MW</fnm>
					</au>
					<au>
						<snm>Smith</snm>
						<fnm>LM</fnm>
					</au>
				</aug>
				<source>Genomics</source>
				<pubdate>1987</pubdate>
				<volume>1</volume>
				<fpage>201</fpage>
				<lpage>212</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/0888-7543(87)90046-2</pubid>
						<pubid idtype="pmpid">3328736</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Genomics, gene expression and DNA arrays.</p>
				</title>
				<aug>
					<au>
						<snm>Lockhart</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Winzeler</snm>
						<fnm>E</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2000</pubdate>
				<volume>405</volume>
				<fpage>827</fpage>
				<lpage>836</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/35015701</pubid>
						<pubid idtype="pmpid" link="fulltext">10866209</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Transcriptional regulatory networks in <it>Saccharomyces cerevisiae</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Lee</snm>
						<fnm>TI</fnm>
					</au>
					<au>
						<snm>Rinaldi</snm>
						<fnm>NJ</fnm>
					</au>
					<au>
						<snm>Robert</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Odom</snm>
						<fnm>DT</fnm>
					</au>
					<au>
						<snm>Bar-Joseph</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Gerber</snm>
						<fnm>GK</fnm>
					</au>
					<au>
						<snm>Hannett</snm>
						<fnm>NM</fnm>
					</au>
					<au>
						<snm>Harbison</snm>
						<fnm>CT</fnm>
					</au>
					<au>
						<snm>Thompson</snm>
						<fnm>CM</fnm>
					</au>
					<au>
						<snm>Simon</snm>
						<fnm>I</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>2002</pubdate>
				<volume>298</volume>
				<fpage>799</fpage>
				<lpage>804</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1075090</pubid>
						<pubid idtype="pmpid" link="fulltext">12399584</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>A comprehensive analysis of protein-protein interactions in <it>Saccharomyces cerevisiae</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Uetz</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Giot</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Cagney</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Mansfield</snm>
						<fnm>TA</fnm>
					</au>
					<au>
						<snm>Judson</snm>
						<fnm>RS</fnm>
					</au>
					<au>
						<snm>Knight</snm>
						<fnm>JR</fnm>
					</au>
					<au>
						<snm>Lockshon</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Narayan</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Srinivasan</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Pochart</snm>
						<fnm>P</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nature</source>
				<pubdate>2000</pubdate>
				<volume>403</volume>
				<fpage>623</fpage>
				<lpage>627</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/35001009</pubid>
						<pubid idtype="pmpid" link="fulltext">10688190</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>Modeling and simulation of genetic regulatory systems: a literature review.</p>
				</title>
				<aug>
					<au>
						<snm>de Jong</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>J Comput Biol</source>
				<pubdate>2002</pubdate>
				<volume>9</volume>
				<fpage>67</fpage>
				<lpage>103</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1089/10665270252833208</pubid>
						<pubid idtype="pmpid" link="fulltext">11911796</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Cluster analysis and display of genome-wide expression patterns.</p>
				</title>
				<aug>
					<au>
						<snm>Eisen</snm>
						<fnm>MB</fnm>
					</au>
					<au>
						<snm>Spellman</snm>
						<fnm>PT</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>PO</fnm>
					</au>
					<au>
						<snm>Botstein</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>1998</pubdate>
				<volume>95</volume>
				<fpage>14863</fpage>
				<lpage>14868</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">24541</pubid>
						<pubid idtype="pmpid" link="fulltext">9843981</pubid>
						<pubid idtype="doi">10.1073/pnas.95.25.14863</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data.</p>
				</title>
				<aug>
					<au>
						<snm>Segal</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Shapira</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Regev</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Pe'er</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Botstein</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Koller</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Friedman</snm>
						<fnm>N</fnm>
					</au>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2003</pubdate>
				<volume>34</volume>
				<fpage>166</fpage>
				<lpage>176</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">12740579</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>A system for identifying genetic networks in gene expression patterns produced by gene disruptions and overexpressions.</p>
				</title>
				<aug>
					<au>
						<snm>Akutsu</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Kuhara</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Maruyama</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Miyano</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Genome Inform Ser Workshop</source>
				<pubdate>1998</pubdate>
				<volume>9</volume>
				<fpage>151</fpage>
				<lpage>160</lpage>
			</bibl>
			<bibl id="B9">
				<title>
					<p>How to reconstruct a large genetic network from n gene perturbations in fewer than n(2) easy steps.</p>
				</title>
				<aug>
					<au>
						<snm>Wagner</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2001</pubdate>
				<volume>17</volume>
				<fpage>1183</fpage>
				<lpage>1197</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/17.12.1183</pubid>
						<pubid idtype="pmpid" link="fulltext">11751227</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Discovery of regulatory interactions through perturbation: inference and experimental design.</p>
				</title>
				<aug>
					<au>
						<snm>Ideker</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Thorsson</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Karp</snm>
						<fnm>RM</fnm>
					</au>
				</aug>
				<source>Pac Symp Biocomput</source>
				<pubdate>2000</pubdate>
				<fpage>305</fpage>
				<lpage>316</lpage>
				<xrefbib>
					<pubid idtype="pmpid">10902179</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>Integrated genomic and proteomic analysis of a systematically perturbed metabolic network.</p>
				</title>
				<aug>
					<au>
						<snm>Ideker</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Thorsson</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Ranish</snm>
						<fnm>JA</fnm>
					</au>
					<au>
						<snm>Christmas</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Buhler</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Eng</snm>
						<fnm>JK</fnm>
					</au>
					<au>
						<snm>Bumgarner</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Goodlett</snm>
						<fnm>DR</fnm>
					</au>
					<au>
						<snm>Aebersold</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Hood</snm>
						<fnm>L</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2001</pubdate>
				<volume>292</volume>
				<fpage>929</fpage>
				<lpage>934</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.292.5518.929</pubid>
						<pubid idtype="pmpid" link="fulltext">11340206</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<aug>
					<au>
						<snm>Raiffa</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Shlaifer</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Applied Statistical Decision Theory</source>
				<publisher>Cambridge, MA: MIT Press</publisher>
				<pubdate>1962</pubdate>
			</bibl>
			<bibl id="B13">
				<aug>
					<au>
						<snm>Fedorov</snm>
						<fnm>FF</fnm>
					</au>
				</aug>
				<source>Theory of Optimal Experimental Design</source>
				<publisher>New York: Academic Press</publisher>
				<pubdate>1972</pubdate>
			</bibl>
			<bibl id="B14">
				<title>
					<p>Active learning for parameter estimation in Bayesian networks.</p>
				</title>
				<aug>
					<au>
						<snm>Tong</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Koller</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Proc 13th Conf Neural Information Processing</source>
				<publisher>T&#252;bingen: Neural Information Processing Systems</publisher>
				<pubdate>2000</pubdate>
				<fpage>647</fpage>
				<lpage>563</lpage>
				<url>http://www.nips.cc</url>
			</bibl>
			<bibl id="B15">
				<title>
					<p>Logic design verification via test generation.</p>
				</title>
				<aug>
					<au>
						<snm>Abadir</snm>
						<fnm>MS</fnm>
					</au>
					<au>
						<snm>Ferguson</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Kirkland</snm>
						<fnm>TE</fnm>
					</au>
				</aug>
				<source>IEEE Trans Computer-Aided Design</source>
				<pubdate>1988</pubdate>
				<volume>7</volume>
				<fpage>138</fpage>
				<lpage>148</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1109/43.3141</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B16">
				<title>
					<p>An automated test approach for US Air Force fighter engines.</p>
				</title>
				<aug>
					<au>
						<snm>Rea</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Settle</snm>
						<fnm>MA</fnm>
					</au>
				</aug>
				<source>IEEE Aerospace Electron Syst Mag</source>
				<pubdate>1996</pubdate>
				<volume>11</volume>
				<fpage>24</fpage>
				<lpage>28</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1109/62.486733</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<title>
					<p>Functional genomic hypothesis generation and experimentation by a robot scientist.</p>
				</title>
				<aug>
					<au>
						<snm>King</snm>
						<fnm>RD</fnm>
					</au>
					<au>
						<snm>Whelan</snm>
						<fnm>KE</fnm>
					</au>
					<au>
						<snm>Jones</snm>
						<fnm>FM</fnm>
					</au>
					<au>
						<snm>Reiser</snm>
						<fnm>PG</fnm>
					</au>
					<au>
						<snm>Bryant</snm>
						<fnm>CH</fnm>
					</au>
					<au>
						<snm>Muggleton</snm>
						<fnm>SH</fnm>
					</au>
					<au>
						<snm>Kell</snm>
						<fnm>DB</fnm>
					</au>
					<au>
						<snm>Oliver</snm>
						<fnm>SG</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2004</pubdate>
				<volume>427</volume>
				<fpage>247</fpage>
				<lpage>252</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nature02236</pubid>
						<pubid idtype="pmpid" link="fulltext">14724639</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B18">
				<title>
					<p>Physical network models.</p>
				</title>
				<aug>
					<au>
						<snm>Yeang</snm>
						<fnm>CH</fnm>
					</au>
					<au>
						<snm>Ideker</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Jaakkola</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>J Comput Biol</source>
				<pubdate>2004</pubdate>
				<volume>11</volume>
				<fpage>243</fpage>
				<lpage>262</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1089/1066527041410382</pubid>
						<pubid idtype="pmpid" link="fulltext">15285891</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<title>
					<p>Protein interactions: two methods for assessment of the reliability of high throughput observations.</p>
				</title>
				<aug>
					<au>
						<snm>Deane</snm>
						<fnm>CM</fnm>
					</au>
					<au>
						<snm>Salwinski</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Xenarios</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Eisenberg</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Mol Cell Proteomics</source>
				<pubdate>2002</pubdate>
				<volume>1</volume>
				<fpage>349</fpage>
				<lpage>356</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1074/mcp.M100037-MCP200</pubid>
						<pubid idtype="pmpid" link="fulltext">12118076</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B20">
				<title>
					<p>Functional discovery via a compendium of expression profiles.</p>
				</title>
				<aug>
					<au>
						<snm>Hughes</snm>
						<fnm>TR</fnm>
					</au>
					<au>
						<snm>Marton</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Jones</snm>
						<fnm>AR</fnm>
					</au>
					<au>
						<snm>Roberts</snm>
						<fnm>CJ</fnm>
					</au>
					<au>
						<snm>Stoughton</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Armour</snm>
						<fnm>CD</fnm>
					</au>
					<au>
						<snm>Bennett</snm>
						<fnm>HA</fnm>
					</au>
					<au>
						<snm>Coffey</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Dai</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>He</snm>
						<fnm>YD</fnm>
					</au>
					<etal/>
				</aug>
				<source>Cell</source>
				<pubdate>2000</pubdate>
				<volume>102</volume>
				<fpage>109</fpage>
				<lpage>126</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0092-8674(00)00015-5</pubid>
						<pubid idtype="pmpid">10929718</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p>Cytoscape: a software environment for integrated models of biomolecular interaction networks.</p>
				</title>
				<aug>
					<au>
						<snm>Shannon</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Markiel</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Ozier</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Baliga</snm>
						<fnm>NS</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>JT</fnm>
					</au>
					<au>
						<snm>Ramage</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Amin</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Schwikowski</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Ideker</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2003</pubdate>
				<volume>13</volume>
				<fpage>2498</fpage>
				<lpage>2504</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">14597658</pubid>
						<pubid idtype="doi">10.1101/gr.1239303</pubid>
						<pubid idtype="pmcid">403769</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B22">
				<title>
					<p>Cytoscape</p>
				</title>
				<url>http://www.cytoscape.org</url>
			</bibl>
			<bibl id="B23">
				<title>
					<p>Cell Circuits Pathway Database</p>
				</title>
				<url>http://www.cellcircuits.org/Yeang2005</url>
			</bibl>
			<bibl id="B24">
				<title>
					<p>Transcriptional profiling shows that Gcn4p is a master regulator of gene expression during amino acid starvation in yeast.</p>
				</title>
				<aug>
					<au>
						<snm>Natarajan</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Meyer</snm>
						<fnm>MR</fnm>
					</au>
					<au>
						<snm>Jackson</snm>
						<fnm>BM</fnm>
					</au>
					<au>
						<snm>Slade</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Roberts</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Hinnebusch</snm>
						<fnm>AG</fnm>
					</au>
					<au>
						<snm>Marton</snm>
						<fnm>MJ</fnm>
					</au>
				</aug>
				<source>Mol Cell Biol</source>
				<pubdate>2001</pubdate>
				<volume>21</volume>
				<fpage>4347</fpage>
				<lpage>4368</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">87095</pubid>
						<pubid idtype="pmpid" link="fulltext">11390663</pubid>
						<pubid idtype="doi">10.1128/MCB.21.13.4347-4368.2001</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B25">
				<title>
					<p>Munich Information Center for Protein Sequences</p>
				</title>
				<url>http://mips.gsf.de</url>
			</bibl>
			<bibl id="B26">
				<title>
					<p>Proteome BioKnowledge Library</p>
				</title>
				<url>http://www.proteome.com</url>
			</bibl>
			<bibl id="B27">
				<title>
					<p>Redirection of the respiro-fermentative flux distribution in <it>Saccharomyces cerevisiae </it>by overexpression of the transcription factor Hap4p.</p>
				</title>
				<aug>
					<au>
						<snm>Blom</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>De Mattos</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Grivell</snm>
						<fnm>LA</fnm>
					</au>
				</aug>
				<source>Appl Environ Microbiol</source>
				<pubdate>2000</pubdate>
				<volume>66</volume>
				<fpage>1970</fpage>
				<lpage>1973</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">101441</pubid>
						<pubid idtype="pmpid" link="fulltext">10788368</pubid>
						<pubid idtype="doi">10.1128/AEM.66.5.1970-1973.2000</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B28">
				<title>
					<p>Yeast PKA represses Msn2p/Msn4p-dependent gene expression to regulate growth, stress response and glycogen accumulation.</p>
				</title>
				<aug>
					<au>
						<snm>Smith</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Ward</snm>
						<fnm>MP</fnm>
					</au>
					<au>
						<snm>Garrett</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>EMBO J</source>
				<pubdate>1998</pubdate>
				<volume>17</volume>
				<fpage>3556</fpage>
				<lpage>3564</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/emboj/17.13.3556</pubid>
						<pubid idtype="pmpid" link="fulltext">9649426</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B29">
				<title>
					<p>SOK2 may regulate cyclic AMP-dependent protein kinase-stimulated growth and pseudohyphal development by repressing transcription.</p>
				</title>
				<aug>
					<au>
						<snm>Ward</snm>
						<fnm>MP</fnm>
					</au>
					<au>
						<snm>Gimeno</snm>
						<fnm>CJ</fnm>
					</au>
					<au>
						<snm>Fink</snm>
						<fnm>GR</fnm>
					</au>
					<au>
						<snm>Garrett</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Mol Cell Biol</source>
				<pubdate>1995</pubdate>
				<volume>15</volume>
				<fpage>6854</fpage>
				<lpage>6863</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">230940</pubid>
						<pubid idtype="pmpid" link="fulltext">8524252</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B30">
				<title>
					<p>Global mapping of the yeast genetic interaction network.</p>
				</title>
				<aug>
					<au>
						<snm>Tong</snm>
						<fnm>AH</fnm>
					</au>
					<au>
						<snm>Lesage</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Bader</snm>
						<fnm>GD</fnm>
					</au>
					<au>
						<snm>Ding</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Xu</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Xin</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Young</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Berriz</snm>
						<fnm>GF</fnm>
					</au>
					<au>
						<snm>Brost</snm>
						<fnm>RL</fnm>
					</au>
					<au>
						<snm>Chang</snm>
						<fnm>M</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>2004</pubdate>
				<volume>303</volume>
				<fpage>808</fpage>
				<lpage>813</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1091317</pubid>
						<pubid idtype="pmpid" link="fulltext">14764870</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B31">
				<aug>
					<au>
						<snm>Popper</snm>
						<fnm>K</fnm>
					</au>
				</aug>
				<source>The Logic of Scientific Discovery</source>
				<publisher>New York: Basic Books</publisher>
				<pubdate>1959</pubdate>
			</bibl>
			<bibl id="B32">
				<title>
					<p>Factor graphs and the sum-product algorithm.</p>
				</title>
				<aug>
					<au>
						<snm>Kschischang</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Frey</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Loeliger</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>IEEE Trans Inform Theory</source>
				<pubdate>2001</pubdate>
				<volume>47</volume>
				<fpage>498</fpage>
				<lpage>519</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1109/18.910572</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B33">
				<title>
					<p>Functional characterization of the <it>S. cerevisiae </it>genome by gene deletion and parallel analysis.</p>
				</title>
				<aug>
					<au>
						<snm>Winzeler</snm>
						<fnm>EA</fnm>
					</au>
					<au>
						<snm>Shoemaker</snm>
						<fnm>DD</fnm>
					</au>
					<au>
						<snm>Astromoff</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Liang</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Anderson</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Andre</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Bangham</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Benito</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Boeke</snm>
						<fnm>JD</fnm>
					</au>
					<au>
						<snm>Bussey</snm>
						<fnm>H</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>1999</pubdate>
				<volume>285</volume>
				<fpage>901</fpage>
				<lpage>906</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.285.5429.901</pubid>
						<pubid idtype="pmpid" link="fulltext">10436161</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B34">
				<title>
					<p>A new non-linear normalization method for reducing variability in DNA microarray experiments.</p>
				</title>
				<aug>
					<au>
						<snm>Workman</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Jensen</snm>
						<fnm>LJ</fnm>
					</au>
					<au>
						<snm>Jarmer</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Berka</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Gautier</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Nielser</snm>
						<fnm>HB</fnm>
					</au>
					<au>
						<snm>Saxild</snm>
						<fnm>HH</fnm>
					</au>
					<au>
						<snm>Nielsen</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Brunak</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Knudsen</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2002</pubdate>
				<volume>3</volume>
				<fpage>research 0048.1</fpage>
				<lpage>0048.16</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1186/gb-2002-3-9-research0048</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B35">
				<title>
					<p>ArrayExpress Gene Expression Database</p>
				</title>
				<url>http://www.ebi.ac.uk/arrayexpress</url>
			</bibl>
			<bibl id="B36">
				<title>
					<p>Testing for differentially-expressed genes by maximum likelihood analysis of microarray data.</p>
				</title>
				<aug>
					<au>
						<snm>Ideker</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Thorsson</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Siegel</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Hood</snm>
						<fnm>L</fnm>
					</au>
				</aug>
				<source>J Comput Biol</source>
				<pubdate>2000</pubdate>
				<volume>7</volume>
				<fpage>805</fpage>
				<lpage>817</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1089/10665270050514945</pubid>
						<pubid idtype="pmpid" link="fulltext">11382363</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B37">
				<title>
					<p>Discovering regulatory and signalling circuits in molecular interaction networks.</p>
				</title>
				<aug>
					<au>
						<snm>Ideker</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Ozier</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Schwikowski</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Siegel</snm>
						<fnm>AF</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2002</pubdate>
				<volume>18</volume>
				<issue>Suppl 1</issue>
				<fpage>S233</fpage>
				<lpage>S240</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">12169552</pubid>
				</xrefbib>
			</bibl>
		</refgrp>
	</bm>
</art>
