<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>gb-2010-11-12-r123</ui><ji>GBJ</ji><fm>
<dochead>Research</dochead>
<bibl>
<title>
<p>Predictive network modeling of the high-resolution dynamic plant transcriptome in response to nitrate</p>
</title>
<aug>
<au id="A1"><snm>Krouk</snm><fnm>Gabriel</fnm><insr iid="I1"/><insr iid="I2"/><email>gk40@nyu.edu</email></au>
<au id="A2"><snm>Mirowski</snm><fnm>Piotr</fnm><insr iid="I3"/><email>mirowski@cs.nyu.edu</email></au>
<au id="A3"><snm>LeCun</snm><fnm>Yann</fnm><insr iid="I3"/><email>yann@cs.nyu.edu</email></au>
<au id="A4"><snm>Shasha</snm><mi>E</mi><fnm>Dennis</fnm><insr iid="I3"/><email>shasha@courant.nyu.edu</email></au>
<au ca="yes" id="A5"><snm>Coruzzi</snm><mi>M</mi><fnm>Gloria</fnm><insr iid="I1"/><email>gloria.coruzzi@nyu.edu</email></au>
</aug>
<insg>
<ins id="I1"><p>Center for Genomics and Systems Biology, Department of Biology, New York University, 100 Washington Square East, 1009 Main Building, New York, NY 10003, USA</p></ins>
<ins id="I2"><p>Biochimie et Physiologie Mol&#233;culaire des Plantes, UMR 5004 CNRS/INRA/SupAgro-M/UM2, Institut de Biologie Int&#233;grative des Plantes, Place Viala, 34060 Montpellier Cedex, France</p></ins>
<ins id="I3"><p>Courant Institute of Mathematical Sciences, New York University, New York, NY 10003, USA</p></ins>
</insg>
<source>Genome Biology</source>
<issn>1465-6906</issn>
<pubdate>2010</pubdate>
<volume>11</volume>
<issue>12</issue>
<fpage>R123</fpage>
<url>http://genomebiology.com/content/11/12/R123</url>
<xrefbib><pubidlist><pubid idtype="pmpid">21182762</pubid><pubid idtype="doi">10.1186/gb-2010-11-12-r123</pubid></pubidlist></xrefbib>
</bibl>
<history><rec><date><day>22</day><month>5</month><year>2010</year></date></rec><revrec><date><day>14</day><month>9</month><year>2010</year></date></revrec><acc><date><day>23</day><month>12</month><year>2010</year></date></acc><pub><date><day>23</day><month>12</month><year>2010</year></date></pub></history>
<cpyrt><year>2010</year><collab>Krouk et al.; licensee BioMed Central Ltd.</collab><note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<sec>
<st>
<p>Background</p>
</st>
<p>Nitrate, acting as both a nitrogen source and a signaling molecule, controls many aspects of plant development. However, gene networks involved in plant adaptation to fluctuating nitrate environments have not yet been identified.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>Here we use time-series transcriptome data to decipher gene relationships and consequently to build core regulatory networks involved in <it>Arabidopsis </it>root adaptation to nitrate provision. The experimental approach has been to monitor genome-wide responses to nitrate at 3, 6, 9, 12, 15 and 20 minutes using Affymetrix ATH1 gene chips. This high-resolution time course analysis demonstrated that the previously known primary nitrate response is actually preceded by a very fast gene expression modulation, involving genes and functions needed to prepare plants to use or reduce nitrate. A state-space model inferred from this microarray time-series data successfully predicts gene behavior in unlearnt conditions.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>The experiments and methods allow us to propose a temporal working model for nitrate-driven gene networks. This network model is tested both <it>in silico </it>and experimentally. For example, the over-expression of a predicted gene hub encoding a transcription factor induced early in the cascade indeed leads to the modification of the kinetic nitrate response of sentinel genes such as <it>NIR</it>, <it>NIA2</it>, and <it>NRT1.1</it>, and several other transcription factors. The potential nitrate/hormone connections implicated by this time-series data are also evaluated.</p>
</sec>
</sec>
</abs>
</fm>
<meta>
<classifications>
<classification id="30010005" subtype="man_spc_id" type="BMC">Development</classification>
<classification id="30010010" subtype="man_spc_id" type="BMC">Genome studies</classification>
<classification id="30010015" subtype="man_spc_id" type="BMC">Model organisms</classification>
<classification id="30010019" subtype="man_spc_id" type="BMC">Plant biology</classification>
<classification id="30010020" subtype="man_spc_id" type="BMC">Virology</classification>
</classifications>
</meta>
<bdy>
<sec>
<st>
<p>Background</p>
</st>
<p>Higher plants, which constitute a main entry of nitrogen in to the food chain, acquire nitrogen mainly as nitrate (NO<sub>3</sub>
<sup>-</sup>). Soil concentrations of this mineral ion can fluctuate dramatically in the rhizosphere, often resulting in limited growth and yield <abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp>. Thus, understanding plant adaptation to fluctuating nitrogen levels in the soil is a challenging task with potential consequences for health, the environment, and economies <abbrgrp>
<abbr bid="B2">2</abbr>
<abbr bid="B3">3</abbr>
<abbr bid="B4">4</abbr>
</abbrgrp>.</p>
<p>The first genomic studies on NO<sub>3</sub>
<sup>- </sup>responses in plants were published 10 years ago <abbrgrp>
<abbr bid="B5">5</abbr>
</abbrgrp>. To date, data monitoring gene expression in response to NO<sub>3</sub>
<sup>- </sup>provision from more than 100 Affymetrix ATH1 chips have been published <abbrgrp>
<abbr bid="B5">5</abbr>
<abbr bid="B6">6</abbr>
<abbr bid="B7">7</abbr>
<abbr bid="B8">8</abbr>
<abbr bid="B9">9</abbr>
<abbr bid="B10">10</abbr>
<abbr bid="B11">11</abbr>
<abbr bid="B12">12</abbr>
</abbrgrp>. Meta-analysis of microarray data sets from several different labs demonstrated that at least a tenth of the genome can potentially be regulated by nitrogen provision, depending on the context <abbrgrp>
<abbr bid="B2">2</abbr>
<abbr bid="B9">9</abbr>
<abbr bid="B13">13</abbr>
<abbr bid="B14">14</abbr>
</abbrgrp>. Despite these extensive efforts of characterization, only a limited number of molecular actors that alter NO<sub>3</sub>
<sup>-</sup>-induced gene regulation have been identified so far. The first molecular actor identified is NRT1.1, a dual affinity NO<sub>3</sub>
<sup>- </sup>transporter that has recently been proposed to also participate in a NO<sub>3</sub>
<sup>-</sup>-sensing system by several studies from different laboratories. A mutation in the <it>NRT1.1 </it>gene has been shown to alter plant responses to NO<sub>3</sub>
<sup>- </sup>provision by changing lateral root development in NO<sub>3</sub>
<sup>-</sup>-rich patches of soil <abbrgrp>
<abbr bid="B15">15</abbr>
<abbr bid="B16">16</abbr>
</abbrgrp> and to affect control of gene expression <abbrgrp>
<abbr bid="B17">17</abbr>
<abbr bid="B18">18</abbr>
<abbr bid="B19">19</abbr>
<abbr bid="B20">20</abbr>
</abbrgrp>. Additionally, mutations in the genes <it>CIPK8 </it>and <it>CIPK23</it>, encoding kinases, the NIN-like protein gene <it>NLP7</it>, and the <it>LBD37</it>/<it>38</it>/<it>39 </it>genes have been shown to alter induction of downstream genes by NO<sub>3</sub>
<sup>- </sup>
<abbrgrp>
<abbr bid="B20">20</abbr>
<abbr bid="B21">21</abbr>
<abbr bid="B22">22</abbr>
<abbr bid="B23">23</abbr>
</abbrgrp>. Other regulatory proteins have been shown to control plant development in response to NO<sub>3</sub>
<sup>- </sup>provision (such as ANR1 for lateral root development), but no evidence has so far demonstrated their role in the control of gene expression in response to NO<sub>3</sub>
<sup>- </sup>provision <abbrgrp>
<abbr bid="B24">24</abbr>
</abbrgrp>. Importantly, the downstream networks of genes affected by such regulatory proteins have not been identified.</p>
<p>In this study, our aim is to provide a systems-wide view of NO<sub>3</sub>
<sup>- </sup>signal propagation through dynamic regulatory gene networks. To do so, we generated a high-resolution dynamic NO<sub>3</sub>
<sup>- </sup>transcriptome from plants treated with nitrate from 0 to 20 minutes, and modeled the resulting sequence using a dynamical model. Instead of learning the dynamics directly from the gene expression sequence, we took into account uncertainty and acquisition errors, and used a state-space model (SSM). The latter defined the observed gene expression time series (denoted as <b>y</b>(<it>t</it>)) as being generated by a hidden 'true' sequence of gene expressions <b>z</b>(<it>t</it>). This approach enabled us to both incorporate uncertainty about the measured mRNA and model the gene regulation network by simple linear dynamics on the hidden variables <b>x</b>(<it>t</it>) (so-called 'states'), thus reducing the number of (unknown) free parameters and the associated risk of over-fitting the observed data. We used a specific machine learning algorithm known as 'dynamical factor graphs' <abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp> with an additional sparsity constraint on the gene regulation network. Interestingly, the coherence of the generated regulatory model is good enough that it is able to predict the direction of gene change (up-regulation or down-regulation) on future data points. This coherence allows us to propose a gene influence network involving transcription factors and 'sentinel genes' involved in the primary NO<sub>3</sub>
<sup>- </sup>response (such as NO<sub>3</sub>
<sup>- </sup>transporters or NO<sub>3</sub>
<sup>- </sup>assimilation genes). The role of a predicted hub in this network is evaluated by over-expressing it, and indeed leads to changes in the NO<sub>3</sub>
<sup>-</sup>-driven gene expression of sentinel genes. The initial gene response to NO<sub>3</sub>
<sup>- </sup>is also analyzed and discussed for its insights into molecular physiology.</p>
</sec>
<sec>
<st>
<p>Results and discussion</p>
</st>
<sec>
<st>
<p>Molecular physiology: assessing molecular reprogramming preceding the 'primary' nitrate response</p>
</st>
<p>To investigate genomic responses that precede the response of sentinel 'primary NO<sub>3</sub>
<sup>- </sup>response' genes (<it>NIR</it>, <it>NRT2.1</it>, <it>NIA1</it>, <it>NIR1</it>) to nitrate application, we first generated several time-series experiments (data not shown). These allowed us to identify the earliest time at which we were able to detect unambiguous NO<sub>3</sub>
<sup>- </sup>induction of these sentinel response genes using real time quantitative PCR (RT-QPCR). Figure <figr fid="F1">1a</figr> shows the expression of selected sentinel genes over time (0, 3, 6, 9, 12, 15, 20, 25, 35, 45, 60 minutes) in response to treatment with 1 mM KNO<sub>3 </sub>or controls of 1 mM KCl. These results (Figure <figr fid="F1">1a</figr>) demonstrate that a sentinel gene such as <it>NRT1.1 </it>is induced at 20 minutes (compared to KCl controls, and in comparison to gene expression at time 0 minutes). The timing of induction of other sentinel genes involved in the 'primary NO<sub>3</sub>
<sup>- </sup>response' are <it>NIR1 </it>at 12 minutes and <it>NRT2.1 </it>and <it>NIA1 </it>at 15 minutes. Following these preliminary experiments, we next ran Affymetrix ATH1 chips on biological replicates corresponding to the beginning of sentinel gene induction and their preceding time points (0, 3, 6, 9, 12, 15, 20 minutes). Note that we kept the 20-minute time point as a reference, since it was the earliest time point that had previously been studied <abbrgrp>
<abbr bid="B6">6</abbr>
</abbrgrp>.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>High-resolution kinetics of transcriptome responses to NO<sub>3</sub><sup>- </sup>treatment</p></caption><text>
   <p><b>High-resolution kinetics of transcriptome responses to NO</b><sub><b>3</b></sub><sup>- </sup><b>treatment</b>. <b>(a) </b>Levels of mRNA for nitrogen-responsive sentinel genes in <it>Arabidopsis </it>roots in response to NO<sub>3</sub><sup>- </sup>treatment. Fourteen-day-old plants grown in the presence of ammonium succinate were treated with 1 mM KNO<sub>3 </sub>or KCL (as a mock treatment). Plants were collected at 0 minutes (before treatment) and 3, 6, 9, 12, 15, 20, 25, 35, 45, and 60 minutes after treatment. Sentinel transcripts were measured in RNA from roots using RT-QPCR and normalized to two housekeeping genes (see Materials and methods). The insets show the Affymetrix MAS5 normalized signal for the sentinel genes on the 0- to 20-minute samples. The data represent the mean &#177; standard error of three and two biological replicates for QPCR and Affymetrix measurements, respectively. <b>(b) </b>Percentage of genes not detected as NO<sub>3</sub><sup>- </sup>regulated in Wang <it>et al</it>. <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. <b>(c) </b>Overall behavior (relative expression) of 550 regulated genes (Log base 2(Signal KNO<sub>3</sub>/Signal KCl)) between 0 and 20 minutes. These data correspond to ATH1 measurement of the samples collected for the RT-PCR presented in (a) (grey shades; see also Materials and methods for further details).</p>
</text><graphic file="gb-2010-11-12-r123-1" hint_layout="double"/></fig>
<p>The resulting nitrate-responsive transcriptome kinetic dataset corresponded to 26 ATH1 chips with 22,810 probes each. A sequential analysis involving linear modeling (detailed in Materials and methods) was carried out to identify genes regulated at each particular time point with highly stringent criteria (including control of the false discovery rate (FDR)). We detected 83, 192, 55, 149, 190, and 229 genes significantly regulated by nitrate treatment at the 3, 6, 9, 12, 15 and 20 minute time points, respectively (Additional file <supplr sid="S1">1</supplr>). The union of these gene lists corresponds to 550 distinct nitrate-responsive genes. We demonstrate that a large majority of the newly identified NO<sub>3</sub>
<sup>-</sup>-regulated genes are controlled at the earliest time points (3 and 6 minutes), which have never before been assayed (Figure <figr fid="F1">1b</figr>). In order to support these new findings, 15 genes have been validated by QPCR (Additional file <supplr sid="S2">2</supplr>) on three replicates (two were used for the microarray chips and one for QPCR only). The predicted behaviors of these genes were validated by the QPCR approach, as follows. One set of genes is shown to have a transient response to NO<sub>3</sub>
<sup>- </sup>(for example, <it>At1g55120</it>, <it>At3g50750</it>, <it>At1g64370</it>, <it>At4g16780</it>, <it>At1g27900</it>, <it>At1g22640</it>, <it>At1g52060</it>, and <it>At2g42200</it>). While a second gene set is validated to be very early responsive genes (for example, <it>At1g13300</it>, <it>At1g49000</it>, <it>At4g31910</it>, <it>At5g15830</it>, <it>At2g27830</it>, <it>At3g25790</it>, and <it>At5g65210</it>). Quantitatively, the correlation between the NO<sub>3</sub>
<sup>- </sup>induction (KNO<sub>3</sub>/KCl ratio) detected by both approaches (ATH1 chip and QPCR) is R<sup>2 </sup>&gt; 0.5 for 8 genes, 0.5 &gt; R<sup>2 </sup>&gt; 0.4 for 3 genes, R<sup>2 </sup>&lt; 0.4 for 4 genes. It is noteworthy that for the genes having a low correlation, their overall behavior is validated by QPCR (for example, constant versus transient induction by NO<sub>3</sub>
<sup>-</sup>; Figure <figr fid="F2">2b</figr>; Additional file <supplr sid="S2">2</supplr>).</p>
<suppl id="S1">
<title>
<p>Additional file 1</p>
</title>
<text>
<p>
<b>Description of significantly regulated genes and their cluster assignment</b>.</p>
</text>
<file name="gb-2010-11-12-r123-S1.pdf">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S2">
<title>
<p>Additional file 2</p>
</title>
<text>
<p>
<b>High-resolution kinetics of response of genes to NO</b>
<sub>
<b>3</b>
</sub>
<sup>- </sup>
<b>treatment</b>. Levels of mRNA for nitrogen-responsive genes in <it>Arabidopsis </it>roots in response to NO<sub>3</sub>
<sup>- </sup>treatment. Fourteen-day-old plants grown in the presence of ammonium succinate were treated with 1 mM KNO<sub>3 </sub>or KCL (as a mock treatment). Plants were collected at 0 minutes (before treatment) and 3, 6, 9, 12, 15, and 20 minutes after treatment. Transcripts were measured in RNA from roots using RT-QPCR and Affy ATH1 chips and normalized to two housekeeping genes (see Materials and methods). The data represent the mean &#177; standard error of three and two biological replicates for QPCR and Affymetrix measurements, respectively. Correlations between QPCR- and Affy-detected fold change results are provided in the third column.</p>
</text>
<file name="gb-2010-11-12-r123-S2.pdf">
   <p>Click here for file</p>
</file>
</suppl>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>Clustering analysis and QPCR reveals different patterns of expression in response to short-term NO<sub>3</sub><sup>- </sup>treatment</p></caption><text>
   <p><b>Clustering analysis and QPCR reveals different patterns of expression in response to short-term NO</b><sub><b>3</b></sub><sup>- </sup><b>treatment</b>. <b>(a) </b>Cluster analysis of the relative expression of 550 regulated genes (Log base 2(Signal KNO<sub>3</sub>/Signal KCl)) between 0 and 20 minutes. These data correspond to ATH1 measurement of the samples collected for the RT-PCR shown in Figure 1 (see Materials and methods for further details). For clusters including genes with a significant over-representation of biological functions see Additional file <supplr sid="S4">4</supplr>. <b>(b) </b>Examples of three different gene behaviors (transitory, early, late responses) after NO<sub>3</sub><sup>- </sup>provision.</p>
</text><graphic file="gb-2010-11-12-r123-2" hint_layout="double"/></fig>
<p>To probe the biological significance of these kinetic patterns of nitrate regulation of gene expression, we determined the functional categories that are over-represented in the lists of nitrate-regulated genes at each time point, separating the induced and repressed gene lists (Additional file <supplr sid="S2">2</supplr>). Interestingly, the biological functions induced earliest after nitrate addition do not concern nitrogen directly. Instead, within 3 minutes, the very first statistically significant over-represented functional category is ribosomal proteins (<it>P</it>-value 6.58e<sup>-6</sup>). This finding generates the hypothesis that nitrogen could trigger a transient and very rapid reprogramming of key elements of the translation machinery needed to synthesize new proteins required for nitrogen acquisition. This idea might be further supported by the fact that many more genes are induced by the addition of nitrate than are repressed (see below). Moreover, later on in the time-course (as early as 9 minutes), the next biological function to be significantly induced is the oxidative pentose-phosphate-pathway, a function that is known to be a critical step providing reductants needed to assimilate NO<sub>3</sub>
<sup>- </sup>
<abbrgrp>
<abbr bid="B26">26</abbr>
</abbrgrp>. The oxidative pentose-phosphate-pathway has also been shown to generate a signal controlling key effectors of the NO<sub>3</sub>
<sup>- </sup>response, such as <it>NRT2.1</it>, <it>NRT2.4</it>, <it>NRT1.1</it>, <it>NRT1.5</it>, and <it>AMT1.3 </it>
<abbrgrp>
<abbr bid="B27">27</abbr>
</abbrgrp>. Taken together, these observations suggest that the early nitrate response involves mechanisms needed to prepare the plant to respond to nitrate rather than mechanisms that relate directly to nitrogen. Such mechanisms - for example, nitrate transport and amino acid metabolism - are regulated later on in the time series (Additional file <supplr sid="S3">3</supplr>).</p>
<suppl id="S3">
<title>
<p>Additional file 3</p>
</title>
<text>
<p>
<b>Gene Ontology functions over-represented in NO</b>
<sub>
<b>3</b>
</sub>
<sup>-</sup>
<b>-regulated gene lists</b>.</p>
</text>
<file name="gb-2010-11-12-r123-S3.pdf">
   <p>Click here for file</p>
</file>
</suppl>
<p>To begin to decipher the pattern of nitrate-regulated gene expression over the entire time series, we first clustered the gene expression ratio (Log<sub>2</sub>(Signal KNO<sub>3</sub>/Signal KCl) of the 550 significantly regulated genes) in order to gain insight into the genomic reprogramming during the first 20 minutes of KNO<sub>3 </sub>treatment (Figure <figr fid="F1">1c</figr>). The vast majority of the reprogramming is an induction of gene expression by NO<sub>3</sub>
<sup>-</sup>, rather than a repression. To quantify this observation, the numbers of genes that are detected as significantly induced by NO<sub>3</sub>
<sup>- </sup>at 3, 6, 9, 12, 15, and 20 minutes are 63 (76% of regulated genes), 146 (76% of regulated genes), 54 (98% of regulated genes), 123 (82% of regulated genes), 164 (87% of regulated genes), and 209 (92% of regulated genes), respectively. One interpretation is that NO<sub>3</sub>
<sup>- </sup>induces an adaptation program that is on 'stand-by' in NO<sub>3</sub>
<sup>-</sup>-free conditions, rather than a shut-down of a putative 'N-free-condition' program. Clustering analysis also allowed us to sort gene responses according to their overall behavior. This analysis demonstrated that rapid gene expression responses to nitrate could be classified into up to 20 clusters (according to figure of merit (FOM) analysis; see Materials and methods; Figure <figr fid="F2">2</figr>). Considering each cluster independently, we were able to identify over-represented biological functions for eight clusters, including chloroplast, the oxidative pentose-phosphate-pathway, and ribosomal proteins (Figure <figr fid="F2">2</figr>; see Additional file <supplr sid="S4">4</supplr> for details).</p>
<suppl id="S4">
<title>
<p>Additional file 4</p>
</title>
<text>
<p>
<b>Gene Ontology functions over-represented in NO</b>
<sub>
<b>3</b>
</sub>
<sup>- </sup>
<b>clusters</b>.</p>
</text>
<file name="gb-2010-11-12-r123-S4.pdf">
   <p>Click here for file</p>
</file>
</suppl>
<p>Moreover, we identified and analyzed 146 genes that were consistently induced over the 20 minutes of nitrate treatment (corresponding to clusters 1, 9, 11, 13, and 14). This group of consistently nitrate-induced genes includes over-represented biological functions such as oxidoreduction coenzyme process (<it>P</it>-value = 0.00027), nicotinamide metabolic process (<it>P</it>-value = 6.50e-05), regulation of transcription (<it>P</it>-value = 0.00167), pentose phosphate shunt (<it>P</it>-value = 0.00073). We also identified 219 genes showing responses to nitrate that seem to represent a general pattern of transient regulation (clusters 2, 3, 4, 6, 7, 8, 10, 12, 16, 17, and 18). Interestingly, the oxygen and redox state of the cell seems to be a general function that is transiently adapted by KNO<sub>3 </sub>treatment. Indeed, Munich Information Center for Protein Sequence (MIPS) functions such as oxygen radical detoxification (<it>P</it>-value = 0.00018), peroxidase reaction (<it>P</it>-value = 0.01479), and superoxide metabolism (<it>P</it>-value = 0.02472) are over-represented gene ontology terms in this group. This observation might indicate the effect of NO<sub>3</sub>
<sup>- </sup>on the redox state of the cell. Finally, we show that 124 genes are repressed by NO<sub>3</sub>
<sup>- </sup>treatment, transiently or otherwise (corresponding to clusters 5, 19, and 20). The common function overrepresented in this group is transcription (<it>P</it>-value = 0.00312). This could result from the extinction of the pre-existing transcriptome program preceding the NO<sub>3</sub>
<sup>- </sup>treatment. Since the plants had been nitrogen starved for 24 hours before NO<sub>3</sub>
<sup>- </sup>treatment, this might correspond to genes that are up-regulated by the pre-treatment (nitrogen starvation) and down-regulated by NO<sub>3</sub>
<sup>- </sup>provision. To statistically test this hypothesis, we set up a randomization test (see Materials and methods) to quantify whether the genes that are down-regulated in our conditions correspond to genes that were up-regulated by nitrogen starvation in Peng <it>et al</it>. <abbrgrp>
<abbr bid="B28">28</abbr>
</abbrgrp>; this occurred with a <it>P</it>-value of 0.0089. Conversely, no significant overlap was detected for clusters induced bi NO<sub>3</sub>
<sup>- </sup>(clusters 1, 2, 4, 9, 10, 11, 13, 14). This finding validates the idea that NO<sub>3</sub>
<sup>-</sup>-down-regulated clusters correspond to genes involved in the response of plants to the pre-treatment conditions. In summary, a large part of the NO<sub>3</sub>
<sup>- </sup>gene expression reprogramming has been missed by previous genomic studies. The time-varying expression modulation newly identified here involves physiological functions that could be components of the nitrate signaling system itself.</p>
<p>In order to further document the potential of this dynamic transcriptome response to mediate cross-talk between nitrate signaling and other well-studied signaling pathways in plants, we evaluated if the gene sets regulated by NO<sub>3</sub>
<sup>- </sup>at the different time points in our analysis overlap more than expected by chance with genes regulated by hormones using data generated by the Chory lab <abbrgrp>
<abbr bid="B29">29</abbr>
</abbrgrp>. To do this, we compared the nitrate-regulated gene lists (over six time points) with the lists of hormone-regulated genes <abbrgrp>
<abbr bid="B29">29</abbr>
</abbrgrp> and generated a matrix that assembled the randomization test <it>P</it>-values (see Materials and methods) between each pair of gene lists. The lists included genes regulated by NO<sub>3</sub>
<sup>- </sup>across each of the six time points (our study), and lists of genes regulated by seven different hormones by the Chory lab (abscisic acid, cytokinins, auxin (IAA), methyl jasmonate, brassinolides, gibberellic acid, ethylene)] <abbrgrp>
<abbr bid="B29">29</abbr>
</abbrgrp>. These results (Figure <figr fid="F3">3</figr>) lead to three main conclusions supporting the existence of gene modules responding to nitrate and hormone signaling.</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>Identification of NO<sub>3</sub><sup>- </sup>response and hormonal cross-talk modules</p></caption><text>
   <p><b>Identification of NO</b><sub><b>3</b></sub><sup>- </sup><b>response and hormonal cross-talk modules</b>. <b>(a) </b>For each pair of gene lists (NO<sub>3</sub><sup>- </sup>responsive (this work) or hormone responsive <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>), a <it>P</it>-value (randomization test; see Materials and methods) was computed and is shown in the table below the blue diagonal. Entries in the blue diagonal give the gene list size in number of genes. Above the diagonal the size of the intersection of each pair of studied gene lists is given. Note that <it>P</it>-value = 0 means <it>P</it>-value &lt; 0.001. Analysis of the <it>P</it>-values included within the yellow outline led to the building of gene modules depicted in the conceptual model provided in <b>(b)</b>. ABA, absisic acid; ACC, 1-aminocyclopropane-1-carboxylic-acid (ethylene precursor); BL, brassinolides; CK, cytokinins; GA, gibberellic acid; IAA, indole acetic acid (auxin); MJ, methyl jasmonate.</p>
</text><graphic file="gb-2010-11-12-r123-3" hint_layout="double"/></fig>
<p>First, we considered only the overlap between the NO<sub>3</sub>
<sup>-</sup>-responsive gene lists at different time points. We found evidence for two linked 'modules' of nitrate-regulated gene expression (modules 1 and 2 in Figure <figr fid="F3">3b</figr>). The first nitrate-regulated module consists of the nitrate-regulated genes in the union of the 3- and 6-minute gene lists. The overlap between these two lists is far beyond what we would expect by chance (<it>P</it>-value &lt; 0.001). However, the 3-minute gene list overlaps very little with the rest of the nitrate-regulated genes in the time-course study. As such, the second nitrate-regulated module is made up of the union of the 6-, 9-, 12-, 15-, and 20-minute gene lists (these gene lists overlap significantly more than random). The 6-minute gene list acts as the link between the very early nitrate-response genes (before 6 minutes) and the more delayed ones (after 6 minutes).</p>
<p>Second, the overlap of the nitrate-regulated genes with the hormone-regulated genes (modules 3 and 4 in Figure <figr fid="F3">3b</figr>) is significantly higher than expected at the 9-minute nitrate time point for abscisic acid-, indole acetic acid-, brassinolide- and methyl jasmonate-regulated genes, while the 12-minute nitrate time point overlaps significantly with cytokinin-regulated genes. This suggests that the interaction of nitrate signaling with other hormone signals is likely to involve the genes regulated by nitrate after 9 minutes. This leads to the hypothesis that, from 0 to 6 minutes, the genomic reprogramming concerns a pure NO<sub>3</sub>
<sup>- </sup>signaling pathway, and thereafter (for example, 9 minutes after nitrate treatment) interactions with developmental signals such as hormones occur (Figure <figr fid="F3">3</figr>). This enables us to derive the hypothesis that the early nitrate controllers (for example, transcription factors, kinases, and so on) regulated at 3, 6, and 9 minutes are involved in the control of the nitrate signaling itself, rather than in the interaction between NO<sub>3</sub>
<sup>- </sup>and other signals such as hormones.</p>
<p>Third, this analysis shows that the different hormonal treatments control largely overlapping gene modules, as has been described previously <abbrgrp>
<abbr bid="B29">29</abbr>
</abbrgrp>.</p>
<p>In conclusion, connections between NO<sub>3</sub>
<sup>- </sup>and hormone-related signaling are common features of plant molecular networks at several layers of integration (for a review, see <abbrgrp>
<abbr bid="B30">30</abbr>
</abbrgrp>). For instance, transcriptional connections have been identified where genes involved in a NO<sub>3</sub>
<sup>-</sup>-responsive 'biomodule' have been shown to be more responsive to NO<sub>3</sub>
<sup>- </sup>if they are also strongly regulated by hormones <abbrgrp>
<abbr bid="B13">13</abbr>
</abbrgrp>. More recently, we provided a mechanistic hypothesis to explain the role of NRT1.1 as a NO<sub>3</sub>
<sup>- </sup>sensor controlling lateral root development. Indeed, NRT1.1 is a transceptor able to transport both auxin and nitrate. The sensing mechanism results from the ability of nitrate to inhibit auxin transport by NRT1.1, leading to low lateral root development at low nitrate concentrations <abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp>. To determine whether this mechanism is also involved in the transcriptional induction studied in the present work will require further investigation. However, the fact that hormones can be involved at the beginning of NO<sub>3</sub>
<sup>- </sup>sensing mechanisms <abbrgrp>
<abbr bid="B13">13</abbr>
<abbr bid="B16">16</abbr>
</abbrgrp> and downstream of NO<sub>3</sub>
<sup>- </sup>transcriptional activation (this analysis) is an intriguing observation that deserves further investigation to understand what is the purpose of such signal entanglement.</p>
</sec>
<sec>
<st>
<p>Machine learning approach: modeling of regulatory gene influences through predictive models</p>
</st>
<sec>
<st>
<p>Dynamical predictive modeling of regulatory gene networks</p>
</st>
<p>Time-series datasets of gene expression levels, as measured by microarrays, can provide us with a detailed picture of the behavior of the genetic network over time, but they contain this information in a highly noisy form requiring reverse engineering <abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp>. An additional challenge of systems biology is to be able to model systems precisely enough that they can predict untested conditions, especially given the paucity of data relative to the number of possible connections.</p>
<p>Among the several approaches to this modeling problem, dynamical models have gained prominence as they simultaneously encode the topology of the gene interaction graph and its functional evolution model. Such a model can in turn be used for predictive modeling of gene expression at later time points or upon perturbation. Such dynamical models essentially consist of a mathematical function that governs the transitions of the state of a gene regulatory network over time. Typically, dynamical models of mRNA concentrations consist of ordinary differential equations (ODEs) <abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp>. For a given gene <it>i</it>, ODEs can, for instance, define the rate of change of mRNA concentration <it>y</it>
<sub>
<it>i</it>
</sub>(<it>t</it>) (with a kinetic constant <it>&#964;</it>), as a function <it>g</it>
<sub>
<it>i </it>
</sub>of the influences of transcription factors (which we assume in this article to consist of the vectors <b>y</b>(<it>t</it>) of all observed mRNA measures, because protein levels are unavailable to us), with an optional mRNA's degradation term, as in the equation below:</p>
<p>
<display-formula>
<m:math name="gb-2010-11-12-r123-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:mi>&#964;</m:mi>
   <m:mfrac>
      <m:mrow>
         <m:mtext>d</m:mtext>
         <m:msub>
            <m:mi>y</m:mi>
            <m:mi>i</m:mi>
         </m:msub>
         <m:mo stretchy="false">(</m:mo>
         <m:mi>t</m:mi>
         <m:mo stretchy="false">)</m:mo>
      </m:mrow>
      <m:mrow>
         <m:mtext>d</m:mtext>
         <m:mi>t</m:mi>
      </m:mrow>
   </m:mfrac>
   <m:mo>=</m:mo>
   <m:msub>
      <m:mi>g</m:mi>
      <m:mi>i</m:mi>
   </m:msub>
   <m:mo stretchy="false">(</m:mo>
   <m:mstyle mathvariant="bold">
      <m:mtext>y</m:mtext>
   </m:mstyle>
   <m:mo stretchy="false">(</m:mo>
   <m:mi>t</m:mi>
   <m:mo stretchy="false">)</m:mo>
   <m:mo stretchy="false">)</m:mo>
   <m:mo>&#8722;</m:mo>
   <m:msub>
      <m:mi>y</m:mi>
      <m:mi>i</m:mi>
   </m:msub>
   <m:mo stretchy="false">(</m:mo>
   <m:mi>t</m:mi>
   <m:mo stretchy="false">)</m:mo>
</m:mrow>
</m:math>
</display-formula>
</p>
<p>In our study, we have considered dynamics with the mRNA degradation term (the so-called 'kinetic' model <abbrgrp>
<abbr bid="B32">32</abbr>
<abbr bid="B33">33</abbr>
</abbrgrp>) and without it (the so-called 'Brownian motion' model <abbrgrp>
<abbr bid="B34">34</abbr>
</abbrgrp>). Assuming degradation (kinetic ODE) worked better.</p>
<p>Since microarray data are discretely sampled over time, the above equation is linearized; hence, it explains how gene expressions at time <it>t </it>influence gene expressions at time <it>t </it>+ 1.</p>
<p>In our study, the sequence of microarrays contained seven full-genome mRNA measures (with two replicates) at 0, 3, 6, 9, 12, 15 and 20 minutes; in the cross-validation leave-out-last study, we used measures between 0 and 15 minutes to fit the model for each gene <it>i </it>(by tuning the parameters of associated dynamical functions), and tested the fitted model on the last time point (prediction of the mRNA level at 20 minutes).</p>
</sec>
<sec>
<st>
<p>Choosing the model</p>
</st>
<p>In a review article, Jaeger and Monk <abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp> pointed out that the inference of biological networks in the presence of few time-point measurements, many genes, measurement errors and random fluctuations in the environment is inherently difficult. Because of this limitation, methods for computational inference of gene regulation networks can be crudely divided into two approaches: non-linear or state-space based modeling of the complex interactions between a restricted number of genes (typically ten) with hidden protein transcription factors; or simpler, but linear, models of transcription factor-gene interactions <abbrgrp>
<abbr bid="B32">32</abbr>
<abbr bid="B33">33</abbr>
<abbr bid="B34">34</abbr>
<abbr bid="B35">35</abbr>
</abbrgrp>, relying on larger (hundreds to thousands) numbers of microarray measurements.</p>
<p>State-space models (SSM) are a general category of machine learning algorithms that model the dynamics of a sequence of data by encoding the joint likelihood of observed and hidden variables. A popular probabilistic example of SSMs that have been applied to gene expression data are dynamical bayesian networks <abbrgrp>
<abbr bid="B36">36</abbr>
</abbrgrp>, such as linear dynamical systems <abbrgrp>
<abbr bid="B37">37</abbr>
<abbr bid="B38">38</abbr>
</abbrgrp>. SSMs assume an observed sequence <b>y</b>(<it>t</it>) (in our case, gene expression data) to be generated from an underlying unknown sequence <b>z</b>(<it>t</it>), also called 'hidden states'. Consecutive hidden states form a Markov chain {<b>z</b>(0), <b>z</b>(1), ..., <b>z</b>(T-2), <b>z</b>(<it>T</it>-1)} (in our case, the sequence contains seven states at 0, 3, 6, 9, 12, 15 and 20 minutes); each transition in the chain corresponds to the same stationary (that is, time invariant) dynamical model <it>f</it>.</p>
<p>As a first example of complex SSMs, Zhang <it>et al</it>. used gaussian processes dynamical models with nonlinear dynamics to infer the profile of a single transcription factor (the tumor suppressor p53) and explained the activity of a large collection of genes using that transcription factor only (without any other transcription factor-gene interaction) <abbrgrp>
<abbr bid="B39">39</abbr>
</abbrgrp>. Another example is the linear dynamical system, which Beal <it>et al</it>. <abbrgrp>
<abbr bid="B37">37</abbr>
</abbrgrp> as well as Angus <it>et al</it>. <abbrgrp>
<abbr bid="B38">38</abbr>
</abbrgrp> used to infer the profiles of 14 hidden transcription factors for 10 observed genes only, either without predictive cross-validation <abbrgrp>
<abbr bid="B37">37</abbr>
</abbrgrp>, or on synthetically generated data <abbrgrp>
<abbr bid="B38">38</abbr>
</abbrgrp>.</p>
<p>Examples of first-order linear dynamical models for gene expression include the Inferelator by Bonneau <it>et al</it>. <abbrgrp>
<abbr bid="B32">32</abbr>
<abbr bid="B33">33</abbr>
</abbrgrp>. The Inferelator consists of a kinetic ODE that follows the Wahde and Hertz equation <abbrgrp>
<abbr bid="B40">40</abbr>
</abbrgrp> and where transcription factors contribute linearly. This ODE also includes an mRNA degradation term. Some instances of the Inferelator introduce nonlinear AND, OR and XOR relationships between pairs of genes, based on a previous bi-clustering of genes. One has to note that the Inferelator has been mostly applied to datasets with hundreds of data-points (for example, <it>Halobacterium</it>).</p>
<p>Other examples include the first-order vector autoregressive model VAR(1) <abbrgrp>
<abbr bid="B35">35</abbr>
</abbrgrp> and the 'Brownian motion' model (which is a VAR(1) model of changes in mRNA concentration) <abbrgrp>
<abbr bid="B34">34</abbr>
</abbrgrp>. Lozano <it>et al</it>. <abbrgrp>
<abbr bid="B41">41</abbr>
</abbrgrp> suggested using a dynamic dependency on the past 2, 3, or 4 time points, but this was impractical in our case given the relatively small number of microarray measurements in our experiments.</p>
<p>Two microarray replicates were acquired in this study. Since each replicate is independent of all microarrays preceding and following in time, there were four possible transitions between any two time points <it>t </it>and <it>t </it>+ 1, and we therefore used four replicate sequences to train the machine learning algorithm.</p>
</sec>
<sec>
<st>
<p>A noise reduction approach to state-space modeling of regulatory gene networks</p>
</st>
<p>In a departure from previous SSM frameworks, our noise-reduction approach uses the hidden variables to represent an idealized, 'true' sequence of gene expressions <b>z</b>(<it>t</it>) that would be measured if there were no noise. The set of all genes at time <it>t </it>is modeled by a 'latent' (that is, hidden but correct) variable (denoted <b>z</b>(<it>t</it>)), about which noisy observations <b>y</b>(<it>t</it>) are made.</p>
<p>Specifically, we a) model the dynamics on hidden states <b>z</b>(<it>t</it>) instead of modeling them directly on the Affymetrix data <b>y</b>(<it>t</it>), as well as b) have the hidden sequence <b>z</b>(<it>t</it>) generate the actual observed sequence <b>y</b>(<it>t</it>) of mRNA, while incorporating measurement uncertainty. Such an approach has been used in robotics to cope with errors coming from sensors. Our proposed SSM is depicted in Figure <figr fid="F4">4a</figr>, where each node <b>y</b>(<it>t</it>) or <b>z</b>(<it>t</it>) represents a vector of all gene expressions at a particular time point, and where latent variables are represented by large red circles, and observed variables by large black circles.</p>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>State space modeling predicts transcription factor influence</p></caption><text>
   <p><b>State space modeling predicts transcription factor influence</b>. <b>(a) </b>Conceptual scheme of the state space modeling. An unknown function f (red square) relates the values of latent variables Z(t) and Z(t + 1) (for all t) corresponding to consecutive time measurements. Learning algorithms iteratively optimize the function f mapping latent values of transcription factors to changes to target genes (and transcription factors themselves at time t + 1). <b>(b) </b>The whole dataset (from 0 to 20 minutes of KNO<sub>3 </sub>treatment) has been learnt by state space modeling (validated to be predictive in a leave-one-last approach; Table 2). The resulting <it>f </it>function has learnt possible connections and can be displayed as an influence matrix. SPL9 is a transcription factor predicted to be a potential bottleneck and is further experimentally studied.</p>
</text><graphic file="gb-2010-11-12-r123-4" hint_layout="double"/></fig>
<p>Our goal is to learn the function <it>f </it>that determines the change in expression of a target gene <it>z</it>
<sub>
<it>j</it>
</sub>, as a linear combination of the expression of a relatively small number of transcription factors, and that relates the values of latent variables <b>z</b>(<it>t</it>) and <b>z</b>(<it>t </it>+ 1) corresponding to consecutive time measurements (function <it>f </it>is represented by a red square in Figure <figr fid="F4">4a</figr>). The relationship between latent and observed variables is assumed to be the identity function <it>h </it>with added Gaussian noise (represented by a black square in Figure <figr fid="F4">4a</figr>).</p>
<p>The function <it>f </it>is modeled as a linear dynamical system (that is, a matrix <b>F</b>). This linear Markovian model, which represents a kinetic (RNA degrades) or Brownian motion (RNA does not degrade) ODE, is the simplest and requires the fewest parameters (there is one parameter per transcription factor-gene interaction, and an additional offset for each target gene). This model thus helps to avoid over-fitting scarce gene data. The linear model operates on hidden variables, which become a smoothed version of the observed gene expression data.</p>
<p>Because our noise reduction state-space modeling algorithm is efficient, simple and tractable, as explained in the Materials and methods section, it can handle larger numbers of genes (we focused on 76 genes) than other SSM approaches, given enough genes <abbrgrp>
<abbr bid="B37">37</abbr>
<abbr bid="B38">38</abbr>
<abbr bid="B39">39</abbr>
</abbrgrp>.</p>
</sec>
<sec>
<st>
<p>Comparative study of state-space model optimization</p>
</st>
<p>Out of the 550 nitrogen-regulated genes, we extracted 67 genes that correspond to all the predicted transcription factors and 9 N-regulated target genes that belong to the primary nitrogen assimilation pathway. The transcription factors have been used as explanatory variables (inputs to <it>f</it>) as well as explained values (output from <it>f</it>) (Figure <figr fid="F4">4b</figr>), whereas the nitrogen assimilation target genes are only explained values. We then optimized our SSM, using different algorithms, in order to fit it to the observed data matrix, and compare all our results in Table <tblr tid="T1">1</tblr>. We also compared our SSM approach to non-SSM approaches <abbrgrp>
<abbr bid="B32">32</abbr>
<abbr bid="B33">33</abbr>
<abbr bid="B34">34</abbr>
<abbr bid="B35">35</abbr>
<abbr bid="B42">42</abbr>
<abbr bid="B43">43</abbr>
</abbrgrp> (Table <tblr tid="T2">2</tblr>).</p>
<tbl id="T1"><title><p>Table 1</p></title><caption><p>The kinetic ODE and both the conjugate gradient and LARS optimization algorithms obtain the best fit to the 0 to 15 minutes data, with good leave-out-last predictions</p></caption><tblbdy cols="8">
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="3" ca="center">
            <p>
               <b>Best hyperparameters (with respect to SNR on leave-1 training dataset)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Performed on training set:</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Performed on test set:</b>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>Dynamics</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Normalization</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Optimization</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Gamma (state-space coefficient)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Tau (kinetic time constant)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Lambda (regularization parameter)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>SNR (in dB) on leave-1 training dataset</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>percentage of correct signs on leave-1 test dataset</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="8">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Kinetic</p>
         </c>
         <c ca="left">
            <p>MAS5</p>
         </c>
         <c ca="left">
            <p>Gradient</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>0.0001</p>
         </c>
         <c ca="center">
            <p>32.4</p>
         </c>
         <c ca="center">
            <p>68%</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Kinetic</p>
         </c>
         <c ca="left">
            <p>MAS5</p>
         </c>
         <c ca="left">
            <p>LARS</p>
         </c>
         <c ca="center">
            <p>0.1</p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>0.1</p>
         </c>
         <c ca="center">
            <p>32.4</p>
         </c>
         <c ca="center">
            <p>74%</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Kinetic</p>
         </c>
         <c ca="left">
            <p>MAS5</p>
         </c>
         <c ca="left">
            <p>Elastic Nets</p>
         </c>
         <c ca="center">
            <p>0.1</p>
         </c>
         <c ca="center">
            <p>7</p>
         </c>
         <c ca="center">
            <p>0.05</p>
         </c>
         <c ca="center">
            <p>32.2</p>
         </c>
         <c ca="center">
            <p>71%</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Brownian</p>
         </c>
         <c ca="left">
            <p>MAS5</p>
         </c>
         <c ca="left">
            <p>Gradient</p>
         </c>
         <c ca="center">
            <p>0.1</p>
         </c>
         <c ca="center">
            <p>NA</p>
         </c>
         <c ca="center">
            <p>0.0001</p>
         </c>
         <c ca="center">
            <p>32.1</p>
         </c>
         <c ca="center">
            <p>65%</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Brownian</p>
         </c>
         <c ca="left">
            <p>MAS5</p>
         </c>
         <c ca="left">
            <p>LARS</p>
         </c>
         <c ca="center">
            <p>0</p>
         </c>
         <c ca="center">
            <p>NA</p>
         </c>
         <c ca="center">
            <p>0.05</p>
         </c>
         <c ca="center">
            <p>32.1</p>
         </c>
         <c ca="center">
            <p>63%</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Brownian</p>
         </c>
         <c ca="left">
            <p>MAS5</p>
         </c>
         <c ca="left">
            <p>Elastic Nets</p>
         </c>
         <c ca="center">
            <p>0</p>
         </c>
         <c ca="center">
            <p>NA</p>
         </c>
         <c ca="center">
            <p>0.05</p>
         </c>
         <c ca="center">
            <p>32.1</p>
         </c>
         <c ca="center">
            <p>63%</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Na&#239;ve trend prediction</p>
         </c>
         <c ca="left">
            <p>MAS5</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>NA</p>
         </c>
         <c ca="center">
            <p>NA</p>
         </c>
         <c ca="center">
            <p>NA</p>
         </c>
         <c ca="center">
            <p>NA</p>
         </c>
         <c ca="center">
            <p>52%</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>Each line in the table represents the type of ODE for the dynamical model of transcription factor-gene regulation (either kinetic, with mRNA degradation, or 'Brownian motion', without mRNA degradation), the type of microarray data normalization, and the optimization algorithm for learning the parameters of the dynamical model. For each of these, we selected the best hyperparameters, namely the state-space coefficient gamma, the kinetic time constant (in minutes) and the parameter regularization coefficient lambda, based on the quality of fit to the training data (from 0 to 15 minutes), as measured by the signal-to-noise ratio (SNR), in dB. We then performed a leave-out-last (leave-1) prediction and counted the number of times the sign of the mRNA change between 15 minutes and 20 minutes was correct. We compared these results to a na&#239;ve extrapolation (based on the trend between 12 and 15 minutes) and obtained statistically significant results at <it>P </it>= 0.0145.</p>
   </tblfn></tbl>
<tbl id="T2"><title><p>Table 2</p></title><caption><p>The quality of fit of our state-space model approach slightly outperforms the non-SSM approaches</p></caption><tblbdy cols="9">
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="3" ca="center">
            <p>
               <b>Best hyper parameters (with respect to SNR on leave-1 training dataset)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Performed on training set:</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Performed on test set:</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>Dynamics</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Normalization</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Optimization</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Gamma (state-space coefficient)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Tau (kinetic time constant)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Lambda (regularization parameter)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>SNR (in dB) on leave-1 training dataset</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>percentage of correct signs on leave-1 test dataset</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Reference</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="9">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Kinetic</p>
         </c>
         <c ca="left">
            <p>MAS5</p>
         </c>
         <c ca="left">
            <p>Gradient</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>0.0001</p>
         </c>
         <c ca="center">
            <p>32.4</p>
         </c>
         <c ca="center">
            <p>68%</p>
         </c>
         <c ca="center">
            <p>This work</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Kinetic</p>
         </c>
         <c ca="left">
            <p>MAS5</p>
         </c>
         <c ca="left">
            <p>LARS</p>
         </c>
         <c ca="center">
            <p>0.1</p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>0.1</p>
         </c>
         <c ca="center">
            <p>32.4</p>
         </c>
         <c ca="center">
            <p>74%</p>
         </c>
         <c ca="center">
            <p>This work</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Kinetic</p>
         </c>
         <c ca="left">
            <p>MAS5</p>
         </c>
         <c ca="left">
            <p>LARS</p>
         </c>
         <c ca="center">
            <p>0</p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>0.05</p>
         </c>
         <c ca="center">
            <p>32.1</p>
         </c>
         <c ca="center">
            <p>74%</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B33">33</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>kinetic</p>
         </c>
         <c ca="left">
            <p>MAS5</p>
         </c>
         <c ca="left">
            <p>Elastic Nets</p>
         </c>
         <c ca="center">
            <p>0</p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>0.05</p>
         </c>
         <c ca="center">
            <p>32.1</p>
         </c>
         <c ca="center">
            <p>74%</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B35">35</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Brownian</p>
         </c>
         <c ca="left">
            <p>MAS5</p>
         </c>
         <c ca="left">
            <p>Gradient</p>
         </c>
         <c ca="center">
            <p>0</p>
         </c>
         <c ca="center">
            <p>NA</p>
         </c>
         <c ca="center">
            <p>0.005</p>
         </c>
         <c ca="center">
            <p>32.1</p>
         </c>
         <c ca="center">
            <p>66%</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B34">34</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Brownian</p>
         </c>
         <c ca="left">
            <p>MAS5</p>
         </c>
         <c ca="left">
            <p>LARS</p>
         </c>
         <c ca="center">
            <p>0</p>
         </c>
         <c ca="center">
            <p>NA</p>
         </c>
         <c ca="center">
            <p>0.05</p>
         </c>
         <c ca="center">
            <p>32.1</p>
         </c>
         <c ca="center">
            <p>63%</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B34">34</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Brownian</p>
         </c>
         <c ca="left">
            <p>MAS5</p>
         </c>
         <c ca="left">
            <p>Elastic Nets</p>
         </c>
         <c ca="center">
            <p>0</p>
         </c>
         <c ca="center">
            <p>NA</p>
         </c>
         <c ca="center">
            <p>0.05</p>
         </c>
         <c ca="center">
            <p>32.1</p>
         </c>
         <c ca="center">
            <p>63%</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B34">34</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Na&#239;ve trend prediction</p>
         </c>
         <c ca="left">
            <p>MAS5</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>NA</p>
         </c>
         <c ca="center">
            <p>NA</p>
         </c>
         <c ca="center">
            <p>NA</p>
         </c>
         <c ca="center">
            <p>NA</p>
         </c>
         <c ca="center">
            <p>52%</p>
         </c>
         <c>
            <p/>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>We compared our SSM-based technique (with a non-zero SSM parameter gamma) to previously published algorithms for learning gene regulation networks by enforcing gamma = 0 (see Materials and methods). We notice that the LARS algorithm <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>, used in the Inferelator by Bonneau <it>et al</it>. <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>, as well as Elastic Nets <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B43">43</abbr></abbrgrp>, obtain a slightly worse quality of fit (signal-to-noise ratio (SNR), in dB) than when combined with our state-space modeling for the same leave-out-last (leave-1) performance as our SSM plus LARS. Not using an mRNA degradation term, as in Wang <it>et al</it>. <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, degrades the leave-out-last performance.</p>
   </tblfn></tbl>
<p>Iterative learning algorithms, described in this study, alternate between two steps: learning the function <it>f </it>mapping latent values of transcription factors at time <it>t </it>to changes to target genes (and transcription factors themselves) at time <it>t </it>+ 1; and recomputing (inferring) the values of the latent variables. In the first step, learning the function <it>f </it>corresponds to finding parameters of <b>F </b>that minimize the prediction error and that involve few transcription factors, thanks to a sparsity constraint on <b>F</b>. In the second step, the sum of quadratic errors on functions <it>f </it>and <it>g </it>is minimized with respect to latent variables <b>z</b>(<it>t</it>) by gradient descent in the hidden variable space <abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp>. The learning procedure is repeated (learning model parameters, inferring latent variables) on training data until <b>F </b>stabilizes (see Materials and methods). Using a bootstrapping approach based on random initialization of latent variables <b>z</b>(<it>t</it>), we further repeat the SSM iterative procedure 20 times and take the final average network <b>F </b>(see Materials and methods).</p>
<p>Three hyper-parameters were explored in our learning experiments: the kinetic time constant <it>&#964; </it>(unless the ODE was 'Brownian motion'), the amount of L1-norm regularization <it>&#955; </it>(explained in Materials and methods), and a variable <it>&#947; </it>linked to the SSM.</p>
<p>When the state-space coefficient is <it>&#947; </it>= 0, we can recover non-SSM algorithms: LARS (least-angle regression and shrinkage) <abbrgrp>
<abbr bid="B42">42</abbr>
</abbrgrp>, as used for instance by Bonneau <it>et al</it>. <abbrgrp>
<abbr bid="B32">32</abbr>
<abbr bid="B33">33</abbr>
</abbrgrp> and Elastic Nets <abbrgrp>
<abbr bid="B43">43</abbr>
</abbrgrp>, as used, for instance, by Shimamura <it>et al</it>. <abbrgrp>
<abbr bid="B35">35</abbr>
</abbrgrp>. LARS is a fast implementation of Tibshirani's popular LASSO (least absolute shrinkage and selection operator) regression with L1-norm regularization <abbrgrp>
<abbr bid="B44">44</abbr>
</abbrgrp>. Elastic Nets are an improvement over LARS and LASSO, and their main advantage is to group variables (in our case genes) as opposed to choosing one gene and leaving out correlated ones. Moreover, if we do not use the mRNA degradation term in the kinetic ODE, and use instead 'Brownian motion' dynamics, and if we set the state-space coefficient to <it>&#947; </it>= 0, we recover an approach comparable to the one published by Wang <it>et al</it>. <abbrgrp>
<abbr bid="B34">34</abbr>
</abbrgrp> (although their optimization algorithm was based on the singular value decomposition (SVD) of the microarray data).</p>
<p>For each type of ODE (kinetic or 'Brownian motion') and type of optimization algorithm, we exhaustively explored the space of hyper-parameters (<it>&#964;</it>, <it>&#947;</it>, <it>&#955;</it>) in order to optimize the quality of fit of each model to the first six time points (0, 3, 6, 9, 12 and 15 minutes). We repeated the experiment using two different ways of normalizing microarray gene expression data: MAS5 and RMA (Robust Multi-array Average). Interestingly, it appears that machine learning (ML) approaches better fit MAS5 data (32.4 db) compared to RMA data (30.8 db). Thus, the study was continued on MAS5 normalized data, as it was for the vast majority of the studies that we reviewed in this work. As can be seen in Table <tblr tid="T1">1</tblr>, we identified the SSM relying on the kinetic ODE, and with either LARS or conjugate gradient optimizations, as the two best (having the highest signal-to-noise ratio) optimization algorithms on the MAS5 training datasets. The signal-to-noise ratio is a monotonic function of the normalized mean square error on the predicted values of mRNA; all algorithms used in this article aim at minimizing the normalized mean square error, that is, at maximizing the signal-to-noise ratio.</p>
<p>Having chosen the two best algorithms using all time points up to and including 15 minutes as training data, we performed a 'leave-out-last' test, consisting of predicting both the direction and magnitude of the change of the gene expression states between 15 and 20 minutes. Using those algorithms with those parameter settings, we made predictions about whether gene expression levels would be increased (positive sign) or decreased (negative sign) at 20 minutes compared with at 15 minutes.</p>
<p>As Table <tblr tid="T1">1</tblr> shows, a SSM relying on the kinetic ODE and with LARS optimization (kinetic LARS) gives correct results 74% of the time on a set of 53 genes (47 transcription factors and 6 nitrogen assimilation genes) that are 'consistent' among the two biological replicates in their behavior (consistently up- or down-regulated in both replicates) for the transition from 15 minutes to 20 minutes. When we considered all 76 genes, regardless of their 'consistency' across replicates, kinetic LARS still gave correct results 71% of the time. The other chosen algorithm (kinetic ODE with conjugate gradient optimization), yielded 68% correct results on both the 53 consistent genes and on all 76 genes. By contrast, a na&#239;ve 'trend forecast' algorithm to extrapolate the trend between 12 minutes and 15 minutes was correct for only 52% of the consistent genes, just slightly better than random (this result implies that 48% of the consistent genes changed 'direction' at 15 minutes). Thus, our SSM does significantly better (<it>P</it>-value = 0.0145) than the na&#239;ve trend forecast based on a binomial test (see Materials and methods) on a coin that is biased to be correct 52% of the time.</p>
<p>Using the hyper-parameters (<it>&#964;</it>, <it>&#947;</it>, <it>&#955;</it>) corresponding to the two best solutions (kinetic LARS and kinetic conjugate gradient), we retrained two SSMs on all the available data (0 to 20 minutes) to obtain corresponding gene regulatory networks. Finally, we performed a statistical analysis of the bootstrap networks in order to retain transcription factor-gene links that were statistically significant at <it>P </it>= 0.001 (see Materials and methods). We ultimately selected the conjugate gradient-optimized network as it gave a less sparse solution (394 links) than the LARS-optimized gene regulatory network (GRN) (22 links). We used this network (next section) to analyze the NO<sub>3</sub>
<sup>- </sup>response of sentinel genes to transcription factors.</p>
<p>We are confident in dynamical modeling, and in our SSM in particular, because in the leave-out-last tests, we were able to learn the system well enough to predict the direction of changes to gene expression. This suggests that we might have learnt some consistent and biologically meaningful networks involved in NO<sub>3</sub>
<sup>- </sup>response pathway. Since function <it>f </it>models the gene regulation network learned during the leave-out-last test, we conclude by presenting the function <it>f </it>obtained from the full time sequence 0 to 20 minutes. This function <it>f </it>can be displayed as an influence matrix (Figure <figr fid="F4">4b</figr>). The study of this network/matrix as a whole system is discussed below.</p>
</sec>
</sec>
<sec>
<st>
<p>Over-expression of a potential network hub (SPL9) modifies the NO<sub>3</sub>
<sup>- </sup>response of sentinel and transcription factor genes</p>
</st>
<p>In order to probe the role of a transcription factor/hub in the predicted regulatory network presented in Figure <figr fid="F4">4b</figr>, transgenic plants (pSLP9:rSPL9) expressing an altered version of the mRNA for the SPL9 transcription factor were compared to wild-type plants for their response to NO<sub>3</sub>
<sup>- </sup>provision. This gene has been selected for several reasons: (i) it is induced at very early time points (3 and 6 minutes); (ii) the inferred network predicts that SPL9 potentially controls at least six genes, including two nitrogen assimilation sentinel genes - this places it as the third-most influential transcription factor on the nitrogen assimilation gene sentinels; (iii) it is also the most strongly influenced gene in both its number of 'in' connections, and by the magnitude of the regulations controlling it (Figure <figr fid="F4">4b</figr>); and (iv) SPL9 displays some strong correlations with key regulators across the NASC (Nottingham Arabidopsis Stock Centre) array dataset (see below). As such, SPL9 constitutes a potential crucial bottleneck in the flux of information mediated by the proposed nitrogen regulatory network (Figure <figr fid="F4">4</figr>). We first considered SPL9 mutants and monitored sentinel expression in this genetic background. However, even if some defects have been observed, no consistent phenotype can be reported (data not shown). This may be readily explained by the topological redundancy of the network (Figure <figr fid="F4">4</figr>). Thus, one could expect that over-expression of <it>SPL9 </it>mRNA would trigger a detectable effect on the predicted sentinel targets and on the network behavior. SPL9 is a miR156-targeted SBP-box transcription factor identified to control shoot development <abbrgrp>
<abbr bid="B45">45</abbr>
</abbrgrp> and flowering transition <abbrgrp>
<abbr bid="B46">46</abbr>
<abbr bid="B47">47</abbr>
</abbrgrp>, and it also appears as a potential central regulator in the network derived from the SSM (Figure <figr fid="F4">4</figr>). SBP-box transcription factors are potentially redundant, but the use of miR156-resistant SPL9 transgenic plants (resulting in over-expression/gain of function plants) has been extensively used to decipher their role. In our experimental set-up, transgenic <it>SPL9 </it>mRNA is over-expressed 4- to 20-fold in the miR156-resistant plants compared to wild type (Figure <figr fid="F5">5</figr>). Moreover, although the <it>rSPL9 </it>mRNA resulting from the modified gene is resistant to degradation by miR156 (explaining the over-expression) the transgene is still under the control of its native promoter. Thus, this over-expression reflects, at least in part, the promoter activity or the effect of other post-transcriptional controls (independent of miR156). This demonstrates that the <it>SPL9 </it>promoter activity is potentially also under the effect of NO<sub>3</sub>
<sup>-</sup>, since <it>rSPL9 </it>mRNA exhibits a transitory depression in the transgenic plants (Figure <figr fid="F5">5</figr>).</p>
<fig id="F5"><title><p>Figure 5</p></title><caption><p>rSPL9 over-expression modifies NO<sub>3</sub><sup>- </sup>kinetic responses</p></caption><text>
   <p><b>rSPL9 over-expression modifies NO</b><sub><b>3</b></sub><sup>- </sup><b>kinetic responses</b>. Sentinel gene mRNA levels in roots of wild type (WT) and transgenic pSPL9:rSPL9 plants in response to NO<sub>3</sub><sup>- </sup>treatment. Fourteen-day-old plants grown in ammonium succinate were treated with 1 mM KNO<sub>3 </sub>versus KCl. Roots were collected at 0 minutes (before treatment) and 10, 20, 40, and 60 minutes after the treatment. Sentinel transcripts were measured in roots using RT-QPCR and normalized to two housekeeping genes (see Materials and methods). The data represent the mean &#177; standard error of three biological replicates (three independent experiments). Differences between the two genotypes are statistically significant at *<it>P </it>&lt; 0.05; **<it>P </it>&lt; 0.01 and ***<it>P </it>&lt; 0.001 (<it>t</it>-test). When a gene presents significant differences, the NO<sub>3</sub><sup>- </sup>induction ratio compared to time 0 is indicated by the numbers close to the plots. The inserts display a zoom of the early time points. More genes (transcription factors) are displayed in Additional file <supplr sid="S5">5</supplr>.</p>
</text><graphic file="gb-2010-11-12-r123-5" hint_layout="double"/></fig>
<p>mRNA transcription levels of several sentinel genes have been followed in this pSPL9:rSPL9 transgenic line. The most dramatic effect recorded is for the <it>NIR </it>target sentinel gene. Interestingly, the <it>NIR </it>gene has previously been demonstrated to be one of the most robustly NO<sub>3</sub>
<sup>-</sup>-regulated genes based on a meta-analysis of microarray data from nitrogen treated plants <abbrgrp>
<abbr bid="B14">14</abbr>
</abbrgrp>. In support of this, over-expression of the <it>SPL9 </it>gene leads to a significant advance in the <it>NIR </it>NO<sub>3</sub>
<sup>- </sup>response by about 10 minutes, and attenuates its magnitude of regulation at later time points (60 minutes). Less dramatic but still significant effects (over three independent experiments) have been recorded for <it>NRT1.1</it>/<it>CIPK23 </it>genes, belonging to the NO<sub>3</sub>
<sup>- </sup>sensing module <abbrgrp>
<abbr bid="B20">20</abbr>
</abbrgrp>, and for the <it>NIA2 </it>gene. These results demonstrate a role of the SPL9 transcription factor in the control of genes involved in the NO<sub>3</sub>
<sup>- </sup>primary response. We also further investigated the role of SPL9 over-expression on the transcription levels of genes in the network over time (Figure <figr fid="F4">4b</figr>). Interestingly, SPL9 seems to have an effect on the vast majority of the genes in the regulatory network that we have tested (Additional file <supplr sid="S5">5</supplr>). The diversity of the misregulation of gene expression is high. For instance, 4 out of the 14 genes tested display an early effect (between 0 and 20 minutes) after the SPL9 over-expression. However, 11 genes display modified gene expression at later time points (40 and 60 minutes).</p>
<suppl id="S5">
<title>
<p>Additional file 5</p>
</title>
<text>
<p>
<b>SPL9 over-expression modifies NO</b>
<sub>
<b>3</b>
</sub>
<sup>- </sup>
<b>kinetic responses of transcription factors</b>. Transcription factor mRNA levels in roots of wild type and transgenic pSPL9:rSPL9 in response to NO<sub>3</sub>
<sup>- </sup>treatment. Fourteen-day-old plants grown in ammonium succinate were treated with 1 mM KNO<sub>3 </sub>versus KCl. Roots were collected at 0 minutes (before treatment) and 10, 20, 40, and 60 minutes after the treatment. Sentinel transcripts were measured in roots using RT-QPCR and normalized to two housekeeping genes (see Materials and methods). The data represent the mean &#177; standard error of three biological replicates (three independent experiments). Differences between the two genotypes are statistically significant at *<it>P </it>&lt; 0.05, **<it>P </it>&lt; 0.01, and ***<it>P </it>&lt; 0.001 (<it>t</it>-test).</p>
</text>
<file name="gb-2010-11-12-r123-S5.pdf">
   <p>Click here for file</p>
</file>
</suppl>
<p>An observation needs to be made at this point. We compared the predicted effect of SPL9 on its putative targets (inferred by ML), with the actual effect of over-expression of SPL9 on these genes measured by Q-PCR. The only systematic effect that we found is that genes predicted to be negatively regulated by SPL9 display an early induction in the pSPL9:rSPL9 line and are indeed less induced at later time points (this is an interesting feature found for four genes, including <it>NIR</it>, At1g13300, At5g10030, and At5g65210). However, the opposite is not true, since the genes predicted to be up-regulated by SPL9 also display a 'down-regulation' at later time points (by 60 minutes), but no 'up-regulation' at early time points (10 to 20 minutes), in response to rSPL9 over-expression. This relative absence of logic can be very easily explained by the predicted functional redundancy found in the network (also discussed below). The question about predicting the over-expression of a network hub is intriguing and will need further investigation, including the generation of transcriptome data to probe and learn how the whole system is perturbed by gene over-expression.</p>
<p>On the other hand, it is noteworthy that with its intrinsically highly connected topology, the network is able to amplify effects (such as SPL9 over-expression) across steps/time. Indeed, consider a simple positive feedback loop where A&#8594;B and also B&#8594;A; if the coefficient of A&#8594;B and B&#8594;A is only 10%, this is enough that when the expression of A and B is 100 at time 0, it will reach 160 after 5 steps. Here, we hypothesize that this is what happens for SPL9; that it takes time (more than 20 minutes) to amplify the differences recorded for the 11 genes with modified gene expression at later time points.</p>
<p>This analysis defined SPL9 as a potential hub for short time scale regulatory behavior, with small-amplitude regulatory effects (36% maximum). Those effects are going to be amplified across consecutive time steps and have longer-term (beyond 40 minutes) effects on additional genes. Thus, the interesting part of machine learning approach is that it can be used to predict the eventual behavior at time points that were not used in the ML process.</p>
<p>Interestingly, in our experimental setup, pSPL9:rSPL9 does not display any obvious developmental phenotype, contrary to what was described by Wang <it>et al</it>. <abbrgrp>
<abbr bid="B48">48</abbr>
</abbrgrp>. These diverging results may be explained by the different plant growth conditions used in the two studies. In particular, in our pre-treatment conditions, plants were grown for 14 days in ammonium succinate without any NO<sub>3</sub>
<sup>-</sup>. By contrast, in the Wang <it>et al</it>. studies the phenotypes were observed in plants grown on nitrate as a nitrogen source. The hypothesis that the phenotypes are nitrate-dependent is supported by the fact that the majority of the pSPL9:rSPL9 gene regulation phenotypes (Figure <figr fid="F5">5</figr>; Additional file <supplr sid="S5">5</supplr>) are triggered by NO<sub>3</sub>
<sup>- </sup>provision. Out of the 18 genes displaying a mis-regulation in the transgenic plants, only the <it>CIPK23 </it>gene displays a phenotype before nitrate treatment. For the 17 other genes, NO<sub>3</sub>
<sup>- </sup>is necessary to promote the rSPL9 over-expression effect.</p>
<p>In order to test the hypothesis that nitrate induces the phenotypes in transgenic plants, plants (wild type and pSPL9:rSPL9) were plated on the same background media used for the kinetic experiments (see Materials and methods) and complemented with 0.5 mM (NH<sub>4</sub>)<sub>2</sub>-succinate or 1 mM KNO<sub>3</sub>. Following 8 days after germination, different root traits were scored. These results show that indeed the presence of NO<sub>3</sub>
<sup>- </sup>enhances pSPL9:rSPL9 phenotypes (Additional file <supplr sid="S6">6</supplr>). Overall, only minor developmental differences were recorded on the (NH<sub>4</sub>)<sub>2</sub>-succinate media that cannot explain the distinct molecular phenotypes shown in Figure <figr fid="F5">5</figr> and Additional file <supplr sid="S5">5</supplr>.</p>
<suppl id="S6">
<title>
<p>Additional file 6</p>
</title>
<text>
<p>
<b>NO</b>
<sub>
<b>3</b>
</sub>
<sup>- </sup>
<b>presence enhances pSPL9:rSPL9 phenotypes</b>. Wild type and pSPL9:rSPL9 have been plated on the same background media as used for kinetic experiments (see Materials and methods) and complemented with either 0.5 mM (NH<sub>4</sub>)<sub>2</sub>-succinate or 1 mM KNO<sub>3</sub>. Eight days after germination, different root traits were scored (n = 13 to 24 plants). ANOVA was used to evaluate the genotype effect (Geno: wild type versus pSPL9:rSPL9); the treatment effect (Treat: nitrate versus ammonium) and the interaction of these two factors (G &#215; T). <it>P</it>-values are provided above each plot corresponding to different root parameters. ***<it>P </it>&lt; 0.001; **<it>P </it>&lt; 0.01; *<it>P </it>&lt; 0.05. NO<sub>3</sub>
<sup>- </sup>presence significantly enhances transgene effect for lateral root number, total lateral root length, lateral root density, and lateral root length density. The average length of lateral roots is not significantly controlled by the pSPL9:rSPL9 transgene.</p>
</text>
<file name="gb-2010-11-12-r123-S6.pdf">
   <p>Click here for file</p>
</file>
</suppl>
<p>From a biological point of view, the modification of the nitrogen status of the plant (induced by nitrogen deprivation) has been shown to increase pri-miR156 accumulation <abbrgrp>
<abbr bid="B49">49</abbr>
</abbrgrp> and mature miR156 in Hsieh <it>et al</it>. <abbrgrp>
<abbr bid="B50">50</abbr>
</abbrgrp>. However, it is noteworthy that miR156 induction is not restricted to nitrogen deprivation, as phosphorus deprivation can also induce miR156 accumulation <abbrgrp>
<abbr bid="B50">50</abbr>
</abbrgrp>. On the SPL9 side, we found that its expression across the Affymetrix NASC dataset is either positively or negatively correlated with key molecular actors in NO<sub>3</sub>
<sup>- </sup>sensing, metabolism and development (NRT1.1, -0.68; NRT1.2, -0.55; ARF8, 0.73). These results provide additional support for a role for the mir156/SPL9 partners in the control of the nitrogen response in plants and potentially in its coordination with other signals, such as phosphorus status.</p>
</sec>
<sec>
<st>
<p>A highly complex connected network: causes and consequences?</p>
</st>
<p>Our machine learning approach (state-space modeling) proposes a regulatory network learned from a high-resolution dynamic transcriptome analysis made in response to KNO<sub>3 </sub>provision. A first interesting feature of this regulatory network is that it predicts a high level of connectivity (Figure <figr fid="F4">4b</figr>). Indeed, for the 76 studied genes, 60 have 500 significant connections (<it>P</it>-value &lt; 0.001; see Materials and methods). This high level of connectivity (favoring functional redundancy) may explain why, to date, experimental analyses have uncovered only few molecular actors specifically involved in the control of NO<sub>3</sub>
<sup>-</sup>-induced gene expression (NRT1.1, CIPK8, CIPK23, NLP7, LBD37/LBD38/LBD39) <abbrgrp>
<abbr bid="B2">2</abbr>
<abbr bid="B20">20</abbr>
<abbr bid="B21">21</abbr>
<abbr bid="B22">22</abbr>
<abbr bid="B23">23</abbr>
</abbrgrp>. Moreover, this level of potential transcription factor redundancy can be a cause of the variable and conditional NO<sub>3</sub>
<sup>- </sup>genomic responses registered over different laboratories and discussed in Gutierrez <it>et al</it>. <abbrgrp>
<abbr bid="B14">14</abbr>
</abbrgrp> and Krouk <it>et al</it>. <abbrgrp>
<abbr bid="B2">2</abbr>
</abbrgrp>.</p>
</sec>
<sec>
<st>
<p>Predicted regulatory network influences</p>
</st>
<p>The identification of regulatory networks is a major aim of systems biology. Relatively few studies have determined regulatory networks precisely enough so that the model can predict behavior in untested conditions; the successful studies concern unicellular organisms such as <it>Halobacterium salinarum </it>
<abbrgrp>
<abbr bid="B32">32</abbr>
<abbr bid="B33">33</abbr>
</abbrgrp>. Our regulatory network model is far more simple; it includes less than a hundred genes and is much less predictive than the one developed for <it>Halobacterium </it>since our model is fed only with transcriptomic data and vastly fewer experiments. Interestingly, however, both approaches predict rather low transcription factor influences. Indeed, the maximum predicted influence of one transcription factor on one gene of our model ranges between 10% and 30% (positive or negative influences). Low influence rate might reinforce the notion that functional redundancy is a built-in feature of regulatory networks that helps organisms adapt themselves to the context of interacting environmental and/or evolutionary forces. Also, the fact that <it>Arabidopsis </it>is a multicellular organism suggests a potential for some 'hidden' regulatory networks, as cell-specific studies have suggested <abbrgrp>
<abbr bid="B10">10</abbr>
</abbrgrp>. This level of complexity, combined with the fact that genes in plants are not organized in functional clusters (as they are in bacteria), are likely to be some of the numerous reasons why our model is less predictive compared to that for <it>Halobacterium</it>. Thus, the next challenge of the plant systems approach will be to reach a level of understanding able to infer regulatory networks at several levels of integration <abbrgrp>
<abbr bid="B3">3</abbr>
</abbrgrp>.</p>
</sec>
<sec>
<st>
<p>Comparative study of state-space model optimization versus non-state-space methods</p>
</st>
<p>In order to measure the improvement of our SSM over the non-state-space methods, given our <it>Arabidopsis </it>dataset, we also ran our SSM with the state-space coefficient set to <it>&#947; </it>= 0 for the following methods: 1) kinetic ODE with LARS optimization, as in Bonneau <it>et al</it>. <abbrgrp>
<abbr bid="B32">32</abbr>
<abbr bid="B33">33</abbr>
</abbrgrp>; 2) first-order vector-autoregressive model with Elastic Nets optimization, as in Shimamura <it>et al</it>. <abbrgrp>
<abbr bid="B35">35</abbr>
</abbrgrp>; and 3) 'Brownian motion' ODE, without mRNA degradation, similar in principle to the method used in Wang <it>et al</it>. <abbrgrp>
<abbr bid="B34">34</abbr>
</abbrgrp>.</p>
<p>As Table <tblr tid="T2">2</tblr> shows, the quality of fit (signal-to-noise ratio) on the training data was slightly worse (32.1 dB) in all non-SSM methods than in our SSM (32.4 dB). The predictive performance on the training data set was 90% of correct signs for all methods. The leave-out-last performance of 3) was worse (below 65% correct signs on the test set), while it was the same for 1) and 2) as in the LARS-based SSM, that is, 74% correct signs on the test set.</p>
<p>The non-SSM approaches that we considered in Table <tblr tid="T2">2</tblr> would each give a unique solution, while our SSM would give slightly different solutions over consecutive runs, thus enabling a bootstrapping procedure. For this reason, our SSM offers a more principled way to deal with uncertainty and avoid over-fitting in microarray measurements than non-SSM methods. Second, our SSM is flexible because it enables adding unobserved variables as additional transcription factors.</p>
</sec>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>This systems biology study uses machine learning on time series of transcriptome data to generate testable hypotheses for the potential mechanisms underlying the NO<sub>3</sub>
<sup>- </sup>transduction signal. We demonstrate that a part of the NO<sub>3</sub>
<sup>- </sup>response (happening within minutes after NO<sub>3</sub>
<sup>- </sup>provision) has been missed by previous transcriptional approaches. This early response contains candidate transcription factors such as SPL9 that can modify the characteristics of NO<sub>3</sub>
<sup>- </sup>signal propagation in gene networks. Furthermore, we demonstrate that state-space modeling can infer putative regulatory networks on sparse datasets and thereby suggest hypotheses that will help to decipher poorly understood signaling pathways.</p>
</sec>
<sec>
<st>
<p>Materials and methods</p>
</st>
<sec>
<st>
<p>Plant material and treatments</p>
</st>
<p>
<it>Arabidopsis </it>plants, ecotype Columbia or transgenic plants (pSPL9:rSPL9) <abbrgrp>
<abbr bid="B46">46</abbr>
</abbrgrp>, were grown in sterile hydroponics as adapted from <abbrgrp>
<abbr bid="B10">10</abbr>
</abbrgrp>. In detail, sterilized seeds were sown on Nitex 03-250/47 mesh (Sefar America, Bricarcliff Manor, NY, USA) supported by a plastic platform to allow roots to grow in hydroponics inside a sterile Phytatray (Sigma-Aldrich, St Louis, MO, USA). Hydroponic media consisted of 1X Murashige and Skoog basal medium containing no nitrogen or sucrose (custom-ordered, GibcoBRL, Gaithersburg, MD, USA) supplemented with 3 mM sucrose, 0.5 mM ammonium succinate, MES buffered at pH 5.7 (0.5 g.l<sup>-1</sup>). Plants were grown for 14 to 16 days in day/night cycles (16/8 h; 150 &#956;mol photons m<sup>-2</sup>.s<sup>-1</sup>; Percival Scientific Inc., Perry, IA, USA) at 22&#176;C. Twenty-four hours before the treatments, plants were transferred toward an equivalent fresh nitrogen-free medium in continuous light to avoid gene regulation induced by light. KNO<sub>3 </sub>was added to the media at 1 mM, or otherwise as stated in the figures. Control plants were mock-treated by adding the same concentration of KCl. The samples corresponding to time 0 were first harvested and then the treatment (KNO<sub>3 </sub>or KCl was applied by transferring the plants onto the refreshed nitrogen-free complemented media (with KNO<sub>3 </sub>and KCL). Then, the stopwatch was started. Roots were then harvested every 3 minutes and immediately frozen in liquid nitrogen. It takes an average of 20 to 30 seconds to harvest a sample. For growth experiments, plants were directly plated on petri dishes containing agar (1%) solidified media identical to the hydroponic one described above.</p>
</sec>
<sec>
<st>
<p>RNA preparation and RT-QPCR analysis</p>
</st>
<p>Total RNA extraction was performed using Trizol&#8482; Reagent according to the manufacturer's recommendations (Invitrogen, San Diego, CA, USA). Total RNA (1 to 2 &#956;g) was digested by DNase I (Sigma). RNA was then reverse transcribed to one-strand cDNA using Thermo&#8482; script RT (Invitrogen) according to the manufacturer's protocol. Gene expression was determined by RT-QPCR (LightCycler; Roche Diagnostics, Basel, Switzerland) using gene-specific primers (Additional file <supplr sid="S7">7</supplr>) and LightCycler FastStart DNA Master SYBR Green (Roche Diagnostics). Expression levels of tested genes were normalized to expression levels of the ACT2/8 and clathrin genes as described in <abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp>.</p>
<suppl id="S7">
<title>
<p>Additional file 7</p>
</title>
<text>
<p>
<b>QPCR primers used in this study</b>.</p>
</text>
<file name="gb-2010-11-12-r123-S7.pdf">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec>
<st>
<p>Microarray hybridization</p>
</st>
<p>cDNA was synthesized from 2 &#956;g total RNA using T7- Oligo(dT) promoter primer and reagents recommended by Affymetrix (Santa Clara, CA, USA). Biotin-labeled cRNA was synthesized according to the manufacturer's protocol. Labeled cRNA (8 &#956;g) was hybridized to <it>Arabidopsis </it>ATH1 Affymetrix gene chip for 16 h at 45&#176;C. Washing, staining and scanning were performed as recommended by Affymetrix. Image analysis and normalization to a target median intensity of 150 was performed with the Affymetrix MAS v5.0 set at default values. We analyzed the reproducibility of replicates using the correlation coefficient and visual inspection of scatter plots of pairs of replicates.</p>
</sec>
<sec>
<st>
<p>Modeling of gene expression patterns: clustering methods</p>
</st>
<p>The microarray data reported in this paper have been deposited in the Gene Expression Omnibus (GEO) database [GEO:GSE20044]. Data manipulations were performed in R <abbrgrp>
<abbr bid="B51">51</abbr>
</abbrgrp>. All filtering steps were carried out by controlling the FDR. The data set, corresponding to 26 ATH1 chips times 22,810 probes, were analyzed in two successive steps in order to increase the stringency and precision of the analysis. The intent of the first step was to narrow the focus to genes regulated by nitrate over the entire data set or in interaction with time. Thus, we ran an ANOVA (<it>aov() </it>function) over the data set where the signal of a probe i is Pi ~ &#956; + &#945;N + &#946;T + &#947;T*N + &#949;, where N is the effect of the nitrate treatment, T is the effect of time, and T*N is the effect of their interaction, &#956; is the mean signal over the data set, and &#949; the unexplained variance. We sorted out probes having a significant call (<it>P</it>-value &lt; 0.01; corresponding to 10% &lt; FDR &lt; 15%), for the effect of N and T*N. This first pass identified 2,159 probes corresponding to nitrogen-regulated genes. On this narrowed data set (26 ATH1 chips times 2,159 probes), we modeled the gene expression using a linear modeling approach (<it>lm()</it>) in order to determine for each gene the particular time point that showed the most marked effect of nitrate. Thus, the expression of a gene (signal of a probe) was modeled such as Pi = &#945;<sub>0 </sub>+ &#945;<sub>1</sub>T + &#945;<sub>2</sub>N + &#945;<sub>3</sub>T*N + &#949;, where &#945;<sub>0 </sub>represents the signal at 'time 0' (before the treatment). This modeling approach forces the model to take 'time 0' as a baseline for the possible effect of NO<sub>3</sub>
<sup>- </sup>or KCl, allowing us to eventually discriminate between the KNO<sub>3 </sub>and KCl effects. Thus, we called a probe regulated if it has a positive call (FDR &lt; 5%) for interaction with nitrate at a particular time point.</p>
<p>The clustering shown in Figures <figr fid="F1">1</figr> and <figr fid="F2">2</figr> was performed through MeV software <abbrgrp>
<abbr bid="B52">52</abbr>
</abbrgrp>. The number of clusters was determined using the figure of merit (FOM) method <abbrgrp>
<abbr bid="B53">53</abbr>
</abbrgrp>, followed by a clustering using Pearson correlation as the distance, and the average as method of aggregation. Additional file <supplr sid="S8">8</supplr> provides the KNO<sub>3</sub>/KCl ratio along the kinetic for the entire 550 regulated genes.</p>
<suppl id="S8">
<title>
<p>Additional file 8</p>
</title>
<text>
<p>
<b>KNO</b>
<sub>
<b>3</b>
</sub>
<b>/KCl gene expression ratio of significantly regulated genes (see also raw data [GEO:GSE20044])</b>.</p>
</text>
<file name="gb-2010-11-12-r123-S8.txt">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec>
<st>
<p>Randomization test</p>
</st>
<p>Data manipulations were performed in R <abbrgrp>
<abbr bid="B51">51</abbr>
</abbrgrp>. We set up a test to monitor whether the overlap between two gene lists is higher than expected by chance. The test consists of randomly selecting 10,000 gene lists of <it>Arabidopsis thaliana </it>Gene Index (AGI) numbers, out of the genome, having the same size as the observed lists. Thus, we counted the number (n) of times that the intersection size of the random lists is equal or higher than the intersection observed for the two tested gene lists. A <it>P</it>-value is thus generated equal to n/10,000. For the generation of the matrix in Figure <figr fid="F3">3</figr>, the randomization test was limited to 1,000 gene lists in order to speed up the analysis.</p>
</sec>
<sec>
<st>
<p>State-space model</p>
</st>
<p>Our SSM involves noisy but observed gene expression data <b>y</b>(<it>t</it>) (black circles in Figure <figr fid="F4">4a</figr>), as well as idealized but unobserved ('latent') gene expression values <b>z</b>(<it>t</it>) (red circles in Figure <figr fid="F4">4a</figr>). As shown in Figure <figr fid="F4">4a</figr>, the relationship between consecutive latent variables <b>z</b>(<it>t</it>
<sub>
<it>k</it>
</sub>) and <b>z</b>(<it>t</it>
<sub>
<it>k</it>
</sub>
<sub>+1</sub>) is a Markov chain: each latent gene's expression value at time <it>t</it>
<sub>
<it>k</it>
</sub>
<sub>+1 </sub>is assumed to depend only on the state of potentially all the latent gene expressions at the previous time point <it>t</it>. For each gene <it>i</it>, this relationship stems from the kinetic ODE involving the rate of mRNA change (with a kinetic time constant <it>&#964;</it>), mRNA degradation, and a linear function <it>fi </it>of transcription factor concentrations for that specific gene. So-called 'Brownian motion' dynamics correspond to kinetic dynamics without the mRNA degradation term. In linearized (discretized) form, the overall dynamical model <it>f </it>can be represented by an <it>n &#215; m </it>matrix <b>F </b>where <it>n </it>is the total number of genes and <it>m </it>the number of transcription factors (<it>m</it>&lt;<it>n</it>, and transcription factors are given indexes from 1 to <it>m</it>), plus a bias term <b>b </b>and a Gaussian error term with zero mean and fixed covariance:</p>
<p>
<display-formula>
<m:math name="gb-2010-11-12-r123-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mtable>
   <m:mtr>
      <m:mtd>
         <m:mi>&#964;</m:mi>
         <m:mfrac>
            <m:mrow>
               <m:mtext>d</m:mtext>
               <m:msub>
                  <m:mi>z</m:mi>
                  <m:mi>i</m:mi>
               </m:msub>
               <m:mo stretchy="false">(</m:mo>
               <m:mi>t</m:mi>
               <m:mo stretchy="false">)</m:mo>
            </m:mrow>
            <m:mrow>
               <m:mtext>d</m:mtext>
               <m:mi>t</m:mi>
            </m:mrow>
         </m:mfrac>
         <m:mo>+</m:mo>
         <m:msub>
            <m:mi>z</m:mi>
            <m:mi>i</m:mi>
         </m:msub>
         <m:mo stretchy="false">(</m:mo>
         <m:mi>t</m:mi>
         <m:mo stretchy="false">)</m:mo>
         <m:mo>=</m:mo>
         <m:msub>
            <m:mi>f</m:mi>
            <m:mi>i</m:mi>
         </m:msub>
         <m:mo stretchy="false">(</m:mo>
         <m:mstyle mathvariant="bold">
            <m:mtext>z</m:mtext>
         </m:mstyle>
         <m:mo stretchy="false">(</m:mo>
         <m:mi>t</m:mi>
         <m:mo stretchy="false">)</m:mo>
         <m:mo stretchy="false">)</m:mo>
         <m:mo>+</m:mo>
         <m:msub>
            <m:mi>&#951;</m:mi>
            <m:mi>i</m:mi>
         </m:msub>
         <m:mo stretchy="false">(</m:mo>
         <m:mi>t</m:mi>
         <m:mo stretchy="false">)</m:mo>
      </m:mtd>
   </m:mtr>
   <m:mtr>
      <m:mtd>
         <m:mfrac>
            <m:mi>&#964;</m:mi>
            <m:mrow>
               <m:msub>
                  <m:mi>t</m:mi>
                  <m:mrow>
                     <m:mi>k</m:mi>
                     <m:mo>+</m:mo>
                     <m:mn>1</m:mn>
                  </m:mrow>
               </m:msub>
               <m:mo>&#8722;</m:mo>
               <m:msub>
                  <m:mi>t</m:mi>
                  <m:mi>k</m:mi>
               </m:msub>
            </m:mrow>
         </m:mfrac>
         <m:mo stretchy="false">(</m:mo>
         <m:msub>
            <m:mi>z</m:mi>
            <m:mi>i</m:mi>
         </m:msub>
         <m:mo stretchy="false">(</m:mo>
         <m:msub>
            <m:mi>t</m:mi>
            <m:mrow>
               <m:mi>k</m:mi>
               <m:mo>+</m:mo>
               <m:mn>1</m:mn>
            </m:mrow>
         </m:msub>
         <m:mo stretchy="false">)</m:mo>
         <m:mo>&#8722;</m:mo>
         <m:msub>
            <m:mi>z</m:mi>
            <m:mi>i</m:mi>
         </m:msub>
         <m:mo stretchy="false">(</m:mo>
         <m:msub>
            <m:mi>t</m:mi>
            <m:mi>k</m:mi>
         </m:msub>
         <m:mo stretchy="false">)</m:mo>
         <m:mo stretchy="false">)</m:mo>
         <m:mo>+</m:mo>
         <m:msub>
            <m:mi>z</m:mi>
            <m:mi>i</m:mi>
         </m:msub>
         <m:mo stretchy="false">(</m:mo>
         <m:msub>
            <m:mi>t</m:mi>
            <m:mi>k</m:mi>
         </m:msub>
         <m:mo stretchy="false">)</m:mo>
         <m:mo>=</m:mo>
         <m:mstyle displaystyle="true">
            <m:munderover>
               <m:mo>&#8721;</m:mo>
               <m:mrow>
                  <m:mi>j</m:mi>
                  <m:mo>=</m:mo>
                  <m:mn>1</m:mn>
               </m:mrow>
               <m:mrow>
                  <m:msub>
                     <m:mi>N</m:mi>
                     <m:mi>i</m:mi>
                  </m:msub>
               </m:mrow>
            </m:munderover>
            <m:mrow>
               <m:msub>
                  <m:mi>F</m:mi>
                  <m:mrow>
                     <m:mi>i</m:mi>
                     <m:mo>,</m:mo>
                     <m:mi>j</m:mi>
                  </m:mrow>
               </m:msub>
               <m:msub>
                  <m:mi>z</m:mi>
                  <m:mi>j</m:mi>
               </m:msub>
               <m:mo stretchy="false">(</m:mo>
               <m:msub>
                  <m:mi>t</m:mi>
                  <m:mi>k</m:mi>
               </m:msub>
               <m:mo stretchy="false">)</m:mo>
               <m:mo>+</m:mo>
               <m:msub>
                  <m:mi>b</m:mi>
                  <m:mrow>
                     <m:mi>i</m:mi>
                     <m:mo>,</m:mo>
                     <m:mn>0</m:mn>
                  </m:mrow>
               </m:msub>
               <m:mo>+</m:mo>
               <m:msub>
                  <m:mi>&#951;</m:mi>
                  <m:mi>i</m:mi>
               </m:msub>
               <m:mo stretchy="false">(</m:mo>
               <m:msub>
                  <m:mi>t</m:mi>
                  <m:mi>k</m:mi>
               </m:msub>
               <m:mo stretchy="false">)</m:mo>
            </m:mrow>
         </m:mstyle>
      </m:mtd>
   </m:mtr>
</m:mtable>
</m:math>
</display-formula>
</p>
<p>The observation model <it>h </it>is essentially an <it>n &#215; n </it>identity matrix with a Gaussian error term:</p>
<p>
<display-formula>
<m:math name="gb-2010-11-12-r123-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mtable>
   <m:mtr>
      <m:mtd>
         <m:msub>
            <m:mi>y</m:mi>
            <m:mi>i</m:mi>
         </m:msub>
         <m:mo stretchy="false">(</m:mo>
         <m:mi>t</m:mi>
         <m:mo stretchy="false">)</m:mo>
         <m:mo>=</m:mo>
         <m:mi>h</m:mi>
         <m:mo stretchy="false">(</m:mo>
         <m:msub>
            <m:mi>z</m:mi>
            <m:mi>i</m:mi>
         </m:msub>
         <m:mo stretchy="false">(</m:mo>
         <m:mi>t</m:mi>
         <m:mo stretchy="false">)</m:mo>
         <m:mo stretchy="false">)</m:mo>
         <m:mo>+</m:mo>
         <m:msub>
            <m:mi>&#1013;</m:mi>
            <m:mi>i</m:mi>
         </m:msub>
         <m:mo stretchy="false">(</m:mo>
         <m:mi>t</m:mi>
         <m:mo stretchy="false">)</m:mo>
      </m:mtd>
   </m:mtr>
   <m:mtr>
      <m:mtd>
         <m:msub>
            <m:mi>y</m:mi>
            <m:mi>i</m:mi>
         </m:msub>
         <m:mo stretchy="false">(</m:mo>
         <m:msub>
            <m:mi>t</m:mi>
            <m:mi>k</m:mi>
         </m:msub>
         <m:mo stretchy="false">)</m:mo>
         <m:mo>=</m:mo>
         <m:msub>
            <m:mi>z</m:mi>
            <m:mi>i</m:mi>
         </m:msub>
         <m:mo stretchy="false">(</m:mo>
         <m:msub>
            <m:mi>t</m:mi>
            <m:mi>k</m:mi>
         </m:msub>
         <m:mo stretchy="false">)</m:mo>
         <m:mo>+</m:mo>
         <m:msub>
            <m:mi>&#1013;</m:mi>
            <m:mi>i</m:mi>
         </m:msub>
         <m:mo stretchy="false">(</m:mo>
         <m:msub>
            <m:mi>t</m:mi>
            <m:mi>k</m:mi>
         </m:msub>
         <m:mo stretchy="false">)</m:mo>
      </m:mtd>
   </m:mtr>
</m:mtable>
</m:math>
</display-formula>
</p>
<p>An iterative procedure tries to learn the dynamical relationship between latent gene expression variables <b>z</b>(<it>t</it>) while maintaining the latent variables <b>z</b>(<it>t</it>) as close as possible to the observed Affimetrix measures <b>y</b>(<it>t</it>). The algorithm consists of a) minimizing the sum of quadratic errors of the dynamical and the observation models with respect to the latent variables <b>Z </b>by using gradient descent on the latent variables <abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp> (this is the inference step), and b) minimizing the sum of quadratic errors of the dynamical model using conjugate gradient, LARS <abbrgrp>
<abbr bid="B42">42</abbr>
</abbrgrp> or Elastic Nets <abbrgrp>
<abbr bid="B43">43</abbr>
</abbrgrp> optimization on the parameters of <b>F </b>(this is the learning step). On a sequence <b>Y </b>of <it>T </it>microarray measurements (including replicate sequences) over <it>n </it>genes, corresponding latent variables <b>Z</b>, under a dynamic model parameterized by transcription factor-gene influence matrix <b>F </b>and bias term <b>b</b>, and for a given hyperparameter <it>&#947; </it>(which controls the weight of the dynamical and observation errors), the total error term for the inference step is:</p>
<p>
<display-formula>
<m:math name="gb-2010-11-12-r123-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:msub>
      <m:mi>e</m:mi>
      <m:mtext mathvariant="normal">&#947;</m:mtext>
   </m:msub>
   <m:mo stretchy="false">(</m:mo>
   <m:mstyle mathvariant="bold">
      <m:mtext>Y</m:mtext>
   </m:mstyle>
   <m:mo>,</m:mo>
   <m:mstyle mathvariant="bold">
      <m:mtext>Z</m:mtext>
   </m:mstyle>
   <m:mo>;</m:mo>
   <m:mstyle mathvariant="bold">
      <m:mtext>F</m:mtext>
   </m:mstyle>
   <m:mo>,</m:mo>
   <m:mstyle mathvariant="bold">
      <m:mtext>b</m:mtext>
   </m:mstyle>
   <m:mo stretchy="false">)</m:mo>
   <m:mo>=</m:mo>
   <m:mstyle displaystyle="true">
      <m:munderover>
         <m:mo>&#8721;</m:mo>
         <m:mrow>
            <m:mi>i</m:mi>
            <m:mo>=</m:mo>
            <m:mn>1</m:mn>
         </m:mrow>
         <m:mi>n</m:mi>
      </m:munderover>
      <m:mrow>
         <m:mstyle displaystyle="true">
            <m:munderover>
               <m:mo>&#8721;</m:mo>
               <m:mrow>
                  <m:mi>k</m:mi>
                  <m:mo>=</m:mo>
                  <m:mn>1</m:mn>
               </m:mrow>
               <m:mi>T</m:mi>
            </m:munderover>
            <m:mrow>
               <m:mrow>
                  <m:mo>(</m:mo>
                  <m:mrow>
                     <m:mtext mathvariant="normal">&#947;</m:mtext>
                     <m:mo>|</m:mo>
                     <m:mo>|</m:mo>
                     <m:msub>
                        <m:mi>&#951;</m:mi>
                        <m:mi>i</m:mi>
                     </m:msub>
                     <m:mo stretchy="false">(</m:mo>
                     <m:msub>
                        <m:mi>t</m:mi>
                        <m:mi>k</m:mi>
                     </m:msub>
                     <m:mo stretchy="false">)</m:mo>
                     <m:mo>|</m:mo>
                     <m:msubsup>
                        <m:mo>|</m:mo>
                        <m:mn>2</m:mn>
                        <m:mn>2</m:mn>
                     </m:msubsup>
                     <m:mo>+</m:mo>
                     <m:mo>|</m:mo>
                     <m:mo>|</m:mo>
                     <m:msub>
                        <m:mi>&#1013;</m:mi>
                        <m:mi>i</m:mi>
                     </m:msub>
                     <m:mo stretchy="false">(</m:mo>
                     <m:msub>
                        <m:mi>t</m:mi>
                        <m:mi>k</m:mi>
                     </m:msub>
                     <m:mo stretchy="false">)</m:mo>
                     <m:mo>|</m:mo>
                     <m:msubsup>
                        <m:mo>|</m:mo>
                        <m:mn>2</m:mn>
                        <m:mn>2</m:mn>
                     </m:msubsup>
                  </m:mrow>
                  <m:mo>)</m:mo>
               </m:mrow>
            </m:mrow>
         </m:mstyle>
      </m:mrow>
   </m:mstyle>
</m:mrow>
</m:math>
</display-formula>
</p>
<p>The solution of a non-SSM can be recovered by setting <it>&#947; </it>= 0, which amounts to having exactly <b>Y </b>= <b>Z</b>.</p>
<p>During the learning step, sparse gene regulation networks are obtained by penalizing dense solutions using L1-norm regularization, which amounts to adding a <it>&#955;</it>-weighted penalty to the dynamical error term, as in the LASSO initially described by Tibshirani <abbrgrp>
<abbr bid="B44">44</abbr>
</abbrgrp>. Employing regularization on parameters of <b>F </b>also helps to avoid local optima in the solutions.</p>
<p>The learning algorithm is run for 100 consecutive epochs over all the replicate sequences (four replicate sequences here). In order to retain the optimal set of parameters of <it>f</it>, one selects the epoch where the dynamic error on the training dataset is minimal. The cross-validation data are unseen during the optimization procedure. One run of the learning procedure yields a matrix <b>F </b>of signed (positive (excitatory) or negative (inhibitory)) interactions between transcription factors and genes. Each element F<sub>i,j </sub>represents the action of the j-th transcription factor on the i-th gene.</p>
</sec>
<sec>
<st>
<p>Selection of gene regulation network by bootstrapping</p>
</st>
<p>The algorithm described above for learning SSMs starts with random initial values for both the dynamical model (in other words, matrix <b>F</b>) and for the latent variables <b>Z</b>. We repeat the whole procedure 20 times in order to perform the following bootstrapping evaluation. Each run <it>k </it>of the algorithm might converge to a slightly different solution <b>F</b>
<sup>
<b>*(k)</b>
</sup>. We then take the average transcription factor-gene interaction weights obtained from all solutions <b>F</b>
<sup>
<b>*(k) </b>
</sup>and call it <b>F*</b>. Table <tblr tid="T1">1</tblr> reports comparative results on the average solutions.</p>
<p>In parallel, we also generate 1,000 random permutations <b>P</b>
<sup>
<b>*(k,1)</b>
</sup>, <b>P</b>
<sup>
<b>*(k,2)</b>
</sup>, ..., <b>P</b>
<sup>
<b>*(k,1000) </b>
</sup>of each matrix <b>F</b>*<sup>(k)</sup>, and then compute 1,000 average matrices <b>P</b>
<sup>
<b>*(1)</b>
</sup>, <b>P</b>
<sup>
<b>*(2)</b>
</sup>, ..., <b>P</b>
<sup>
<b>*(1000) </b>
</sup>of those 'scrambled' matrices (we take the averages over the 20 runs). We compare each average element <b>F</b>
<sub>i,j</sub>
<b>* </b>to the empirical distribution of the 1,000 permuted averages and thus obtain an empirical <it>P</it>-value. The final genetic regulation network consists of elements <b>F</b>
<sub>i,j</sub>
<b>* </b>that have a <it>P</it>-value &lt; 0.001.</p>
</sec>
<sec>
<st>
<p>Software package</p>
</st>
<p>We provide online <abbrgrp>
<abbr bid="B54">54</abbr>
</abbrgrp> our own SSM software, which is based on publication <abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp> and implements the algorithm described above. The software runs under Matlab (The Mathworks Inc., Natick, MA, USA) and requires the LARS library, designed by Karl Skoglund (IMM, DTU), which is available as part of the BoLASSO Matlab package.</p>
</sec>
</sec>
<sec>
<st>
<p>Abbreviations</p>
</st>
<p>FDR: false discovery rate; LARS: least-angle regression and shrinkage; LASSO: least absolute shrinkage and selection operator; ODE: ordinary differential equation; RT-QPCR: real time quantitative PCR; SSM: state space model.</p>
</sec>
<sec>
<st>
<p>Authors' contributions</p>
</st>
<p>GK, DS, and GC designed the study. GK carried out the experiments. GK performed transcriptome data analysis. GK and DS developed the randomization test to infer signal convergence. PM, DS, and YL developed the machine learning approach and the SSM algorithm. GK, PM, DS, and GC wrote the paper.</p>
</sec>
</bdy><bm>
<ack>
<sec>
<st>
<p>Acknowledgements</p>
</st>
<p>The pSPL9:rSPL9 seeds were kindly provided by the Detlef Weigel lab. This work was supported by: NIH Grant GM032877 to GC and DS; a NSF <it>Arabidopsis </it>2010 grant MCB-0929338 to GC and DS. Modeling tools are developed under NSF DBI-0445666 to GC and DS. GK is supported by a European-FP7-International Outgoing Fellowship (Marie Curie) (AtSYSTM-BIOL; PIOF-GA-2008-220157). DS is also supported by IIS-0414763. YL is supported by a grant NSF ITR-0325463: ITR: New directions in predictive learning. We thank Dr Sandrine Ruffel for her useful comments on the manuscript.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>Nitrate - nutrient and signal for plant-growth.</p></title><aug><au><snm>Crawford</snm><fnm>NM</fnm></au></aug><source>Plant Cell</source><pubdate>1995</pubdate><volume>7</volume><fpage>859</fpage><lpage>868</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1105/tpc.7.7.859</pubid><pubid idtype="pmcid">160877</pubid><pubid idtype="pmpid" link="fulltext">7640524</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>Nitrate signaling: adaptation to fluctuating environments.</p></title><aug><au><snm>Krouk</snm><fnm>G</fnm></au><au><snm>Crawford</snm><fnm>NM</fnm></au><au><snm>Coruzzi</snm><fnm>GM</fnm></au><au><snm>Tsay</snm><fnm>YF</fnm></au></aug><source>Curr Opin Plant Biol</source><pubdate>2010</pubdate><volume>13</volume><fpage>266</fpage><lpage>273</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.pbi.2009.12.003</pubid><pubid idtype="pmpid" link="fulltext">20093067</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>A systems view of responses to nutritional cues in <it>Arabidopsis</it>: toward a paradigm shift for predictive network modeling.</p></title><aug><au><snm>Ruffel</snm><fnm>S</fnm></au><au><snm>Krouk</snm><fnm>G</fnm></au><au><snm>Coruzzi</snm><fnm>GM</fnm></au></aug><source>Plant Physiol</source><pubdate>2010</pubdate><volume>152</volume><fpage>445</fpage><lpage>452</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1104/pp.109.148502</pubid><pubid idtype="pmpid" link="fulltext">19939945</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>A systems view of nitrogen nutrient and metabolite responses in <it>Arabidopsis</it>.</p></title><aug><au><snm>Vidal</snm><fnm>EA</fnm></au><au><snm>Gutierrez</snm><fnm>RA</fnm></au></aug><source>Curr Opin Plant Biol</source><pubdate>2008</pubdate><volume>11</volume><fpage>521</fpage><lpage>529</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.pbi.2008.07.003</pubid><pubid idtype="pmpid" link="fulltext">18775665</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Genomic analysis of a nutrient response in <it>Arabidopsis </it>reveals diverse expression patterns and novel metabolic and potential regulatory genes induced by nitrate.</p></title><aug><au><snm>Wang</snm><fnm>R</fnm></au><au><snm>Guegler</snm><fnm>K</fnm></au><au><snm>LaBrie</snm><fnm>ST</fnm></au><au><snm>Crawford</snm><fnm>NM</fnm></au></aug><source>Plant Cell</source><pubdate>2000</pubdate><volume>12</volume><fpage>1491</fpage><lpage>1510</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1105/tpc.12.8.1491</pubid><pubid idtype="pmcid">149118</pubid><pubid idtype="pmpid" link="fulltext">10948265</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Microarray analysis of the nitrate response in <it>Arabidopsis </it>roots and shoots reveals over 1,000 rapidly responding genes and new linkages to glucose, trehalose-6-phosphate, iron, and sulfate metabolism.</p></title><aug><au><snm>Wang</snm><fnm>R</fnm></au><au><snm>Okamoto</snm><fnm>M</fnm></au><au><snm>Xing</snm><fnm>X</fnm></au><au><snm>Crawford</snm><fnm>NM</fnm></au></aug><source>Plant Physiol</source><pubdate>2003</pubdate><volume>132</volume><fpage>556</fpage><lpage>567</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1104/pp.103.021253</pubid><pubid idtype="pmcid">166997</pubid><pubid idtype="pmpid" link="fulltext">12805587</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Genomic analysis of the nitrate response using a nitrate reductase-null mutant of <it>Arabidopsis</it>.</p></title><aug><au><snm>Wang</snm><fnm>R</fnm></au><au><snm>Tischner</snm><fnm>R</fnm></au><au><snm>Gutierrez</snm><fnm>RA</fnm></au><au><snm>Hoffman</snm><fnm>M</fnm></au><au><snm>Xing</snm><fnm>X</fnm></au><au><snm>Chen</snm><fnm>M</fnm></au><au><snm>Coruzzi</snm><fnm>G</fnm></au><au><snm>Crawford</snm><fnm>NM</fnm></au></aug><source>Plant Physiol</source><pubdate>2004</pubdate><volume>136</volume><fpage>2512</fpage><lpage>2522</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1104/pp.104.044610</pubid><pubid idtype="pmcid">523318</pubid><pubid idtype="pmpid" link="fulltext">15333754</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Genome-wide reprogramming of primary and secondary metabolism, protein synthesis, cellular growth processes, and the regulatory infrastructure of <it>Arabidopsis </it>in response to nitrogen.</p></title><aug><au><snm>Scheible</snm><fnm>WR</fnm></au><au><snm>Morcuende</snm><fnm>R</fnm></au><au><snm>Czechowski</snm><fnm>T</fnm></au><au><snm>Fritz</snm><fnm>C</fnm></au><au><snm>Osuna</snm><fnm>D</fnm></au><au><snm>Palacios-Rojas</snm><fnm>N</fnm></au><au><snm>Schindelasch</snm><fnm>D</fnm></au><au><snm>Thimm</snm><fnm>O</fnm></au><au><snm>Udvardi</snm><fnm>MK</fnm></au><au><snm>Stitt</snm><fnm>M</fnm></au></aug><source>Plant Physiol</source><pubdate>2004</pubdate><volume>136</volume><fpage>2483</fpage><lpage>2499</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1104/pp.104.047019</pubid><pubid idtype="pmcid">523316</pubid><pubid idtype="pmpid" link="fulltext">15375205</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>A Systems approach uncovers restrictions for signal interactions regulating genome-wide responses to nutritional cues in <it>Arabidopsis</it>.</p></title><aug><au><snm>Krouk</snm><fnm>G</fnm></au><au><snm>Tranchina</snm><fnm>D</fnm></au><au><snm>Lejay</snm><fnm>L</fnm></au><au><snm>Cruikshank</snm><fnm>AA</fnm></au><au><snm>Shasha</snm><fnm>D</fnm></au><au><snm>Coruzzi</snm><fnm>GM</fnm></au><au><snm>Gutierrez</snm><fnm>RA</fnm></au></aug><source>PLoS Comput Biol</source><pubdate>2009</pubdate><volume>5</volume><fpage>e1000326</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pcbi.1000326</pubid><pubid idtype="pmcid">2652106</pubid><pubid idtype="pmpid" link="fulltext">19300494</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>Cell-specific nitrogen responses mediate developmental plasticity.</p></title><aug><au><snm>Gifford</snm><fnm>ML</fnm></au><au><snm>Dean</snm><fnm>A</fnm></au><au><snm>Gutierrez</snm><fnm>RA</fnm></au><au><snm>Coruzzi</snm><fnm>GM</fnm></au><au><snm>Birnbaum</snm><fnm>KD</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2008</pubdate><volume>105</volume><fpage>803</fpage><lpage>808</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0709559105</pubid><pubid idtype="pmcid">2206617</pubid><pubid idtype="pmpid" link="fulltext">18180456</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Qualitative network models and genome-wide expression data define carbon/nitrogen-responsive molecular machines in <it>Arabidopsis</it>.</p></title><aug><au><snm>Gutierrez</snm><fnm>RA LL</fnm></au><au><snm>Dean</snm><fnm>A</fnm></au><au><snm>Chiaromonte</snm><fnm>F</fnm></au><au><snm>Shasha</snm><fnm>DE</fnm></au><au><snm>Coruzzi</snm><fnm>GM</fnm></au></aug><source>Genome Biol</source><pubdate>2007</pubdate><volume>8</volume><fpage>R7</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2007-8-1-r7</pubid><pubid idtype="pmcid">1839130</pubid><pubid idtype="pmpid" link="fulltext">17217541</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Systemic signaling of the plant nitrogen status triggers specific transcriptome responses depending on the nitrogen source in <it>Medicago truncatula</it>.</p></title><aug><au><snm>Ruffel</snm><fnm>S</fnm></au><au><snm>Freixes</snm><fnm>S</fnm></au><au><snm>Balzergue</snm><fnm>S</fnm></au><au><snm>Tillard</snm><fnm>P</fnm></au><au><snm>Jeudy</snm><fnm>C</fnm></au><au><snm>Martin-Magniette</snm><fnm>ML</fnm></au><au><snm>van der Merwe</snm><fnm>MJ</fnm></au><au><snm>Kakar</snm><fnm>K</fnm></au><au><snm>Gouzy</snm><fnm>J</fnm></au><au><snm>Fernie</snm><fnm>AR</fnm></au><au><snm>Udvardi</snm><fnm>M</fnm></au><au><snm>Salon</snm><fnm>C</fnm></au><au><snm>Gojon</snm><fnm>A</fnm></au><au><snm>Lepetit</snm><fnm>M</fnm></au></aug><source>Plant Physiol</source><pubdate>2008</pubdate><volume>146</volume><fpage>2020</fpage><lpage>2035</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1104/pp.107.115667</pubid><pubid idtype="pmcid">2287368</pubid><pubid idtype="pmpid" link="fulltext">18287487</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>A system biology approach highlights a hormonal enhancer effect on regulation of genes in a nitrate responsive "biomodule".</p></title><aug><au><snm>Nero</snm><fnm>D</fnm></au><au><snm>Krouk</snm><fnm>G</fnm></au><au><snm>Tranchina</snm><fnm>D</fnm></au><au><snm>Coruzzi</snm><fnm>GM</fnm></au></aug><source>BMC Syst Biol</source><pubdate>2009</pubdate><volume>3</volume><fpage>59</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1752-0509-3-59</pubid><pubid idtype="pmcid">2702358</pubid><pubid idtype="pmpid" link="fulltext">19500399</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>Insights into the genomic nitrate response using genetics and the Sungear Software System.</p></title><aug><au><snm>Gutierrez</snm><fnm>RA</fnm></au><au><snm>Gifford</snm><fnm>ML</fnm></au><au><snm>Poultney</snm><fnm>C</fnm></au><au><snm>Wang</snm><fnm>R</fnm></au><au><snm>Shasha</snm><fnm>DE</fnm></au><au><snm>Coruzzi</snm><fnm>GM</fnm></au><au><snm>Crawford</snm><fnm>NM</fnm></au></aug><source>J Exp Bot</source><pubdate>2007</pubdate><volume>58</volume><fpage>2359</fpage><lpage>2367</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/jxb/erm079</pubid><pubid idtype="pmpid" link="fulltext">17470441</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>The <it>Arabidopsis </it>NRT1.1 transporter participates in the signaling pathway triggering root colonization of nitrate-rich patches.</p></title><aug><au><snm>Remans</snm><fnm>T</fnm></au><au><snm>Nacry</snm><fnm>P</fnm></au><au><snm>Pervent</snm><fnm>M</fnm></au><au><snm>Filleur</snm><fnm>S</fnm></au><au><snm>Diatloff</snm><fnm>E</fnm></au><au><snm>Mounier</snm><fnm>E</fnm></au><au><snm>Tillard</snm><fnm>P</fnm></au><au><snm>Forde</snm><fnm>BG</fnm></au><au><snm>Gojon</snm><fnm>A</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2006</pubdate><volume>103</volume><fpage>19206</fpage><lpage>19211</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0605275103</pubid><pubid idtype="pmcid">1748200</pubid><pubid idtype="pmpid" link="fulltext">17148611</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Nitrate and auxin transport by NRT1.1 defines a mechanism for nutrient sensing in plants.</p></title><aug><au><snm>Krouk</snm><fnm>G</fnm></au><au><snm>Lacombe</snm><fnm>B</fnm></au><au><snm>Bielach</snm><fnm>A</fnm></au><au><snm>Perrine-Walker</snm><fnm>F</fnm></au><au><snm>Malinska</snm><fnm>K</fnm></au><au><snm>Mounier</snm><fnm>E</fnm></au><au><snm>Hoyerova</snm><fnm>K</fnm></au><au><snm>Tillard</snm><fnm>P</fnm></au><au><snm>Leon</snm><fnm>S</fnm></au><au><snm>Ljung</snm><fnm>K</fnm></au><au><snm>Zazimalova</snm><fnm>E</fnm></au><au><snm>Benkova</snm><fnm>E</fnm></au><au><snm>Nacry</snm><fnm>P</fnm></au><au><snm>Gojon</snm><fnm>A</fnm></au></aug><source>Dev Cell</source><pubdate>2010</pubdate><volume>18</volume><fpage>927</fpage><lpage>937</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.devcel.2010.05.008</pubid><pubid idtype="pmpid" link="fulltext">20627075</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>Regulation of the high-affinity NO3- uptake system by NRT1.1-mediated NO3- demand signaling in <it>Arabidopsis</it>.</p></title><aug><au><snm>Krouk</snm><fnm>G</fnm></au><au><snm>Tillard</snm><fnm>P</fnm></au><au><snm>Gojon</snm><fnm>A</fnm></au></aug><source>Plant Physiol</source><pubdate>2006</pubdate><volume>142</volume><fpage>1075</fpage><lpage>1086</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1104/pp.106.087510</pubid><pubid idtype="pmcid">1630733</pubid><pubid idtype="pmpid" link="fulltext">16998085</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>Transcript profiling in the chl1-5 mutant of <it>Arabidopsis </it>reveals a role of the nitrate transporter NRT1.1 in the regulation of another nitrate transporter, NRT2.1.</p></title><aug><au><snm>Mu&#241;os</snm><fnm>S</fnm></au><au><snm>Cazettes</snm><fnm>C</fnm></au><au><snm>Fizames</snm><fnm>C</fnm></au><au><snm>Gaymard</snm><fnm>F</fnm></au><au><snm>Tillard</snm><fnm>P</fnm></au><au><snm>Lepetit</snm><fnm>M</fnm></au><au><snm>Lejay</snm><fnm>L</fnm></au><au><snm>Gojon</snm><fnm>A</fnm></au></aug><source>Plant Cell</source><pubdate>2004</pubdate><volume>16</volume><fpage>2433</fpage><lpage>2447</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1105/tpc.104.024380</pubid><pubid idtype="pmcid">520944</pubid><pubid idtype="pmpid" link="fulltext">15319483</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>A genetic screen for nitrate regulatory mutants captures the nitrate transporter gene NRT1.1.</p></title><aug><au><snm>Wang</snm><fnm>R</fnm></au><au><snm>Xing</snm><fnm>X</fnm></au><au><snm>Wang</snm><fnm>Y</fnm></au><au><snm>Tran</snm><fnm>A</fnm></au><au><snm>Crawford</snm><fnm>NM</fnm></au></aug><source>Plant Physiol</source><pubdate>2009</pubdate><volume>151</volume><fpage>472</fpage><lpage>478</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1104/pp.109.140434</pubid><pubid idtype="pmcid">2735993</pubid><pubid idtype="pmpid" link="fulltext">19633234</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>CHL1 functions as a nitrate sensor in plants.</p></title><aug><au><snm>Ho</snm><fnm>CH</fnm></au><au><snm>Lin</snm><fnm>SH</fnm></au><au><snm>Hu</snm><fnm>HC</fnm></au><au><snm>Tsay</snm><fnm>YF</fnm></au></aug><source>Cell</source><pubdate>2009</pubdate><volume>138</volume><fpage>1184</fpage><lpage>1194</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2009.07.004</pubid><pubid idtype="pmpid" link="fulltext">19766570</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>AtCIPK8, a CBL-interacting protein kinase, regulates the low-affinity phase of the primary nitrate response.</p></title><aug><au><snm>Hu</snm><fnm>HC</fnm></au><au><snm>Wang</snm><fnm>YY</fnm></au><au><snm>Tsay</snm><fnm>YF</fnm></au></aug><source>Plant J</source><pubdate>2009</pubdate><volume>57</volume><fpage>264</fpage><lpage>278</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1365-313X.2008.03685.x</pubid><pubid idtype="pmpid" link="fulltext">18798873</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>The nodule inception-like protein 7 modulates nitrate sensing and metabolism in <it>Arabidopsis</it>.</p></title><aug><au><snm>Castaings</snm><fnm>L</fnm></au><au><snm>Camargo</snm><fnm>A</fnm></au><au><snm>Pocholle</snm><fnm>D</fnm></au><au><snm>Gaudon</snm><fnm>V</fnm></au><au><snm>Texier</snm><fnm>Y</fnm></au><au><snm>Boutet-Mercey</snm><fnm>S</fnm></au><au><snm>Taconnat</snm><fnm>L</fnm></au><au><snm>Renou</snm><fnm>JP</fnm></au><au><snm>Daniel-Vedele</snm><fnm>F</fnm></au><au><snm>Fernandez</snm><fnm>E</fnm></au><au><snm>Meyer</snm><fnm>C</fnm></au><au><snm>Krapp</snm><fnm>A</fnm></au></aug><source>Plant J</source><pubdate>2009</pubdate><volume>57</volume><fpage>426</fpage><lpage>435</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1365-313X.2008.03695.x</pubid><pubid idtype="pmpid" link="fulltext">18826430</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Members of the LBD family of transcription factors repress anthocyanin synthesis and affect additional nitrogen responses in <it>Arabidopsis</it>.</p></title><aug><au><snm>Rubin</snm><fnm>G</fnm></au><au><snm>Tohge</snm><fnm>T</fnm></au><au><snm>Matsuda</snm><fnm>F</fnm></au><au><snm>Saito</snm><fnm>K</fnm></au><au><snm>Scheible</snm><fnm>WR</fnm></au></aug><source>Plant Cell</source><pubdate>2009</pubdate><volume>21</volume><fpage>3567</fpage><lpage>3584</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1105/tpc.109.067041</pubid><pubid idtype="pmcid">2798321</pubid><pubid idtype="pmpid" link="fulltext">19933203</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>An <it>Arabidopsis </it>MADS box gene that controls nutrient-induced changes in root architecture.</p></title><aug><au><snm>Zhang</snm><fnm>H</fnm></au><au><snm>Forde</snm><fnm>BG</fnm></au></aug><source>Science</source><pubdate>1998</pubdate><volume>279</volume><fpage>407</fpage><lpage>409</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.279.5349.407</pubid><pubid idtype="pmpid" link="fulltext">9430595</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Dynamical factor graphs for time series modeling.</p></title><aug><au><snm>Mirowski</snm><fnm>P</fnm></au><au><snm>LeCun</snm><fnm>Y</fnm></au></aug><source>Lecture Notes Artificial Intelligence</source><pubdate>2009</pubdate><volume>5782</volume><fpage>128</fpage><lpage>142</lpage></bibl><bibl id="B26"><title><p>The effect of Glc6P uptake and its subsequent oxidation within pea root plastids on nitrite reduction and glutamate synthesis.</p></title><aug><au><snm>Bowsher</snm><fnm>CG</fnm></au><au><snm>Lacey</snm><fnm>AE</fnm></au><au><snm>Hanke</snm><fnm>GT</fnm></au><au><snm>Clarkson</snm><fnm>DT</fnm></au><au><snm>Saker</snm><fnm>LR</fnm></au><au><snm>Stulen</snm><fnm>I</fnm></au><au><snm>Emes</snm><fnm>MJ</fnm></au></aug><source>J Exp Bot</source><pubdate>2007</pubdate><volume>58</volume><fpage>1109</fpage><lpage>1118</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/jxb/erl269</pubid><pubid idtype="pmpid" link="fulltext">17220512</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>Oxidative pentose phosphate pathway-dependent sugar sensing as a mechanism for regulation of root ion transporters by photosynthesis.</p></title><aug><au><snm>Lejay</snm><fnm>L</fnm></au><au><snm>Wirth</snm><fnm>J</fnm></au><au><snm>Pervent</snm><fnm>M</fnm></au><au><snm>Cross</snm><fnm>JM</fnm></au><au><snm>Tillard</snm><fnm>P</fnm></au><au><snm>Gojon</snm><fnm>A</fnm></au></aug><source>Plant Physiol</source><pubdate>2008</pubdate><volume>146</volume><fpage>2036</fpage><lpage>2053</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1104/pp.107.114710</pubid><pubid idtype="pmcid">2287369</pubid><pubid idtype="pmpid" link="fulltext">18305209</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>Genome-wide analysis of <it>Arabidopsis </it>responsive transcriptome to nitrogen limitation and its regulation by the ubiquitin ligase gene NLA.</p></title><aug><au><snm>Peng</snm><fnm>M</fnm></au><au><snm>Bi</snm><fnm>YM</fnm></au><au><snm>Zhu</snm><fnm>T</fnm></au><au><snm>Rothstein</snm><fnm>SJ</fnm></au></aug><source>Plant Mol Biol</source><pubdate>2007</pubdate><volume>65</volume><fpage>775</fpage><lpage>797</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s11103-007-9241-0</pubid><pubid idtype="pmpid" link="fulltext">17885809</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>Different plant hormones regulate similar processes through largely nonoverlapping transcriptional responses.</p></title><aug><au><snm>Nemhauser</snm><fnm>JL</fnm></au><au><snm>Hong</snm><fnm>F</fnm></au><au><snm>Chory</snm><fnm>J</fnm></au></aug><source>Cell</source><pubdate>2006</pubdate><volume>126</volume><fpage>467</fpage><lpage>475</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2006.05.050</pubid><pubid idtype="pmpid" link="fulltext">16901781</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Plant hormones and nutrient signaling.</p></title><aug><au><snm>Rubio</snm><fnm>V</fnm></au><au><snm>Bustos</snm><fnm>R</fnm></au><au><snm>Irigoyen</snm><fnm>ML</fnm></au><au><snm>Cardona-Lopez</snm><fnm>X</fnm></au><au><snm>Rojas-Triana</snm><fnm>M</fnm></au><au><snm>Paz-Ares</snm><fnm>J</fnm></au></aug><source>Plant Mol Biol</source><pubdate>2009</pubdate><volume>69</volume><fpage>361</fpage><lpage>373</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s11103-008-9380-y</pubid><pubid idtype="pmpid" link="fulltext">18688730</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>Reverse engineering of gene regulatory networks.</p></title><aug><au><snm>Jaeger</snm><fnm>J</fnm></au><au><snm>Monk</snm><fnm>N</fnm></au></aug><source>Learning and Inference in Computational Systems Biology</source><publisher>Cambridge MA: MIT Press</publisher><editor>Lawrence N, Girolami M, Rattray M, Sanguinetti G</editor><pubdate>2010</pubdate></bibl><bibl id="B32"><title><p>A predictive model for transcriptional control of physiology in a free living cell.</p></title><aug><au><snm>Bonneau</snm><fnm>R</fnm></au><au><snm>Facciotti</snm><fnm>MT</fnm></au><au><snm>Reiss</snm><fnm>DJ</fnm></au><au><snm>Schmid</snm><fnm>AK</fnm></au><au><snm>Pan</snm><fnm>M</fnm></au><au><snm>Kaur</snm><fnm>A</fnm></au><au><snm>Thorsson</snm><fnm>V</fnm></au><au><snm>Shannon</snm><fnm>P</fnm></au><au><snm>Johnson</snm><fnm>MH</fnm></au><au><snm>Bare</snm><fnm>JC</fnm></au><au><snm>Longabaugh</snm><fnm>W</fnm></au><au><snm>Vuthoori</snm><fnm>M</fnm></au><au><snm>Whitehead</snm><fnm>K</fnm></au><au><snm>Madar</snm><fnm>A</fnm></au><au><snm>Suzuki</snm><fnm>L</fnm></au><au><snm>Mori</snm><fnm>T</fnm></au><au><snm>Chang</snm><fnm>DE</fnm></au><au><snm>Diruggiero</snm><fnm>J</fnm></au><au><snm>Johnson</snm><fnm>CH</fnm></au><au><snm>Hood</snm><fnm>L</fnm></au><au><snm>Baliga</snm><fnm>NS</fnm></au></aug><source>Cell</source><pubdate>2007</pubdate><volume>131</volume><fpage>1354</fpage><lpage>1365</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2007.10.053</pubid><pubid idtype="pmpid" link="fulltext">18160043</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>The Inferelator: an algorithm for learning parsimonious regulatory networks from systems-biology data sets de novo.</p></title><aug><au><snm>Bonneau</snm><fnm>R</fnm></au><au><snm>Reiss</snm><fnm>DJ</fnm></au><au><snm>Shannon</snm><fnm>P</fnm></au><au><snm>Facciotti</snm><fnm>M</fnm></au><au><snm>Hood</snm><fnm>L</fnm></au><au><snm>Baliga</snm><fnm>NS</fnm></au><au><snm>Thorsson</snm><fnm>V</fnm></au></aug><source>Genome Biol</source><pubdate>2006</pubdate><volume>7</volume><fpage>R36</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2006-7-5-r36</pubid><pubid idtype="pmcid">1779511</pubid><pubid idtype="pmpid" link="fulltext">16686963</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>Inferring gene regulatory networks from multiple microarray datasets.</p></title><aug><au><snm>Wang</snm><fnm>Y</fnm></au><au><snm>Joshi</snm><fnm>T</fnm></au><au><snm>Zhang</snm><fnm>XS</fnm></au><au><snm>Xu</snm><fnm>D</fnm></au><au><snm>Chen</snm><fnm>L</fnm></au></aug><source>Bioinformatics</source><pubdate>2006</pubdate><volume>22</volume><fpage>2413</fpage><lpage>2420</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btl396</pubid><pubid idtype="pmpid" link="fulltext">16864593</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>Recursive regularization for inferring gene networks from time-course gene expression profiles.</p></title><aug><au><snm>Shimamura</snm><fnm>T</fnm></au><au><snm>Imoto</snm><fnm>S</fnm></au><au><snm>Yamaguchi</snm><fnm>R</fnm></au><au><snm>Fujita</snm><fnm>A</fnm></au><au><snm>Nagasaki</snm><fnm>M</fnm></au><au><snm>Miyano</snm><fnm>S</fnm></au></aug><source>BMC Syst Biol</source><pubdate>2009</pubdate><volume>3</volume><fpage>41</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1752-0509-3-41</pubid><pubid idtype="pmcid">2686685</pubid><pubid idtype="pmpid" link="fulltext">19386091</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><aug><au><snm>Murphy</snm><fnm>K</fnm></au><au><snm>Mian</snm><fnm>S</fnm></au></aug><source>Modelling Gene Expression Data using Dynamic Bayesian Networks. Technical report</source><publisher>Computer Science Division, University of California and Life Sciences Division, Lawrence Berkeley National Laboratory</publisher><pubdate>1999</pubdate></bibl><bibl id="B37"><title><p>A Bayesian approach to reconstructing genetic regulatory networks with hidden factors.</p></title><aug><au><snm>Beal</snm><fnm>M</fnm></au><au><snm>Falciani</snm><fnm>F</fnm></au><au><snm>Ghahramani</snm><fnm>Z</fnm></au><au><snm>Rangel</snm><fnm>C</fnm></au><au><snm>Wild</snm><fnm>D</fnm></au></aug><source>Bioinformatics</source><pubdate>2005</pubdate><volume>21</volume><fpage>349</fpage><lpage>356</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/bti014</pubid><pubid idtype="pmpid" link="fulltext">15353451</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>Inferring transcriptional networks using prior biological knowledge and constrained state-space models.</p></title><aug><au><snm>Angus</snm><fnm>J</fnm></au><au><snm>Beal</snm><fnm>M</fnm></au><au><snm>Li</snm><fnm>J</fnm></au><au><snm>Rangel</snm><fnm>C</fnm></au><au><snm>Wild</snm><fnm>D</fnm></au></aug><source>Learning and Inference in Computational Systems Biology</source><publisher>Cambridge: MIT Press</publisher><editor>Lawrence ND, Girolami M, Rattray M, Sanguinetti G</editor><pubdate>2010</pubdate><fpage>117</fpage><lpage>152</lpage></bibl><bibl id="B39"><title><p>An integrated machine learning approach for predicting DosR-regulated genes in <it>Mycobacterium tuberculosis</it>.</p></title><aug><au><snm>Zhang</snm><fnm>Y</fnm></au><au><snm>Hatch</snm><fnm>KA</fnm></au><au><snm>Bacon</snm><fnm>J</fnm></au><au><snm>Wernisch</snm><fnm>L</fnm></au></aug><source>BMC Syst Biol</source><pubdate>2010</pubdate><volume>4</volume><fpage>37</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1752-0509-4-37</pubid><pubid idtype="pmcid">2867773</pubid><pubid idtype="pmpid" link="fulltext">20356371</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>Modeling genetic regulatory dynamic in neural development.</p></title><aug><au><snm>Wahde</snm><fnm>M</fnm></au><au><snm>Hertz</snm><fnm>J</fnm></au></aug><source>J Comput Biol</source><pubdate>2001</pubdate><volume>8</volume><fpage>429</fpage><lpage>442</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1089/106652701752236223</pubid><pubid idtype="pmpid" link="fulltext">11571076</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>Grouped graphical Granger modeling for gene expression regulatory networks discovery.</p></title><aug><au><snm>Lozano</snm><fnm>AC</fnm></au><au><snm>Abe</snm><fnm>N</fnm></au><au><snm>Liu</snm><fnm>Y</fnm></au><au><snm>Rosset</snm><fnm>S</fnm></au></aug><source>Bioinformatics</source><pubdate>2009</pubdate><volume>25</volume><fpage>i110</fpage><lpage>118</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btp199</pubid><pubid idtype="pmcid">2687953</pubid><pubid idtype="pmpid" link="fulltext">19477976</pubid></pubidlist></xrefbib></bibl><bibl id="B42"><title><p>Least angle regression.</p></title><aug><au><snm>Efron</snm><fnm>B</fnm></au><au><snm>Hastie</snm><fnm>T</fnm></au><au><snm>Jonnstone</snm><fnm>I</fnm></au><au><snm>Tibshirani</snm><fnm>R</fnm></au></aug><source>Ann Stat</source><pubdate>2004</pubdate><volume>32</volume><fpage>407</fpage><lpage>499</lpage><xrefbib><pubid idtype="doi">10.1214/009053604000000067</pubid></xrefbib></bibl><bibl id="B43"><title><p>Regularization and variable selection via the elastic net.</p></title><aug><au><snm>Zhu</snm><fnm>H</fnm></au><au><snm>Hastie</snm><fnm>T</fnm></au></aug><source>J R Stat Soc</source><pubdate>2005</pubdate><volume>67</volume><fpage>301</fpage><lpage>320</lpage><xrefbib><pubid idtype="doi">10.1111/j.1467-9868.2005.00503.x</pubid></xrefbib></bibl><bibl id="B44"><title><p>Regression shrinkage and selection via the lasso.</p></title><aug><au><snm>Tibshirani</snm><fnm>R</fnm></au></aug><source>J R Stat Soc</source><pubdate>2006</pubdate><volume>58</volume><fpage>267</fpage><lpage>288</lpage></bibl><bibl id="B45"><title><p>The microRNA regulated SBP-box genes SPL9 and SPL15 control shoot maturation in <it>Arabidopsis</it>.</p></title><aug><au><snm>Schwarz</snm><fnm>S</fnm></au><au><snm>Grande</snm><fnm>AV</fnm></au><au><snm>Bujdoso</snm><fnm>N</fnm></au><au><snm>Saedler</snm><fnm>H</fnm></au><au><snm>Huijser</snm><fnm>P</fnm></au></aug><source>Plant Mol Biol</source><pubdate>2008</pubdate><volume>67</volume><fpage>183</fpage><lpage>195</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s11103-008-9310-z</pubid><pubid idtype="pmcid">2295252</pubid><pubid idtype="pmpid" link="fulltext">18278578</pubid></pubidlist></xrefbib></bibl><bibl id="B46"><title><p>miR156-regulated SPL transcription factors define an endogenous flowering pathway in <it>Arabidopsis thaliana</it>.</p></title><aug><au><snm>Wang</snm><fnm>JW</fnm></au><au><snm>Czech</snm><fnm>B</fnm></au><au><snm>Weigel</snm><fnm>D</fnm></au></aug><source>Cell</source><pubdate>2009</pubdate><volume>138</volume><fpage>738</fpage><lpage>749</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2009.06.014</pubid><pubid idtype="pmpid" link="fulltext">19703399</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>The sequential action of miR156 and miR172 regulates developmental timing in <it>Arabidopsis</it>.</p></title><aug><au><snm>Wu</snm><fnm>G</fnm></au><au><snm>Park</snm><fnm>MY</fnm></au><au><snm>Conway</snm><fnm>SR</fnm></au><au><snm>Wang</snm><fnm>JW</fnm></au><au><snm>Weigel</snm><fnm>D</fnm></au><au><snm>Poethig</snm><fnm>RS</fnm></au></aug><source>Cell</source><pubdate>2009</pubdate><volume>138</volume><fpage>750</fpage><lpage>759</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2009.06.031</pubid><pubid idtype="pmcid">2732587</pubid><pubid idtype="pmpid" link="fulltext">19703400</pubid></pubidlist></xrefbib></bibl><bibl id="B48"><title><p>Dual effects of miR156-targeted SPL genes and CYP78A5/KLUH on plastochron length and organ size in <it>Arabidopsis thaliana</it>.</p></title><aug><au><snm>Wang</snm><fnm>JW</fnm></au><au><snm>Schwab</snm><fnm>R</fnm></au><au><snm>Czech</snm><fnm>B</fnm></au><au><snm>Mica</snm><fnm>E</fnm></au><au><snm>Weigel</snm><fnm>D</fnm></au></aug><source>Plant Cell</source><pubdate>2008</pubdate><volume>20</volume><fpage>1231</fpage><lpage>1243</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1105/tpc.108.058180</pubid><pubid idtype="pmcid">2438454</pubid><pubid idtype="pmpid" link="fulltext">18492871</pubid></pubidlist></xrefbib></bibl><bibl id="B49"><title><p>Identification of nutrient-responsive <it>Arabidopsis </it>and rapeseed microRNAs by comprehensive real-time polymerase chain reaction profiling and small RNA sequencing.</p></title><aug><au><snm>Pant</snm><fnm>BD</fnm></au><au><snm>Musialak-Lange</snm><fnm>M</fnm></au><au><snm>Nuc</snm><fnm>P</fnm></au><au><snm>May</snm><fnm>P</fnm></au><au><snm>Buhtz</snm><fnm>A</fnm></au><au><snm>Kehr</snm><fnm>J</fnm></au><au><snm>Walther</snm><fnm>D</fnm></au><au><snm>Scheible</snm><fnm>WR</fnm></au></aug><source>Plant Physiol</source><pubdate>2009</pubdate><volume>150</volume><fpage>1541</fpage><lpage>1555</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1104/pp.109.139139</pubid><pubid idtype="pmcid">2705054</pubid><pubid idtype="pmpid" link="fulltext">19465578</pubid></pubidlist></xrefbib></bibl><bibl id="B50"><title><p>Uncovering small RNA-mediated responses to phosphate-deficiency in <it>Arabidopsis </it>by deep sequencing.</p></title><aug><au><snm>Hsieh</snm><fnm>LC</fnm></au><au><snm>Lin</snm><fnm>SI</fnm></au><au><snm>Shih</snm><fnm>AC</fnm></au><au><snm>Chen</snm><fnm>JW</fnm></au><au><snm>Lin</snm><fnm>WY</fnm></au><au><snm>Tseng</snm><fnm>CY</fnm></au><au><snm>Li</snm><fnm>WH</fnm></au><au><snm>Chiou</snm><fnm>TJ</fnm></au></aug><source>Plant Physiol</source><pubdate>2009</pubdate><volume>151</volume><fpage>2120</fpage><lpage>2132</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1104/pp.109.147280</pubid><pubid idtype="pmcid">2785986</pubid><pubid idtype="pmpid" link="fulltext">19854858</pubid></pubidlist></xrefbib></bibl><bibl id="B51"><title><p>R project website.</p></title><url>http://www.r-project.org/</url></bibl><bibl id="B52"><title><p>MeV software.</p></title><url>http://www.tm4.org/mev.html</url></bibl><bibl id="B53"><title><p>Validating clustering for gene expression data.</p></title><aug><au><snm>Yeung</snm><fnm>KY</fnm></au><au><snm>Haynor</snm><fnm>DR</fnm></au><au><snm>Ruzzo</snm><fnm>WL</fnm></au></aug><source>Bioinformatics</source><pubdate>2001</pubdate><volume>17</volume><fpage>309</fpage><lpage>318</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/17.4.309</pubid><pubid idtype="pmpid" link="fulltext">11301299</pubid></pubidlist></xrefbib></bibl><bibl id="B54"><title><p>Source code for SSM.</p></title><url>http://cs.nyu.edu/~mirowski/Publications_en.html</url></bibl></refgrp>
</bm></art>