<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2002-3-12-research0076</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Prediction and overview of the RpoN-regulon in closely related species of the <it>Rhizobiales</it></p>
         </title>
         <aug>
            <au id="A1">
               <snm>Dombrecht</snm>
               <fnm>Bruno</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A2">
               <snm>Marchal</snm>
               <fnm>Kathleen</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A3">
               <snm>Vanderleyden</snm>
               <fnm>Jos</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A4" ca="yes">
               <snm>Michiels</snm>
               <fnm>Jan</fnm>
               <insr iid="I1"/>
               <email>jan.michiels@agr.kuleuven.ac.be</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Centre of Microbial and Plant Genetics, Katholieke Universiteit Leuven, 3001 Heverlee, Belgium</p>
            </ins>
            <ins id="I2">
               <p>ESAT-SCD, Katholieke Universiteit Leuven, 3001 Heverlee, Belgium</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2002</pubdate>
         <volume>3</volume>
         <issue>12</issue>
         <fpage>research0076.1</fpage>
         <lpage>research0076.11</lpage>
         <url>http://genomebiology.com/2002/3/12/research/0076</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/gb-2002-3-12-research0076</pubid>
               <pubid idtype="pmpid">12537565</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>21</day>
               <month>6</month>
               <year>2002</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>16</day>
               <month>9</month>
               <year>2002</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>18</day>
               <month>10</month>
               <year>2002</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>26</day>
               <month>11</month>
               <year>2002</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2002</year>
         <collab>Dombrecht et al., licensee BioMed Central Ltd</collab>
      </cpyrt>
      <shorttitle>
         <p>Prediction and overview of the RpoN-regulon in closely related species of the Rhizobiales</p>
      </shorttitle>
      <shortabs>
         <p>Two complete rhizobial genomes (<it>Mesorhizobium loti</it>, <it>Sinorhizobium meliloti</it>) and four symbiotic regions (<it>Rhizobium etli</it>, <it>Rhizobium</it> sp. NGR234, <it>Bradyrhizobium japonicum</it>, <it>M. loti</it>) were screened for the presence of highly conserved RpoN-binding sites. The methodology used to predict RpoN-binding sites proved highly effective as nearly all known RpoN-controlled genes were identified.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>In the rhizobia, a group of symbiotic Gram-negative soil bacteria, RpoN (&#963;<sup>54</sup>, &#963;<sup>N</sup>, NtrA) is best known as the sigma factor enabling transcription of the nitrogen fixation genes. Recent reports, however, demonstrate the involvement of RpoN in other symbiotic functions, although no large-scale effort has yet been undertaken to unravel the RpoN-regulon in rhizobia. We screened two complete rhizobial genomes (<it>Mesorhizobium loti</it>, <it>Sinorhizobium meliloti</it>) and four symbiotic regions (<it>Rhizobium etli</it>, <it>Rhizobium</it> sp. NGR234, <it>Bradyrhizobium japonicum</it>, <it>M. loti</it>) for the presence of the highly conserved RpoN-binding sites. A comparison was also made with two closely related non-symbiotic members of the <it>Rhizobiales</it> (<it>Agrobacterium tumefaciens</it>, <it>Brucella melitensis</it>).</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>A highly specific weight-matrix-based screening method was applied to predict members of the RpoN-regulon, which were stored in a highly annotated and manually curated dataset. Possible enhancer-binding proteins (EBPs) controlling the expression of RpoN-dependent genes were predicted with a profile hidden Markov model.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>The methodology used to predict RpoN-binding sites proved highly effective as nearly all known RpoN-controlled genes were identified. In addition, many new RpoN-dependent functions were found. The dependency of several of these diverse functions on RpoN seems species-specific. Around 30% of the identified genes are hypothetical. Rhizobia appear to have recruited RpoN for symbiotic processes, whereas the role of RpoN in <it>A. tumefaciens</it> and <it>B. melitensis</it> remains largely to be elucidated. All species screened possess at least one uncharacterized EBP as well as the usual ones. Lastly, RpoN could significantly broaden its working range by direct interfering with the binding of regulatory proteins to the promoter DNA.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010014">Microbiology and parasitology</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010016">Molecular biology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The bacterial alternative sigma factor RpoN recognizes and binds a -24/-12-type promoter with the following consensus sequence: 5'-TG<b>G</b>CACG-N<sub>4</sub>-TTG<b>C</b>W-3' (the bold G and C are situated at position -24 and -12 relative to the transcription start site, respectively) [<abbr bid="B1">1</abbr>]. Subsequently, the core DNA-dependent RNA polymerase (core-RNAP) binds to the RpoN-DNA secondary complex to form a stable closed promoter complex. The closed promoter complex is unable to initiate transcription by itself. For this, melting of the double-stranded (ds) DNA within the closed complex is required [<abbr bid="B2">2</abbr>]. This is accomplished by the nucleotide hydrolysis activity of an activator or enhancer-binding protein (EBP). EBPs bind to enhancer sites situated 100 base-pairs (bp) or more upstream of the transcription initiation site. Each EBP is controlled by its own signal transduction pathway, thereby responding to different conditions [<abbr bid="B3">3</abbr>,<abbr bid="B4">4</abbr>,<abbr bid="B5">5</abbr>]. As there is almost no leaky expression in the absence of EBPs, the expression of different RpoN-dependent genes is tightly regulated by the different EBPs.</p>
         <p>RpoN is also known as &#963;<sup>54</sup> (from the 54 kDa molecular weight of the <it>Escherichia coli</it> polypeptide), &#963;<sup>N</sup> (this sigma factor was initially discovered for its requirement for the expression of nitrogen metabolism genes) and NtrA or GlnF (names now not in common use). Because the members of this protein family vary considerably in molecular weight, the designation &#963;<sup>54</sup> cannot be correctly applied to all of them. Furthermore, we use the name RpoN instead of &#963;<sup>N</sup> as the link with the encoding gene <it>rpoN</it> is more obvious. As some bacteria (for example, <it>Bradyrhizobium japonicum</it>, <it>Rhizobium etli</it> and <it>Mesorhizobium loti</it>) have two copies of the gene, the respective proteins can be easily distinguished as RpoN<sub>1</sub> and RpoN<sub>2</sub>.</p>
         <p>RpoN helps initiating the transcription of genes encoding proteins for very diverse functions in a broad range of bacteria [<abbr bid="B5">5</abbr>,<abbr bid="B6">6</abbr>,<abbr bid="B7">7</abbr>]. The processes controlled by RpoN are not essential for cell survival and growth under favorable conditions, with the exception of <it>Myxococcus xanthus</it> [<abbr bid="B8">8</abbr>]. The most widespread RpoN-regulated function in bacteria is the assimilation of ammonia [<abbr bid="B6">6</abbr>,<abbr bid="B7">7</abbr>]. The expression of genes coding for glutamine synthetase, an ammonium transporter and the PII proteins is controlled by the NtrC EBP and RpoN. In the enteric nitrogen regulation (Ntr) paradigm, NtrC is activated via phosphorylation by NtrB. The PII protein, a signal transducer, stimulates dephosphorylation of NtrC-P under conditions of nitrogen excess (reflected by a high glutamine/2-ketoglutarate ratio), whereas it does not affect the NtrC phosphorylation status under nitrogen-limiting conditions [<abbr bid="B9">9</abbr>]. Consequently, under low nitrogen conditions, NtrC-P activates transcription of the NtrC-RpoN regulon.</p>
         <p>Species of the genera <it>Allorhizobium</it>, <it>Azorhizobium</it>, <it>Bradyrhizobium</it>, <it>Mesorhizobium</it>, <it>Rhizobium</it> and <it>Sinorhizobium</it> are generally referred to as rhizobia. Rhizobial <it>rpoN</it> mutants are deficient in symbiotic nitrogen fixation [<abbr bid="B10">10</abbr>,<abbr bid="B11">11</abbr>,<abbr bid="B12">12</abbr>,<abbr bid="B13">13</abbr>,<abbr bid="B14">14</abbr>]. RpoN ensures the transcription of most of the nitrogen fixation genes (<it>nif/fix</it>) whose gene products constitute the nitrogenase complex and accessory proteins [<abbr bid="B15">15</abbr>,<abbr bid="B16">16</abbr>]. However, several other symbiosis-related genes are reported to be RpoN-dependent [<abbr bid="B17">17</abbr>,<abbr bid="B18">18</abbr>,<abbr bid="B19">19</abbr>,<abbr bid="B20">20</abbr>,<abbr bid="B21">21</abbr>,<abbr bid="B22">22</abbr>,<abbr bid="B23">23</abbr>,<abbr bid="B24">24</abbr>,<abbr bid="B25">25</abbr>,<abbr bid="B26">26</abbr>,<abbr bid="B27">27</abbr>]. So far, no large-scale effort has been undertaken to unravel the RpoN-regulon in rhizobia. The recent publication of several rhizobial genomes and symbiotic regions is a good opportunity to identify RpoN-regulated genes in rhizobia. To obtain a good view on what (symbiotic) functions are controlled by RpoN, we carried out an <it>in silico</it> analysis on the presence of -24/-12-type promoters in the symbiotic regions of <it>R. etli</it> CFN42, <it>Rhizobium</it> sp. NGR234, <it>B. japonicum</it> USDA110 and <it>M. loti</it> R7A [<abbr bid="B28">28</abbr>,<abbr bid="B29">29</abbr>,<abbr bid="B30">30</abbr>] and in the genomes of <it>M. loti</it> MAFF303099 and <it>Sinorhizobium meliloti</it> 1021 [<abbr bid="B31">31</abbr>,<abbr bid="B32">32</abbr>]. Two closely related non-symbiotic species belonging to the <it>Rhizobiales</it> order of the &#945;-proteobacteria, namely <it>Agrobacterium tumefaciens</it> C58 and <it>Brucella melitensis</it> 16M [<abbr bid="B33">33</abbr>,<abbr bid="B34">34</abbr>], were also included. These are non-symbiotic plant and animal pathogens, respectively. To date, there is only one report on RpoN-dependent functions in <it>A. tumefaciens</it> [<abbr bid="B35">35</abbr>] whereas no information whatsoever is available on the RpoN-regulon of <it>B. melitensis</it>. The possible RpoN-dependent genes predicted by the screening were complemented with data from the literature and classified according to the function of the encoded proteins.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p><it>In silico</it> identification of potential RpoN-dependent promoters</p>
            </st>
            <p>The upstream intergenic sequences were extracted from the genomes of <it>M. loti</it> MAFF303099, <it>S. meliloti</it> 1021, <it>A. tumefaciens</it> C58 and <it>B. melitensis</it> M16 and the symbiotic regions of <it>R. etli</it> CFN42, <it>Rhizobium</it> sp. NGR234, <it>B. japonicum</it> USDA110 and <it>M. loti</it> R7A (see Materials and methods) [<abbr bid="B28">28</abbr>,<abbr bid="B29">29</abbr>,<abbr bid="B30">30</abbr>,<abbr bid="B31">31</abbr>,<abbr bid="B32">32</abbr>,<abbr bid="B33">33</abbr>,<abbr bid="B34">34</abbr>]. The upper strand of these sequences was scored against a weight matrix using PATSER (see Materials and methods). Positive matches (possible RpoN-binding sites or -24/-12-type promoters) were classified according to the functions of the encoded gene products (see Additional data files, pages 8,9). To ensure a sound functional description, the predicted proteins were individually screened against the protein databases of the National Center for Biotechnology Information (NCBI) using BLASTP.</p>
            <p>In our analysis, the number of false-positive matches is estimated to be very low. The use of PATSER in combination with a strong weight matrix is a preferred method to identify true binding sites [<abbr bid="B36">36</abbr>]. The weight matrix used here was based on a set of 186 characterized RpoN-binding sites of 44 different bacterial species (Table <tblr tid="T1">1</tblr>) [<abbr bid="B1">1</abbr>]. The lower threshold for the scores was chosen such that only matches strongly resembling the consensus were retained (see Materials and methods). This high stringency allows for a high specificity at the expense of the sensitivity. Indeed, some reports mention the presence of very poorly conserved -24/-12-type promoters that appear to be functional to some degree (see Additional data files, pages 8,9). Our stringent procedure most probably misses these sites. Therefore, the retained matches might represent an underestimation of the actual number of active -24/-12-type promoters. Second, only intergenic sequences were considered for the screening, as all known -24/-12-type promoters are situated in intergenic regions [<abbr bid="B7">7</abbr>]. Moreover, the matches all have the correct orientation, as the upper strand of the intergenic sequences was used (for matches on the lower strand, see 'Additional control mechanisms involving RpoN'). The median of the distances from the -12 (C) position - the cytidylate residue on position -12 relative to the transcription initiation site - of a match to the start codon of the downstream coding sequence (CDS) varies from 74 to 156 bp (Table <tblr tid="T2">2</tblr>). This is somewhat different from the situation in <it>E. coli</it>, where the average distance amounts to 50 bp [<abbr bid="B7">7</abbr>]. Although roughly 75% of the matches are within 200 bp upstream of the start codon, matches with distances over 1,000 bp were also retained (Table <tblr tid="T2">2</tblr>). A good example to justify this is the case of the mapped promoter of the <it>B. japonicum</it> gene <it>fixB</it>, which is situated 720 bp upstream of the coding region [<abbr bid="B37">37</abbr>]. As is the case with other <it>in silico</it> prediction methods, the results of our analysis may have been biased by the approach used.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Weight matrix based on 186 characterized -24/-12-type promoters [<abbr bid="B1">1</abbr>]</p>
               </caption>
               <tblbdy cols="17">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>T</p>
                     </c>
                     <c ca="center">
                        <p>G</p>
                     </c>
                     <c ca="center">
                        <p>G* -24</p>
                     </c>
                     <c ca="center">
                        <p>C</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>C</p>
                     </c>
                     <c ca="center">
                        <p>G</p>
                     </c>
                     <c ca="center">
                        <p>N</p>
                     </c>
                     <c ca="center">
                        <p>N</p>
                     </c>
                     <c ca="center">
                        <p>N</p>
                     </c>
                     <c ca="center">
                        <p>N</p>
                     </c>
                     <c ca="center">
                        <p>T</p>
                     </c>
                     <c ca="center">
                        <p>T</p>
                     </c>
                     <c ca="center">
                        <p>G</p>
                     </c>
                     <c ca="center">
                        <p>C*</p>
                     </c>
                     <c ca="center">
                        <p>W</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="17">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>A</p>
                     </c>
                     <c ca="right">
                        <p>12</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>12</p>
                     </c>
                     <c ca="right">
                        <p>139</p>
                     </c>
                     <c ca="right">
                        <p>11</p>
                     </c>
                     <c ca="right">
                        <p>55</p>
                     </c>
                     <c ca="right">
                        <p>51</p>
                     </c>
                     <c ca="right">
                        <p>46</p>
                     </c>
                     <c ca="right">
                        <p>44</p>
                     </c>
                     <c ca="right">
                        <p>38</p>
                     </c>
                     <c ca="right">
                        <p>13</p>
                     </c>
                     <c ca="right">
                        <p>4</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="right">
                        <p>9</p>
                     </c>
                     <c ca="right">
                        <p>76</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>C</p>
                     </c>
                     <c ca="right">
                        <p>14</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>147</p>
                     </c>
                     <c ca="right">
                        <p>23</p>
                     </c>
                     <c ca="right">
                        <p>122</p>
                     </c>
                     <c ca="right">
                        <p>17</p>
                     </c>
                     <c ca="right">
                        <p>48</p>
                     </c>
                     <c ca="right">
                        <p>64</p>
                     </c>
                     <c ca="right">
                        <p>42</p>
                     </c>
                     <c ca="right">
                        <p>62</p>
                     </c>
                     <c ca="right">
                        <p>22</p>
                     </c>
                     <c ca="right">
                        <p>18</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="right">
                        <p>173</p>
                     </c>
                     <c ca="right">
                        <p>5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>G</p>
                     </c>
                     <c ca="right">
                        <p>10</p>
                     </c>
                     <c ca="right">
                        <p>184</p>
                     </c>
                     <c ca="right">
                        <p>186</p>
                     </c>
                     <c ca="right">
                        <p>6</p>
                     </c>
                     <c ca="right">
                        <p>18</p>
                     </c>
                     <c ca="right">
                        <p>10</p>
                     </c>
                     <c ca="right">
                        <p>103</p>
                     </c>
                     <c ca="right">
                        <p>69</p>
                     </c>
                     <c ca="right">
                        <p>36</p>
                     </c>
                     <c ca="right">
                        <p>35</p>
                     </c>
                     <c ca="right">
                        <p>43</p>
                     </c>
                     <c ca="right">
                        <p>15</p>
                     </c>
                     <c ca="right">
                        <p>10</p>
                     </c>
                     <c ca="right">
                        <p>181</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="right">
                        <p>17</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>T</p>
                     </c>
                     <c ca="right">
                        <p>150</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>21</p>
                     </c>
                     <c ca="right">
                        <p>6</p>
                     </c>
                     <c ca="right">
                        <p>43</p>
                     </c>
                     <c ca="right">
                        <p>11</p>
                     </c>
                     <c ca="right">
                        <p>18</p>
                     </c>
                     <c ca="right">
                        <p>40</p>
                     </c>
                     <c ca="right">
                        <p>65</p>
                     </c>
                     <c ca="right">
                        <p>43</p>
                     </c>
                     <c ca="right">
                        <p>136</p>
                     </c>
                     <c ca="right">
                        <p>154</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="right">
                        <p>3</p>
                     </c>
                     <c ca="right">
                        <p>88</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>*Position relative to transcription start site.</p>
               </tblfn>
            </tbl>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Comparison of different members of the <it>Rhizobiales</it></p>
               </caption>
               <tblbdy cols="9">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Re</p>
                     </c>
                     <c ca="left">
                        <p>NGR</p>
                     </c>
                     <c ca="left">
                        <p>Bj</p>
                     </c>
                     <c ca="left">
                        <p>R7A</p>
                     </c>
                     <c ca="left">
                        <p>Ml</p>
                     </c>
                     <c ca="left">
                        <p>Sm</p>
                     </c>
                     <c ca="left">
                        <p>At</p>
                     </c>
                     <c ca="left">
                        <p>Bm</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Genome</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Upper strand matches*</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>82</p>
                     </c>
                     <c ca="left">
                        <p>43</p>
                     </c>
                     <c ca="left">
                        <p>31</p>
                     </c>
                     <c ca="left">
                        <p>12</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>RpoN controlled gene products</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>184</p>
                     </c>
                     <c ca="left">
                        <p>62</p>
                     </c>
                     <c ca="left">
                        <p>69</p>
                     </c>
                     <c ca="left">
                        <p>24</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Size (Mb)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>7.59</p>
                     </c>
                     <c ca="left">
                        <p>6.69</p>
                     </c>
                     <c ca="left">
                        <p>5.67</p>
                     </c>
                     <c ca="left">
                        <p>3.29</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Matches/Mb</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>10.80</p>
                     </c>
                     <c ca="left">
                        <p>6.43</p>
                     </c>
                     <c ca="left">
                        <p>5.47</p>
                     </c>
                     <c ca="left">
                        <p>3.65</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GC content (%)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>62.4</p>
                     </c>
                     <c ca="left">
                        <p>62.1</p>
                     </c>
                     <c ca="left">
                        <p>58.1</p>
                     </c>
                     <c ca="left">
                        <p>57</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Symbiotic region<sup>&#8224;</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Upper strand matches*</p>
                     </c>
                     <c ca="left">
                        <p>24</p>
                     </c>
                     <c ca="left">
                        <p>23</p>
                     </c>
                     <c ca="left">
                        <p>31</p>
                     </c>
                     <c ca="left">
                        <p>25</p>
                     </c>
                     <c ca="left">
                        <p>24</p>
                     </c>
                     <c ca="left">
                        <p>15</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>RpoN controlled gene products</p>
                     </c>
                     <c ca="left">
                        <p>52</p>
                     </c>
                     <c ca="left">
                        <p>60</p>
                     </c>
                     <c ca="left">
                        <p>87</p>
                     </c>
                     <c ca="left">
                        <p>55</p>
                     </c>
                     <c ca="left">
                        <p>58</p>
                     </c>
                     <c ca="left">
                        <p>30</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Size (Mb)</p>
                     </c>
                     <c ca="left">
                        <p>0.37</p>
                     </c>
                     <c ca="left">
                        <p>0.54</p>
                     </c>
                     <c ca="left">
                        <p>0.41</p>
                     </c>
                     <c ca="left">
                        <p>0.50</p>
                     </c>
                     <c ca="left">
                        <p>0.61</p>
                     </c>
                     <c ca="left">
                        <p>1.35</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Matches/Mb</p>
                     </c>
                     <c ca="left">
                        <p>64.86</p>
                     </c>
                     <c ca="left">
                        <p>42.59</p>
                     </c>
                     <c ca="left">
                        <p>75.61</p>
                     </c>
                     <c ca="left">
                        <p>50.00</p>
                     </c>
                     <c ca="left">
                        <p>39.34</p>
                     </c>
                     <c ca="left">
                        <p>11.11</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>GC content (%)</p>
                     </c>
                     <c ca="left">
                        <p>57.3</p>
                     </c>
                     <c ca="left">
                        <p>58.5</p>
                     </c>
                     <c ca="left">
                        <p>59.1</p>
                     </c>
                     <c ca="left">
                        <p>59.3</p>
                     </c>
                     <c ca="left">
                        <p>59.7</p>
                     </c>
                     <c ca="left">
                        <p>60.4</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Intergenic GC content (%)<sup>&#8225;</sup></p>
                     </c>
                     <c ca="left">
                        <p>55.0</p>
                     </c>
                     <c ca="left">
                        <p>55.9</p>
                     </c>
                     <c ca="left">
                        <p>57.3</p>
                     </c>
                     <c ca="left">
                        <p>58.0</p>
                     </c>
                     <c ca="left">
                        <p>57.0</p>
                     </c>
                     <c ca="left">
                        <p>56.4</p>
                     </c>
                     <c ca="left">
                        <p>52.0</p>
                     </c>
                     <c ca="left">
                        <p>49.9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Lower strand matches*</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>21</p>
                     </c>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="left">
                        <p>67<sup>&#165;</sup></p>
                     </c>
                     <c ca="left">
                        <p>32<sup>&#165;</sup></p>
                     </c>
                     <c ca="left">
                        <p>23<sup>&#165;</sup></p>
                     </c>
                     <c ca="left">
                        <p>12<sup>&#165;</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>-12 (C) distance to start codon (bp)<sup>&#167;</sup></p>
                     </c>
                     <c ca="left">
                        <p>91 (231)</p>
                     </c>
                     <c ca="left">
                        <p>88 (149)</p>
                     </c>
                     <c ca="left">
                        <p>156 (214)</p>
                     </c>
                     <c ca="left">
                        <p>156 (288)</p>
                     </c>
                     <c ca="left">
                        <p>102 (209)</p>
                     </c>
                     <c ca="left">
                        <p>97 (210)</p>
                     </c>
                     <c ca="left">
                        <p>74 142)</p>
                     </c>
                     <c ca="left">
                        <p>123 (345)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Possible EBPs<sup>&#182;</sup></p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>*Matches (possible RpoN-binding sites or -24/-12-type promoters) were predicted using PATSER (see Materials and methods). <sup>&#8224;</sup>p42d (Re); pNGR234<it>a</it> (NGR); 410 kb region on chromosome (Bj); 502 kb symbiotic island on chromosome (R7A); 610 kb symbiotic island on chromosome (Ml); pSymA (Sm). <sup>&#8225;</sup>The GC-content of the intergenic (non-coding) sequences was calculated with FramePlot 2.3.2 [<abbr bid="B68">68</abbr>]. <sup>&#167;</sup>The distribution of the -12 (C) position on the intergenic sequences is biased towards the corresponding downstream CDS (this distribution is visualized by means of a boxplot in the Additional data files, page 29). Therefore the median and third quartile (between brackets) were used as indicative parameters for these distances. <sup>&#182;</sup>Possible EBPs were predicted using HMMER 2.2 g (see Materials and methods) (see also Table <tblr tid="T3">3</tblr>). <sup>&#165;</sup>The upstream intergenic region of the <it>rpoN</it> gene is included. At, <it>Agrobacterium tumefaciens</it> C58; Bm, <it>Brucella melitensis</it> 16 M; Bj, <it>Bradyrhizobium japonicum</it> USDA110; Ml, <it>Mesorhizobium loti</it> MAFF303099; NGR, <it>Rhizobium</it> sp. NGR234; R7A, <it>M. loti</it> R7A; Re, <it>Rhizobium etli</it> CFN42; Sm, <it>Sinorhizobium meliloti</it> 1021.</p>
               </tblfn>
            </tbl>
            <p>A good test case to evaluate the reliability of our predictions is to compare the matches with experimental data. In our laboratory we confirmed the RpoN-dependent expression of several <it>R. etli</it> genes with predicted RpoN-binding sites (see Additional data files, pages 3,4,6,7,12,13). The products of these genes are involved in a wide variety of functions, such as nitrogen fixation (<it>nifH</it> and <it>iscN-nifUS</it>) [<abbr bid="B38">38</abbr>,<abbr bid="B39">39</abbr>], oxidative stress and gene regulation (<it>spxA-rpoN</it><sub>2</sub>) [<abbr bid="B13">13</abbr>,<abbr bid="B40">40</abbr>], and transport (<it>yp104-103</it> and <it>yp100</it>) [<abbr bid="B41">41</abbr>] (G. Dirix, M. Van Guyse and J.M., unpublished results). For the genes on plasmid pNGR234<it>a</it> of <it>Rhizobium</it> sp. NGR234, an extensive expression analysis is available as well as an annotation of RpoN-dependent promoters [<abbr bid="B42">42</abbr>]. Our screening confirmed the 15 previously annotated RpoN-dependent promoters on pNGR234<it>a</it> (controlling 50 CDSs). All these promoters are upregulated in bacteroids, resembling the NifA-RpoN-dependent symbiosis-specific expression pattern of the nitrogen fixation genes (see Additional data files). In addition to these promoters, eight other matches were found, controlling ten CDSs. Five out of these eight are highly expressed in bacteroids (see Additional data files), indicating that they are probably functional. In the symbiotic region of <it>B. japonicum</it>, 31 matches were found (controlling 87 CDSs) (Table <tblr tid="T2">2</tblr>), which were also previously annotated as RpoN-dependent promoters [<abbr bid="B29">29</abbr>]. Thirteen of these promoters were shown to be functional and controlled by NifA and RpoN (see Additional data files). Furthermore, the fact that most - if not all - common RpoN-controlled genes were picked up for all species screened, including those already characterized in rhizobia, gives additional support to the methodology used. Common RpoN-dependent functions include assimilation of ammonium, transport of C<sub>4</sub>-dicarboxylates and nitrogen fixation (see Additional data files, pages 1-7, 10).</p>
         </sec>
         <sec>
            <st>
               <p>Functional classification of RpoN-dependent genes in the <it>Rhizobiales</it></p>
            </st>
            <p>The matches (also referred to as possible RpoN-binding sites or possible -24/-12-type promoters) obtained from the <it>in silico</it> search were complemented with literature data on RpoN-regulated genes in rhizobia, and classified according to protein function (see Additional data files).</p>
            <p>For the four genomes screened, matches were found upstream of the genes whose products are involved in the uptake and assimilation of ammonia: <it>glnK-amtB</it>, <it>gInBA</it> and <it>glnII</it> (see Additional data files, pages 1,2). A similar regulation and gene organization is found in other &#945;-proteobacteria [<abbr bid="B9">9</abbr>]. <it>B. melitensis</it> differs slightly in that <it>amtB</it> constitutes a monocistronic unit and the <it>glnII</it> ortholog was not identified.</p>
            <p>Possible RpoN-binding sites were found upstream of many nitrogen fixation genes in the six symbiotic, nitrogen-fixing species (<it>R. etli</it>, <it>Rhizobium</it> sp. NGR234, <it>B. japonicum</it>, <it>M. loti</it> MAF303099 and R7A and <it>S. meliloti</it>). Most rhizobial nitrogen fixation genes are controlled by RpoN and the NifA EBP (see Additional data files, pages 2-7). The latter protein is activated under microaerobic conditions and is highly active in bacteroids [<abbr bid="B15">15</abbr>,<abbr bid="B16">16</abbr>].</p>
            <p>A possible RpoN-binding site is also present in the promoter region of <it>dctA</it>, encoding a C<sub>4</sub>-dicarboxylate transporter, in all species screened, except for <it>B. melitensis</it>, where no such gene is present, and <it>R. etli</it>, where it is not present on the symbiotic plasmid (see Additional data files, page 10). In <it>M. loti</it> MAFF303099, -24/-12-type promoters were identified in front of the two C<sub>4</sub>-dicarboxylate transporter genes. Rhizobial <it>dctA</it> mutants are deficient in nitrogen fixation. The expression of <it>dctA</it> is controlled under free-living conditions by RpoN-DctD, whereas in bacteroids, the DctD EBP is (partially) replaced by another unknown EBP. The DctD activator protein is activated by the DctB sensor protein that senses the presence of C<sub>4</sub>-dicarboxylates together with DctA [<abbr bid="B16">16</abbr>].</p>
            <p>During symbiotic nitrogen fixation, hydrogen is produced as an obligate byproduct of the nitrogenase reaction. Some rhizobia possess a hydrogen-uptake (Hup) system that reoxidizes the hydrogen, a process that is thought to increase the efficiency of nitrogen fixation. This is, for instance, the case in <it>B. japonicum</it>, where the -24/-12-type promoters controlling these genes were confirmed in the screening (see Additional data files, pages 7,8). In <it>B. japonicum</it>, HoxA-RpoN is required for <it>hupSL</it> expression under free-living conditions and FixK<sub>2</sub> under symbiotic conditions, whereas in <it>Rhizobium leguminosarum</it>, expression of <it>hup</it> genes is only induced in bacteroids and depends on NifA-RpoN [<abbr bid="B21">21</abbr>,<abbr bid="B43">43</abbr>,<abbr bid="B44">44</abbr>].</p>
            <p>As well as these previously well-characterized RpoN-dependent functions, several other groups of genes were found in the <it>in silico</it> screening. Potential target genes preceded by a possible -24/-12-type promoter belong to the following functional categories: nodulation; transport; detoxification; gene regulation; cell envelope; amino-acid biosynthesis; energy and DNA metabolism; translation; integration and recombination; and various others (see Additional data files, pages 8-28).</p>
            <p>An interesting observation is the large number of hypothetical CDSs preceded by a possible RpoN-binding site (see Additional data files, pages 21-28). Of the 593 gene products identified in the screening, 22% have no function attributed, but do have homologs in other species. Proteins encoded by orphan genes, unique to one species, make up 9% of the total. Some of these hypothetical CDSs having a possible -24/-12-type promoter were picked up in several genomes: <it>atu1754-smc01000-mll0186, smc00999-mll0185, mll9007-y4nG, mll9408-y4nH, orf196-y4vH, orf538-y4vI, orf354-y4vJ, id79-mlr5912-y4vQ-msi288-yp001, yp003-y4wP-yp021-yp099, mlr5819-y4wI, mll5852-msi351, mlr5886-msi324, mlr5914-msi286</it> and <it>msr5928-msi276</it>. The majority of these hypothetical genes in <it>Rhizobium</it> sp. NGR234 are highly expressed in bacteroids [<abbr bid="B42">42</abbr>], suggesting a possible involvement of the putative gene products in the symbiosis.</p>
            <p>Another interesting class of possible RpoN-regulated genes encodes proteins catalyzing detoxification reactions (see Additional data files, pages 12,13). During symbiosis, rhizobia are confronted with toxic compounds and oxidative stress in the form of reactive oxygen species [<abbr bid="B45">45</abbr>]. Effective protection against these harmful agents would thus enhance the symbiotic performance of the rhizobia. Two good candidate proteins to fulfill such a function were identified in this screening: a peroxiredoxin and a cytochrome P450. Peroxiredoxins reduce peroxides to the alcohol and water. The gene encoding the symbiosis-specific peroxiredoxin SpxA is highly expressed in bacteroids of several rhizobial species in a NifA-RpoN-dependent way [<abbr bid="B13">13</abbr>,<abbr bid="B40">40</abbr>,<abbr bid="B42">42</abbr>]. Cytochrome P450S catalyze the monoxygenation of a broad spectrum of substrates, including toxic compounds. Several genes encoding cytochrome P450S have a possible RpoN-binding site in their promoter region. One subclass of these P450S is conserved among several rhizobial species and is highly expressed in bacteroids [<abbr bid="B27">27</abbr>,<abbr bid="B42">42</abbr>]. Expression of the <it>Rhizobium</it> sp. BR816 homolog depends on NifA-RpoN and another, so far unidentified, factor [<abbr bid="B27">27</abbr>]. Furthermore, two matches were found upstream of <it>B. japonicum nrgA</it> and <it>nrgBC</it>. The expression of these genes, possibly involved in detoxification reactions, was shown to be controlled by NifA and RpoN [<abbr bid="B46">46</abbr>].</p>
            <p>Transport systems have an important role at different stages of the rhizobium-legume symbiosis [<abbr bid="B41">41</abbr>,<abbr bid="B47">47</abbr>,<abbr bid="B48">48</abbr>,<abbr bid="B49">49</abbr>]. Many transporter-encoding gene loci are preceded by a possible -24/-12-type promoter (see Additional data files, pages 9-12). The majority of these transporters are of the ABC type, which is consistent with the fact that this type is the most abundant transporter in <it>Rhizobiaceae</it> family [<abbr bid="B33">33</abbr>]. The RpoN-regulated transporters might be involved in symbiotic (as, for instance, DctA) or non-symbiotic (in <it>A. tumefaciens</it>) processes.</p>
            <p>Many regulatory genes have a possible RpoN-binding site in their promoter region (see Additional data files, pages 13,14). The RpoN regulon is known to expand its working range by controlling the expression of other regulatory genes [<abbr bid="B6">6</abbr>,<abbr bid="B7">7</abbr>]. The induction of the <it>y4qH</it> gene (preceded by a possible RpoN-binding site and encoding a LuxR-type transcriptional regulator) in bacteroids of <it>Rhizobium</it> sp. NGR234 illustrates how RpoN might indirectly control the expression of genes involved in symbiosis [<abbr bid="B42">42</abbr>]. Other examples include RpoN dependency of <it>R. leguminosarum fnrN</it>, or the autoregulation of <it>R. etli rpoN</it><sub>2</sub> [<abbr bid="B13">13</abbr>,<abbr bid="B14">14</abbr>]. The latter form of control also appears to be conserved in <it>M. loti</it> MAFF303099 and R7A, where a possible -24/-12-type promoter was identified upstream of the second copy of <it>rpoN</it> (see Additional data files, page 7).</p>
            <p>The gene products of the rhizobial nodulation genes (<it>nol, nod, noe</it>) synthesize and export a class of signal molecules, the so-called Nod factors, involved in the early steps of the symbiotic interaction with leguminous plants. Although the transcription of most nodulation genes depends on the NodD regulator protein in the presence of a suitable plant-root-derived flavonoid, several reports mention an RpoN dependency (to some extent) of some of the nodulation genes ([<abbr bid="B22">22</abbr>,<abbr bid="B23">23</abbr>,<abbr bid="B24">24</abbr>,<abbr bid="B26">26</abbr>,<abbr bid="B50">50</abbr>] and see Additional data files, pages 8,9). However, the screening did not reveal any matches upstream of rhizobial nodulation genes. Because of the high stringency of the screening, several promoters, such as those of <it>Rhizobium</it> sp. NGR234 <it>nodD</it><sub>2</sub> and <it>S. meliloti nodD</it><sub>3</sub>, may have been missed. It thus seems that less conserved -24/-12-type promoters are functional to some degree. On the other hand, such poorly conserved promoters may be the exception rather than the rule. Nevertheless, the -24/-12-type promoter of the <it>B. japonicum nfeC</it> gene (involved in nodule formation) was confirmed in the screening.</p>
            <p>The other functional categories (see Additional data files, page 14-21) include, among others, cell envelope, amino-acid biosynthesis, energy metabolism, DNA metabolism, translation, and integration and recombination. Unlike the functions discussed above, these categories comprise many different proteins from different bacterial species. It appears that the RpoN dependency of the encoding genes in these categories is not conserved in the <it>Rhizobiales</it>. This might reflect a species-specific recruitment of RpoN for gene transcription.</p>
         </sec>
         <sec>
            <st>
               <p>Signals controlling RpoN-dependent gene expression in the <it>Rhizobiales</it></p>
            </st>
            <p>What signals control the activity of EBPs in species of the <it>Rhizobiales</it>? An <it>in silico</it> screening for EBPs in the proteomes of members of the <it>Rhizobiales</it> (see Materials and methods) revealed that the number of candidate EBPs is proportional to the genome size (Table <tblr tid="T2">2</tblr>). <it>M. loti, S. meliloti, A. tumefaciens</it> and <it>B. melitensis</it> all have roughly 1 EBP per million bases (Mb) (8, 7, 6 and 3, respectively), a number differing from a previous estimate (13, 12 and 9, respectively; the <it>B. melitensis</it> genome was not published at the time) [<abbr bid="B33">33</abbr>]. However, our predicted EBPs scored significantly better (E-value 10<sup>-25</sup> or less) with the Pfam Sigma54_activat domain HMM motif than the best of the rejected proteins (E-value 10<sup>-5</sup> or greater), giving confidence to our prediction. Each of the candidate EBPs was subjected to a BLASTP search (default settings) against the NCBI protein databases and classified according to homology (Table <tblr tid="T3">3</tblr>). A similar analysis of <it>Pseudomonas aeruginosa, Salmonella typhimurium</it> and <it>E. coli</it> proteomes revealed that these bacteria have a higher number of possible EBPs (22, 15 and 13, respectively) than the species of the <it>Rhizobiales</it> (data not shown).</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Enhancer binding proteins (EBPs) of different members of the <it>Rhizobiales</it></p>
               </caption>
               <tblbdy cols="9">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Re</p>
                     </c>
                     <c ca="left">
                        <p>NGR</p>
                     </c>
                     <c ca="left">
                        <p>Bj</p>
                     </c>
                     <c ca="left">
                        <p>R7A</p>
                     </c>
                     <c ca="left">
                        <p>Ml</p>
                     </c>
                     <c ca="left">
                        <p>Sm</p>
                     </c>
                     <c ca="left">
                        <p>At</p>
                     </c>
                     <c ca="left">
                        <p>Bm</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NtrC</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Mlr0389 (14021374)</p>
                     </c>
                     <c ca="left">
                        <p>SMc01043 (15074392)</p>
                     </c>
                     <c ca="left">
                        <p>Atu1446 (17739866)</p>
                     </c>
                     <c ca="left">
                        <p>BmeI0866 (17987149)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NtrX</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Mlr0400 (14021376)</p>
                     </c>
                     <c ca="left">
                        <p>SMc01045 (15074394)</p>
                     </c>
                     <c ca="left">
                        <p>Atu1448 (17739869)</p>
                     </c>
                     <c ca="left">
                        <p>BmeI0868 (17987151)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>DctD-1</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Msi357 (20804186)</p>
                     </c>
                     <c ca="left">
                        <p>Mlr5842 (14025622)</p>
                     </c>
                     <c ca="left">
                        <p>SMb20613 (15141413)</p>
                     </c>
                     <c ca="left">
                        <p>Atu3296 (17741681)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>DctD-2</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Mlr7239 (14026791)</p>
                     </c>
                     <c ca="left">
                        <p>SMb21200 (15140751)</p>
                     </c>
                     <c ca="left">
                        <p>Atu0114 (17738434)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NifA</p>
                     </c>
                     <c ca="left">
                        <p>RE1SP0000735 (21492735)</p>
                     </c>
                     <c ca="left">
                        <p>Y4uN (16519979)</p>
                     </c>
                     <c ca="left">
                        <p>ID736 (12620732)</p>
                     </c>
                     <c ca="left">
                        <p>Msi346 (20804175)</p>
                     </c>
                     <c ca="left">
                        <p>Mll5857 (14025634)</p>
                     </c>
                     <c ca="left">
                        <p>SMa0815 (14523538)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Msi361 (20804190)</p>
                     </c>
                     <c ca="left">
                        <p>Mll5837 (14025617)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>EBP-1</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Mll3001 (14023386)</p>
                     </c>
                     <c ca="left">
                        <p>SMc04011 (15075819)</p>
                     </c>
                     <c ca="left">
                        <p>Atu3764 (17742191)</p>
                     </c>
                     <c ca="left">
                        <p>BmeII0011 (17988355)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>EBP-2</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>SMb20102 (15139974)</p>
                     </c>
                     <c ca="left">
                        <p>Atu4741 (17743247)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>EBP-3</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Y4pA (16519894)</p>
                     </c>
                     <c ca="left">
                        <p>ID407 (12620599)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Mll9384 (14028438)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>EBPs were predicted using HMMER 2.2 g (see Materials and methods). The species acronyms are as in Table <tblr tid="T2">2</tblr>. Orthologs (having at least 30% amino-acid identity over the full length of the polypeptide) are situated on the same row (GenBank GI number). EBP-n: uncharacterized EBPs.</p>
               </tblfn>
            </tbl>
            <p>All four genomes screened have NtrC and NtrX orthologs (Table <tblr tid="T3">3</tblr>), which are both nitrogen-responsive EBPs [<abbr bid="B9">9</abbr>]. Rhizobial NtrC-RpoN-regulated genes are downregulated early in bacteroid differentiation [<abbr bid="B51">51</abbr>,<abbr bid="B52">52</abbr>]. This downregulation seems to be necessary for an effective symbiosis as ectopic expression of <it>R. etli amtB</it> alters the bacteroid differentiation process and the ability to enter the host cells [<abbr bid="B53">53</abbr>].</p>
            <p><it>M. loti, S. meliloti</it> and <it>A. tumefaciens</it> all have two DctD EBP paralogs (Table <tblr tid="T3">3</tblr>). DctD is activated by the DctB sensor protein, which senses C<sub>4</sub>-dicarboxylates in collaboration with the DctA C<sub>4</sub>-dicarboxylate transporter. The second <it>dctD</it> copy might encode the so-called alternative symbiotic activator, ensuring <it>dctA</it> expression in <it>S. meliloti</it> and <it>R. leguminosarum dctD</it> mutants during symbiosis [<abbr bid="B16">16</abbr>].</p>
            <p>All nitrogen-fixing species possess a NifA homolog - <it>M. loti</it> MAFF303099 and R7A even have two (Table <tblr tid="T3">3</tblr>). Rhizobial NifA proteins are activated by the low oxygen tension in the nodules, which is the key signal controlling expression of the nitrogen fixation genes and a prerequisite for nitrogen fixation [<abbr bid="B54">54</abbr>]. NifA activates RpoN-dependent transcription of many rhizobial genes, including nitrogen fixation genes (see Additional data files). For the NifA-RpoN-dependent expression of some genes, microaerobiosis alone is not sufficient to equal the expression levels observed in bacteroids (see Additional data files, pages 2,4,6,7,13,24) [<abbr bid="B27">27</abbr>,<abbr bid="B39">39</abbr>]. An additional signal appears to control the optimal functioning of NifA.</p>
            <p>The presence of additional uncharacterized EBPs suggests that other signals control RpoN-dependent gene expression in the <it>Rhizobiales</it> (Table <tblr tid="T3">3</tblr>). These signals could control expression of genes involved in symbiosis (<it>M. loti</it> and <it>S. meliloti</it>) or other, so far uncharacterized, functions (<it>A. tumefaciens</it> and <it>B. melitensis</it>). One common signal might control the activity of EBP-1, present in the four genomes (Table <tblr tid="T3">3</tblr>).</p>
         </sec>
         <sec>
            <st>
               <p>Comparison between symbiotic and non-symbiotic <it>Rhizobiales</it></p>
            </st>
            <p>Globally, a higher number of possible RpoN-binding sites per megabases of genome is observed in symbiotic members of the <it>Rhizobiales</it> than in non-symbiotic ones (Table <tblr tid="T2">2</tblr>). This ratio is even higher in the symbiotic regions and that might reflect the recruitment of NifA-RpoN-dependent promoters, which are highly upregulated in bacteroids, for late symbiotic functions. RpoN is not essential for bacterial survival, but controls genes whose products have a wide range of different functions [<abbr bid="B6">6</abbr>]. Apparently, there has been a selection in bacteroids for the NifA-RpoN-type promoter, not only for nitrogen fixation but for other symbiotic functions as well. This is illustrated by the high number of non-<it>nif/fix</it> genes on pNGR234<it>a</it> preceded by a -24/-12-type promoter and preferentially expressed in bacteroids (see Additional data files) [<abbr bid="B42">42</abbr>]. The symbiotic island of <it>M. loti</it> R7A is about 109 kb smaller than that of MAFF303099 and the two areas share only a conserved backbone of 248 kb [<abbr bid="B30">30</abbr>]. This distinction seems not to be reflected by the RpoN-regulon, as the majority of possible RpoN-dependent loci (75%) on the two islands is conserved (see Additional data files).</p>
            <p>Although an <it>A. tumefaciens rpoN</it> mutant is not affected in tumorigenesis, expression of <it>vir</it> genes, chemotaxis or flagellum formation, it does not grow on nitrate as sole nitrogen source nor on C<sub>4</sub>-dicarboxylates as sole carbon source, and grows only very poorly on arginine as sole nitrogen source [<abbr bid="B35">35</abbr>]. We were able to identify possible RpoN-binding sites in the promoter regions of the <it>dctA</it> gene, the <it>gInK-amtB, glnBA, glnII</it> and <it>nrtABC-nirBD</it> loci (see Additional data files, pages 1,2,10,12,16), providing genetic evidence for the previously observed phenotypes. As could be expected, no virulence, tumorigenesis, chemotaxis or flagellar genes were picked up in screening, although a good match was found upstream of <it>motB</it>, which encodes a flagellar motor protein. Strikingly, in <it>Agrobacterium</it> the total number of gene products whose genes are preceded by a possible -24/-12-type promoter is approximately equal to that of <it>Sinorhizobium</it> (Table <tblr tid="T2">2</tblr>). <it>S. meliloti</it> and <it>A. tumefaciens</it> are closely related at the level of protein identity as well as nucleotide colinearity and conservation of gene order [<abbr bid="B33">33</abbr>]. This relatedness appears to be somewhat conserved at the regulatory level, as <it>S. meliloti</it> and <it>A. tumefaciens</it> share seven predicted RpoN-binding sites upstream of the same gene loci: <it>dctA, glnK-amtB, glnBA, glnll, nrtABC, atu1754-smc01000</it> and <it>atu4739-smb20103</it> (see Additional data files, pages 1,2,10,12,18,21,24). The relatively high number of positive matches in <it>A. tumefaciens</it> and the lack of tumorigenesis-related phenotype in the <it>rpoN</it> mutant hints to other, so far unidentified, RpoN-controlled functions. Indicative of this is the number (22) of possible RpoN-binding sites upstream of hypothetical genes in <it>A. tumefaciens</it> (see Additional data files, pages 21,26). Of the predicted proteins of <it>A. tumefaciens</it>, 16% are not present in <it>M. loti</it> or <it>S. meliloti</it> and 89% of these are hypothetical gene products [<abbr bid="B33">33</abbr>], leaving room for genetic diversity between rhizobia and <it>A. tumefaciens.</it></p>
            <p><it>B. melitensis</it> is an intracellular pathogen of humans and animals, and is closely related to rhizobia and <it>A. tumefaciens</it> [<abbr bid="B55">55</abbr>]. Only 12 matches resulted from the <it>in silico</it> analysis for possible RpoN-binding sites (Table <tblr tid="T2">2</tblr>), two of which were found in the upstream region of <it>amtB</it> and <it>glnBA</it> (see Additional data files, pages 1,2). At present, no report of a <it>B. melitensis rpoN</it> mutant is available. A good candidate function for RpoN to regulate in <it>B. melitensis</it> would be pathogenesis, as is the case for several other pathogens [<abbr bid="B6">6</abbr>]. Six possible -24/-12-type promoters were found in the <it>B. melitensis</it> genome upstream of hypothetical CDSs (see Additional data files, pages 21,22,26).</p>
            <p><it>E. coli</it> was estimated to have around 30 -24/-12-type promoters, controlling mainly functions related to nitrogen metabolism [<abbr bid="B7">7</abbr>]. Although there is some overlap of RpoN-controlled functions between <it>E. coli</it> and members of the <it>Rhizobiales</it> (for example, ammonium assimilation and hydrogen oxidation), RpoN seems to control a more diverse set of functions in the latter. This appears to somewhat contradict the observation that <it>E. coli</it> possesses a higher number of EBPs (see 'Signals controlling RpoN-dependent gene expression in the <it>Rhizobiales</it>').</p>
         </sec>
         <sec>
            <st>
               <p>Additional control mechanisms involving RpoN</p>
            </st>
            <p>Expression studies of <it>rpoN</it> in wild-type and <it>rpoN</it><sup>-</sup> backgrounds revealed that RpoN negatively controls its own expression in <it>B. japonicum, Acinetobacter calcoaceticus, R. etli</it> and <it>R. leguminosarum</it> bv. <it>viciae</it> [<abbr bid="B14">14</abbr>,<abbr bid="B56">56</abbr>,<abbr bid="B57">57</abbr>,<abbr bid="B58">58</abbr>]. The presence of a putative oppositely oriented RpoN-binding site on the template strand of <it>rpoN</it> promoter regions, overlapping with the -10 promoter region and the transcription start site, was proposed [<abbr bid="B58">58</abbr>]. It was stated that, as RpoN is able to bind to -24/-12-type promoters in the absence of core-RNAP, the negative autoregulation of the <it>rpoN</it> genes occurs by direct interference of RpoN with the binding of &#963;<sup>70</sup>-holo-RNAP complex to the -10 promoter region of the <it>rpoN</it> gene. Site-directed mutagenesis of the highly conserved GG to TT in the putative RpoN-binding site of the <it>rpoN</it> promoter relieved the negative autoregulation, giving strong support to the above hypothesis [<abbr bid="B14">14</abbr>].</p>
            <p>The <it>in silico</it> screening revealed the presence of oppositely oriented possible RpoN-binding sites upstream of the <it>rpoN</it> genes of <it>M. loti, S. meliloti, A. tumefaciens</it> and <it>B. melitensis</it> (Table <tblr tid="T2">2</tblr>). A comparison with the <it>rpoN</it> coding sequences of <it>R. etli, R. leguminosarum, Rhizobium</it> sp. NGR234, <it>B. japonicum, M. loti</it> and <it>S. meliloti</it> revealed that the <it>rpoN</it> genes of <it>A. tumefaciens</it> and <it>B. melitensis</it> were incorrectly annotated. Their coding sequences should be 63 bp longer and shorter, respectively. An alignment of the <it>rpoN</it> promoter regions of these species shows the strong conservation of these promoters (Figure <figr fid="F1">1</figr>). In addition, the screening of the lower strand of the intergenic sequences revealed a slightly lower number of matches than that of the correctly oriented RpoN-binding sites (Table <tblr tid="T2">2</tblr>). It is thus not unconceivable that RpoN could alter the expression of these genes in a way similar to its own autoregulation, that is, by interference with the binding of the holo-RNAP or a regulatory protein to the promoter. This would significantly broaden the working domain of RpoN.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Manual alignment of <it>rpoN</it> promoter sequences</p>
               </caption>
               <text>
                  <p>Manual alignment of <it>rpoN</it> promoter sequences. At (<it>Agrobacterium tumefaciens</it>, GI: 17738659); Bj (<it>Bradyrhizobium japonicum</it>, GI: 152137); Bm (<it>Brucella melitensis</it>, GI: 17983821); Ml (<it>Mesorhizobium loti</it>, GI: 14023393); NGR (<it>Rhizobium</it> sp. NGR234, GI: 152431); Re (<it>Rhizobium etli</it>, GI: 1046228), Rl (<it>Rhizobium leguminosarum</it> bv. <it>viciae</it>, GI: 5759116), Sm (<it>Sinorhizobium meliloti</it>, GI: 152389). Upper line, -35 and -10 consensus sequences of <it>Escherichia coli</it> and the transcription start site (*) as determined for <it>S. meliloti</it> [<abbr bid="B69">69</abbr>]. Lower line, consensus sequence of -24/-12-type promoter [<abbr bid="B1">1</abbr>]. Nucleotides are shaded in black (100% conserved) or gray (75% conserved). The numbers represent the distance (in bp) from the end of the alignment to the start codon of the downstream <it>rpoN</it> gene.</p>
               </text>
               <graphic file="gb-2002-3-12-research0076-1"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>A highly specific <it>in silico</it> screening method was applied to predict members of the RpoN-regulon in eight different species of the <it>Rhizobiales</it>. The matches obtained were individually checked and classified according to protein function, resulting in a highly annotated and manually curated dataset. This dataset was complemented with available literature data on members of the RpoN-regulon in <it>Rhizobiales</it>. In addition, a screening was carried out to identify possible EBPs controlling the expression of RpoN-dependent genes. Together, these data serve as a source of exhaustive information on the (possible) roles of RpoN in symbiotic and non-symbiotic processes.</p>
         <p>RpoN-binding sites were found upstream of genes involved in common RpoN-dependent functions, such as assimilation of ammonium and uptake of C<sub>4</sub>-dicarboxylic acids. The symbiotic members of the <it>Rhizobiales</it> seem to have recruited RpoN for the expression of nitrogen fixation and other symbiotic genes. This is illustrated by the high number of possible RpoN-binding sites in the symbiotic regions of these bacteria. Other RpoN-dependent symbiotic functions might include detoxification and transport or might be controlled indirectly through other regulatory proteins. Whereas an <it>A. tumefaciens rpoN</it> mutant only displays common RpoN-dependent phenotypes, the relatively high number of possible RpoN-binding sites present in its genome points to several other, yet unidentified, RpoN-dependent functions. So far, no reports are present on RpoN-dependent phenotypes in <it>B. melitensis</it>. This animal pathogen has a significantly lower number of possible RpoN-binding sites than the other members of the <it>Rhizobiales. B. melitensis</it> RpoN might be required for infection of the host organism, as is the case for other pathogens. Furthermore, the species screened seem to have recruited RpoN independently, in a species-specific manner, for the transcription of different gene sets. The high percentage of hypothetical conserved and non-conserved CDSs preceded by a possible RpoN-binding site opens up ample opportunities for future research. Several uncharacterized EBPs were identified besides the 'classic' EBPs such as NtrC, NtrX, NifA and DctD. This implies that signals other than nitrogen, oxygen and C<sub>4</sub>-dicarboxylates control the expression of RpoN-dependent genes in species of the <it>Rhizobiales.</it> Identification of these signals will give better insight into yet uncharacterized RpoN-dependent functions. Finally, a similar number of possible RpoN-binding sites were found on the lower strand of the upstream intergenic sequences. RpoN might thus significantly extend its working domain by blocking the binding of transcription regulatory factors. Such is the case, for instance, in the negative autoregulation of <it>rpoN</it>.</p>
         <p>Although much consideration was given in our analysis to the design and, to some extent, the experimental validation of the approach, experimental confirmation will ultimately be required to establish the biological meaning of the predicted -24/-12-type promoters, as is the case with all computer predictions.</p>
         <p>In conclusion, a highly efficient method was applied to predict the RpoN-regulon of different members of the <it>Rhizobiales</it> group. The same approach might be used for the prediction of RpoN-dependent genes in other bacterial species.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Retrieval of intergenic sequences</p>
            </st>
            <p>Complete DNA sequences from <it>A. tumefaciens</it> C58 (circular chromosome: NC_003304, linear chromosome NC_003305, plasmid AT: NC_003306, plasmid Ti: NC_003308), <it>B. japonicum</it> USDA110 (symbiotic chromosomal region: AF322012 and AF322013), <it>B. melitensis</it> 16 M (chromosome I: NC_003317, chromosome II: NC_003318), <it>M. loti</it> MAFF303099 (chromosome: NC_002678, plasmid pMla: NC_002679, plasmid pMlb: NC_002682), <it>M. loti</it> R7A (symbiotic island: AL672111), <it>Rhizobium</it> sp. NGR234 (plasmid pNGR234<it>a</it>: NC_000914), <it>R. etli</it> CFN42 (plasmid p42d: NC_004041) and <it>S. meliloti</it> 1021 (chromosome: NC_003047, plasmid pSymA: NC_003037, plasmid pSymB: NC_003078) were extracted from GenBank. Intergenic sequences were identified by automatically parsing the corresponding GenBank files using the modules of INCLUsive [<abbr bid="B59">59</abbr>,<abbr bid="B60">60</abbr>]. An intergenic region is defined as the non-coding region between two genes. Intergenic regions smaller than 10 nucleotides were discarded, as the corresponding genes are likely to belong to an operon.</p>
         </sec>
         <sec>
            <st>
               <p>Prediction of possible RpoN-binding sites</p>
            </st>
            <p>The intergenic regions were screened with the PATSER module of the Regulatory Sequence Analysis Tools (RSAT) [<abbr bid="B61">61</abbr>,<abbr bid="B62">62</abbr>,<abbr bid="B63">63</abbr>] for the presence of the -24/-12 promoter consensus sequence. PATSER scores N-mers (in this case 16-mers) from a sequence against a given weight matrix. A set of 186 RpoN-dependent promoters from different bacterial species [<abbr bid="B1">1</abbr>] was used to generate the weight matrix (Table <tblr tid="T1">1</tblr>). Initially, this matrix was trained against 67 -24/-12-type promoters with a known transcriptional start site. From the distribution of these scores (Figure <figr fid="F2">2</figr>), it was decided to retain all matches with a score higher than or equal to the fifth percentile (8.9). PATSER was run with the GC content of the intergenic sequences as a measure for the <it>a priori</it> probabilities of the nucleotides. The intergenic GC content differs markedly from the total GC content of the genomes (Table <tblr tid="T2">2</tblr>).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Distribution of PATSER scores (see Material and methods) of 67 -24/-12-type promoters with mapped transcriptional start sites.</p>
               </caption>
               <text>
                  <p>Distribution of PATSER scores (see Material and methods) of 67 -24/-12-type promoters with mapped transcriptional start sites. (GenBank GI number: 2979503, 141885, 141892, 38664, 38679, 1769418, 142336, 142378, 142326, 39977, 550310, 408911, 39532, 39516, 39526, 152106, 152283, 152315, 39548, 312974, 12620419, 152100, 152280, 152317, 2316081, 144194, 896457, 144223, 262651, 3493239, 7208421, 262651, 145911, 556890, 41774, 146158, 1004098, 41568, 42538, 149241, 43802, 149252, 149256, 149246, 43857, 43864, 149273, 149275, 150095, 950651, 150993, 490170, 6492432, 151643, 6636054, 46254, 152305, 152230, 340664, 46285, 46324, 46324, 550144, 550144, 664946, 453435, 1649033). Cumulative percentage: black line.</p>
               </text>
               <graphic file="gb-2002-3-12-research0076-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Prediction of possible EBPs</p>
            </st>
            <p>An estimate of the number of possible EBPs in the respective proteomes was obtained by looking for the presence of the Pfam Sigma54_activat domain [<abbr bid="B64">64</abbr>]. Sigma54_activat is a conserved domain present in EBPs that is involved in the ATP-dependent interaction with RpoN. The predicted proteome sequences of <it>A. tumefaciens, B. melitensis, S. meliloti</it> and <it>M. loti</it> MAFF303099 were downloaded from Proteome Analysis @ EBI [<abbr bid="B65">65</abbr>] and the protein sequences of the symbiotic regions of <it>B. japonicum, M. loti</it> R7A, <it>Rhizobium</it> sp. NGR234 and <it>R. etli</it> were obtained from the NCBI protein database [<abbr bid="B66">66</abbr>]. The protein sequences were subsequently queried with the PF00158 HMM motif using HMMER 2.2 g [<abbr bid="B67">67</abbr>]. Matches with an E-value less than or equal to 10<sup>-25</sup> were retained.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>An <supplr sid="s1">additional data file</supplr> listing all genes and proteins included in this analysis is available. Each protein is accompanied by a functional description, the species from which it comes, GenBank GI number and transcription unit; additional references for each protein are also included in the file. Information on the gene's regulation is provided where available. The data were obtained from an <it>in silico</it> analysis (see Materials and methods) and the literature [<abbr bid="B12">12</abbr>,<abbr bid="B13">13</abbr>,<abbr bid="B14">14</abbr>,<abbr bid="B17">17</abbr>,<abbr bid="B18">18</abbr>,<abbr bid="B19">19</abbr>,<abbr bid="B20">20</abbr>,<abbr bid="B21">21</abbr>,<abbr bid="B22">22</abbr>,<abbr bid="B23">23</abbr>,<abbr bid="B24">24</abbr>,<abbr bid="B25">25</abbr>,<abbr bid="B26">26</abbr>,<abbr bid="B27">27</abbr>,<abbr bid="B29">29</abbr>,<abbr bid="B30">30</abbr>,<abbr bid="B31">31</abbr>,<abbr bid="B32">32</abbr>,<abbr bid="B33">33</abbr><abbr bid="B34">34</abbr>,<abbr bid="B39">39</abbr>,<abbr bid="B40">40</abbr>,<abbr bid="B41">41</abbr>,<abbr bid="B42">42</abbr>,<abbr bid="B43">43</abbr>,<abbr bid="B44">44</abbr>,<abbr bid="B46">46</abbr>,<abbr bid="B50">50</abbr>,<abbr bid="B51">51</abbr>,<abbr bid="B56">56</abbr>,<abbr bid="B58">58</abbr>,<abbr bid="B70">70</abbr>,<abbr bid="B71">71</abbr>,<abbr bid="B72">72</abbr>,<abbr bid="B73">73</abbr>,<abbr bid="B74">74</abbr>,<abbr bid="B75">75</abbr>,<abbr bid="B76">76</abbr>,<abbr bid="B77">77</abbr>,<abbr bid="B78">78</abbr>,<abbr bid="B79">79</abbr>,<abbr bid="B80">80</abbr>,<abbr bid="B81">81</abbr>,<abbr bid="B82">82</abbr>,<abbr bid="B83">83</abbr>,<abbr bid="B84">84</abbr>,<abbr bid="B85">85</abbr>,<abbr bid="B86">86</abbr>,<abbr bid="B87">87</abbr>,<abbr bid="B88">88</abbr>].</p>
         <suppl id="s1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>An additional data file listing all genes and proteins included in this analysi </p>
            </caption>
            <text>
               <p>An additional data file listing all genes and proteins included in this analysis</p>
            </text>
            <file name="gb-2002-3-12-research0076-s1.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank I. Van Cauteren and G. Thijs for computational assistance. B.D. is a recipient of a postdoctoral fellowship of the Onderzoeksfonds Katholieke Universiteit Leuven. K.M. is a postdoctoral fellow of the Fund for Scientific Research (Flanders). We acknowledge financial support from the Flemish government (GOA98 to J.V.).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Compilation and analysis of sigma 54-dependent promoter sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Barrios</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Valderrama</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Morett</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1999</pubdate>
            <volume>27</volume>
            <fpage>4305</fpage>
            <lpage>4313</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">148710</pubid>
                  <pubid idtype="pmpid" link="fulltext">10536136</pubid>
                  <pubid idtype="doi">10.1093/nar/27.22.4305</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>The bacterial enhancer-dependent sigma(54) (sigma(N)) transcription factor.</p>
            </title>
            <aug>
               <au>
                  <snm>Buck</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gallegos</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Studholme</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Guo</snm>
                  <fnm>YL</fnm>
               </au>
               <au>
                  <snm>Gralla</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2000</pubdate>
            <volume>182</volume>
            <fpage>4129</fpage>
            <lpage>4136</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">101881</pubid>
                  <pubid idtype="pmpid" link="fulltext">10894718</pubid>
                  <pubid idtype="doi">10.1128/JB.182.15.4129-4136.2000</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Expression of sigma 54 (<it>ntrA</it>)-dependent genes is probably united by a common mechanism.</p>
            </title>
            <aug>
               <au>
                  <snm>Kustu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Santero</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Keener</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Popham</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Weiss</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Microbiol Rev</source>
            <pubdate>1989</pubdate>
            <volume>53</volume>
            <fpage>367</fpage>
            <lpage>376</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2677638</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>The sigma 54 bacterial enhancer-binding protein family: mechanism of action and phylogenetic relationship of their functional domains.</p>
            </title>
            <aug>
               <au>
                  <snm>Morett</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Segovia</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1993</pubdate>
            <volume>175</volume>
            <fpage>6067</fpage>
            <lpage>6074</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">206698</pubid>
                  <pubid idtype="pmpid">8407777</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Signal sensing by sigma 54-dependent regulators: derepression as a control mechanism.</p>
            </title>
            <aug>
               <au>
                  <snm>Shingler</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1996</pubdate>
            <volume>19</volume>
            <fpage>409</fpage>
            <lpage>416</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.1996.388920.x</pubid>
                  <pubid idtype="pmpid">8830233</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>The biology of enhancer-dependent transcriptional regulation in bacteria: insights from genome sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Studholme</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Buck</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Lett</source>
            <pubdate>2000</pubdate>
            <volume>186</volume>
            <fpage>1</fpage>
            <lpage>9</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1097(00)00082-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">10779705</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Metabolic context and possible physiological themes of sigma(54)-dependent genes in <it>Escherichia coli</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Reitzer</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Schneider</snm>
                  <fnm>BL</fnm>
               </au>
            </aug>
            <source>Microbiol Mol Biol Rev</source>
            <pubdate>2001</pubdate>
            <volume>65</volume>
            <fpage>422</fpage>
            <lpage>444</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">99035</pubid>
                  <pubid idtype="pmpid" link="fulltext">11528004</pubid>
                  <pubid idtype="doi">10.1128/MMBR.65.3.422-444.2001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Sigma(54), a vital protein for <it>Myxococcus xanthus</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Keseler</snm>
                  <fnm>IM</fnm>
               </au>
               <au>
                  <snm>Kaiser</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1997</pubdate>
            <volume>94</volume>
            <fpage>1979</fpage>
            <lpage>1984</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">20028</pubid>
                  <pubid idtype="pmpid" link="fulltext">9050890</pubid>
                  <pubid idtype="doi">10.1073/pnas.94.5.1979</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>P-II signal transduction proteins, pivotal players in microbial nitrogen control.</p>
            </title>
            <aug>
               <au>
                  <snm>Arcond&#233;guy</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Jack</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Merrick</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Microbiol Mol Biol Rev</source>
            <pubdate>2001</pubdate>
            <volume>65</volume>
            <fpage>80</fpage>
            <lpage>105</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">99019</pubid>
                  <pubid idtype="pmpid" link="fulltext">11238986</pubid>
                  <pubid idtype="doi">10.1128/MMBR.65.1.80-105.2001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p><it>Rhizobium meliloti ntrA</it> (<it>rpoN</it>) gene is required for diverse metabolic functions.</p>
            </title>
            <aug>
               <au>
                  <snm>Ronson</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Nixon</snm>
                  <fnm>BT</fnm>
               </au>
               <au>
                  <snm>Albright</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Ausubel</snm>
                  <fnm>FM</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1987</pubdate>
            <volume>169</volume>
            <fpage>2424</fpage>
            <lpage>2431</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">212082</pubid>
                  <pubid idtype="pmpid">3034856</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Sequence and analysis of the <it>rpoN</it> sigma factor gene of <it>Rhizobium</it> sp. strain NGR234, a primary coregulator of symbiosis.</p>
            </title>
            <aug>
               <au>
                  <snm>van Slooten</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Cervantes</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Broughton</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Stanley</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1990</pubdate>
            <volume>172</volume>
            <fpage>5563</fpage>
            <lpage>5574</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">213034</pubid>
                  <pubid idtype="pmpid">2211497</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p><it>Azorhizobium caulinodans</it> nitrogen fixation (<it>nif/fix</it>) gene regulation: mutagenesis of the <it>nifA</it> -24/-12 promoter element, characterization of a <it>ntrA</it>(<it>rpoN</it>) gene, and derivation of a model.</p>
            </title>
            <aug>
               <au>
                  <snm>Stigter</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schneider</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>de Bruijn</snm>
                  <fnm>FJ</fnm>
               </au>
            </aug>
            <source>Mol Plant-Microbe Interact</source>
            <pubdate>1993</pubdate>
            <volume>6</volume>
            <fpage>238</fpage>
            <lpage>252</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8471796</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Differential regulation of <it>Rhizobium etli rpoN2</it> gene expression during symbiosis and free-living growth.</p>
            </title>
            <aug>
               <au>
                  <snm>Michiels</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Moris</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dombrecht</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Verreth</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Vanderleyden</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1998</pubdate>
            <volume>180</volume>
            <fpage>3620</fpage>
            <lpage>3628</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">107331</pubid>
                  <pubid idtype="pmpid" link="fulltext">9658006</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>RpoN of <it>Rhizobium leguminosarum</it> bv. <it>viciae</it> strain VF39SM plays a central role in FnrN-dependent microaerobic regulation of genes involved in nitrogen fixation.</p>
            </title>
            <aug>
               <au>
                  <snm>Clark</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Oresnik</snm>
                  <fnm>IJ</fnm>
               </au>
               <au>
                  <snm>Hynes</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>Mol Gen Genet</source>
            <pubdate>2001</pubdate>
            <volume>264</volume>
            <fpage>623</fpage>
            <lpage>633</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s004380000348</pubid>
                  <pubid idtype="pmpid">11212917</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Genetic regulation of nitrogen fixation in rhizobia.</p>
            </title>
            <aug>
               <au>
                  <snm>Fischer</snm>
                  <fnm>HM</fnm>
               </au>
            </aug>
            <source>Microbiol Rev</source>
            <pubdate>1994</pubdate>
            <volume>58</volume>
            <fpage>352</fpage>
            <lpage>386</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7968919</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>A survey of symbiotic nitrogen fixation by rhizobia.</p>
            </title>
            <aug>
               <au>
                  <snm>Kaminski</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Batut</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Boistard</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>In The Rhizobiaceae, Molecular Biology of Model Plant-Associated Bacteria</source>
            <publisher>Dordrecht: Kluwer</publisher>
            <editor>Spaink HP, Kondorosi A, Hooykaas PJJ</editor>
            <pubdate>1998</pubdate>
            <fpage>431</fpage>
            <lpage>460</lpage>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Transcription of a <it>Rhizobium leguminosarum</it> biovar <it>phaseoli</it> gene needed for melanin synthesis is activated by NifA of <it>Rhizobium</it> and <it>Klebsiella pneumoniae</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Hawkins</snm>
                  <fnm>FK</fnm>
               </au>
               <au>
                  <snm>Johnston</snm>
                  <fnm>AW</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1988</pubdate>
            <volume>2</volume>
            <fpage>331</fpage>
            <lpage>337</lpage>
            <xrefbib>
               <pubid idtype="pmpid">3041240</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Nucleotide sequence and characterization of <it>Rhizobium meliloti</it> nodulation competitiveness genes <it>nfe</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Soto</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Zorzano</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mercado-Blanco</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lepek</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Olivares</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Toro</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1993</pubdate>
            <volume>229</volume>
            <fpage>570</fpage>
            <lpage>576</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1993.1060</pubid>
                  <pubid idtype="pmpid" link="fulltext">8429568</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Identification and characterization of a novel <it>Bradyrhizobium japonicum</it> gene involved in host-specific nitrogen fixation.</p>
            </title>
            <aug>
               <au>
                  <snm>Chun</snm>
                  <fnm>JY</fnm>
               </au>
               <au>
                  <snm>Sexton</snm>
                  <fnm>GL</fnm>
               </au>
               <au>
                  <snm>Roth</snm>
                  <fnm>LE</fnm>
               </au>
               <au>
                  <snm>Stacey</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1994</pubdate>
            <volume>176</volume>
            <fpage>6717</fpage>
            <lpage>6729</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">197029</pubid>
                  <pubid idtype="pmpid">7961425</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>A <it>Bradyrhizobium japonicum</it> gene essential for nodulation competitiveness is differentially regulated from two promoters.</p>
            </title>
            <aug>
               <au>
                  <snm>Chun</snm>
                  <fnm>JY</fnm>
               </au>
               <au>
                  <snm>Stacey</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Mol Plant Microbe Interact</source>
            <pubdate>1994</pubdate>
            <volume>7</volume>
            <fpage>248</fpage>
            <lpage>255</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8012043</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Hydrogenase genes from <it>Rhizobium leguminosarum</it> bv. <it>viciae</it> are controlled by the nitrogen fixation regulatory protein NifA.</p>
            </title>
            <aug>
               <au>
                  <snm>Brito</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Martinez</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fernandez</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Rey</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Cabrera</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Palacios</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Imperial</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ruiz-Argueso</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1997</pubdate>
            <volume>94</volume>
            <fpage>6019</fpage>
            <lpage>6024</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">20993</pubid>
                  <pubid idtype="pmpid" link="fulltext">9177161</pubid>
                  <pubid idtype="doi">10.1073/pnas.94.12.6019</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p><it>nodD2</it> of <it>Rhizobium</it> sp NGR234 is involved in the repression of the <it>nodABC</it> operon.</p>
            </title>
            <aug>
               <au>
                  <snm>Fellay</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hanin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Montorzi</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Frey</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Freiberg</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Golinowski</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Staehelin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Broughton</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Jabbouri</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1998</pubdate>
            <volume>27</volume>
            <fpage>1039</fpage>
            <lpage>1050</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.1998.00761.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">9535093</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>The <it>Rhizobium</it> sp. BR816 <it>nodD3</it> gene is regulated by a transcriptional regulator of the AraC/XylS family.</p>
            </title>
            <aug>
               <au>
                  <snm>Vlassak</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>de Wilde</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Snoeck</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Luyten</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>van Rhijn</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Vanderleyden</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Mol Gen Genet</source>
            <pubdate>1998</pubdate>
            <volume>258</volume>
            <fpage>558</fpage>
            <lpage>561</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s004380050768</pubid>
                  <pubid idtype="pmpid">9669339</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The upstream region of the <it>nodD3</it> gene of <it>Sinorhizobium meliloti</it> carries enhancer sequences for the transcriptional activator NtrC.</p>
            </title>
            <aug>
               <au>
                  <snm>Dusha</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Austin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dixon</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Lett</source>
            <pubdate>1999</pubdate>
            <volume>179</volume>
            <fpage>491</fpage>
            <lpage>499</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1097(99)00450-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">10518756</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p><it>Sinorhizobium meliloti nfe</it> (nodulation formation efficiency) genes exhibit temporal and spatial expression patterns similar to those of genes involved in symbiotic nitrogen fixation.</p>
            </title>
            <aug>
               <au>
                  <snm>Garcia-Rodr&#237;guez</snm>
                  <fnm>FM</fnm>
               </au>
               <au>
                  <snm>Toro</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Mol Plant Microbe Interact</source>
            <pubdate>2000</pubdate>
            <volume>13</volume>
            <fpage>583</fpage>
            <lpage>591</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10830257</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Dual control of the <it>nodA</it> operon of <it>Azorhizobium caulinodans</it> ORS571 by a nod box and a NifA-sigma54-type promoter.</p>
            </title>
            <aug>
               <au>
                  <snm>Gao</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>D'Haeze</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>De Rycke</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Holsters</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Genet Genomics</source>
            <pubdate>2001</pubdate>
            <volume>265</volume>
            <fpage>1050</fpage>
            <lpage>1059</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s004380100500</pubid>
                  <pubid idtype="pmpid" link="fulltext">11523777</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Analysis of a symbiosis-specific cytochrome P450 homolog in <it>Rhizobium</it> sp. BR816.</p>
            </title>
            <aug>
               <au>
                  <snm>Luyten</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Swinnen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Vlassak</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Verreth</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Dombrecht</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Vanderleyden</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Mol Plant Microbe Interact</source>
            <pubdate>2001</pubdate>
            <volume>14</volume>
            <fpage>918</fpage>
            <lpage>924</lpage>
            <xrefbib>
               <pubid idtype="pmpid">11437267</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Molecular basis of symbiosis between <it>Rhizobium</it> and legumes.</p>
            </title>
            <aug>
               <au>
                  <snm>Freiberg</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Fellay</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bairoch</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Broughton</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Rosenthal</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Perret</snm>
                  <fnm>X</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1997</pubdate>
            <volume>387</volume>
            <fpage>394</fpage>
            <lpage>401</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/387394a0</pubid>
                  <pubid idtype="pmpid">9163424</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Potential symbiosis-specific genes uncovered by sequencing a 410-kilobase DNA region of the <it>Bradyrhizobium japonicum</it> chromosome.</p>
            </title>
            <aug>
               <au>
                  <snm>G&#246;ttfert</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rothlisberger</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kundig</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Beck</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Marty</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hennecke</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2001</pubdate>
            <volume>183</volume>
            <fpage>1405</fpage>
            <lpage>1412</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">95015</pubid>
                  <pubid idtype="pmpid" link="fulltext">11157954</pubid>
                  <pubid idtype="doi">10.1128/JB.183.4.1405-1412.2001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Comparative sequence analysis of the symbiosis island of <it>Mesorhizobium loti</it> strain R7A.</p>
            </title>
            <aug>
               <au>
                  <snm>Sullivan</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Trzebiatowski</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Cruickshank</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Gouzy</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Elliot</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Fleetwood</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>McCallum</snm>
                  <fnm>NG</fnm>
               </au>
               <au>
                  <snm>Rossbach</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>GS</fnm>
               </au>
               <etal/>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2002</pubdate>
            <volume>184</volume>
            <fpage>3086</fpage>
            <lpage>3095</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">135072</pubid>
                  <pubid idtype="pmpid" link="fulltext">12003951</pubid>
                  <pubid idtype="doi">10.1128/JB.184.11.3086-3095.2002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Complete genome structure of the nitrogen-fixing symbiotic bacterium <it>Mesorhizobium loti</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Kaneko</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Sato</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Asamizu</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Kato</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sasamoto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Idesawa</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ishikawa</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kawashima</snm>
                  <fnm>K</fnm>
               </au>
               <etal/>
            </aug>
            <source>DNA Res</source>
            <pubdate>2000</pubdate>
            <volume>7</volume>
            <fpage>331</fpage>
            <lpage>338</lpage>
            <xrefbib>
               <pubid idtype="pmpid">11214968</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>The composite genome of the legume symbiont <it>Sinorhizobium meliloti</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Galibert</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Finan</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Long</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Puhler</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Abola</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ampe</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Barloy-Hubler</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Barnett</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Becker</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Boistard</snm>
                  <fnm>P</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>293</volume>
            <fpage>668</fpage>
            <lpage>672</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11474104</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>The genome of the natural genetic engineer <it>Agrobacterium tumefaciens</it> C58.</p>
            </title>
            <aug>
               <au>
                  <snm>Wood</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Setubal</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Kaul</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Monks</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Kitajima</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Okura</snm>
                  <fnm>VK</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wood</snm>
                  <fnm>GE</fnm>
               </au>
               <au>
                  <snm>Almeida</snm>
                  <fnm>NF</fnm>
                  <suf>Jr</suf>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>294</volume>
            <fpage>2317</fpage>
            <lpage>2323</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1066804</pubid>
                  <pubid idtype="pmpid" link="fulltext">11743193</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>The genome sequence of the facultative intracellular pathogen <it>Brucella melitensis</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>DelVecchio</snm>
                  <fnm>VG</fnm>
               </au>
               <au>
                  <snm>Kapatral</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Redkar</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Patra</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Mujer</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Los</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ivanova</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Bhattacharyya</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lykidis</snm>
                  <fnm>A</fnm>
               </au>
               <etal/>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>443</fpage>
            <lpage>448</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">117579</pubid>
                  <pubid idtype="pmpid" link="fulltext">11756688</pubid>
                  <pubid idtype="doi">10.1073/pnas.221575398</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>The <it>ntrA</it> gene of <it>Agrobacterium tumefaciens</it>: identification, cloning, and phenotype of a site-directed mutant.</p>
            </title>
            <a