<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>gb-2010-11-9-r97</ui><ji>GBJ</ji><fm>
<dochead>Research</dochead>
<bibl>
<title>
<p>Premature terminator analysis sheds light on a hidden world of bacterial transcriptional attenuation</p>
</title>
<aug>
<au id="A1"><snm>Naville</snm><fnm>Magali</fnm><insr iid="I1"/><email>magali.naville@igmors.u-psud.fr</email></au>
<au ca="yes" id="A2"><snm>Gautheret</snm><fnm>Daniel</fnm><insr iid="I1"/><email>daniel.gautheret@u-psud.fr</email></au>
</aug>
<insg>
<ins id="I1"><p>Universit&#233; Paris-Sud, CNRS, UMR8621, Institut de G&#233;n&#233;tique et Microbiologie, B&#226;timent 400, F-91405 Orsay Cedex, France</p></ins>
</insg>
<source>Genome Biology</source>
<issn>1465-6906</issn>
<pubdate>2010</pubdate>
<volume>11</volume>
<issue>9</issue>
<fpage>R97</fpage>
<url>http://genomebiology.com/2010/11/9/R97</url>
<xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2010-11-9-r97</pubid><pubid idtype="pmpid">20920266</pubid></pubidlist></xrefbib>
</bibl>
<history><rec><date><day>12</day><month>4</month><year>2010</year></date></rec><revrec><date><day>11</day><month>8</month><year>2010</year></date></revrec><acc><date><day>29</day><month>9</month><year>2010</year></date></acc><pub><date><day>29</day><month>9</month><year>2010</year></date></pub></history>
<cpyrt><year>2010</year><collab>Naville and Gautheret; licensee BioMed Central Ltd.</collab><note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<sec>
<st>
<p>Background</p>
</st>
<p>Bacterial transcription attenuation occurs through a variety of <it>cis</it>-regulatory elements that control gene expression in response to a wide range of signals. The signal-sensing structures in attenuators are so diverse and rapidly evolving that only a small fraction have been properly annotated and characterized to date. Here we apply a broad-spectrum detection tool in order to achieve a more complete view of the transcriptional attenuation complement of key bacterial species.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>Our protocol seeks gene families with an unusual frequency of 5' terminators found across multiple species. Many of the detected attenuators are part of annotated elements, such as riboswitches or T-boxes, which often operate through transcriptional attenuation. However, a significant fraction of candidates were not previously characterized in spite of their unmistakable footprint. We further characterized some of these new elements using sequence and secondary structure analysis. We also present elements that may control the expression of several non-homologous genes, suggesting co-transcription and response to common signals. An important class of such elements, which we called mobile attenuators, is provided by 3' terminators of insertion sequences or prophages that may be exapted as 5' regulators when inserted directly upstream of a cellular gene.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>We show here that attenuators involve a complex landscape of signal-detection structures spanning the entire bacterial domain. We discuss possible scenarios through which these diverse 5' regulatory structures may arise or evolve.</p>
</sec>
</sec>
</abs>
</fm><meta>
<classifications>
<classification id="300100010" subtype="man_spc_id" type="BMC">Genome studies</classification>
<classification id="300100016" subtype="man_spc_id" type="BMC">Molecular biology</classification>
<classification id="300100014" subtype="man_spc_id" type="BMC">Microbiology and parasitology</classification>
</classifications>
</meta><bdy>
<sec>
<st>
<p>Background</p>
</st>
<p>Transcription of protein-coding genes does not always lead to the production of full length mRNAs. In both eukaryotes and bacteria, transcriptome analysis is revealing high levels of short transcripts that result from either unsuccessful initiation events or premature termination <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B2">2</abbr>
<abbr bid="B3">3</abbr>
<abbr bid="B4">4</abbr>
</abbrgrp>. In eukaryotes, the functions of such events remain unelucidated, except for a few cases <abbrgrp>
<abbr bid="B5">5</abbr>
</abbrgrp>, and abortive transcription is still largely considered as transcriptional 'noise'. In Bacteria however, a form of abortive transcription known as transcription attenuation has emerged as an important regulatory strategy. The basic principle of transcriptional attenuation is the folding of the RNA transcript into either of two alternative structures, one of them corresponding to a Rho-independent terminator. The expression/repression decision occurs through a sensing system located between the promoter and the first start codon of the operon, and depends on interactions modulated by a variety of signals. The type of signal detected is commonly used to classify attenuators into major families: riboswitches bind small metabolites <abbrgrp>
<abbr bid="B6">6</abbr>
<abbr bid="B7">7</abbr>
<abbr bid="B8">8</abbr>
</abbrgrp>, T-boxes bind tRNAs <abbrgrp>
<abbr bid="B9">9</abbr>
<abbr bid="B10">10</abbr>
</abbrgrp>, and other types of 5' leaders respond to protein factors <abbrgrp>
<abbr bid="B11">11</abbr>
<abbr bid="B12">12</abbr>
<abbr bid="B13">13</abbr>
</abbrgrp> or temperature <abbrgrp>
<abbr bid="B14">14</abbr>
<abbr bid="B15">15</abbr>
<abbr bid="B16">16</abbr>
</abbrgrp>. The triggering signals, by reflecting the global physiological state of the cell, enable a continuous monitoring of operon expression requirements.</p>
<p>A number of computational strategies have been proposed for attenuator prediction. The most general approaches consist of the identification of mutually exclusive RNA secondary structures <abbrgrp>
<abbr bid="B17">17</abbr>
<abbr bid="B18">18</abbr>
</abbrgrp>, with the limitation that they miss non-hairpin anti-terminators such as riboswitches whose anti-terminator corresponds to a much larger secondary structure. Other and more specific studies have been devoted to the screening of particular classes of attenuators, such as riboswitches <abbrgrp>
<abbr bid="B19">19</abbr>
<abbr bid="B20">20</abbr>
<abbr bid="B21">21</abbr>
</abbrgrp>, T-boxes <abbrgrp>
<abbr bid="B9">9</abbr>
</abbrgrp> or leader peptide systems <abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>. These screening strategies use either covariance models <abbrgrp>
<abbr bid="B19">19</abbr>
<abbr bid="B20">20</abbr>
</abbrgrp> or descriptors combining sequence and structure motifs <abbrgrp>
<abbr bid="B23">23</abbr>
</abbrgrp>, which are designed to detect conspicuous, class-specific signatures. Signatures can be, for instance, an RNA fold, a conserved sequence, or the presence of a short ORF <abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>. Such screens based on sequence or structure recognition identify highly conserved members of a family and, eventually, closely related variants. For instance, five distinct riboswitch families have been described that all sense S-adenosyl methionine (SAM I <abbrgrp>
<abbr bid="B24">24</abbr>
<abbr bid="B25">25</abbr>
<abbr bid="B26">26</abbr>
</abbrgrp>, SAM II <abbrgrp>
<abbr bid="B27">27</abbr>
</abbrgrp>, SAM III <abbrgrp>
<abbr bid="B28">28</abbr>
</abbrgrp>, SAM IV <abbrgrp>
<abbr bid="B29">29</abbr>
</abbrgrp> and SAM V <abbrgrp>
<abbr bid="B30">30</abbr>
</abbrgrp>).</p>
<p>In bacterial species, up to 10% of operons may be regulated by transcription attenuation <abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp>. In agreement with this assessment, we showed in a previous study <abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp> that a mean of 1.6% of all bacterial genes could be subject to attenuation, with a maximum of 2.6% in Firmicutes. However, knowledge on transcriptional attenuation is unevenly distributed: almost none of the predicted attenuators in phyla such as Chlamydiae or Acidobacteria have associated functions, whereas 16% are already annotated in Firmicutes. As previous attenuator screens were mostly based on similarity searches, known families often present a marked homogeneity and lack many evolutionarily isolated instances. Moreover, they exclude entire classes of elements that are either too short or too variable to produce signatures strong enough for a similarity search.</p>
<p>To fully explore the variety of attenuation systems, we need strategies that do not rely initially on sequence or structure homology. We developed a protocol that first screens all potential Rho-independent terminators in the 5' region of genes in multiple bacterial genomes and, in a second stage, extracts significant elements using two types of procedures: a synteny-based procedure that seeks gene families with unusually frequent 5' terminators; and a non-syntenic procedure that seeks sequences conserved among multiple putative terminators. Synteny analysis alone was able to pick up every class of known attenuation system, which are generally the most widespread, while allowing the prediction of numerous new instances. A major benefit of this strategy lies in the evolutionary insight it provides on attenuator families, as we illustrate below for five families of particular interest found throughout the 302 species surveyed. We further characterized new attenuators, in particular in <it>Escherichia coli </it>and <it>Bacillus subtilis</it>, using sequence-based analysis. Our results demonstrate that attenuator characterization can be largely improved even in widely analyzed model species.</p>
</sec>
<sec>
<st>
<p>Results and discussion</p>
</st>
<sec>
<st>
<p>5' terminators are less stable than 3' terminators and unevenly distributed among species</p>
</st>
<p>We combined two methods for Rho-independent terminator prediction using either position weight matrices <abbrgrp>
<abbr bid="B32">32</abbr>
</abbrgrp> or descriptors <abbrgrp>
<abbr bid="B33">33</abbr>
</abbrgrp> to detect potential terminators in the 5' and 3' UTRs in 302 bacterial genomes (see Materials and methods for details). As already documented <abbrgrp>
<abbr bid="B34">34</abbr>
</abbrgrp>, the overall usage of Rho-independent terminators fluctuates among species, with a maximum in Firmicutes (approximately 2,689 predictions in <it>Bacillus cereus</it>) and a minimum in Actinobacteria (30 predictions in <it>Nocardioides </it>sp.) or in certain atypical species such as the proteobacteria <it>Ehrlichia ruminantium </it>(8 predictions), an obligate intracellular pathogenic organism. Controls performed using randomized UTR sequences indicate a low false positive rate in both 5' and 3' UTRs (2.3% and 2.5%, respectively; see Materials and methods). Based on experimentally verified <it>B. subtilis </it>operons, we estimate that our protocol would retrieve about 88% of 3' terminators <abbrgrp>
<abbr bid="B35">35</abbr>
</abbrgrp>.</p>
<p>To identify putative attenuators, we applied additional filters to the set of 5' terminators, based on orientation and distance relative to flanking genes. In our set of 302 bacterial genomes, this led to the identification of 15,930 putative attenuators, 1,004 of them overlapping sequences previously annotated as ORFs encoding short hypothetical proteins. In <it>B. subtilis</it>, this protocol detected 32 of 57 (56%) known attenuation systems (riboswitches, T-boxes and other elements).</p>
<p>Terminators found in 5' UTR regions are thermodynamically less stable than 3' terminators (average folding free energy of -16.5 kcal versus -20.2 kcal/mol, <it>P </it>&lt; 2e-16) and their average stem length is slightly shorter (7.0 versus 7.6 bp, <it>P </it>&lt; 2e-16). As seen above, this difference cannot be imputed to a higher false discovery rate in 5' UTR, although it is in agreement with expected properties of structures that must fold alternatively; all known 5' regulatory terminators allow an alternative readthrough, which is not the case for 3' terminators. Interestingly, 5' terminators do not seem to be evolutionarily more conserved in sequence than 3' terminators. Analyzing data from a recent screen for non-coding conserved elements in bacteria <abbrgrp>
<abbr bid="B36">36</abbr>
</abbrgrp>, we observed that 58% of 5' terminators, versus 66% of 3' terminators, overlap a conserved region. This suggests that 5' terminators are not associated with conserved sequences more often than canonical terminators.</p>
</sec>
<sec>
<st>
<p>Synteny analysis reveals more than 50 gene families subject to frequent attenuation control</p>
</st>
<p>We classified 5' terminators based on homology relationships between downstream genes, an operation that amounts to seeking syntenic attenuator/gene pairs. This method is related to that of Merino and Yanofsky <abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp>, who sought over-represented families of orthologous genes flanking putative attenuators. These authors defined attenuators based on mutually exclusive stems, while we look for single terminator motifs. In principle our approach should be more sensitive to transcriptional attenuators as some achieve their alternative state through contacts with external factors and do not require stable alternative base pairing. On the other hand, Merino and Yanofsky were able to detect translational attenuators, which we do not detect here.</p>
<p>To identify protein families showing a greater propensity for regulation by transcriptional attenuation, we used the Hogenom database of gene families <abbrgrp>
<abbr bid="B37">37</abbr>
</abbrgrp> and ranked families based on numbers of 5' terminators. In a first procedure, we scored gene families according to absolute numbers of predicted attenuators across all species without any consideration of gene family size, which favored large families of paralogs (Table S0 in Additional file <supplr sid="S1">1</supplr>). To avoid bias towards large paralog families, we used a second scoring procedure where gene families were ranked according to their frequencies and species distribution (Figure <figr fid="F1">1</figr>). The significance of attenuator enrichment was confirmed independently for each family. Scores, <it>P</it>-values and family information are shown in Table S1 in Additional file <supplr sid="S1">1</supplr>. Our synteny-based scoring eliminates over 90% of the false positives corresponding to terminators of independent small RNA genes. While 2.2% (343 elements) of the total 15,930 predicted 5' terminators map to annotated small RNAs, this fraction is reduced to 0.16% (3 out of 1,845) when considering only attenuator candidates from the 65 high-ranking families. Therefore, although some terminators of independent RNA genes are included in our initial screen, most of them are dismissed when we consider high-scoring gene families. Finally, we analyzed sequence conservation to detect possible functional elements associated with attenuators. To this aim, we performed pairwise sequence comparison of the terminator regions, followed by hierarchical clustering. Attenuator elements harboring a sequence conserved in at least two species are listed in Table S2 in Additional file <supplr sid="S1">1</supplr> for the first 30 attenuator families.</p>
<suppl id="S1">
<title>
<p>Additional file 1</p>
</title>
<text>
<p>
<b>Supplementary tables and figures</b>. Table S0: gene families showing the highest absolute numbers of attenuator candidates. Table S1: genes most frequently regulated by attenuation in bacteria (normalized by family size). Table S2: list of sequence clusters observed in the 30 gene families most often regulated by attenuation (tabulation-separated). Table S3: sequence clusters obtained among candidates upstream of ABC-transporter genes. Table S4: complete list of clusters obtained by analyzing all candidates from enterobacterial species listed in Table S6. Cluster classes: 'a', clusters including only orthologous genes. 'b', clusters including only non-orthologous genes, sometimes from a single species; 'c', 'super-clusters' containing several sets of orthologous genes. Table S5: complete list of clusters obtained by analyzing all the candidates of <it>Bacillus </it>species listed in Table S6. 'a', clusters including only orthologous genes; 'b', clusters including only non-orthologous genes, sometimes from a single species; 'c', 'super-clusters' containing several sets of orthologous genes. Table S6: list of species analyzed for the identification of attenuators 'regulons'. Table S7: complete list of analyzed species, along with GenBank identifiers of corresponding DNA molecules and clade. Table S8: complete list of attenuators predicted in 5' UTR of genes, using the protocol described in <abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp> (tab-delimited table). Supplementary data 1: list of <it>rimP</it>-leaders from Gammaproteobacteria; list of <it>rimP</it>-leaders from other species; list of intergenic regions where no terminator could be detected, but showing sequence similarity to putative attenuators. Supplementary data 2: Stockholm alignments of the five ABC-leaders shown in Figure 4. Supplementary data 3: lists and Stockholm alignments of attenuator 'regulons' (candidates present upstream of several non-homologous genes) in Firmicutes. Supplementary data 4: parameters, commands and descriptor files used for terminator prediction.</p>
</text>
<file name="gb-2010-11-9-r97-S1.ZIP">
   <p>Click here for file</p>
</file>
</suppl>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Sixty-five families of genes most frequently controlled by attenuation</p></caption><text>
   <p><b>Sixty-five families of genes most frequently controlled by attenuation</b>. For each gene family described on the left, the histogram bar shows the fraction of candidates already described in the Rfam database or in the literature. The green curve corresponds to the cumulated portion of families with at least one described candidate, from the first family to the 65th. The red curve indicates the absolute number of candidates in each family.</p>
</text><graphic file="gb-2010-11-9-r97-1"/></fig>
<p>We assessed prior knowledge of attenuator regulation in each gene family through a systematic literature survey and comparison to the Rfam RNA family database <abbrgrp>
<abbr bid="B38">38</abbr>
</abbrgrp> and to the RegulonDB database of <it>E. coli </it>transcriptional networks <abbrgrp>
<abbr bid="B39">39</abbr>
</abbrgrp>. The high incidence of known terminator systems among high-ranking families in Figure <figr fid="F1">1</figr> underscores the specificity of our detection method. Forty-two out of 65 high-ranking families (65%) were already described as attenuator-regulated in at least one species, covering virtually all known classes of attenuator systems (Figure <figr fid="F1">1</figr>). This proportion reaches 100% for the first 20 families. Within known attenuator families, however, a large fraction of elements were not described previously: 67% of elements are unannotated in the top 30 families. Furthermore, several major families of attenuators are essentially uncharacterized: 27 out of 65 are completely uncharacterized or are less than 1% characterized. In the following sections we describe five previously uncharacterized attenuator families, selected either because they are particularly widespread or functionally interesting or because they display intriguing phylogenetic patterns.</p>
<sec>
<st>
<p>The <it>rimP</it>-leader: the most ubiquitous transcription attenuator</p>
</st>
<p>The <it>rimP </it>gene ranks first in our list of genes most often regulated by attenuation (Figure <figr fid="F1">1</figr>). <it>rimP</it>, previously known as <it>yhbC </it>in <it>E. coli </it>and <it>ylxS </it>in <it>B. subtilis</it>, encodes a protein recently shown to be involved in 30S ribosomal subunit maturation <abbrgrp>
<abbr bid="B40">40</abbr>
</abbrgrp>. It is the first gene of an operon encompassing the <it>nusA </it>and <it>infB </it>genes, which are present in almost all bacteria. While <it>infB </it>encodes the translation initiation factor IF-2, the NusA protein is characterized as a transcriptional pausing, readthrough, termination and anti-termination factor, and is shown to participate in the Rho-dependent anti-termination complex <abbrgrp>
<abbr bid="B41">41</abbr>
</abbrgrp>.</p>
<p>Analysis of predicted attenuators in <it>rimP</it>-<it>nusA</it>-<it>infB </it>operons (Figure <figr fid="F2">2b</figr>) revealed the presence of a short and highly conserved motif corresponding to the terminator, the '<it>rimP-</it>leader', but gave no evidence of larger conserved elements characteristic of riboswitches, T-boxes or ribosomal protein-dependent attenuators. The terminator stem contains an unusual highly conserved <monospace>GGGc(...)gCCC</monospace> motif. We were unable to detect such a conserved motif in any other 5' or 3' terminator, suggesting this sequence signature is specific to the <it>rimP-</it>leader. We could find, however, the same motif along with the downstream U-stretch in many 5' UTRs of <it>rimP</it>-<it>nusA</it>-<it>infB </it>operons where no terminator structure was detected (Supplementary data 1 in Additional file <supplr sid="S1">1</supplr>). These additional motifs were missed because the potential hairpin was too short for detection with our programs, consistent with our previous observation that regulatory terminators are less stable than regular terminators. The distance separating the terminator from the gene start varies between 4 and 130 nucleotides, and may consequently encompass the ribosome-binding site (RBS). In several Gammaproteobacteria (listed in Supplementary data 1 in Additional file <supplr sid="S1">1</supplr>), the <it>rimP</it>-leader appears more complex, with a second terminator found in tandem and upstream of the former (Figure <figr fid="F2">2a</figr>), and presenting a clear potential anti-terminator structure. Interestingly, this terminator presents a <monospace>CCCg(...)cGGG</monospace> motif, inverse to the downstream motif.</p>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>The <it>rimP</it>-leader</p></caption><text>
   <p><b>The <it>rimP</it>-leader</b>. Highlighted boxes indicate putative ribosome binding sites. <b>(a) </b><it>rimP</it>-leader identified in several Gammaproteobacteria (listed in Supplementary data 1 in Additional file <supplr sid="S1">1</supplr>), composed of a putative termination/antitermination structure (shown by thick arrows under the sequence) followed by the general <it>rimP</it>-leader motif described in the text. <b>(b) </b><it>rimP</it>-leader found in the majority of species, including Firmicutes and Gammaproteobacteria. This leader sequence consists of a hairpin that is G-rich on its 5' arm and C-rich on its 3' arm, followed by the T-stretch characterizing Rho-independent terminators. The black arrow indicates the transcription start site recently detected by deep sequencing <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. Sequence logos were produced using Weblogo <abbrgrp><abbr bid="B81">81</abbr></abbrgrp>.</p>
</text><graphic file="gb-2010-11-9-r97-2"/></fig>
<p>High sequence conservation in the <it>rimP</it>-leader terminator stem and the absence of any visible antiterminator structure argue for regulation involving a termination or anti-termination protein that can specifically recognize a nucleic-acid motif <abbrgrp>
<abbr bid="B12">12</abbr>
<abbr bid="B13">13</abbr>
</abbrgrp>. If feedback control by RimP, now known to interact with rRNA <abbrgrp>
<abbr bid="B40">40</abbr>
</abbrgrp>, may be hypothesized, the best candidate for this direct interaction is probably the NusA protein, the expression of which was shown 25 years ago to repress expression of the operon <abbrgrp>
<abbr bid="B42">42</abbr>
<abbr bid="B43">43</abbr>
</abbrgrp>. NusA contains an amino-terminal domain that interacts with RNA polymerase, an S1 domain frequent in RNA-associated proteins, and two RNA-binding K homology (KH) domains <abbrgrp>
<abbr bid="B44">44</abbr>
</abbrgrp>. It was already shown to be involved in the attenuation of the Trp, His and S10 operons <abbrgrp>
<abbr bid="B45">45</abbr>
</abbrgrp> by interacting with the upstream arm of the terminator hairpin, but the RNA motif we describe here was not observed in these instances.</p>
<p>While this manuscript was under review, a deep sequencing study of 5' regulators in <it>B. subtilis </it>
<abbrgrp>
<abbr bid="B46">46</abbr>
</abbrgrp> observed that transcripts encoding certain core transcription elongation subunits, including <it>ylxS </it>(that is, <it>rimP</it>), appear to contain a long 5' leader region. The authors suggested these regions may contain elements regulating the associated genes. The 180-nucleotide leader they observed by deep sequencing in <it>B. subtilis rimP </it>transcripts indeed covers the attenuator we predict for this gene (Figure <figr fid="F2">2</figr>).</p>
</sec>
<sec>
<st>
<p>The <it>rpsL</it>-leader: a multiform ribosomal protein leader</p>
</st>
<p>The <it>rpsL </it>gene also appears at the top of the list of genes frequently regulated by transcriptional attenuation (Figure <figr fid="F1">1</figr>). Like <it>rimP</it>, it belongs to an operon of largely conserved genes, <it>rpsL </it>and <it>rpsG</it>, encoding the ribosomal proteins S12 and S7 respectively, and <it>fusA </it>and <it>tufA</it>, encoding the elongation factors EF-G and EF-Tu, respectively. Sequence-based clustering allowed us to identify variants among the different '<it>rpsL-</it>leaders' (Table S2 in Additional file <supplr sid="S1">1</supplr>). We derived consensus structures for several of these elements and were able to find potential alternative antiterminator structures in each case (Figure <figr fid="F3">3</figr>).</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>The <it>rpsL</it>-leader</p></caption><text>
   <p><b>The <it>rpsL</it>-leader</b>. Representative structures are shown for reference species, each corresponding to the consensus terminal structure of elements found in closely related species. Boxes indicate terminator T-stretches. Black arrows indicate putative anti-terminator structures. <b>(a) </b>Elements found upstream of the <it>rpsL </it>gene in <it>Listeria</it>, <it>Xanthomonas</it>, <it>Pseudomonas </it>and <it>Rickettsia </it>genera. <b>(b) </b><it>rpsL</it>-leader found upstream of the gene <it>fusA </it>in streptococci.</p>
</text><graphic file="gb-2010-11-9-r97-3"/></fig>
<p>The translation of <it>rpsL </it>and <it>rpsG </it>was already shown to be controlled by S7, the product of <it>rpsG</it>, in <it>E. coli </it>
<abbrgrp>
<abbr bid="B47">47</abbr>
</abbrgrp>, and Merino and Yanofsky <abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp> predicted putative transcription attenuators upstream of <it>rpsL </it>in 24 species. However, their results scantly overlap our own predictions: we have only four common candidates, while we scanned 22 of their 24 species. The occurrences we missed with our protocol involved non-canonical terminators (with a long, GC-poor and/or bulged hairpin), or terminators that were too far from the ATG to meet our distance filter (8 of 18 cases).</p>
<p>The persistence of an attenuator element upstream of <it>rpsL </it>in widely divergent species argues for a common origin. However, we found no globally conserved feature, neither in sequence nor in structure, associated with this attenuator. This probably explains why no common structure has been proposed for the <it>rpsL-</it>leader previously. In the <it>Streptococcus </it>genus, we found no attenuator upstream of <it>rpsL</it>, contrary to Merino's analysis, which found one (this terminator is too far from the ATG codon (189 nucleotides) to satisfy our distance filter). However, we found a candidate further downstream between <it>rpsG </it>and <it>fusA </it>in streptococci (Figure <figr fid="F3">3b</figr>). It is possible that two similar elements are present in this operon, since Meyer <it>et al. </it>
<abbrgrp>
<abbr bid="B48">48</abbr>
</abbrgrp> found two similar RNA structures upstream of <it>rpsL </it>and <it>fusA </it>in the Proteobacteria <it>Candidatus Pelagibacter ubique</it>. This would be consistent with co-regulation of the two genes. The element identified in <abbrgrp>
<abbr bid="B48">48</abbr>
</abbrgrp>, however, does not meet our terminator criteria and does not present any common sequence feature with any of our predicted attenuators.</p>
<p>Is the structure and sequence diversity in <it>rpsL</it>-leaders compatible with an interaction with the same S7 protein partner in all species? The highly flexible RNA binding portion of S7 <abbrgrp>
<abbr bid="B49">49</abbr>
</abbrgrp> could tolerate some variation in RNA targets or, alternatively, the leader may bind different protein partners. In the current view of ribosomal protein leaders, each leader family displays a characteristic motif that mimics corresponding binding sites in ribosomal RNA. This view may be too restrictive and the example of <it>rpsL </it>suggests that the modalities of interaction may differ across distant phyla.</p>
</sec>
<sec>
<st>
<p>ABC-leaders: conferring specificity to regulatory elements</p>
</st>
<p>ATP-binding cassette (ABC) transporters constitute one of the largest and most ancient protein families, with hundreds of paralogs transporting a wide variety of substrates across the plasmic membrane, including ions, amino acids, lipids and drugs <abbrgrp>
<abbr bid="B50">50</abbr>
<abbr bid="B51">51</abbr>
</abbrgrp>. In Bacteria, these multiproteic complexes are encoded by operons comprising genes for ATPase, permease and periplasmic components. A number of them were shown to be regulated by transcriptional factors <abbrgrp>
<abbr bid="B52">52</abbr>
</abbrgrp>, whereas very few are known to be subject to transcriptional attenuation <abbrgrp>
<abbr bid="B53">53</abbr>
</abbrgrp>. ABC transporters do not appear in Figure <figr fid="F1">1</figr>, where scores are weighted for family size; however, this family presents the highest absolute number of genes regulated by attenuation (Table S0 in Additional file <supplr sid="S1">1</supplr>), with a total of 205 candidates in our study, and an enrichment <it>P</it>-value of 8.8e-05. Further scrutiny of these candidates is important because of the great diversity of transporters and potential variability of regulatory elements controlling them. We found different sequence motifs associated with these 'ABC-leaders' (listed in Table S3 in Additional file <supplr sid="S1">1</supplr>).</p>
<p>Figure <figr fid="F4">4</figr> shows five ABC-leaders associated with transporters of either known (Figure <figr fid="F4">4a-d</figr>) or unknown (Figure <figr fid="F4">4e</figr>) substrates. Each is able to form an antiterminator structure and the conserved sequence/structure motif (see alignments in Supplementary data 2 in Additional file <supplr sid="S1">1</supplr>) suggests that it responds to a unique substrate. The candidate shown in Figure <figr fid="F4">4c</figr> is a probable T-box, but the other candidates do not resemble any known <it>cis</it>-regulator. Their regulatory mechanisms thus remain to be determined. The size of the conserved structure is sufficient to form an aptamer that could directly detect a substrate, thus defining new classes of riboswitches; however, we cannot exclude an indirect regulation involving a protein factor. Indeed, a number a RNA-binding proteins target palindromic RNA <abbrgrp>
<abbr bid="B12">12</abbr>
<abbr bid="B13">13</abbr>
</abbrgrp> that may also act as a terminator hairpin.</p>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>ABC-leaders</p></caption><text>
   <p><b>ABC-leaders</b>. <b>(a) </b>The ABC-leader found upstream of the <it>potABCD </it>operon, encoding a spermidine/putrescine import system, in the Gammaproteobacteria <it>Haemophilus somnus </it>and <it>Pasteurella multocida</it>. <b>(b) </b>The ABC-leader found in the lactobacilli <it>Lactobacillus gasseri </it>and <it>Lactobacillus johnsonii</it>, upstream of a multidrug export system operon. <b>(c) </b>The T-box found in the Firmicutes <it>Enterococcus faecalis </it>and <it>Lactobacillus sakei</it>, upstream of genes for metal ion and methionine transporters, respectively. <b>(d) </b>The ABC-leader found in bacilli upstream of an alkanesulfonates transporter operon. <b>(e) </b>The ABC-leader found in bacilli 15-nucleotides upstream of a transporter of unknown specificity. Candidates are represented using the same conventions as in Figure 3.</p>
</text><graphic file="gb-2010-11-9-r97-4"/></fig>
<p>The analysis of such a multiple paralog family raises interesting evolutionary questions on the origin of associated attenuators, that is, whether they all derive from an ancient attenuator that would have regulated the ancestral ABC transporter, or if certain genes of this family tend to 'attract' attenuators for their regulation. The relatively low proportion of attenuated ABC transporter genes and the ability of attenuators to 'hop' from one gene to another (see below) argue for the second hypothesis.</p>
</sec>
<sec>
<st>
<p>Regulators of the <it>hisS </it>genes: switching between sensing systems</p>
</st>
<p>Syntenic attenuation systems do not necessarily use the same sensing system. There are well known examples of switches from a T-box or a riboswitch in certain species (for example, Firmicutes) to a leader peptide in others (for example, Proteobacteria) <abbrgrp>
<abbr bid="B9">9</abbr>
<abbr bid="B10">10</abbr>
</abbrgrp>. Our protocol, which does not require a conserved RNA sequence or structure, is well suited to detect such exchanges, and at least five are present in our list of frequently attenuated genes (Figure <figr fid="F1">1</figr>).</p>
<p>A particularly striking case of switch between sensing systems is provided by the <it>hisS </it>gene (Figure <figr fid="F5">5</figr>). In the <it>Bacillus </it>genus, <it>hisS</it>, encoding the histidyl-tRNA synthetase, has been long known to be regulated by a T-box, like many other tRNA synthetases <abbrgrp>
<abbr bid="B9">9</abbr>
</abbrgrp>. This gene, however, underwent two successive duplications: a recent one that appears specific to bacilli, and a more ancestral one that occurred before divergence of the Proteobacteria and Firmicutes. Interestingly, all three paralogs now found in bacilli are predicted to be regulated by a different type of attenuator, as shown in Figure <figr fid="F5">5</figr>. In addition to the known <it>hisS </it>T-box, we found novel attenuators upstream of <it>hisS* </it>(the <it>Bacillus </it>
<it>hisS </it>paralog) and <it>hisZ</it>, encoding an ATP phosphoribosyltransferase regulatory subunit.</p>
<fig id="F5"><title><p>Figure 5</p></title><caption><p>Regulators of the <it>hisS </it>gene family</p></caption><text>
   <p><b>Regulators of the <it>hisS </it>gene family</b>. The dendrogram on the right represents a hierarchical clustering of candidate attenuator sequences. Clusters of conserved sequences defined by a threshold E-value of 10<sup>-4 </sup>are framed. Blue frames highlight three groups of paralogs found in bacilli. Dotted frames indicate candidates previously annotated as short hypothetical proteins ('sORF'). <b>(a) </b>A T-box found upstream of <it>hisS </it>in lactobacilli. <b>(b) </b>Histidine leader peptide identified in bacilli upstream of two paralogs of <it>hisS</it>. <b>(c) </b>A T-box found upstream of <it>hisS </it>in bacilli and other species, including isolated Gammaproteobacteria and Chlorobia.</p>
</text><graphic file="gb-2010-11-9-r97-5"/></fig>
<p>We performed a sequence-based clustering of the 5' UTR regions in the <it>hisS </it>family to identify sets of related motifs (Figure <figr fid="F5">5</figr>). The 5' UTRs of <it>hisZ </it>and of <it>hisS* </it>have highly similar sequences that include short ORFs encompassing a stretch of histidine codons. This strongly argues for the presence of a histidine leader peptide regulating both genes. To our knowledge, no leader peptide system had been shown to exist outside of Proteobacteria <abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp> and Actinobacteria <abbrgrp>
<abbr bid="B54">54</abbr>
</abbrgrp>, if we exclude the atypical <it>ermC </it>leader <abbrgrp>
<abbr bid="B55">55</abbr>
</abbrgrp> that controls translation and has no amino acid specificity but senses a global slowdown in translation. This result thus strengthens the evolutionary relevance of leader peptide systems and expands the range of RNA-based regulation in Gram-positive bacteria.</p>
<p>Leader peptides may have spread to Firmicutes by horizontal transfer. However, we may also hypothesize that short reading frames may have emerged repeatedly from random sequences, especially in the favorable cellular environment of species such as those in the Firmicutes phylum. In support of convergent evolution, no similarity in the non-coding or leader peptide sequence is observed between the Firmicutes and Proteobacteria. That the two <it>hisS </it>gene duplications are observed only in certain Firmicutes species suggests a recent event: a gene resulting from the first duplication may have evolved or captured an attenuation system (for example, from a Gram-negative bacteria) before undergoing a second duplication.</p>
<p>Sequence analysis of <it>hisS </it>attenuators also reveals a T-box element in Lactobacillales (Figure <figr fid="F5">5a</figr>). Furthermore, we observed one possible horizontal transfer of attenuator elements from Firmicutes to Gammaproteobacteria: an unknown proteobacterial attenuator, which corresponds to a sequence encompassing a putative short ORF, clearly resembles Firmicute T-boxes (Figure <figr fid="F5">5c</figr>, red box). This illustrates the remarkable lability of attenuator elements, which can be acquired from other species and subsequently evolve to fit the preferred regulatory mechanisms of their new host.</p>
</sec>
<sec>
<st>
<p>The related <it>greA</it>- and <it>rnk</it>-leaders</p>
</st>
<p>The <it>greA</it>/<it>rnk </it>gene family ranks seventh in our list of genes frequently regulated by attenuation. Although the propensity of <it>greA</it>/<it>rnk </it>genes for transcriptional attenuation was detected previously <abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp>, experimental evidence for an <it>E. coli </it>
<it>greA </it>attenuator is recent <abbrgrp>
<abbr bid="B4">4</abbr>
</abbrgrp>. The gene family includes two major paralogs, <it>greA</it>, which encodes a transcription elongation factor, and <it>rnk</it>, which encodes a regulator of nucleoside diphosphate kinase. We found attenuators upstream of these genes in species ranging from Proteobacteria to bacilli and Clostridia. In several species, we identified attenuators in both genes. Figure <figr fid="F6">6</figr> shows the result of a sequence-based clustering of <it>greA</it>/<it>rnk </it>attenuators and secondary structure models for different sequence clusters. In each case, we were able to detect a clear antiterminator structure; however, no common feature could be detected between the <it>greA</it>- and <it>rnk</it>-leaders.</p>
<fig id="F6"><title><p>Figure 6</p></title><caption><p>The <it>greA</it>- and <it>rnk</it>-leaders</p></caption><text>
   <p><b>The <it>greA</it>- and <it>rnk</it>-leaders</b>. <b>(a) </b>The <it>rnk</it>-leader identified upstream of <it>rnk </it>in <it>Pseudomonas</it>. <b>(b,c) </b>The <it>rnk</it>-leader and <it>greA</it>-leader identified in Enterobacteria, upstream of <it>rnk </it>and <it>greA</it>, respectively. <b>(d) </b>The <it>greA</it>-leader identified upstream of <it>greA </it>in bacilli. <b>(e) </b>The <it>rnk</it>-leader identified upstream of <it>rnk </it>in <it>Yersinia</it>. Candidates are represented using the same conventions as in Figure 3.</p>
</text><graphic file="gb-2010-11-9-r97-6"/></fig>
<p>Very interestingly, the limited experimental evidence available on these two putative <it>cis </it>non-coding RNAs argues for a second mechanism in <it>trans</it>. Potrykus <it>et al. </it>
<abbrgrp>
<abbr bid="B4">4</abbr>
</abbrgrp> showed that overexpression of the short form of the <it>greA </it>transcript, released after attenuation has occurred, leads to repression of several genes. Furthermore, an intergenic region corresponding to the <it>rnk</it>-leader was found in a systematic screening <abbrgrp>
<abbr bid="B56">56</abbr>
</abbrgrp> to co-immunoprecipitate with Hfq, a conserved bacterial protein known to facilitate interaction between small RNAs and their target mRNA. Although leaders doubling as <it>trans</it>-acting RNAs is a recent and, for the time being, rare finding <abbrgrp>
<abbr bid="B57">57</abbr>
</abbrgrp>, we may have found here two such cases with the <it>greA</it>- and <it>rnk</it>-leader families.</p>
</sec>
</sec>
<sec>
<st>
<p>Identification of attenuator 'regulons'</p>
</st>
<p>The term 'regulon' or 'modulon' <abbrgrp>
<abbr bid="B58">58</abbr>
</abbrgrp> has been coined to describe a set of genes subject to a common regulatory element. We analyzed attenuator regulons involving members of one or more gene families. To this intent, we compared sequences surrounding predicted 5' terminators across all genes with no consideration for orthology in a set of related species. We then clustered similar 5' sequences based on pairwise distances in so-called 'terminator clusters'. We describe here results obtained in the Enterobacteria (13 species) and <it>Bacillus </it>(8 species) subfamilies. Surveyed species are listed in Table S6 in Additional file <supplr sid="S1">1</supplr>. We identified a total of 192 and 270 terminator clusters in Enterobacteria and bacilli, respectively. We distinguished clusters based on the nature of downstream genes. Clusters involving only orthologous genes overlap the previous analysis and are given in Tables S4 and S5 in Additional file <supplr sid="S1">1</supplr>. We focus below on clusters involving several non-orthologous genes or groups of genes present in a single or a few related species.</p>
<sec>
<st>
<p>'Mobile attenuators' associated with transposable elements</p>
</st>
<p>Forty-six terminator clusters in Enterobacteria and 96 clusters in bacilli are associated with transposable elements. We describe them as 'mobile attenuators'. Terminator sequences in these clusters show a high level of conservation, consistent with an origin from transposable elements of recent dissemination. They fall into two classes. The first class (Figure <figr fid="F7">7a</figr>) corresponds to terminators located upstream of transposases or other insertion sequence (IS)-related genes and is mainly species-specific. Of note, transposase or IS-related genes also rank high in the list of frequently attenuated gene families, when no normalization for family size is applied (Table S0 in Additional file <supplr sid="S1">1</supplr>). These genes belong to transposon families such as ISBma2 (mainly present in the Betaproteobacteria <it>Burkholderia mallei</it>, in <it>Bacillus thuringiensis </it>and in the Clostridia <it>Symbiobacterium thermophilum</it>, <it>Thermoanaerobacter tengcongensis </it>and <it>Clostridium novyi</it>), IS3/IS2/IS600/IS1329/IS407A (found sporadically in all phyla) and ISL3 (mainly present in different Firmicutes species). The second class of mobile attenuators (Figure <figr fid="F7">7b</figr>) represents families of related transposon-borne sequences located immediately upstream of different, unrelated cellular genes.</p>
<fig id="F7"><title><p>Figure 7</p></title><caption><p>Mobile elements and the emergence of new attenuators</p></caption><text>
   <p><b>Mobile elements and the emergence of new attenuators</b>. <b>(a) </b>5' terminators detected upstream of transposase or insertion sequence (IS)-related genes correspond to intrinsic IS 5' elements that prevent expression of the IS when it is inserted in a transcribed region. <b>(b) </b>ISs also contain intrinsic 3' terminators that terminate transcription of the 3'-most IS genes. When IS-related genes undergo pseudogenization, some 3' terminators may remain active as attenuators of downstream cellular genes (exaptation). As IS insertion may occur at nearly random genomic sites, similar attenuator candidates may be found upstream of totally unrelated genes.</p>
</text><graphic file="gb-2010-11-9-r97-7"/></fig>
<p>The emergence of mobile attenuators can be explained by the structure of the IS containing both 3' and 5' transcription terminators <abbrgrp>
<abbr bid="B59">59</abbr>
<abbr bid="B60">60</abbr>
</abbrgrp> (Figure <figr fid="F7">7</figr>). Terminators of the first class correspond to IS 5' terminators, whose function is to limit transposon proliferation when inserted in a coding region under control of an active promoter. Such transposition events would be deleterious for the host, and consequently for the element's own survival. Clusters of conserved attenuators located upstream of unrelated genes (Figure <figr fid="F7">7b</figr>) may correspond to 3' terminators of ISs. The significant proportion (25 to 35%) of conserved terminator clusters that result from IS transposition suggests they have a significant impact in genome evolution, particularly in terms of regulation. The possible pseudogenization of transposed genes may allow exaptation of their 5' and/or 3' terminators and give birth to new regulatory elements (Figure <figr fid="F7">7b</figr>). It should be noted that this exaptation of transposable elements for regulatory functions is probably a universal mechanism, as several instances are already described in eukaryotic genomes <abbrgrp>
<abbr bid="B61">61</abbr>
<abbr bid="B62">62</abbr>
</abbrgrp>.</p>
</sec>
<sec>
<st>
<p>Non-insertion sequence related regulons</p>
</st>
<p>Ten terminator clusters in Enterobacteria and 23 in bacilli are found in different species associated with the same set of non-homologous genes. They may therefore constitute attenuator regulons (Tables S4 and S5 in Additional file <supplr sid="S1">1</supplr>). Several known riboswitches fall in this class (15 out of 23 clusters in bacilli), which is consistent with the ability of riboswitches to occur upstream of unrelated genes from distant species. Analysis of these clusters thus seems promising for the identification of new regulons, although this class also contains sequences from repeated elements. For instance, the largest cluster detected in Enterobacteria (Figure <figr fid="F8">8</figr>) involves elements conserved upstream of the <it>gatY</it>, <it>rbfA</it>, <it>cvpA </it>and, in some cases, <it>mscS</it>, <it>dppB</it>, <it>yidB </it>and <it>sbp </it>genes. Several members of this cluster present significant similarities with the rtT transcript identified by B&#246;sl and Kersten <abbrgrp>
<abbr bid="B63">63</abbr>
</abbrgrp>, a by-product of the <it>tyrT </it>operon that could have a role in stringent response, supporting the biological significance of these candidate attenuators. Interestingly, we found these highly conserved elements to belong to bacterial interspersed mosaic elements (BIMEs) <abbrgrp>
<abbr bid="B64">64</abbr>
</abbrgrp>. Even though this is a repeated element, it probably constitutes a <it>bona fide </it>attenuator as well. Indeed, BIMEs have been shown to act as bidirectional transcription terminators <abbrgrp>
<abbr bid="B65">65</abbr>
</abbrgrp> and to induce transcriptional attenuation when placed upstream of a tRNA reporter gene <abbrgrp>
<abbr bid="B66">66</abbr>
</abbrgrp>. The high conservation of these repeats, both in sequence and physical location, among Enterobacteria supports an exaptation scenario, possibly as an attenuator system. A second large attenuator cluster involving the genes <it>artJ</it>, <it>ygdL</it>, <it>yhfZ </it>and <it>yijO </it>was also found to correspond to BIMEs (Table S4 in Additional file <supplr sid="S1">1</supplr>).</p>
<fig id="F8"><title><p>Figure 8</p></title><caption><p>A possible attenuator based on bacterial interspersed mosaic elements (BIMEs)</p></caption><text>
   <p><b>A possible attenuator based on bacterial interspersed mosaic elements (BIMEs)</b>. A conserved element found before the <it>gatY</it>, <it>rbfA </it>and <it>cvpA </it>genes in Enterobacteria, corresponding to BIMEs. Blue frames indicate terminators, black frames indicate possible anti-terminators. Conservation of these repeats in sequence and position argues for their utilization as attenuator elements.</p>
</text><graphic file="gb-2010-11-9-r97-8"/></fig>
<p>Clusters of attenuators located upstream of different genes suggest coexpression, and therefore functional relationships between these genes. In bacilli, an interesting such element, which we called the '<it>vanR</it>-leader', was found upstream of a gene encoding a flavin oxidoreductase and of the <it>vanR </it>gene, encoding a TCS transcription factor involved in vanillate catabolism (Figure <figr fid="F9">9</figr>; alignment in Supplementary data 3 in Additional file <supplr sid="S1">1</supplr>), suggesting a concerted regulation of these two genes. Another common element is found upstream of a sulfonate ABC transporter operon and upstream of an unknown gene. Its involvement in sulfonate metabolism may consequently be hypothesized (alignment in Supplementary data 3 in Additional file <supplr sid="S1">1</supplr>). Finally, a cluster of related attenuators, which we named '<it>fabH</it>-leaders', involves elements upstream of the <it>fabH </it>gene, involved in fatty acid metabolism, and upstream of genes encoding a DNA binding protein, for which an association with the same pathway would be interesting to test (alignment in Supplementary data 3 in Additional file <supplr sid="S1">1</supplr>).</p>
<fig id="F9"><title><p>Figure 9</p></title><caption><p>The <it>vanR</it>-leader</p></caption><text>
   <p><b>The <it>vanR</it>-leader</b>. This element is conserved in bacilli, upstream of genes encoding a flavin oxidoreductase and a TCS DNA binding element (VanR). Candidates are represented using the same conventions as in Figure 3.</p>
</text><graphic file="gb-2010-11-9-r97-9"/></fig>
</sec>
</sec>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<sec>
<st>
<p>Broad-spectrum detection techniques are required to capture a larger fraction of attenuator diversity</p>
</st>
<p>In this study we apply a detection protocol that is able to capture a large spectrum of 5' regulatory sequences. Analysis of associated genes allowed us to list genes most frequently controlled by attenuation, and to further describe their attenuators. While previous computational screens based on sequence/structure homology were successful at identifying many distinct elements such as T-boxes or riboswitches <abbrgrp>
<abbr bid="B9">9</abbr>
<abbr bid="B10">10</abbr>
<abbr bid="B19">19</abbr>
<abbr bid="B20">20</abbr>
<abbr bid="B21">21</abbr>
</abbrgrp>, they tend to suggest a discrete occurrence of specific elements, while the reality looks more as a continuous landscape of evolving elements that explore all possible sensing systems offered by 5' UTR regulation. An illustration of this diversity is provided by the extensive analysis released by Weinberg <it>et al. </it>
<abbrgrp>
<abbr bid="B67">67</abbr>
</abbrgrp>, who screened all available bacterial genomes for structured RNA and unearthed more than a hundred new significant motifs, most of which are probable <it>cis</it>-regulators. Strikingly, none of these motifs overlaps our major attenuator families in Figure <figr fid="F1">1</figr>. Indeed, many of the sensing systems used by attenuators do not involve an RNA structure or evolve too rapidly to be caught by conservation analysis. Here we captured them using a simple terminator motif that, although it is not present in all <it>cis</it>-regulatory RNAs (some use translational control or cryptic terminators), was able to detect a variety of elements not represented in current databases or literature.</p>
<p>An obvious limitation of our protocol is that it misses cases of Rho-dependent termination. However, only one instance of a Rho-dependent terminator-based attenuator has been described to date, in <it>E. coli </it>
<abbrgrp>
<abbr bid="B68">68</abbr>
</abbrgrp>. We can furthermore hypothesize that regulation more likely uses Rho-independent structures, which do not need any additional protein to function and thus may be more responsive to external challenges. Another limitation of computational attenuator detection is that it cannot unambiguously discriminate attenuators from independent non-coding RNAs when the later meet all search criteria (that is, if they are terminated by a Rho-independent structure and located directly upstream of a gene in the same orientation). The main non-experimental clue to this question could lay in the synteny conservation between different strains: linkage of attenuator element and flanking gene strongly argues in favor of a <it>cis</it>-acting element. However, as some <it>trans</it>-acting small RNAs tend to retain their physical location for co-regulation reasons, no absolute rule can be drawn.</p>
<p>Our analysis should result in a better annotation of bacterial intergenic regions. It is likely, for instance, that many short ORFs currently annotated as hypothetical proteins and located upstream of operons, which we purposely considered as intergenic sequences, will turn out to be leader peptides or other attenuation systems. Indeed, among the top 30 families of Figure <figr fid="F1">1</figr>, 14 candidates from 8 different families were previously annotated as short hypothetical proteins, which was probably erroneous.</p>
</sec>
<sec>
<st>
<p>A harvest of mobile terminators and clues for possible exaptative scenarios</p>
</st>
<p>From an evolutionary viewpoint, it is interesting that unrelated organisms may use different sensing systems to regulate the same genes. We observed such interchanges in most of the families we studied. A recurrent scenario that was previously documented is the switch between a T-box in Firmicutes and a leader peptide in Proteobacteria <abbrgrp>
<abbr bid="B9">9</abbr>
<abbr bid="B10">10</abbr>
</abbrgrp>. One of our noticeable finding, however, is the presence of 'regular' leader peptides in Firmicutes, replacing T-boxes.</p>
<p>Interestingly, we noticed a tendency for RNA-binding protein genes to be more often regulated by attenuation than other genes: while about 5% of <it>B. subtilis </it>operons present a predicted attenuator, this proportion reaches 14% among operons encoding at least one RNA-binding protein (data not shown). As several RNA-binding proteins are now shown to regulate their own expression at the transcriptional level (for example, ribosomal proteins <abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp>, PyrR <abbrgrp>
<abbr bid="B12">12</abbr>
</abbrgrp> or <it>E. coli </it>threonyl-tRNA synthetase <abbrgrp>
<abbr bid="B69">69</abbr>
</abbrgrp>), it is tempting to postulate that many other RNA-binding proteins may harbor such dual-functionality, as illustrated in Figure <figr fid="F10">10a</figr>. In the same way, DNA-binding factors tend to regulate their own expression at the DNA- level (Figure <figr fid="F10">10b</figr>) <abbrgrp>
<abbr bid="B70">70</abbr>
<abbr bid="B71">71</abbr>
</abbrgrp>. It is tempting to consider possible evolutionary shifts between these two models. A number of attenuator elements identified in this study were previously assigned to palindromes interacting with transcription factors at the DNA level (MarR- or LysR-binding sites for instance). The switch from one system to another is conceivable with the emergence of a second transcription start site upstream or downstream of the palindrome sequence. Such tandem transcription start sites are quite frequent <abbrgrp>
<abbr bid="B72">72</abbr>
<abbr bid="B73">73</abbr>
</abbrgrp> and may give rise to bifunctional elements acting both as terminator and transcription factor binding sites (Figure <figr fid="F10">10</figr>) and possibly interacting with two different protein partners. It would be interesting to test such a hypothesis. Other potentially bifunctional terminators would involve sequestration of a RBS by the terminator hairpin, inducing a concerted transcriptional and translational attenuation. While no such case has been experimentally demonstrated to date, it is noticeable that 18 candidate attenuators in <it>E. coli </it>are located under 15 nucleotides from the start codon, three of which are annotated as RBS-sequestrator in the RegulonDB database <abbrgrp>
<abbr bid="B39">39</abbr>
</abbrgrp>. A switch from one system to the other is therefore a realistic hypothesis.</p>
<fig id="F10"><title><p>Figure 10</p></title><caption><p>Models for the evolution of self-regulation by RNA- and DNA-binding proteins</p></caption><text>
   <p><b>Models for the evolution of self-regulation by RNA- and DNA-binding proteins</b>. <b>(a) </b>RNA-binding protein. Among other cellular functions involving interactions with RNA molecules, RNA-binding proteins often regulate their own expression by interacting with the 5' UTR of their mRNA. In the case of transcriptional attenuation, the transcription start site in the DNA is followed by a palindrome corresponding to the terminator. Emergence of an additional transcription start site downstream of the terminator palindrome could give birth to a new transcription factor binding site. <b>(b) </b>DNA-binding protein. These proteins (mainly transcriptional factors) often regulate their own expression through interaction with a palindromic DNA sequence in the promoter. Emergence of an additional transcription start site upstream of this transcription factor binding site palindrome could give birth to a new transcriptional attenuator.</p>
</text><graphic file="gb-2010-11-9-r97-10"/></fig>
</sec>
</sec>
<sec>
<st>
<p>Materials and methods</p>
</st>
<sec>
<st>
<p>Identification of putative elements involving a Rho-independent terminator</p>
</st>
<p>Genome sequences and ORF annotations were retrieved from the National Center for Biotechnology Information (NCBI) database. The list of analyzed species and corresponding GenBank IDs is presented in Table S7 in Additional file <supplr sid="S1">1</supplr>. In a first stage, we looked for Rho-independent terminators in intergenic regions, as well as in regions encompassing short ORFs (&lt;200 nucleotides) encoding hypothetical proteins as well as leader peptides (considered as a particular case of transcriptional attenuation), so as to potentially reannotate some of them. A terminator search was carried out using a combination of ERPIN <abbrgrp>
<abbr bid="B32">32</abbr>
</abbrgrp> and RNAMOTIF <abbrgrp>
<abbr bid="B33">33</abbr>
</abbrgrp>. ERPIN is a profile detection algorithm that takes as input a structure-annotated alignment. Here we used a training set of 1,200 terminator sequences from <it>B. subtilis </it>and <it>E. coli</it>. For the RNAMOTIF search, we used the descriptor of Lesnik <it>et al. </it>
<abbrgrp>
<abbr bid="B74">74</abbr>
</abbrgrp>. This descriptor consists essentially of a 4- to 18-bp helix, a 0- to 2-nucleotide spacer and a 12-nucleotide T-rich region. Parameters and input files for each program are given in Supplementary data 4 in Additional file <supplr sid="S1">1</supplr>. In this study, terminators were first sought using ERPIN, and in a second stage using RNAMOTIF when ERPIN was not able to detect any motif. In tests conducted with randomized terminator sequences, the combination of these two programs produced, on average, 47 false positive hits per 1,000 intergenic regions of size 115 nucleotides <abbrgrp>
<abbr bid="B35">35</abbr>
</abbrgrp>. In this study, we further compared the specificity of terminator detections in 5' and 3' UTR sequences by generating, for each predicted terminator, two sets of 100 randomized sequences of the same di-nucleotide composition as 5' and 3' UTRs, respectively, and applying our prediction protocol to each set.</p>
<p>To be considered as candidate attenuators, terminators had to be in the same orientation as the downstream gene and at a suitable distance to avoid incorporation of canonical 3' terminators (from the upstream gene or from an undetected non-coding RNA present in the intergenic region). To this effect, terminators located at a distance ratio (that is, the ratio of the distance from the downstream gene to the distance from the upstream gene) greater than 1:1 or further than 300 nucleotides from the first codon of the downstream gene were discarded. Final predictions are listed in Table S8 in Additional file <supplr sid="S1">1</supplr>.</p>
</sec>
<sec>
<st>
<p>Annotation of attenuator candidates and genes</p>
</st>
<p>Attenuator candidates were defined as the sequences from the limit of the upstream gene - or a maximum of 200 nucleotides - to the position immediately 3' of the terminator. Candidates were assigned to Rfam 10.0 <abbrgrp>
<abbr bid="B38">38</abbr>
</abbrgrp> families by similarity search using Blast <abbrgrp>
<abbr bid="B75">75</abbr>
</abbrgrp> with the following parameters: -p blastn, -G 2, -E 1, -W 7, -e 0.0001. Hits were retained if the match covered more than 50% of the full length of the Rfam entry. Candidates that failed to match a Rfam family with Blast were further submitted to covariance search using the rfam_scan.pl script provided on the Rfam web site <abbrgrp>
<abbr bid="B38">38</abbr>
</abbrgrp>. Attenuator candidates that could not be annotated using these methods were submitted to a literature search based on the function of the downstream gene, or searched in the RegulonDB <abbrgrp>
<abbr bid="B39">39</abbr>
</abbrgrp> for <it>E. coli </it>candidates. We assigned functions to bacterial genes using BlastP against the Hogenom database <abbrgrp>
<abbr bid="B37">37</abbr>
</abbrgrp> and retrieving the corresponding Hogenom IDs.</p>
</sec>
<sec>
<st>
<p>Hogenom family scoring</p>
</st>
<p>Each Hogenom family of homologous genes <abbrgrp>
<abbr bid="B37">37</abbr>
</abbrgrp> was scored in two ways. The absolute score was defined as the absolute number of predicted attenuators upstream of all genes in the family across all 302 species (Table S0 in Additional file <supplr sid="S1">1</supplr>). The normalized score was computed as follows: for a given Hogenom family, the number of candidates in each species was normalized by the total number of genes in this species for this family. These single-species scores were then summed across all species. Lastly, in order to favor evolutionarily parsimonious distributions, we arranged single-species scores in a phylogenetic order and weighted consecutive non-zero scores by a factor of +0.1 times the length of the non-zero stretch. We confirmed the statistical significance of the enrichment of 5' terminators in each Hogenom family using Fisher's exact tests with the following values: number of genes in the Hogenom family; total number of genes assessed; number of genes with 5' terminators in the Hogenom family; total number of genes with 5' terminators.</p>
</sec>
<sec>
<st>
<p>Clustering of attenuator sequences</p>
</st>
<p>For sequence analysis, we extracted regions around putative Rho-independent terminators, from the limit of the upstream gene (or a maximum of 200 nucleotides) to the position immediately 3' of the terminator. The distance between two sequences was defined as the inverse of the best hit score obtained by Blast/bl2seq <abbrgrp>
<abbr bid="B75">75</abbr>
</abbrgrp>. We then performed hierarchical clustering from the distance matrix using the 'hclust' procedure from the R package <abbrgrp>
<abbr bid="B76">76</abbr>
</abbrgrp>. Clusters were defined by pruning the tree at a height of 0.038 (corresponding to an E-value of approximately 10<sup>-4</sup>) determined empirically so as to cluster exclusively elements known to belong to the same family. For each cluster, we performed a multiple sequence alignment using ClustalW <abbrgrp>
<abbr bid="B77">77</abbr>
</abbrgrp> followed by consensus structure prediction with the RNAalifold program <abbrgrp>
<abbr bid="B78">78</abbr>
</abbrgrp>.</p>
<p>RNA alignments and structures were refined and analyzed using the Ralee Emacs extension <abbrgrp>
<abbr bid="B79">79</abbr>
</abbrgrp>. RNA structures were drawn using the Varna applet <abbrgrp>
<abbr bid="B80">80</abbr>
</abbrgrp>.</p>
</sec>
</sec>
<sec>
<st>
<p>Abbreviations</p>
</st>
<p>ABC: ATP-binding cassette; BIME: bacterial interspersed mosaic element; bp: base pair; IS: insertion sequence; ORF: open reading frame; RBS: ribosome-binding site; TCS: two-component system; UTR: untranslated region.</p>
</sec>
<sec>
<st>
<p>Authors' contributions</p>
</st>
<p>MN and DG both contributed to the experimental design, data analysis and writing of the manuscript. MN performed the experiments.</p>
</sec>
</bdy><bm>
<refgrp><bibl id="B1"><title><p>The transcriptional landscape of the yeast genome defined by RNA sequencing.</p></title><aug><au><snm>Nagalakshmi</snm><fnm>U</fnm></au><au><snm>Wang</snm><fnm>Z</fnm></au><au><snm>Waern</snm><fnm>K</fnm></au><au><snm>Shou</snm><fnm>C</fnm></au><au><snm>Raha</snm><fnm>D</fnm></au><au><snm>Gerstein</snm><fnm>M</fnm></au><au><snm>Snyder</snm><fnm>M</fnm></au></aug><source>Science</source><pubdate>2008</pubdate><volume>320</volume><fpage>1344</fpage><lpage>1349</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1158441</pubid><pubid idtype="pmcid">2951732</pubid><pubid idtype="pmpid">18451266</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>Tiny RNAs associated with transcription start sites in animals.</p></title><aug><au><snm>Taft</snm><fnm>RJ</fnm></au><au><snm>Glazov</snm><fnm>EA</fnm></au><au><snm>Cloonan</snm><fnm>N</fnm></au><au><snm>Simons</snm><fnm>C</fnm></au><au><snm>Stephen</snm><fnm>S</fnm></au><au><snm>Faulkner</snm><fnm>GJ</fnm></au><au><snm>Lassmann</snm><fnm>T</fnm></au><au><snm>Forrest</snm><fnm>AR</fnm></au><au><snm>Grimmond</snm><fnm>SM</fnm></au><au><snm>Schroder</snm><fnm>K</fnm></au><au><snm>Irvine</snm><fnm>K</fnm></au><au><snm>Arakawa</snm><fnm>T</fnm></au><au><snm>Nakamura</snm><fnm>M</fnm></au><au><snm>Kubosaki</snm><fnm>A</fnm></au><au><snm>Hayashida</snm><fnm>K</fnm></au><au><snm>Kawazu</snm><fnm>C</fnm></au><au><snm>Murata</snm><fnm>M</fnm></au><au><snm>Nishiyori</snm><fnm>H</fnm></au><au><snm>Fukuda</snm><fnm>S</fnm></au><au><snm>Kawai</snm><fnm>J</fnm></au><au><snm>Daub</snm><fnm>CO</fnm></au><au><snm>Hume</snm><fnm>DA</fnm></au><au><snm>Suzuki</snm><fnm>H</fnm></au><au><snm>Orlando</snm><fnm>V</fnm></au><au><snm>Carninci</snm><fnm>P</fnm></au><au><snm>Hayashizaki</snm><fnm>Y</fnm></au><au><snm>Mattick</snm><fnm>JS</fnm></au></aug><source>Nat Genet</source><pubdate>2009</pubdate><volume>41</volume><fpage>572</fpage><lpage>578</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng.312</pubid><pubid idtype="pmpid" link="fulltext">19377478</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>The complex eukaryotic transcriptome: unexpected pervasive transcription and novel small RNAs.</p></title><aug><au><snm>Jacquier</snm><fnm>A</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2009</pubdate><volume>10</volume><fpage>833</fpage><lpage>844</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrg2683</pubid><pubid idtype="pmpid" link="fulltext">19920851</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Imprecise transcription termination within <it>Escherichia coli greA </it>leader gives rise to an array of short transcripts, GraL.</p></title><aug><au><snm>Potrykus</snm><fnm>K</fnm></au><au><snm>Murphy</snm><fnm>H</fnm></au><au><snm>Chen</snm><fnm>X</fnm></au><au><snm>Epstein</snm><fnm>JA</fnm></au><au><snm>Cashel</snm><fnm>M</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2010</pubdate><volume>38</volume><fpage>1636</fpage><lpage>1651</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkp1150</pubid><pubid idtype="pmcid">2836576</pubid><pubid idtype="pmpid">20008510</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Widespread bidirectional promoters are the major source of cryptic transcripts in yeast.</p></title><aug><au><snm>Neil</snm><fnm>H</fnm></au><au><snm>Malabat</snm><fnm>C</fnm></au><au><snm>d'Aubenton-Carafa</snm><fnm>Y</fnm></au><au><snm>Xu</snm><fnm>Z</fnm></au><au><snm>Steinmetz</snm><fnm>LM</fnm></au><au><snm>Jacquier</snm><fnm>A</fnm></au></aug><source>Nature</source><pubdate>2009</pubdate><volume>457</volume><fpage>1038</fpage><lpage>1042</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature07747</pubid><pubid idtype="pmpid" link="fulltext">19169244</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>The intricate world of riboswitches.</p></title><aug><au><snm>Coppins</snm><fnm>L</fnm></au><au><snm>Hall</snm><fnm>KB</fnm></au><au><snm>Groisman</snm><fnm>EA</fnm></au></aug><source>Curr Opin Microbiol</source><pubdate>2007</pubdate><volume>10</volume><fpage>176</fpage><lpage>181</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.mib.2007.03.006</pubid><pubid idtype="pmcid">1894890</pubid><pubid idtype="pmpid">17383225</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Riboswitch RNAs: using RNA to sense cellular metabolism.</p></title><aug><au><snm>Henkin</snm><fnm>TM</fnm></au></aug><source>Genes Dev</source><pubdate>2008</pubdate><volume>22</volume><fpage>3383</fpage><lpage>3390</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.1747308</pubid><pubid idtype="pmpid" link="fulltext">19141470</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Expanding roles for metabolite-sensing regulatory RNAs.</p></title><aug><au><snm>Dambach</snm><fnm>MD</fnm></au><au><snm>Winkler</snm><fnm>WC</fnm></au></aug><source>Curr Opin Microbiol</source><pubdate>2009</pubdate><volume>12</volume><fpage>161</fpage><lpage>169</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.mib.2009.01.012</pubid><pubid idtype="pmcid">2846827</pubid><pubid idtype="pmpid">19250859</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Comparative genomic analysis of T-box regulatory systems in bacteria.</p></title><aug><au><snm>Vitreschak</snm><fnm>AG</fnm></au><au><snm>Mironov</snm><fnm>AA</fnm></au><au><snm>Lyubetsky</snm><fnm>VA</fnm></au><au><snm>Gelfand</snm><fnm>MS</fnm></au></aug><source>RNA</source><pubdate>2008</pubdate><volume>14</volume><fpage>717</fpage><lpage>735</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1261/rna.819308</pubid><pubid idtype="pmcid">2271356</pubid><pubid idtype="pmpid">18359782</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>Biochemical features and functional implications of the RNA-based T-box regulatory mechanism.</p></title><aug><au><snm>Guti&#233;rrez-Preciado</snm><fnm>A</fnm></au><au><snm>Henkin</snm><fnm>TM</fnm></au><au><snm>Grundy</snm><fnm>FJ</fnm></au><au><snm>Yanofsky</snm><fnm>C</fnm></au><au><snm>Merino</snm><fnm>E</fnm></au></aug><source>Microbiol Mol Biol Rev</source><pubdate>2009</pubdate><volume>73</volume><fpage>36</fpage><lpage>61</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/MMBR.00026-08</pubid><pubid idtype="pmcid">2650889</pubid><pubid idtype="pmpid">19258532</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Feedback regulation of ribosomal protein gene expression in <it>Escherichia coli</it>: structural homology of ribosomal RNA and ribosomal protein mRNA.</p></title><aug><au><snm>Nomura</snm><fnm>M</fnm></au><au><snm>Yates</snm><fnm>JL</fnm></au><au><snm>Dean</snm><fnm>D</fnm></au><au><snm>Post</snm><fnm>LE</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1980</pubdate><volume>77</volume><fpage>7084</fpage><lpage>7088</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.77.12.7084</pubid><pubid idtype="pmcid">350445</pubid><pubid idtype="pmpid">7012833</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Molecular recognition of pyr mRNA by the <it>Bacillus subtilis </it>attenuation regulatory protein PyrR.</p></title><aug><au><snm>Bonner</snm><fnm>ER</fnm></au><au><snm>D'Elia</snm><fnm>JN</fnm></au><au><snm>Billips</snm><fnm>BK</fnm></au><au><snm>Switzer</snm><fnm>RL</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2001</pubdate><volume>29</volume><fpage>4851</fpage><lpage>4865</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/29.23.4851</pubid><pubid idtype="pmcid">96680</pubid><pubid idtype="pmpid">11726695</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>The bgl sensory system: a transmembrane signaling pathway controlling transcriptional antitermination.</p></title><aug><au><snm>Amster-Choder</snm><fnm>O</fnm></au></aug><source>Curr Opin Microbiol</source><pubdate>2005</pubdate><volume>8</volume><fpage>127</fpage><lpage>134</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.mib.2005.02.014</pubid><pubid idtype="pmpid" link="fulltext">15802242</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>RNA thermometers.</p></title><aug><au><snm>Narberhaus</snm><fnm>F</fnm></au><au><snm>Waldminghaus</snm><fnm>T</fnm></au><au><snm>Chowdhury</snm><fnm>S</fnm></au></aug><source>FEMS Microbiol Rev</source><pubdate>2006</pubdate><volume>30</volume><fpage>3</fpage><lpage>16</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1574-6976.2005.004.x</pubid><pubid idtype="pmpid" link="fulltext">16438677</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>Genomewide bioinformatics prediction and experimental &#233;valuation of potential RNA thermometers.</p></title><aug><au><snm>Waldminghaus</snm><fnm>T</fnm></au><au><snm>Gaubig</snm><fnm>LC</fnm></au><au><snm>Narberhaus</snm><fnm>F</fnm></au></aug><source>Mol Genet Genomics</source><pubdate>2007</pubdate><volume>278</volume><fpage>555</fpage><lpage>564</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00438-007-0272-7</pubid><pubid idtype="pmpid" link="fulltext">17647020</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Molecular basis for temperature sensing by an RNA thermometer.</p></title><aug><au><snm>Chowdhury</snm><fnm>S</fnm></au><au><snm>Maris</snm><fnm>C</fnm></au><au><snm>Allain</snm><fnm>FH</fnm></au><au><snm>Narberhaus</snm><fnm>F</fnm></au></aug><source>EMBO J</source><pubdate>2006</pubdate><volume>25</volume><fpage>2487</fpage><lpage>2497</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/sj.emboj.7601128</pubid><pubid idtype="pmcid">1478195</pubid><pubid idtype="pmpid">16710302</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>Transcription attenuation: a highly conserved regulatory strategy used by bacteria.</p></title><aug><au><snm>Merino</snm><fnm>E</fnm></au><au><snm>Yanofsky</snm><fnm>C</fnm></au></aug><source>Trends Genet</source><pubdate>2005</pubdate><volume>21</volume><fpage>260</fpage><lpage>264</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.tig.2005.03.002</pubid><pubid idtype="pmpid" link="fulltext">15851059</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>[Search for alternative RNA secondary structures regulating expression of bacterial genes]</p></title><aug><au><snm>Liubetskaia</snm><fnm>EV</fnm></au><au><snm>Leont'ev</snm><fnm>LA</fnm></au><au><snm>Gel'fand</snm><fnm>MS</fnm></au><au><snm>Liubetskii</snm><fnm>VA</fnm></au></aug><source>Mol Biol (Mosk)</source><pubdate>2003</pubdate><volume>37</volume><fpage>707</fpage><lpage>716</lpage><xrefbib><pubid idtype="doi">10.1023/A:1026084910427</pubid></xrefbib></bibl><bibl id="B19"><title><p>New RNA motifs suggest an expanded scope for riboswitches in bacterial genetic control.</p></title><aug><au><snm>Barrick</snm><fnm>JE</fnm></au><au><snm>Corbino</snm><fnm>KA</fnm></au><au><snm>Winkler</snm><fnm>WC</fnm></au><au><snm>Nahvi</snm><fnm>A</fnm></au><au><snm>Mandal</snm><fnm>M</fnm></au><au><snm>Collins</snm><fnm>J</fnm></au><au><snm>Lee</snm><fnm>M</fnm></au><au><snm>Roth</snm><fnm>A</fnm></au><au><snm>Sudarsan</snm><fnm>N</fnm></au><au><snm>Jona</snm><fnm>I</fnm></au><au><snm>Wickiser</snm><fnm>JK</fnm></au><au><snm>Breaker</snm><fnm>RR</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2004</pubdate><volume>101</volume><fpage>6421</fpage><lpage>6426</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0308014101</pubid><pubid idtype="pmcid">404060</pubid><pubid idtype="pmpid">15096624</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>The distributions, mechanisms, and structures of metabolite-binding riboswitches.</p></title><aug><au><snm>Barrick</snm><fnm>JE</fnm></au><au><snm>Breaker</snm><fnm>RR</fnm></au></aug><source>Genome Biol</source><pubdate>2007</pubdate><volume>8</volume><fpage>R239</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2007-8-11-r239</pubid><pubid idtype="pmcid">2258182</pubid><pubid idtype="pmpid">17997835</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>A computational pipeline for high-throughput discovery of cis-regulatory noncoding RNA in prokaryotes.</p></title><aug><au><snm>Yao</snm><fnm>Z</fnm></au><au><snm>Barrick</snm><fnm>J</fnm></au><au><snm>Weinberg</snm><fnm>Z</fnm></au><au><snm>Neph</snm><fnm>S</fnm></au><au><snm>Breaker</snm><fnm>R</fnm></au><au><snm>Tompa</snm><fnm>M</fnm></au><au><snm>Ruzzo</snm><fnm>WL</fnm></au></aug><source>PLoS Comput Biol</source><pubdate>2007</pubdate><volume>3</volume><fpage>e126</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pcbi.0030126</pubid><pubid idtype="pmcid">1913097</pubid><pubid idtype="pmpid">17616982</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>Attenuation regulation of amino acid biosynthetic operons in proteobacteria: comparative genomics analysis.</p></title><aug><au><snm>Vitreschak</snm><fnm>AG</fnm></au><au><snm>Lyubetskaya</snm><fnm>EV</fnm></au><au><snm>Shirshin</snm><fnm>MA</fnm></au><au><snm>Gelfand</snm><fnm>MS</fnm></au><au><snm>Lyubetsky</snm><fnm>VA</fnm></au></aug><source>FEMS Microbiol Lett</source><pubdate>2004</pubdate><volume>234</volume><fpage>357</fpage><lpage>370</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1574-6968.2004.tb09555.x</pubid><pubid idtype="pmpid">15135544</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Computational identification of riboswitches based on RNA conserved functional sequences and conformations.</p></title><aug><au><snm>Chang</snm><fnm>TH</fnm></au><au><snm>Huang</snm><fnm>HD</fnm></au><au><snm>Wu</snm><fnm>LC</fnm></au><au><snm>Yeh</snm><fnm>CT</fnm></au><au><snm>Liu</snm><fnm>BJ</fnm></au><au><snm>Horng</snm><fnm>JT</fnm></au></aug><source>RNA</source><pubdate>2009</pubdate><volume>15</volume><fpage>1426</fpage><lpage>1430</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1261/rna.1623809</pubid><pubid idtype="pmcid">2704089</pubid><pubid idtype="pmpid">19460868</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>The S box regulon: a new global transcription termination control system for methionine and cysteine biosynthesis genes in gram-positive bacteria.</p></title><aug><au><snm>Grundy</snm><fnm>FJ</fnm></au><au><snm>Henkin</snm><fnm>TM</fnm></au></aug><source>Mol Microbiol</source><pubdate>1998</pubdate><volume>30</volume><fpage>737</fpage><lpage>749</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1046/j.1365-2958.1998.01105.x</pubid><pubid idtype="pmpid" link="fulltext">10094622</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>The riboswitch-mediated control of sulfur metabolism in bacteria.</p></title><aug><au><snm>Epshtein</snm><fnm>V</fnm></au><au><snm>Mironov</snm><fnm>AS</fnm></au><au><snm>Nudler</snm><fnm>E</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2003</pubdate><volume>100</volume><fpage>5052</fpage><lpage>5056</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0531307100</pubid><pubid idtype="pmcid">154296</pubid><pubid idtype="pmpid">12702767</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>An mRNA structure that controls gene expression by binding S-adenosylmethionine.</p></title><aug><au><snm>Winkler</snm><fnm>WC</fnm></au><au><snm>Nahvi</snm><fnm>A</fnm></au><au><snm>Sudarsan</snm><fnm>N</fnm></au><au><snm>Barrick</snm><fnm>JE</fnm></au><au><snm>Breaker</snm><fnm>RR</fnm></au></aug><source>Nat Struct Biol</source><pubdate>2003</pubdate><volume>10</volume><fpage>701</fpage><lpage>707</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nsb967</pubid><pubid idtype="pmpid" link="fulltext">12910260</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>Evidence for a second class of S-adenosylmethionine riboswitches and other regulatory RNA motifs in alpha-proteobacteria.</p></title><aug><au><snm>Corbino</snm><fnm>KA</fnm></au><au><snm>Barrick</snm><fnm>JE</fnm></au><au><snm>Lim</snm><fnm>J</fnm></au><au><snm>Welz</snm><fnm>R</fnm></au><au><snm>Tucker</snm><fnm>BJ</fnm></au><au><snm>Puskarz</snm><fnm>I</fnm></au><au><snm>Mandal</snm><fnm>M</fnm></au><au><snm>Rudnick</snm><fnm>ND</fnm></au><au><snm>Breaker</snm><fnm>RR</fnm></au></aug><source>Genome Biol</source><pubdate>2005</pubdate><volume>6</volume><fpage>R70</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2005-6-8-r70</pubid><pubid idtype="pmcid">1273637</pubid><pubid idtype="pmpid">16086852</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>The SMK box is a new SAM-binding RNA for translational regulation of SAM synthetase.</p></title><aug><au><snm>Fuchs</snm><fnm>RT</fnm></au><au><snm>Grundy</snm><fnm>FJ</fnm></au><au><snm>Henkin</snm><fnm>TM</fnm></au></aug><source>Nat Struct Mol Biol</source><pubdate>2006</pubdate><volume>13</volume><fpage>226</fpage><lpage>233</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nsmb1059</pubid><pubid idtype="pmpid" link="fulltext">16491091</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>The aptamer core of SAM-IV riboswitches mimics the ligand-binding site of SAM-I riboswitches.</p></title><aug><au><snm>Weinberg</snm><fnm>Z</fnm></au><au><snm>Regulski</snm><fnm>EE</fnm></au><au><snm>Hammond</snm><fnm>MC</fnm></au><au><snm>Barrick</snm><fnm>JE</fnm></au><au><snm>Yao</snm><fnm>Z</fnm></au><au><snm>Ruzzo</snm><fnm>WL</fnm></au><au><snm>Breaker</snm><fnm>RR</fnm></au></aug><source>RNA</source><pubdate>2008</pubdate><volume>14</volume><fpage>822</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1261/rna.988608</pubid><pubid idtype="pmcid">2327355</pubid><pubid idtype="pmpid">18369181</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>A variant riboswitch aptamer class for S-adenosylmethionine common in marine bacteria.</p></title><aug><au><snm>Poiata</snm><fnm>E</fnm></au><au><snm>Meyer</snm><fnm>MM</fnm></au><au><snm>Ames</snm><fnm>TD</fnm></au><au><snm>Breaker</snm><fnm>RR</fnm></au></aug><source>RNA</source><pubdate>2009</pubdate><volume>15</volume><fpage>2046</fpage><lpage>2056</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1261/rna.1824209</pubid><pubid idtype="pmcid">2764483</pubid><pubid idtype="pmpid">19776155</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>Transcription attenuation in bacteria: theme and variations.</p></title><aug><au><snm>Naville</snm><fnm>M</fnm></au><au><snm>Gautheret</snm><fnm>D</fnm></au></aug><source>Brief Funct Genomic Proteomic</source><pubdate>2009</pubdate><volume>8</volume><fpage>482</fpage><lpage>492</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bfgp/elp025</pubid><pubid idtype="pmpid" link="fulltext">19651704</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>Direct RNA motif definition and identification from multiple sequence alignments using secondary structure profiles.</p></title><aug><au><snm>Gautheret</snm><fnm>D</fnm></au><au><snm>Lambert</snm><fnm>A</fnm></au></aug><source>J Mol Biol</source><pubdate>2001</pubdate><volume>313</volume><fpage>1003</fpage><lpage>1011</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/jmbi.2001.5102</pubid><pubid idtype="pmpid" link="fulltext">11700055</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>RNAmotif, an RNA secondary structure definition and search algorithm.</p></title><aug><au><snm>Macke</snm><fnm>TJ</fnm></au><au><snm>Ecker</snm><fnm>DJ</fnm></au><au><snm>Gutell</snm><fnm>RR</fnm></au><au><snm>Gautheret</snm><fnm>D</fnm></au><au><snm>Case</snm><fnm>DA</fnm></au><au><snm>Sampath</snm><fnm>R</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2001</pubdate><volume>29</volume><fpage>4724</fpage><lpage>4735</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/29.22.4724</pubid><pubid idtype="pmcid">92549</pubid><pubid idtype="pmpid">11713323</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>Prediction of transcriptional terminators in <it>Bacillus subtilis </it>and related species.</p></title><aug><au><snm>de Hoon</snm><fnm>MJ</fnm></au><au><snm>Makita</snm><fnm>Y</fnm></au><au><snm>Nakai</snm><fnm>K</fnm></au><au><snm>Miyano</snm><fnm>S</fnm></au></aug><source>PLoS Comput Biol</source><pubdate>2005</pubdate><volume>1</volume><fpage>e25</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pcbi.0010025</pubid><pubid idtype="pmcid">1187862</pubid><pubid idtype="pmpid">16110342</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>ARNold: a web tool for the prediction of Rho-independent transcription terminators.</p></title><aug><au><snm>Naville</snm><fnm>M</fnm></au><au><snm>Ghuillot-Gaudeffroy</snm><fnm>A</fnm></au><au><snm>Gautheret</snm><fnm>D</fnm></au></aug><source>RNA Biol</source><pubdate>2011</pubdate><inpress/></bibl><bibl id="B36"><title><p>Single-pass classification of all noncoding sequences in a bacterial genome using phylogenetic profiles.</p></title><aug><au><snm>Marchais</snm><fnm>A</fnm></au><au><snm>Naville</snm><fnm>M</fnm></au><au><snm>Bohn</snm><fnm>C</fnm></au><au><snm>Bouloc</snm><fnm>P</fnm></au><au><snm>Gautheret</snm><fnm>D</fnm></au></aug><source>Genome Res</source><pubdate>2009</pubdate><volume>19</volume><fpage>1084</fpage><lpage>1092</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.089714.108</pubid><pubid idtype="pmcid">2694484</pubid><pubid idtype="pmpid">19237465</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>Tree pattern matching in phylogenetic trees: automatic search for orthologs or paralogs in homologous gene sequence databases.</p></title><aug><au><snm>Dufayard</snm><fnm>JF</fnm></au><au><snm>Duret</snm><fnm>L</fnm></au><au><snm>Penel</snm><fnm>S</fnm></au><au><snm>Gouy</snm><fnm>M</fnm></au><au><snm>Rechenmann</snm><fnm>F</fnm></au><au><snm>Perri&#232;re</snm><fnm>G</fnm></au></aug><source>Bioinformatics</source><pubdate>2005</pubdate><volume>21</volume><fpage>2596</fpage><lpage>2603</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/bti325</pubid><pubid idtype="pmpid" link="fulltext">15713731</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>Rfam: updates to the RNA families database.</p></title><aug><au><snm>Gardner</snm><fnm>PP</fnm></au><au><snm>Daub</snm><fnm>J</fnm></au><au><snm>Tate</snm><fnm>JG</fnm></au><au><snm>Nawrocki</snm><fnm>EP</fnm></au><au><snm>Kolbe</snm><fnm>DL</fnm></au><au><snm>Lindgreen</snm><fnm>S</fnm></au><au><snm>Wilkinson</snm><fnm>AC</fnm></au><au><snm>Finn</snm><fnm>Rd</fnm></au><au><snm>Griffiths-Jones</snm><fnm>S</fnm></au><au><snm>Eddy</snm><fnm>SR</fnm></au><au><snm>Bateman</snm><fnm>A</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2009</pubdate><volume>37</volume><fpage>D136</fpage><lpage>140</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkn766</pubid><pubid idtype="pmcid">2686503</pubid><pubid idtype="pmpid">18953034</pubid></pubidlist></xrefbib></bibl><bibl id="B39"><title><p>RegulonDB (version 6.0): gene regulation model of <it>Escherichia coli </it>K-12 beyond transcription, active (experimental) annotated promoters and Textpresso navigation.</p></title><aug><au><snm>Gama-Castro</snm><fnm>S</fnm></au><au><snm>Jimenez-Jacinto</snm><fnm>V</fnm></au><au><snm>Peralta-Gil</snm><fnm>M</fnm></au><au><snm>Santos-Zavaleta</snm><fnm>A</fnm></au><au><snm>Pe&#241;aloza-Spindola</snm><fnm>MI</fnm></au><au><snm>Contreras-Moreira</snm><fnm>B</fnm></au><au><snm>Segura-Salazar</snm><fnm>J</fnm></au><au><snm>Mu&#241;iz-Rascado</snm><fnm>L</fnm></au><au><snm>Martinez-Flores</snm><fnm>I</fnm></au><au><snm>Salgado</snm><fnm>H</fnm></au><au><snm>Bonavides-Martinez</snm><fnm>C</fnm></au><au><snm>Abreu-Goodger</snm><fnm>C</fnm></au><au><snm>Rodriguez-Penagos</snm><fnm>C</fnm></au><au><snm>Miranda-Rios</snm><fnm>J</fnm></au><au><snm>Morett</snm><fnm>E</fnm></au><au><snm>Merino</snm><fnm>E</fnm></au><au><snm>Huerta</snm><fnm>AM</fnm></au><au><snm>Trevino-Quintanilla</snm><fnm>L</fnm></au><au><snm>Collado-Vides</snm><fnm>J</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2008</pubdate><issue>36 Database</issue><fpage>D120</fpage><lpage>124</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2238961</pubid><pubid idtype="pmpid">18158297</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>The RimP protein is important for maturation of the 30S ribosomal subunit.</p></title><aug><au><snm>Nord</snm><fnm>S</fnm></au><au><snm>Bylund</snm><fnm>GO</fnm></au><au><snm>L&#246;vgren</snm><fnm>JM</fnm></au><au><snm>Wikstr&#246;m</snm><fnm>PM</fnm></au></aug><source>J Mol Biol</source><pubdate>2009</pubdate><volume>386</volume><fpage>742</fpage><lpage>753</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.jmb.2008.12.076</pubid><pubid idtype="pmpid" link="fulltext">19150615</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>Structure of a <it>Mycobacterium tuberculosis </it>NusA-RNA complex.</p></title><aug><au><snm>Beuth</snm><fnm>B</fnm></au><au><snm>Pennell</snm><fnm>S</fnm></au><au><snm>Arnvig</snm><fnm>KB</fnm></au><au><snm>Martin</snm><fnm>SR</fnm></au><au><snm>Taylor</snm><fnm>IA</fnm></au></aug><source>EMBO J</source><pubdate>2005</pubdate><volume>24</volume><fpage>3576</fpage><lpage>3587</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/sj.emboj.7600829</pubid><pubid idtype="pmcid">1276712</pubid><pubid idtype="pmpid">16193062</pubid></pubidlist></xrefbib></bibl><bibl id="B42"><title><p>Effect of NusA protein on expression of the nusA,infB operon in <it>E. coli</it>.</p></title><aug><au><snm>Plumbridge</snm><fnm>JA</fnm></au><au><snm>Dondon</snm><fnm>J</fnm></au><au><snm>Nakamura</snm><fnm>Y</fnm></au><au><snm>Grunberg-Manago</snm><fnm>M</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>1985</pubdate><volume>13</volume><fpage>3371</fpage><lpage>3388</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/13.9.3371</pubid><pubid idtype="pmcid">341241</pubid><pubid idtype="pmpid">2987884</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>Evidence for autoregulation of the nusA-infB operon of <it>Escherichia coli</it>.</p></title><aug><au><snm>Nakamura</snm><fnm>Y</fnm></au><au><snm>Plumbridge</snm><fnm>J</fnm></au><au><snm>Dondon</snm><fnm>J</fnm></au><au><snm>Grunberg-Manago</snm><fnm>M</fnm></au></aug><source>Gene</source><pubdate>1985</pubdate><volume>36</volume><fpage>189</fpage><lpage>193</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/0378-1119(85)90085-X</pubid><pubid idtype="pmpid">2998933</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Crystal structure and RNA-binding analysis of the archaeal transcription factor NusA.</p></title><aug><au><snm>Shibata</snm><fnm>R</fnm></au><au><snm>Bessho</snm><fnm>Y</fnm></au><au><snm>Shinkai</snm><fnm>A</fnm></au><au><snm>Nishimoto</snm><fnm>M</fnm></au><au><snm>Fusatomi</snm><fnm>E</fnm></au><au><snm>Terada</snm><fnm>T</fnm></au><au><snm>Shirouzu</snm><fnm>M</fnm></au><au><snm>Yokoyama</snm><fnm>S</fnm></au></aug><source>Biochem Biophys Res Commun</source><pubdate>2007</pubdate><volume>355</volume><fpage>122</fpage><lpage>128</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.bbrc.2007.01.119</pubid><pubid idtype="pmpid" link="fulltext">17288993</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>An extended RNA binding surface through arrayed S1 and KH domains in transcription factor NusA.</p></title><aug><au><snm>Worbs</snm><fnm>M</fnm></au><au><snm>Bourenkov</snm><fnm>GP</fnm></au><au><snm>Bartunik</snm><fnm>HD</fnm></au><au><snm>Huber</snm><fnm>R</fnm></au><au><snm>Wahl</snm><fnm>MC</fnm></au></aug><source>Mol Cell</source><pubdate>2001</pubdate><volume>7</volume><fpage>1177</fpage><lpage>1189</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S1097-2765(01)00262-3</pubid><pubid idtype="pmpid" link="fulltext">11430821</pubid></pubidlist></xrefbib></bibl><bibl id="B46"><title><p>Identification of regulatory RNAs in <it>Bacillus subtilis</it>.</p></title><aug><au><snm>Irnov</snm><fnm>I</fnm></au><au><snm>Sharma</snm><fnm>CM</fnm></au><au><snm>Vogel</snm><fnm>J</fnm></au><au><snm>Winkler</snm><fnm>WC</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2010</pubdate><inpress/><xrefbib><pubid idtype="pmpid" link="fulltext">20525796</pubid></xrefbib></bibl><bibl id="B47"><title><p>Post-transcriptional regulation of the <it>str </it>operon in <it>Escherichia coli</it>: Ribosomal protein S7 inhibits coupled translation of S7 but not its independent translation.</p></title><aug><au><snm>Saito</snm><fnm>K</fnm></au><au><snm>Mattheakis</snm><fnm>LC</fnm></au><au><snm>Nomura</snm><fnm>M</fnm></au></aug><source>J Mol Biol</source><pubdate>1994</pubdate><volume>235</volume><fpage>111</fpage><lpage>124</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0022-2836(05)80020-8</pubid><pubid idtype="pmpid" link="fulltext">7507167</pubid></pubidlist></xrefbib></bibl><bibl id="B48"><title><p>Identification of candidate structured RNAs in the marine organism '<it>Candidatus Pelagibacter ubique</it>'.</p></title><aug><au><snm>Meyer</snm><fnm>MM</fnm></au><au><snm>Ames</snm><fnm>TD</fnm></au><au><snm>Smith</snm><fnm>DP</fnm></au><au><snm>Weinberg</snm><fnm>Z</fnm></au><au><snm>Schwalbach</snm><fnm>MS</fnm></au><au><snm>Giovannoni</snm><fnm>SJ</fnm></au><au><snm>Breaker</snm><fnm>RR</fnm></au></aug><source>BMC Genomics</source><pubdate>2009</pubdate><volume>10</volume><fpage>268</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-10-268</pubid><pubid idtype="pmcid">2704228</pubid><pubid idtype="pmpid">19531245</pubid></pubidlist></xrefbib></bibl><bibl id="B49"><title><p>Mapping of the RNA recognition site of <it>Escherichia coli </it>ribosomal protein S7.</p></title><aug><au><snm>Robert</snm><fnm>F</fnm></au><au><snm>Gagnon</snm><fnm>M</fnm></au><au><snm>Sans</snm><fnm>D</fnm></au><au><snm>Michnick</snm><fnm>S</fnm></au><au><snm>Brakier-Gingras</snm><fnm>L</fnm></au></aug><source>RNA</source><pubdate>2000</pubdate><volume>6</volume><fpage>1649</fpage><lpage>1659</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1017/S1355838200001199</pubid><pubid idtype="pmcid">1370033</pubid><pubid idtype="pmpid">11105763</pubid></pubidlist></xrefbib></bibl><bibl id="B50"><title><p>Structure, function, and evolution of bacterial ATP-binding cassette systems.</p></title><aug><au><snm>Davidson</snm><fnm>AL</fnm></au><au><snm>Dassa</snm><fnm>E</fnm></au><au><snm>Orelle</snm><fnm>C</fnm></au><au><snm>Chen</snm><fnm>J</fnm></au></aug><source>Microbiol Mol Biol Rev</source><pubdate>2008</pubdate><volume>72</volume><fpage>317</fpage><lpage>364</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/MMBR.00031-07</pubid><pubid idtype="pmcid">2415747</pubid><pubid idtype="pmpid">18535149</pubid></pubidlist></xrefbib></bibl><bibl id="B51"><title><p>ATP-binding cassette transporters in <it>Escherichia coli.</it>.</p></title><aug><au><snm>Moussatova</snm><fnm>A</fnm></au><au><snm>Kandt</snm><fnm>C</fnm></au><au><snm>O'Mara</snm><fnm>ML</fnm></au><au><snm>Tieleman</snm><fnm>DP</fnm></au></aug><source>Biochim Biophys Acta</source><pubdate>2008</pubdate><volume>1778</volume><fpage>1757</fpage><lpage>1771</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.bbamem.2008.06.009</pubid><pubid idtype="pmpid" link="fulltext">18634750</pubid></pubidlist></xrefbib></bibl><bibl id="B52"><title><p>Regulation of bacterial drug export systems.</p></title><aug><au><snm>Grkovic</snm><fnm>S</fnm></au><au><snm>Brown</snm><fnm>MH</fnm></au><au><snm>Skurray</snm><fnm>RA</fnm></au></aug><source>Microbiol Mol Biol Rev</source><pubdate>2002</pubdate><volume>66</volume><fpage>671</fpage><lpage>701</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/MMBR.66.4.671-701.2002</pubid><pubid idtype="pmcid">134658</pubid><pubid idtype="pmpid">12456787</pubid></pubidlist></xrefbib></bibl><bibl id="B53"><title><p>Transcriptional termination control of a novel ABC transporter gene involved in antibiotic resistance in <it>Bacillus subtilis</it>.</p></title><aug><au><snm>Ohki</snm><fnm>R</fnm></au><au><snm>Tateno</snm><fnm>K</fnm></au><au><snm>Takizawa</snm><fnm>T</fnm></au><au><snm>Aiso</snm><fnm>T</fnm></au><au><snm>Murata</snm><fnm>M</fnm></au></aug><source>J Bacteriol</source><pubdate>2005</pubdate><volume>187</volume><fpage>5946</fpage><lpage>5954</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JB.187.17.5946-5954.2005</pubid><pubid idtype="pmcid">1196159</pubid><pubid idtype="pmpid">16109936</pubid></pubidlist></xrefbib></bibl><bibl id="B54"><title><p>Comparative analysis of RNA regulatory elements of amino acid metabolism genes in Actinobacteria.</p></title><aug><au><snm>Seliverstov</snm><fnm>AV</fnm></au><au><snm>Putzer</snm><fnm>H</fnm></au><au><snm>Gelfand</snm><fnm>MS</fnm></au><au><snm>Lyubetsky</snm><fnm>VA</fnm></au></aug><source>BMC Microbiol</source><pubdate>2005</pubdate><volume>5</volume><fpage>54</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2180-5-54</pubid><pubid idtype="pmcid">1262725</pubid><pubid idtype="pmpid">16202131</pubid></pubidlist></xrefbib></bibl><bibl id="B55"><title><p>Conformational alteration of mRNA structure and the posttranscriptional regulation of erythromycin-induced drug resistance.</p></title><aug><au><snm>Gryczan</snm><fnm>TJ</fnm></au><au><snm>Grandi</snm><fnm>G</fnm></au><au><snm>Hahn</snm><fnm>J</fnm></au><au><snm>Grandi</snm><fnm>R</fnm></au><au><snm>Dubnau</snm><fnm>D</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>1980</pubdate><volume>8</volume><fpage>6081</fpage><lpage>6097</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/8.24.6081</pubid><pubid idtype="pmcid">328074</pubid><pubid idtype="pmpid">6162157</pubid></pubidlist></xrefbib></bibl><bibl id="B56"><title><p>Global analysis of small RNA and mRNA targets of Hfq.</p></title><aug><au><snm>Zhang</snm><fnm>A</fnm></au><au><snm>Wassarman</snm><fnm>KM</fnm></au><au><snm>Rosenow</snm><fnm>C</fnm></au><au><snm>Tjaden</snm><fnm>BC</fnm></au><au><snm>Storz</snm><fnm>G</fnm></au><au><snm>Gottesman</snm><fnm>S</fnm></au></aug><source>Mol Microbiol</source><pubdate>2003</pubdate><volume>50</volume><fpage>1111</fpage><lpage>1124</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1046/j.1365-2958.2003.03734.x</pubid><pubid idtype="pmpid" link="fulltext">14622403</pubid></pubidlist></xrefbib></bibl><bibl id="B57"><title><p>A trans-acting riboswitch controls expression of the virulence regulator PrfA in <it>Listeria monocytogenes</it>.</p></title><aug><au><snm>Loh</snm><fnm>E</fnm></au><au><snm>Dussurget</snm><fnm>O</fnm></au><au><snm>Gripenland</snm><fnm>J</fnm></au><au><snm>Vaitkevicius</snm><fnm>K</fnm></au><au><snm>Tiensuu</snm><fnm>T</fnm></au><au><snm>Mandin</snm><fnm>P</fnm></au><au><snm>Repoila</snm><fnm>F</fnm></au><au><snm>Buchrieser</snm><fnm>C</fnm></au><au><snm>Cossart</snm><fnm>P</fnm></au><au><snm>Johansson</snm><fnm>J</fnm></au></aug><source>Cell</source><pubdate>2009</pubdate><volume>139</volume><fpage>770</fpage><lpage>779</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2009.08.046</pubid><pubid idtype="pmpid" link="fulltext">19914169</pubid></pubidlist></xrefbib></bibl><bibl id="B58"><title><p>Comparative genomic reconstruction of transcriptional regulatory networks in bacteria.</p></title><aug><au><snm>Rodionov</snm><fnm>DA</fnm></au></aug><source>Chem Rev</source><pubdate>2007</pubdate><volume>107</volume><fpage>3467</fpage><lpage>3497</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1021/cr068309+</pubid><pubid idtype="pmcid">2643304</pubid><pubid idtype="pmpid">17636889</pubid></pubidlist></xrefbib></bibl><bibl id="B59"><title><p>Tn10 protects itself at two levels from fortuitous activation by external promoters.</p></title><aug><au><snm>Davis</snm><fnm>MA</fnm></au><au><snm>Simons</snm><fnm>RW</fnm></au><au><snm>Kleckner</snm><fnm>N</fnm></au></aug><source>Cell</source><pubdate>1985</pubdate><volume>43</volume><fpage>379</fpage><lpage>387</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/0092-8674(85)90043-1</pubid><pubid idtype="pmpid" link="fulltext">2416461</pubid></pubidlist></xrefbib></bibl><bibl id="B60"><title><p>Repression of IS200 transposase synthesis by RNA secondary structures.</p></title><aug><au><snm>Beuzon</snm><fnm>CR</fnm></au><au><snm>Marques</snm><fnm>S</fnm></au><au><snm>Casadesus</snm><fnm>J</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>1999</pubdate><volume>27</volume><fpage>3690</fpage><lpage>3695</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/27.18.3690</pubid><pubid idtype="pmcid">148624</pubid><pubid idtype="pmpid">10471738</pubid></pubidlist></xrefbib></bibl><bibl id="B61"><title><p>A distal enhancer and an ultraconserved exon are derived from a novel retroposon.</p></title><aug><au><snm>Bejerano</snm><fnm>G</fnm></au><au><snm>Lowe</snm><fnm>CB</fnm></au><au><snm>Ahituv</snm><fnm>N</fnm></au><au><snm>King</snm><fnm>B</fnm></au><au><snm>Siepel</snm><fnm>A</fnm></au><au><snm>Salama</snm><fnm>SR</fnm></au><au><snm>Rubin</snm><fnm>EM</fnm></au><au><snm>Kent</snm><fnm>WJ</fnm></au><au><snm>Haussler</snm><fnm>D</fnm></au></aug><source>Nature</source><pubdate>2006</pubdate><volume>441</volume><fpage>87</fpage><lpage>90</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature04696</pubid><pubid idtype="pmpid" link="fulltext">16625209</pubid></pubidlist></xrefbib></bibl><bibl id="B62"><title><p>Thousands of human mobile element fragments undergo strong purifying selection near developmental genes.</p></title><aug><au><snm>Lowe</snm><fnm>CB</fnm></au><au><snm>Bejerano</snm><fnm>G</fnm></au><au><snm>Haussler</snm><fnm>D</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2007</pubdate><volume>104</volume><fpage>8005</fpage><lpage>8010</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0611223104</pubid><pubid idtype="pmcid">1876562</pubid><pubid idtype="pmpid">17463089</pubid></pubidlist></xrefbib></bibl><bibl id="B63"><title><p>A novel RNA product of the <it>tyrT </it>operon of <it>Escherichia coli</it>.</p></title><aug><au><snm>B&#246;sl</snm><fnm>M</fnm></au><au><snm>Kersten</snm><fnm>H</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>1991</pubdate><volume>19</volume><fpage>5863</fpage><lpage>5870</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/19.21.5863</pubid><pubid idtype="pmcid">329039</pubid><pubid idtype="pmpid">1840671</pubid></pubidlist></xrefbib></bibl><bibl id="B64"><title><p>The BIME family of bacterial highly repetitive sequences.</p></title><aug><au><snm>Gilson</snm><fnm>E</fnm></au><au><snm>Saurin</snm><fnm>W</fnm></au><au><snm>Perrin</snm><fnm>D</fnm></au><au><snm>Bachellier</snm><fnm>S</fnm></au><au><snm>Hofnung</snm><fnm>M</fnm></au></aug><source>Res Microbiol</source><pubdate>1991</pubdate><volume>142</volume><fpage>217</fpage><lpage>222</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/0923-2508(91)90033-7</pubid><pubid idtype="pmpid" link="fulltext">1656494</pubid></pubidlist></xrefbib></bibl><bibl id="B65"><title><p>A subfamily of <it>E. coli </it>palindromic units implicated in transcription termination?</p></title><aug><au><snm>Gilson</snm><fnm>E</fnm></au><au><snm>Rousset</snm><fnm>JP</fnm></au><au><snm>Cl&#233;ment</snm><fnm>JM</fnm></au><au><snm>Hofnung</snm><fnm>M</fnm></au></aug><source>Ann Inst Pasteur Microbiol</source><pubdate>1986</pubdate><volume>137B</volume><fpage>259</fpage><lpage>270</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0769-2609(86)80116-8</pubid><pubid idtype="pmpid">3318868</pubid></pubidlist></xrefbib></bibl><bibl id="B66"><title><p>Transcription attenuation associated with bacterial repetitive extragenic BIME elements.</p></title><aug><au><snm>Esp&#233;li</snm><fnm>O</fnm></au><au><snm>Moulin</snm><fnm>L</fnm></au><au><snm>Boccard</snm><fnm>F</fnm></au></aug><source>J Mol Biol</source><pubdate>2001</pubdate><volume>314</volume><fpage>375</fpage><lpage>386</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/jmbi.2001.5150</pubid><pubid idtype="pmpid" link="fulltext">11846552</pubid></pubidlist></xrefbib></bibl><bibl id="B67"><title><p>Comparative genomics reveals 104 candidate structured RNAs from bacteria, archaea and their metagenomes.</p></title><aug><au><snm>Weinberg</snm><fnm>Z</fnm></au><au><snm>Wang</snm><fnm>JX</fnm></au><au><snm>Bogue</snm><fnm>J</fnm></au><au><snm>Yang</snm><fnm>J</fnm></au><au><snm>Corbino</snm><fnm>K</fnm></au><au><snm>Moy</snm><fnm>RH</fnm></au><au><snm>Breaker</snm><fnm>RR</fnm></au></aug><source>Genome Biol</source><pubdate>2010</pubdate><volume>11</volume><fpage>R31</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2010-11-3-r31</pubid><pubid idtype="pmcid">2864571</pubid><pubid idtype="pmpid">20230605</pubid></pubidlist></xrefbib></bibl><bibl id="B68"><title><p>Rho-dependent transcription termination in the tna operon of <it>Escherichia coli</it>: Roles of the boxA sequence and the rut site.</p></title><aug><au><snm>Konan</snm><fnm>KV</fnm></au><au><snm>Yanofsky</snm><fnm>C</fnm></au></aug><source>J Bacteriol</source><pubdate>2000</pubdate><volume>182</volume><fpage>3981</fpage><lpage>3988</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JB.182.14.3981-3988.2000</pubid><pubid idtype="pmcid">94583</pubid><pubid idtype="pmpid">10869076</pubid></pubidlist></xrefbib></bibl><bibl id="B69"><title><p>Bacterial translational control at atomic resolution.</p></title><aug><au><snm>Romby</snm><fnm>P</fnm></au><au><snm>Springer</snm><fnm>M</fnm></au></aug><source>Trends Genet</source><pubdate>2003</pubdate><volume>19</volume><fpage>155</fpage><lpage>161</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0168-9525(03)00020-9</pubid><pubid idtype="pmpid" link="fulltext">12615010</pubid></pubidlist></xrefbib></bibl><bibl id="B70"><title><p>Molecular biology of the LysR family of transcriptional regulators.</p></title><aug><au><snm>Schell</snm><fnm>MA</fnm></au></aug><source>Annu Rev Microbiol</source><pubdate>1993</pubdate><volume>47</volume><fpage>597</fpage><lpage>626</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1146/annurev.mi.47.100193.003121</pubid><pubid idtype="pmpid" link="fulltext">8257110</pubid></pubidlist></xrefbib></bibl><bibl id="B71"><title><p>Transcriptional and translational regulation of the marRAB multiple anibiotic resistance operon in <it>Escherichia coli</it>.</p></title><aug><au><snm>Martin</snm><fnm>RG</fnm></au><au><snm>Rosner</snm><fnm>JL</fnm></au></aug><source>Mol Microbiol</source><pubdate>2004</pubdate><volume>53</volume><fpage>183</fpage><lpage>191</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1365-2958.2004.04080.x</pubid><pubid idtype="pmpid" link="fulltext">15225313</pubid></pubidlist></xrefbib></bibl><bibl id="B72"><title><p>The <it>Listeria </it>transcriptional landscape from saprophytism to virulence.</p></title><aug><au><snm>Toledo-Arana</snm><fnm>A</fnm></au><au><snm>Dussurget</snm><fnm>O</fnm></au><au><snm>Nikitas</snm><fnm>G</fnm></au><au><snm>Sesto</snm><fnm>N</fnm></au><au><snm>Guet-Revillet</snm><fnm>H</fnm></au><au><snm>Balestrino</snm><fnm>D</fnm></au><au><snm>Loh</snm><fnm>E</fnm></au><au><snm>Gripenland</snm><fnm>J</fnm></au><au><snm>Tiensuu</snm><fnm>T</fnm></au><au><snm>Vaitkevicius</snm><fnm>K</fnm></au><au><snm>Barthelemy</snm><fnm>M</fnm></au><au><snm>Vergassola</snm><fnm>M</fnm></au><au><snm>Nahori</snm><fnm>MA</fnm></au><au><snm>Soubigou</snm><fnm>G</fnm></au><au><snm>R&#233;gnault</snm><fnm>B</fnm></au><au><snm>Copp&#233;e</snm><fnm>JY</fnm></au><au><snm>Lecuit</snm><fnm>M</fnm></au><au><snm>Johansson</snm><fnm>J</fnm></au><au><snm>Cossart</snm><fnm>P</fnm></au></aug><source>Nature</source><pubdate>2009</pubdate><volume>459</volume><fpage>950</fpage><lpage>956</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature08080</pubid><pubid idtype="pmpid" link="fulltext">19448609</pubid></pubidlist></xrefbib></bibl><bibl id="B73"><title><p>Genome-wide identification of transcription start sites, promoters and transcription factor binding sites in <it>E. coli.</it>.</p></title><aug><au><snm>Mendoza-Vargas</snm><fnm>A</fnm></au><au><snm>Olvera</snm><fnm>L</fnm></au><au><snm>Olvera</snm><fnm>M</fnm></au><au><snm>Grande</snm><fnm>R</fnm></au><au><snm>Vega-Alvarado</snm><fnm>L</fnm></au><au><snm>Taboada</snm><fnm>B</fnm></au><au><snm>Jimenez-Jacinto</snm><fnm>V</fnm></au><au><snm>Salgado</snm><fnm>H</fnm></au><au><snm>Ju&#225;rez</snm><fnm>K</fnm></au><au><snm>Contreras-Moreira</snm><fnm>B</fnm></au><au><snm>Huerta</snm><fnm>AM</fnm></au><au><snm>Collado-Vides</snm><fnm>J</fnm></au><au><snm>Morett</snm><fnm>E</fnm></au></aug><source>PLoS One</source><pubdate>2009</pubdate><volume>4</volume><fpage>e7526</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0007526</pubid><pubid idtype="pmcid">2760140</pubid><pubid idtype="pmpid">19838305</pubid></pubidlist></xrefbib></bibl><bibl id="B74"><title><p>Prediction of Rho-independent transcriptional terminators in <it>Escherichia coli</it>.</p></title><aug><au><snm>Lesnik</snm><fnm>EA</fnm></au><au><snm>Sampath</snm><fnm>R</fnm></au><au><snm>Levene</snm><fnm>HB</fnm></au><au><snm>Henderson</snm><fnm>TJ</fnm></au><au><snm>McNeil</snm><fnm>JA</fnm></au><au><snm>Ecker</snm><fnm>DJ</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2001</pubdate><volume>29</volume><fpage>3583</fpage><lpage>3594</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/29.17.3583</pubid><pubid idtype="pmcid">55870</pubid><pubid idtype="pmpid">11522828</pubid></pubidlist></xrefbib></bibl><bibl id="B75"><title><p>Basic local alignment search tool.</p></title><aug><au><snm>Altschul</snm><fnm>SF</fnm></au><au><snm>Gish</snm><fnm>W</fnm></au><au><snm>Miller</snm><fnm>W</fnm></au><au><snm>Myers</snm><fnm>EW</fnm></au><au><snm>Lipman</snm><fnm>DJ</fnm></au></aug><source>J Mol Biol</source><pubdate>1990</pubdate><volume>215</volume><fpage>403</fpage><lpage>410</lpage><xrefbib><pubid idtype="pmpid">2231712</pubid></xrefbib></bibl><bibl id="B76"><title><p>R: A Language and Environment for Statistical Computing.</p></title><aug><au><cnm>R Development Core Team</cnm></au></aug><pubdate>2009</pubdate><url>http://www.R-project.org</url></bibl><bibl id="B77"><title><p>ClustalW and ClustalX version 2.</p></title><aug><au><snm>Larkin</snm><fnm>MA</fnm></au><au><snm>Blackshields</snm><fnm>G</fnm></au><au><snm>Brown</snm><fnm>NP</fnm></au><au><snm>Chenna</snm><fnm>R</fnm></au><au><snm>McGettigan</snm><fnm>PA</fnm></au><au><snm>McWilliam</snm><fnm>H</fnm></au><au><snm>Valentin</snm><fnm>F</fnm></au><au><snm>Wallace</snm><fnm>IM</fnm></au><au><snm>Wilm</snm><fnm>A</fnm></au><au><snm>Lopez</snm><fnm>R</fnm></au><au><snm>Thompson</snm><fnm>JD</fnm></au><au><snm>Gibson</snm><fnm>TJ</fnm></au><au><snm>Higgins</snm><fnm>DG</fnm></au></aug><source>Bioinformatics</source><pubdate>2007</pubdate><volume>23</volume><fpage>2947</fpage><lpage>2948</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btm404</pubid><pubid idtype="pmpid" link="fulltext">17846036</pubid></pubidlist></xrefbib></bibl><bibl id="B78"><title><p>RNAalifold: improved consensus structure prediction for RNA alignments.</p></title><aug><au><snm>Bernhart</snm><fnm>SH</fnm></au><au><snm>Hofacker</snm><fnm>IL</fnm></au><au><snm>Will</snm><fnm>S</fnm></au><au><snm>Gruber</snm><fnm>AR</fnm></au><au><snm>Stadler</snm><fnm>PF</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2008</pubdate><volume>9</volume><fpage>474</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-9-474</pubid><pubid idtype="pmcid">2621365</pubid><pubid idtype="pmpid">19014431</pubid></pubidlist></xrefbib></bibl><bibl id="B79"><title><p>RALEE - RNA ALignment editor in Emacs.</p></title><aug><au><snm>Griffiths-Jones</snm><fnm>S</fnm></au></aug><source>Bioinformatics</source><pubdate>2005</pubdate><volume>21</volume><fpage>257</fpage><lpage>259</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/bth489</pubid><pubid idtype="pmpid" link="fulltext">15377506</pubid></pubidlist></xrefbib></bibl><bibl id="B80"><title><p>VARNA: Interactive drawing and editing of the RNA secondary structure.</p></title><aug><au><snm>Darty</snm><fnm>K</fnm></au><au><snm>Denise</snm><fnm>A</fnm></au><au><snm>Ponty</snm><fnm>Y</fnm></au></aug><source>Bioinformatics</source><pubdate>2009</pubdate><volume>25</volume><fpage>1974</fpage><lpage>1975</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btp250</pubid><pubid idtype="pmcid">2712331</pubid><pubid idtype="pmpid">19398448</pubid></pubidlist></xrefbib></bibl><bibl id="B81"><title><p>WebLogo: A sequence logo generator.</p></title><aug><au><snm>Crooks</snm><fnm>GE</fnm></au><au><snm>Hon</snm><fnm>G</fnm></au><au><snm>Chandonia</snm><fnm>JM</fnm></au><au><snm>Brenner</snm><fnm>SE</fnm></au></aug><source>Genome Res</source><pubdate>2004</pubdate><volume>14</volume><fpage>1188</fpage><lpage>1190</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.849004</pubid><pubid idtype="pmcid">419797</pubid><pubid idtype="pmpid">15173120</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm></art>