<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2007-8-8-r155</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Software</dochead>
      <bibl>
         <title>
            <p>RASTA-Bacteria: a web-based tool for identifying toxin-antitoxin loci in prokaryotes</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Sevin</snm>
               <mi>W</mi>
               <fnm>Emeric</fnm>
               <insr iid="I1"/>
               <email>esevin@univ-rennes1.fr</email>
            </au>
            <au id="A2" ca="yes">
               <snm>Barloy-Hubler</snm>
               <fnm>Fr&#233;d&#233;rique</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>fhubler@univ-rennes1.fr</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>CNRS UMR6061 G&#233;n&#233;tique et D&#233;veloppement, Universit&#233; de Rennes 1, IFR 140, Av. du Prof. L&#233;on Bernard, CS 34317, 35043 Rennes, France</p>
            </ins>
            <ins id="I2">
               <p>CNRS UMR6026 Interactions Cellulaires et Mol&#233;culaires, Groupe DUALS, Universit&#233; de Rennes 1, IFR140, Campus de Beaulieu, Av. du G&#233;n&#233;ral Leclerc, 35042 Rennes, France</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>8</issue>
         <fpage>R155</fpage>
         <url>http://genomebiology.com/2007/8/8/R155</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17678530</pubid>
               <pubid idtype="doi">10.1186/gb-2007-8-8-r155</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>29</day>
               <month>3</month>
               <year>2007</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>14</day>
               <month>6</month>
               <year>2007</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>1</day>
               <month>8</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>01</day>
               <month>08</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Sevin and Barloy-Hubler; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>The RASTA-Bacteria tool</p>
      </shorttitle>
      <shortabs>
         <p>RASTA-Bacteria is an automated method that allows quick and reliable identification of toxin/antitoxin loci in sequenced prokaryotic genomes, whether they are annotated Open Reading Frames or not.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <p>Toxin/antitoxin (TA) systems, viewed as essential regulators of growth arrest and programmed cell death, are widespread among prokaryotes, but remain sparsely annotated. We present RASTA-Bacteria, an automated method allowing quick and reliable identification of TA loci in sequenced prokaryotic genomes, whether they are annotated open reading frames or not. The tool successfully confirmed all reported TA systems, and spotted new putative loci upon screening of sequenced genomes. RASTA-Bacteria is publicly available at <url>http://genoweb.univ-rennes1.fr/duals/RASTA-Bacteria</url>.</p>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010014">Microbiology and parasitology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Rationale</p>
         </st>
         <p>More than 500 prokaryotic genomes have now been completely sequenced and annotated, and the number of sequencing projects underway (approximately 1,300) indicates that the amount of such data is going to rise very rapidly <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Large-scale comparative genomics based on these data constituted a giant leap forward in the process of gene identification. Nevertheless, substantial numbers of annotated open reading frames (ORFs) throughout the sequenced genomes remain hypothetical, most of which are 200 amino acids in length or shorter <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. Luckily, interest in these small ORFs (sORFs) is growing <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, and recent work in <it>Sacharromyces cerevisiae </it>shows that they may be involved in key cellular functions <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>.</p>
         <p>The toxin/antitoxin (TA) modules are a group of sORFs for which knowledge has been improving over the past two decades. Most TA modules are constituted of two adjacent co-oriented but antagonist genes: one encodes a stable toxin harmful to an essential cell process, and the second a labile antitoxin that blocks the toxin's activity by DNA- or protein-binding <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. TA pairs have been classified into two types. The first are those where the antitoxin is an antisense-RNA. They have been linked to plasmid stabilization by means of a post-segregational killing (PSK) effect, <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> (for a review, see <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>). The second type, on which we focus in this study, includes loci where the antitoxin is a fully translated protein. For consistency with previous studies, we shall refer to them throughout this paper as TA systems.</p>
         <p>For some time after their discovery in 1983 <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, TA systems were only found on plasmids. They were defined as plasmid inheritance guarantor systems, and called 'plasmid addiction systems'. Several years later, two homologous TA operons were discovered on the <it>Escherichia coli </it>chromosome <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. Interest in these chromosomal TA systems led to the discovery of further systems in various bacteria <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>, and of their involvement in programmed cell death (PCD) <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. It was suggested that under severe starvation conditions, the TA-mediated PCD of moribund subpopulations provides the remaining healthy cells with nutrients, thus benefiting the species. Proof was later established that some TA systems actually provoke a static state in certain adverse conditions, in which cells remain viable but do not proliferate, and that this state is fully reversible on cognate antitoxin induction <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. However, it was later shown that this reversible effect is only possible within a limited time frame. Subsequently, there is a 'point of no return' in the killing effect of the toxin <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>.</p>
         <p>TA systems, widespread among both bacteria and archaea <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, are currently classified into eight families, depending on their structural features or modes of action <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Little is known about the only three-component family, whose founding member is the omega-epsilon-zeta (&#969;-&#949;-&#950;) system from plasmid pSM19035, except that the additional gene (&#969;) acts as a repressor regulating the transcription of the operon <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. &#969;-&#949;-&#950; systems are found only in Gram-positive bacteria. The remaining seven, two-component families, include: the ParDE system, found in Gram-negative and Gram-positive bacteria and in archaea, targets DNA gyrase <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>; HigBA, unique in that its toxin is located upstream from its antitoxin <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, is found in Gram-negative and Gram-positive bacteria, and its action involves mRNA cleavage <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>; the <it>phd</it>/<it>doc </it>locus, found in all types of prokaryotes, is believed to inhibit translation <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>; and the <it>vapBC </it>locus, found both on plasmids and chromosomes, seems to be the TA system with the highest copy-number in the prokaryotes that bear them, but no cellular target has yet been reported, although VapC toxins contain a PIN domain (homologue of the pilT amino-terminal domain: ribonuclease involved in nonsense-mediated mRNA decay and RNA interference in eukaryotes), suggesting that the system may contribute to quality control of gene expression <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. The other three families are the best characterized: the <it>ccdAB </it>locus, found only in some Gram-negative bacteria, stabilizes plasmids upon replication by targeting DNA gyrase <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>; members of the RelBE family, present in Gram-negatives, Gram-positives and archaea, inhibit cell growth by impairing translation due to mRNA cleavage through the A-site of the ribosome <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr></abbrgrp>; and finally, the toxins of the MazEF/PemIK family, sometimes referred to as 'RNA interferases' <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>, are ribonucleases that cleave cellular mRNA, thus depriving the ribosomes of substrates to translate <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> - they have been found in Gram-negative and Gram-positive bacteria.</p>
         <p>The role of TA systems in programmed cell death opens promising possibilities for the design of a new class of antibiotics <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. Moreover, chromosome-borne TA systems are activated by various extreme conditions, including the presence of antibiotics <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> or infecting phages <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, thymine starvation or other DNA damage <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, high temperatures, and oxidative stress <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. Their involvement in the response to amino acid starvation <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> also raises large interest: indeed, TA modules are believed to provide a backup system to the stringent response by controlling superfluous macromolecular biosynthesis during stasis independently of ppGpp <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>, the stringent response alarmone eliciting the protective reactions cascade. A reduced rate of translation is associated with fewer translational errors, so TA loci may contribute to quality control of gene expression, helping the cells cope with nutritional stress <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Therefore, it remains a priority to exhaustively identify TA loci in prokaryotic organisms in order to improve our understanding of these systems and more broadly of the cellular mechanisms behind bacterial adaptation.</p>
         <p>In 2005, Pandey and co-workers <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> performed an exhaustive search in 126 completely sequenced genomes (archaea and eubacteria), using standard sequence alignment tools (BLASTP and TBLASTN). Their work highlighted a surprising diversity in the distribution of TA loci: some organisms have many (<it>Nitrosomonas europaea </it>has 45 potential TA systems), whereas more than half of the other species have between 1 and 5, and 31 have none. Nevertheless, the use of basic nucleic or amino acid sequence similarity limits these findings to toxins and antitoxins for which a clear homolog exists; there is, therefore, a possible bias in their results. In view of the aforementioned lack of annotation of the small ORFs, and to improve localization techniques for TA systems, we developed a simple method for identifying all potential TA systems in a given bacterial genome: Rapid Automated Scan for Toxins and Antitoxins in Bacteria (RASTA-Bacteria). This method is based on the genomic features associated with toxins and antitoxins and the existence of conserved functional domains. The results, sorted by a confidence score, discard no candidate, thus providing the user an extensive overview of the data.</p>
      </sec>
      <sec>
         <st>
            <p>Process overview</p>
         </st>
         <p>The module-based pipeline of RASTA-Bacteria is described in Figure <figr fid="F1">1</figr>. The first step is to provide a genomic sequence. Even though it can be useful to test relatively short 'raw' nucleic sequences for the presence of a TA system, RASTA-Bacteria was designed to function with whole-replicon genomic sequences, regardless of their size (small plasmids or large chromosomes). The tool can thus take both simple (FASTA-formatted) nucleic sequences or fully annotated (GenBank) files as input data. They can either be selected from an extensive list of sequenced bacterial and archaeal genomes, or be provided by the user in the case of an unpublished genome.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Schematic modular pipeline of RASTA-Bacteria</p>
            </caption>
            <text>
               <p>Schematic modular pipeline of RASTA-Bacteria. Step 1: provide a nucleic genome sequence in GenBank or raw Fasta format. Step 2: tune the search parameters (optional). Step 3: launch the search; each module calculates a local score, and possibly modifies the dataset (Sx = score at level x; Ny = number of ORFs in dataset; Lz = length distribution of dataset; b1 = bonus). Step 4: output in webpage and/or results files available for download.</p>
            </text>
            <graphic file="gb-2007-8-8-r155-1"/>
         </fig>
         <p>The second step enables the user to tune optional parameters for the search: depending on the origin of the input sequence, it is possible to choose the length-scoring model, from 'general', 'archaea', 'Gram+', and 'Gram-', on which the scoring function must rely. The sensitivity of the tool can also be improved by modifying the bit-score threshold for the RPSBLAST alignments. However, we defined the default value from our experiments and believe it is the most appropriate. Similarly, a minimal ORF size for the ORF finder can be defined, as well as an annotated gene overlap percentage threshold when verifying the annotation. These parameters limit the amount of data (hence time of computation), and should be refined only in particular cases, such as for known high-overlapping genomes for example. The third step is the run phase, performed as follows: first, screening of the nucleic sequence for open-reading frames; second, screening of newly determined ORFs for the presence of TA domains; third, size-based scoring of the ORFs; and fourth, scoring based on the pairing possibility of an ORF with another. In the last step, the results are combined to calculate a global confidence score for each ORF. These are then ranked accordingly and displayed to the user in a tabular format, which ensures clear visualization of the results and allows easy verification by cross-linkage to the data files. For raw nucleic sequences and files below 500 kb, the table is directly viewable in the user's web browser (Figure <figr fid="F2">2</figr>). The results table and supporting files are then available for download as a tar archive. For fully annotated genomes and files over 500 kB, no interactive display will be produced, and the user will be notified by email when the job ends that the archive is ready for download.</p>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Screenshot of the results displayed as a webpage</p>
            </caption>
            <text>
               <p>Screenshot of the results displayed as a webpage. This illustration shows the output results ranked by confidence score. The arrows represent internal links to additional supporting data. The amino acid sequence corresponding to an ORF as annotated by RASTA-Bacteria is shown (1). When a conserved TA domain was predicted, the alignment results can be seen in rpsblast output format (2). Anchor links between co-localized candidates allow checking for possible parity (3).</p>
            </text>
            <graphic file="gb-2007-8-8-r155-2"/>
         </fig>
         <p>The method developed was automated using Perl, with sequence processing relying on the BioPerl library <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. The script is embedded in a PHP-based web-interface. RASTA-Bacteria is publicly available from the application website <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Description of the algorithm</p>
         </st>
         <sec>
            <st>
               <p>Genomic features used for discriminating TA systems</p>
            </st>
            <p>It should be noted here that <it>hipBA </it>loci (found to have a role in the production of 'persister cells' in <it>E. coli </it><abbrgrp><abbr bid="B42">42</abbr></abbrgrp>), as well as restriction-modification (type II) systems, can also be considered as TA systems. Nevertheless, the latter have been extensively identified and characterized elsewhere <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>, and have been excluded from our work. Because of its specific organization, the three-component TA family (&#969;-&#949;-&#950;) was also excluded from the present study.</p>
            <p>TA systems by definition consist of, at least, two genes: the 'dormant guard' role is fulfilled by the presence of a toxic and a protective protein together, although some orphan genes (for which conservation of functionality as such remains unclear) have been reported <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B45">45</abbr></abbrgrp>. Whether or not the TA pairs are encoded by genes forming an operon, the spacer seldom extends beyond 30 nucleotides, and a small overlap (1 to 20 nucleotides in general) is the most common structure. The order of the two cooperating genes is also well conserved, with the antitoxin being upstream (Figure <figr fid="F3">3</figr>), although there is an exception: in <it>higBA </it>loci the toxin is upstream of the antitoxin <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>General genetic context of a TA loci</p>
               </caption>
               <text>
                  <p>General genetic context of a TA loci. The typical TA loci organization with sizes and distance profiling is shown.</p>
               </text>
               <graphic file="gb-2007-8-8-r155-3"/>
            </fig>
            <p>TA genes in all prokaryotic species are small. According to Pandey <it>et al</it>. <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>, antitoxins are 41 to 206 amino acids long and toxins 31 to 204 amino acids long, antitoxins generally being shorter than their partner toxins (Figure <figr fid="F4">4</figr>). Here too there seems to be an exception: the toxin of the HipBA system is 440 amino acids in length (not shown).</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Length distribution of Bacterial toxins and antitoxins</p>
               </caption>
               <text>
                  <p>Length distribution of Bacterial toxins and antitoxins. The graph represents the length distribution of antitoxins and toxins in 126 organisms (from [39]), depending on their classification (X-axis, length in amino acids; left Y-axis, number of sequences). The black curves represent the probability over the total population (1,378 TA) for a sequence of length X to constitute a TA (right Y-axis), and were used to determine the length-criterion scoring function as described in the text.</p>
               </text>
               <graphic file="gb-2007-8-8-r155-4"/>
            </fig>
            <p>These two features have been used with success as preliminary filters to a biological search for unidentified TA pairs in <it>E. coli </it><abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, but this approach is too permissive to be accurate as an automatic predictor. By adding a third criterion, namely the presence of a conserved functional domain, the selectivity of the method over the input space can be improved. Furthermore, as the knowledge base of TA systems grows, sequence homology can provide further information.</p>
         </sec>
         <sec>
            <st>
               <p>ORF detection and filtering</p>
            </st>
            <p>To bypass the mis-annotation of TA genes, which, like many small ORFs, are easily omitted during the annotation process, the tool begins with a na&#239;ve ORF prediction. This first step is essential to ensure that the analysis leaves no possible ORF aside. RASTA-Bacteria thus starts by predicting the entire set of valid ORFs in the sequence, defined as the series of triplets occurring between one of the four accepted prokaryotic start codons (NTG), and one of the three stop codons (TGA, TAG, TAA), with no further assumption about the profile of the ORF. In the case of alternative start codons, redundancy is avoided by considering only the longest possible sequence. Although no possible ORFs should be overlooked, existing genomic information (in the case of an annotated genome, the preferred input) should not be ignored. Indeed, even if sometimes flawed, the original annotation can provide RASTA-Bacteria with valuable hints. Therefore, the tool recovers all the annotated features of the sequence, and compares the 'na&#239;ve' ORFs to the existing set of genes. If a na&#239;ve ORF overlaps an annotated gene (whose 'product' and 'confidence' fields do not display the terms 'unknown', 'putative', or 'hypothetical') by more than a threshold percentage (see parameters), then it is discarded as a spurious ORF. If the considered ORF corresponds to an annotated ORF, its score is rewarded to reflect the annotators' work, that is, the probability that this ORF actually encodes a protein. For reasons of consistency, this process also renames existing ORFs with their common designation.</p>
         </sec>
         <sec>
            <st>
               <p>Conserved domain verification: a specific TA-dedicated database</p>
            </st>
            <p>Once the whole list of candidate ORFs is established, the ORFs undergo a conserved domain search. To achieve this, we use the Reverse PSI-BLAST program (RPSBLAST, part of the standalone blast archive, release 2.2.14 <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>), which searches a query sequence against a database of pre-computed lookup tables called PSSMs (position specific scoring matrices), originating from the Pfam, Smart, COG, KOG and cd alignment collections (the complete archive of conserved domain PSSMs can be found at <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>). These profiles then need to be formatted as a usable database by the formatrpsdb tool <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. For our purposes, we thus built a dedicated TA conserved domains database (TAcddb), compiled from the existing profiles of domains known to belong to toxin and antitoxin genes (Table <tblr tid="T1">1</tblr>), against which all the sequences in amino acids are searched. Consequently, TA systems with unknown functionally conserved domains are unfortunately liable to be penalized. However, the combining of different criteria tempers the risk of overlooking them, and the database is able to evolve as it can be re-compiled with any new set of PSSMs.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>List of PSSM profiles selected in <it>TAcddb </it>to verify the presence of a conserved TA-related domain</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>PSSMid</p>
                     </c>
                     <c ca="left">
                        <p>CD accession name</p>
                     </c>
                     <c ca="left">
                        <p>Relation/involvement in TA world</p>
                     </c>
                     <c ca="center">
                        <p>Reference</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>28977</p>
                     </c>
                     <c ca="left">
                        <p>cd00093-HTH_XRE</p>
                     </c>
                     <c ca="left">
                        <p>XRE-like domain present in HigA and VapB antitoxins</p>
                     </c>
                     <c ca="center">
                        <p>[20], this study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>31586</p>
                     </c>
                     <c ca="left">
                        <p>COG1396-HipB</p>
                     </c>
                     <c ca="left">
                        <p>Involved in production of persister cells (antitoxin)</p>
                     </c>
                     <c ca="center">
                        <p>[20]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>31676</p>
                     </c>
                     <c ca="left">
                        <p>COG1487-VapC</p>
                     </c>
                     <c ca="left">
                        <p>Quality control of gene expression</p>
                     </c>
                     <c ca="center">
                        <p>[57]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>31786</p>
                     </c>
                     <c ca="left">
                        <p>COG1598</p>
                     </c>
                     <c ca="left">
                        <p>HicB of HicAB system (function undetermined)</p>
                     </c>
                     <c ca="center">
                        <p>[58]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>31910</p>
                     </c>
                     <c ca="left">
                        <p>COG1724</p>
                     </c>
                     <c ca="left">
                        <p>HicA of HicAB system (function undetermined)</p>
                     </c>
                     <c ca="center">
                        <p>[58]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>32033</p>
                     </c>
                     <c ca="left">
                        <p>COG1848</p>
                     </c>
                     <c ca="left">
                        <p>PIN domain, present in VapC toxins</p>
                     </c>
                     <c ca="center">
                        <p>[20,59,60]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>32185</p>
                     </c>
                     <c ca="left">
                        <p>COG2002-AbrB</p>
                     </c>
                     <c ca="left">
                        <p>Domain present in of MazE and VapB antitoxins</p>
                     </c>
                     <c ca="center">
                        <p>[20]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>32209</p>
                     </c>
                     <c ca="left">
                        <p>COG2026-RelE</p>
                     </c>
                     <c ca="left">
                        <p>Toxin of cytotoxic translational repressor system</p>
                     </c>
                     <c ca="center">
                        <p>[14,28,29]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>32344</p>
                     </c>
                     <c ca="left">
                        <p>COG2161-StbD</p>
                     </c>
                     <c ca="left">
                        <p>Antitoxin of the RelBE family</p>
                     </c>
                     <c ca="center">
                        <p>[61]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>32487</p>
                     </c>
                     <c ca="left">
                        <p>COG2336-MazE</p>
                     </c>
                     <c ca="left">
                        <p>Growth regulator (antitoxin)</p>
                     </c>
                     <c ca="center">
                        <p>[45]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>32488</p>
                     </c>
                     <c ca="left">
                        <p>COG2337-MazF</p>
                     </c>
                     <c ca="left">
                        <p>Growth inhibitor (toxin)</p>
                     </c>
                     <c ca="center">
                        <p>[45]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>32907</p>
                     </c>
                     <c ca="left">
                        <p>COG3093-VapI</p>
                     </c>
                     <c ca="left">
                        <p>Named from VapI region; corresponds to VapB antitoxins (Plasmid maintenance)</p>
                     </c>
                     <c ca="center">
                        <p>[62], this study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>33351</p>
                     </c>
                     <c ca="left">
                        <p>COG3549-HigB</p>
                     </c>
                     <c ca="left">
                        <p>Toxin of plasmid maintenance system</p>
                     </c>
                     <c ca="center">
                        <p>[23]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>33352</p>
                     </c>
                     <c ca="left">
                        <p>COG3550-HipA</p>
                     </c>
                     <c ca="left">
                        <p>Involved in production of persister cells (toxin)</p>
                     </c>
                     <c ca="center">
                        <p>[20]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>33408</p>
                     </c>
                     <c ca="left">
                        <p>COG3609</p>
                     </c>
                     <c ca="left">
                        <p>CopG/Arc/MetJ DNA-binding domain, present in RelB, ParD, VapBCand CcdA antitoxins</p>
                     </c>
                     <c ca="center">
                        <p>[20], this study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>33452</p>
                     </c>
                     <c ca="left">
                        <p>COG3654-Doc</p>
                     </c>
                     <c ca="left">
                        <p>Toxin of probable translational inhibitor system</p>
                     </c>
                     <c ca="center">
                        <p>[25,63]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>33466</p>
                     </c>
                     <c ca="left">
                        <p>COG3668-ParE</p>
                     </c>
                     <c ca="left">
                        <p>Toxin of plasmid stabilization system</p>
                     </c>
                     <c ca="center">
                        <p>[22,64]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>33870</p>
                     </c>
                     <c ca="left">
                        <p>COG4113</p>
                     </c>
                     <c ca="left">
                        <p>PIN domain, present in VapC toxins</p>
                     </c>
                     <c ca="center">
                        <p>[20,59,60]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>33875</p>
                     </c>
                     <c ca="left">
                        <p>COG4118-Phd</p>
                     </c>
                     <c ca="left">
                        <p>Antitoxin to translational inhibitor Doc</p>
                     </c>
                     <c ca="center">
                        <p>[65]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>33951</p>
                     </c>
                     <c ca="left">
                        <p>COG4226-HicB</p>
                     </c>
                     <c ca="left">
                        <p>HicB of HicAB system (predicted)</p>
                     </c>
                     <c ca="center">
                        <p>[58]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>34119</p>
                     </c>
                     <c ca="left">
                        <p>COG4423</p>
                     </c>
                     <c ca="left">
                        <p>Predicted antitoxin of PIN domain toxins (VapC)</p>
                     </c>
                     <c ca="center">
                        <p>[57,60]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>34135</p>
                     </c>
                     <c ca="left">
                        <p>COG4456-VagC</p>
                     </c>
                     <c ca="left">
                        <p>Antitoxin of plasmid maintenance system</p>
                     </c>
                     <c ca="center">
                        <p>[66]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>34307</p>
                     </c>
                     <c ca="left">
                        <p>COG4691-StbC</p>
                     </c>
                     <c ca="left">
                        <p>Plasmid stability proteins (HigBA family)</p>
                     </c>
                     <c ca="center">
                        <p>[67,68], this study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>34891</p>
                     </c>
                     <c ca="left">
                        <p>COG5302-CcdA</p>
                     </c>
                     <c ca="left">
                        <p>Antitoxin of plasmid stabilization system</p>
                     </c>
                     <c ca="center">
                        <p>[27,69]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>35058</p>
                     </c>
                     <c ca="left">
                        <p>COG5499</p>
                     </c>
                     <c ca="left">
                        <p>Predicted transcription regulators with HTH domain</p>
                     </c>
                     <c ca="center">
                        <p>[20], this study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>41431</p>
                     </c>
                     <c ca="left">
                        <p>pfam01381-Hth_3</p>
                     </c>
                     <c ca="left">
                        <p>Present in antitoxins of HigBA and VapBC families</p>
                     </c>
                     <c ca="center">
                        <p>[20], this study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>41452</p>
                     </c>
                     <c ca="left">
                        <p>pfam01402-Hth_4</p>
                     </c>
                     <c ca="left">
                        <p>Present in CopG repressors (RelBE, ParDE, VapBC, and CcdAB families)</p>
                     </c>
                     <c ca="center">
                        <p>[20], this study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>41869</p>
                     </c>
                     <c ca="left">
                        <p>pfam01845-CcdB</p>
                     </c>
                     <c ca="left">
                        <p>Toxin of plasmid stabilization system</p>
                     </c>
                     <c ca="center">
                        <p>[69]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>41874</p>
                     </c>
                     <c ca="left">
                        <p>pfam01850-PIN</p>
                     </c>
                     <c ca="left">
                        <p>DNA binding PIN domain, present in VapC toxins</p>
                     </c>
                     <c ca="center">
                        <p>[59,60]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>42429</p>
                     </c>
                     <c ca="left">
                        <p>pfam02452-PemK</p>
                     </c>
                     <c ca="left">
                        <p>Toxin of the MazEF family</p>
                     </c>
                     <c ca="center">
                        <p>[70]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>43931</p>
                     </c>
                     <c ca="left">
                        <p>pfam04014-AbrB</p>
                     </c>
                     <c ca="left">
                        <p>Domain present in MazE and VapB antitoxins</p>
                     </c>
                     <c ca="center">
                        <p>[20], this study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>44135</p>
                     </c>
                     <c ca="left">
                        <p>pfam04221-RelB</p>
                     </c>
                     <c ca="left">
                        <p>Antitoxin to translational repressor RelE</p>
                     </c>
                     <c ca="center">
                        <p>[14]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>44915</p>
                     </c>
                     <c ca="left">
                        <p>pfam05012-Doc</p>
                     </c>
                     <c ca="left">
                        <p>Toxin of probable translational inhibitor system</p>
                     </c>
                     <c ca="center">
                        <p>[63]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>44918</p>
                     </c>
                     <c ca="left">
                        <p>pfam05015-Plasmid_killer</p>
                     </c>
                     <c ca="left">
                        <p>Toxins of the HigBA family</p>
                     </c>
                     <c ca="center">
                        <p>[23], this study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>44919</p>
                     </c>
                     <c ca="left">
                        <p>pfam05016-Plasmid_stabil</p>
                     </c>
                     <c ca="left">
                        <p>Toxins of the RelE family</p>
                     </c>
                     <c ca="center">
                        <p>[14], this study</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>45431</p>
                     </c>
                     <c ca="left">
                        <p>pfam05534-HicB</p>
                     </c>
                     <c ca="left">
                        <p>Member of the HicAB system</p>
                     </c>
                     <c ca="center">
                        <p>[58]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>47246</p>
                     </c>
                     <c ca="left">
                        <p>pfam07362-CcdA</p>
                     </c>
                     <c ca="left">
                        <p>Antitoxin of plasmid stabilization system</p>
                     </c>
                     <c ca="center">
                        <p>[27,69]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>47831</p>
                     </c>
                     <c ca="left">
                        <p>smart00530-Xre</p>
                     </c>
                     <c ca="left">
                        <p>XRE-like HTH domain present in HigA and VapB</p>
                     </c>
                     <c ca="center">
                        <p>[20], this study</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>References to 'this study' correspond to domains found in this study upon sequence analysis of described TA candidates. AbrB, AidB regulator; HTH, helix-turn-helix; PIN, homologues of the pilin biogenesis protein pilT amino-terminal domain; XRE, xenobiotic response element.</p>
               </tblfn>
            </tbl>
            <p>For each candidate, the hits are analyzed to select the most likely in terms of both homology and sequence alignment length. If the candidate ORF exhibits a clear homology, namely a high score and over 80% of a full product domain aligned, but is longer than the corresponding profile, it is scanned for alternative start codons to identify any other 5' end that gives a better profile fit. If this is the case, the ORF is resized to its new coordinates. A short description of the possible domain is stored for subsequent display as a hint to the user for further classification, with an internal hyperlink to the alignment: again, no information is discarded and all the results can be visually assessed. Here, each reference domain used is levelheaded with a coefficient representing its implication in the TA kingdom: those defined by a confirmed TA family have a higher coefficient than domains found in TAs but not exclusive to them (for example, PIN versus VapB domain). This coefficient is computed together with the alignment data to yield the 'domain score'.</p>
         </sec>
         <sec>
            <st>
               <p>The length criterion</p>
            </st>
            <p>The candidates proceed to a size-scoring module. Based on the lengths of 1,378 TA sequences (Figure <figr fid="F4">4</figr>) described following the extensive search by Pandey's team <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>, we calculated the probability for length l of a candidate to be that of a toxin or an antitoxin as follows:</p>
            <p>
               <display-formula>
                  <m:math name="gb-2007-8-8-r155-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>P</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>L</m:mi>
                           <m:mo>=</m:mo>
                           <m:mi>l</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>n</m:mi>
                                    <m:mi>l</m:mi>
                                 </m:msub>
                              </m:mrow>
                              <m:mi>N</m:mi>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8tuQ8FMI8Gi=hEeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciGacaGaaeqabaqadeqadaaakeaacaWGqbGaaiikaiaadYeacqGH9aqpcaWGSbGaaiykaiabg2da9maalaaabaGaamOBamaaBaaaleaacaWGSbaabeaaaOqaaiaad6eaaaaaaa@3C26@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>N </it>= 1,378. We then defined our scoring function by averaging the probability over k neighboring lengths before and after the considered length such that:</p>
            <p>
               <display-formula>
                  <m:math name="gb-2007-8-8-r155-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>f</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>l</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mn>1</m:mn>
                              <m:mrow>
                                 <m:mn>2</m:mn>
                                 <m:mi>k</m:mi>
                                 <m:mo>+</m:mo>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                           </m:mfrac>
                           <m:mstyle displaystyle="true">
                              <m:munderover>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mo>=</m:mo>
                                    <m:mo>&#8722;</m:mo>
                                    <m:mi>k</m:mi>
                                 </m:mrow>
                                 <m:mi>k</m:mi>
                              </m:munderover>
                              <m:mrow>
                                 <m:mi>P</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>L</m:mi>
                                 <m:mo>=</m:mo>
                                 <m:mi>l</m:mi>
                                 <m:mo>+</m:mo>
                                 <m:mi>i</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                           </m:mstyle>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8tuQ8FMI8Gi=hEeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciGacaGaaeqabaqadeqadaaakeaacaWGMbGaaiikaiaadYgacaGGPaGaeyypa0ZaaSaaaeaacaaIXaaabaGaaGOmaiaadUgacqGHRaWkcaaIXaaaamaaqahabaGaamiuaiaacIcacaWGmbGaeyypa0JaamiBaiabgUcaRiaadMgacaGGPaaaleaacaWGPbGaeyypa0JaeyOeI0Iaam4AaaqaaiaadUgaa0GaeyyeIuoaaaa@4945@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>This smoothes the curb of probabilities to some extent, as it avoids accidental high or low counts of a given length to be given undue weight with respect to surrounding lengths. Several datasets were created so that the scoring function reflects the different types of organisms: general, archaea, Gram-negative and Gram-positive. The user can thus choose which model to use depending on the species being considered. Similarly, although defining size functions for each of the seven TA families is at first sight appealing, it should be emphasized that automatic classification of TA loci is risky. This is due to diverging homologies: some toxin motifs pair with antitoxin motifs, or more simply toxins/antitoxins of a given family sometimes demonstrate similarity with those of another family <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. Therefore, relying on such specific characteristics for the size criterion evaluation might lead to mis-scoring.</p>
         </sec>
         <sec>
            <st>
               <p>ORF 'pair organization' scoring criterion</p>
            </st>
            <p>Finally, the method verifies that the ORFs are paired on the strand considered. To do so, the module searches for close neighbors upstream and downstream of the ORF, in agreement with the distance parameter described above: a neighbor is considered close if it lies less than 30 base-pairs away from the extremities of the ORF, and if it overlaps the ORF by less than 20 base-pairs. In practice, both values can be somewhat enlarged, so as to avoid potential loss of candidates in the case of an extended span of the ORF due to alternative start codons. Thus, if an ORF fits these criteria, its score is rewarded. Furthermore, if the neighbor exhibits a TA length and/or a TA domain, the score is given the corresponding bonus. Obviously, this diminishes the chances of fortuitous or clearly non-TA characterized operons finding themselves among the top candidates.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>RASTA-Bacteria in action</p>
         </st>
         <p>All tests reported in this section were carried out with annotated .gbk files downloaded on 1 September 2006 from the RefSeq repository <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>, on a Mac PowerPC G5 with Mac OS X v.10.3.9. For multi-replicon organisms, all episomes were included in the analysis. Running times were between 40 s (for a 600 Mb genome) and 33 minutes (for a 9 Gb genome).</p>
         <sec>
            <st>
               <p>Application to the alpha-proteobacteria model: <it>Sinorhizobium meliloti</it></p>
            </st>
            <p><it>S. meliloti </it>is a Gram-negative alpha-proteobacterium studied in our laboratory that is found both free-living in soil and in a symbiotic interaction with alfalfa where it forms root nodules. Its genome is made up of a 3.65 Mb circular chromosome and two essential megaplasmids, pSymA (1.35 Mb) and pSymB (1.68 Mb), all of them being GC rich (62.2% global) <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. These features (large and tripartite genome with recently acquired plasmid, free and symbiotic life ability) make <it>S. meliloti </it>an interesting model for the validation of RASTA-bacteria. In the 2005 search by Pandey <it>et al</it>. <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>, 12 TA systems (2 <it>relBE</it>-like, 3 <it>higBA</it>-like, and 7 <it>vapBC</it>-like) were identified, but only the chromosome was considered. We analyzed all three replicons with RASTA-Bacteria, as they are all constituents of the complete genome. Of the 12 systems identified by Pandey <it>et al</it>., 11 were positively discriminated by RASTA, including the ntrPR operon, which was recently shown to function as a TA system <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>, demonstrating the good accuracy of our software. The 12th one (higBA-2, GI15965582-15965583) was only poorly rewarded by the method described here; indeed, none of the TA domain profiles corresponding to its described classification (nor others) were matched by the members of this TA pair, which furthermore do not fit the size and distance criteria. Further sequence analysis did reveal similarity with a putative addiction module killer protein for the amino-terminal half of gene 15965582, but a second conserved domain in its carboxy-terminal half, as well as the conserved domains ('ABC transporter') found in its reported partner, are rather contradictory with the fact that this pair might comprise a valid TA system. There is thus no concrete evidence that enables us to confirm this hypothesis.</p>
            <p>We found 14 additional putative TA loci on the chromosome (bringing the population to 25 for this replicon), 17 loci on pSymA and 11 on pSymB (Figure <figr fid="F5">5a</figr>). Hence, our approach predicts a total of 53 TA loci in the complete genome of <it>S. meliloti</it>, including 95 genes of which 18 are newly identified. Their distribution across the various replicons seems random, although there is an apparent alternation of rich and poor areas, in particular in the megaplasmids (Figure <figr fid="F6">6</figr>). Similarly, they are remarkably evenly distributed between lagging and leading strands (Figure <figr fid="F5">5c</figr>). Relative to the sizes of the replicons, megaplasmid A, suspected to have been acquired more recently in the genome, contains twice as many TA loci as the other replicons (Figure <figr fid="F5">5b</figr>). Interestingly, the genetic organizations are diverse, although pairs remain the most frequent (71.5 %): 12 genes in 4 triplets, 68 genes in 34 pairs and 15 solitary genes (12 encode antitoxins and 3 encode toxins, one of them being the chromosomal <it>relE</it>; Figure <figr fid="F4">4d</figr>).</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>TA loci features in individual replicons of <it>S. meliloti </it>strain 1021</p>
               </caption>
               <text>
                  <p>TA loci features in individual replicons of <it>S. meliloti </it>strain 1021. <b>(a) </b>Repartition of TA loci in the chromosome (new and confirming Pandey <it>et al</it>.'s [39] findings) and in the two megaplasmids. <b>(b) </b>Percentage of TA loci as a function of replicon size. <b>(c) </b>Repartition with respect to leading and lagging strands of replication. <b>(d) </b>Frequency of the three genomic organizations found for TA genes in the three replicons.</p>
               </text>
               <graphic file="gb-2007-8-8-r155-5"/>
            </fig>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Maps of TA loci in individual replicons of <it>S. meliloti </it>strain 1021</p>
               </caption>
               <text>
                  <p>Maps of TA loci in individual replicons of <it>S. meliloti </it>strain 1021. The maps were created using CGView [53,54]. Green labels represent newly annotated TA genes, and orange labels represent RASTA-Bacteria predicted TA genes previously reported by Pandey <it>et al</it>. [39] On the chromosome, the grey SmeXXX regions correspond to genomic islands as described in the Islander database [55,56].</p>
               </text>
               <graphic file="gb-2007-8-8-r155-6"/>
            </fig>
            <p>The classification of candidates into families according to sequence homology alone is a tedious task. Nevertheless, it seems the two major families are <it>vapBC</it>, consistent with the findings of Pandey <it>et al</it>. <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>, and parDE. No <it>ccdAB </it>locus was found, but the results indicate there may be <it>parDE </it>and <it>phd/doc </it>members (distributed on all three replicons) among the candidates, as well as one <it>mazEF </it>pair, situated on plasmid B.</p>
         </sec>
         <sec>
            <st>
               <p>RASTA-bacteria results compared to those from previous studies</p>
            </st>
            <p>Our tool proved to be efficient and fast for the bacterium <it>S. meliloti</it>, which was used for its design. The effectiveness of RASTA-Bacteria for other sequences was first assessed using 14 prokaryotes previously studied by Pandey <it>et al</it>. <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> (Table <tblr tid="T2">2</tblr>): three gamma-proteobacteria (<it>E. coli </it>as an AT-rich generic model, <it>Coxiella burnetii </it>as an obligate host-associated organism and <it>Pseudomonas aeruginosa </it>as a free living, GC-rich bacterium); two alpha-proteobacteria (<it>Bradyrhizobium japonicum</it>, which has a large chromosome with significant horizontal rearrangements, and <it>Agrobacterium tumefaciens</it>, which has both circular and linear chromosomes); the genome with the largest predicted set of TA loci (<it>Nitrosomonas europeae </it><abbrgrp><abbr bid="B39">39</abbr></abbrgrp>); free-living firmicutes (<it>Lactococcus lactis</it>, <it>Bacillus</it>); one epsilon-proteobacteria (<it>Campylobacter jejuni</it>); three obligate host-associated organisms (<it>Rickettsia prowazekii</it>, <it>Buchnera aphidicola</it>, and <it>Mycobacterium leprae </it>for which Pandey <it>et al</it>. did not find any TA loci); and members of the Aquificae and Thermatogae extreme-life phylum (<it>Thermotoga maritima</it>, <it>Aquifex aeolicus</it>). Also, to assess the range of applicability of our tool, we tested the archaeum <it>Sulfolobus tokodaii</it>. The result files for all these species as well as for <it>S. meliloti </it>are available in the 'Pre-computed Data' section of our website <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Results for 14 previously studied organisms</p>
               </caption>
               <tblbdy cols="11">
                  <r>
                     <c ca="left">
                        <p>Organism</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>ccdAB</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>higBA</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>mazEF</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>parDE</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>phd/doc</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>relBE</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>vapBC</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>hipBA</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Unclass.</p>
                     </c>
                     <c ca="center">
                        <p>Total</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="11">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Aquifex aeolicus VF5</it>*</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>1 (0)</p>
                     </c>
                     <c ca="center">
                        <p>6 (2)</p>
                     </c>
                     <c ca="center">
                        <p>2 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>9 (2)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>A. tumefaciens str. C58</it>*</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>1 (1)</p>
                     </c>
                     <c ca="center">
                        <p>3 (3)</p>
                     </c>
                     <c ca="center">
                        <p>1 (0)</p>
                     </c>
                     <c ca="center">
                        <p>7 (7)</p>
                     </c>
                     <c ca="center">
                        <p>5 (3)</p>
                     </c>
                     <c ca="center">
                        <p>6 (0)</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>24 (14)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Bacillus anthracis Ames</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>1 (1)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1 (1)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Bacillus subtilis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>1 (1)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1 (1)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Bradyrhizobium japonicum</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>4 (4)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>1 (1)</p>
                     </c>
                     <c ca="center">
                        <p>6 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>12 (5)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Borrelia afzelii Pko</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Buchnera aphidicola str.</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Campylobacter jejuni</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>1 (0)</p>
                     </c>
                     <c ca="center">
                        <p>1 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>2 (0)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>C. pneumoniae CWL029</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Coxiella burnetii RSA 493</it>*</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>3 (3)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>1 (1)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>1 (1)</p>
                     </c>
                     <c ca="center">
                        <p>2 (2)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>10 (7)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Escherichia coli K12</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>1 (1)</p>
                     </c>
                     <c ca="center">
                        <p>2 (2)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>4 (3)</p>
                     </c>
                     <c ca="center">
                        <p>2 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>10 (6)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Haemophilus ducreyi</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Lactococcus lactis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>1 (0)</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>7 (0)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Mycobacterium leprae TN</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Mycoplasma genitalium</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Nitrosomonas europaea</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1 (1)</p>
                     </c>
                     <c ca="center">
                        <p>8 (7)</p>
                     </c>
                     <c ca="center">
                        <p>5 (5)</p>
                     </c>
                     <c ca="center">
                        <p>6 (6)</p>
                     </c>
                     <c ca="center">
                        <p>2 (2)</p>
                     </c>
                     <c ca="center">
                        <p>10 (10)</p>
                     </c>
                     <c ca="center">
                        <p>20 (14)</p>
                     </c>
                     <c ca="center">
                        <p>1 (0)</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>57 (45)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Prochlorococcus marinus</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Pseudomonas aeruginosa</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>5 (1)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>1 (1)</p>
                     </c>
                     <c ca="center">
                        <p>1 (0)</p>
                     </c>
                     <c ca="center">
                        <p>1 (1)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>2 (0)</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>13 (3)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Rickettsia prowazekii</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Sinorhizobium meliloti</it>*</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>6 (3)</p>
                     </c>
                     <c ca="center">
                        <p>1 (0)</p>
                     </c>
                     <c ca="center">
                        <p>8 (0)</p>
                     </c>
                     <c ca="center">
                        <p>3 (0)</p>
                     </c>
                     <c ca="center">
                        <p>2 (2)</p>
                     </c>
                     <c ca="center">
                        <p>27 (7)</p>
                     </c>
                     <c ca="center">
                        <p>1 (0)</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>53 (12)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Sulfolobus tokodaii</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>3 (3)</p>
                     </c>
                     <c ca="center">
                        <p>4 (4)</p>
                     </c>
                     <c ca="center">
                        <p>29 (25)</p>
                     </c>
                     <c ca="center">
                        <p>1 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>37 (32)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Thermotoga maritima</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (1)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>1 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>2 (1)</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Numbers stand for TA systems (singleton or doublet) as predicted by: RASTA-Bacteria (numbers in parentheses are as predicted by Pandey <it>et al</it>. [39]). *Plasmids were not included in the analysis by Pandey <it>et al</it>. [39]. Unclass., unclassified.</p>
               </tblfn>
            </tbl>
            <p>RASTA-bacteria identified all TA loci previously predicted by Pandey <it>et al</it>. except for one locus in <it>S. meliloti </it>(see above) and one <it>higBA </it>system in <it>B. japonicum</it>, which was not retained because the confidence score was too low (although there are conserved domains, they are ambiguous and were not included in TAcddb). The absence of detectable TA genes from the three obligate host-associated organisms tested (<it>R. prowazekii</it>, <it>B. aphidicola</it>, <it>M. leprae</it>) was confirmed, as was the presence of a single TA locus in <it>Bacillus </it>sp. Our tool was more sensitive than the previously used method: in all other tested genomes, RASTA-Bacteria identified a large number of new candidate loci. This was largely due to detection of potential members of the <it>higBA</it>, <it>relBE</it>, <it>hipBA </it>families and especially the <it>vapBC </it>family. For example, even in the case of the well-documented model <it>E. coli</it>, RASTA-Bacteria predicts at least four new TA pairs with high confidence (<it>yfeD/yfeC</it>, <it>yafN/yafO</it>, <it>ygjN/ygjM </it>and <it>sohA/yahV</it>). In addition, the <it>ygiT/b3022</it>, <it>ydcQ/yncN </it>and <it>ydaS/ydaT </it>loci have at least one member with a conserved domain commonly found in antitoxins, and ranked higher than published TA genes. Finally, YbaQ demonstrates near perfect identity with the profile corresponding to VapB antitoxins, but has no physically close partner, so it most likely is a solitary antitoxin, the first such to be reported in <it>E. coli</it>.</p>
            <p>Ten previously undescribed TA systems were identified in the four replicons of <it>A. tumefaciens </it>(Table <tblr tid="T2">2</tblr>), although only the two chromosomes were previously studied. RASTA-Bacteria confirmed the 14 systems previously reported and identified 5 additional (orphan) loci on the circular chromosome, 1 full pair and 1 orphan gene on the linear chromosome, and 2 TA systems on plasmid AT. It revealed plasmid Ti carries no plasmid addiction systems, although it does have a gene resembling <it>hipA </it>(Atu6158, GI|17939291). However, this candidate is substantially shorter than its reference, such that it is unlikely to be functional, and it is almost 60 kb away from any possible <it>hipB </it>candidate.</p>
            <p>We also assessed the sensitivity of our tool by examining genomes containing many TA loci, including that of <it>N. europaea</it>, reported to have no less than 45 TA loci, representing 88 genes. The RASTA-Bacteria scan of the genome of <it>N. europaea </it>yielded high confidence scores for 76 of these previously identified genes (86%), a confidence score between 50% and 70% for 11 (12.5%) and an unranked score for 1. It identified 11 additional TA loci on the <it>N. europaea </it>chromosome, if the <it>hipBA </it>locus is taken into account. Three are clearly vapBC pairs, although one is made of two relatively short and possibly disrupted genes, raising doubt about whether this pair is functional. The NE2103/NE2104 pair gave an intermediate confidence score, but has characteristics consistent with it being a TA system. NE1375/NE1376 may well define a new MazEF-like system. Finally, three orphan <it>vapB </it>and two orphan <it>higA </it>genes were found: it would be interesting to determine whether they are silent relics of ancient systems or are still active and responsible for a function. Remarkably, all these newly identified loci map in the same regions as the previously discovered systems, reinforcing the observation that TA loci in <it>N. europaea </it>cluster in particular regions of the genome.</p>
            <p>We also applied our tool to organisms where no TA loci had been found previously, including <it>L. lactis</it>, in which we predict ten possible TA loci, eight of which consist of an orphan gene containing a region encoding the same HTH_DNA-binding (for helix-turn-helix) profile.</p>
            <p>Finally, the archaeum with the most TA loci was <it>S. tokodaii</it>, with 32 TA loci <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. RASTA-Bacteria confirmed 52 of the 61 genes at these 32 TA loci (3 singletons): the STS188/ST1628 and ST2136/37 pairs gave low scores because of an extreme overlap or because of an alternative start codon causing a bias in the size scoring process. The results for five other genes cannot be interpreted with certainty, but observations in other organisms where orphan TA genes do not seem uncommon suggests that some loci predicted to be pairs might in fact belong to the single-gene loci category. Nevertheless, four additional TA loci were identified, two of them being standard pairs of the VapBC family. The TA loci are unevenly distributed through the chromosome: two regions of approximately 240 and 440 kb seem to be devoid of TA loci, and the loci appear to be clustered in particular regions (data not shown).</p>
         </sec>
         <sec>
            <st>
               <p>Application to newly selected genomes</p>
            </st>
            <p>We performed a second round of analyses of newly selected, mostly recently published genomes (Table <tblr tid="T3">3</tblr>). At least three genomes from each phylogenetic branch and lifestyle were examined, except where only less complete genome sequences were available.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>RASTA-Bacteria predictions for newly sequenced genomes</p>
               </caption>
               <tblbdy cols="11">
                  <r>
                     <c ca="left">
                        <p>Organism</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>ccdAB</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>higBA</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>mazEF</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>parDE</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>phd/doc</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>relBE</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>vapBC</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>hipBA</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>UnClass.</p>
                     </c>
                     <c ca="center">
                        <p>Total</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="11">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>A. phagocytophilum</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Arthrobacter aurescens</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Azoarcus </it>sp. BH720</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Bartonella bacilliformis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Burkholderia xenovorans</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>33</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>C. P. amoebophila</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Dehalococcoides </it>sp.</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Deinococcus geothermalis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Ehrlichia chaffeensis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Frankia alni ACN14a</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Gramella forsetii KT0803</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Granulibacter bethesdensis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Lawsonia intracellularis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Magnetococcus </it>sp.</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>17</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Methanococcoides burtonii</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Methanospirillum hungatei</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>30</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Mycobacterium bovis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>48</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>65</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Mycobacterium </it>sp. KMS</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Myxococcus xanthus</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Nanoarchaeaum equitans</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>O. yellows phytoplasma</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>P. naphthalenivorans</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>25</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Pyrobaculum islandicum</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Shewanella </it>sp.</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Thermofilum pendens</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Trichodesmium erythraeum</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Wigglesworthia</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     