<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2009-10-4-r34</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Identification and functional characterization of <it>cis</it>-regulatory elements in the apicomplexan parasite <it>Toxoplasma gondii</it></p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Mullapudi</snm>
               <fnm>Nandita</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>mnandita@gmail.com</email>
            </au>
            <au id="A2">
               <snm>Joseph</snm>
               <mi>J</mi>
               <fnm>Sandeep</fnm>
               <insr iid="I1"/>
               <email>sandeepjosejoseph@gmail.com</email>
            </au>
            <au id="A3" ca="yes">
               <snm>Kissinger</snm>
               <mi>C</mi>
               <fnm>Jessica</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>jkissing@uga.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Genetics, University of Georgia, East Green Street, Athens, Georgia, 30602, USA</p>
            </ins>
            <ins id="I2">
               <p>Center for Tropical and Emerging Global Diseases, University of Georgia, DW Brooks Drive, Athens, Georgia, 30602, USA</p>
            </ins>
            <ins id="I3">
               <p>Current address: Department of Pulmonary Medicine, Albert Einstein College of Medicine, Morris Park Ave, Bronx, New York, NY 10461, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2009</pubdate>
         <volume>10</volume>
         <issue>4</issue>
         <fpage>R34</fpage>
         <url>http://genomebiology.com/2009/10/4/R34</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">19351398</pubid>
               <pubid idtype="doi">10.1186/gb-2009-10-4-r34</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>21</day>
               <month>9</month>
               <year>2008</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>11</day>
               <month>1</month>
               <year>2009</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>7</day>
               <month>4</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>7</day>
               <month>4</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Mullapudi et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Toxoplasma gondii regulatory elements</p>
      </shorttitle>
      <shortabs>
         <p>Mining of genomic sequence data of the apicomplexan parasite Toxoplasma gondii identifies putative cis-regulatory elements using a de novo approach.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p><it>Toxoplasma gondii </it>is a member of the phylum Apicomplexa, which consists entirely of parasitic organisms that cause several diseases of veterinary and human importance. Fundamental mechanisms of gene regulation in this group of protistan parasites remain largely uncharacterized. Owing to their medical and veterinary importance, genome sequences are available for several apicomplexan parasites. Their genome sequences reveal an apparent paucity of known transcription factors and the absence of canonical <it>cis</it>-regulatory elements. We have approached the question of gene regulation from a sequence perspective by mining the genomic sequence data to identify putative <it>cis</it>-regulatory elements using a <it>de novo </it>approach.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We have identified putative <it>cis</it>-regulatory elements present upstream of functionally related groups of genes and subsequently characterized the function of some of these conserved elements using reporter assays in the parasite. We show a sequence-specific role in gene-expression for seven out of eight identified elements.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>This work demonstrates the power of pure sequence analysis in the absence of expression data or <it>a priori </it>knowledge of regulatory elements in eukaryotic organisms with compact genomes.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010014">Microbiology and parasitology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p><it>Toxoplasma gondii </it>is an obligate intracellular parasite belonging to the phylum Apicomplexa. The <it>T. gondii </it>genome is approximately 63 Mb, contains approximately 7,800 protein-encoding genes and has a GC content of 52%. Despite its reduced genome, the parasite exhibits a complex developmental life cycle wherein it is capable of switching between a rapidly dividing tachyzoite form and a quiescent bradyzoite form within the asexual stage of its life cycle <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. During its asexual stage, it exhibits a wide host range, capable of infecting a variety of warm-blooded animals. Infection is of greater concern in AIDS or immunosuppressed patients, where it can lead to neurological, mental and ocular defects. It is also responsible for human birth defects and spontaneous abortion as a result of trans-placental transmission in infected pregnant women <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. Given its wide host-range and medical importance, understanding fundamental processes of gene regulation is important for developing methods aimed at controlling infection and disease.</p>
         <p>There are many levels at which organisms can control gene expression, including chromatin-mediated modifications, transcriptional and post-transcriptional regulation, and post-translational regulation <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. Transcription factors that mediate transcriptional regulation can be sequence-specific DNA-binding proteins that are involved in gene-specific regulation, or more general RNA polymerase II components that are required for transcription initiation. Promoter organization in unicellular eukaryotes such as <it>Saccharomyces cerevisiae </it>is composed of a bi-partite structure consisting of a core promoter located close to the start of transcription and upstream activator sequences that contain binding sites for sequence-specific transcription factors present a few hundred base pairs away. In metazoans, additional, more distal elements, such as enhancers and insulator elements, provide for more specific fine-tuning of gene-regulation <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Very little is known about how <it>T. gondii </it>and other apicomplexan parasites regulate their genes. A relatively small number of gene-specific studies in <it>T. gondii </it>have identified non-canonical <it>cis</it>-regulatory elements indicative of a bi-partite promoter organization that were found to play a role in downstream gene expression <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. Preliminary surveys of the complete genome sequence have revealed a paucity of known specialized transcriptional factors encoded in the genome <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Recent studies have focused on dissecting the developmental signals responsible for inter-conversion between the tachyzoite and bradyzoite developmental stages and the preferential gene expression that characterizes these stages. To this end, the study of stage-specific genes and their promoters <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp> has revealed the presence of <it>cis</it>-regulatory elements in the promoter region that are responsible for preferential gene expression in different life cycle stages. Large-scale analyses of gene expression from key developmental life cycle stages <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> point to the absence of chromosomal clustering of co-expressed genes, and the presence of unique stage-specific mRNAs in each developmental stage. However, promoter organization and the presence of specialized transcription factors for their recognition remain largely unexplored areas. The medical importance combined with the evolutionary divergence of the apicomplexan parasites relative to model organisms has motivated a rapidly growing collection of genome sequencing efforts for this group.</p>
         <p>Sequence information provides us with a starting point to identify <it>cis</it>-acting signals in the genome and to uncover underlying gene-regulatory mechanisms. Sequence analysis to identify conserved <it>cis</it>-regulatory signals is typically augmented by at least one of two types of information: the organization of regulons and known sequences of conserved transcription factor binding sites, or large-scale gene expression information (for example, from microarray studies), that provide data sets of co-regulated genes within which conserved transcription factor binding sites can be identified <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Known canonical eukaryotic <it>cis</it>-elements have not yet been reported in <it>T. gondii</it>. In the absence of this starting information, we have adopted a <it>de novo </it>approach to identify conserved sequence elements that could serve as putative <it>cis</it>-regulatory elements. We have then experimentally verified the role for these candidate elements in the parasite, establishing their role in gene expression. Our study includes four different groups of genes that share parasite-specific or metabolic functions. We describe a computational framework for the identification of novel <it>cis</it>-regulatory elements in eukaryotic non-model systems, particularly those with reduced genomes and relatively small intergenic regions.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <p>We analyzed four different functional groups of genes for the presence of conserved, over-represented upstream sequence motifs within each group. The choice of seed genes was based on the hypothesis that genes that share a common function or operate in the same biochemical pathway should be co-regulated and possess common upstream regulatory elements. We used MEME (Multiple Em for Motif Elicitation) <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, a <it>de novo </it>pattern-finding algorithm to detect such motifs within each group of genes. We tested the functional significance of top candidate motifs by mutagenizing them in their native promoter context and measuring subsequent reporter gene expression (see Materials and methods). We find that different groups of genes share different over-represented motifs and no global motif emerges from our studies to be shared by all groups. The results of pattern finding and accompanying experimental evidence establish the biological role of the motifs considered in this study.</p>
         <sec>
            <st>
               <p>Genes involved in glycolysis</p>
            </st>
            <p><it>T. gondii</it>, like <it>Eimeria tenella </it>and <it>Cryptosporidium parvum</it>, uses glucose as its main source of energy in its rapidly dividing tachyzoite stage <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Phylogenetic analyses have shown that two of the glycolytic genes in <it>T. gondii</it>, enolase and glucose-6-phosphate isomerase, are closely related to their corresponding homologs in plants, suggesting that they were acquired and potentially suitable as drug targets due to their distinct evolutionary origin <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. Glycolysis has also been actively studied with respect to stage differentiation in <it>T. gondii</it>. Three key glycolytic enzymes - glucose-6-phosphate isomerase [ToxoDB:76.m00001], lactate dehydrogenase (LDH) and enolase (ENO) [ToxoDB:59.m03410] - exhibit developmentally regulated expression <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. Stage-specific cDNAs have been isolated that encode distinct isoforms of LDH: <it>LDH1 </it>(tachyzoite) and <it>LDH2 </it>(bradyzoite) <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. Experimental evidence based on the detection of their respective mRNA and protein products indicates that <it>LDH1 </it>is post-translationally repressed while <it>LDH2 </it>is transcriptionally induced in bradyzoites <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. Similarly, stage-specific cDNAs have also been isolated for distinct forms of ENO: <it>ENO1 </it>(bradyzoite) and <it>ENO2 </it>(tachyzoite) <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Stage-specific expression of the two enolases is brought about by the presence of specific <it>cis</it>-regulatory elements in the promoter regions of these genes <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. The regulation of the genes involved in glycolysis presents an intriguing case study from developmental, evolutionary and regulatory perspectives.</p>
            <p>We analyzed the upstream sequences of 11 genes involved in tachyzoite glycolysis to identify conserved, over-represented sequence motifs (Table <tblr tid="T1">1</tblr>). We report the analysis of two candidate motifs here: motif GLYCA, also found upstream of six orthologs in <it>E. tenella</it>, and motif GLYCB, found exclusively in <it>T. gondii</it>. These motifs were not reported in the aforementioned studies on stage-specific regulation of the enolase gene <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. Motif GLYCA, represented by the consensus 5'GCTKCMTY (Figure <figr fid="F1">1a</figr>) is an 8 bp well-conserved sequence occurring at least once per sequence on the forward strand (Figure <figr fid="F1">1b</figr>). It does not show significant positional conservation, but motifs found upstream of orthologs in <it>E. tenella </it>are found to be 100% conserved in sequence to their counterpart in <it>T. gondii</it>. Motif GLYCA is not found in the upstream regions of the bradyzoite isoforms of the stage-specific glycolytic genes (<it>ENO2 </it>and <it>LDH1</it>). Motif GLYCB is also an 8 bp motif represented by the consensus sequence 5'TGCASTNT (Figure <figr fid="F1">1a</figr>), with 6 of 8 bases conserved in more than 90% of the occurrences. This motif is present once per sequence and can occur on either strand (Figure <figr fid="F1">1b</figr>). Motif GLYCB was also found in the upstream regions of the bradyzoite-specific copies of enolase and LDH (data not shown).</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>List of genes used in this study</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Symbol</p>
                     </c>
                     <c ca="left">
                        <p>Gene name</p>
                     </c>
                     <c ca="center">
                        <p>Ortholog</p>
                     </c>
                     <c ca="left">
                        <p>ToxoDB ID</p>
                     </c>
                     <c ca="center">
                        <p>Promoter length (bp)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Gylcolysis</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>HK</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Hexokinase</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>57.m00001</p>
                     </c>
                     <c ca="center">
                        <p>900</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>G6PI</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Glucose-6-phosphate-isomerase</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>76.m00001</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>PFK</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Phosphofructokinase</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>49.m03242</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>ALD</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Aldolase</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>46.m00002</p>
                     </c>
                     <c ca="center">
                        <p>1,370</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>TPI</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Triose-phosphate-isomerase</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>42.m00050</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>GAPDH</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Glyceraldehydye-3-phosphate dehydrogenase</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>80.m00003</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>PGK</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Phosphoglycerate kinase</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>641.m000193</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>PGM</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Phosphoglucomutase</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>113.m00016</p>
                     </c>
                     <c ca="center">
                        <p>1,500</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>ENO</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p><it>Enolase</it>*</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>59.m03410</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>PyK</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Pyruvate kinase</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>55.m00007</p>
                     </c>
                     <c ca="center">
                        <p>1,500</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Nucleotide metabolism</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>AK</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Adenosine kinase</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>50.m00018</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>CTPS</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Cytidine synthase</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>129.m00261</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>DCDA</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Deoxycytidine deaminase</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>8.m00191</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>DHFR-TS</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Dihydrofolate reducatase-thymidine synthase</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>50.m00016</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>GMPS</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Guanidine monophosphate synthase</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>44.m00023</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>RDPR</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Ribonucelotide diphosphate reductase</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>83.m00003</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>UPRT</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p><it>Uracil phosphoribosyl transferase</it>*</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>583.m00018</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>AT</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Adenosine transporter</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>49.m00004</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Micronemal proteins</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>MIC1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Microneme 1</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>80.m00012</p>
                     </c>
                     <c ca="center">
                        <p>1,500</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>MIC2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Microneme 2</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>20.m00002</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>MIC3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Microneme 3</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>641.m00002</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>MIC4</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Microneme 4</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>25.m00006</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>MIC5</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Microneme 5</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>65.m00002</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>MIC6</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Microneme 6</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>38.m00003</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>MIC7</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Microneme 7</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>55.m00014</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>MIC8</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p><it>Microneme 8</it>*</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>50.m00002</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>MIC9</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Microneme 9</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>49.m03396</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>MIC10</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Microneme 10</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>50.m00010</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>MIC11</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Microneme 11</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>20.m05914</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>M2AP</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Microneme-2-associated protein</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>33.m00006</p>
                     </c>
                     <c ca="center">
                        <p>2,000</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Ribosomal proteins</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>RPS29</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Ribosomal protein S29</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>49.m03285</p>
                     </c>
                     <c ca="center">
                        <p>800</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>RPS38</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Ribosomal protein S38</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>44.m04616</p>
                     </c>
                     <c ca="center">
                        <p>1,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>RPS3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Ribosomal protein S3</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>44.m04669</p>
                     </c>
                     <c ca="center">
                        <p>1,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>RPS13</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Ribosomal protein S13</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>59.m03516</p>
                     </c>
                     <c ca="center">
                        <p>1,000</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>RPL9</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p><it>Ribosomal protein L9</it>*</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>76.m00009</p>
                     </c>
                     <c ca="center">
                        <p>1,200</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>RPS25</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Ribosomal protein S25</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>44.m00003</p>
                     </c>
                     <c ca="center">
                        <p>1,300</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>RPS10</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Ribosomal protein S10</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>64.m00338</p>
                     </c>
                     <c ca="center">
                        <p>700</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>RPL25</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Ribosomal protein L25</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>55.m00189</p>
                     </c>
                     <c ca="center">
                        <p>1,000</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The list of genes and the lengths of their upstream regions that were used in the studies to identify regulatory motifs. A plus sign in the Ortholog column indicates that a corresponding ortholog in <it>E. tenella </it>was obtained and added to the search. Representative genes used in mutagenesis and expression analyses are denoted by an asterisk.</p>
               </tblfn>
            </tbl>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Candidate motifs identified upstream of glycolytic genes, upstream location, site-directed mutagenesis and results of reporter assays</p>
               </caption>
               <text>
                  <p>Candidate motifs identified upstream of glycolytic genes, upstream location, site-directed mutagenesis and results of reporter assays. Motifs GLYCA and GLYCB act in concert to influence gene-expression from the <it>Eno2 </it>promoter. <b>(a) </b>Sequence logos represent the consensus sequence for each candidate motif. The y-axis represents information content at each position. <b>(b) </b>Occurrences and positions of the motifs in the promoter region relative to the translational start site of each gene. The gene names are abbreviated as shown in Table 1. The underlined gene name indicates the representative promoter used in reporter assays. Motif GLYCA, found in both <it>E. tenella </it>and <it>T. gondii</it>, is denoted by a circle and motif GLYCB, exclusive to <it>T. gondii</it>, is denoted by a square. Solid shapes denote motifs on the opposite strand. <b>(c) </b>The wild-type (WT) motifs and their mutagenized (MUT) versions in the representative promoter are represented. <b>(d) </b>The graphs depict luciferase activity as ratios of firefly:renilla activity in relative luciferase units (RLU) from the different constructs containing either WT or mutagenized versions of GLYCA, GLYCB, or both motifs. All luciferase readings are relative to an internal control (&#945;-tubulin-renilla). Error bars represent standard error calculated across the means of three independent electroporations. <it>p</it>-values describe the probability that the difference in expression between the WT and mutagenized promoters may be due to chance.</p>
               </text>
               <graphic file="gb-2009-10-4-r34-1"/>
            </fig>
            <p>Mutagenesis of GLYCA to the sequence 5'AACAAACA in the <it>ENO2 </it>promoter resulted in a small increase in promoter activity. Mutagenesis of GLYCB to the sequence 5'CAACACAC within the ENO2 promoter resulted in a small decrease in promoter activity (Figure <figr fid="F1">1c, d</figr>). However, when both motifs were mutagenized, a larger decrease in promoter activity was seen. These results are complex in comparison to patterns seen with motifs for other groups of genes (see below). It must be noted that the changes in expression levels caused by mutagenizing each individual sequence in the ENO2 promoter are of small magnitude, but statistically significant. It is possible that the effects of mutagenizing each motif are not very severe in their effect, while the double mutant shows a large decrease in reporter expression, indicating a definite role for both of these motifs, in concert, to affect downstream gene expression. An alternative scenario to explain this result is one in which mutagenesis of GLYCA gives rise to a chimeric motif that enhances downstream gene-expression only in the presence of wild-type (WT) GLYCB. The strong evolutionary conservation of motif GLYCA in <it>E. tenella </it>and the significant decrease in reporter activity in the double mutant lend support to their role in regulating gene expression. Further experiments are needed to fully resolve these intriguing results.</p>
         </sec>
         <sec>
            <st>
               <p>Genes involved in nucleotide biosynthesis and salvage</p>
            </st>
            <p>Purines and pyrimidines are the building blocks of nucleic acids in living cells. All protozoan parasites examined thus far are unable to synthesize purines <it>de novo </it>and depend upon salvage enzymes to obtain purines from the host <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. Most protists, however, possess a full set of <it>de novo </it>pyrimidine biosynthesis enzymes, with one exception, <it>C. parvum</it>, which has lost the <it>de novo </it>pathway and evolved to also salvage pyrimidines from the host cell <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. Enzymes involved in nucleotide metabolism in protozoan parasites can serve as promising drug targets because they are essential to the parasite's survival and are also evolutionarily distinct from host enzymes in some cases <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. In <it>T. gondii</it>, it was found that <it>de novo </it>pyrimidine biosynthesis is essential for the virulence of the parasite <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. We examined eight genes encoding enzymes involved in nucleotide biosynthesis and salvage in <it>T. gondii </it>and selected two conserved motifs found in their upstream regions as candidates for experimental validation. Motif NTBA is an A-rich 9 bp motif represented by the consensus 5'GCAAAMGRA (Figure <figr fid="F2">2a</figr>). It is very well conserved in four orthologs in <it>E. tenella</it>. Motif NTBA is present only once upstream of each gene and is always found on the positive strand. It is primarily located at 1,000-1,500 bp upstream of the translation start (Figure <figr fid="F2">2b</figr>). Motif NTBB is an 8 bp long T-rich motif and is exclusive to <it>T. gondii</it>. It is represented by the consensus sequence 5'TTTYTCGC (Figure <figr fid="F2">2a</figr>) and is also found only once upstream of each gene on the forward strand. The two motifs are typically present within 300-400 bp of each other (Figure <figr fid="F2">2b</figr>).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Candidate motifs identified upstream of the nucleotide biosynthetic genes, upstream location, site-directed mutagenesis and results of reporter assays</p>
               </caption>
               <text>
                  <p>Candidate motifs identified upstream of the nucleotide biosynthetic genes, upstream location, site-directed mutagenesis and results of reporter assays. Motifs NTBA and NTBB show redundancy in function by negatively affecting gene expression from the UPRT promoter among the nucleotide metabolism genes. <b>(a) </b>Sequence logos represent the consensus sequence for each candidate motif. The y-axis represents information content at each position. <b>(b) </b>Occurrences and positions of the motifs in the promoter region relative to the translational start site of each gene. The gene names are abbreviated as shown in Table 1. The underlined gene name indicates the representative promoter used in reporter assays. Motif NTBA, found in both <it>E. tenella </it>and <it>T. gondii</it>, is denoted by a circle and motif NTBB, exclusive to <it>T. gondii</it>, is denoted by a square. <b>(c) </b>The WT motifs and their mutagenized (MUT) versions in the representative promoter are represented. <b>(d) </b>The graphs depict luciferase activity as ratios of firefly:renilla activity in relative luciferase units (RLU) from the different constructs containing either WT or mutagenized versions of NTBA, NTBB, or both motifs. All luciferase readings are relative to an internal control (&#945;-tubulin-renilla). Error bars represent standard error calculated across the means of three independent electroporations. <it>p</it>-values describe the probability that the difference in expression between the WT and mutagenized promoters may be due to chance.</p>
               </text>
               <graphic file="gb-2009-10-4-r34-2"/>
            </fig>
            <p>To establish the biological significance of these motifs, we mutagenized NTBA to the sequence 5'AAGCGCAAG and NTBB to the sequence 5'GTGTGTG (Figure <figr fid="F2">2c</figr>). Mutagenesis of either of these motifs individually in the promoter of the gene encoding uracil phosphoribosyl transferase (UPRT) [ToxoDB:583.m00018] showed no significant change in promoter activity. Mutagenesis of both motifs within the UPRT promoter resulted in a seven-fold increase in reporter gene-expression, indicating that the two motifs function in repressing gene-expression and possibly possess redundancy in function (Figure <figr fid="F2">2d</figr>).</p>
         </sec>
         <sec>
            <st>
               <p>Genes encoding micronemal proteins</p>
            </st>
            <p>Micronemes are secretory organelles found in apicomplexan parasites and serve as compartments for the storage and trafficking of micronemal proteins, a family of proteins that function as ligand for host-cell receptors <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. These proteins play a very important role in the active process of host-cell adhesion and invasion during the parasite life cycle. We analyzed the upstream sequences of 12 microneme protein-encoding genes in <it>T. gondii </it>and corresponding upstream sequences of four orthologs in <it>E. tenella</it>. We identified two well-conserved sequence motifs in this data set that we subsequently selected for further experimental characterization. Motif MICA is an 8 bp motif represented by the consensus sequence 5'GCGTCDCW (Figure <figr fid="F3">3a</figr>). It is found at least twice in the majority of the upstream regions occurring on either strand and does not show conservation of position relative to the translational start site (Figure <figr fid="F3">3b</figr>). This motif was also found upstream of <it>E. tenella </it>micronemal protein genes. In the reverse orientation, this motif closely resembles the 5'WGAGACG motif that has been identified in previous studies to function as a regulatory element in several promoters of <it>T. gondii </it><abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Motif MICB is an 8 bp motif with the very well conserved sequence 5'SMTGCAGY (Figure <figr fid="F3">3a</figr>); the core 'TGCA' nucleotides are conserved in 100% of occurrences. This motif occurs once upstream in all 11 micronemal protein genes in <it>T. gondii</it>, but was not found in the corresponding orthologs in <it>E. tenella</it>. It does not show conservation of position relative to the translational start site, and is always found on the forward strand (Figure <figr fid="F3">3b</figr>).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Candidate motifs identified upstream of the micronemal protein-encoding genes, upstream location, site-directed mutagenesis and results of reporter assays</p>
               </caption>
               <text>
                  <p>Candidate motifs identified upstream of the micronemal protein-encoding genes, upstream location, site-directed mutagenesis and results of reporter assays. Motifs MICA and MICB display an additive effect in the regulation of the gene encoding microneme 8. <b>(a) </b>Sequence logos represent the consensus sequence for each candidate motif. The y-axis represents information content at each position. <b>(b) </b>Occurrences and positions of the motifs in the promoter region relative to the translational start site of each gene. The gene names are abbreviated as shown in Table 1. The underlined gene name indicates the representative promoter used in reporter assays. Motif MICA, found in both <it>E. tenella </it>and <it>T. gondii</it>, is denoted by a circle and motif MICB, exclusive to <it>T. gondii</it>, is denoted by a square. <b>(c) </b>The WT motifs and their mutagenized (MUT) versions in the representative promoter are represented. <b>(d) </b>The graphs depict luciferase activity as ratios of firefly:renilla activity in relative luciferase units (RLU) from the different constructs containing either WT or mutagenized versions of MICA, MICB, or both motifs. All luciferase readings are relative to an internal control (&#945;-tubulin-renilla). Error bars represent standard error calculated across the means of three independent electroporations. <it>p</it>-values describe the probability that the difference in expression between the WT and mutagenized promoters may be due to chance.</p>
               </text>
               <graphic file="gb-2009-10-4-r34-3"/>
            </fig>
            <p>To characterize the functional significance of these conserved motifs, each was mutagenized to an 8 bp polyA sequence (5'AAAAAAAA; Figure <figr fid="F3">3c</figr>). The mutagenesis of motif MICA in the Mic8 (Micronemal protein 8) [ToxoDB: 50.m00002] promoter led to a tenfold reduction in reporter activity, and the mutagenesis of motif MICB led to a threefold reduction in reporter expression. When both MICA and MICB were mutagenized in the same promoter, it had a dramatic effect on promoter activity (the raw value of firefly expression levels (440 units) was comparable to that of non-transfected cells (386 units) (Figure <figr fid="F3">3d</figr>)). From these data, we infer that both MICA and MICB act positively to enhance gene expression from the <it>Mic8 </it>promoter, and together exert an additive effect on downstream gene-expression, as is indicated by the loss of expression when both MICA and MICB are mutagenized (Figure <figr fid="F3">3d</figr>).</p>
         </sec>
         <sec>
            <st>
               <p>Ribosomal protein encoding genes</p>
            </st>
            <p>Examination of stage-specific expressed sequence tag libraries in <it>E. tenella </it>and <it>T. gondii </it>indicates that the coccidia regulate <it>de novo </it>ribosome biosynthesis at the transcriptional level <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. In a recent study <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> the authors examined a large set of cytoplasmic ribosomal proteins in <it>T. gondii </it>(79 genes in all) and describe the presence of two well-conserved motifs, TRP-1 (motif RPA; 5'CGGCTTATATTCG) and TRP-2 (motif RPB; 5'YGCATGCR) (Figure <figr fid="F4">4a</figr>) identified by MEME in all promoters. The sequence of TRP-2 (RPB) is similar to the 8 bp element 5'TGCATGCA reported to be overrepresented in the non-coding regions of the apicomplexans <it>C. parvum</it>, <it>T. gondii </it>and <it>E. tenella </it><abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. This sequence is also similar to one of the binding sites of the AP2-domain containing transcription factors as inferred from protein-based microarray studies conducted in <it>P. falciparum </it><abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. In a study of the promoter strengths of eight of the ribosomal protein genes, no correlation could be found between multiple occurrences of one or both motifs and promoter strength in the eight promoters <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. However, the biological function of these motifs was not reported. We conducted analyses on a subset of these genes (eight promoters) and also recovered the motifs TRP-1 (RPA) and TRP-2 (RPB) as described by van Poppel <it>et al</it>. <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> (Figure <figr fid="F4">4b</figr>). We mutagenized these motifs in our analyses to ascertain if they functioned in a sequence-specific manner to affect promoter activity.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Candidate motifs identified upstream of the ribosomal protein genes, upstream location, site-directed mutagenesis and results of reporter assays</p>
               </caption>
               <text>
                  <p>Candidate motifs identified upstream of the ribosomal protein genes, upstream location, site-directed mutagenesis and results of reporter assays. Motif RPA (TRP-1) does not influence reporter activity, and motif RPB (TRP-2) acts as an enhancer of gene-expression from the RPL9 promoter. <b>(a) </b>Sequence logos represent the consensus sequence for each candidate motif. The y-axis represents information content at each position. <b>(b) </b>Occurrences and positions of the motifs in the promoter region relative to the translational start site of each gene. The gene names are abbreviated as shown in Table 1. The underlined gene name indicates the representative promoter used in the reporter assays. <b>(c) </b>The WT motifs and their mutagenized (MUT) versions in the representative promoter are represented. <b>(d) </b>The graphs depict luciferase activity as ratios of firefly:renilla activity in relative luciferase units (RLU) from the different constructs containing either WT or mutagenized versions of RPA, RPB, both motifs or both copies of motif RPB. All luciferase readings are relative to an internal control (&#945;-tubulin-renilla). Error bars represent standard error calculated across the means of three independent electroporations. <it>p</it>-values describe the probability that the difference in expression between the WT and mutagenized promoters may be due to chance.</p>
               </text>
               <graphic file="gb-2009-10-4-r34-4"/>
            </fig>
            <p>Motif TRP-1 (RPA) in the <it>RPL9 </it>(Ribosomal protein L9) promoter [ToxoDB:76.m00009] was mutagenized to the sequence 5'CGAAGTATGCGAG (retaining the WT sequence at 3 of the 13 nucleotide positions due to mutagenesis challenges presented by the length of this motif) and motif TRP-2 (RPB), which occurs twice in the <it>RPL9 </it>promoter, was mutagenized at both sites (singly and jointly) to the sequence 5'TAAATAAA (Figure <figr fid="F4">4c</figr>). TRP-1 (RPA) did not affect reporter expression when mutagenized individually or in combination with TRP-2 (RPB). This observation may be attributed to the fact that not all of the bases in this motif were mutagenized, indicating that the three WT positions might be crucial and sufficient for the function of this motif or that this motif may serve a function during a different stage of development or not serve a function related to gene expression. These results warrant further examination. Mutagenesis of one of the copies of motif RPB resulted in a 50% reduction in promoter activity, while mutagenesis of both the copies of RPB caused a 75% reduction in gene expression relative to the WT promoter (Figure <figr fid="F4">4d</figr>). These data indicate that TRP-2 (RPB) enhances gene expression from the <it>RPL9 </it>promoter; the presence of additional copies of this motif likely confers additional strength to the promoter.</p>
         </sec>
         <sec>
            <st>
               <p>Genome-wide occurrences of candidate motifs</p>
            </st>
            <p>We examined the occurrences of each of the motifs to determine if there was over-representation within upstream regions relative to coding regions. Table <tblr tid="T2">2</tblr> lists the genome-wide occurrences of each of the candidate motifs within the upstream and the coding regions of the genome, respectively, as computed by MAST (Motif Analysis and Search Tool) <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. In order to normalize for the different sizes of the two data sets, the motif count is represented as number of motifs per 10 kbp (motif density). Of the eight candidate motifs selected in this study, the RPB (TRP-2) motif (5'YGCATGCR) has the highest occurrence within upstream regions, 4,030 occurrences upstream of 1,311 genes. When normalized to the total size of each database (upstream or coding), the candidate motifs (except GLYCA and MICB) were found to be significantly (two- to four-fold) over-represented (<it>p </it>&lt; 0.001) in the upstream regions relative to the coding regions (Table <tblr tid="T2">2</tblr>, Figure <figr fid="F5">5</figr>).</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Genome-wide occurrences of candidate motifs</p>
               </caption>
               <text>
                  <p>Genome-wide occurrences of candidate motifs. Most of the candidate motifs with verified biological function are over-represented within upstream regions. Motif density is plotted as number of motifs per 10 kb for each data set - upstream sequences (red) and coding sequences (blue) (Table 2) - on the y-axis for each candidate motif on the x-axis.</p>
               </text>
               <graphic file="gb-2009-10-4-r34-5"/>
            </fig>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Genome-wide occurrences of each candidate motif within coding and upstream regions</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="3" ca="center">
                        <p>Upstream</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>Coding</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="3">
                        <hr/>
                     </c>
                     <c cspan="3">
                        <hr/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Motif</p>
                     </c>
                     <c ca="center">
                        <p>Number of genes</p>
                     </c>
                     <c ca="center">
                        <p>Number of motifs</p>
                     </c>
                     <c ca="center">
                        <p>Number of motifs/10 kb</p>
                     </c>
                     <c ca="center">
                        <p>Number of genes</p>
                     </c>
                     <c ca="center">
                        <p>Number of motifs</p>
                     </c>
                     <c ca="center">
                        <p>Number of motifs/10 kb</p>
                     </c>
                     <c ca="center">
                        <p><it>p</it>-value</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GLYCA</p>
                     </c>
                     <c ca="center">
                        <p>885</p>
                     </c>
                     <c ca="center">
                        <p>2608</p>
                     </c>
                     <c ca="center">
                        <p>2.23</p>
                     </c>
                     <c ca="center">
                        <p>956</p>
                     </c>
                     <c ca="center">
                        <p>3618</p>
                     </c>
                     <c ca="center">
                        <p>2.14</p>
                     </c>
                     <c ca="center">
                        <p>0.0538</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GLYCB</p>
                     </c>
                     <c ca="center">
                        <p>418</p>
                     </c>
                     <c ca="center">
                        <p>982</p>
                     </c>
                     <c ca="center">
                        <p>0.84</p>
                     </c>
                     <c ca="center">
                        <p>201</p>
                     </c>
                     <c ca="center">
                        <p>531</p>
                     </c>
                     <c ca="center">
                        <p>0.31</p>
                     </c>
                     <c ca="center">
                        <p>&lt; 0.001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MICA</p>
                     </c>
                     <c ca="center">
                        <p>734</p>
                     </c>
                     <c ca="center">
                        <p>2010</p>
                     </c>
                     <c ca="center">
                        <p>1.72</p>
                     </c>
                     <c ca="center">
                        <p>435</p>
                     </c>
                     <c ca="center">
                        <p>1019</p>
                     </c>
                     <c ca="center">
                        <p>0.6</p>
                     </c>
                     <c ca="center">
                        <p>&lt; 0.001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MICB</p>
                     </c>
                     <c ca="center">
                        <p>223</p>
                     </c>
                     <c ca="center">
                        <p>637</p>
                     </c>
                     <c ca="center">
                        <p>0.54</p>
                     </c>
                     <c ca="center">
                        <p>290</p>
                     </c>
                     <c ca="center">
                        <p>769</p>
                     </c>
                     <c ca="center">
                        <p>0.46</p>
                     </c>
                     <c ca="center">
                        <p>0.0026</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NTBA</p>
                     </c>
                     <c ca="center">
                        <p>658</p>
                     </c>
                     <c ca="center">
                        <p>1959</p>
                     </c>
                     <c ca="center">
                        <p>1.67</p>
                     </c>
                     <c ca="center">
                        <p>418</p>
                     </c>
                     <c ca="center">
                        <p>1495</p>
                     </c>
                     <c ca="center">
                        <p>0.89</p>
                     </c>
                     <c ca="center">
                        <p>&lt; 0.001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NTBB</p>
                     </c>
                     <c ca="center">
                        <p>1100</p>
                     </c>
                     <c ca="center">
                        <p>2548</p>
                     </c>
                     <c ca="center">
                        <p>2.18</p>
                     </c>
                     <c ca="center">
                        <p>359</p>
                     </c>
                     <c ca="center">
                        <p>852</p>
                     </c>
                     <c ca="center">
                        <p>0.5</p>
                     </c>
                     <c ca="center">
                        <p>&lt; 0.001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPA</p>
                     </c>
                     <c ca="center">
                        <p>368</p>
                     </c>
                     <c ca="center">
                        <p>581</p>
                     </c>
                     <c ca="center">
                        <p>0.49</p>
                     </c>
                     <c ca="center">
                        <p>145</p>
                     </c>
                     <c ca="center">
                        <p>262</p>
                     </c>
                     <c ca="center">
                        <p>0.15</p>
                     </c>
                     <c ca="center">
                        <p>&lt; 0.001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPB</p>
                     </c>
                     <c ca="center">
                        <p>1311</p>
                     </c>
                     <c ca="center">
                        <p>4030</p>
                     </c>
                     <c ca="center">
                        <p>3.45</p>
                     </c>
                     <c ca="center">
                        <p>810</p>
                     </c>
                     <c ca="center">
                        <p>2648</p>
                     </c>
                     <c ca="center">
                        <p>1.57</p>
                     </c>
                     <c ca="center">
                        <p>&lt; 0.001</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The number of occurrences of each motif and the genes containing them in the whole genome Motif density (number of motifs per 10 kb) was computed using MAST to search position weight matrix profiles of each motif against custom built databases (upstream regions (11,685,162 bp) and coding regions (16,862,741 bp)).</p>
               </tblfn>
            </tbl>
            <p>We calculated the expected frequency of motifs within the upstream and coding regions based on the motif length, degeneracy and the composition and size of the database (Materials and methods). The expected occurrences of most of the motifs are almost equal in both databases (upstream and coding) because of the similarity in size and nucleotide composition of the two databases. The motifs are not found to occur at a significantly greater frequency than expected, exceptions being NTBA, which is found at a higher frequency than expected (<it>p </it>&lt; 0.05) within the upstream and coding regions, and motifs NTBB and RPA, which are found at frequencies higher than expected in the coding regions only (Table <tblr tid="T3">3</tblr> in Additional data file 1).</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Gene Ontology categories significantly enriched among motif-containing genes</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Motif</p>
                     </c>
                     <c ca="left">
                        <p>GO category</p>
                     </c>
                     <c ca="left">
                        <p>Description</p>
                     </c>
                     <c ca="center">
                        <p><it>p</it>-value</p>
                     </c>
                     <c ca="center">
                        <p>Adjusted <it>p</it>-value</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GLYCB</p>
                     </c>
                     <c ca="left">
                        <p>GO:0016530</p>
                     </c>
                     <c ca="left">
                        <p>Metallochaperone activity</p>
                     </c>
                     <c ca="center">
                        <p>1.00E-05</p>
                     </c>
                     <c ca="center">
                        <p>0.0004</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MICB</p>
                     </c>
                     <c ca="left">
                        <p>GO:0016530</p>
                     </c>
                     <c ca="left">
                        <p>Metallochaperone activity</p>
                     </c>
                     <c ca="center">
                        <p>1.00E-05</p>
                     </c>
                     <c ca="center">
                        <p>0.0004</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NTBA</p>
                     </c>
                     <c ca="left">
                        <p>GO:0022414</p>
                     </c>
                     <c ca="left">
                        <p>Reproductive process</p>
                     </c>
                     <c ca="center">
                        <p>1.00E-06</p>
                     </c>
                     <c ca="center">
                        <p>0.0001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NTBA</p>
                     </c>
                     <c ca="left">
                        <p>GO:0016530</p>
                     </c>
                     <c ca="left">
                        <p>Metallochaperone activity</p>
                     </c>
                     <c ca="center">
                        <p>1.00E-05</p>
                     </c>
                     <c ca="center">
                        <p>0.0004</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NTBB</p>
                     </c>
                     <c ca="left">
                        <p>GO:0002376</p>
                     </c>
                     <c ca="left">
                        <p>Immune system process</p>
                     </c>
                     <c ca="center">
                        <p>1.00E-06</p>
                     </c>
                     <c ca="center">
                        <p>0.0002</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NTBB</p>
                     </c>
                     <c ca="left">
                        <p>GO:0009987</p>
                     </c>
                     <c ca="left">
                        <p>Cellular process</p>
                     </c>
                     <c ca="center">
                        <p>0.0005</p>
                     </c>
                     <c ca="center">
                        <p>0.0110</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NTBB</p>
                     </c>
                     <c ca="left">
                        <p>GO:0009055</p>
                     </c>
                     <c ca="left">
                        <p>Electron carrier activity</p>
                     </c>
                     <c ca="center">
                        <p>0.0025</p>
                     </c>
                     <c ca="center">
                        <p>0.0371</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NTBB</p>
                     </c>
                     <c ca="left">
                        <p>GO:0008152</p>
                     </c>
                     <c ca="left">
                        <p>Metabolic process</p>
                     </c>
                     <c ca="center">
                        <p>0.0026</p>
                     </c>
                     <c ca="center">
                        <p>0.0367</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NTBB</p>
                     </c>
                     <c ca="left">
                        <p>GO:0030234</p>
                     </c>
                     <c ca="left">
                        <p>Enzyme regulator activity</p>
                     </c>
                     <c ca="center">
                        <p>0.0030</p>
                     </c>
                     <c ca="center">
                        <p>0.0400</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPA</p>
                     </c>
                     <c ca="left">
                        <p>GO:0045735</p>
                     </c>
                     <c ca="left">
                        <p>Nutrient reservoir activity</p>
                     </c>
                     <c ca="center">
                        <p>1.00E-06</p>
                     </c>
                     <c ca="center">
                        <p>0.0001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPA</p>
                     </c>
                     <c ca="left">
                        <p>GO:0005840</p>
                     </c>
                     <c ca="left">
                        <p>Ribosome</p>
                     </c>
                     <c ca="center">
                        <p>1.45E-06</p>
                     </c>
                     <c ca="center">
                        <p>0.0001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPA</p>
                     </c>
                     <c ca="left">
                        <p>GO:0005198</p>
                     </c>
                     <c ca="left">
                        <p>Structural molecule activity</p>
                     </c>
                     <c ca="center">
                        <p>3.65E-06</p>
                     </c>
                     <c ca="center">
                        <p>0.0002</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPA</p>
                     </c>
                     <c ca="left">
                        <p>GO:0006412</p>
                     </c>
                     <c ca="left">
                        <p>Translation</p>
                     </c>
                     <c ca="center">
                        <p>0.0001</p>
                     </c>
                     <c ca="center">
                        <p>0.0039</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPA</p>
                     </c>
                     <c ca="left">
                        <p>GO:0009987</p>
                     </c>
                     <c ca="left">
                        <p>Cellular process</p>
                     </c>
                     <c ca="center">
                        <p>0.0004</p>
                     </c>
                     <c ca="center">
                        <p>0.0096</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPA</p>
                     </c>
                     <c ca="left">
                        <p>GO-0043226</p>
                     </c>
                     <c ca="left">
                        <p>Organelle</p>
                     </c>
                     <c ca="center">
                        <p>0.0005</p>
                     </c>
                     <c ca="center">
                        <p>0.0110</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPA</p>
                     </c>
                     <c ca="left">
                        <p>GO-0032991</p>
                     </c>
                     <c ca="left">
                        <p>Macromolecular complex</p>
                     </c>
                     <c ca="center">
                        <p>0.0005</p>
                     </c>
                     <c ca="center">
                        <p>0.0107</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPA</p>
                     </c>
                     <c ca="left">
                        <p>GO:0051234</p>
                     </c>
                     <c ca="left">
                        <p>Establishment localization</p>
                     </c>
                     <c ca="center">
                        <p>0.0008</p>
                     </c>
                     <c ca="center">
                        <p>0.0148</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPA</p>
                     </c>
                     <c ca="left">
                        <p>GO:0051179</p>
                     </c>
                     <c ca="left">
                        <p>Localization</p>
                     </c>
                     <c ca="center">
                        <p>0.0008</p>
                     </c>
                     <c ca="center">
                        <p>0.0156</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPB</p>
                     </c>
                     <c ca="left">
                        <p>GO-0044421</p>
                     </c>
                     <c ca="left">
                        <p>Extracellular region part</p>
                     </c>
                     <c ca="center">
                        <p>1.00E-07</p>
                     </c>
                     <c ca="center">
                        <p>3.52E-05</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPB</p>
                     </c>
                     <c ca="left">
                        <p>GO:0022414</p>
                     </c>
                     <c ca="left">
                        <p>Reproductive process</p>
                     </c>
                     <c ca="center">
                        <p>1.00E-06</p>
                     </c>
                     <c ca="center">
                        <p>0.0001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPB</p>
                     </c>
                     <c ca="left">
                        <p>GO:0016530</p>
                     </c>
                     <c ca="left">
                        <p>Metallochaperone activity</p>
                     </c>
                     <c ca="center">
                        <p>1.00E-05</p>
                     </c>
                     <c ca="center">
                        <p>0.0003</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPB</p>
                     </c>
                     <c ca="left">
                        <p>GO:0006412</p>
                     </c>
                     <c ca="left">
                        <p>Translation</p>
                     </c>
                     <c ca="center">
                        <p>0.0001</p>
                     </c>
                     <c ca="center">
                        <p>0.0020</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPB</p>
                     </c>
                     <c ca="left">
                        <p>GO:0005840</p>
                     </c>
                     <c ca="left">
                        <p>Ribosome</p>
                     </c>
                     <c ca="center">
                        <p>0.0003</p>
                     </c>
                     <c ca="center">
                        <p>0.0074</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPB</p>
                     </c>
                     <c ca="left">
                        <p>GO-0043226</p>
                     </c>
                     <c ca="left">
                        <p>Organelle</p>
                     </c>
                     <c ca="center">
                        <p>0.0010</p>
                     </c>
                     <c ca="center">
                        <p>0.0172</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPB</p>
                     </c>
                     <c ca="left">
                        <p>GO:0009987</p>
                     </c>
                     <c ca="left">
                        <p>Cellular process</p>
                     </c>
                     <c ca="center">
                        <p>0.0018</p>
                     </c>
                     <c ca="center">
                        <p>0.0285</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPB</p>
                     </c>
                     <c ca="left">
                        <p>GO:0005198</p>
                     </c>
                     <c ca="left">
                        <p>Structural molecule activity</p>
                     </c>
                     <c ca="center">
                        <p>0.0018</p>
                     </c>
                     <c ca="center">
                        <p>0.0282</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Significantly enriched GO categories (adjusted <it>p</it>-value &lt; 0.05) for motif-containing genes. The results are listed by each motif and ordered based on their Benjamini Hochberg adjusted <it>p</it>-value. A good correlation is observed between the functional pathways associated with the seed genes and the GO category that is over-represented in the corresponding motif-containing genome-wide gene sets in the case of motifs NTBB, RPA and RPB.</p>
               </tblfn>
            </tbl>
            <p>Thus, while most of the regulatory motifs are present at a slightly higher frequency in the upstream regions when compared to the coding regions, they do not occur at a higher frequency than expected in either upstream or coding regions. These analyses highlight the limitations of approaches that use statistical overrepresentation of motifs as a reliable and sufficient property to identify biologically relevant motifs. It is possible that a functional regulatory motif may not be detectable by sequence alone. The surrounding sequence context and other still elusive signals may be involved in enabling it to function as a regulatory motif.</p>
            <p>To examine enrichment of specific Gene Ontology (GO) categories among all genes containing any of the eight candidate upstream motifs, we retrieved first-level GO annotations for all of the motif-containing genes (Table <tblr tid="T2">2</tblr> in Additional data file 1) for each of the three main GO categories: 'cellular component', 'molecular function' and 'biological process'. We also included lower level GO annotation IDs for the specific pathways/functional groups included in this study (Materials and methods). Table 4 in Additional data file 1 lists the GO categories that were significantly enriched within the motif-containing gene sets. Some of the motif-containing gene sets are also enriched in GO terms related to the corresponding function/pathway used to initially identify the motif, indicating that the regulatory motif may indeed be a subset-specific or pathway-specific motif. On the other hand, some motif-containing gene sets do not show enrichment for a particular GO category, but rather to a more general, functional classification. For example, genes containing the motifs discovered in the analysis of ribosomal protein-coding genes (RPA and RPB) are enriched in annotated higher-level GO categories such as organelle and regulation of biological process. This indicates that a large number of genes that contain the RPA (TRP-1) and RPB (TRP-2) motifs can be assigned to ribosome or translational-specific functions, indicating a broad subset specificity for this motif. Genes that contain the MICA or MICB motifs do not show any GO category enrichment, indicating a more general role for these upstream motifs. When deeper-level GO annotations for particular processes (such as 'ribosome' [GO:0005840]) are enumerated among the motif-containing genes, we find that the genome-wide lists of genes that contain RPA and RPB motifs are also enriched in corresponding GO categories ('ribosome' and 'translation'), indicating an even stronger specific association of these motifs with the corresponding processes (Table <tblr tid="T3">3</tblr>).</p>
         </sec>
         <sec>
            <st>
               <p>General discussion</p>
            </st>
            <p>Promoter organization in <it>T. gondii </it>has been studied in a few genes thus far <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. In these studies, it has been observed that a gene-proximal region is necessary for minimal gene expression and additional upstream sequence helps to enhance expression from the same promoter. However, very little is known about the mechanism of gene regulation and the prevalence and type of transcriptional signals and regulatory apparatus in this organism. Analyses of genome sequences and individual gene-specific experiments point out two deviations from what has been observed in other model eukaryotes. First, canonical eukaryotic promoter elements such as the TATA box have not been found in <it>T. gondii </it>promoter regions <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, although a highly divergent TATA binding protein has been reported <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Furthermore, there is a stark paucity of known specialized transcription factors encoded in the genome <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. A similar scenario is seen in two other apicomplexan parasites, <it>P. falciparum </it>and <it>C. parvum </it><abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>. This paradox can be explained in two ways: these organisms do not employ a specialized transcriptional apparatus to regulate their genes; or a specialized transcriptional machinery exists but is so divergent from known eukaryotic counterparts that its components cannot be detected by simple similarity-based searches. Recent studies have shown that the <it>T. gondii </it>genome encodes a rich repertoire of histone-modifying enzymes, and epigenetic regulation has been purported to be responsible for stage-switching in the parasite <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>. More recently, chromatin immunoprecipitation (ChIP)-on-chip experiments conducted on 1% of the <it>T. gondii </it>genome reveal a strong association between specific histone modification marks and active promoter regions <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. It is likely that histone-mediated regulation is responsible for regulation of genes to a sizeable extent in <it>T. gondii</it>. Serial analysis of gene expression (SAGE) studies of genes expressed during key life-cycle stages <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> have shown that the mRNA pool of <it>T. gondii </it>is highly dynamic and gene expression is controlled in a time- and stage-dependent manner. These studies have also shown that co-expressed genes in <it>T. gondii </it>do not cluster in the genome with respect to chromosomal location. Searches of the <it>Plasmodium </it>genome sequence for transcription factors using secondary structure similarity have revealed the presence of putative transcription factors that were missed in simple sequence-based searches <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. A divergent, putative, specialized transcription factor ApiAP2 has also been reported in the apicomplexa <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. A large percentage of proteins in <it>T. gondii </it>are 'hypothetical proteins' with no known function and might possibly encode parasite-specific functions, including transcriptional regulatory proteins. It is plausible that such highly divergent regulatory proteins utilize very different <it>cis</it>-elements for their recruitment, which would explain the absence of canonical <it>cis</it>-elements in the promoters studied thus far.</p>
            <p>We have exploited the availability of genome sequence for <it>T. gondii </it>to identify conserved upstream motifs in diverse groups of functionally related genes. We identified over-represented motifs by <it>de novo </it>pattern finding and tested their function <it>in vitro</it>, in the parasite, by specifically mutagenizing them in their native promoter context and measuring reporter activity. For each group, two candidate motifs were selected and characterized for their function in their endogenous promoter. We find that seven out of eight motifs identified by <it>de novo </it>pattern finding show a statistically significant role in promoter activity. We have shown that conserved over-represented motifs play a definite role in gene-expression, and can affect promoter activity either positively or negatively (Figures <figr fid="F1">1</figr>, <figr fid="F2">2</figr>, <figr fid="F3">3</figr>, <figr fid="F4">4</figr>). It is exciting to note that some of the motifs that affect gene expression also exhibit cross-species conservation, as shown by their presence upstream of orthologs in <it>E. tenella </it>(Table <tblr tid="T1">1</tblr>).</p>
            <p>Our studies have shown that in spite of the lack of <it>a priori </it>knowledge concerning the nature of regulatory sequences and/or expression profiles, it is possible to identify putative <it>cis</it>-regulatory elements. We have shown that elements identified purely on the basis of computational techniques can be functionally relevant for gene expression. We previously characterized putative <it>cis</it>-regulatory elements in the genome of <it>C. parvum </it>in a similar fashion where we established a correlation between over-represented elements and co-ordinate gene expression <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. In a comparison of the genes common to both studies (genes of the glycolytic and nucleotide metabolism pathways), we do not detect identical motifs in <it>T. gondii </it>and <it>C. parvum</it>. Given the evolutionary divergence and difference in genome organization and content between these two parasites, it is not unexpected that they do not share some specialized components of the regulatory machinery, or these components may have evolved so rapidly as to be unrecognizable as orthologs.</p>
            <p>The use of asynchronous populations of parasites, as is the case here, is expected to dilute out expression effects to some extent, and this is reflected in the occasional small, but significant, change in promoter activity upon mutagenesis of a motif (Figure <figr fid="F1">1d</figr>, motif GLYCA; <it>p </it>&lt; 0.05). In spite of these limitations, our study has successfully identified six novel <it>cis</it>-regulatory elements and established the functional significance of one previously reported conserved upstream element. The study of stage-specific gene regulation in <it>T. gondii </it>has been an active area of investigation, and small-scale microarray studies have reported genes that are preferentially expressed in either developmental stage <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>. Recently, three <it>T. gondii </it>studies have reported the presence of novel <it>cis</it>-regulatory elements in promoters of genes regulated in a stage-specific manner <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. However, gene regulation within the tachyzoite stage has not been well studied. The lack of a synchronized population of tachyzoites (with respect to their cell cycle) in <it>in vitro </it>culture makes it difficult to address gene regulatory questions in the actively multiplying tachyzoite, as any given population of cells in culture consists of parasites at different points of their cell cycle.</p>
            <p>Our study reports the presence of different <it>cis</it>-regulatory elements controlling gene-expression within the tachyzoite stage of the parasite and is among the first to show evidence for the presence of modular organization of promoters in <it>T. gondii</it>. Using site-specific mutagenesis of conserved upstream motifs and subsequent reporter assays, we have shown that the identified <it>cis</it>-elements can exhibit co-operative, additive or redundant effects within a promoter and operate in a sequence-specific manner to control gene expression (Figures <figr fid="F1">1</figr>, <figr fid="F2">2</figr>, <figr fid="F3">3</figr>, <figr fid="F4">4</figr>). The putative transcription factors or repressors recruited by these elements in order to facilitate gene regulation remain to be determined. One of the limitations of investigating gene expression within the tachyzoite stage is the lack of a parasite population enhanced in the production of a specific transcription factor that could be recruited by these <it>cis</it>-regulatory elements. Consequently, the detection of such putative transcription factors or repressor proteins from a mixed population of parasites by experiments such as electrophoretic mobility shift assays has proven to be challenging and yield inconsistent results (data not shown).</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p><it>Cis</it>-regulatory elements play a significant role in gene regulation in <it>T. gondii </it>and can operate individually and in concert to influence gene expression. This study provides a glimpse of the extent and mechanisms by which <it>cis</it>-regulatory elements are involved in controlling gene expression within the tachyzoite life cycle of <it>T. gondii</it>. We have shown evidence for upstream elements to behave as both positive and negative regulators of gene expression as well as exhibit redundancy within the same promoter in downstream gene expression. One of the eight motifs examined in this study, the TRP-2 (RPB) motif, is similar in sequence to the binding site for ApiAP2, a family of apicomplexan-specific DNA-binding proteins. <it>De novo </it>computational approaches possess great predictive power in compact genomes when sequence is available. We have shown here the applicability of computational techniques to identify gene regulatory signals in a system where little is known about gene regulation.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Computational analyses</p>
            </st>
            <p>Whole genome sequence (v.3.3) and gene-predictions for <it>T. gondii </it>were obtained from ToxoDB (release 4.0) <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. Scripts were written in PERL to extract the upstream sequence (2 kb or until the previous upstream gene is encountered, whichever sequence was smaller) for every predicted gene to create an 'upstream sequence database'. This database was screened for possible missed protein coding regions by performing a BLASTX against all protein sequences in the non-redundant database at NCBI. Sequences that contained greater than 80% similarity to coding sequences were pruned. Groups of genes (hereafter referred to as seed genes) used to identify common over-represented upstream motifs were selected based on the hypothesis that genes belonging to the same biochemical or functional pathway should be co-regulated and, hence, possess common upstream regulatory elements. Members from each of the four groups considered in this study (glycolysis, nucleotide metabolism, micronemal proteins and ribosomal proteins) were identified based on existing annotation from ToxoDB. Additionally, BLAST analyses using sequence information of orthologs from <it>P. falciparum </it>and <it>C. parvum </it>were employed to identify the corresponding genes in <it>T. gondii </it>when necessary. Given the limited annotation at the time of this study, it is possible that the gene lists under each functional group are not exhaustive. The pattern-finding algorithm MEME <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> was used to identify over-represented conserved motifs in the upstream regions of genes within each functional group. MEME was run using the parameters minw = 8, maxw = 20, in all three modes (tcm, oops and zoops) and the results were manually examined to pick candidate motifs. Upstream sequences of corresponding orthologs from <it>E. tenella </it>(a related coccidian parasite) were also used whenever possible to identify evolutionarily conserved motifs. In order to narrow down the results of MEME, a rule-based approach was adopted to pick candidate motifs for subsequent experimental validation. Candidate motifs included those that were over-represented in the upstream regions in comparison to the background set (entire upstream regions database), showed considerable conservation in sequence and/or position, and were present in all the sequences within each group.</p>
            <p>The program MAST <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> was used to search the most recently annotated version of the <it>T. gondii </it>genome (release 4.3) for the presence of each candidate motif in all 7,817 genes in the genome divided into coding regions (16,862,741 bp) and upstream regions (11,685,162 bp). To normalize for the different sizes of the coding regions database and the upstream regions database, the motif density was computed by calculating the number of motifs per 10,000 bp. Chi square analysis was performed to examine whether there was a significant difference in occurrence of each motif between upstream and coding regions.</p>
            <p>The expected frequency of motifs within each set (the upstream regions database and the coding regions database) was calculated as follows. The upstream regions database has a base composition of 48% AT (A = 23%, T = 25%, G = 25% and C = 26%) and the coding regions database has a base composition of 42% AT (A = 22%, T = 20%, G = 30%, C = 28%). Given these statistics, the 8 bp motif GLYCA (GCTKCMTY) will be expected to occur once every 1,780 bases in the upstream regions database and once every 2,406 bases in the coding regions database (after accounting for positional degeneracy). Taking the database sizes into account, the expected motif density per 10,000 bases was calculated for each motif in each database. Chi square analysis was used to examine the statistical significance of the differences in the observed and expected frequencies of motifs in the upstream and coding regions.</p>
            <sec>
               <st>
                  <p>Assessment of enrichment of GO categories within genes containing a candidate motif</p>
               </st>
               <p>Of the 7,817 predicted genes in <it>T. gondii</it>, 2,437 genes have been assigned the subcategory cellular component, 3,902 genes have been assigned the subcategory molecular function, and 4,738 genes have been assigned the subcategory biological process. First-level GO annotations from each of these subcategories (totaling 44 subcategories) for all motif-containing genes were obtained from ToxoDB along with the total number of genes in the genome corresponding to each annotation. Hypergeometric probability distribution was used to determine the chance probability of observing the number of genes with a given GO annotation within each of the eight sets of candidate upstream regulatory motif-containing genes compared to the number of genes with that GO annotation in the whole genome. More specifically, the probability of observing at least <it>x </it>genes that contain an upstream motif with a given annotation in a random subset of <it>n </it>genes is given by <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>:</p>
               <p>
                  <display-formula>
                     <graphic file="gb-2009-10-4-r34-i1.gif"/>
                  </display-formula>
               </p>
               <p>where <it>A </it>is the total number of genes with a particular GO annotation and <it>N </it>is the total number of genes within the genome (7,817). In order to control error rates for multiple hypothesis testing, we applied two distinct methods, the Benjamini Hochberg method <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> and the Bonferroni method <abbrgrp><abbr bid="B43">43</abbr></abbrgrp> (Table 4 in Additional data file 1). In the case of the Benjamini Hochberg method, a false discovery rate criteria based on the number of GO terms searched (number of tests = 44 GO terms searched &#215; 8 motifs = 352) was implemented and a false discovery rate adjusted <it>p</it>-value &lt; 0.05 was considered significant.</p>
               <p>Additionally, specific, lower-level GO categories that describe the functional groups chosen in this study (for example, glycolysis [GO:0006096], ATP-binding [GO:0005524], nucleoside metabolism [GO:00009116], calcium binding: [GO:0005509], translation: [GO:0006412] and ribosome structure: [GO:0003735]) were picked to similarly test for their enrichment within the motif-containing gene sets. Significant enrichment of any of these within the corresponding motif-containing gene sets was determined using hypergeometric probability.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Molecular techniques</p>
            </st>
            <p>For each group of functionally related genes considered in this study, a promoter that contained a single occurrence of the candidate motif was chosen to study the role of the motif in driving gene expression. Promoter sequences were PCR-amplified from parasite genomic DNA and a two-step overlap-extension PCR technique was employed to carry out site-directed mutagenesis <abbrgrp><abbr bid="B44">44</abbr></abbrgrp> to alter the candidate motif sequence (see Table <tblr tid="T1">1</tblr> in Additional data file 1 for primer sequences). All or a majority of the bases in each motif were substituted by base-specific transversions, thus destroying the original sequence of the candidate motif but maintaining the spacing within the promoter (Figures <figr fid="F1">1c</figr>, <figr fid="F2">2c</figr>, <figr fid="F3">3c</figr> and <figr fid="F4">4c</figr>). Successful mutagenesis was confirmed by sequencing or by restriction digest analysis. The Gateway&#8482; cloning system was used to clone the WT and mutagenized promoters individually upstream of a firefly luciferase-expressing vector (test-firefly). As an internal control, a constitutive promoter (<it>T. gondii </it>&#945;-tubulin promoter)-driven renilla luciferase-expressing construct (&#945;-tub-renilla) was co-transfected along with the experimental construct (Additional data file 2).</p>
         </sec>
         <sec>
            <st>
               <p>Parasite culture and transient transfections</p>
            </st>
            <p><it>T. gondii </it>RH tachyzoites were cultured in human foreskin fibroblasts (hTERT cells, BJ Biomedicals) as previously described <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Transient transfection was performed via electroporation, using freshly lysed parasites, needle-passaged and filtered through a 3 micron filter and resuspended in cytomix <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B45">45</abbr></abbrgrp>. Immediately prior to use, freshly prepared 2 mM ATP and 5 mM glutathione were added to the cytomix and sterile-filtered. For each co-transfection, 2 &#215; 10<sup>7 </sup>parasites were transfected via electroporation with a mixture of sterile circular plasmid DNA of &#945;-tub-renilla (control) and test-firefly mixed in a ratio of 2.5:1. (40 &#956;g test + 16 &#956;g control). Electroporation was performed in a 2 mm gap cuvette using a BTX electroporator: 1.8 kv, 100 &#937;, 25 &#956;F. Post-electroporation, the parasites were allowed to rest for 15 minutes in the cuvette and then transferred to T25 tissue culture flasks. Then, 18-24 hours post-electroporation, the cells were scraped and lysed using passive lysis buffer (Promega, Madison, WI, USA) and a dual luciferase assay was performed with the extract using the Promega Dual Luciferase kit. Briefly, the different substrate requirements for each enzyme, firefly luciferase and renilla luciferase allowed us to assay reporter expression for each construct sequentially within the same extract. Reporter activity from the WT or mutagenized promoter was measured relative to the internal control, eliminating errors due to variation in parasite populations and individual transfections. Enzyme activity was measured using a dual luciferase-ready luminometer. Each electroporation experiment was performed in triplicate and luciferase assays were performed in duplicate for expression measurements. The unpaired Students <it>t</it>-test was used to calculate the statistically significant difference in expression levels between WT and mutagenized promoter activity; <it>p </it>&lt; 0.05 was considered statistically significant.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>ENO: enolase; GO: Gene Ontology; LDH: lactate dehydrogenase; MAST: Motif Analysis and Search Tool; MEME: Multiple Em for Motif Elicitation; RPL9: Ribosomal protein L9; UPRT: uracil phospho-ribosyl transferase; WT: wild type.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>NM and JCK conceptualized the study, NM conducted the analyses and wrote the manuscript and JCK provided advice and revisions to the manuscript. SJ conducted statistical analyses to address motif over-representation and GO enrichment.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper: an Excel spreadsheet with supplementary Tables 1-4 (Additional data file <supplr sid="S1">1</supplr>); a PDF figure explaining the dual transfection and luciferase assay experimental set up (Additional data file <supplr sid="S2">2</supplr>).</p>
         <suppl id="S1">
            <title>
               <p>Additional File 1</p>
            </title>
            <caption>
               <p>Supplementary Tables 1-4</p>
            </caption>
            <text>
               <p>Table 1: oligonucleotide sequences used in Gateway&#8482; cloning and site-directed mutagenesis. Table 2: genome-wide list of genes that contain a candidate regulatory motif in their upstream region. Table 3: comparison of expected and observed number of motifs genome-wide in upstream and coding regions. Table 4: significant GO category enrichments (raw <it>p</it>-value &lt; 0.05) within each motif-containing gene set.</p>
            </text>
            <file name="gb-2009-10-4-r34-S1.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional File 2</p>
            </title>
            <caption>
               <p>The dual transfection and luciferase assay experimental set up</p>
            </caption>
            <text>
               <p>The dual transfection and luciferase assay experimental set up.</p>
            </text>
            <file name="gb-2009-10-4-r34-S2.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We would like to thank Drs Michael White and Michael Behnke for the <it>T. gondii </it>adapted luciferase vectors and related protocols, Elizabeth Mathis and Karen Hermetz for technical assistance and Haiming Wang for help with statistical analyses. We also wish to thank Drs Boris Striepen, Jeff Bennetzen, and Jeremy DeBarry and the anonymous reviewers for their valuable comments on the manuscript. This study was supported in part by NIH R01 AI068908 to JCK resources and in part by technical expertise from the University of Georgia Research Computing Center, a partnership between the Office of the Vice President for Research and the Office of the Chief Information Officer.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Lytic cycle of <it>Toxoplasma gondii</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Black</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Boothroyd</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Microbiol Mol Biol Rev</source>
            <pubdate>2000</pubdate>
            <volume>64</volume>
            <fpage>607</fpage>
            <lpage>623</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">99006</pubid>
                  <pubid idtype="pmpid" link="fulltext">10974128</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Mechanisms of innate resistance to <it>Toxoplasma gondii </it>infection.</p>
            </title>
            <aug>
               <au>
                  <snm>Alexander</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Scharton-Kersten</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Yap</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Liew</snm>
                  <fnm>FY</fnm>
               </au>
               <au>
                  <snm>Sher</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Philos Trans R Soc Lond B Biol Sci</source>
            <pubdate>1997</pubdate>
            <volume>352</volume>
            <fpage>1355</fpage>
            <lpage>1359</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1692026</pubid>
                  <pubid idtype="pmpid" link="fulltext">9355127</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Effect of prenatal treatment on mother to child transmission of <it>Toxoplasma gondii</it>: retrospective cohort study of 554 mother-child pairs in Lyon, France.</p>
            </title>
            <aug>
               <au>
                  <snm>Gilbert</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Gras</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wallon</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Peyron</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Ades</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Dunn</snm>
                  <fnm>DT</fnm>
               </au>
            </aug>
            <source>Int J Epidemiol</source>
            <pubdate>2001</pubdate>
            <volume>30</volume>
            <fpage>1303</fpage>
            <lpage>1308</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11821334</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Fundamentally different logic of gene regulation in eukaryotes and prokaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Struhl</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1999</pubdate>
            <volume>98</volume>
            <fpage>1</fpage>
            <lpage>4</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10412974</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>In control of its fate: gene regulation in <it>Toxoplasma gondii</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Mattsson</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Erhardt</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Soldati</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Behring Inst Mitt</source>
            <pubdate>1997</pubdate>
            <fpage>25</fpage>
            <lpage>33</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9303199</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Transcription regulation and animal diversity.</p>
            </title>
            <aug>
               <au>
                  <snm>Levine</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tjian</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>424</volume>
            <fpage>147</fpage>
            <lpage>151</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12853946</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Common <it>cis</it>-acting elements critical for the expression of several genes of <it>Toxoplasma gondii</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Mercier</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lefebvre-Van Hende</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Garber</snm>
                  <fnm>GE</fnm>
               </au>
               <au>
                  <snm>Lecordier</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Capron</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cesbron-Delauw</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1996</pubdate>
            <volume>21</volume>
            <fpage>421</fpage>
            <lpage>428</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8858595</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>A selector of transcription initiation in the protozoan parasite <it>Toxoplasma gondii</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Soldati</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Boothroyd</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>1995</pubdate>
            <volume>15</volume>
            <fpage>87</fpage>
            <lpage>93</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">231911</pubid>
                  <pubid idtype="pmpid" link="fulltext">7799972</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>The transcription machinery and the molecular toolbox to control gene expression in <it>Toxoplasma gondii </it>and other protozoan parasites.</p>
            </title>
            <aug>
               <au>
                  <snm>Meissner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Soldati</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Microbes Infect</source>
            <pubdate>2005</pubdate>
            <volume>7</volume>
            <fpage>1376</fpage>
            <lpage>1384</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16087378</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Transcriptional regulation of two stage-specifically expressed genes in the protozoan parasite <it>Toxoplasma gondii</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Kibe</snm>
                  <fnm>MK</fnm>
               </au>
               <au>
                  <snm>Coppin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Dendouga</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Oria</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Meurice</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Mortuaire</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Madec</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Tomavo</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>1722</fpage>
            <lpage>1736</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1903550</pubid>
                  <pubid idtype="pmpid" link="fulltext">15784612</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Identification and characterisation of a regulatory region in the <it>Toxoplasma gondii </it>hsp70 genomic locus.</p>
            </title>
            <aug>
               <au>
                  <snm>Ma</snm>
                  <fnm>YF</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Weiss</snm>
                  <fnm>LM</fnm>
               </au>
            </aug>
            <source>Int J Parasitol</source>
            <pubdate>2004</pubdate>
            <volume>34</volume>
            <fpage>333</fpage>
            <lpage>346</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15003494</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>The transcription of bradyzoite genes in <it>Toxoplasma gondii </it>is controlled by autonomous promoter elements.</p>
            </title>
            <aug>
               <au>
                  <snm>Behnke</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Radke</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>AT</fnm>
               </au>
               <au>
                  <snm>Sullivan</snm>
                  <fnm>WJ</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2008</pubdate>
            <volume>68</volume>
            <fpage>1502</fpage>
            <lpage>1518</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2440561</pubid>
                  <pubid idtype="pmpid" link="fulltext">18433450</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The transcriptome of <it>Toxoplasma gondii</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Radke</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Behnke</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Mackey</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Radke</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Roos</snm>
                  <fnm>DS</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>BMC Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>26</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1325263</pubid>
                  <pubid idtype="pmpid" link="fulltext">16324218</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Computational identification of <it>cis</it>-regulatory elements associated with groups of functionally related genes in <it>Saccharomyces cerevisiae</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Hughes</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Estep</snm>
                  <fnm>PW</fnm>
               </au>
               <au>
                  <snm>Tavazoie</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2000</pubdate>
            <volume>296</volume>
            <fpage>1205</fpage>
            <lpage>1214</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10698627</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Fitting a mixture model by expectation maximization to discover motifs in biopolymers.</p>
            </title>
            <aug>
               <au>
                  <snm>Bailey</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Elkan</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Proc Int Conf Intell Syst Mol Biol</source>
            <pubdate>1994</pubdate>
            <volume>2</volume>
            <fpage>28</fpage>
            <lpage>36</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7584402</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Enzymes of energy metabolism in the bradyzoites and tachyzoites of <it>Toxoplasma gondii</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Denton</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Alexander</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Thong</snm>
                  <fnm>KW</fnm>
               </au>
               <au>
                  <snm>Coombs</snm>
                  <fnm>GH</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Lett</source>
            <pubdate>1996</pubdate>
            <volume>137</volume>
            <fpage>103</fpage>
            <lpage>108</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8935663</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>The protozoan parasite <it>Toxoplasma gondii </it>expresses two functional plant-like glycolytic enzymes. Implications for evolutionary origin of apicomplexans.</p>
            </title>
            <aug>
               <au>
                  <snm>Dzierszinski</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Popescu</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Toursel</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Slomianny</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Yahiaoui</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Tomavo</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1999</pubdate>
            <volume>274</volume>
            <fpage>24888</fpage>
            <lpage>24895</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10455162</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Differential expression of two plant-like enolases with distinct enzymatic and antigenic properties during stage conversion of the protozoan parasite <it>Toxoplasma gondii</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Dzierszinski</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Mortuaire</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dendouga</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Popescu</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Tomavo</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2001</pubdate>
            <volume>309</volume>
            <fpage>1017</fpage>
            <lpage>1027</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11399076</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>A bradyzoite stage-specifically expressed gene of <it>Toxoplasma gondii </it>encodes a polypeptide homologous to lactate dehydrogenase.</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Parmley</snm>
                  <fnm>SF</fnm>
               </au>
            </aug>
            <source>Mol Biochem Parasitol</source>
            <pubdate>1995</pubdate>
            <volume>73</volume>
            <fpage>291</fpage>
            <lpage>294</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8577343</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Expressed sequence tag analysis of the bradyzoite stage of <it>Toxoplasma gondii</it>: identification of developmentally regulated genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Manger</snm>
                  <fnm>ID</fnm>
               </au>
               <au>
                  <snm>Hehl</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Parmley</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sibley</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Marra</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hillier</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Waterston</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Boothroyd</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Infect Immun</source>
            <pubdate>1998</pubdate>
            <volume>66</volume>
            <fpage>1632</fpage>
            <lpage>1637</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9529091</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Purine and pyridmidine metabolism.</p>
            </title>
            <aug>
               <au>
                  <snm>Berens</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>C</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Marr</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>Biochemistry and Molecular Biology of Parasites</source>
            <publisher>London: Academic Press</publisher>
            <editor>Marr JJ, Muller M</editor>
            <pubdate>1995</pubdate>
            <fpage>89</fpage>
            <lpage>117</lpage>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Gene transfer in the evolution of parasite nucleotide biosynthesis.</p>
            </title>
            <aug>
               <au>
                  <snm>Striepen</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Pruijssers</snm>
                  <fnm>AJP</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gubbels</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Umejiego</snm>
                  <fnm>NN</fnm>
               </au>
               <au>
                  <snm>Hedstrom</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kissinger</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>3154</fpage>
            <lpage>3159</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">365759</pubid>
                  <pubid idtype="pmpid" link="fulltext">14973196</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>De novo pyrimidine biosynthesis is required for virulence of <it>Toxoplasma gondii</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Fox</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Bzik</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>415</volume>
            <fpage>926</fpage>
            <lpage>929</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11859373</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Microneme proteins: structural and functional requirements to promote adhesion and invasion by the apicomplexan parasite <it>Toxoplasma gondii</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Soldati</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Dubremetz</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Lebrun</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Int J Parasitol</source>
            <pubdate>2001</pubdate>
            <volume>31</volume>
            <fpage>1293</fpage>
            <lpage>1302</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11566297</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p><it>De novo </it>ribosome biosynthesis is transcriptionally regulated in <it>Eimeria tenella</it>, dependent on its life cycle stage.</p>
            </title>
            <aug>
               <au>
                  <snm>Schaap</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Arts</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>van Poppel</snm>
                  <fnm>NF</fnm>
               </au>
               <au>
                  <snm>Vermeulen</snm>
                  <fnm>AN</fnm>
               </au>
            </aug>
            <source>Mol Biochem Parasitol</source>
            <pubdate>2005</pubdate>
            <volume>139</volume>
            <fpage>239</fpage>
            <lpage>248</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15664658</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>The complete set of <it>Toxoplasma gondii </it>ribosomal protein genes contains two conserved promoter elements.</p>
            </title>
            <aug>
               <au>
                  <snm>Poppel</snm>
                  <fnm>NFV</fnm>
               </au>
               <au>
                  <snm>Welagen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Vermeulen</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Schaap</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Parasitology</source>
            <pubdate>2006</pubdate>
            <volume>133</volume>
            <fpage>19</fpage>
            <lpage>31</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16674839</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Integrated mapping, chromosomal sequencing and sequence analysis of <it>Cryptosporidium parvum</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Bankier</snm>
                  <fnm>AT</fnm>
               </au>
               <au>
                  <snm>Spriggs</snm>
                  <fnm>HF</fnm>
               </au>
               <au>
                  <snm>Fartmann</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Konfortov</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Madera</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Vogel</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Teichmann</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Ivens</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Dear</snm>
                  <fnm>PH</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>1787</fpage>
            <lpage>1799</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403770</pubid>
                  <pubid idtype="pmpid" link="fulltext">12869580</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Specific DNA-binding by Apicomplexan AP2 transcription factors.</p>
            </title>
            <aug>
               <au>
                  <snm>DeSilva</snm>
                  <fnm>EK</fnm>
               </au>
               <au>
                  <snm>Gehrke</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Olszewski</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>n</snm>
                  <fnm>IL</fnm>
               </au>
               <au>
                  <snm>Chahal</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Bulyk</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>s</snm>
                  <fnm>ML</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2008</pubdate>
            <volume>105</volume>
            <fpage>8393</fpage>
            <lpage>8398</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2423414</pubid>
                  <pubid idtype="pmpid" link="fulltext">18541913</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Transcriptional regulation of ribosomal protein genes in <it>Toxoplasma gondii</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>van Poppel</snm>
                  <fnm>NF</fnm>
               </au>
            </aug>
            <source>PhD thesis</source>
            <publisher>Utrecht University, Faculty of Veterinary Medicine</publisher>
            <pubdate>2005</pubdate>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Genome sequence of the human malaria parasite <it>Plasmodium falciparum</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Gardner</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Fung</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Berriman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hyman</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Carlton</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Pain</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Bowman</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>IT</fnm>
               </au>
               <au>
                  <snm>James</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Rutherford</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Salzberg</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Craig</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kyes</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chan</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Nene</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Shallom</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Suh</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Peterson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Angiuoli</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pertea</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Allen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Selengut</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Haft</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Mather</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Vaidya</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>DM</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>419</volume>
            <fpage>498</fpage>
            <lpage>511</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12368864</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Complete genome sequence of the apicomplexan, <it>Cryptosporidium parvum</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Abrahamsen</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Templeton</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Enomoto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Abrahante</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lancto</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Widmer</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Tzipori</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Buck</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bankier</snm>
                  <fnm>AT</fnm>
               </au>
               <au>
                  <snm>Dear</snm>
                  <fnm>PH</fnm>
               </au>
               <au>
                  <snm>Konfortov</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Spriggs</snm>
                  <fnm>HF</fnm>
               </au>
               <au>
                  <snm>Iyer</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Anantharaman</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kapur</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>304</volume>
            <fpage>441</fpage>
            <lpage>445</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15044751</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Histone-modifying complexes regulate gene expression pertinent to the differentiation of the protozoan parasite <it>Toxoplasma gondii</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Saksouk</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bhatti</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Kieffer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>AT</fnm>
               </au>
               <au>
                  <snm>Musset</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Garin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sullivan</snm>
                  <fnm>WJ</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Cesbron-Delauw</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Hakimi</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>2005</pubdate>
            <volume>25</volume>
            <fpage>10301</fpage>
            <lpage>10314</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1291236</pubid>
                  <pubid idtype="pmpid" link="fulltext">16287846</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Histone mediated gene activation in <it>Toxoplasma gondii</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Sullivan</snm>
                  <fnm>WJ</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Hakimi</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Mol Biochem Parasitol</source>
            <pubdate>2006</pubdate>
            <volume>148</volume>
            <fpage>109</fpage>
            <lpage>116</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16644030</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Epigenomic modifications predict active promoters and gene structure in <it>Toxoplasma gondii</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Gissot</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kelly</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Ajioka</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Greally</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>PLoS Pathog</source>
            <pubdate>2007</pubdate>
            <volume>3</volume>
            <fpage>e77</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1891328</pubid>
                  <pubid idtype="pmpid" link="fulltext">17559302</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Prediction of the general transcription factors associated with RNA polymerase II in <it>Plasmodium falciparum</it>: conserved features and differences relative to other eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Callebaut</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Prat</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Meurice</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Mornon</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Tomavo</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>BMC genomics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>100</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1199594</pubid>
                  <pubid idtype="pmpid" link="fulltext">16042788</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Discovery of the principal specific transcription factors of Apicomplexa and their implication for the evolution of the AP2-integrase DNA binding domains.</p>
            </title>
            <aug>
               <au>
                  <snm>Balaji</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Babu</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Iyer</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>3994</fpage>
            <lpage>4006</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1178005</pubid>
                  <pubid idtype="pmpid" link="fulltext">16040597</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Identification of putative <it>cis</it>-regulatory elements in <it>Cryptosporidium parvum </it>by de novo pattern finding.</p>
            </title>
            <aug>
               <au>
                  <snm>Mullapudi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Lancto</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Abrahamsen</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Kissinger</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>13</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1779779</pubid>
                  <pubid idtype="pmpid" link="fulltext">17212834</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p><it>Toxoplasma gondii </it>asexual development: identification of developmentally regulated genes and distinct patterns of gene expression.</p>
            </title>
            <aug>
               <au>
                  <snm>Cleary</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Singh</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Blader</snm>
                  <fnm>IJ</fnm>
               </au>
               <au>
                  <snm>Brewer</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Boothroyd</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Eukaryotic Cell</source>
            <pubdate>2002</pubdate>
            <volume>1</volume>
            <fpage>329</fpage>
            <lpage>340</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">118016</pubid>
                  <pubid idtype="pmpid" link="fulltext">12455982</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Identification and characterization of differentiation mutants in the protozoan parasite <it>Toxoplasma gondii</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Matrajt</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Donald</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Singh</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Roos</snm>
                  <fnm>DS</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2002</pubdate>
            <volume>44</volume>
            <fpage>735</fpage>
            <lpage>747</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11994154</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>ToxoDB: accessing the <it>Toxoplasma gondii </it>genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Kissinger</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Gajria</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>IT</fnm>
               </au>
               <au>
                  <snm>Roos</snm>
                  <fnm>DS</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>234</fpage>
            <lpage>236</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165519</pubid>
                  <pubid idtype="pmpid" link="fulltext">12519989</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Systematic determination of genetic network architecture.</p>
            </title>
            <aug>
               <au>
                  <snm>Tavazoie</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hughes</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Campbell</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Cho</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>1999</pubdate>
            <volume>281</volume>
            <fpage>285</fpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10391217</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Controlling the false discovery rate: a practical and powerful approach to multiple testing.</p>
            </title>
            <aug>
               <au>
                  <snm>Hochberg</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Benjamini</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>J Roy Stat Soc</source>
            <pubdate>1995</pubdate>
            <volume>57</volume>
            <fpage>289</fpage>
            <lpage>300</lpage>
         </bibl>
         <bibl id="B43">
            <title>
               <p>A simple sequentially rejective multiple test procedure.</p>
            </title>
            <aug>
               <au>
                  <snm>Holm</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Scand J Stat</source>
            <pubdate>1979</pubdate>
            <fpage>65</fpage>
            <lpage>70</lpage>
         </bibl>
         <bibl id="B44">
            <aug>
               <au>
                  <snm>Sambrook</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Fritsch</snm>
                  <fnm>EF</fnm>
               </au>
               <au>
                  <snm>Maniatis</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Molecular Cloning: A Laboratory Manual</source>
            <publisher>Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press</publisher>
            <pubdate>1989</pubdate>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Transient transfection and expression in the obligate intracellular parasite <it>Toxoplasma gondii</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Soldati</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Boothroyd</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1993</pubdate>
            <volume>260</volume>
            <fpage>349</fpage>
            <lpage>352</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8469986</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
