<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>gb-2010-11-4-114</ui><ji>GBJ</ji><fm>
<dochead>Research highlight</dochead>
<bibl>
<title>
<p>Two different genomes that produce the same result</p>
</title>
<aug>
<au ca="yes" id="A1"><snm>Kessin</snm><mi>H</mi><fnm>Richard</fnm><insr iid="I1"/><email>rhk2@columbia.edu</email></au>
</aug>
<insg>
<ins id="I1"><p>Department of Pathology and Cell Biology, Columbia University, 630 W. 168<sup>th </sup>St. New York, New York, 10032, USA</p></ins>
</insg>
<source>Genome Biology</source>
<issn>1465-6906</issn>
<pubdate>2010</pubdate>
<volume>11</volume>
<issue>4</issue>
<fpage>114</fpage>
<url>http://genomebiology.com/2010/11/4/114</url>
<xrefbib><pubidlist><pubid idtype="pmpid">20423532</pubid><pubid idtype="doi">10.1186/gb-2010-11-4-114</pubid></pubidlist></xrefbib>
</bibl>
<history><pub><date><day>27</day><month>4</month><year>2010</year></date></pub></history>
<cpyrt><year>2010</year><collab> BioMed Central Ltd.</collab></cpyrt>
<shorttitle>
<p>Two different genomes that produce the same result</p>
</shorttitle>
<shortabs>
<p>Despite differences in genomic sequence, the developmental program of two distantly related Dictyostelium species is remarkably similar.</p>
</shortabs>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<p>Despite considerable differences in genomic sequence, the developmental program of gene expression between two similar <it>Dictyostelium </it>species is remarkably similar.</p>
</sec>
</abs>
</fm><meta>
<classifications>
<classification id="30010005" subtype="man_spc_id" type="BMC">Development</classification>
<classification id="30010008" subtype="man_spc_id" type="BMC">Evolution</classification>
<classification id="30010010" subtype="man_spc_id" type="BMC">Genome studies</classification>
</classifications>
</meta>
<bdy>
<sec>
<st>
<p>Research highlight</p>
</st>
<p>Have you ever wondered who determined the first DNA sequence? Or how hard it was? Well, I can't say it was the very first, but nearly 45 years ago, George Streisinger and his colleagues mutated the lysozyme gene of phage T4 with acridine, which they knew caused frameshifts, and then they caused a second site suppression (a mutation at a second site that suppressed the effect of the first) with another round of mutagenesis, restoring lysozyme activity. The amino acids encoded by the DNA between the two mutations should, in theory, have been changed - and they were. Knowing the changed amino acids and the genetic code, the group determined the actual DNA sequence. It was 23 nucleotides long and the complete study must have taken five people a year <abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp>. There was an extra prize, however. The work confirmed that there were no 'commas' between codons. Reading the paper as a graduate student, I thought it was wonderful. And it was.</p>
<p>Now we sequence genomes with such speed that our problem is to make use of the information and not be overwhelmed by it. For the past few decades we have been obsessed with sequences from various organisms and have mastered the art of building phylogenetic trees to reveal distant evolutionary relationships, but comparing the genomes and the transcriptomes of similar organisms can also be revealing. Parikh <it>et al</it>. <abbrgrp>
<abbr bid="B2">2</abbr>
</abbrgrp> have assembled a team of molecular biologists and informaticians to ask a number of interesting questions about the development of two outwardly very similar species, the slime molds <it>Dictyostelium discoideum </it>and <it>Dictyostelium purpureum</it>, now that the sequence of <it>D. purpureum </it>as well as that of <it>D. discoideum </it>is available (R Sucgang <it>et al</it>., unpublished, (see <abbrgrp>
<abbr bid="B3">3</abbr>
</abbrgrp>), The two species are social amoebae, single-celled creatures that live in the soil and eat bacteria until they run out of food. Then they do an extraordinary thing - the amoebae aggregate in groups of 50,000 or so and undergo a synchronous development such that, after 24 hours, they have created a fruiting body composed of a ball of resistant spores on top of a stalk of dead cells: the spores can then be dispersed to a more favorable environment. For movies of these organisms undergoing synchronous aggregation and development go to <abbrgrp>
<abbr bid="B4">4</abbr>
</abbrgrp> - it's worth the trouble (Figure <figr fid="F1">1</figr>). </p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p><it>Dictyostelium discoideum </it>has a multicellular development, the latter stages of which are shown in this figure</p></caption><text>
   <p><b><it>Dictyostelium discoideum </it>has a multicellular development, the latter stages of which are shown in this figure</b>. After aggregating by chemotaxis, the cells form a mound, differentiate into two cell types and then, over the next 12 hours, construct a fruiting body consisting of 80,000 viable spores on a stalk created by 20,000 dead stalk cells. <it>D. purpureum </it>has a similar development, except for an earlier formation of the stalk and the synthesis of a purple dye in the spore mass. Both species aggregate by chemotaxis toward sources of cAMP. The high synchrony of development makes these experiments possible. Image reproduced from <abbrgrp><abbr bid="B12">12</abbr></abbrgrp></p>
</text><graphic file="gb-2010-11-4-114-1"/></fig>
<p>The two species are very similar in appearance and behavior, and the chemoattractant aggregation signal for both species is cyclic AMP (cAMP). <it>D. purpureum </it>makes the stalk of the fruiting body a little differently and the spore mass is purple (<it>D. discoideum </it>is light yellow) but that is about the extent of the obvious morphological differences. And yet the genome sequences are different - as different, according to Parikh <it>et al</it>. <abbrgrp>
<abbr bid="B2">2</abbr>
</abbrgrp>, as those of humans and bony fishes, despite the fact that <it>D. discoideum </it>and <it>D. purpureum </it>group within the same clade within the many species of social amoebae, according to phylogenies constructed from ribosomal RNA gene (<it>rrnA</it>) sequences <abbrgrp>
<abbr bid="B5">5</abbr>
</abbrgrp>. The overall sequence homology of the orthologues is 61.8%. Parikh <it>et al</it>. <abbrgrp>
<abbr bid="B2">2</abbr>
</abbrgrp> find that the two genomes retain certain gross similarities - both are remarkably AT-rich - but the coding and intergenic sequences have diverged. The questions they then ask are: Do the two species retain the same programs of development despite the differences in genomes? Do the genes necessary to make spores or stalk cells turn on at the same time in each species? How many genes are orthologs; that is, similar by virtue of direct descent from the same ancestral gene? And how many genes are transcribed, and which genes are transcribed the most or the least?</p>
<p>To analyze and compare the transcriptomes of the two species, Parikh <it>et al</it>. <abbrgrp>
<abbr bid="B2">2</abbr>
</abbrgrp> have abandoned the difficulties of microarray analysis in favor of RNA-sequencing (RNA-seq) <abbrgrp>
<abbr bid="B6">6</abbr>
</abbrgrp>. The latter method has a greater dynamic range and cross hybrididization is not the problem in RNAseq that it is in microarray analysis. Transcripts were collected at 4-hour intervals during the synchronous development of the fruiting body of each species and converted into cDNAs. Fragments of the cDNAs were sequenced in reads of 35 base pairs, and the reads mapped onto the genomes of <it>D. discoideum </it>or <it>D. purpureum</it>. Any transcript that did not map to a unique sequence was not counted, which will eliminate repetitive elements would be eliminated. This means that actin genes, of which there are a number, would not be counted, nor would the transcripts coding for the mysterious poly-asparagine tracts found in thousands of <it>Dictyostelium </it>proteins.</p>
<p>There is interesting data in the transcriptome analysis and the authors provide a nice tool, DictyExpress <abbrgrp>
<abbr bid="B7">7</abbr>
</abbrgrp>, to explore them, even for those not well versed in computational biology. The important finding is that among the transcripts that are mapped back to the two genomes, there are many orthologs - 7,619 to be exact (out of a predicted total of 12410 genes for <it>D. purpureum </it>and 13992 for <it>D. discoideum</it>) - and to a great extent they are transcribed in the same groups and in the same temporal order during development in the two species. Almost all genes are regulated during development, either up or down. The synchrony of development and the improved quantitation of RNA-seq (compared with microarrays) make these comparisons possible. Despite the differences in genome sequence, the regulation of developmental gene expression is maintained. Transcripts that are induced during development are coordinated with the slight differences in timing - <it>D. purpureum </it>takes 4 hours longer than <it>D. discoideum </it>to reach a particular developmental stage, and the appearance of the relevant transcripts is delayed as well. Many previously characterized genes are regulated almost identically in the two species.</p>
<p>What is the value of this molecular comparative anatomy? Some essential detail is perhaps lost in the statement of Parikh <it>et al</it>. <abbrgrp>
<abbr bid="B2">2</abbr>
</abbrgrp> likening the difference between <it>D. discoideum </it>and <it>D. purpureum </it>genomes to the differences between the genomes of bony fish and humans. The differences in sequence between the two slime molds will surely not be spread evenly over the genomes. In structural genes, important functional elements of the protein sequence tend to be conserved, leaving other sequences to diverge. Occasionally, a lack of conservation can be telling - the cell-cell recognition proteins of different species, for example, might be expected to be species-specific and vary in discrete regions <abbrgrp>
<abbr bid="B8">8</abbr>
</abbrgrp>. Amazingly, the amoebae of these two species will co-aggregate because of their mutual chemotaxis towards higher levels of cAMP, but they subsequently sort out before forming a fruiting body, as Raper and Thom showed long ago <abbrgrp>
<abbr bid="B9">9</abbr>
</abbrgrp>.</p>
<p>But there is a long standing problem with <it>Dictyostelium </it>development and that concerns the responsible transcription factors - or rather their paucity <abbrgrp>
<abbr bid="B10">10</abbr>
</abbrgrp>. It has been known for years that development in <it>Dictyostelium </it>is accompanied by shifts in the expression patterns of many genes. In fact, it seems as if the cells switch from expressing one set of genes to expressing another, exactly at the time they switch from being unicellular to being multicellular. Parikh <it>et al</it>. <abbrgrp>
<abbr bid="B2">2</abbr>
</abbrgrp> now show that the cells alter the abundance of almost every mRNA in the transcriptome during development, so one might expect that transcription factors would be central to the regulation of <it>Dictyostelium </it>development, as they are in <it>Drosophila</it>, for example. But this may not the case - <it>Dictyostelium </it>researchers have looked for developmental mutants by mutagenesis screens with restriction-enzyme-mediated mutagenesis (REMI), a form of insertional mutagenesis, for the past 18 years, but only a handful of the hundreds of mutants found are in canonical transcription factors. Of such transcription factors, two Mybs, one GATA, two bZIPs, CRTF and a STAT have been found, but a close correlation of any of these with any developmental program or coordinated gene expression in <it>Dictyostelium </it>has been elusive (see <abbrgrp>
<abbr bid="B4">4</abbr>
</abbrgrp> for the roles of these factors and the phenotypes of their mutants). One exception is srfA, a trancriptional regulator similar in sequence to mammalian serum-response factor, whose loss by mutation results in the depression of transcripts involved in spore formation. <it>D. discoideum </it>and <it>D. purpureum </it>have the lowest known number of transcription factors relative to their genome size <abbrgrp>
<abbr bid="B8">8</abbr>
</abbrgrp>.</p>
<p>There are a number of possible explanations for these findings. One is that transcription factor genes have been mutated and associated developmental defects have been observed, but the gene products were not recognized as gene regulatory proteins because they had no homology with known transcription factors. A mutation in the <it>D. discoideum </it>G-box binding factor (GBF), for example, blocks post-aggregation development, but it is a non-canonical transcription factor. Another possibility is that the extraordinary conserved temporal expression of many orthologous transcripts in prestalk and prespore cells in the two species could be controlled by some means in addition to traditional transcription factors and recognition sites.</p>
<p>The exceptional AT-richness of promoter regions - 95% in most cases - invites comparison with another organism with a similarly sized AT-rich genome - <it>Plasmodium falciparum</it>. In this case too, transcriptional regulation has been difficult to study in detail, although recently a family of AP2 (Apicomplexan apetala2) transcription factors have been shown to be linked to sporozoite specific genes<abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp>. These have weak homology with plant AP2 factors and, like GBF, bind sequences that have some GC content. Perhaps, with the exception of GBF and a few others, we are just not seeing the <it>Dictyostelium </it>transcription factors.</p>
<p>The extraordinary synchrony of development of <it>Dictyostelium </it>species and the quantitative advantages of RNA-seq are powerful partners, but such comparisons could be imagined in developing lineages within a particular species, such as different breeds of domesticated animals. How do the neural crest cells that make the snout of a greyhound differ from those of a bulldog? Is it just a few sequences that differ? Or a matter of transcript number? Is the transcript repertory the same but in one case there are more progenitors? These methods might be applied to find out. I am not suggesting sacrificing puppies (perhaps fish would be better subjects), but it is the kind of thing that Darwin would have liked to know.</p>
</sec>
</bdy><bm>
<refgrp><bibl id="B1"><title><p>Change of a sequence of amino acids in phage T4 lysozyme by acridine-induced mutations</p></title><aug><au><snm>Terzaghi</snm><fnm>E</fnm></au><au><snm>Okada</snm><fnm>Y</fnm></au><au><snm>Streisinger</snm><fnm>G</fnm></au><au><snm>Emrich</snm><fnm>E</fnm></au><au><snm>Inouye</snm><fnm>M</fnm></au><au><snm>Tsugita</snm><fnm>A</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1966</pubdate><volume>56</volume><fpage>500</fpage><lpage>507</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.56.2.500</pubid><pubid idtype="pmcid">224401</pubid><pubid idtype="pmpid">5229972</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>Conserved developmental transcriptomes in evolutionarily divergent species.</p></title><aug><au><snm>Parikh</snm><fnm>A</fnm></au><au><snm>Miranda</snm><fnm>ER</fnm></au><au><snm>Katoh-Kurasawa</snm><fnm>M</fnm></au><au><snm>Fuller</snm><fnm>D</fnm></au><au><snm>Rot</snm><fnm>G</fnm></au><au><snm>Zagar</snm><fnm>L</fnm></au><au><snm>Curk</snm><fnm>T</fnm></au><au><snm>Sucgang</snm><fnm>R</fnm></au><au><snm>Chen</snm><fnm>R</fnm></au><au><snm>Zupan</snm><fnm>B</fnm></au><au><snm>Loomis</snm><fnm>WF</fnm></au><au><snm>Kuspa</snm><fnm>A</fnm></au><au><snm>Shaulsky</snm><fnm>G</fnm></au></aug><source>Genome Biol</source><pubdate>2010</pubdate><volume>11</volume><fpage>R35</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2010-11-3-r35</pubid><pubid idtype="pmpid" link="fulltext">20236529</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Dictyostelium purpureum welcome page</p></title><url>http://genomes.dictybase.org/purpureum</url></bibl><bibl id="B4"><title><p>dictyBase</p></title><url>http://www.dictybase.org</url></bibl><bibl id="B5"><title><p>Molecular phylogeny and evolution of morphology in the social amoebae.</p></title><aug><au><snm>Schaap</snm><fnm>P</fnm></au><au><snm>Winckler</snm><fnm>T</fnm></au><au><snm>Nelson</snm><fnm>M</fnm></au><au><snm>Alvarez-Curto</snm><fnm>E</fnm></au><au><snm>Elgie</snm><fnm>B</fnm></au><au><snm>Hagiwara</snm><fnm>H</fnm></au><au><snm>Cavender</snm><fnm>J</fnm></au><au><snm>Milano-Curto</snm><fnm>A</fnm></au><au><snm>Rozen</snm><fnm>DE</fnm></au><au><snm>Dingermann</snm><fnm>T</fnm></au><au><snm>Mutzel</snm><fnm>R</fnm></au><au><snm>Baldauf</snm><fnm>SL</fnm></au></aug><source>Science</source><pubdate>2006</pubdate><volume>314</volume><fpage>661</fpage><lpage>663</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1130670</pubid><pubid idtype="pmcid">2173941</pubid><pubid idtype="pmpid" link="fulltext">17068267</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>The beginning of the end for microarrays?</p></title><aug><au><snm>Shendure</snm><fnm>J</fnm></au></aug><source>Nat Methods</source><pubdate>2008</pubdate><volume>5</volume><fpage>585</fpage><lpage>587</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nmeth0708-585</pubid><pubid idtype="pmpid" link="fulltext">18587314</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>DictyExpress</p></title><url>http://www.ailab.si/dictyexpress</url></bibl><bibl id="B8"><title><p>Polymorphic members of the lag gene family mediate kin discrimination in <it>Dictyostelium</it>.</p></title><aug><au><snm>Benabentos</snm><fnm>R</fnm></au><au><snm>Hirose</snm><fnm>S</fnm></au><au><snm>Sucgang</snm><fnm>R</fnm></au><au><snm>Curk</snm><fnm>T</fnm></au><au><snm>Katoh</snm><fnm>M</fnm></au><au><snm>Ostrowski</snm><fnm>EA</fnm></au><au><snm>Strassmann</snm><fnm>JE</fnm></au><au><snm>Queller</snm><fnm>DC</fnm></au><au><snm>Zupan</snm><fnm>B</fnm></au><au><snm>Shaulsky</snm><fnm>G</fnm></au><au><snm>Kuspa</snm><fnm>A</fnm></au></aug><source>Curr Biol</source><pubdate>2009</pubdate><volume>19</volume><fpage>567</fpage><lpage>572</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cub.2009.02.037</pubid><pubid idtype="pmcid">2694408</pubid><pubid idtype="pmpid" link="fulltext">19285397</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Interspecies mixtures of the Dictyosteliaceae.</p></title><aug><au><snm>Raper</snm><fnm>KB</fnm></au><au><snm>Thom</snm><fnm>C</fnm></au></aug><source>Am J Bot</source><pubdate>1941</pubdate><volume>28</volume><fpage>69</fpage><lpage>78</lpage><xrefbib><pubid idtype="doi">10.2307/2437063</pubid></xrefbib></bibl><bibl id="B10"><title><p>Components of the <it>Dictyostelium </it>gene expression regulatory machinery.</p></title><aug><au><snm>Shaulsky</snm><fnm>G</fnm></au><au><snm>Huang</snm><fnm>E</fnm></au></aug><source>Dictyostelium Genomics</source><publisher>Norwich, UK: Horizon Scientific Press</publisher><editor>Loomis WF, Kuspa A</editor><pubdate>2005</pubdate><fpage>83</fpage><lpage>102</lpage></bibl><bibl id="B11"><title><p>Structural determinants of DNA binding by a <it>P. falciparum </it>ApiAP2 transcriptional regulator.</p></title><aug><au><snm>Lindner</snm><fnm>SE</fnm></au><au><snm>De Silva</snm><fnm>EK</fnm></au><au><snm>Keck</snm><fnm>JL</fnm></au><au><snm>Llin&#225;s</snm><fnm>M</fnm></au></aug><source>J Mol Biol</source><pubdate>2010</pubdate><volume>395</volume><fpage>558</fpage><lpage>567</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.jmb.2009.11.004</pubid><pubid idtype="pmpid" link="fulltext">19913037</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Learning to get along despite struggling to get by</p></title><aug><au><snm>Ostrowski</snm><fnm>EA</fnm></au><au><snm>Shaulsky</snm><fnm>G</fnm></au></aug><source>Genome Biol</source><pubdate>2009</pubdate><volume>10</volume><fpage>218</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2009-10-5-218</pubid><pubid idtype="pmpid" link="fulltext">19519929</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm></art>