<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2013-14-1-401</ui>
   <ji>1465-6906</ji>
   <fm>
      <dochead>Open letter</dochead>
      <bibl>
         <title>
            <p>Genome sequencing of the important oilseed crop <it>Sesamum indicum </it>L</p>
         </title>
         <aug>
            <au ca="yes" id="A1"><snm>Zhang</snm><fnm>Haiyang</fnm><insr iid="I1"/><email>zhy@hnagri.org.cn</email></au>
            <au id="A2"><snm>Miao</snm><fnm>Hongmei</fnm><insr iid="I1"/><email>miaohongmei@yahoo.com.cn</email></au>
            <au id="A3"><snm>Wang</snm><fnm>Lei</fnm><insr iid="I2"/><email>wanglei@nankai.edu.cn</email></au>
            <au id="A4"><snm>Qu</snm><fnm>Lingbo</fnm><insr iid="I3"/><email>qulingbo@zzu.edu.cn</email></au>
            <au id="A5"><snm>Liu</snm><fnm>Hongyan</fnm><insr iid="I4"/><email>liuhy1219@163.com</email></au>
            <au id="A6"><snm>Wang</snm><fnm>Qiang</fnm><insr iid="I5"/><email>wq161@yahoo.com.cn</email></au>
            <au id="A7"><snm>Yue</snm><fnm>Meiwang</fnm><insr iid="I6"/><email>mwyuecarl@163.com</email></au>
         </aug>
         <insg>
            <ins id="I1"><p>Henan Sesame Research Center, Henan Academy of Agricultural Sciences, Zhengzhou 450002, People's Republic of China</p></ins>
            <ins id="I2"><p>TEDA School of Biological Sciences and Biotechnology, Nankai University, Tianjin 300457, People's Republic of China</p></ins>
            <ins id="I3"><p>Department of Bioengineering, Henan Technology University, Zhengzhou 450001, People's Republic of China</p></ins>
            <ins id="I4"><p>Institute of Plant Protection Research, Henan Academy of Agricultural Sciences, Zhengzhou 450002, People's Republic of China</p></ins>
            <ins id="I5"><p>Crop Research Institute, Anhui Academy of Agricultural Sciences, Hefei 230031, People's Republic of China</p></ins>
            <ins id="I6"><p>Crops Research Institute, Jiangxi Academy of Agricultural Sciences, Nanchang 330200, People's Republic of China</p></ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2013</pubdate>
         <volume>14</volume>
         <issue>1</issue>
         <fpage>401</fpage>
         <url>http://genomebiology.com/2013/14/1/401</url>
         <xrefbib><pubidlist><pubid idtype="pmpid">23369264</pubid><pubid idtype="doi">10.1186/gb-2013-14-1-401</pubid></pubidlist></xrefbib>
      </bibl>
      <history><pub><date><day>31</day><month>1</month><year>2013</year></date></pub></history>
      <cpyrt><year>2013</year><collab>BioMed Central Ltd.</collab></cpyrt>
      <kwdg>
         <kwd>genomics</kwd>
         <kwd>sequencing</kwd>
         <kwd>sesame</kwd>
      </kwdg>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <p>The Sesame Genome Working Group (SGWG) has been formed to sequence and assemble the sesame (<it>Sesamum indicum </it>L.) genome. The status of this project and our planned analyses are described.</p>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification id="300100010" subtype="man_spc_id" type="BMC">Genome studies</classification>
         <classification id="300100019" subtype="man_spc_id" type="BMC">Plant biology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>The importance of the sesame genome</p>
         </st>
         <p>Sesame (<it>Sesamum indicum </it>L., 2n = 26), which belongs to the <it>Sesamum </it>genus of the Pedaliaceae family, is one of the oldest oilseed crops and is cultivated in tropical and subtropical regions of Asia, Africa and South America <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Its cultivation history can be traced back to between 5,000 and 5,500 years ago in the Harappa Valley of the Indian subcontinent <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. The total area of sesame harvested in the world is currently 7.8 million hectares, and annual production is 3.84 million tons (2010, UN Food and Agriculture Organization data). Being one of the four main sesame-producing countries, China has contributed 15.2 to 32.5% of the total world sesame production over the past 10 years (2001 to 2010, UN Food and Agriculture Organization data). Sesame has one of the highest oil contents: decorticated seeds contain 45 to 63% oil <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. The seed is also rich in protein, vitamins, including niacin, minerals and lignans, such as sesamolin and sesamin <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>, and it is a popular food and medicine <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. Sequencing and analysis of the sesame genome is essential if we are to elucidate the evolutionary origins and characteristics of the sesame species.</p>
         <p><it>Sesamum </it>is the main genus in the family Pedaliaceae, which contains 17 genera and 80 species of annual and perennial herbs that are distributed in the Old World tropics and subtropics <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. The taxonomy and cytogenetics of the <it>Sesamum </it>genus has been reviewed and debated for a long time <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>, and many heterogeneous landraces present in various growing areas still need to be distinguished <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B18">18</abbr></abbrgrp>. <it>S. indicum </it>is the sole cultivar in the <it>Sesamum </it>genus and evolved from wild populations <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B19">19</abbr></abbrgrp>. However, the origin and evolution of cultivated sesame is still unclear and requires more detailed investigation <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B15">15</abbr></abbrgrp>. Evidence suggests that sesame may have originated in either India or Africa <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>. Bedigian reported that sesame was derived from the Indian subcontinent (the western Indian peninsula and parts of Pakistan) thousands of years ago, and believed that the progenitor of sesame is a taxon named <it>S. orientale </it>var. <it>malabaricum </it>Nar. <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>, although most species of <it>Sesamum </it>and genera of the Pedaliaceae are native to Africa <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr></abbrgrp>. We hope to clarify the origin and phylogeny of <it>S. indicum </it>by applying comparative genomics and morphological and cytological analyses.</p>
         <p>Sesame seed is commonly known as the 'Queen of the oil seeds', perhaps for its resistance to oxidation and rancidity <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. As it contains lignans, sesame oil also exerts anti-cancer properties both <it>in vitro </it>and in animal bioassays <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>. Compared with peanut (<it>Arachis hypogaea</it>), soybean (<it>Glycine max</it>), oilseed rape (<it>Brassica napus</it>), sunflower (<it>Helianthus annuus </it>L.) and other oilseed crops, sesame seed oil has an ideal nearly equal content of oleic acid (18:1) (39.6%) and linoleic acid (18:2) (46.0%), and has desirable physiological effects, including antioxidant activity, and blood pressure- and serum lipid-lowering potential <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. Studies of the genome and functional genome of sesame are essential for elucidating the regulatory mechanisms underlying fatty acid and storage protein composition and content, and the secondary metabolism of antioxidant lignans <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp>.</p>
         <p>Sesame grows well and gives good yields in both tropical and temperate climates. Its tolerance of drought and high temperatures make sesame well suited to land where few other crops can survive. However, compared with other oilseed crops, sesame seed production is not consistent, as it is susceptible to pathogens, waterlogging and low temperature conditions <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. Sesame breeding objectives, like those for other seed-producing crops, especially oil crops, are to create new varieties with high quality and yield potential, and resistance to pathogens (including <it>Fusarium </it>wilt and Charcoal rot diseases), insect pests, waterlogging, drought and low temperature stress <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>. However, identification of genes or gene families and marker loci associated with yield, quality, and resistance to disease and abiotic stresses has been hampered due to a lack of information on the sesame genome. Only a few functional genes, mainly involved in the formation and regulation of fatty acids, seed storage proteins and secondary metabolites, and salt stress response, have been investigated <abbrgrp><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr></abbrgrp>. With the exception of a sole amplified fragment length polymorphism (AFLP) marker associated with the indehiscent-capsule trait reported in 2003 <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>, no quantitative trait loci have been found in the linkage map of sesame, let alone used for molecular-assisted selection (MAS) in sesame breeding programs. Integrating desirable qualities from the few available excellent germplasm resources, including wild species, will not be achievable rapidly unless considerably more genomic and functional genomic information is available. In addition, sequencing of the sesame genome will facilitate studies of other genera of the Pedaliaceae family by providing a closely related reference genome.</p>
         <p>We therefore plan to implement a Sesame Genome Project and sequence the <it>S. indicum </it>genome using the Chinese domestic cultivar, Yuzhi 11, which represents <it>S. indicum </it>cultivars with a simple stem, three flowers per axilla, oblong-quadrangular capsules, and white flower and seed-coat color. Yuzhi 11 is one of the most important Chinese cultivars due to its high oil content (56.66%), resistance to fungal pathogens such as <it>Fusarium </it>wilt, charcoal rot and <it>Alternaria </it>leaf spot, and waterlogging stress. It is cultivated in the main production regions of China <abbrgrp><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Phylogenetic position of sesame</p>
         </st>
         <p><it>S. indicum </it>is located in the asterids clade of the core eudicotyledons of Angiosperm Phylogeny Group 2 (APG 2) <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>. Its phylogenetic position determined using sesame chloroplast genomic data indicates that <it>Sesamum </it>(Pedaliaceae family) is a sister genus to the <it>Olea </it>and <it>Jasminum </it>(Oleaceae family) clade and represents the core lineage of the Lamiales families <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>. Compared with the 19 families shown in Figure <figr fid="F1">1</figr> (adapted from the NCBI taxonomy database <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>), <it>Sesamum</it>, which has 36 available genomes, is closely related to the Solanaceae and Phrymaceae families, but distantly related to other oil crops such as soybean (<it>Glycine max</it>), castor (<it>Ricinus communis</it>) and rape (<it>Brassica rapa</it>). At present, genomic information on the Pedaliaceae family is quite limited, as genomes from this family have not previously been sequenced.</p>
         <fig id="F1"><title><p>Figure 1</p></title><caption><p>Phylogenetic positions of sesame and the 36 land plants with available genome sequences</p></caption><text>
   <p><b>Phylogenetic positions of sesame and the 36 land plants with available genome sequences</b>. <sup>a</sup>Refers to sesame (<it>S. indicum </it>L.), a member of the Pedaliaceae family, only 34 genera of which have been entered in the NCBI taxonomy database.</p>
</text><graphic file="gb-2013-14-1-401-1"/></fig>
      </sec>
      <sec>
         <st>
            <p>Overview of the Sesame Genome Project</p>
         </st>
         <p>The Sesame Genome Working Group (SGWG) comprises six major sesame research teams in China involved in investigating genetic diversity of germplasm resources, functional genomics, and biotic and abiotic resistance, in addition to sesame genome sequencing. All members of the SGWG work under the Toronto Statement for prepublication data release <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. The main goal of the Sesame Genome Project is to provide a fine map of <it>S. indicum </it>and facilitate global genomic and functional genomic studies. We have already released a preliminary draft assembly <abbrgrp><abbr bid="B62">62</abbr></abbrgrp> of the sesame genome that can be used according to the conditions outlined in this letter. A detailed plan for the Sesame Genome Project has been made available on our website <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Properties of the <it>S. indicum </it>genome and available genomic resources</p>
         </st>
         <p>Natural sesame species can be divided into three types based on chromosome numbers, that is, 2n = 26 (for example, <it>S. indicum, S. alatum</it>), 2n = 32 (for example, <it>S. protratum, S. angolense</it>) and 2n = 64 (for example, <it>S. radiatum, S. schinzianum</it>) <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B37">37</abbr></abbrgrp>. The basic chromosome number in the <it>Sesamum </it>genus is X = 8 and 13, with X = 13 probably resulting from ancient polyploidy <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. The size of a haploid genome of <it>S. indicum </it>(2n = 26) was reported to be about 0.95 Gb, with a mass of 0.97 pg <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>, which is out of proportion with the 0.51 Gb and 0.97 Gb of <it>Cerathoteca sesamoides </it>(2n = 32) and <it>S. radiatum </it>(2n = 64), respectively <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>. Before beginning this genome project, we examined the characteristics of sesame chromosomes using cv. Yuzhi 11. Results showed that its karyotype formula is 2n = 2x = 26 = 6m + 16sm + 4st, and chromosome length ranges from 1.21 to 2.48 &#956;m (H Zhang, unpublished data). We distinguished and numbered the chromosomes with 45S rRNA, simple sequence repeats (SSR) and bacterial artificial chromosome (BAC) sequence probes using fluorescent <it>in situ </it>hybridization (FISH) and BAC-FISH techniques to facilitate super-scaffold assembly in the sesame genome (H Zhang, unpublished data). Comparing genome size with that of <it>Arabidopsis thaliana </it><abbrgrp><abbr bid="B65">65</abbr></abbrgrp>, soybean (cv. William 82) <abbrgrp><abbr bid="B66">66</abbr></abbrgrp> and rice (cv. Nipponbare) <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>, the genome size of <it>S. indicum </it>cv. Yuzhi 11 is estimated by flow cytometry to be about 369 Mb (H Zhang, unpublished data). From our preliminary sequencing data, we estimate the genome size to be approximately 354 Mb, close to this result (see below).</p>
         <p>The sesame chloroplast genome was published recently <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>. Sequencing of the chloroplast genome of <it>S. indicum </it>cv. Yuzhi 11 has also been performed (H Zhang, unpublished data), and will be used for raw read filtering and genome assembly in our Sesame Genome Project. A total of 86,222 unigenes with an average length of 629 bp are available and 46,584 (54.03%) unigenes have a significant similarity with proteins in the NCBI nonredundant protein database and Swiss-Prot database (E-value &lt;10<sup>-5</sup>) <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. Before the beginning of this project, we sequenced sesame transcriptomes from 24 groups of <it>S. indicum </it>materials and treatments using Illumina paired-end sequencing technology to greatly enrich available information on the functional genome <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B68">68</abbr></abbrgrp>, obtaining a 40G dataset containing 42,566 unitranscript sequences. We also constructed a BIBAC (pCLD 04541) library of 80,000 clones with an insert size of 120 kb and a BAC (CopyControl&#8482; pCC1BAC&#8482;) library of 57,600 clones with an insert size of 85 kb. The genome coverage of both BAC libraries was 27- and 13-fold, respectively (H Zhang, unpublished data). There are 45,093 <it>S. indicum </it>expressed sequence tags (ESTs) available in the NCBI EST database. Prior to our work, only two other <it>S. indicum </it>seed-specific cDNA libraries, including one full-length cDNA library, had been constructed, some clones of which were chosen at random and sequenced <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B69">69</abbr></abbrgrp>. In order to explore more genes involved in sesame growth and development, we constructed a full-length cDNA library of <it>S. indicum </it>cv. Yuzhi 11 containing 300,000 clones, 1,200 clones of which were selected randomly and sequenced (H Zhang, unpublished data). The genomic and transcriptomic data from these studies should facilitate genome assembly and analysis. The first sesame linkage map, which contains 284 microsatellite polymorphic loci, was set up in 2009 and has been used as a landmark frame for assembly of the whole genome <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>. We recently updated this high-density linkage map with 653 SSR, SNP, AFLP and random selective amplification of microsatellite polymorphic loci (RSAMPL) markers falling into 14 linkage groups to facilitate sesame genome assembly and anchoring of trait loci (H Zhang, unpublished data).</p>
      </sec>
      <sec>
         <st>
            <p>Sequencing strategy for the <it>S. indicum </it>genome</p>
         </st>
         <p>The Sesame Genome Project is divided into three phases. The first phase, which has already been completed, involves high coverage Illumina sequencing and draft genome assembly. We constructed five types of Illumina libraries, including two paired-end libraries with insert sizes of 300 and 500 bp, and three mate-pair libraries with insert sizes of 2, 3 and 5 kb. In order to avoid bias in library construction, at least two libraries for each insert length were constructed. Illumina technology was used to generate 98 Gb of reads, giving a 276&#215; coverage of the estimated genome (Table <tblr tid="T1">1</tblr>). Subsequently, the draft genome was assembled using ABySS (v 1.3.3) <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>. Paired-end Illumina reads were first assembled into contigs. Mate-pair reads with insert sizes of 2, 3 and 5 kb were then aligned into the contigs, and the relationship between mate-pair reads was used to join contigs and construct scaffolds. As a result, a preliminary assembly of 293.7 Mb was generated (Table <tblr tid="T2">2</tblr>).</p>
         <tbl id="T1"><title><p>Table 1</p></title><caption><p>Summary of Illumina data for the <it>S. indicum </it>genome</p></caption><tblbdy cols="4">
      <r>
         <c ca="center">
            <p>
               <b>Sequencing platform</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Library type (n)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Insert size (bp)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Usable bases (Gb)</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Illumina genome analyzer (Solexa)</p>
         </c>
         <c ca="center">
            <p>Paired-end (12)</p>
         </c>
         <c ca="center">
            <p>300</p>
         </c>
         <c ca="center">
            <p>28.12</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>500</p>
         </c>
         <c ca="center">
            <p>44.51</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>Mate-pair (5)</p>
         </c>
         <c ca="center">
            <p>2,000</p>
         </c>
         <c ca="center">
            <p>7.23</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>3,000</p>
         </c>
         <c ca="center">
            <p>7.74</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>5,000</p>
         </c>
         <c ca="center">
            <p>10.65</p>
         </c>
      </r>
   </tblbdy></tbl>
         <tbl id="T2"><title><p>Table 2</p></title><caption><p>Overview of the current draft assembly of <it>S. indicum</it></p></caption><tblbdy cols="8">
      <r>
         <c ca="center">
            <p>
               <b>Estimatedgenome size (Mb)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Genome assembly length (Mb)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Estimatedcoverage (%)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>ContigsN50 (kb)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>ContigsN90 (kb)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>GC (%)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>ScaffoldsN50 (kb)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>ScaffoldsN90 (kb)</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="8">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>354</p>
         </c>
         <c ca="center">
            <p>293.7</p>
         </c>
         <c ca="center">
            <p>82.9%</p>
         </c>
         <c ca="center">
            <p>19.0</p>
         </c>
         <c ca="center">
            <p>3.9</p>
         </c>
         <c ca="center">
            <p>34.6</p>
         </c>
         <c ca="center">
            <p>22.6</p>
         </c>
         <c ca="center">
            <p>4.3</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>Note: these statistics assume a genome size of 354 Mb. GC, guanine (G) + cytosine (C).</p>
   </tblfn></tbl>
         <p>The second phase will involve Roche 454 pyrosequencing and BAC sequencing and fine map construction. We have constructed Roche 454 paired-end libraries with an insert size of 20 kb and will generate 3.5 Gb of data giving a 250&#215; coverage of the estimated genome. We also plan to end-sequence 40,000 sesame BAC clones using conventional Sanger sequencing, giving a 12&#215; coverage of the estimated genome. To ensure hybrid <it>de novo </it>assembly of the best possible quality, we will use a modified Celera Assembler pipeline <abbrgrp><abbr bid="B72">72</abbr></abbrgrp>. Roche 454 paired-end reads and BAC-end reads are better for spanning longer repetitive elements and joining scaffolds into superscaffolds. We will use BAC-end information to retrieve and select 1,000 specific BAC clones, one end of which aligns well to the scaffold while the other end is located in a gap region, for full-length sequencing using the Illumina BAC polling method. The full-length BAC sequences will fill in the gaps within superscaffolds and greatly improve genome integrity. At this stage, we expect to obtain a fine map of Yuzhi 11 with 800 to 1,000 superscaffolds of a putative N50 length of 1 Mb and N90 length of 250 kb.</p>
         <p>In the final phase, the superscaffolds will be anchored to chromosomes. We will first anchor the BACs containing mapped SSR markers on the updated linkage map <abbrgrp><abbr bid="B70">70</abbr></abbrgrp> (H Zhang, unpublished data). Physical distances between landmarks will then be determined. Furthermore, we will construct a physical chromosome map based on at least 1,000 BAC clones using information obtained from BAC-FISH and BAC-end. At least one BAC will be anchored on the chromosomes per superscaffold to ensure all superscaffolds are anchored onto the 13 chromosomes. In order to validate the accuracy and integrity of the sesame genome assembly, several quality control parameters, such as read depth of coverage, average quality values per contig, discordant read pairs and gene footprint coverage, will be examined. To check the accuracy of the assembly of scaffolds, we will also complete full-length sequencing of 15 BAC clones using conventional Sanger sequencing and align them to the scaffolds.</p>
      </sec>
      <sec>
         <st>
            <p>Timeline and goals of the Sesame Genome Project</p>
         </st>
         <p>The blueprint for the Sesame Genome Project was conceived and designed by the SGWG in 2009. We completed the goals of the first phase in March 2012. In the second phase, Roche 454 paired-ends reads will be sequenced by December 2012, and the double-ended sequencing of the 40,000 BAC clones and full-length sequencing of 1,000 BAC clones will be completed by June 2013. The final phase of scaffold anchoring will proceed in parallel with bioinformatics analysis. We expect to complete all the goals of Sesame Genome Project and submit a paper by December 2013. To make our data broadly available prior to publication, the completion of each goal of these phases will be publicly communicated via our website <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>. Updated versions of assembly data will be made available to any independent research groups performing non-genome-scale analyses. Sequence data and the preliminary assembly produced in the first phase are already available on the website.</p>
      </sec>
      <sec>
         <st>
            <p>Status of current preliminary genome assemblies</p>
         </st>
         <p>The current draft assembly of Yuzhi 11 is 293.7 Mb in length, with a GC content of 34.65%. The N50 and N90 sizes of the scaffolds are 22.6 kb and 4.3 kb, respectively (Table <tblr tid="T2">2</tblr>). Genome size was estimated to be 354 Mb using the well-established 17-mer method <abbrgrp><abbr bid="B73">73</abbr></abbrgrp>, in line with flow cytometry data that suggest it is 369 Mb (H Zhang, unpublished data). The 17-mer distribution frequency in 16.77 Gb of trimmed Illumina PE reads was calculated using Jellyfish (v1.1.4) <abbrgrp><abbr bid="B74">74</abbr></abbrgrp>. We identified a total of 13,931,658,332 unique k-mers, and 87,207,553 k-mers that had a frequency &lt;10. The frequency of peak k-mers was 39 (Figure <figr fid="F2">2</figr>).</p>
         <fig id="F2"><title><p>Figure 2</p></title><caption><p>K-mer (17mer) frequency analysis of the <it>S. indicum </it>genomic sequence</p></caption><text>
   <p><b>K-mer (17mer) frequency analysis of the <it>S. indicum </it>genomic sequence</b>. Data produced from 500 bp insert libraries. The peak k-mer frequency is 39 and its minimum point is 10. Genome size was estimated with the formula: Estimated genome size (bp) = total number of k-mers with a frequency &gt;10/peak k-mer frequency.</p>
</text><graphic file="gb-2013-14-1-401-2"/></fig>
         <p>In order to determine the frequency and complexity of repetitive elements in the draft assembly, we compared the assembly information with the <it>Arabidopsis </it>repetitive elements database from the RepeatMasker library (version 20120418) and the sesame <it>de novo </it>database constructed for the Yuzhi 11 draft assembly (RepeatModeler, version 1.0.5) using RepeatMasker (version open-3.2.9) <abbrgrp><abbr bid="B75">75</abbr><abbr bid="B76">76</abbr></abbrgrp>. Thirty-eight percent of the draft assembly was identified as repetitive elements (Table <tblr tid="T3">3</tblr>), only approximately 5.7% of which shared homology with the <it>Arabidopsis </it>database.</p>
         <tbl id="T3"><title><p>Table 3</p></title><caption><p>Repeats derived from <it>de novo </it>and homology-based predictions in <it>S. indicum</it></p></caption><tblbdy cols="4">
      <r>
         <c ca="center">
            <p>
               <b>Repeat type</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Repeat number </b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Length occupied (bp)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Percentage of sequences</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Retroelements</p>
         </c>
         <c ca="center">
            <p>18,322</p>
         </c>
         <c ca="center">
            <p>5,811,328</p>
         </c>
         <c ca="center">
            <p>1.98</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>SINEs</p>
         </c>
         <c ca="center">
            <p>8</p>
         </c>
         <c ca="center">
            <p>328</p>
         </c>
         <c ca="center">
            <p>0.00</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>LINEs</p>
         </c>
         <c ca="center">
            <p>2,266</p>
         </c>
         <c ca="center">
            <p>374,709</p>
         </c>
         <c ca="center">
            <p>0.13</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>LTR elements</p>
         </c>
         <c ca="center">
            <p>16,048</p>
         </c>
         <c ca="center">
            <p>5,436,291</p>
         </c>
         <c ca="center">
            <p>1.85</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>DNA transposons</p>
         </c>
         <c ca="center">
            <p>3,349</p>
         </c>
         <c ca="center">
            <p>571,933</p>
         </c>
         <c ca="center">
            <p>0.19</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>hobo-Activator</p>
         </c>
         <c ca="center">
            <p>305</p>
         </c>
         <c ca="center">
            <p>43,075</p>
         </c>
         <c ca="center">
            <p>0.01</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Tc1-IS630-Pogo</p>
         </c>
         <c ca="center">
            <p>1,232</p>
         </c>
         <c ca="center">
            <p>155,117</p>
         </c>
         <c ca="center">
            <p>0.05</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>En-Spm</p>
         </c>
         <c ca="center">
            <p>96</p>
         </c>
         <c ca="center">
            <p>55,227</p>
         </c>
         <c ca="center">
            <p>0.02</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>MuDR-IS905</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c ca="center">
            <p>347</p>
         </c>
         <c ca="center">
            <p>0.00</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Total bases masked</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>16,852,950</p>
         </c>
         <c ca="center">
            <p>5.74</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Unclassified repeats<sup>a</sup></p>
         </c>
         <c ca="center">
            <p>835,752</p>
         </c>
         <c ca="center">
            <p>92,380,494</p>
         </c>
         <c ca="center">
            <p>31.65</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Total interspersed repeats</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>92,380,494</p>
         </c>
         <c ca="center">
            <p>31.65</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p><sup>a</sup>Unclassified repeats refer to predicted repeats (sequences in the <it>de novo </it>repeats library) that cannot be classified by RepeatMasker.</p>
   </tblfn></tbl>
      </sec>
      <sec>
         <st>
            <p>Quality control the raw data and intermediate datasets</p>
         </st>
         <p>In order to control the quality of raw data, the SolexaQA package was used to verify the sequence data generated from each of the 17 Illumina-Solexa libraries <abbrgrp><abbr bid="B77">77</abbr></abbrgrp>. The raw reads were trimmed by DynamicTrim (quality threshold Q &#8776; 20) and then filtered by LengthSort (the length cutoff set as 25). Unpaired reads would be screened and discarded in this system. Meanwhile, Roche 454 reads data, which are kept in Standard Flowgram Format (SFF), were converted into FastQ format and evaluated using the traditional quality metrics. As Sanger reads may contain vector sequences, the Lucy package was used to search and trim for cutting off the vector sequence contamination <abbrgrp><abbr bid="B78">78</abbr></abbrgrp>. Low-quality bases and chimeric reads would be tracked with trim modules of the Celera Assembler.</p>
         <p>We validated the coding region coverage of the draft assembly using two different gene footprint coverage methods. Using the Core Eukaryotic Genes Mapping Approach (CEGMA) <abbrgrp><abbr bid="B79">79</abbr></abbrgrp>, 444 (96.9%) of the 458 core eukaryotic genes (CEGs) mapped against the draft assembly were identified. An RNA sequence based method employing Velvet <abbrgrp><abbr bid="B80">80</abbr></abbrgrp> and OASES <abbrgrp><abbr bid="B81">81</abbr></abbrgrp> allowed us to assemble 3.5 Gb of RNA-Seq reads (NCBI accession SRX061117) <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> into 99,589 putative transcripts. Putative transcripts were then translated into 82,549 peptides using ESTScan (version 2.1) <abbrgrp><abbr bid="B82">82</abbr></abbrgrp>. These peptides were aligned against the SWISS-PROT <abbrgrp><abbr bid="B83">83</abbr></abbrgrp> database using BLAST (E-value 10<sup>-5</sup>) to obtain high-confidence peptides. Redundant peptides (such as alternative-splicing transcripts) were filtered according to BLAST scores and the names of the hits. More than 99.5% of the 3,584 peptides obtained could be aligned to the draft assembly using GMAP <abbrgrp><abbr bid="B84">84</abbr></abbrgrp>. The above results indicate that the draft assembly has a high coverage of the coding region.</p>
         <p>Gene prediction for the draft assembly was performed using InchWorm <abbrgrp><abbr bid="B85">85</abbr></abbrgrp>: 3.5 Gb of RNA-Seq reads [GenBank: SRX061117] were assembled into 472,257 contigs and mapped to the draft genome using GMAP. The GMAP mapping results were used as a training set for <it>ab initio </it>prediction using AUGUSTUS <abbrgrp><abbr bid="B86">86</abbr></abbrgrp>. As a result, 23,713 gene models were obtained with a total length of 28 Mb (Table <tblr tid="T4">4</tblr>). Average coding sequence length was 1.2 kb and average GC content was 45%. We obtained functional annotations of all genes using InterProScan <abbrgrp><abbr bid="B87">87</abbr></abbrgrp>, which also determines motifs and domains. Gene Ontology (GO) annotations were given to 10,656 genes using corresponding InterPro entries and the Pfam database <abbrgrp><abbr bid="B88">88</abbr></abbrgrp>. Visualization of the functional categories of these 10,656 genes was performed using WEGO <abbrgrp><abbr bid="B89">89</abbr></abbrgrp> (Figure <figr fid="F3">3</figr>).</p>
         <tbl id="T4"><title><p>Table 4</p></title><caption><p>Predicted genes in <it>S. indicum</it></p></caption><tblbdy cols="7">
      <r>
         <c ca="left">
            <p>
               <b>Gene number</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Average gene length (kb)</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Average number of introns per gene</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>CDS GC (%)</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Average length of introns (bp)</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Average length of exons</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Average length of CDS</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="7">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>23,713</p>
         </c>
         <c ca="left">
            <p>2.9</p>
         </c>
         <c ca="left">
            <p>4.3</p>
         </c>
         <c ca="left">
            <p>45</p>
         </c>
         <c ca="left">
            <p>399.4</p>
         </c>
         <c ca="left">
            <p>227.4</p>
         </c>
         <c ca="left">
            <p>1.2</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>CDS, coding sequence; GC, guanine (G) + cytosine (C).</p>
   </tblfn></tbl>
         <fig id="F3"><title><p>Figure 3</p></title><caption><p>Functional catalogues of sesame genes in the preliminary assembly</p></caption><text>
   <p><b>Functional catalogues of sesame genes in the preliminary assembly</b>. Results are summarized in three main categories: biological processes, cellular components and molecular functions. A total of 10,656 genes have been assigned with Gene Ontology terms.</p>
</text><graphic file="gb-2013-14-1-401-3"/></fig>
      </sec>
      <sec>
         <st>
            <p>Biological questions to be addressed</p>
         </st>
         <p>We plan to address several key biological questions specific to sesame using this new genome and transcriptome data. We will compare the sesame genome with the genomes of monocotyledonous and other dicotyledonous plants to elucidate the phylogeny of the <it>Sesamum </it>genus and the origin of <it>S. indicum</it>. We will also perform more detailed investigations on the formation and regulation of fatty acids, storage proteins and secondary metabolites (including sesamin) in sesame. We will apply the bio-information obtained in this genome project in sesame breeding programs, paying particular attention to the induction and regulation of resistance to the main sesame diseases, including <it>Fusarium </it>wilt and charcoal rot diseases, and the environmental stress of waterlogging. Other possible uses of the genomics dataset, such as determining the regulatory mechanisms of biological characteristics in <it>Sesamum</it>, including simple stem or branch, leaf shape, indeterminate growth habit, flower number per axilla, capsule carpel number, flower color and other species-specific traits, will not form part of our analysis. We believe that the main achievement of this project will be to markedly accelerate sesame genetic research and breeding. Members of the SGWG also hope to address additional questions about the relationship between sesame growth and environmental conditions, such as identifying which genes regulate low temperature responses and drought sensitivity.</p>
      </sec>
      <sec>
         <st>
            <p>Joining the SGWG and using our early release data</p>
         </st>
         <p>This project is being conducted by the SGWG. We invite other research groups to access and use the draft assembly and raw read data, which have already been released. Any group performing non-genome-scale analyses, or investigating the above biological questions, is welcome to use our data without restriction. As a matter of courtesy and to avoid duplication of effort, we request that competing genome-scale projects or studies that overlap with the above stated research areas disclose their status to the SGWG consortium. Formal inquiries and requests to join the working group should be made to HZ. Updated versions of the genome assembly, further project descriptions and a complete list of current SGWG members dedicated to this project can be accessed on our website <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The authors declare that they have no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>AFLP: amplified fragment length polymorphism; BAC: bacterial artificial chromosome; EST: expressed sequence tags; FISH: fluorescent <it>in situ </it>hybridization; RSAMPL: random selective amplification of microsatellite polymorphic loci; SGWG: Sesame Genome Working Project; SNP: single nucleotide polymorphism; SSR: simple sequence repeats.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was supported by the earmarked fund for the China Agriculture Research System (CARS-15), China National '973' Project (2011CB109304), and Henan Zhongyuan Scholar Fund (092101211100) to HZ. HM was supported by a grant from the China National Key Technology R &amp; D program (2009BADA8B04-03) and the earmarked fund for China Agriculture Research System (CARS-15). HL, QW and MY were individually supported by the earmarked fund for China Agriculture Research System (CARS-15). Special thanks to Dr Joy Fleming for helpful discussions and suggestions in the manuscript revision process.</p>
         </sec>
      </ack>
      <refgrp><bibl id="B1"><title><p>Sesame breeding.</p></title><aug><au><snm>Ashri</snm><fnm>A</fnm></au></aug><source>Plant Breeding Reviews</source><publisher>Oxford: Oxford UK</publisher><editor>Janick J</editor><pubdate>1998</pubdate><fpage>79</fpage><lpage>228</lpage></bibl><bibl id="B2"><title><p>Nutritional, medicinal and industrial uses of sesame (<it>Sesamum indicum </it>L.) seeds an overview.</p></title><aug><au><snm>Anilakumar</snm><fnm>KR</fnm></au><au><snm>Pal</snm><fnm>A</fnm></au><au><snm>Khanum</snm><fnm>F</fnm></au><au><snm>Bawa</snm><fnm>AS</fnm></au></aug><source>Agric Conspec Sci</source><pubdate>2010</pubdate><volume>75</volume><fpage>159</fpage><lpage>168</lpage></bibl><bibl id="B3"><title><p>Evidence for cultivation of sesame in the ancient world.</p></title><aug><au><snm>Bedigian</snm><fnm>D</fnm></au><au><snm>Harlan</snm><fnm>JR</fnm></au></aug><source>Econ Bot</source><pubdate>1986</pubdate><volume>40</volume><fpage>137</fpage><lpage>154</lpage><xrefbib><pubid idtype="doi">10.1007/BF02859136</pubid></xrefbib></bibl><bibl id="B4"><title><p>The chemical and physiological properties of sesame oil.</p></title><aug><au><snm>Budowski</snm><fnm>P</fnm></au><au><snm>Markley</snm><fnm>KS</fnm></au></aug><source>Chem Rev</source><pubdate>1951</pubdate><volume>48</volume><fpage>125</fpage><lpage>151</lpage><xrefbib><pubid idtype="doi">10.1021/cr60149a005</pubid></xrefbib></bibl><bibl id="B5"><title><p>Sesame seed is a rich source of dietary lignans.</p></title><aug><au><snm>Moazzami</snm><fnm>AA</fnm></au><au><snm>Kamal-Eldin</snm><fnm>A</fnm></au></aug><source>J Am Oil Chem Soc</source><pubdate>2006</pubdate><volume>8</volume><fpage>719</fpage><lpage>723</lpage></bibl><bibl id="B6"><title><p>The chemistry and physiological functions of sesame.</p></title><aug><au><snm>Nakimi</snm><fnm>M</fnm></au></aug><source>Food Rev Int</source><pubdate>1995</pubdate><volume>11</volume><fpage>281</fpage><lpage>329</lpage><xrefbib><pubid idtype="doi">10.1080/87559129509541043</pubid></xrefbib></bibl><bibl id="B7"><title><p>Sesame seed lignans and &#947;-tocopherol act synergistically to produce vitamin E activity in rats.</p></title><aug><au><snm>Yamashita</snm><fnm>K</fnm></au><au><snm>Nohara</snm><fnm>Y</fnm></au><au><snm>Katayama</snm><fnm>K</fnm></au><au><snm>Namiki</snm><fnm>M</fnm></au></aug><source>J Nutr</source><pubdate>1992</pubdate><volume>122</volume><fpage>2440</fpage><lpage>2246</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">1453229</pubid></xrefbib></bibl><bibl id="B8"><title><p>Effect of sesame lignans on TNF-alpha-induced expression of adhesion molecules in endothelial cells.</p></title><aug><au><snm>Mochizuki</snm><fnm>M</fnm></au><au><snm>Tsuchie</snm><fnm>Y</fnm></au><au><snm>Yamada</snm><fnm>N</fnm></au><au><snm>Miyake</snm><fnm>Y</fnm></au><au><snm>Osawa</snm><fnm>T</fnm></au></aug><source>Biosci Biotechnol Biochem</source><pubdate>2010</pubdate><volume>74</volume><fpage>1539</fpage><lpage>1544</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1271/bbb.100095</pubid><pubid idtype="pmpid" link="fulltext">20699585</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Differential tissue distribution of sesaminol triglucoside and its metabolites in rats fed with lignan glycosides from sesame meal with or without nano/submicrosizing.</p></title><aug><au><snm>Liao</snm><fnm>CD</fnm></au><au><snm>Hung</snm><fnm>WL</fnm></au><au><snm>Lu</snm><fnm>WC</fnm></au><au><snm>Jan</snm><fnm>KC</fnm></au><au><snm>Shih</snm><fnm>DY</fnm></au><au><snm>Yeh</snm><fnm>AI</fnm></au><au><snm>Ho</snm><fnm>CT</fnm></au><au><snm>Hwang</snm><fnm>LS</fnm></au></aug><source>J Agric Food Chem</source><pubdate>2010</pubdate><volume>58</volume><fpage>563</fpage><lpage>569</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1021/jf9028046</pubid><pubid idtype="pmpid" link="fulltext">20014846</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>Biotransformation of sesaminol triglucoside to mammalian lignans by intestinal microbiota.</p></title><aug><au><snm>Jan</snm><fnm>KC</fnm></au><au><snm>Hwang</snm><fnm>LS</fnm></au><au><snm>Ho</snm><fnm>CT</fnm></au></aug><source>J Agric Food Chem</source><pubdate>2009</pubdate><volume>57</volume><fpage>6101</fpage><lpage>6106</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1021/jf901215j</pubid><pubid idtype="pmpid" link="fulltext">19537732</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Tissue distribution and elimination of estrogenic and anti-inflammatory catechol metabolites from sesaminol triglucoside in rats.</p></title><aug><au><snm>Jan</snm><fnm>KC</fnm></au><au><snm>Ku</snm><fnm>KL</fnm></au><au><snm>Chu</snm><fnm>YH</fnm></au><au><snm>Hwang</snm><fnm>LS</fnm></au><au><snm>Ho</snm><fnm>CT</fnm></au></aug><source>J Agric Food Chem</source><pubdate>2010</pubdate><volume>58</volume><fpage>7693</fpage><lpage>7700</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1021/jf1009632</pubid><pubid idtype="pmpid" link="fulltext">20524667</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Intestinal distribution and excretion of sesaminol and its tetrahydrofuranoid metabolites in rats.</p></title><aug><au><snm>Jan</snm><fnm>KC</fnm></au><au><snm>Ku</snm><fnm>KL</fnm></au><au><snm>Chu</snm><fnm>YH</fnm></au><au><snm>Hwang</snm><fnm>LS</fnm></au><au><snm>Ho</snm><fnm>CT</fnm></au></aug><source>J Agric Food Chem</source><pubdate>2011</pubdate><volume>59</volume><fpage>3078</fpage><lpage>3086</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1021/jf105012v</pubid><pubid idtype="pmpid" link="fulltext">21384919</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>Whole sesame seed is as rich a source of mammalian lignan precursors as whole flaxseed.</p></title><aug><au><snm>Coulman</snm><fnm>KD</fnm></au><au><snm>Liu</snm><fnm>Z</fnm></au><au><snm>Hum</snm><fnm>WQ</fnm></au><au><snm>Michaelides</snm><fnm>J</fnm></au><au><snm>Thompson</snm><fnm>LU</fnm></au></aug><source>Nutr Cancer</source><pubdate>2009</pubdate><volume>52</volume><fpage>156</fpage><lpage>165</lpage></bibl><bibl id="B14"><title><p>Sesamum.</p></title><aug><au><snm>Nimmakayala</snm><fnm>P</fnm></au><au><snm>Perumal</snm><fnm>R</fnm></au><au><snm>Mulpuri</snm><fnm>S</fnm></au><au><snm>Reddy</snm><fnm>UK</fnm></au></aug><source>Wild Crop Relatives: Genomic and Breeding Resources Oilseeds</source><publisher>Berlin Heidelberg: Springer-Verlag</publisher><editor>Kole C</editor><pubdate>2011</pubdate><fpage>261</fpage><lpage>273</lpage></bibl><bibl id="B15"><aug><au><snm>Joshi</snm><fnm>AB</fnm></au></aug><source>Sesamum</source><publisher>Hyderabad: Examiner Press</publisher><pubdate>1961</pubdate></bibl><bibl id="B16"><title><p>Notes on African Pedaliaceae.</p></title><aug><au><snm>Bruce</snm><fnm>EA</fnm></au></aug><source>Kew Bull</source><pubdate>1953</pubdate><volume>67</volume><fpage>417</fpage><lpage>429</lpage></bibl><bibl id="B17"><title><p>The wild and cultivated species in the genus Sesamum.</p></title><aug><au><snm>Kobayashi</snm><fnm>T</fnm></au></aug><source>Sesame: Status and Improvement. Proceedings of Expert Consultation: 8-12 December 1980; Rome</source><publisher>Rome</publisher><editor>Amram A</editor><pubdate>1981</pubdate><fpage>157</fpage><lpage>163</lpage></bibl><bibl id="B18"><aug><au><snm>Weiss</snm><fnm>EA</fnm></au></aug><source>Castor, Sesame and Safflower</source><publisher>New York: Barnes &amp; Noble Press</publisher><pubdate>1971</pubdate></bibl><bibl id="B19"><title><p>Analysis of sesame karyotype and resemblance-near coefficient.</p></title><aug><au><snm>Zhang</snm><fnm>H</fnm></au><au><snm>Miao</snm><fnm>H</fnm></au><au><snm>Li</snm><fnm>C</fnm></au><au><snm>Wei</snm><fnm>L</fnm></au><au><snm>Ma</snm><fnm>Q</fnm></au></aug><source>Chinese Plant Bullet</source><pubdate>2012</pubdate><volume>47</volume><fpage>602</fpage><lpage>614</lpage></bibl><bibl id="B20"><title><p>Sesamin, sesamolin and the origin of sesame.</p></title><aug><au><snm>Bedigian</snm><fnm>D</fnm></au><au><snm>Seigler</snm><fnm>DS</fnm></au><au><snm>Harlan</snm><fnm>JR</fnm></au></aug><source>Biochem Systemat Ecol</source><pubdate>1985</pubdate><volume>13</volume><fpage>133</fpage><lpage>139</lpage><xrefbib><pubid idtype="doi">10.1016/0305-1978(85)90071-7</pubid></xrefbib></bibl><bibl id="B21"><title><p>Evolution of sesame revisited: domestication, diversity and prospects.</p></title><aug><au><snm>Bedigian</snm><fnm>D</fnm></au></aug><source>Genet Resour Crop Evol</source><pubdate>2003</pubdate><volume>50</volume><fpage>779</fpage><lpage>787</lpage><xrefbib><pubid idtype="doi">10.1023/A:1025029903549</pubid></xrefbib></bibl><bibl id="B22"><title><p>Slimy leaves and oily seeds: distribution and use of wild relatives of sesame in Africa.</p></title><aug><au><snm>Bedigian</snm><fnm>D</fnm></au></aug><source>Economic Botany</source><pubdate>2004</pubdate><volume>58</volume><issue>Suppl</issue><fpage>3</fpage><lpage>33</lpage></bibl><bibl id="B23"><title><p>Characterization of sesame (<it>Sesamum indicum </it>L.) germplasm: a critique.</p></title><aug><au><snm>Bedigian</snm><fnm>D</fnm></au></aug><source>Genet Resour Crop Evol</source><pubdate>2010</pubdate><volume>57</volume><fpage>641</fpage><lpage>647</lpage><xrefbib><pubid idtype="doi">10.1007/s10722-010-9552-x</pubid></xrefbib></bibl><bibl id="B24"><title><p>Relationships between cultivated sesame (<it>Sesamum </it>sp.) and the wild relatives based on morphological characters, isozymes and RAPD markers.</p></title><aug><au><snm>Nanthakumar</snm><fnm>G</fnm></au><au><snm>Singh</snm><fnm>KN</fnm></au><au><snm>Vaidyanathan</snm><fnm>P</fnm></au></aug><source>J Genet Breeding</source><pubdate>2000</pubdate><volume>54</volume><fpage>5</fpage><lpage>12</lpage></bibl><bibl id="B25"><title><p>Further evidence on the prehistory of sesame.</p></title><aug><au><snm>Fuller</snm><fnm>DQ</fnm></au></aug><source>Asian Agri-History</source><pubdate>2003</pubdate><volume>7</volume><fpage>127</fpage><lpage>137</lpage></bibl><bibl id="B26"><title><p>Cytological analysis of interspecific hybrid between <it>Sesamum indicum </it>L &#215; S. <it>Orientale </it>L. Var. <it>malabaricum</it>.</p></title><aug><au><snm>Kumar</snm><fnm>AKMS</fnm></au><au><snm>Hiremath</snm><fnm>SC</fnm></au></aug><source>Karnataka J Agric Sci</source><pubdate>2008</pubdate><volume>21</volume><fpage>498</fpage><lpage>502</lpage></bibl><bibl id="B27"><title><p>Sesame research overview: Current status, perspective and priorities.</p></title><aug><au><snm>Ashri</snm><fnm>A</fnm></au></aug><source>Proceeding of the 1st Australian Sesame Workshop: 21-23 March 1995; Darwin-Katherine</source><publisher>Wood IM: Darwin</publisher><editor>Bennett MR</editor><pubdate>1995</pubdate><fpage>1</fpage><lpage>17</lpage></bibl><bibl id="B28"><aug><au><snm>Grubben</snm><fnm>GJH</fnm></au><au><snm>Denton</snm><fnm>OA</fnm></au></aug><source>Plant Resources of Tropical Africa: Vegetables</source><publisher>Leiden: Backhuys Publishers</publisher><pubdate>2004</pubdate></bibl><bibl id="B29"><aug><au><snm>Purseglove</snm><fnm>JW</fnm></au></aug><source>Tropical Crops: Dicotyledons</source><publisher>London: Longmans Press</publisher><pubdate>1968</pubdate></bibl><bibl id="B30"><title><p>Antihypertensive effect of sesamin inhibition of vascular superoxide production by sesamin.</p></title><aug><au><snm>Nakano</snm><fnm>D</fnm></au><au><snm>Itoh</snm><fnm>C</fnm></au><au><snm>Takaoka</snm><fnm>M</fnm></au><au><snm>Kiso</snm><fnm>Y</fnm></au><au><snm>Tanaka</snm><fnm>T</fnm></au><au><snm>Matsumura</snm><fnm>Y</fnm></au></aug><source>Biol Pharm Bull</source><pubdate>2002</pubdate><volume>25</volume><fpage>1247</fpage><lpage>1249</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1248/bpb.25.1247</pubid><pubid idtype="pmpid" link="fulltext">12230131</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>Nutraceutical importance of sesame seed and oil A review of the contribution of their lignans.</p></title><aug><au><snm>Kanu</snm><fnm>PJ</fnm></au><au><snm>Bahsoon</snm><fnm>JZ</fnm></au><au><snm>Kanu</snm><fnm>JB</fnm></au><au><snm>Kandeh</snm><fnm>JB</fnm></au></aug><source>Sierra Leone J Biomed Res</source><pubdate>2010</pubdate><volume>2</volume><fpage>4</fpage><lpage>16</lpage></bibl><bibl id="B32"><title><p>The use of sesame oil and other vegetable oils in the inhibition of human colon cancer growth <it>in vitro</it>.</p></title><aug><au><snm>Salerno</snm><fnm>JW</fnm></au><au><snm>Smith</snm><fnm>DE</fnm></au></aug><source>Anticancer Res</source><pubdate>1991</pubdate><volume>11</volume><fpage>209</fpage><lpage>215</lpage><xrefbib><pubid idtype="pmpid">2018355</pubid></xrefbib></bibl><bibl id="B33"><title><p>Chemopreventive effect of resveratrol, sesamol, sesame oil and sunflower oil in the Epstein-Barr Virus early antigen activation assay and the mouse two state carcinogenesis.</p></title><aug><au><snm>Kapadia</snm><fnm>GJ</fnm></au><au><snm>Azuine</snm><fnm>MA</fnm></au><au><snm>Tokuda</snm><fnm>H</fnm></au><au><snm>Takasaki</snm><fnm>M</fnm></au><au><snm>Mukainaka</snm><fnm>T</fnm></au><au><snm>Konoshima</snm><fnm>T</fnm></au><au><snm>Nishino</snm><fnm>H</fnm></au></aug><source>Pharmacol Res</source><pubdate>2002</pubdate><volume>45</volume><fpage>499</fpage><lpage>505</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/phrs.2002.0992</pubid><pubid idtype="pmpid" link="fulltext">12162952</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>Chemical aspects of the antioxidative activity of roasted sesame seed oil and the effect of using the oil for frying.</p></title><aug><au><snm>Fukuda</snm><fnm>Y</fnm></au><au><snm>Nagata</snm><fnm>M</fnm></au><au><snm>Osawa</snm><fnm>T</fnm></au><au><snm>Namiki</snm><fnm>M</fnm></au></aug><source>Agric Biol Chem</source><pubdate>1986</pubdate><volume>50</volume><fpage>857</fpage><lpage>862</lpage><xrefbib><pubid idtype="doi">10.1271/bbb1961.50.857</pubid></xrefbib></bibl><bibl id="B35"><title><p>Oil content and composition of the seed in the world collection of sesame introductions.</p></title><aug><au><snm>Yermanos</snm><fnm>DM</fnm></au><au><snm>Hemstreet</snm><fnm>S</fnm></au><au><snm>Saleeb</snm><fnm>W</fnm></au><au><snm>Huszar</snm><fnm>CK</fnm></au></aug><source>J Am Oil Chem Soc</source><pubdate>1972</pubdate><volume>49</volume><fpage>20</fpage><lpage>23</lpage><xrefbib><pubid idtype="doi">10.1007/BF02545131</pubid></xrefbib></bibl><bibl id="B36"><title><p>Application of microsatellite polymorphisms to study the diversity in seed oil content and fatty acid composition in Nigerian sesame (<it>Sesamum indicum </it>L.) accessions.</p></title><aug><au><snm>Nweke</snm><fnm>FN</fnm></au><au><snm>Ubi</snm><fnm>BE</fnm></au><au><snm>Kunert</snm><fnm>K</fnm></au></aug><source>Afr J Biotech</source><pubdate>2012</pubdate><volume>11</volume><fpage>8820</fpage><lpage>8830</lpage></bibl><bibl id="B37"><title><p>Sesame (<it>Sesamum indicum </it>L.).</p></title><aug><au><snm>Ashi</snm><fnm>A</fnm></au></aug><source>Genetic Resources, Chromosome Engineering, and Crop Improvement</source><publisher>Boca Raton: CRC Press</publisher><editor>Signh RJ</editor><pubdate>2006</pubdate><fpage>231</fpage><lpage>280</lpage></bibl><bibl id="B38"><title><p>Comparative analysis of expressed sequence tags from <it>Sesamum indicum </it>and <it>Arabidopsis thaliana </it>developing seeds.</p></title><aug><au><snm>Suh</snm><fnm>MC</fnm></au><au><snm>Kim</snm><fnm>MJ</fnm></au><au><snm>Hur</snm><fnm>C-G</fnm></au><au><snm>Bae</snm><fnm>JM</fnm></au><au><snm>Park</snm><fnm>YI</fnm></au><au><snm>Chung</snm><fnm>C-H</fnm></au><au><snm>Kang</snm><fnm>C-W</fnm></au><au><snm>Ohlrogge</snm><fnm>JB</fnm></au></aug><source>Plant Mol Biol</source><pubdate>2003</pubdate><volume>52</volume><fpage>1107</fpage><lpage>1123</lpage><xrefbib><pubid idtype="pmpid">14682612</pubid></xrefbib></bibl><bibl id="B39"><title><p>Characterization of the sesame (<it>Sesamum indicum </it>L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers.</p></title><aug><au><snm>Wei</snm><fnm>WL</fnm></au><au><snm>Qi</snm><fnm>XQ</fnm></au><au><snm>Wang</snm><fnm>LH</fnm></au><au><snm>Zhang</snm><fnm>YX</fnm></au><au><snm>Hua</snm><fnm>W</fnm></au><au><snm>Li</snm><fnm>DH</fnm></au><au><snm>Lv</snm><fnm>HX</fnm></au><au><snm>Zhang</snm><fnm>XR</fnm></au></aug><source>BMC Genomics</source><pubdate>2011</pubdate><volume>12</volume><fpage>451</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-12-451</pubid><pubid idtype="pmcid">3184296</pubid><pubid idtype="pmpid" link="fulltext">21929789</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>De novo transcriptome sequencing and analysis of sesame growth and development.</p></title><aug><au><snm>Wei</snm><fnm>L</fnm></au><au><snm>Miao</snm><fnm>H</fnm></au><au><snm>Zhang</snm><fnm>H</fnm></au></aug><source>Scientia Agricultura Sinica</source><pubdate>2012</pubdate><volume>45</volume><fpage>1246</fpage><lpage>1256</lpage></bibl><bibl id="B41"><title><p>Fungal and baterial diseases of sesame.</p></title><aug><au><snm>Verma</snm><fnm>ML</fnm></au><au><snm>Mehta</snm><fnm>N</fnm></au><au><snm>Snagwan</snm><fnm>MS</fnm></au></aug><source>Diseases of Oilseed Crops</source><publisher>New Delhi; Sangwan Indus</publisher><editor>Saharan GS, Mehta N</editor><pubdate>2005</pubdate><fpage>269</fpage><lpage>301</lpage></bibl><bibl id="B42"><title><p>Breeding for the improvement of the ideal plant type of sesame.</p></title><aug><au><snm>Baydar</snm><fnm>H</fnm></au></aug><source>Plant Breed</source><pubdate>2005</pubdate><volume>124</volume><fpage>263</fpage><lpage>267</lpage><xrefbib><pubid idtype="doi">10.1111/j.1439-0523.2005.01080.x</pubid></xrefbib></bibl><bibl id="B43"><title><p>Nature of gene action for yield, yield components and major diseases resistance in sesame (<it>Sesamum indicum </it>L.).</p></title><aug><au><snm>El-Bramawy</snm><fnm>MAS</fnm></au><au><snm>Shaban</snm><fnm>WI</fnm></au></aug><source>Res J Agric Biol Sci</source><pubdate>2007</pubdate><volume>3</volume><fpage>821</fpage><lpage>826</lpage></bibl><bibl id="B44"><title><p>Simulated flooding and drought effects on germination, growth, and yield parameters of sesame (<it>Sesamum indicum </it>L.).</p></title><aug><au><snm>Mensah</snm><fnm>JK</fnm></au><au><snm>Obadoni</snm><fnm>BO</fnm></au><au><snm>Eruotor</snm><fnm>PG</fnm></au><au><snm>Onome-Irieguna</snm><fnm>F</fnm></au></aug><source>Afr J Biotechnol</source><pubdate>2006</pubdate><volume>5</volume><fpage>1249</fpage><lpage>1253</lpage></bibl><bibl id="B45"><aug><au><snm>Kuol</snm><fnm>BG</fnm></au></aug><source>Breeding for Drought Tolerance in Sesame (Sesamum indicum L.) in Sudan</source><publisher>G&#246;ttingen: Cuvillier Press</publisher><pubdate>2004</pubdate></bibl><bibl id="B46"><title><p>Structure and expression of two seed-specfic cDNA clones encoding stearoyl-acyl carrier protein desaturase from sesame, <it>Sesamum indicum </it>L.</p></title><aug><au><snm>Yukawa</snm><fnm>Y</fnm></au><au><snm>Takaiwa</snm><fnm>F</fnm></au><au><snm>Shoji</snm><fnm>K</fnm></au><au><snm>Masuda</snm><fnm>K</fnm></au><au><snm>Yamada</snm><fnm>K</fnm></au></aug><source>Plant Cell Physiol</source><pubdate>1996</pubdate><volume>37</volume><fpage>201</fpage><lpage>205</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/oxfordjournals.pcp.a028932</pubid><pubid idtype="pmpid" link="fulltext">8665096</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>Cloning, expression and isoform classification of a minor oleosin in sesame oil bodies.</p></title><aug><au><snm>Chen</snm><fnm>JCF</fnm></au><au><snm>Lin</snm><fnm>RH</fnm></au><au><snm>Huang</snm><fnm>HC</fnm></au><au><snm>Tzen</snm><fnm>JTC</fnm></au></aug><source>J Biochem</source><pubdate>1997</pubdate><volume>122</volume><fpage>819</fpage><lpage>824</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/oxfordjournals.jbchem.a021828</pubid><pubid idtype="pmpid" link="fulltext">9399587</pubid></pubidlist></xrefbib></bibl><bibl id="B48"><title><p>Characterization and temporal expression of a &#969;-6 fatty acid desaturase cDNA from sesame (Sesamum indicum L.) seeds.</p></title><aug><au><snm>Jin</snm><fnm>UH</fnm></au><au><snm>Lee</snm><fnm>JW</fnm></au><au><snm>Chung</snm><fnm>YS</fnm></au><au><snm>Lee</snm><fnm>JH</fnm></au><au><snm>Yi</snm><fnm>YB</fnm></au><au><snm>Kim</snm><fnm>YK</fnm></au><au><snm>Hyung</snm><fnm>NI</fnm></au><au><snm>Pyee</snm><fnm>JH</fnm></au><au><snm>Chung</snm><fnm>CH</fnm></au></aug><source>Plant Sci</source><pubdate>2001</pubdate><volume>161</volume><fpage>935</fpage><lpage>941</lpage><xrefbib><pubid idtype="doi">10.1016/S0168-9452(01)00489-7</pubid></xrefbib></bibl><bibl id="B49"><title><p>Specific expression of a sesame storage protein in transgenic rice bran.</p></title><aug><au><snm>Lee</snm><fnm>TTT</fnm></au><au><snm>Chung</snm><fnm>MC</fnm></au><au><snm>Kao</snm><fnm>YW</fnm></au><au><snm>Wang</snm><fnm>CS</fnm></au><au><snm>Chen</snm><fnm>LJ</fnm></au><au><snm>Tzen</snm><fnm>JTC</fnm></au></aug><source>J Cereal Sci</source><pubdate>2005</pubdate><volume>41</volume><fpage>23</fpage><lpage>29</lpage><xrefbib><pubid idtype="doi">10.1016/j.jcs.2004.08.014</pubid></xrefbib></bibl><bibl id="B50"><title><p>Cloning and expression of a seed-specific mtallothione-like protein from sesame.</p></title><aug><au><snm>Chyan</snm><fnm>CL</fnm></au><au><snm>Lee</snm><fnm>TTT</fnm></au><au><snm>Liu</snm><fnm>CP</fnm></au><au><snm>Yang</snm><fnm>YC</fnm></au><au><snm>Tzen</snm><fnm>JTC</fnm></au><au><snm>Chou</snm><fnm>WM</fnm></au></aug><source>Biosci Biotech Biochem</source><pubdate>2005</pubdate><volume>69</volume><fpage>2319</fpage><lpage>2325</lpage><xrefbib><pubid idtype="doi">10.1271/bbb.69.2319</pubid></xrefbib></bibl><bibl id="B51"><title><p>Gene families encoding isoforms of two major sesame seed storage proteins, 11S globulin and 2S albumin.</p></title><aug><au><snm>Hsiao</snm><fnm>ESL</fnm></au><au><snm>Lin</snm><fnm>LJ</fnm></au><au><snm>Li</snm><fnm>FY</fnm></au><au><snm>Wang</snm><fnm>MMC</fnm></au><au><snm>Liao</snm><fnm>MY</fnm></au><au><snm>Tzen</snm><fnm>JTC</fnm></au></aug><source>J Agirc Food Chem</source><pubdate>2006</pubdate><volume>54</volume><fpage>9544</fpage><lpage>9550</lpage><xrefbib><pubid idtype="doi">10.1021/jf061505x</pubid></xrefbib></bibl><bibl id="B52"><title><p>The SebHLH transcription factor mediates trans-activation of the SeFAD2 gene promoter through binding to E-and G-box elements.</p></title><aug><au><snm>Kim</snm><fnm>MJ</fnm></au><au><snm>Kim</snm><fnm>JK</fnm></au><au><snm>Shin</snm><fnm>JS</fnm></au><au><snm>Suh</snm><fnm>MC</fnm></au></aug><source>Plant Mol Biol</source><pubdate>2007</pubdate><volume>64</volume><fpage>453</fpage><lpage>466</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s11103-007-9165-8</pubid><pubid idtype="pmpid" link="fulltext">17420955</pubid></pubidlist></xrefbib></bibl><bibl id="B53"><title><p>Seed-expressed casein kinase I acts as a positive regulator of the SeFAD2 promoter via phosphorylation of the SebHLH transcription factor.</p></title><aug><au><snm>Kim</snm><fnm>MJ</fnm></au><au><snm>Go</snm><fnm>YS</fnm></au><au><snm>Lee</snm><fnm>SB</fnm></au><au><snm>Kim</snm><fnm>YS</fnm></au><au><snm>Shin</snm><fnm>JS</fnm></au><au><snm>Min</snm><fnm>MK</fnm></au><au><snm>Hwang</snm><fnm>I</fnm></au><au><snm>Suh</snm><fnm>MC</fnm></au></aug><source>Plant Mol Biol</source><pubdate>2010</pubdate><volume>73</volume><fpage>425</fpage><lpage>437</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s11103-010-9630-7</pubid><pubid idtype="pmpid" link="fulltext">20349267</pubid></pubidlist></xrefbib></bibl><bibl id="B54"><title><p>Comparison of sesamin contents and <it>CYP81Q1 </it>gene expressions in aboveground vegetative organs between two Japanese sesame (<it>Sesamum indicum </it>L.) varieties differing in seed sesamin contents.</p></title><aug><au><snm>Hata</snm><fnm>N</fnm></au><au><snm>Hayashi</snm><fnm>Y</fnm></au><au><snm>Okazawa</snm><fnm>A</fnm></au><au><snm>Ono</snm><fnm>E</fnm></au><au><snm>Satake</snm><fnm>H</fnm></au><au><snm>Kobayashi</snm><fnm>A</fnm></au></aug><source>Plant Sci</source><pubdate>2010</pubdate><volume>178</volume><fpage>510</fpage><lpage>516</lpage><xrefbib><pubid idtype="doi">10.1016/j.plantsci.2010.02.020</pubid></xrefbib></bibl><bibl id="B55"><title><p>Identification of a molecular marker linked to the closed capsule mutant trait in sesame using AFLP.</p></title><aug><au><snm>Uzun</snm><fnm>B</fnm></au><au><snm>Lee</snm><fnm>D</fnm></au><au><snm>Donini</snm><fnm>P</fnm></au><au><snm>Cagirgan</snm><fnm>MI</fnm></au></aug><source>Plant Breeding</source><pubdate>2003</pubdate><volume>122</volume><fpage>95</fpage><lpage>97</lpage><xrefbib><pubid idtype="doi">10.1046/j.1439-0523.2003.00787.x</pubid></xrefbib></bibl><bibl id="B56"><title><p>Breeding of a new sesame variety Yuzhi 11.</p></title><aug><au><snm>Wei</snm><fnm>W</fnm></au><au><snm>Wei</snm><fnm>S</fnm></au><au><snm>Zhang</snm><fnm>H</fnm></au><au><snm>Ding</snm><fnm>F</fnm></au><au><snm>Zhang</snm><fnm>T</fnm></au><au><snm>Lu</snm><fnm>F</fnm></au></aug><source>J Henan Agri Sci</source><pubdate>1999</pubdate><volume>7</volume><fpage>3</fpage><lpage>4</lpage></bibl><bibl id="B57"><title><p>Analysis of Integrated characteristics of Yuzhi 11.</p></title><aug><au><snm>Zhang</snm><fnm>T</fnm></au><au><snm>Zhang</snm><fnm>H</fnm></au><au><snm>Wei</snm><fnm>S</fnm></au><au><snm>Zheng</snm><fnm>Y</fnm></au><au><snm>Zhang</snm><fnm>Z</fnm></au><au><snm>Wang</snm><fnm>Z</fnm></au></aug><source>Chinese Agri Sci Bullet</source><pubdate>2003</pubdate><volume>19</volume><fpage>44</fpage><lpage>46</lpage></bibl><bibl id="B58"><title><p>An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG II.</p></title><aug><au><cnm>Angiosperm Phylogeny Group</cnm></au></aug><source>Bot J Linn Soc</source><pubdate>2003</pubdate><volume>141</volume><fpage>399</fpage><lpage>436</lpage></bibl><bibl id="B59"><title><p>Complete chloroplast genome sequences of important oilseed crop <it>Sesamum indicum </it>L.</p></title><aug><au><snm>Yi</snm><fnm>D-K</fnm></au><au><snm>Kim</snm><fnm>K-J</fnm></au></aug><source>PloS One</source><pubdate>2011</pubdate><volume>7</volume><fpage>e35872</fpage></bibl><bibl id="B60"><title><p>Database resources of the National Center for Biotechnology Information.</p></title><aug><au><snm>Sayers</snm><fnm>EW</fnm></au><au><snm>Barrett</snm><fnm>T</fnm></au><au><snm>Benson</snm><fnm>DA</fnm></au><au><snm>Bryant</snm><fnm>SH</fnm></au><au><snm>Canese</snm><fnm>K</fnm></au><au><snm>Chetvernin</snm><fnm>V</fnm></au><au><snm>Church</snm><fnm>DM</fnm></au><au><snm>DiCuccio</snm><fnm>M</fnm></au><au><snm>Edgar</snm><fnm>R</fnm></au><au><snm>Federhen</snm><fnm>S</fnm></au><au><snm>Feolo</snm><fnm>M</fnm></au><au><snm>Geer</snm><fnm>LY</fnm></au><au><snm>Helmberg</snm><fnm>W</fnm></au><au><snm>Kapustin</snm><fnm>Y</fnm></au><au><snm>Landsman</snm><fnm>D</fnm></au><au><snm>Lipman</snm><fnm>DJ</fnm></au><au><snm>Madden</snm><fnm>TL</fnm></au><au><snm>Maglott</snm><fnm>DR</fnm></au><au><snm>Miller</snm><fnm>V</fnm></au><au><snm>Mizrachi</snm><fnm>I</fnm></au><au><snm>Ostell</snm><fnm>J</fnm></au><au><snm>Pruitt</snm><fnm>KD</fnm></au><au><snm>Schuler</snm><fnm>GD</fnm></au><au><snm>Sequeira</snm><fnm>E</fnm></au><au><snm>Sherry</snm><fnm>ST</fnm></au><au><snm>Shumway</snm><fnm>M</fnm></au><au><snm>Sirotkin</snm><fnm>K</fnm></au><au><snm>Souvorov</snm><fnm>A</fnm></au><au><snm>Starchenko</snm><fnm>G</fnm></au><au><snm>Tatusova</snm><fnm>TA</fnm></au><etal/></aug><source>Nucleic Acid Res</source><pubdate>2009</pubdate><volume>37</volume><issue>Database issue</issue><fpage>5</fpage><lpage>15</lpage></bibl><bibl id="B61"><title><p>Benefits and best practices of rapid re-publication data release.</p></title><aug><au><cnm>Toronto 2009 Data Release Workshop Authors</cnm></au></aug><source>Nature</source><pubdate>2009</pubdate><volume>461</volume><fpage>168</fpage><lpage>170</lpage><xrefbib><pubidlist><pubid idtype="pmcid">3073843</pubid><pubid idtype="pmpid" link="fulltext">19741685</pubid></pubidlist></xrefbib></bibl><bibl id="B62"><title><p>The Sesame Genome Working Group.</p></title><url>http://www.sesamum.org</url></bibl><bibl id="B63"><title><p>Test system for the qualitative detection of sesame DNA in food products by PCR Real Time.</p></title><url>http://www.sacace.com/</url><note>I-007.9a E Rev.007. Code: A-02-1081</note></bibl><bibl id="B64"><title><p>Genetic characterization of two traditional leafy vegetables (<it>Sesamum radiatum </it>Thonn. ex Hornem and <it>Ceratotheca sesamoides </it>Endl.) of Benin, using flow cytometry and amplified fragment length polymorphism (AFLP) markers.</p></title><aug><au><snm>Ad&#233;oti</snm><fnm>K</fnm></au><au><snm>Rival</snm><fnm>A</fnm></au><au><snm>Dansi</snm><fnm>A</fnm></au><au><snm>Santoni</snm><fnm>S</fnm></au><au><snm>Brown</snm><fnm>S</fnm></au><au><snm>Beule</snm><fnm>T</fnm></au><au><snm>Nato</snm><fnm>A</fnm></au><au><snm>Henry</snm><fnm>Y</fnm></au><au><snm>Vodouhe</snm><fnm>R</fnm></au><au><snm>Loko</snm><fnm>LY</fnm></au><au><snm>Sanni</snm><fnm>A</fnm></au></aug><source>Afr J Biotechnol</source><pubdate>2011</pubdate><volume>10</volume><fpage>14264</fpage><lpage>14275</lpage></bibl><bibl id="B65"><title><p>Analysis of the genome sequence of the flowering plant Arabidopsis thaliana.</p></title><aug><au><cnm>The Arabidopsis Genome Initiative</cnm></au></aug><source>Nature</source><pubdate>2000</pubdate><volume>408</volume><fpage>796</fpage><lpage>815</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/35048692</pubid><pubid idtype="pmpid" link="fulltext">11130711</pubid></pubidlist></xrefbib></bibl><bibl id="B66"><title><p>Genome sequence of the palaeopolyploid soybean.</p></title><aug><au><snm>Schmutz</snm><fnm>J</fnm></au><au><snm>Cannon</snm><fnm>SB</fnm></au><au><snm>Schlueter</snm><fnm>J</fnm></au><au><snm>Ma</snm><fnm>J</fnm></au><au><snm>Mitros</snm><fnm>T</fnm></au><au><snm>Nelson</snm><fnm>W</fnm></au><au><snm>Hyten</snm><fnm>DL</fnm></au><au><snm>Song</snm><fnm>Q</fnm></au><au><snm>Thelen</snm><fnm>JJ</fnm></au><au><snm>Cheng</snm><fnm>J</fnm></au><au><snm>Xu</snm><fnm>D</fnm></au><au><snm>Hellsten</snm><fnm>U</fnm></au><au><snm>May</snm><fnm>GD</fnm></au><au><snm>Yu</snm><fnm>Y</fnm></au><au><snm>Sakurai</snm><fnm>T</fnm></au><au><snm>Umezawa</snm><fnm>T</fnm></au><au><snm>Bhattacharyya</snm><fnm>MK</fnm></au><au><snm>Sandhu</snm><fnm>D</fnm></au><au><snm>Valliyodan</snm><fnm>B</fnm></au><au><snm>Lindquist</snm><fnm>E</fnm></au><au><snm>Peto</snm><fnm>M</fnm></au><au><snm>Grant</snm><fnm>D</fnm></au><au><snm>Shu</snm><fnm>S</fnm></au><au><snm>Goodstein</snm><fnm>D</fnm></au><au><snm>Barry</snm><fnm>K</fnm></au><au><snm>Futrell-Griggs</snm><fnm>M</fnm></au><au><snm>Abernathy</snm><fnm>B</fnm></au><au><snm>Du</snm><fnm>J</fnm></au><au><snm>Tian</snm><fnm>Z</fnm></au><au><snm>Zhu</snm><fnm>L</fnm></au><au><snm>Gill</snm><fnm>N</fnm></au><etal/></aug><source>Nature</source><pubdate>2010</pubdate><volume>463</volume><fpage>178</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature08670</pubid><pubid idtype="pmpid" link="fulltext">20075913</pubid></pubidlist></xrefbib></bibl><bibl id="B67"><title><p>A draft sequence of the rice genome (<it>Oryza sativa </it>L. ssp. <it>japonica</it>).</p></title><aug><au><snm>Goff</snm><fnm>SA</fnm></au><au><snm>Ricke</snm><fnm>D</fnm></au><au><snm>Lan</snm><fnm>T-H</fnm></au><au><snm>Presting</snm><fnm>G</fnm></au><au><snm>Wang</snm><fnm>R</fnm></au><au><snm>Dunn</snm><fnm>M</fnm></au><au><snm>Glazebrook</snm><fnm>J</fnm></au><au><snm>Sessions</snm><fnm>A</fnm></au><au><snm>Oeller</snm><fnm>P</fnm></au><au><snm>Varma</snm><fnm>H</fnm></au><au><snm>Hadley</snm><fnm>D</fnm></au><au><snm>Hutchison</snm><fnm>D</fnm></au><au><snm>Martin</snm><fnm>C</fnm></au><au><snm>Katagiri</snm><fnm>F</fnm></au><au><snm>Lange</snm><fnm>BM</fnm></au><au><snm>Moughamer</snm><fnm>T</fnm></au><au><snm>Xia</snm><fnm>Y</fnm></au><au><snm>Budworth</snm><fnm>P</fnm></au><au><snm>Zhong</snm><fnm>J</fnm></au><au><snm>Miguel</snm><fnm>T</fnm></au><au><snm>Paszkowski</snm><fnm>U</fnm></au><au><snm>Zhang</snm><fnm>S</fnm></au><au><snm>Colbert</snm><fnm>M</fnm></au><au><snm>Sun</snm><fnm>W-L</fnm></au><au><snm>Chen</snm><fnm>L</fnm></au><au><snm>Cooper</snm><fnm>B</fnm></au><au><snm>Park</snm><fnm>S</fnm></au><au><snm>Wood</snm><fnm>TC</fnm></au><au><snm>Mao</snm><fnm>L</fnm></au><au><snm>Quail</snm><fnm>P</fnm></au><etal/></aug><source>Science</source><pubdate>2002</pubdate><volume>296</volume><fpage>92</fpage><lpage>100</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1068275</pubid><pubid idtype="pmpid" link="fulltext">11935018</pubid></pubidlist></xrefbib></bibl><bibl id="B68"><title><p>Development and validation of genic-SSR markers in sesame by RNA-seq.</p></title><aug><au><snm>Zhang</snm><fnm>H</fnm></au><au><snm>Wei</snm><fnm>L</fnm></au><au><snm>Miao</snm><fnm>H</fnm></au><au><snm>Zhang</snm><fnm>T</fnm></au><au><snm>Wang</snm><fnm>C</fnm></au></aug><source>BMC Genomics</source><pubdate>2012</pubdate><volume>13</volume><fpage>316</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-13-316</pubid><pubid idtype="pmcid">3428654</pubid><pubid idtype="pmpid" link="fulltext">22800194</pubid></pubidlist></xrefbib></bibl><bibl id="B69"><title><p>Analysis of expression sequence tags from a full-length-enriched cDNA library of developing sesame seeds (<it>Sesamum indicum</it>).</p></title><aug><au><snm>Ke</snm><fnm>T</fnm></au><au><snm>Dong</snm><fnm>C</fnm></au><au><snm>Mao</snm><fnm>H</fnm></au><au><snm>Zhao</snm><fnm>Y</fnm></au><au><snm>Chen</snm><fnm>H</fnm></au><au><snm>Liu</snm><fnm>H</fnm></au><au><snm>Dong</snm><fnm>X</fnm></au><au><snm>Tong</snm><fnm>C</fnm></au><au><snm>Liu</snm><fnm>S</fnm></au></aug><source>BMC Plant Biol</source><pubdate>2011</pubdate><volume>11</volume><fpage>180</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2229-11-180</pubid><pubid idtype="pmcid">3311628</pubid><pubid idtype="pmpid" link="fulltext">22195973</pubid></pubidlist></xrefbib></bibl><bibl id="B70"><title><p>A genetic linkage map construction for sesame (Sesamum indicum L.).</p></title><aug><au><snm>Wei</snm><fnm>L-B</fnm></au><au><snm>Zhang</snm><fnm>H-Y</fnm></au><au><snm>Zheng</snm><fnm>Y-Z</fnm></au><au><snm>Miao</snm><fnm>H-M</fnm></au><au><snm>Zhang</snm><fnm>T-Z</fnm></au><au><snm>Guo</snm><fnm>W-Z</fnm></au></aug><source>Genes Genomics</source><pubdate>2009</pubdate><volume>31</volume><fpage>199</fpage><lpage>208</lpage><xrefbib><pubid idtype="doi">10.1007/BF03191152</pubid></xrefbib></bibl><bibl id="B71"><title><p>ABySS: A parallel assembler for short read sequence data.</p></title><aug><au><snm>Simpson</snm><fnm>JT</fnm></au><au><snm>Wong</snm><fnm>K</fnm></au><au><snm>Jackman</snm><fnm>SD</fnm></au><au><snm>Schein</snm><fnm>JE</fnm></au><au><snm>Jones</snm><fnm>SJ</fnm></au><au><snm>Birol</snm><fnm>I</fnm></au></aug><source>Genome Res</source><pubdate>2009</pubdate><volume>19</volume><fpage>1117</fpage><lpage>1123</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.089532.108</pubid><pubid idtype="pmcid">2694472</pubid><pubid idtype="pmpid" link="fulltext">19251739</pubid></pubidlist></xrefbib></bibl><bibl id="B72"><title><p>Aggressive assembly of pyrosequencing reads with mates.</p></title><aug><au><snm>Miller</snm><fnm>JR</fnm></au><au><snm>Delcher</snm><fnm>AL</fnm></au><au><snm>Koren</snm><fnm>S</fnm></au><au><snm>Venter</snm><fnm>E</fnm></au><au><snm>Walenz</snm><fnm>BP</fnm></au><au><snm>Brownley</snm><fnm>A</fnm></au><au><snm>Johnson</snm><fnm>J</fnm></au><au><snm>Li</snm><fnm>K</fnm></au><au><snm>Mobarry</snm><fnm>C</fnm></au><au><snm>Sutton</snm><fnm>G</fnm></au></aug><source>Bioinformatics</source><pubdate>2008</pubdate><volume>24</volume><fpage>2818</fpage><lpage>2824</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btn548</pubid><pubid idtype="pmcid">2639302</pubid><pubid idtype="pmpid" link="fulltext">18952627</pubid></pubidlist></xrefbib></bibl><bibl id="B73"><title><p>Genome sequence and analysis of the tuber crop potato.</p></title><aug><au><snm>Xu</snm><fnm>X</fnm></au><au><snm>Pan</snm><fnm>S</fnm></au><au><snm>Cheng</snm><fnm>S</fnm></au><au><snm>Zhang</snm><fnm>B</fnm></au><au><snm>Mu</snm><fnm>D</fnm></au><au><snm>Ni</snm><fnm>P</fnm></au><au><snm>Zhang</snm><fnm>G</fnm></au><au><snm>Yang</snm><fnm>S</fnm></au><au><snm>Li</snm><fnm>R</fnm></au><au><snm>Wang</snm><fnm>J</fnm></au><au><snm>Orjeda</snm><fnm>G</fnm></au><au><snm>Guzman</snm><fnm>F</fnm></au><au><snm>Torres</snm><fnm>M</fnm></au><au><snm>Lozano</snm><fnm>R</fnm></au><au><snm>Ponce</snm><fnm>O</fnm></au><au><snm>Martinez</snm><fnm>D</fnm></au><au><snm>De la Cruz</snm><fnm>G</fnm></au><au><snm>Chakrabarti</snm><fnm>SK</fnm></au><au><snm>Patil</snm><fnm>VU</fnm></au><au><snm>Skryabin</snm><fnm>KG</fnm></au><au><snm>Kuznetsov</snm><fnm>BB</fnm></au><au><snm>Ravin</snm><fnm>NV</fnm></au><au><snm>Kolganova</snm><fnm>TV</fnm></au><au><snm>Beletsky</snm><fnm>AV</fnm></au><au><snm>Mardanov</snm><fnm>AV</fnm></au><au><snm>Di Genova</snm><fnm>A</fnm></au><au><snm>Bolser</snm><fnm>DM</fnm></au><au><snm>Martin</snm><fnm>DM</fnm></au><au><snm>Li</snm><fnm>G</fnm></au><au><snm>Yang</snm><fnm>Y</fnm></au><etal/></aug><source>Nature</source><pubdate>2011</pubdate><volume>475</volume><fpage>189</fpage><lpage>195</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature10158</pubid><pubid idtype="pmpid" link="fulltext">21743474</pubid></pubidlist></xrefbib></bibl><bibl id="B74"><title><p>A fast, lock-free approach for efficient parallel counting of occurrences of k-mers.</p></title><aug><au><snm>Mar&#231;ais</snm><fnm>G</fnm></au><au><snm>Kingsford</snm><fnm>CE</fnm></au></aug><source>Bioinformatics</source><pubdate>2011</pubdate><volume>27</volume><fpage>764</fpage><lpage>70</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btr011</pubid><pubid idtype="pmcid">3051319</pubid><pubid idtype="pmpid" link="fulltext">21217122</pubid></pubidlist></xrefbib></bibl><bibl id="B75"><title><p>RepeatMasker Open-3.0.</p></title><aug><au><snm>Smit</snm><fnm>AFA</fnm></au><au><snm>Green</snm><fnm>P</fnm></au></aug><url>http://www.repeatmasker.org</url></bibl><bibl id="B76"><title><p>RepeatModeler, version 1.0.5.</p></title><url>http://www.repeatmasker.org/RepeatModeler.html</url></bibl><bibl id="B77"><title><p>SolexaQA: At-a-glance quality assessment of Illumina second-generation sequencing data.</p></title><aug><au><snm>Cox</snm><fnm>MP</fnm></au><au><snm>Peterson</snm><fnm>DA</fnm></au><au><snm>Biggs</snm><fnm>P</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2010</pubdate><volume>11</volume><fpage>485</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-11-485</pubid><pubid idtype="pmcid">2956736</pubid><pubid idtype="pmpid" link="fulltext">20875133</pubid></pubidlist></xrefbib></bibl><bibl id="B78"><title><p>DNA sequence quality trimming and vector removal.</p></title><aug><au><snm>Chou</snm><fnm>HH</fnm></au><au><snm>Holmes</snm><fnm>MH</fnm></au></aug><source>Bioinformatics</source><pubdate>2001</pubdate><volume>17</volume><fpage>1093</fpage><lpage>1104</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/17.12.1093</pubid><pubid idtype="pmpid" link="fulltext">11751217</pubid></pubidlist></xrefbib></bibl><bibl id="B79"><title><p>CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes.</p></title><aug><au><snm>Parral</snm><fnm>G</fnm></au><au><snm>Bradnaml</snm><fnm>K</fnm></au><au><snm>Korfl</snm><fnm>I</fnm></au></aug><source>Bioinformatics</source><pubdate>2007</pubdate><volume>23</volume><fpage>1061</fpage><lpage>1067</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btm071</pubid><pubid idtype="pmpid" link="fulltext">17332020</pubid></pubidlist></xrefbib></bibl><bibl id="B80"><title><p>Velvet: Algorithms for de novo short read assembly using de Bruijn graphs.</p></title><aug><au><snm>Zerbino</snm><fnm>DR</fnm></au><au><snm>Birney</snm><fnm>E</fnm></au></aug><source>Genome Res</source><pubdate>2008</pubdate><volume>18</volume><fpage>821</fpage><lpage>829</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.074492.107</pubid><pubid idtype="pmcid">2336801</pubid><pubid idtype="pmpid" link="fulltext">18349386</pubid></pubidlist></xrefbib></bibl><bibl id="B81"><title><p>Oases: Robust de novo RNA-seq assembly across the dynamic range of expression levels.</p></title><aug><au><snm>Schulz1</snm><fnm>MH</fnm></au><au><snm>Zerbino</snm><fnm>DR</fnm></au><au><snm>Vingron</snm><fnm>M</fnm></au><au><snm>Birney</snm><fnm>E</fnm></au></aug><source>Bioinformatics</source><pubdate>2012</pubdate><volume>28</volume><fpage>1086</fpage><lpage>1092</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/bts094</pubid><pubid idtype="pmcid">3324515</pubid><pubid idtype="pmpid" link="fulltext">22368243</pubid></pubidlist></xrefbib></bibl><bibl id="B82"><title><p>ESTScan: a program for detecting, evaluating, and reconstructing potential coding regions in EST sequences.</p></title><aug><au><snm>Iseli</snm><fnm>C</fnm></au><au><snm>Jongeneel</snm><fnm>CV</fnm></au><au><snm>Bucher</snm><fnm>P</fnm></au></aug><source>Proc Int Conf Intell Syst Mol Biol</source><pubdate>1999</pubdate><volume>7</volume><fpage>138</fpage><lpage>148</lpage></bibl><bibl id="B83"><title><p>The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003.</p></title><aug><au><snm>Boeckmann</snm><fnm>B</fnm></au><au><snm>Bairoch</snm><fnm>A</fnm></au><au><snm>Apweiler</snm><fnm>R</fnm></au><au><snm>Blatter</snm><fnm>M-C</fnm></au><au><snm>Estreicher</snm><fnm>A</fnm></au><au><snm>Gasteiger</snm><fnm>E</fnm></au><au><snm>Martin</snm><fnm>MJ</fnm></au><au><snm>Michoud</snm><fnm>K</fnm></au><au><snm>O'Donovan</snm><fnm>C</fnm></au><au><snm>Phan</snm><fnm>I</fnm></au><au><snm>Pilbout</snm><fnm>S</fnm></au><au><snm>Schneider</snm><fnm>M</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2003</pubdate><volume>31</volume><fpage>365</fpage><lpage>370</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkg095</pubid><pubid idtype="pmcid">165542</pubid><pubid idtype="pmpid" link="fulltext">12520024</pubid></pubidlist></xrefbib></bibl><bibl id="B84"><title><p>GMAP: a genomic mapping and alignment program for mRNA and EST sequences.</p></title><aug><au><snm>Wu</snm><fnm>TD</fnm></au><au><snm>Watanabe</snm><fnm>CK</fnm></au></aug><source>Bioinformatics</source><pubdate>2005</pubdate><volume>21</volume><fpage>1859</fpage><lpage>1875</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/bti310</pubid><pubid idtype="pmpid" link="fulltext">15728110</pubid></pubidlist></xrefbib></bibl><bibl id="B85"><title><p>Full-length transcriptome assembly from RNA-Seq data without a reference genome.</p></title><aug><au><snm>Grabherr</snm><fnm>MG</fnm></au><au><snm>Haas</snm><fnm>BJ</fnm></au><au><snm>Yassour</snm><fnm>M</fnm></au><au><snm>Levin</snm><fnm>JZ</fnm></au><au><snm>Thompson</snm><fnm>DA</fnm></au><au><snm>Amit</snm><fnm>I</fnm></au><au><snm>Adiconis</snm><fnm>X</fnm></au><au><snm>Fan</snm><fnm>L</fnm></au><au><snm>Raychowdhury</snm><fnm>R</fnm></au><au><snm>Zeng</snm><fnm>Q</fnm></au><au><snm>Chen</snm><fnm>Z</fnm></au><au><snm>Mauceli</snm><fnm>E</fnm></au><au><snm>Hacohen</snm><fnm>N</fnm></au><au><snm>Gnirke</snm><fnm>A</fnm></au><au><snm>Rhind</snm><fnm>N</fnm></au><au><snm>Palma</snm><fnm>F</fnm></au><au><snm>Birren</snm><fnm>BW</fnm></au><au><snm>Nusbaum</snm><fnm>C</fnm></au><au><snm>Friedman</snm><fnm>N</fnm></au><au><snm>Regev</snm><fnm>A</fnm></au></aug><source>Nat Biotechnol</source><pubdate>2011</pubdate><volume>29</volume><fpage>644</fpage><lpage>652</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nbt.1883</pubid><pubid idtype="pmpid" link="fulltext">21572440</pubid></pubidlist></xrefbib></bibl><bibl id="B86"><title><p>AUGUSTUS: ab initio prediction of alternative transcripts.</p></title><aug><au><snm>Stanke</snm><fnm>M</fnm></au><au><snm>Keller</snm><fnm>O</fnm></au><au><snm>Gunduz</snm><fnm>I</fnm></au><au><snm>Hayes</snm><fnm>A</fnm></au><au><snm>Waack</snm><fnm>S</fnm></au><au><snm>Morgenstern</snm><fnm>B</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2006</pubdate><volume>34</volume><fpage>W435</fpage><lpage>239</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkl200</pubid><pubid idtype="pmcid">1538822</pubid><pubid idtype="pmpid" link="fulltext">16845043</pubid></pubidlist></xrefbib></bibl><bibl id="B87"><title><p>InterProScan an integration platform for the signature- recognition methods in InterPro.</p></title><aug><au><snm>Zdobnov</snm><fnm>EM</fnm></au><au><snm>Apweiler</snm><fnm>R</fnm></au></aug><source>Bioinformatics</source><pubdate>2001</pubdate><volume>17</volume><fpage>847</fpage><lpage>848</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/17.9.847</pubid><pubid idtype="pmpid" link="fulltext">11590104</pubid></pubidlist></xrefbib></bibl><bibl id="B88"><title><p>The Pfam protein families database.</p></title><aug><au><snm>Bateman</snm><fnm>A</fnm></au><au><snm>Coin</snm><fnm>L</fnm></au><au><snm>Durbin</snm><fnm>R</fnm></au><au><snm>Finn</snm><fnm>RD</fnm></au><au><snm>Hollich</snm><fnm>V</fnm></au><au><snm>Griffiths-Jones</snm><fnm>S</fnm></au><au><snm>Khanna</snm><fnm>A</fnm></au><au><snm>Marshall</snm><fnm>M</fnm></au><au><snm>Moxon</snm><fnm>S</fnm></au><au><snm>Sonnhammer</snm><fnm>ELL</fnm></au><au><snm>Studholme</snm><fnm>DJ</fnm></au><au><snm>Yeats</snm><fnm>C</fnm></au><au><snm>Eddy</snm><fnm>SR</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2004</pubdate><volume>32</volume><issue>Suppl 1</issue><fpage>D138</fpage><lpage>141</lpage><xrefbib><pubidlist><pubid idtype="pmcid">308855</pubid><pubid idtype="pmpid" link="fulltext">14681378</pubid></pubidlist></xrefbib></bibl><bibl id="B89"><title><p>WEGO: a web tool for plotting GO annotations.</p></title><aug><au><snm>Ye</snm><fnm>J</fnm></au><au><snm>Fang</snm><fnm>L</fnm></au><au><snm>Zheng</snm><fnm>H</fnm></au><au><snm>Zhang</snm><fnm>Y</fnm></au><au><snm>Chen</snm><fnm>J</fnm></au><au><snm>Zhang</snm><fnm>Z</fnm></au><au><snm>Wang</snm><fnm>J</fnm></au><au><snm>Li</snm><fnm>S</fnm></au><au><snm>Li</snm><fnm>R</fnm></au><au><snm>Bolund</snm><fnm>L</fnm></au><au><snm>Wan</snm><fnm>J</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2006</pubdate><volume>34</volume><issue>Suppl 2</issue><fpage>W293</fpage><lpage>297</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1538768</pubid><pubid idtype="pmpid" link="fulltext">16845012</pubid></pubidlist></xrefbib></bibl></refgrp>
   </bm>
</art>