Table 1

Summary of datasets for eight sequenced plant genomes included in this study

Species

Annotation version

Number of annotated genes


Arabidopsis thaliana (thale cress)

TAIR version 9

27,379

Carica papaya (papaya)

ASGPB release

25,536

Cucumis sativus (cucumber)

BGI release

21,635

Populus trichocarpa (black cottonwood)

JGI version 2.0

41,377

Glycine max (soybean)

Phytozome version 1.0

55,787

Vitis vinifera (grape vine)

Genoscope release

30,434

Oryza sativa (rice)

RGAP release 6.1

56,979

Sorghum bicolor

JGI version 1.4

34,496


These eight genome sequences were used to construct orthogroups, which were then populated with additional unigenes of asterids, basal eudicots, non-grass monocots, and basal angiosperms. The number of annotated genes in each genome is indicated. ASGPB, Advanced Studies of Genomics, Proteomics and Bioinformatics; JGI, Joint Genome Institute; RGAP, Rice Genome Annotation Project; TAIR, The Arabidopsis Information Resource.

Jiao et al. Genome Biology 2012 13:R3   doi:10.1186/gb-2012-13-1-r3

Open Data