Table 2

Genome-wide analysis of C. militaris gene sets.

Characteristics

C. militaris

CMM corea

CMA restrictedb

CMC restrictedc

CCM specific


Number of genes

9,684

7,981

217

158

1,328

Mean gene length (bp)

1,742

1,885

1,445

1,440

967

Mean number of introns per gene

1.99

2.05

1.77

1.69

1.71

Percentage genes without introns

21.3

20.1

22.6

28.5

27.5

Percentage GC content (excluding introns)

58.6

58.6

70.7

58.7

58.3

Number of InterproScan protein families

2,644

2,552

69

52

112

Number of secreted proteins

1,572

1,250

45

13

264

Number of PHI genesd

1,547

1,539

4

2

2

Number of TSA proteases

68

65

3

0

0

Number of MFS genes

245

242

0

2

1

Number of cytochrome P450s

57

56

0

1

0

Number of Pth11-like GPCRs

18

18

0

0

0

Number of protein kinases

167

167

0

0

0

Number of transcription factors

123

120

2

1

0

Number of glycoside hydrolases

105

103

2

0

0

Number of SM backbone genes

28

28

0

0

0

Number of horizontally transferred genes

49

30

5

1

12

Number of orthologs in M. anisopliae

6,863

6,705

158

NA

NA

Number of orthologs in M. acridum

6,762

6,644

NA

118

NA

Number of orthologs in F. graminearum

6,740

6,376

106

89

169

Number of orthologs in M. oryzae

6,219

5,937

90

80

112

Percentage identity to M. anisopliae orthologs

63.4

63.7

51.3

NA

NA

Percentage identity to M. acridum orthologs

63.4

63.6

NA

51.2

NA

Percentage identity to F. graminearum orthologs

61.6

62.3

51.9

52.8

46.7

Percentage identity to M. oryzae orthologs

56.0

56.4

48.3

48.0

45.3


aCMM core: C. militaris (CCM), M. anisopliae and M. acridum genes grouped with a cutoff E value of 1e-5 during reciprocal Blast analysis. bCMA restricted: C. militias and M. anisopliae restricted genes grouped with a cutoff E value of 1e-5. cCMC restricted: C. militaris and M. acridum restricted genes grouped at a cutoff E value of 1e-5. dPHI genes, pathogen-host interaction genes predicted by blast analysis against the PHI database [72]. Identity was estimated at the amino acid level. GPCR, G-protein coupled receptor; MFS, major facilitator superfamily; NA, not available; SM, secondary metabolite; TSA, trypsin, subtilisin and aspartyl protease.

Zheng et al. Genome Biology 2011 12:R116   doi:10.1186/gb-2011-12-11-r116

Open Data