Table 4

Protein domain enrichment found in recently duplicated mouse genes*

InterPro entry ID
Protein domain description
Number found in 608 duplicated genes
Number found in all 16,515 annotated genes in genome
Enrichment

IPR000276
Rhodopsin-like GPCR superfamily
135
1229
3.0
IPR000725
Olfactory receptor
103
861
3.3
IPR003006
Immunoglobulin/major histocompatibility complex
46
372
3.4
IPR004072
Vomeronasal receptor, type 1
31
108
7.8
IPR001909
KRAB box
23
103
6.1
IPR001254
Serine protease, trypsin family
21
117
4.9
IPR002401
E-class P450, group I
20
61
8.9
IPR001128
Cytochrome P450
20
68
8.0
IPR007086
Zn-finger, C2H2 subtype
20
139
3.9
IPR001314
Chymotrypsin serine protease, family S1
19
108
4.8
IPR002403
E-class P450, group IV
17
56
8.2
IPR002397
B-class P450
13
29
11.9
IPR001304
C-type lectin
13
96
3.7
IPR000215
Serpin
12
48
6.8
IPR002402
E-class P450, group II
9
14
18.5
IPR006046
Glycoside hydrolase family 13
7
8
23.0
IPR006047
Alpha amylase, catalytic domain
7
10
19.2
IPR001400
Somatotropin hormone
7
32
6.1
IPR006048
Alpha amylase, C-terminal all-beta domain
6
7
24.7
IPR002018
Carboxylesterase, type B
6
13
12.3
IPR004073
Vomeronasal receptor, type 2
6
13
12.3
IPR001039
Major histocompatibility complex protein, class I
6
17
9.9
IPR001828
Extracellular ligand-binding receptor
6
29
5.5
IPR002213
UDP-glucoronosyl/UDP-glucosyl transferase
5
12
11.8
IPR002448
Odour-binding protein
4
9
13.2
IPR000068
Extracellular calcium-sensing receptor
4
10
11.0

*Only Ensembl gene annotation (608 genes) was used in this analysis. All results shown are statistically significant with p-values < 10-5 (chi2 test).

Cheung et al. Genome Biology 2003 4:R47   doi:10.1186/gb-2003-4-8-r47

Open Data