Table 1

Interkingdom domain fusions and their probable origins

IKF gene
Best 'native' hit
Best 'alien' hit
Protein function
Stand-alone
Comment
(GI number and gene
(E-value, amino acid
(E-value, amino

paralog of the

name) and origin
residue range,
acid residue range,

alien domain

of domains
species)/domain
species)/domain




function
function




Archaea





     Aeropyrum pernix





          5106104_
2621953_Mth
2633525_Bs
Hydroxymethyl-
None
Pyrococci encode proteins with
          APE2400
5e-27;
4e-54;
pyrimidine

the same domain organization
          Archaeal-bacterial
282-445;
16-272;
phosphate kinase

andclosest similarity to A. pernix;

uncharacterized domain
hydroxymethyl-
involved in thiamine

M. jannaschii encodes a protein

conserved among
pyrimidine phosphate
biosynthesis

with the same domain

archaea (homolog
kinase
(additional function?)

organization but low similarity;

of the amino-terminal



Mt encodes a HMP-kinase with

domain of sialic acid



moderate similarity

synthase)




     Methanococcus jannaschii





          1591138_
2128140_Mj;
7270033_At;
Unknown;
None
The amino-terminal domain is
          MJ0434
1e-19;
0.003;
possible role

present in several stand-alone
          Archaeal-
2-94;
120-222;
in stress response

copies in M. jannaschii, but
          bacterial-eukaryotic
uncharacterized
AIG2-like


otherwise, is seen mostly in

domain
stress-related


bacteria; the possibility of


protein


acquisition of a bacterial gene





by the Methanococcus lineage





is conceivable
     Methanobacterium thermoautotrophicum





          2621249_
5103547_Ap;
1651798_Ssp;
Membrane-associated
None
In Ssp, the amino-terminal
          MTH204
1e-34;
0.002;
5-formyl-

domain is fused to another
          Archaeal-
137-326;
8-139;
tetrahydrofolate

uncharacterized domain. An
          eukaryotic/
5-formyl-
uncharacterized
cyclo-ligase(?);

ortholog with conserved
          bacterial
tetrahydrofolate
membrane-associated
exact function

domain organization is seen

cyclo-ligase
domain
unknown

in Mycobacterium, but many





other bacteria encode stand-





alone versions of this domain,





which could be the actual sources





of horizontal gene transfer
          2621673_
3256572_Ph;
2984130_Aa;
GTPase, possible
2621855

          MTH594
3e-10;
6e-19;
role in signal


          Archaeal-bacterial
5-137;
233-390;
transduction



inactivated RecA
GTPase




domain




          2622642_
5105992_Ap;
2569943_Axy;
Glucose-1-phosphate
None

          MTH1523
3e-36;
2e-05;
thymidylyl transferase/


          Archaeal-bacterial
5-226;
226-334;
glucose-6-phosphate



glucose-1-phosphate
mannose-6-
isomerase



thymidylyl transferase
phosphate isomerase



     Bacteria





          Aquifex aeolicus





          2983622_
2633696_Bs;
2650176_Af;
Signal
None

          aq_1151
5e-65;
0.005;
transduction


          Bacterial-archaeal
325-795;
116-279;
c-di-GMP



c-di-GMP phospho-
PAS/PAC
phospho-diesterase



diesterase
domain



          2984285_
586875_Bs;
3915955_Mj;
Molybdenum
None

          aq_2060
4e-63
3e-09;
cofactor


          Bacterial-archaeal
1-252;
270-441;
bisynthesis enzyme(?)



PHP superfamily
pyruvate




hydrolase
formate-lyase





activating enzyme





(Fe-S cluster





oxidoreductase)



     Bacillus subtilis





          2632283_yaaH,
4980914_Tm
399377_Rn
Chitinase
2635915
B. subtilis encodes two
          1945087_ydhD
1e-06
2e-11


paralogous proteins with the
          Bacterial-eukaryotic
2-92;
221-402;


same domain architecture

LysM repeat domain
chitinase



          2633242_yhcR
645819_Dr;
2622704_Mth;
Nuclease-nucleotidase
None

          Bacterial-archaeal
1e-64;
0.008
(probable repair



584-1068;
151-257;
enzyme)



5'-nucleotidase;
nucleic acid-binding




1175987_
domain (OB-fold)




ECR100;





2e-09;





377-521;





thermonuclease




          2632325_yabN
4981449_Tm;
3873806_Ce;
Methyl-transferase/
None
Other than in chlamydiae,
          Bacterial-eukaryotic
2e-62;
0.003;
pyro-phosphatase

the SWI domain is seen

223-483;
7-125;
(metabolic enzyme

in eukaryotic chromatin-

MazG (predicted pyro-
SAM-dependent
of an unknown

associated proteins, leading

phosphatase)
methyl-transferase
pathway?)

to the suggestion that





chlamydial topoisomerase





is involved in chromosome





condensation
     Chlamydophyla pneumoniae





          4377077_
730965_Bs;
3581917_Sp;
DNA topoisomerase I,
7189103
SWI is a typical eukaryotic
          CPn0769
e-148;
3e-10;
possibly involved in

domain not found in
          Bacterial-eukaryotic
1-727;
792-866;
chromatin

prokaryotes other than

DNA topoisomerase I
SWI domain
condensation

chlamydia (the ortholog





in Chlamydia trachomatis has the





same domain architecture)
     Deinococcus radiodurans





          6459294_
7248325_Sco;
6754878_Mm;
DNase
None
The G9a domain is not
          DR1533
0.001;
9e-28;


detectable in other prokaryotes.
          Bacterial-eukaryotic
171-265;
4-148;


In eukaryotes, this domain so

McrA family
G9a domain (DNA-


far has been found only as part

endonuclease
binding?)


of multidomain nuclear proteins,





including transcription factors
     Escherichia coli





          1787179_
94933_Ppu;
3747107_Rn;
Oxidoreductase
None
The eukaryotic domain is present
          b0947
3e-10;
3e-32;


(as a partial sequence) also in the
          Bacterial-eukaryotic
287-367;
4-261;


beta-proteobacterium Vogesella.

ferredoxin
uncharacterized


This domain contains a conserved


domain (thiol


pair of cysteines, which together


oxidoreductase?)


with the ferredoxin fusion, may





suggest a thiol oxidoreductase





activity. Most of the eukaryotic





proteins containing this domain





appear to be mitochondrial,





suggesting the possibility of an





alternative evolutionary scenario
          1787678_
487713_Sli;
5459012_Pab;
Methyl-transferase/
None

          b1410
3e-05;
1e-17;
Lipase (exact function


          Bacterial-archaeal
408-522;
33-274;
unclear)



SAM-dependent
lyso-phospholipase




methyl-transferase




          1787679_ynbD
1591375_Mj;
7160233_Sp;
Membrane-associated
None
An unusual case of fusion
          Archaeal-eukaryotic
4e-04;
1e-06;
bifunctional

between an apparently archaeal

50-218;
346-415;
phosphatase

and a typical eukaryotic domain

membrane-associated
tyrosine phosphatase


in a bacterium

acid phosphatase




          1788589_
5763950_Sco;
3860247_At;
Bifunctional enzyme;
None

          b2255
4e-35;
1e-55;
exact function unclear


          Bacterial-eukaryotic
1-259;
318-652;




methionyl-tRNA
dTDP-glucose 4-6-




formyl-transferase
dehydratase



          1788938_yfiQ
929735_Nsp;
2649370_Af;
acetyl-CoA synthetase/
None

          bacterial-Archaeal/
8e-32;
4e-85;
acetyl-transferase; exact


          eukaryotic
637-874;
6-689;
function unclear



acetyl-transferase
acetyl-CoA synthetase



     Mycobacterium tuberculosis





          2909507_
6469244_Sco;
4151109_Tbr;
Adenylate cyclase/
7476546,
M. tuberculosis encodes three
          Rv2488c,
5e-64;
6e-04;
ATPase; probable
7476738
paralogous proteins that consist
          2791528_Rv1358,
19-603;
6-167;
transcription regulator

of three domains, the eukaryotic-
          1419061_
4726088_Rer;
adenylate cyclase


type adenylate cyclase, AP
          Rv1358
2e-12;



(apoptotic) ATPase and DNA-
          Bacterial-eukaryotic
818-1073



binding response regulator, and





two stand-alone versions of





adenylate cyclase, which show the





closest similarity to the cyclase





domain of the multidomain





proteins
          1314025_
120037_Tt;
178213_Hs;
Ferredoxin/
2076681
D. radiodurans also encodes the
          Rv0886
1e-11;
4e-65;
ferredoxin reductase

eukaryotic-type ferredoxin
          Bacterial-eukaryotic
2-79;
93-543;


reductase, but the ferredoxin

ferredoxin
ferredoxin reductase


fusion is unique to mycobacteria
          3261732_
2661695_Sco;
279520_Dd;
cAMP-dependent
4455714

          Rv0998
3e-13;
7e-07;
acetyl-transferase(?)
(M. leprae)

          Bacterial-eukaryotic
148-328;
30-105;




acetyl-transferase
cAMP-binding domain



          2326726_
421331_Cvi;
2645721_Mm;
Bifunctional enzyme of
1929080

          Rv1683
1e-24;
6e-26;
poly (3-hydroxy-butyrate)


          Bacterial-eukaryotic
23-359;
456-972;
synthesis



poly (3-hydroxy-
very-long-chain




butyrate) synthase
acyl-CoA synthetase



          1403447_
6752338_Sco;
3892714_At;
Polyfunctional enzyme
2661651
In this protein, the domain of
          Rv2006
2e-27;
8e-27;
of trehalose metabolism

apparent eukaryotic origin
          Bacterial-eukaryotic
23-240;
264-521;


is flanked by bacterial domains

phosphatase;
trehalose-6-phosphate


from both sides

6448751_Sco;
phosphatase




0.0;





534-1320;





trehalose hydrolase




          2896788_
117648_Ec;
3073773_Mm;
Polyfunctional enzyme
2337823
The presence of the stand-alone
          Rv2051c
1e-16;
4e-31;
of lipid metabolism
(M. leprae);
version of the eukaryotic
          Bacterial-eukaryotic
94-514;
588-829;

6468712
domain in Streptomyces suggests

apolipoprotein
dolichol-phosphate-

(Streptomyces
an ancient horizontal transfer

N-acyltransferase
mannose synthase

coelicolor)

          2791523_
6225563_Scy;
1098605_Cnu;
Multifunctional enzyme
None

          Rv2483c
7e-16;
5e-22;
of phospholipid


          Bacterial-eukaryotic
36-253;
289-492;
metabolism



phosphoserine
1-acyl-sn-




phosphatase
glycerol-3-phosphate





acyltransferase



          2894233_
2633801_Bs;
4538974_At;
Molybdopterin synthase
2076687
The same domain organization
          Rv3323c
3e-19;
7e-06;


is seen in D. radiodurans, but in
          Bacterial-eukaryotic
89-208;
2-82;


this case, both components

molybdopterin
molybdopterin


appear to be of bacterial origin

synthase large subunit
synthase small subunit




(MoaE)
(MoaD)



          2960152_
4753872_Sco;
466119_Ce;
cAMP-regulated
2501688
M. tuberculosis encodes two
          Rv3728,
1e-35;
7e-20;
efflux pump(?)

strongly similar paralogs with
          7477551_
56-428;
549-964;


the same domain architecture
          Rv3239c
transmembrane
cAMP-binding domain-



          Bacterial-eukaryotic
efflux protein
phosphoesterase



          2960153_
4731342_Sl;
1591330_Mj;
Bifunctional enzyme
1806159
The amino-terminal domain
          Rv3729
3e-14;
3e-58;
of molybdenum

stand-alone paralog is more
          Bacterial-archaeal
510-776;
molybdenum
cofactor biosynthesis

similar to archaeal homologs

C5-O-methyl-
cofactor biosynthesis


than to the stand-alone paralog,

Transferase
protein MoaA


but nevertheless, the latter

(mitomycin
(Fe-S oxidoreductase)


appears to be of archaeal origin

biosynthesis)




          3261806_
40487_Cg;
7304009_Dm;
Secreted protein
7649504
The stand-alone version of the
          Rv3811
3e-12;
2e-12;

(S. coelicolor)
eukaryotic domain is present
          Bacterial-eukaryotic
404-494;
198-384;


only in Streptomyces

major secreted
peptidoglycan




protein
recognition protein



     Treponema pallidum





          3322964_
7225946_Nm;
320868_Sc;
Uridine kinase
None
A co-linear ortholog is present
          TP0667
9e-04;
2e-13;


in Thermotoga
          Bacterial-eukaryotic
10-154;
290-488;




threonyl-tRNA
uridine kinase




synthetase (TGS and





H3H domains)




     Thermotoga maritima





          4981276_
68516_Bs;
3218401_Sp;
Uridine kinase
None
A co-linear ortholog is present
          TM0751
3e-07;
2e-11;


in Treponema
          Bacterial-eukaryotic
11-200;
288-475;




threonyl-tRNA
uridine kinase




synthetase (TGS and





H3H domains)




Eukaryotes





     Saccharomyces cerevisiae





          536367_
586134_Bt;
7450047_Aa;
Bifunctional signal-
5249
SurE homologs are not
          Ybr094w
9e-10;
8e-09;
transduction protein
(Yarrowia
detectable in eukaryotes other
          Eukaryotic/
tubulin-tyrosine ligase
acid phosphatase

lipolytica)
than yeasts
          Bacterial-archaeal

(SurE)



          1431219_
577625_Hs;
3328426_Ct



          YDL141w
1e-39
5e-27;



          Eukaryotic-
Biotin-[propionyl-
biotin protein ligase
Bifunctional biotin-
None
An ortholog with an identical
          bacterial
CoA-carboxylase(ATP-

protein ligase

domain architecture is present

hydrolysing)] ligase



in S. pombe
          458922_
477096_Gg;
1653075_Ssp;
heat shock
NONE
An ortholog with an identical
          YHR206W
8e-18;
7e-17;
transcription

domain architecture is present
          Eukaryotic-bacterial
78-216
375-503;
factor

in S. pombe (3327019)

heat shock
CheY domain




transcription factor





domain
2983676_Aa;
Siroheme synthase
2330809
S. pombe also encodes a co-linear
          486539_
1146165_At;
1e-04;

(S. pombe)
ortholog (3581882); apparent
          YKR069w
3e-34;
22-188;


displacement of the bacterial
          Eukaryotic-bacterial
249-556;
precorrin-2 oxidase


precorrin-2 oxidase by a distinct

urophorphyrin III



Rossmann fold domain

methylase




          1302305_
4938476_At;
3212189_Hi;
Multifunctional enzyme
None
Co-linear orthologs in S. pombe
          YNL256w
5e-65;
5e-05;
of folate biosynthesis

(7490442) and Pneumocystis
          Eukaryotic-bacterial
324-861
62-148;


carinii (283062)

7,8-dihydro-6-
187-297;




hydroxymethylpterin-
dihydro-neopterin




pyro-phosphokinase+
aldolase




Dihydro-pteroate





synthase




          1419887_
7297709_Dm;
5918510_Sco;
Bifunctional RNA
2213559
The known bacterial homologs
          YOL066c
2e-72;
2e-10;
modification enzyme
(S. pombe)
have a two-domain organization;
          Eukaryotic-bacterial
42-408;
436-574;


the evolutionary scenario could

large ribosomal
pyrimidine deaminase


have included domain

subunit pseudoU



rearrangements

synthase




          1419865_
2462827_At;
1075360_Hi;
Transcriptional regulator None

Yeast encodes three strongly
          YOL055c,
1e-39;
6e-24;
of thiamine biosynthesis

similar paralogs with identical
          2132251_
22-390;
342-549;
genes(?)

domain organization; co-linear
          YPL258c,
phosphomethyl
transcriptional


orthologs are present in other
          2132289_
pyrimidinekinase
activator


ascomycetes
          YPR121w
(thiamine biosynthesis)




          Eukaryotic-bacterial





          1370444_ YPL214c
2746079_Bn;
2648451_Af;
Bifunctional thiamine
None
Except for the one from
          Eukaryotic-archaeal/
1e-27;
9e-27;
biosynthesis enzyme

A. fulgidus, all highly conserved
          Bacterial
9-233;
251-531;


homologs of the kinase domain

thiamin-phosphate
hydroxyethyl-thiazole


of this protein are bacterial; it

pyro-phosphorylase
kinase


appears likely that the A. fulgidus





gene is the result of horizontal





transfer

The following complete genomes were analyzed. Archaea: Aeropyrum pernix (Ap); Archaeoglobus fulgidus (Af); Methanococcus jannaschii (Mj); Methanobacterium thermoautotrophicum (Mth); Pyrococcus horikoshii (Ph); Bacteria: Aquifex aeolicus (Aa); Borrelia burgdorferi (Bb); Bacillus subtilis (Bs); Chlamydophila pneumoniae (Cp); Deinococcus radiodurans (Dr); Escherichia coli (Ec); Haemophilus influenzae (Hi); Helicobacter pylori (Hp); Mycobacterium tuberculosis (Mt); Mycoplasma pneumoniae (Mp); Rickettsia prowazekii (Rp); Synechocystis sp (Ssp); Thermotoga maritima (Tm); Treponema pallidum (Tp). No IKFs were detected in the genomes that are not shown in the table. Additional species name abbreviations: At, Arabidopsis thaliana; Axy, Acetobacter xylinus; Bn, Brassica napus; Ce, Caenorhabditis elegans; Cvi, Chromatium vinosum; Gg, Gallus gallus; Hs, Homo sapiens; Mm, Mus musculus; Rn, Rattus norvegicus; Sco, Streptomyces coelicolor; Sl, Streptomyces lavendulae.

Wolf et al. Genome Biology 2000 1:research0013.1   doi:10.1186/gb-2000-1-6-research0013