Table 1

Selenoprotein families identified in the Sargasso Sea database

Prokaryotic selenoprotein family

Unique sequences

COG/Pfam ID

COG/Pfam description


Known selenoproteins (209 sequences)

SelW-like protein

48

Pfam05169

Selenoprotein W-related

Peroxiredoxin (Prx)

43

COG1225

Peroxiredoxin

Proline reductase (PrdB)

42

-

Selenophosphate synthetase

28

COG0709

Selenophosphate synthetase

Prx-like protein

22

COG0450

Peroxiredoxin-like

Thioredoxin (Trx)

11

COG3118

Thioredoxin

Formate dehydrogenase alpha chain (fdhA)

8

COG0243

Anaerobic dehydrogenases

Glutathione peroxidase (GPx)

5

COG0386

Glutathione peroxidase

Glycine reductase selenoprotein A (grdA)

1

-

Glycine reductase selenoprotein B (grdB)

1

Pfam07355

Glycine reductase selenoprotein B

New selenoproteins (101 sequences)

AhpD-like protein

27

COG2128

Uncharacterized conserved protein

Arsenate reductase

14

COG1393

Arsenate reductase and related proteins

Molybdopterin biosynthesis MoeB protein

11

COG0476

Dinucleotide-utilizing enzymes, molybdopterin biosynthesis

Glutaredoxin (Grx)

10

COG0695

Glutaredoxin and related proteins

DsbA-like protein

9

COG2761

DsbA-like

Glutathione S-transferase

4

COG0625

Glutathione S-transferase

Deiodinase-like protein

4

Pfam00837

Iodothyronine deiodinase

Thiol-disulfide isomerase-like protein

4

-

CMD domain-containing protein

4

Pfam02627

Carboxymuconolactone decarboxylase

Hypothetical protein 1

4

-

Rhodanese-related sulfurtransferase

3

COG2897

Rhodanese-related sulfurtransferase

OsmC-like protein

3

COG1765

Predicted redox protein, OsmC-like

DsrE-like protein

2

Pfam02635

DsrE-like

DsbG-like protein

1

COG1651

DsbG, Protein-disulfide isomerase

NADH:ubiquinone oxidoreductase

1

COG2209

NADH:ubiquinone oxidoreductase

Total

310


Classification of selenoproteins (10 previously known and 15 new prokaryotic selenoprotein families) is supported by COG or Pfam sequence clusters (or both) as shown in this table. The number of individual selenoprotein sequences for each family is indicated.

Zhang et al. Genome Biology 2005 6:R37   doi:10.1186/gb-2005-6-4-r37

Open Data