Log on / register
BioMed Central home | Journals A-Z | Feedback | Support | My details
.refereed research
 |  |  |  |  | 


Open AccessMethod

A gold standard set of mechanistically diverse enzyme superfamilies

Shoshana D Brown1 email, John A Gerlt2 email, Jennifer L Seffernick3 email and Patricia C Babbitt4 email

1Department of Biopharmaceutical Sciences, University of California, 1700 4th Street, San Francisco, San Francisco, CA 94143-2550, USA

2Department of Biochemistry, University of Illinois, Roger Adams Laboratory, 600 S Mathews Avenue, Urbana, IL 61801, USA

3Department of Biochemistry, Molecular Biology, and Biophysics, Biological Process Technology Institute, and Center for Microbial and Plant Genomics, University of Minnesota, St Paul, MN 55108, USA

4Departments of Biopharmaceutical Sciences and Pharmaceutical Chemistry, University of California, 1700 4th Street, San Francisco, San Francisco, CA 94143-2550, USA

author email corresponding author email

Genome Biology 2006, 7:R8doi:10.1186/gb-2006-7-1-r8

Published: 31 January 2006

Subject areas: Biochemistry and structural biology, Bioinformatics, Molecular biology

Abstract

Superfamily and family analyses provide an effective tool for the functional classification of proteins, but must be automated for use on large datasets. We describe a 'gold standard' set of enzyme superfamilies, clustered according to specific sequence, structure, and functional criteria, for use in the validation of family and superfamily clustering methods. The gold standard set represents four fold classes and differing clustering difficulties, and includes five superfamilies, 91 families, 4,887 sequences and 282 structures.


© 1999-2008 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.