Consolidating the set of known human protein-protein interactions in preparation for large-scale mapping of the human interactome
-
* Corresponding authors: Raymond J Mooney mooney@cs.utexas.edu - Edward M Marcotte marcotte@icmb.utexas.edu
1 Center for Systems and Synthetic Biology and Institute for Cellular and Molecular Biology, University of Texas, Austin, TX 78712, USA
2 Department of Computer Sciences, University of Texas, Austin, TX 78712, USA
3 Department of Chemistry and Biochemistry, University of Texas, Austin, TX 78712, USA
Genome Biology 2005, 6:R40 doi:10.1186/gb-2005-6-5-r40
Published: 15 April 2005Additional files
Additional File 1:
Training set of 200 Medline abstracts with all occurrences of protein names tagged
Format: GZ Size: 111KB Download file
Additional File 2:
Training set of 750 Medline abstracts with all occurrences of protein names tagged
Format: GZ Size: 434KB Download file
Additional File 3:
Dictionary of human protein names and synonyms indexed to LocusLink identifiers
Format: TXT Size: 2.9MB Download file
Additional File 4:
Final set of 31,609 protein interactions between 7,748 proteins derived from this analysis
Format: TXT Size: 943KB Download file
Additional File 5:
Final set of co-citation/Bayesian classifier-derived interactions with the PubMed identifiers of co-citing abstracts
Format: TXT Size: 812KB Download file
Additional File 6:
Benchmark training set of functional annotations
Format: TXT Size: 102KB Download file
Additional File 7:
Benchmark test set of functional annotations
Format: TXT Size: 102KB Download file
Additional File 9:
Discriminating word list used by the Bayesian classifier to estimate the likelihood of Medline abstracts to discuss protein interactions
Format: TXT Size: 2KB Download file
