Consolidating the set of known human protein-protein interactions in preparation for large-scale mapping of the human interactome1Center for Systems and Synthetic Biology and Institute for Cellular and Molecular Biology, University of Texas, Austin, TX 78712, USA 2Department of Computer Sciences, University of Texas, Austin, TX 78712, USA 3Department of Chemistry and Biochemistry, University of Texas, Austin, TX 78712, USA
Genome Biology 2005, 6:R40doi:10.1186/gb-2005-6-5-r40
Subject areas: Bioinformatics, Genome studies, Molecular biology, Methods Additional filesAdditional File 1: Training set of 200 Medline abstracts with all occurrences of protein names tagged Format: GZ Size: 111KB Download file Additional File 2: Training set of 750 Medline abstracts with all occurrences of protein names tagged Format: GZ Size: 434KB Download file Additional File 3: Dictionary of human protein names and synonyms indexed to LocusLink identifiers Format: TXT Size: 2.9MB Download file Additional File 4: Final set of 31,609 protein interactions between 7,748 proteins derived from this analysis Format: TXT Size: 943KB Download file Additional File 5: Final set of co-citation/Bayesian classifier-derived interactions with the PubMed identifiers of co-citing abstracts Format: TXT Size: 812KB Download file Additional File 6: Benchmark training set of functional annotations Format: TXT Size: 102KB Download file Additional File 7: Benchmark test set of functional annotations Format: TXT Size: 102KB Download file Additional File 9: Discriminating word list used by the Bayesian classifier to estimate the likelihood of Medline abstracts to discuss protein interactions Format: TXT Size: 2KB Download file |


on Google Scholar







author email
corresponding author email