Table 1

Results for 7-gram model using entire dataset

Location
Code
Precision
Sensitivity
FPR
Specificity
MCC

Cytoplasm
CYT
0.828
0.775
0.020
0.980
0.777
Cytoskeleton
CSK
0.882
0.452
0.001
0.999
0.629
Endoplasmic Reticulum
END
0.961
0.789
0.001
0.999
0.867
Extracellular
EXC
0.949
0.939
0.021
0.979
0.921
Golgi Apparatus
GOL
0.891
0.550
0.001
0.999
0.697
Lysosome
LYS
0.953
0.855
0.000
1.000
0.902
Mitochrondria
MIT
0.964
0.799
0.003
0.997
0.867
Nuclear
NUC
0.807
0.906
0.048
0.952
0.821
Plasma Membrane
PLA
0.883
0.958
0.043
0.957
0.892
Perixosome
POX
0.938
0.748
0.000
1.000
0.836

Single-localized % overall accuracy
89.03
Multi-localized % overall accuracy (at least 1 correct)
81.88
Multi-localized % overall accuracy (both correct)
59.70

The performance results of ngLOC on a tenfold cross-validation are displayed. The overall accuracy is also reported for multi-localized sequences, comparing at least one localization predicted correctly against both localizations predicted correctly. FPR, false positive rate; MCC, Matthews correlation coefficient.

King and Guda Genome Biology 2007 8:R68   doi:10.1186/gb-2007-8-5-r68