|
Resolution: standard / high Figure 1.
Measures of performance for the initial round of GO term predictions. (a) Mean area under the receiver operating characteristic curve (AUC) within each evaluation
category, evaluated using the held-out genes. Gene Ontology Biological process (GO-BP),
Cellular component (GO-CC), and Molecular function (GO-MF) branches are indicated
on the x-axis, grouped by specificity (indicated by the minimum number of genes in
the training set associated with each GO term in a given category). Upper case letters
associated with the color code correspond to submission identifier. (b) Mean AUC within each evaluation category, evaluated prospectively using newly annotated
genes. (c) For each pair of submissions X and Y, we test for difference in AUC value for every
GO term (evaluated using held-out genes). Color bars indicate fraction of pairwise
comparisons for which X's AUC is significantly higher (blue), not significantly different
(beige), and significantly lower (maroon). (d) As (c), except evaluated using the newly annotated genes. (e) The fraction of GO terms exceeding the indicated precision at 20% recall (P20R) value,
evaluated using held-out genes. The black line corresponds to the fraction of GO terms
for which the 'straw man' approach achieved the indicated precision. (f) As (e), except with P20R values derived prospectively from newly annotated genes.
Peña-Castillo et al. Genome Biology 2008 9(Suppl 1):S2 doi:10.1186/gb-2008-9-s1-s2 |