Figure 1.

The probability of a feature x occurring in irrelevant articles. The figure shows the three distributions of the leave-out dataset, remaining training dataset, and official test dataset. The probability of a feature x occurring in irrelevant articles (Pr(x|c-)) in different datasets are shown (only 40 features are listed here).

Huang et al. Genome Biology 2008 9(Suppl 2):S12   doi:10.1186/gb-2008-9-s2-s12