Word pairs in conserved word-pair templates are closely spaced in S. cerevisiae A comparison of the median of minimum distances is shown for three categories of word pairs. For each category, the distribution of median of minimum distances is represented by a box-and-whisker plot, which was generated using the statistical software package R ; the box extends from the 25th percentile to the 75th percentile, and the vertical line within the box denotes the median of the distribution. Dashed lines extend for 1.5 times the range of the box, and circles indicate extreme values. 'Selected template' denotes closely spaced and jointly conserved word pairs (χ2 > 31.1, spacing q < 0.05, N = 989). 'Conserved' denotes dependently conserved word pairs that occur in at least 10 intergenic regions (χ2 > 31.1, N = 3,726) and includes all of the word pairs in the 'selected template' category. 'Random' denotes a sample of randomly conserved word pairs that occur in at least 10 intergenic regions (χ2 < 1, N = 42,718).
Chiang et al. Genome Biology 2003 4:R43 doi:10.1186/gb-2003-4-7-r43