|
Resolution: standard / high Figure 1.
Method for comparative analysis of gene expression. Given expression matrices for
two species where rows correspond to genes and columns correspond to conditions, we
first find one-to-one ortholog matches between the two species and arrange the matrices
such that equivalent rows represent the expression patterns of orthologs. Note that
after this step the two matrices have the same number of rows, but not necessarily
the same number of conditions and the conditions are not comparable. Next, the Pearson
correlation coefficient (PCC) is calculated for each pair of genes over all conditions,
generating the correlation matrices
, . Each row in these matrices corresponds to the correlations between one gene and
all other genes (with orthologs) from the same genome. Equivalent rows in the two
matrices correspond to the correlation vectors of a pair of orthologs with all other
orthologs from the respective genomes. The correlation between these vectors of correlations
is defined as the initial estimation of expression conservation (EC0). EC scores are then iteratively refined by calculating weighted Pearson correlation
coefficients (PCCw) where EC scores from the previous iteration are used as weights
and genes with negative weights are excluded from the calculation. This procedure
is repeated until convergence of the EC scores (ECk ≈ ECk - 1). The iterative procedure can also be initiated from random weights to verify the
convergence to a global minimum (see Materials and methods).
Tirosh and Barkai Genome Biology 2007 8:R50 doi:10.1186/gb-2007-8-4-r50 |