Figure 2.
Highest percentage-identity match in 5% intervals for the E < 10 datasets of Drosophila and C. elegans compared to the human dataset. Baseline identity between typical C2H2 ZNF domains
is between 20 and 44%, and this is where most genes show their highest identity. Values
higher than this range are strongly suggestive of orthology. We also examined the
difference between this analysis and an analysis of more stringent datasets (E < 1).
All but one of the sequences detected at E < 10 but excluded from E < 1 had maximum
identity matches below 40%.
Knight and Shimeld Genome Biology 2001 2:research0016.1 doi:10.1186/gb-2001-2-5-research0016 |