The contribution of nearby sequences to the formation of de novo insertions. (A) Proportion of nucleotides in insertions and matching controls that match small stretches of DNA present in nearby sequences for different window sizes (30 bp, 60 bp, 90 bp, and 120 bp windows). P values refer to Wilcoxon rank sum tests. (B) Percentage of insertions and matching controls that have at least one small stretch of DNA sequence also found in flanking regions for different window sizes (30 bp, 60 bp, 90 bp, and 120 bp windows). P values refer to Fisher's exact tests.
Cardoso-Moreira et al. Genome Biology 2012 13:R119 doi:10.1186/gb-2012-13-12-r119