Additional data file 3.

Each module is defined by a set of pairwise alignments, and each reference sequence in these sets is represented as a single row in this table. The first column (module) contains an identifier for the particular copy of the module (duplicon) indicated in the next three columns. These columns (query sequence) list the subtelomeric location of the query sequence defining the module (see Materials and methods). The 'aligned sequences' column shows the locations of other duplicons in this module, matched by the query. The coordinates in this column refer either to our published subtelomeric assemblies (designated by chromosome and arm p or q) or the human genome build 35 (all other designations). The %IDeach is percent nucleotide sequence identity across the chained pairwise alignment, excluding masked sequence. The %IDavg is the average percent identity of all pairwise alignments in the module. This was the number used for %ID in charts and analyses in this paper. The final column shows a 1 if the module contains intrachromosomal non-subtelomeric sequence matches, and 0 if it does not.

Format: PDF Size: 772KB Download file

This file can be viewed with: Adobe Acrobat Reader

Ambrosini et al. Genome Biology 2007 8:R151   doi:10.1186/gb-2007-8-7-r151