<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2009-10-11-r128</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Method</dochead>
      <bibl>
         <title>
            <p>Genome Alteration Print (GAP): a tool to visualize and mine complex cancer genomic profiles obtained by SNP arrays</p>
         </title>
         <aug>
            <au ca="yes" id="A1"><snm>Popova</snm><fnm>Tatiana</fnm><insr iid="I1"/><insr iid="I2"/><email>tatiana.popova@curie.fr</email></au>
            <au id="A2"><snm>Mani&#233;</snm><fnm>Elodie</fnm><insr iid="I1"/><insr iid="I2"/><email>elodie.manie@curie.fr</email></au>
            <au id="A3"><snm>Stoppa-Lyonnet</snm><fnm>Dominique</fnm><insr iid="I1"/><insr iid="I2"/><insr iid="I3"/><insr iid="I4"/><email>dominique.stoppa-lyonnet@curie.net</email></au>
            <au id="A4"><snm>Rigaill</snm><fnm>Guillem</fnm><insr iid="I5"/><insr iid="I6"/><email>guillem.rigaill@curie.fr</email></au>
            <au id="A5"><snm>Barillot</snm><fnm>Emmanuel</fnm><insr iid="I1"/><insr iid="I7"/><insr iid="I8"/><email>emmanuel.barillot@curie.fr</email></au>
            <au id="A6"><snm>Stern</snm><mnm>Henri</mnm><fnm>Marc</fnm><insr iid="I1"/><insr iid="I2"/><email>marc-henri.stern@curie.fr</email></au>
         </aug>
         <insg>
            <ins id="I1"><p>Centre de Recherche, Institut Curie, 26 rue d'Ulm, Paris, 75248, France</p></ins>
            <ins id="I2"><p>INSERM U830, Institut Curie, 26 rue d'Ulm, Paris, 75248, France</p></ins>
            <ins id="I3"><p>Department of Tumor Biology, Institut Curie, 26 rue d'Ulm, Paris, 75248, France</p></ins>
            <ins id="I4"><p>University Paris Descartes, 12 rue de l'Ecole de M&#233;decine, Paris, 75270, France</p></ins>
            <ins id="I5"><p>Translational Research Department, Institut Curie, 1 avenue Claude Vellefaux, Paris, 75475, France</p></ins>
            <ins id="I6"><p>MIA 518, AgroParisTech/INRA, 16 rue Claude Bernard, Paris, 75231, France</p></ins>
            <ins id="I7"><p>INSERM U900, Institut Curie, 26 rue d'Ulm, Paris, 75248, France</p></ins>
            <ins id="I8"><p>Ecole des Mines ParisTech, 35 rue Saint Honor&#233;, Fontainebleau, 77305, France</p></ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2009</pubdate>
         <volume>10</volume>
         <issue>11</issue>
         <fpage>R128</fpage>
         <url>http://genomebiology.com/2009/10/11/R128</url>
         <xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2009-10-11-r128</pubid><pubid idtype="pmpid">19903341</pubid></pubidlist></xrefbib>
      </bibl>
      <history><rec><date><day>2</day><month>2</month><year>2009</year></date></rec><revrec><date><day>24</day><month>9</month><year>2009</year></date></revrec><acc><date><day>11</day><month>11</month><year>2009</year></date></acc><pub><date><day>11</day><month>11</month><year>2009</year></date></pub></history>
      <cpyrt><year>2009</year><collab>Popova et al.; licensee BioMed Central Ltd.</collab><note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
      <shorttitle>
         <p>Mining SNP arrays</p>
      </shorttitle>
      <shortabs>
         <p>GAP, a method for analyzing complex cancer genome profiles from SNP arrays, performs well even with poor quality data and rearranged genomes</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <p>We describe a method for automatic detection of absolute segmental copy numbers and genotype status in complex cancer genome profiles measured with single-nucleotide polymorphism (SNP) arrays. The method is based on pattern recognition of segmented and smoothed copy number and allelic imbalance profiles. Assignments were verified by DNA indexes of primary tumors and karyotypes of cell lines. The method performs well even for poor-quality data, low tumor content, and highly rearranged tumor genomes.</p>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification id="30010003" subtype="man_spc_id" type="BMC">Cancer</classification>
         <classification id="30010013" subtype="man_spc_id" type="BMC">Methods</classification>
         <classification id="30010016" subtype="man_spc_id" type="BMC">Molecular biology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Alterations of genomic DNA are hallmarks of cancer <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. These genetic alterations include point mutations and small insertion/deletion events, translocations, copy-number changes, amplifications, and losses of heterozygosity. Chromosome copy-number alterations and homozygosities (uniparental disomies) acquired during cancer evolution are believed to be selected as the result of the loss of function of tumor-suppressor genes and the gain of function of oncogenes. Recurrent copy-number variations (CNVs) or loss of heterozygosity (LOH) are therefore critical indicators of possible localization of cancer-related genes <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Both recurrent regions of alteration and patterns of genomic instability contribute to tumor classification <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Single-nucleotide polymorphism (SNP) arrays are presently one of the most efficient technologies for the identification of such alterations <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. SNP arrays simultaneously define copy-number changes and allelic imbalances (including LOH) occurring in a tumor, at high resolution and throughout the whole genome <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>.</p>
         <p>Genome-wide SNP arrays are available mainly for Affymetrix <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> and Illumina <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> platforms. On both platforms, SNP genotypes are extracted from allele-specific signal intensities after array hybridization. Arbitrarily, the two alleles are designated as A and B, and the ratio of allele-specific signal intensities (A/B, A/(A+B), and so on, depending on the method used) provides an allelic-imbalance value. Chromosomal aberrations are identified by (a) relative copy-number changes and (b) allelic imbalances. Both platforms were originally designed for high-throughput genotyping of <it>normal </it>genomes, and they require specific normalization and data-mining strategies to study alterations in <it>cancer </it>genomes <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Two characteristics of genetic alterations are essential to extract from SNP data: (a) breakpoints corresponding to the boundaries of the altered regions of genomic DNA, and (b) copy number and genotype status of each such alteration.</p>
         <p>Accurate determination of breakpoints has been addressed from many aspects, starting from reduction of nonrelevant variation to optimal breakpoint counts and positioning <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. As compromises between sensitivity and specificity, these methods will perform variably, depending on the specific setting used, the quality of the primary data set, and the complexity of the tumor genomes.</p>
         <p>Determination of copy numbers and genotype status of each alteration is more complicated, and no general solution has yet been proposed. Attempts to address this question include a manual interpretation of Affymetrix 500K SNP-array results for glioblastomas presented in <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> and an automatic copy-number recognition method based on allelic imbalances for the Illumina platform, proposed in <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Other methods attribute relative gain, loss, or allelic-imbalance status without addressing the determination of absolute copy number and genotype (cnvPartition from Illumina, <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>).</p>
         <p>Three major sources of problems complicate the estimation of genome-wide copy number in cancer cells with SNP-array technology. The first concerns the determination of the reference point for copy-number variation (the level corresponding to the unaltered status of the tumor genome), which is not trivial for aneuploid cancer genomes with unknown underlying ploidies (diploid, tetraploid, and so on). Eventually, the reference point for a near-diploid cancer genome should correspond to normal genome status: a balanced genotype (AB status) and two copies. In the case of near-tetraploid tumors, a balanced genotype (AABB) and four copies could be proposed as the reference point. Setting the correct reference point thus depends on recognition of the underlying ploidy. This issue is considered in Attiyeh and colleagues <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, in which an aneuploidy correction factor was determined based on intensity-distribution modes in regions with balanced genotypes. Gardina and co-workers <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> directly estimated the chromosome copy-number status by using theoretic allelic ratios indicative of higher ploidy levels and then inferred tumor ploidy.</p>
         <p>The second problem arises from the frequent contamination of cancer samples by normal stromal cells. The presence of a significant proportion of normal DNA in a sample diminishes the amplitude of measured signal changes reflecting rearrangements in the tumor DNA. Any fixed threshold-based method of copy-number variation recognition may fail to distinguish the proper regions. A number of publications have addressed this issue <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B24">24</abbr></abbrgrp>. Staaf and colleagues <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> proposed a strategy for copy-number and LOH recognition based on adjusted thresholds, inferred from their study of dilution series. A model for estimation of normal DNA inclusion on the basis of measured allelic imbalances is considered in <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. These authors also mentioned that, in addition to negative effects, a small degree of contamination could help in distinguishing somatically acquired homozygosity from germline homozygous regions.</p>
         <p>The third problem in mining cancer SNP-array profiles is coming from intratumoral heterogeneity <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Although generally arising from a single cell (monoclonal proliferation), cancer progression leads to subpopulations bearing different genomic alterations (subclones) coexisting in most tumor samples. The tumor genomic profile is thus due to (a) genomic alterations shared by all tumor cells and producing few discrete steps of gains and losses, and (b) subclonal events shared by only certain subpopulations of tumor cells and producing a number of intermediate steps in the "main" copy-number profile. CNV and LOH status of an alteration specific for subclones is generally indefinable, as the measured signal reflects the sum of unknown subclonal signals in unknown proportions. An algorithm estimating the proportion of cancer cells harboring the particular alteration event was proposed in <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> and confirmed on known genetic events from a serial dilution of cancer cells with normal matched cells.</p>
         <p>In this article, we propose a method for segmental copy-number and genotype detection from SNP arrays that takes advantage of previous findings and addresses the aforementioned issues. This method is based on SNP-array data formalization that we have called the Genome Alteration Print (GAP). The GAP of a tumor sample summarizes segmented CNV and allelic imbalance profiles into a list of segments, characterized by two corresponding averages. GAP visualization reveals the overall genomic ploidy of tumors, pinpoints the possible normal status (reference point for gain and loss), shows the level of contamination, indicates subclones, and generally characterizes the tumor genome. The <it>model </it>GAP built on theoretic distribution of CNV and allelic imbalances provides interpretation for a tumor GAP and serves as a basis for automatic recognition of the copy number and genotype of each segment.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Generation of complex cancer genome data sets</p>
            </st>
            <p>The 300K Illumina SNP-arrays (Human Hap300-Duo) were used to study breast cancer genomes in a series of primary breast carcinomas (40 cases) and two cell lines. This series includes basal-like carcinomas (BLCs) arising in the general population (sporadic BLCs) and in <it>BRCA1 </it>mutation carriers, who are especially predisposed to BLCs <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Both hereditary (in <it>BRCA1 </it>carriers) and sporadic BLCs are associated with inactivation of <it>BRCA1 </it><abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, a key protein for DNA repair <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. Analysis of breast carcinomas by SNP-arrays is complicated by the numerous genomic rearrangements associated with these tumors <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, their high stromal cell content <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>, and intratumoral heterogeneity <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>.</p>
            <p>Figure <figr fid="F1">1a</figr> and <figr fid="F1">1b</figr> shows the whole genome profiles of the BLC_B1_T45 sample measured on a 300K Illumina SNP-array. The copy-number variation (CNV) profile is represented by the Log R ratios (LRRs), which are the log-transformed ratios of experimental and normal reference SNP intensities, centered at zero for each sample. Allelic imbalances are represented by the B-allele frequencies (BAFs), which are the normalized proportions of the B alleles in two allele mixtures. Complexity of the profile is characterized by (a) the number of breakpoints in both profiles, and (b) the number of levels in smoothed LRRs and BAFs corresponding to the alteration states in the genomic DNA. The amplitude of both LRR and BAF changes depends on the purity of the tumor sample <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B24">24</abbr></abbrgrp>. The main challenge is to interpret both segmental LRR and BAF values correctly in terms of <it>absolute </it>DNA copy number and LOH status, provided various amplitudes of changes, unknown underlying tumor genome ploidy, and disturbing subclonal intermediates. Specifically, DNA segments including at least 10 SNPs (~40 kb on average) were analyzed, which decreased resolution but minimized the effects of both experimental variations and short CNVs observed in population studies <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>.</p>
            <fig id="F1"><title><p>Figure 1</p></title><caption><p>The whole-genome single-nucleotide polymorphism (SNP) array profile and genome alteration print (GAP)</p></caption><text>
   <p>The whole-genome single-nucleotide polymorphism (SNP) array profile and genome alteration print (GAP). The whole-genome profile of genomic rearrangements in the BLC_B1_T45 sample measured by 300K Illumina SNP-array and corresponding GAP. (a) Allelic imbalances are represented by B-allele frequency (BAF). (b) Copy-number variation profile is represented by log R ratio (LRR), centered at zero. (c) The GAP of the sample is a combined sideview projection of segmented LRR and BAF. Each region of the genome is represented by two symmetric circles in the case of allelic imbalance and by one circle centered at BAF = 0.5 in the case of a balanced genotype. Attribution of copy numbers and genotypes corresponds to a near-diploid model of rearrangements. (d) "Reading" GAP pattern: the degree of stromal contamination, acquired and germline homozygosities, and subclones are indicated. (e) The best-fitting model GAP allows interpretation of the cluster structure and estimates contamination by normal DNA and contraction of the pattern on the LRR scale. Clusters are designated by the ratio of copy number to B (or major allele) counts.</p>
</text><graphic file="gb-2009-10-11-r128-1"/></fig>
         </sec>
         <sec>
            <st>
               <p>Genome alteration print (GAP)</p>
            </st>
            <p>The method for segmental copy-number and genotypes attribution presented here is based on the structure denoted by GAP. To build the GAP, breakpoints in LRR and BAF profiles are determined separately by the circular binary segmentation (CBS) algorithm (see Materials and methods for details) <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Any contiguous region in both LRR and BAF profiles (region between two consecutive breakpoints from LRR and BAF breakpoints mixture) is considered to be an alteration unit (possibly unaltered) and characterized by (a) the median of LRR, (b) the modes of BAF distribution, and (c) the length of the corresponding region (in SNP counts). The list and two-dimensional visualization of all alteration units of a measured sample is denoted the GAP.</p>
            <p>The GAP of the BLC_B1_T45 sample is shown in Figure <figr fid="F1">1c</figr>. Each alteration unit is represented by a circle, with the center coordinates equal to its BAF (x-axis) and LRR (y-axis) smoothed values. The circle radius is scaled to the relative size of the corresponding chromosome region. In other words, the structure in Figure <figr fid="F1">1c</figr> represents a combined side-view projection of segmented and smoothed profiles of LRR and BAF shown earlier in Figure <figr fid="F1">1ab</figr>. The pattern in Figure <figr fid="F1">1c</figr> has a regular structure: circles corresponding to genomic regions with similar alteration status are assembled in clusters, forming discrete steps in their projection on the LRR scale, and symmetrically disposed on the BAF scale. As "A" and "B" allele names are set arbitrarily, the BAF profile is symmetric relative to 0.5 axis, and one alteration unit is represented by two symmetric circles away from the 0.5 axis on the BAF scale. Clusters centered at BAF = 0.5 present the genome regions with balanced (heterozygous) genotype; that is, an equal representation of both (maternal and paternal) alleles.</p>
            <p>According to standard mining of SNP-array results, the GAP pattern shows (a) normal regions, which correspond to the balanced cluster; (b) losses, which are below the level of the balanced cluster; (c) gains, which are above this level; and (d) loss of heterozygosity without copy-number change (uniparental disomy), which are the side clusters of the reference balanced cluster (Figure <figr fid="F1">1d</figr>). The overall pattern of GAP corresponds to rearrangements in a near-diploid tumor.</p>
            <p>The balanced cluster representing the normal status is generally not centered at zero on the LRR scale, which is set by normalization. For example, in Figure <figr fid="F1">1</figr>, the functional center (the diploid balanced cluster that represents unaltered regions) is shifted up from the formal center of the LRR profile (zero on LRR scale) because of the prevailing losses versus gains observed in the tumor.</p>
            <p>Small germline homozygous regions, detected when more than 50 successive SNPs have a homozygous call (the 50-SNP length was set arbitrarily), form side clusters at the 0 and 1 boundaries of BAF scale. These germline homozygous regions can be easily distinguished from acquired LOH (see Figure <figr fid="F1">1d</figr>). Distances between germline and acquired homozygous clusters reflect the degree of tumor-sample purity <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. Acquired and germline homozygosities cannot be distinguished in the case of pure tumor sample or (more often) cell line.</p>
            <p>It is worth mentioning that (a) allelic imbalance is often treated as LOH, whereas here only single allelic genotypes (A, AA, AAA...) were considered to have an LOH status; (b) although mirrored BAF (see Materials and methods) is used for all computational evaluations, the GAP structure is shown in a complete (symmetric) view for easier association with the initial SNP-array measurement (with symmetric BAF bands).</p>
         </sec>
         <sec>
            <st>
               <p>Influence of tumor dilution and heterogeneity on GAP pattern</p>
            </st>
            <p>Breast carcinomas frequently show a high degree of stromal contamination and heterogeneity seen on the GAP pattern (Figure <figr fid="F1">1d</figr>). The triangle-like figure formed by homozygous clusters has the following interpretation. <it>P</it>% of normal DNA adds some proportion of normal (AA, AB, or BB) signal to any measured value. However, (a) this proportion depends on the corresponding copy-number status of a region and, (b) germline homozygous regions would show a pure homozygous signal, whereas cancer homozygous regions (LOH) would show a shift caused by normal heterozygous signal addition. Cancer BAF is modeled depending on the proportion of normal DNA inclusion (<it>p</it>) as the weighted sum of B-allele counts in cancer and normal genotypes related to maximal possible B allele counts at current copy-number level (see Materials and methods). For example, the calculated level of normal stromal DNA in the BLC_B1_T45 sample is approximately 30%. Such BAF dynamics also were illustrated by Nancarrow and colleagues <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> by using computer simulations. The clear linear relation between the measured mirrored BAF (mBAF) and the level of contamination by normal tissue of the tumor sample was demonstrated in <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> in dilution series.</p>
            <p>The few isolated circles situated between one- and two-copy levels in Figure <figr fid="F1">1d</figr> could be attributed to losses occurring only in a fraction of the tumor cells (subclones). Following the logic of <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> and using our model of BAF, the proportion of cancer cells harboring this event is approximately 26%. More complicated subclonal mixtures could produce various intermediates in LRR and BAF scales.</p>
            <p>The dynamics of change in LRR scale depend on numerous uncontrolled factors and show a high degree of variation from sample to sample. The significant dilution of a cancer sample by normal DNA clearly decreases the contrast (the amplitude of change in LRR corresponding to a copy-number change) <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. However, universal linear dependence between LRR and contamination, similar to that for BAF, has not yet been described. The observed amplitude of LRR changes is usually smaller than expected by the initial model (log<sub>2</sub>(CN/2)), but the proportion between copy-number steps seems to be preserved for well-represented copy-number layers around the mean. LRR is therefore modeled by applying a simple coefficient of contraction <it>q </it>to the standard log ratio, which produces the sequence of LRR values: -<it>q</it>, 0, 0.58<it>q</it>, <it>q</it>, 1.32<it>q</it>, 2<it>q</it>,... for corresponding copy-number levels: 1, 2, 3, 4, 5,...</p>
            <p>Model GAP that follows theoretic values for BAF and LRR with estimated contamination <it>p </it>= 0.3 and coefficient of contraction <it>q </it>= 0.3 is superimposed onto the experimental GAP of BLC_B1_T45 sample in Figure <figr fid="F1">1e</figr>.</p>
         </sec>
         <sec>
            <st>
               <p>Diploid and tetraploid GAP patterns</p>
            </st>
            <p>The 40 SNP-array profiles of breast carcinomas and cell lines presented two main types of GAP pattern named "near-diploid" and "near-tetraploid" patterns (Figure <figr fid="F2">2a</figr> and <figr fid="F2">2b</figr>; for more examples, see Additional data file 1). The near-diploid pattern is characterized by a single balanced cluster with one layer of losses (Figures <figr fid="F1">1</figr> and <figr fid="F2">2a</figr>). The typical near-tetraploid pattern shown in Figure <figr fid="F2">2b</figr> has (a) two balanced modes representing a balanced heterozygous genotype on two- and four-copy levels (AB and AABB); (b) three-copy level (between balanced modes) with the full spectrum of allelic imbalances, including LOH (AAA, AAB, ABB, BBB); (c) few levels higher than four copies accounting for possible five, six, seven... copies.</p>
            <fig id="F2"><title><p>Figure 2</p></title><caption><p>Characteristic genome alteration print (GAP) patterns</p></caption><text>
   <p>Characteristic genome alteration print (GAP) patterns. Two characteristic <b>(a, b) </b>and one unique <b>(c) </b>GAP patterns obtained in the analysis of a breast carcinoma series: <b>(a) </b>near-diploid pattern, sample BLC_ T34; <b>(b) </b>near-tetraploid pattern, sample BLC_T09; and <b>(c) </b>possible near-triploid pattern, sample BLC_T10. Attribution of genotypes is based on the type of pattern; best-fitting models are shown.</p>
</text><graphic file="gb-2009-10-11-r128-2"/></fig>
            <p>The near-diploid pattern has genomic DNA mainly presented in one, two, and three copies, whereas the near-tetraploid pattern has the well-represented two-, three-, four-, and five-copy layers. These patterns appeared to be easily distinguished in the case of the high density of alteration events observed in the current series of the breast carcinomas. A unique type of GAP pattern in the series was observed in BLC_T10 (Figure <figr fid="F2">2c</figr>). This pattern is characterized by sparse balanced cluster (due to a single chromosome with balanced genotype) and very strong homozygous clusters on the 3-copy level. This may be interpreted as an almost pure triplication of a haploid genome, possibly similar to the triploid glioblastoma cases described in <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
            <p>DNA index and karyotype were used to verify correspondence between the interpretation of GAP pattern and the actual tumor genomic status. <it>In silico </it>DNA indexes inferred from SNP arrays were very close to actual tumor DNA indexes measured with flow cytometry (FCM) analysis for 16 of the 18 breast carcinoma samples tested (Table <tblr tid="T1">1</tblr>, Additional data file 1). The DNA index provided by FCM characterizes DNA content of tumor genome relative to normal diploid genome, which has a DNA index defined as 1. <it>In silico </it>DNA indexes were estimated by averaging segmental copy numbers (divided by 2), inferred from the GAP pattern. For 11 cases, the difference between actual and <it>in silico </it>DNA index was less than 0.1; for five cases, it was less than 0.3. With the exception of two outliers, this difference was always less than 0.5, which is the minimal absolute error in the case of wrong assignment of the overall copy-number scale (pattern shift on +1 or -1 copy). For the two outliers (BLC_B1_T22 and BLC_T34), GAP patterns were perfectly near-diploid with a clear contrast, making cluster misattribution unlikely. The discrepancy in DNA index estimation requires further biologic verification (for example, in the case of BLC_B1_T22, there might be a pure and possibly recent duplication of the diploid tumor cells as the <it>in silico </it>DNA index was equal to half of the experimental index).</p>
            <tbl id="T1"><title><p>Table 1</p></title><caption><p>Experimental and <it>in silico </it>DNA indexes and parameters of GAP model</p></caption><tblbdy cols="6">
      <r>
         <c ca="left">
            <p>
               <b>Sample ID</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>DNA index FCM</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>DNA index GAP</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>DNA index OverUnder</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Tumor content 1-<it>p</it><sub>BAF</sub></b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Contraction <it>q</it><sub>LRR</sub></b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_B1_T14</p>
         </c>
         <c ca="center">
            <p>1.14</p>
         </c>
         <c ca="center">
            <p>0.85</p>
         </c>
         <c ca="center">
            <p>0.98</p>
         </c>
         <c ca="center">
            <p>0.85</p>
         </c>
         <c ca="center">
            <p>0.37</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_B1_T17</p>
         </c>
         <c ca="center">
            <p>0.84</p>
         </c>
         <c ca="center">
            <p>0.82</p>
         </c>
         <c ca="center">
            <p>0.97</p>
         </c>
         <c ca="center">
            <p>0.77</p>
         </c>
         <c ca="center">
            <p>0.17</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_B1_T19</p>
         </c>
         <c ca="center">
            <p>1.6</p>
         </c>
         <c ca="center">
            <p>1.63</p>
         </c>
         <c ca="center">
            <p>2.93</p>
         </c>
         <c ca="center">
            <p>0.4</p>
         </c>
         <c ca="center">
            <p>0.27</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_B1_T20</p>
         </c>
         <c ca="center">
            <p>1.41</p>
         </c>
         <c ca="center">
            <p>1.48</p>
         </c>
         <c ca="center">
            <p>3.06</p>
         </c>
         <c ca="center">
            <p>0.4</p>
         </c>
         <c ca="center">
            <p>0.2</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_B1_T22<sup>a</sup></p>
         </c>
         <c ca="center">
            <p>1.98</p>
         </c>
         <c ca="center">
            <p>0.94</p>
         </c>
         <c ca="center">
            <p>1.02</p>
         </c>
         <c ca="center">
            <p>0.87</p>
         </c>
         <c ca="center">
            <p>0.44</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_T07</p>
         </c>
         <c ca="center">
            <p>1.68</p>
         </c>
         <c ca="center">
            <p>1.49</p>
         </c>
         <c ca="center">
            <p>3.12</p>
         </c>
         <c ca="center">
            <p>0.44</p>
         </c>
         <c ca="center">
            <p>0.28</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_T09</p>
         </c>
         <c ca="center">
            <p>2.02</p>
         </c>
         <c ca="center">
            <p>1.85</p>
         </c>
         <c ca="center">
            <p>1.89</p>
         </c>
         <c ca="center">
            <p>0.92</p>
         </c>
         <c ca="center">
            <p>0.47</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_T10</p>
         </c>
         <c ca="center">
            <p>1.88</p>
         </c>
         <c ca="center">
            <p>1.9</p>
         </c>
         <c ca="center">
            <p>1.07</p>
         </c>
         <c ca="center">
            <p>0.95</p>
         </c>
         <c ca="center">
            <p>0.47</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_T12</p>
         </c>
         <c ca="center">
            <p>1.51</p>
         </c>
         <c ca="center">
            <p>1.54</p>
         </c>
         <c ca="center">
            <p>2.56</p>
         </c>
         <c ca="center">
            <p>0.65</p>
         </c>
         <c ca="center">
            <p>0.35</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_T15</p>
         </c>
         <c ca="center">
            <p>1.11</p>
         </c>
         <c ca="center">
            <p>0.89</p>
         </c>
         <c ca="center">
            <p>0.99</p>
         </c>
         <c ca="center">
            <p>0.74</p>
         </c>
         <c ca="center">
            <p>0.27</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_T23</p>
         </c>
         <c ca="center">
            <p>1.32</p>
         </c>
         <c ca="center">
            <p>1.39</p>
         </c>
         <c ca="center">
            <p>2.72</p>
         </c>
         <c ca="center">
            <p>0.41</p>
         </c>
         <c ca="center">
            <p>0.21</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_T31</p>
         </c>
         <c ca="center">
            <p>1.91</p>
         </c>
         <c ca="center">
            <p>1.84</p>
         </c>
         <c ca="center">
            <p>1.48</p>
         </c>
         <c ca="center">
            <p>0.84</p>
         </c>
         <c ca="center">
            <p>0.45</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_T34<sup>a</sup></p>
         </c>
         <c ca="center">
            <p>1.55</p>
         </c>
         <c ca="center">
            <p>0.99</p>
         </c>
         <c ca="center">
            <p>1.04</p>
         </c>
         <c ca="center">
            <p>0.87</p>
         </c>
         <c ca="center">
            <p>0.42</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_T37</p>
         </c>
         <c ca="center">
            <p>1.51</p>
         </c>
         <c ca="center">
            <p>1.53</p>
         </c>
         <c ca="center">
            <p>1.44</p>
         </c>
         <c ca="center">
            <p>0.89</p>
         </c>
         <c ca="center">
            <p>0.44</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>L_B1_T24B</p>
         </c>
         <c ca="center">
            <p>1.84</p>
         </c>
         <c ca="center">
            <p>1.64</p>
         </c>
         <c ca="center">
            <p>2.61</p>
         </c>
         <c ca="center">
            <p>0.59</p>
         </c>
         <c ca="center">
            <p>0.29</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>L_B1_T25A</p>
         </c>
         <c ca="center">
            <p>1.00</p>
         </c>
         <c ca="center">
            <p>1.04</p>
         </c>
         <c ca="center">
            <p>3.03</p>
         </c>
         <c ca="center">
            <p>0.39</p>
         </c>
         <c ca="center">
            <p>0.17</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>L_B1_T30</p>
         </c>
         <c ca="center">
            <p>1.84</p>
         </c>
         <c ca="center">
            <p>1.83</p>
         </c>
         <c ca="center">
            <p>1.53</p>
         </c>
         <c ca="center">
            <p>0.78</p>
         </c>
         <c ca="center">
            <p>0.42</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>L_B1_T47</p>
         </c>
         <c ca="center">
            <p>1.00</p>
         </c>
         <c ca="center">
            <p>1.03</p>
         </c>
         <c ca="center">
            <p>1.47</p>
         </c>
         <c ca="center">
            <p>0.45</p>
         </c>
         <c ca="center">
            <p>0.17</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p><sup>a</sup>Two samples with clear near-diploid pattern of GAP and discordant experimental DNA indexes.</p>
   </tblfn></tbl>
            <p>Breast cancer cell lines with known karyotypes were used for another validation of GAP interpretation. The tetraploid breast cancer cell line MDA-MB-175-VII (MDA_175; <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>) has a clear near-tetraploid pattern of GAP (Figure <figr fid="F3">3a</figr>). The unique balanced cluster must be attributed to a four-copy level because two levels of losses visible below it could not account for 1 and 0 copies, but rather for 3 and 2 copies because of their positions and the absence of a normal contingent in the cell line. Circles on each side of the balanced cluster fit with AAAA and AAAB, and ABBB and BBBB genotypes, respectively, also implying a two-copy level. It is noteworthy that two-copy regions are represented exclusively by homozygous genotypes.</p>
            <fig id="F3"><title><p>Figure 3</p></title><caption><p>Genome alteration prints (GAPs) for breast cancer cell lines</p></caption><text>
   <p>Genome alteration prints (GAPs) for breast cancer cell lines. GAPs for breast cancer cell lines: <b>(a) </b>MDA_175; and <b>(b) </b>MDA_468. Both GAPs show a near-tetraploid pattern, and genotypes were assigned accordingly.</p>
</text><graphic file="gb-2009-10-11-r128-3"/></fig>
            <p>A near-tetraploid genome implies the number of chromosomes to be close to 92 (88 autosomes = two sets of diploid genomes). Copy-number summary for centromeric regions was considered a surrogate measure of chromosome number. As no SNP measurements can be performed at centromeres because of their highly repetitive DNA structure, pericentric regions were used to estimate the copy-number status of the chromosomes. The status of 39 pericentric regions (two for each of the 17 metacentric autosomes and one for each of the five acrocentric autosomes) was determined according to GAP. The number of autosomes in MDA_175 was estimated to be 86.5, which is close to the description in <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> (model number was 84 chromosomes; range, 82 to 89; verified on the cell line used for the SNP-array). Table <tblr tid="T2">2</tblr> shows the frequency of occurrence of the inferred copy number of pericentric regions (also for other tumor samples considered in this study, with more-detailed information presented in Additional data file 2). A similar analysis was performed with the MDA-MB-468 (MDA_468) cell line; this hypotetraploid breast cancer cell line (modal number, 64; range, 60 to 67) <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> showed a typical tetraploid GAP pattern (Figure <figr fid="F3">3b</figr>). Estimated autosome number (71.5) matched the description, and the slight overestimation was likely due to segmental amplification in one pericentric region (Table <tblr tid="T2">2</tblr> and Additional data file 2). Taken together, these results indicate correct local assignment with our approach.</p>
            <tbl id="T2"><title><p>Table 2</p></title><caption><p>Frequency of inferred copy numbers at pericentric regions and deduced autosome numbers</p></caption><tblbdy cols="11">
      <r>
         <c>
            <p/>
         </c>
         <c cspan="8" ca="center">
            <p>
               <b>Copy number<sup>a</sup></b>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>Sample ID</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>1</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>2</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>3</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>4</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>5</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>6</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>7</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>8</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Autosome number</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Pattern<sup>b</sup></b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="11">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>MDA_175</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>5</p>
         </c>
         <c ca="center">
            <p>9</p>
         </c>
         <c ca="center">
            <p>16</p>
         </c>
         <c ca="center">
            <p>4</p>
         </c>
         <c ca="center">
            <p>4</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>86.5</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>MDA_468<sup>c</sup></p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>17</p>
         </c>
         <c ca="center">
            <p>8</p>
         </c>
         <c ca="center">
            <p>8</p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>71.5</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_B1_T14</p>
         </c>
         <c ca="center">
            <p>12</p>
         </c>
         <c ca="center">
            <p>25</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>38</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_B1_T17</p>
         </c>
         <c ca="center">
            <p>14</p>
         </c>
         <c ca="center">
            <p>21</p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>38.5</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_B1_T19</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>7</p>
         </c>
         <c ca="center">
            <p>10</p>
         </c>
         <c ca="center">
            <p>16</p>
         </c>
         <c ca="center">
            <p>4</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>76.5</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_B1_T20</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>16</p>
         </c>
         <c ca="center">
            <p>9</p>
         </c>
         <c ca="center">
            <p>8</p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>66.5</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_B1_T22</p>
         </c>
         <c ca="center">
            <p>12</p>
         </c>
         <c ca="center">
            <p>24</p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>37.5</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_T07</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>15</p>
         </c>
         <c ca="center">
            <p>13</p>
         </c>
         <c ca="center">
            <p>8</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>70</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_T09</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>16</p>
         </c>
         <c ca="center">
            <p>13</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c ca="center">
            <p>85.5</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_T10</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>21</p>
         </c>
         <c ca="center">
            <p>9</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>87</p>
         </c>
         <c ca="center">
            <p>1.5</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_T12</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>11</p>
         </c>
         <c ca="center">
            <p>10</p>
         </c>
         <c ca="center">
            <p>12</p>
         </c>
         <c ca="center">
            <p>4</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>73.5</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_T15</p>
         </c>
         <c ca="center">
            <p>10</p>
         </c>
         <c ca="center">
            <p>27</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>40</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_T23</p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>13</p>
         </c>
         <c ca="center">
            <p>11</p>
         </c>
         <c ca="center">
            <p>6</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>4</p>
         </c>
         <c ca="center">
            <p>68.5</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_T31</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>5</p>
         </c>
         <c ca="center">
            <p>4</p>
         </c>
         <c ca="center">
            <p>25</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>84.5</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_T34</p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>42</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>BLC_T37</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>16</p>
         </c>
         <c ca="center">
            <p>9</p>
         </c>
         <c ca="center">
            <p>6</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>4</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c ca="center">
            <p>70.5</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>L_B1_T24B</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>11</p>
         </c>
         <c ca="center">
            <p>12</p>
         </c>
         <c ca="center">
            <p>7</p>
         </c>
         <c ca="center">
            <p>6</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>72</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>L_B1_T25A</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>37</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>46</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>L_B1_T30</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>11</p>
         </c>
         <c ca="center">
            <p>23</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>79</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>L_B1_T47</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>34</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>47</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p><sup>a</sup>Frequency of inferred copy numbers (1 to 8 are indicated) at pericentric regions. <sup>b</sup>1, 2, 1.5 indicates a near-diploid, near-tetraploid, and near-triploid patterns of Genome Alteration Print (GAP), respectively (Figure 2, Additional data file 1). <sup>c</sup>Estimated high chromosome copy number (= 8) is likely to result from a segmental amplification in one pericentric region, leading to overestimation of the autosome number in MDA_468.</p>
   </tblfn></tbl>
            <p>It should be noted that determination of the reference point for gain and loss attribution for complex highly rearranged cancer genomes is not always obvious, even with known patterns of rearrangements and absolute copy numbers. Samples displaying a near-diploid GAP pattern (as in Figure <figr fid="F2">2a</figr>) represent a simple situation, as their unique balanced cluster corresponding to 2-copy indicates the reference point. Near-tetraploid patterns with a unique balanced cluster at four copies (such as that of the cell line in Figure <figr fid="F3">3a</figr>) and inferred autosome numbers close to 88 indicate underlying tetraploidy, and it is logical to set the reference point to four copies in these cases. Underlying ploidy is less clear for intermediate DNA index or autosome number (between one and two, or 44 and 88, respectively), and the GAP shows a tetraploid pattern with two balanced clusters (as for BLC_B1_T19 and BLC_B1_T20 samples). Correct interpretation of gains and losses in such cases requires further biologic validation.</p>
         </sec>
         <sec>
            <st>
               <p>Automatic recognition of segmental copy numbers and genotypes</p>
            </st>
            <p>The GAP pattern can be easily mined by automatic procedures. This procedure includes (a) recognition of a GAP pattern and (b) assignment of segmental copy numbers and genotypes to a corresponding tumor genome based on this pattern. As described earlier, the GAP is characterized by two parameters: <it>p</it>, which is the proportion of tumor contamination by normal DNA affecting BAF values, and <it>q</it>, which is a coefficient of contraction of LRR values. The automatic recognition procedure searches for parameters and position of a model GAP that best fits to the experimental GAP. Quality of fitness is assessed by genome coverage in terms of number of SNPs that are explained by the model (see Material and methods for details). In other words, the model GAP template that most closely corresponds to the experimental GAP is selected. In the second round, the model GAP is used as the basis for interpretation of the experimental GAP, and segmental copy numbers and genotypes are assigned accordingly.</p>
            <p>The quality of pattern recognition was tested on 42 in-house samples, including the samples validated by DNA index. The procedure performed 41 correct and one erroneous recognitions, as compared with manual assessment. The problematic sample presented a high variance and low contrast, and the correct solution had a high but not the highest score. In general, the method tolerates contamination of tumor samples by normal DNA and experimental variations, as shown by correct recognition of our validated series with up to 60% of normal contamination and up to 0.17 contraction of LRR scale (see Table <tblr tid="T1">1</tblr>).</p>
            <p>We considered subclones as segments located essentially between designated clusters. They could be artefacts from incorrect segmentation, or true tumor heterogeneity. An interesting case is represented by sample BLC_T31. Its first interpretation was that of a near-tetraploid pattern, but its second interpretation with a very similar score was that of a near-diploid pattern because of poor representation of the three-copy level interpreted as subclones in the latter case. The DNA index determined by FCM indicated near-tetraploidy, supporting the first interpretation (see Additional data file 1).</p>
            <p>It should be stressed that (a) correct recognition requires good contrast between clusters and multiplicity of genetic events (for example, patterns consisting of AB and A&#8709; genotypes versus AABB and AA cannot be distinguished when no other evidence of a four-copy pattern exists); (b) the robustness of the quality criterion used in our method is not always satisfactory: the correct solution often differs from incorrect solutions by less than 1%; (c) the linear models used in the method diverge from experimental data in both the LRR and BAF scales when copy numbers were higher than 6-copy. However, no universal rule to correct this effect was identified on the basis of the 41 tumors examined.</p>
         </sec>
         <sec>
            <st>
               <p>Comparative testing of GAP recognition</p>
            </st>
            <p>When characterizing rearrangements in tumor genome measured by SNP array, it is essential to extract from data (a) the degree of genomic instability displayed by the number and distribution of breakpoints, and (b) the type of each alteration. The GAP method is based on both LRR and BAF breakpoints and is therefore not directly suitable for breakpoint counting. To minimize double counting of a single breakpoint, LRR and BAF breakpoints separated by a small region (arbitrary defined as 10 SNPs) were simply merged. More complicated pooling of LRR and BAF breakpoints could allow more accurate breakpoint counting, and this would not be expected to influence the performance of the GAP method. Another way to address breakpoint detection in highly rearranged cancer genomes with possible low tumor content and noisy profiles is to use the GAP pattern as a source for secondary optimization.</p>
            <p>The GAP method is elaborated for determination of alteration events in <it>complex</it>, highly rearranged cancer genomes (in contrast, it would be of little help for interpretation of a stable genome with few amplifications). The methods specifically developed for analysis of cancer genomes include SOMATICS <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> and BAFsegmentation <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, which reveal segments with allelic imbalances based on various models but do not produce copy numbers and genotypes. The OverUnder algorithm presented by Attiyeh and associates <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> estimates ploidy, as well as copy numbers and genotypes, and has been shown to outperform PennCNV, IlluminaCN Estimate, and CBS for the analysis of cancer genomes. We compared our automatic GAP fitting method with the OverUnder algorithm in terms of quality and consistency of recognition.</p>
            <p>The OverUnder algorithm (available as Illumina Beadstudio plug-in) was initially applied to our validated series of breast carcinomas to estimate the DNA indexes (Table <tblr tid="T1">1</tblr>). OverUnder results for seven samples clearly deviate from experimental data (Figure <figr fid="F4">4</figr>). These samples are characterized by high levels of normal DNA contamination, as estimated by the GAP model. The GAP method tolerated normal contamination, demonstrating better overall performance.</p>
            <fig id="F4"><title><p>Figure 4</p></title><caption><p>Comparison of genome alteration print (GAP) and OverUnder-based <it>in silico </it>DNA indexes with experimental DNA indexes</p></caption><text>
   <p>Comparison of genome alteration print (GAP) and OverUnder-based <it>in silico </it>DNA indexes with experimental DNA indexes. GAP indexes (blue circles) show excellent correspondence with experimental DNA indexes. OverUnder indexes (red triangles) show more outliers with overestimation of the DNA index. Both methods show consistent results, but not corresponding to the experimental DNA indexes (1.98 and 1.5) for two samples, designated by enlarged markers.</p>
</text><graphic file="gb-2009-10-11-r128-4"/></fig>
            <p>The self-consistency of the methods was tested on the basis of dilution series available in the GEO database (GEO:GSE11976) <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. The HCC1395/CRL2324 cell line <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> measured in this series is genetically complex and poorly defined. However, estimated copy numbers and LOH regions must be consistent for all CRL2324 samples with various proportions of tumor DNA. The results of the self-consistency test are presented in Table <tblr tid="T3">3</tblr> (more details in Additional data file 3). The better self-consistency of the GAP method is obvious in terms of copy numbers and LOH. Structural reproducibility of tumor GAP pattern with various proportions of normal DNA is illustrated in Additional data file 4.</p>
            <tbl id="T3"><title><p>Table 3</p></title><caption><p>Self-consistency of copy numbers and LOH in dilution series by using GAP and OverUnder analyses</p></caption><tblbdy cols="9">
      <r>
         <c ca="center">
            <p>
               <b>GAP</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>CN</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>LOH</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <b>DNA index</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Tumor DNA</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>1-<it>p </it>BAF</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b><it>q </it>LRR</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="9">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CRL2324</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>1.45</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>0.45</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CRL2324_79</p>
         </c>
         <c ca="center">
            <p>0.93</p>
         </c>
         <c ca="center">
            <p>0.98</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>1.46</p>
         </c>
         <c ca="center">
            <p>0.79</p>
         </c>
         <c ca="center">
            <p>0.8</p>
         </c>
         <c ca="center">
            <p>0.35</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CRL2324_50</p>
         </c>
         <c ca="center">
            <p>0.9</p>
         </c>
         <c ca="center">
            <p>0.97</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>1.44</p>
         </c>
         <c ca="center">
            <p>0.5</p>
         </c>
         <c ca="center">
            <p>0.42</p>
         </c>
         <c ca="center">
            <p>0.22</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CRL2324_47</p>
         </c>
         <c ca="center">
            <p>0.78</p>
         </c>
         <c ca="center">
            <p>0.96</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>1.49</p>
         </c>
         <c ca="center">
            <p>0.47</p>
         </c>
         <c ca="center">
            <p>0.42</p>
         </c>
         <c ca="center">
            <p>0.24</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CRL2324_45</p>
         </c>
         <c ca="center">
            <p>0.81</p>
         </c>
         <c ca="center">
            <p>0.96</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>1.5</p>
         </c>
         <c ca="center">
            <p>0.45</p>
         </c>
         <c ca="center">
            <p>0.35</p>
         </c>
         <c ca="center">
            <p>0.18</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CRL2324_34</p>
         </c>
         <c ca="center">
            <p>0.69</p>
         </c>
         <c ca="center">
            <p>0.93</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>1.53</p>
         </c>
         <c ca="center">
            <p>0.34</p>
         </c>
         <c ca="center">
            <p>0.27</p>
         </c>
         <c ca="center">
            <p>0.16</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CRL2324_30</p>
         </c>
         <c ca="center">
            <p>0.73</p>
         </c>
         <c ca="center">
            <p>0.93</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>1.52</p>
         </c>
         <c ca="center">
            <p>0.3</p>
         </c>
         <c ca="center">
            <p>0.25</p>
         </c>
         <c ca="center">
            <p>0.12</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CRL2324_23</p>
         </c>
         <c ca="center">
            <p>0.72</p>
         </c>
         <c ca="center">
            <p>0.93</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>1.59</p>
         </c>
         <c ca="center">
            <p>0.23</p>
         </c>
         <c ca="center">
            <p>0.26</p>
         </c>
         <c ca="center">
            <p>0.14</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CRL2324_21</p>
         </c>
         <c ca="center">
            <p>0.7</p>
         </c>
         <c ca="center">
            <p>0.92</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>1.64</p>
         </c>
         <c ca="center">
            <p>0.21</p>
         </c>
         <c ca="center">
            <p>0.14</p>
         </c>
         <c ca="center">
            <p>0.12</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>OverUnder</p>
         </c>
         <c ca="center">
            <p>CN</p>
         </c>
         <c ca="center">
            <p>LOH</p>
         </c>
         <c ca="center">
            <p>CN &#177; 1</p>
         </c>
         <c ca="center">
            <p>CN CBS</p>
         </c>
         <c ca="center">
            <p>DNA index</p>
         </c>
         <c ca="center">
            <p>Tumor DNA</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CRL2324</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>2.48</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CRL2324_79</p>
         </c>
         <c ca="center">
            <p>0.36</p>
         </c>
         <c ca="center">
            <p>0.94</p>
         </c>
         <c ca="center">
            <p>0.74</p>
         </c>
         <c ca="center">
            <p>0.46</p>
         </c>
         <c ca="center">
            <p>2.16</p>
         </c>
         <c ca="center">
            <p>0.79</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CRL2324_50</p>
         </c>
         <c ca="center">
            <p>0.21</p>
         </c>
         <c ca="center">
            <p>0.45</p>
         </c>
         <c ca="center">
            <p>0.65</p>
         </c>
         <c ca="center">
            <p>0.1</p>
         </c>
         <c ca="center">
            <p>2.64</p>
         </c>
         <c ca="center">
            <p>0.5</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CRL2324_47</p>
         </c>
         <c ca="center">
            <p>0.22</p>
         </c>
         <c ca="center">
            <p>0.45</p>
         </c>
         <c ca="center">
            <p>0.68</p>
         </c>
         <c ca="center">
            <p>0.1</p>
         </c>
         <c ca="center">
            <p>2.5</p>
         </c>
         <c ca="center">
            <p>0.47</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CRL2324_45</p>
         </c>
         <c ca="center">
            <p>0.34</p>
         </c>
         <c ca="center">
            <p>0.45</p>
         </c>
         <c ca="center">
            <p>0.71</p>
         </c>
         <c ca="center">
            <p>0.11</p>
         </c>
         <c ca="center">
            <p>2.85</p>
         </c>
         <c ca="center">
            <p>0.45</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CRL2324_34</p>
         </c>
         <c ca="center">
            <p>0.24</p>
         </c>
         <c ca="center">
            <p>0.44</p>
         </c>
         <c ca="center">
            <p>0.56</p>
         </c>
         <c ca="center">
            <p>0.19</p>
         </c>
         <c ca="center">
            <p>2.57</p>
         </c>
         <c ca="center">
            <p>0.34</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CRL2324_30</p>
         </c>
         <c ca="center">
            <p>0.14</p>
         </c>
         <c ca="center">
            <p>0.45</p>
         </c>
         <c ca="center">
            <p>0.48</p>
         </c>
         <c ca="center">
            <p>0.23</p>
         </c>
         <c ca="center">
            <p>2.54</p>
         </c>
         <c ca="center">
            <p>0.3</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CRL2324_23</p>
         </c>
         <c ca="center">
            <p>0.31</p>
         </c>
         <c ca="center">
            <p>0.45</p>
         </c>
         <c ca="center">
            <p>0.68</p>
         </c>
         <c ca="center">
            <p>0.23</p>
         </c>
         <c ca="center">
            <p>2.51</p>
         </c>
         <c ca="center">
            <p>0.23</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CRL2324_21</p>
         </c>
         <c ca="center">
            <p>0.07</p>
         </c>
         <c ca="center">
            <p>0.47</p>
         </c>
         <c ca="center">
            <p>0.12</p>
         </c>
         <c ca="center">
            <p>0.05</p>
         </c>
         <c ca="center">
            <p>1.11</p>
         </c>
         <c ca="center">
            <p>0.21</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>CN = copy number; LOH = loss of heterozygosity; tumor DNA = proportion of tumor DNA in the dilution; DNA index = <it>in silico </it>DNA index with each algorithm; 1-<it>p </it>BAF and <it>q </it>LRR are parameters of the model GAP; CN &#177; 1, copy numbers are considered to be consistent when the difference is less than or equal to 1; CN CBS, consistency is calculated on averaged (by median) and rounded copy-number assignments in CBS determined segments.</p>
   </tblfn></tbl>
         </sec>
         <sec>
            <st>
               <p>GAP for Affymetrix SNP platform</p>
            </st>
            <p>Affymetrix GeneChip SNP 6.0 array was used to generate SNP profiles of the BLC_B1_T45 sample. The GAP was obtained according to the same strategy as for Illumina SNP data but by using the profile-recognition method described in <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
            <p>Comparison of the data generated on these two platforms is shown in Figure <figr fid="F5">5</figr>. Affymetrix SNP measurements are represented by Log Copy Number Ratio and Allelic Differences as compared with Illumina LRR and BAF, respectively. Germline homozygous SNPs were omitted if fewer than 50 in a row, and are therefore represented by small clusters along two parallel lines at 0 and 1 limits of the BAF scale in an Illumina plot. Homozygous SNPs were always included in Affymetrix GAP and therefore formed large clusters represented along divergent diagonal lines (as allelic differences are dependent on copy-number levels) in the Affymetrix plot. Genome regions localized and attributed to a specific copy number in an Illumina-profiled genome were used to color code the regions in the Affymetrix SNP profile. Excellent concordance was observed between Affymetrix and Illumina patterns, as shown by relevant Illumina-derived color gradation on Affymetrix GAP. Visible differences in relative cluster sizes are due to different distributions of measured SNPs along the genome in Illumina and Affymetrix chips. The main conclusions from this comparison are (a) excellent correspondence between the two technologies in terms of copy-number variation; and (b) GAP can be used for analysis of complex cancer genomes on Affymetrix platforms.</p>
            <fig id="F5"><title><p>Figure 5</p></title><caption><p>Genome alteration print (GAP) for Affymetrix single-nucleotide polymorphism (SNP) GeneChip SNP 6.0 array</p></caption><text>
   <p>Genome alteration print (GAP) for Affymetrix single-nucleotide polymorphism (SNP) GeneChip SNP 6.0 array. BLC_B1_T45 tumor sample measured on two SNP-array platforms, analyzed by using GAP, and superimposed by color code: <b>(a) </b>GAP for Affymetrix; and <b>(b) </b>GAP for Illumina. Copy numbers obtained from the Illumina GAP were coded by colors indicated at the bottom of the Figure. Concordance between Affymetrix and Illumina patterns is illustrated by the relevant Illumina-derived color gradation on Affymetrix GAP. Germline homozygous regions are boxed. The main cluster patterns are indicated by hexagonal frames. The differences in relative cluster sizes are due to different distributions of SNPs measured along the genome in Illumina and Affymetrix chips.</p>
</text><graphic file="gb-2009-10-11-r128-5"/></fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>We present a method to mine complex genome alteration profiles measured with SNP-arrays. We introduce genome alteration print (GAP), a combined side-view projection of LRR and BAF segmented and smoothed profiles. The method, based on GAP pattern recognition, is fully automatic and provides segmental copy numbers and genotypes. It also estimates tumor-sample contamination by normal DNA. The method performs well, even for poor-quality data, low tumor content, and highly rearranged tumor genomes. Visualization of the GAP recognition pattern characterizes overall rearrangements in a tumor sample and can be used to verify the results. The GAP method is designed for Illumina SNP-array, but can be easily applied to Affymetrix SNP-arrays. This method could be a valuable tool to identify recurrent alterations in complex tumor-genome profiles.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Illumina arrays</p>
            </st>
            <p>A series of 40 breast carcinomas, including cases described in <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, was analyzed, as well as the breast cancer cell lines MDA-MB-175-VII (MDA_175) and MDA-MB-468 (MDA_468) <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. DNA was extracted from samples, and genomic profiling of the tumor samples was performed at Integragen <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> on 300K Illumina SNP-arrays (Human Hap300-Duo). SNP-array data are available through Gene Expression Omnibus <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> [GEO:GSE18799].</p>
         </sec>
         <sec>
            <st>
               <p>Data processing</p>
            </st>
            <p>Normalization of raw data was performed with Illumina Beadstudio software version 3.3 by using standard settings (all supporting files are provided by Illumina <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>). The normalization procedure tQN proposed in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> also was used to make BAF symmetric.</p>
         </sec>
         <sec>
            <st>
               <p>LRR and BAF segmentation and construction of the GAP</p>
            </st>
            <p>The circular binary segmentation (CBS) algorithm (DNAcopy package, Bioconductor) <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B38">38</abbr></abbrgrp> was applied to LRR and filtered BAF data separately to define breakpoints (the minimal level of significance was defined as 10<sup>-2 </sup>for LRR and 10<sup>-3 </sup>for BAF profiles). Smoothing of outliers was performed in both cases. LRR was smoothed by the median between breakpoints. To obtain one banded BAF profile, (a) non-informative homozygous SNPs were filtered out, based on the threshold (mBAF &gt; 0.97), as suggested in <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>; (b) tQN normalized and reflected relative to the 0.5 axis version of BAF, named mirrored BAF (mBAF) <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, was segmented. In addition, the boundaries of germline homozygous regions, detected when more than 50 successive SNPs had a homozygous call (the number of SNPs was set arbitrarily), were included into the set of breakpoints. The mode estimation (dip-test package, <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>) was used for smoothing of the mBAF profile to maintain the contrast between balanced and slightly shifted imbalances.</p>
            <p>Any region between two consecutive breakpoints from the LRR and BAF breakpoint mixture was considered to be an alteration unit (possibly unaltered) and characterized by (a) the averaged LRR, (b) the mode of mBAF distribution, and (c) the length of the corresponding region (in SNP counts). The list and the two-dimensional visualization of all alteration units of a measured sample were denoted the genome alteration print (GAP). For GAP visualization, each alteration unit was represented by a circle centered on BAF (x-axis) and LRR (y-axis) smoothed values, and the radius was scaled to the relative size of the corresponding chromosome region.</p>
         </sec>
         <sec>
            <st>
               <p>Comments on stability of GAP</p>
            </st>
            <p>The CBS algorithm was used to favor sensitivity over specificity in the breakpoint-detection process, as "false" breakpoints do not significantly change the overall GAP pattern. False alteration units often appeared as artefacts at joining LRR and BAF breakpoints, but were not visible, provided the true alteration units were significantly longer. More problems were observed when the robust profile estimators were applied to poor-quality data: the absence of true breakpoints could significantly alter the GAP pattern.</p>
         </sec>
         <sec>
            <st>
               <p>Model GAP</p>
            </st>
            <p>The model GAP was determined by the independent combination of BAF and LRR models. The BAF model was used to determine the position of clusters on the horizontal scale, and the LRR model was used to determine the <it>relative </it>position of clusters on the vertical scale.</p>
            <p>The cancer BAF was modeled as the weighted sum of B-allele counts in cancer and normal genotypes, as a ratio of the maximal possible B allele counts at the current copy-number level:</p>
            <p>
               <display-formula>
                  <graphic file="gb-2009-10-11-r128-i1.gif"/>
               </display-formula>
            </p>
            <p>where <it>p </it>is normal DNA proportion (and hence (1 - <it>p</it>) is tumor DNA proportion); <inline-formula><graphic file="gb-2009-10-11-r128-i2.gif"/></inline-formula> and <inline-formula><graphic file="gb-2009-10-11-r128-i3.gif"/></inline-formula> are the B and A allele counts in the tumor genotype; (<inline-formula><graphic file="gb-2009-10-11-r128-i2.gif"/></inline-formula> + <inline-formula><graphic file="gb-2009-10-11-r128-i3.gif"/></inline-formula>) is considered to be the copy-number level; and <inline-formula><graphic file="gb-2009-10-11-r128-i4.gif"/></inline-formula> is the B allele count in normal genome (<inline-formula><graphic file="gb-2009-10-11-r128-i4.gif"/></inline-formula> = 0, 1, 2). A similar model was described in <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B24">24</abbr></abbrgrp>; a model proposed in <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> could also be used for GAP-method settings.</p>
            <p>To estimate normal DNA contamination in a measured tumor sample, at least one cluster annotation (copy number and genotype) and its position on the BAF scale must be known. For example, projections of cluster centers in the experimental GAP pattern (Figure <figr fid="F1">1e</figr>) were assessed to be as follows: <it>BAF</it><sup><it>M </it></sup>= 0.765 for the B cluster, <it>BAF</it><sup><it>M </it></sup>= 0.845 for the BB cluster, <it>BAF</it><sup><it>M </it></sup>= 0.885 for the BBB cluster, and <it>BAF</it><sup><it>M </it></sup>= 0.628 for the ABB cluster. Substitution of <it>BAF</it><sup><it>M</it></sup>, B allele counts, and copy numbers in the model provides an estimation of the contamination coefficient <it>p </it>= 0.307, 0.31, 0.309, and 0.312, respectively. As expected, inferred coefficients were very close to each other and estimated the normal DNA contamination around 30% for this sample.</p>
            <p>The same method was used to estimate the proportion of tumor cells bearing a given rearrangement (subclone); in the case shown in Figure <figr fid="F1">1d</figr>: <it>BAF</it><sup><it>M </it></sup>= 0.575, <inline-formula><graphic file="gb-2009-10-11-r128-i2.gif"/></inline-formula> = 1, <inline-formula><graphic file="gb-2009-10-11-r128-i2.gif"/></inline-formula>+<inline-formula><graphic file="gb-2009-10-11-r128-i3.gif"/></inline-formula> = 1, <inline-formula><graphic file="gb-2009-10-11-r128-i4.gif"/></inline-formula> = 1 gave the normal content estimation <it>p </it>&#8776; 0.74 and hence the tumor content was 1 - <it>p </it>&#8776; 0.26.</p>
            <p>LRR was modeled by applying a simple coefficient of contraction <it>q </it>to the standard log ratio:<inline-formula><graphic file="gb-2009-10-11-r128-i5.gif"/></inline-formula>; <it>n </it>is the copy number, which produces the sequence of LRR values: -<it>q</it>, 0, 0.58<it>q</it>, <it>q</it>, 1.32<it>q</it>, 2<it>q</it>,... for corresponding copy number levels: 1, 2, 3, 4, 5... The LRR of zero copy (homozygous deletion) was arbitrarily set at -3<it>q </it>(log<sub>2</sub>0 = - &#8734;, variation in real LRR is usually very large and not followed by the model).</p>
         </sec>
         <sec>
            <st>
               <p>Fitting model GAP and copy number and genotype recognition</p>
            </st>
            <p>Automatic recognition of the tumor GAP pattern consisted of an exhaustive search for (a) the best centering of the model GAP on the LRR scale for each pair of contamination proportion (<it>p</it>) and coefficient of contraction (<it>q</it>), and (b) the best (<it>p, q</it>) couple satisfying a few necessary conditions. The genome coverage in terms of the number of SNPs explained by the model was used as the quality criterion. The necessary conditions were used to filter unusual interpretations.</p>
            <p>GAP pattern-recognition algorithm: 1) Initiation of a grid with 0.005 cell dimension on the BAF &#215; LRR plane and definition of the densities of alteration units in SNP counts on the grid; 2) Smoothing of the densities by averaging adjacent cells and filtering of low densities to enhance the contrast (densities were set to 0 in 95 to 98% of cells in the grid); 3) Choosing model parameters (<it>p, p </it>&#8712; {0,0.02, 0.04, ..., 0.86};<it>q</it>,<inline-formula><graphic file="gb-2009-10-11-r128-i6.gif"/></inline-formula>) and setting of the GAP template with one to five copies by determining the centers and sizes of model clusters on the grid; 4) For a given pair (<it>p, q</it>), searching for the best centering of the GAP template on the grid in terms of maximal density falling into designated clusters; 5) Checking all possible combinations of <it>p </it>and <it>q </it>and ranking templates; 6) Filtering of templates according to necessary conditions. This removes from further consideration redundant interpretations with many empty clusters; 7) Choosing the best interpretation, superimposing the model GAP to the experimental one, and ascribing copy number and genotype to each alteration unit after the annotation of its closest cluster on the template.</p>
            <p>In the case of low contrast between clusters, additional adjustments of recognition are necessary to attribute correctly the alteration units located between designated clusters. A confidence score is attributed to all alteration units (depending on the distance to the nearest model cluster(s)), and the <it>linear </it>copy number and genotype profiles are adjusted by keeping confident assignments and correcting less-confident assignments.</p>
         </sec>
         <sec>
            <st>
               <p>Experimental estimation of ploidy and karyotyping</p>
            </st>
            <p>The DNA content of tumor samples was obtained with flow cytometry (FCM) analysis after propidium iodine staining, as described in <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. The DNA index is equal to 1 for normal diploid cells. A karyotype of MDA_175 was obtained by a routine procedure <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Estimation of DNA content and chromosome number based on SNP data</p>
            </st>
            <p>The inferred copy-number profile was averaged along the genome, providing the DNA content of the corresponding cancer sample.</p>
            <p>Chromosome copy numbers were characterized by the status of pericentric regions, defined as the alteration units directly before or after the centromeric part of the chromosome (which has no SNP measurement <it>per se</it>). The definition of pericentric region therefore depends on the SNP chip used for genotyping. Regions less than 10 SNPs were ignored. If the pericentric alteration unit is a small region (less than 100 SNPs), setting the chromosome copy number on the basis of this alteration unit could be erroneous and could therefore interfere with karyotype assessment.</p>
         </sec>
         <sec>
            <st>
               <p>Dilution series</p>
            </st>
            <p>The dilution series described in <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, measured by Illumina 370 K array and available in the GEO database <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> [GEO:GSE11976], were processed in a similar way to in-house tumor samples: (a) normalization by the method proposed in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>; (b) segmentation of LRR and BAF profiles by CBS, in the same way as described in subsection 3 of the Materials and Methods; (c) construction of GAP, GAP pattern recognition, and copy-number assignment.</p>
         </sec>
         <sec>
            <st>
               <p>Results of the OverUnder <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> algorithm</p>
            </st>
            <p>The OverUnder plug-in was applied to the data normalized in BeadStudio 3.3, with window length equal to 51. OverUnder produces continuous copy-number values, which were rounded to discrete values and then summarized in comparison tables. As rounding can introduce artificial discrepancies, the procedure was slightly modified so that copy numbers were considered to be equal when they differed by no more than 1 unit (column CN &#177; 1, Table <tblr tid="T3">3</tblr>). CBS sections also were used to average (by median) and to round copy-number assignments (column CN CBS, Table <tblr tid="T3">3</tblr>).</p>
         </sec>
         <sec>
            <st>
               <p>Affymetrix SNP data</p>
            </st>
            <p>One BLC sample also was analyzed on the Genome-Wide Human SNP-array 6.0, according to the manufacturer's instructions (Affymetrix Inc., Santa Clara, CA). Normalization was performed by using the Genotyping Console&#8482; (Affymetrix), and profile recognition was performed by using the method described in <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Availability</p>
         </st>
         <p>An implementation of the proposed GAP pattern recognition and detection of copy numbers and genotypes based on segmented profiles is available, together with the supporting data <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. SNP array data for the 19 primary tumors and the two cell lines shown here are available through Gene Expression Omnibus <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> [GEO:GSE18799].</p>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>BAF: B allele frequency; BLC: basal-like breast carcinoma; CBS: circular binary segmentation; CN: copy number; CNV: copy-number variation; FCM: flow cytometry; LOH: loss of heterozygosity; LRR: Log R ratios; mBAF: mirrored BAF; MDA_175: MDA-MB-175-VII breast cancer cell line; MDA_468: MDA-MB-468 breast cancer cell line; SNP: single-nucleotide polymorphism.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are included with the online version of this article.</p>
         <p>A table of images of GAP patterns and copy-number recognition templates for a series of breast carcinomas with available DNA indexes (Additional data file <supplr sid="S1">1</supplr>), a table listing copy-number status of pericentric regions inferred on the basis of GAP pattern for a series of breast carcinomas and cell lines (Additional data file <supplr sid="S2">2</supplr>), two tables indicating self-consistency in copy-number attribution in dilution series calculated for two methods of recognition: GAP method and OverUnder algorithm (Additional data file <supplr sid="S3">3</supplr>), and GAP patterns and copy-number recognition templates for the dilution series of cell line CRL2324 (Additional data file <supplr sid="S4">4</supplr>).</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>GAP patterns and copy-number recognition templates</p>
            </caption>
            <text>
               <p>A table of images of GAP patterns and copy-number recognition templates for a series of breast carcinomas with available DNA indexes</p>
            </text>
            <file name="gb-2009-10-11-r128-S1.pdf">
   <p>Click here for file</p>
</file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>Copy-number status of pericentric regions</p>
            </caption>
            <text>
               <p>A table listing copy-number status of pericentric regions inferred on the basis of GAP pattern for a series of breast carcinomas and cell lines</p>
            </text>
            <file name="gb-2009-10-11-r128-S2.xls">
   <p>Click here for file</p>
</file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p>Self-consistency in copy-number attribution in dilution series</p>
            </caption>
            <text>
               <p>two tables indicating self-consistency in copy-number attribution in dilution series calculated for two methods of recognition: GAP method and OverUnder algorithm</p>
            </text>
            <file name="gb-2009-10-11-r128-S3.doc">
   <p>Click here for file</p>
</file>
         </suppl>
         <suppl id="S4">
            <title>
               <p>Additional data file 4</p>
            </title>
            <caption>
               <p>Cell line CRL2324</p>
            </caption>
            <text>
               <p>GAP patterns and copy-number recognition templates for the dilution series of cell line CRL2324</p>
            </text>
            <file name="gb-2009-10-11-r128-S4.pdf">
   <p>Click here for file</p>
</file>
         </suppl>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>TP performed research, analyzed data, and wrote the paper; EM performed experiments and analyzed data; DSL designed the general project; GR provided the Affymetrix data and analyzed data; EB supervised bioinformatics analyses; and MHS designed the research and wrote the paper. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank X. Sastre-Garau, N. Gruel, and I. Lebigot for providing breast cancer DNAs, T. Dubois for giving us access to data, J. Couturier for karyotype data, A. Vincent-Salomon for tumor characterization, A. Zinovyev for helpful discussion and support, and C. Lucchesi and S. Pook for critical reading. This work is part of the "Canc&#233;rop&#244;le Ile-de-France-R&#233;gion Ile-de-France-Hereditary Breast Cancer" program coordinated by DSL and MHS. This work also was supported by the INSERM, the Institut Curie, and its Translational Research Department. TP is supported by a grant from the Canc&#233;rop&#244;le Ile-de-France-R&#233;gion Ile-de-France. GR is supported by a grant from the Institut National du Cancer (INCa).</p>
         </sec>
      </ack>
      <refgrp><bibl id="B1"><title><p>Chromosome aberrations in solid tumors.</p></title><aug><au><snm>Albertson</snm><fnm>DG</fnm></au><au><snm>Collins</snm><fnm>C</fnm></au><au><snm>McCormick</snm><fnm>F</fnm></au><au><snm>Gray</snm><fnm>JW</fnm></au></aug><source>Nat Genet</source><pubdate>2003</pubdate><volume>34</volume><fpage>369</fpage><lpage>376</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng1215</pubid><pubid idtype="pmpid" link="fulltext">12923544</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>Genomic and transcriptional aberrations linked to breast cancer pathophysiologies.</p></title><aug><au><snm>Chin</snm><fnm>K</fnm></au><au><snm>DeVries</snm><fnm>S</fnm></au><au><snm>Fridlyand</snm><fnm>J</fnm></au><au><snm>Spellman</snm><fnm>PT</fnm></au><au><snm>Roydasgupta</snm><fnm>R</fnm></au><au><snm>Kuo</snm><fnm>WL</fnm></au><au><snm>Lapuk</snm><fnm>A</fnm></au><au><snm>Neve</snm><fnm>RM</fnm></au><au><snm>Qian</snm><fnm>Z</fnm></au><au><snm>Ryder</snm><fnm>T</fnm></au><au><snm>Chen</snm><fnm>F</fnm></au><au><snm>Feiler</snm><fnm>H</fnm></au><au><snm>Tokuyasu</snm><fnm>T</fnm></au><au><snm>Kingsley</snm><fnm>C</fnm></au><au><snm>Dairkee</snm><fnm>S</fnm></au><au><snm>Meng</snm><fnm>Z</fnm></au><au><snm>Chew</snm><fnm>K</fnm></au><au><snm>Pinkel</snm><fnm>D</fnm></au><au><snm>Jain</snm><fnm>A</fnm></au><au><snm>Ljung</snm><fnm>BM</fnm></au><au><snm>Esserman</snm><fnm>L</fnm></au><au><snm>Albertson</snm><fnm>DG</fnm></au><au><snm>Waldman</snm><fnm>FM</fnm></au><au><snm>Gray</snm><fnm>JW</fnm></au></aug><source>Cancer Cell</source><pubdate>2006</pubdate><volume>10</volume><fpage>529</fpage><lpage>541</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ccr.2006.10.009</pubid><pubid idtype="pmpid" link="fulltext">17157792</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Using high-throughput SNP technologies to study cancer.</p></title><aug><au><snm>Engle</snm><fnm>LJ</fnm></au><au><snm>Simpson</snm><fnm>CL</fnm></au><au><snm>Landers</snm><fnm>JE</fnm></au></aug><source>Oncogene</source><pubdate>2006</pubdate><volume>25</volume><fpage>1594</fpage><lpage>1601</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/sj.onc.1209368</pubid><pubid idtype="pmpid" link="fulltext">16550159</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Emerging paradigms in cancer genetics: some important findings from high-density single nucleotide polymorphism array studies.</p></title><aug><au><snm>Bacolod</snm><fnm>MD</fnm></au><au><snm>Schemmann</snm><fnm>GS</fnm></au><au><snm>Giardina</snm><fnm>SF</fnm></au><au><snm>Paty</snm><fnm>P</fnm></au><au><snm>Notterman</snm><fnm>DA</fnm></au><au><snm>Barany</snm><fnm>F</fnm></au></aug><source>Cancer Res</source><pubdate>2009</pubdate><volume>69</volume><fpage>723</fpage><lpage>727</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1158/0008-5472.CAN-08-3543</pubid><pubid idtype="pmpid" link="fulltext">19155292</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Single nucleotide polymorphism array analysis of cancer.</p></title><aug><au><snm>Dutt</snm><fnm>A</fnm></au><au><snm>Beroukhim</snm><fnm>R</fnm></au></aug><source>Curr Opin Oncol</source><pubdate>2007</pubdate><volume>19</volume><fpage>43</fpage><lpage>49</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1097/CCO.0b013e328011a8c1</pubid><pubid idtype="pmpid" link="fulltext">17133111</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Affymetrix</p></title><url>http://www.affymetrix.com</url></bibl><bibl id="B7"><title><p>Illumina</p></title><url>http://www.illumina.com</url></bibl><bibl id="B8"><title><p>High-resolution copy number analysis of paraffin-embedded archival tissue using SNP BeadArrays.</p></title><aug><au><snm>Oosting</snm><fnm>J</fnm></au><au><snm>Lips</snm><fnm>EH</fnm></au><au><snm>van Eijk</snm><fnm>R</fnm></au><au><snm>Eilers</snm><fnm>PH</fnm></au><au><snm>Szuhai</snm><fnm>K</fnm></au><au><snm>Wijmenga</snm><fnm>C</fnm></au><au><snm>Morreau</snm><fnm>H</fnm></au><au><snm>van Wezel</snm><fnm>T</fnm></au></aug><source>Genome Res</source><pubdate>2007</pubdate><volume>17</volume><fpage>368</fpage><lpage>376</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.5686107</pubid><pubid idtype="pmcid">1800928</pubid><pubid idtype="pmpid">17267813</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Normalization of Illumina Infinium whole-genome SNP data improves copy number estimates and allelic intensity ratios.</p></title><aug><au><snm>Staaf</snm><fnm>J</fnm></au><au><snm>Vallon-Christersson</snm><fnm>J</fnm></au><au><snm>Lindgren</snm><fnm>D</fnm></au><au><snm>Juliusson</snm><fnm>G</fnm></au><au><snm>Rosenquist</snm><fnm>R</fnm></au><au><snm>Hoglund</snm><fnm>M</fnm></au><au><snm>Borg</snm><fnm>A</fnm></au><au><snm>Ringner</snm><fnm>M</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2008</pubdate><volume>9</volume><fpage>409</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-9-409</pubid><pubid idtype="pmcid">2572624</pubid><pubid idtype="pmpid">18831757</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>ITALICS: an algorithm for normalization and DNA copy number calling for Affymetrix SNP arrays.</p></title><aug><au><snm>Rigaill</snm><fnm>G</fnm></au><au><snm>Hupe</snm><fnm>P</fnm></au><au><snm>Almeida</snm><fnm>A</fnm></au><au><snm>La Rosa</snm><fnm>P</fnm></au><au><snm>Meyniel</snm><fnm>JP</fnm></au><au><snm>Decraene</snm><fnm>C</fnm></au><au><snm>Barillot</snm><fnm>E</fnm></au></aug><source>Bioinformatics</source><pubdate>2008</pubdate><volume>24</volume><fpage>768</fpage><lpage>774</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btn048</pubid><pubid idtype="pmpid" link="fulltext">18252739</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Smoothing waves in array CGH tumor profiles.</p></title><aug><au><snm>Wiel</snm><mnm>van de</mnm><fnm>MA</fnm></au><au><snm>Brosens</snm><fnm>R</fnm></au><au><snm>Eilers</snm><fnm>PH</fnm></au><au><snm>Kumps</snm><fnm>C</fnm></au><au><snm>Meijer</snm><fnm>GA</fnm></au><au><snm>Menten</snm><fnm>B</fnm></au><au><snm>Sistermans</snm><fnm>E</fnm></au><au><snm>Speleman</snm><fnm>F</fnm></au><au><snm>Timmerman</snm><fnm>ME</fnm></au><au><snm>Ylstra</snm><fnm>B</fnm></au></aug><source>Bioinformatics</source><pubdate>2009</pubdate><volume>25</volume><fpage>1099</fpage><lpage>1104</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btp132</pubid><pubid idtype="pmpid" link="fulltext">19276148</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>A faster circular binary segmentation algorithm for the analysis of array CGH data.</p></title><aug><au><snm>Venkatraman</snm><fnm>ES</fnm></au><au><snm>Olshen</snm><fnm>AB</fnm></au></aug><source>Bioinformatics</source><pubdate>2007</pubdate><volume>23</volume><fpage>657</fpage><lpage>663</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btl646</pubid><pubid idtype="pmpid" link="fulltext">17234643</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>Analysis of array CGH data: from signal ratio to gain and loss of DNA regions.</p></title><aug><au><snm>Hupe</snm><fnm>P</fnm></au><au><snm>Stransky</snm><fnm>N</fnm></au><au><snm>Thiery</snm><fnm>JP</fnm></au><au><snm>Radvanyi</snm><fnm>F</fnm></au><au><snm>Barillot</snm><fnm>E</fnm></au></aug><source>Bioinformatics</source><pubdate>2004</pubdate><volume>20</volume><fpage>3413</fpage><lpage>3422</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/bth418</pubid><pubid idtype="pmpid" link="fulltext">15381628</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>A statistical approach for array CGH data analysis.</p></title><aug><au><snm>Picard</snm><fnm>F</fnm></au><au><snm>Robin</snm><fnm>S</fnm></au><au><snm>Lavielle</snm><fnm>M</fnm></au><au><snm>Vaisse</snm><fnm>C</fnm></au><au><snm>Daudin</snm><fnm>JJ</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2005</pubdate><volume>6</volume><fpage>27</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-6-27</pubid><pubid idtype="pmcid">549559</pubid><pubid idtype="pmpid">15705208</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>Ploidy status and copy number aberrations in primary glioblastomas defined by integrated analysis of allelic ratios, signal ratios and loss of heterozygosity using 500K SNP Mapping Arrays.</p></title><aug><au><snm>Gardina</snm><fnm>PJ</fnm></au><au><snm>Lo</snm><fnm>KC</fnm></au><au><snm>Lee</snm><fnm>W</fnm></au><au><snm>Cowell</snm><fnm>JK</fnm></au><au><snm>Turpaz</snm><fnm>Y</fnm></au></aug><source>BMC Genomics</source><pubdate>2008</pubdate><volume>9</volume><fpage>489</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-9-489</pubid><pubid idtype="pmcid">2576260</pubid><pubid idtype="pmpid">18928532</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Genomic copy number determination in cancer cells from single nucleotide polymorphism microarrays based on quantitative genotyping corrected for aneuploidy.</p></title><aug><au><snm>Attiyeh</snm><fnm>EF</fnm></au><au><snm>Diskin</snm><fnm>SJ</fnm></au><au><snm>Attiyeh</snm><fnm>MA</fnm></au><au><snm>Mosse</snm><fnm>YP</fnm></au><au><snm>Hou</snm><fnm>C</fnm></au><au><snm>Jackson</snm><fnm>EM</fnm></au><au><snm>Kim</snm><fnm>C</fnm></au><au><snm>Glessner</snm><fnm>J</fnm></au><au><snm>Hakonarson</snm><fnm>H</fnm></au><au><snm>Biegel</snm><fnm>JA</fnm></au><au><snm>Maris</snm><fnm>JM</fnm></au></aug><source>Genome Res</source><pubdate>2009</pubdate><volume>19</volume><fpage>276</fpage><lpage>283</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.075671.107</pubid><pubid idtype="pmcid">2652209</pubid><pubid idtype="pmpid">19141597</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>Segmentation-based detection of allelic imbalance and loss-of-heterozygosity in cancer cells using whole genome SNP arrays.</p></title><aug><au><snm>Staaf</snm><fnm>J</fnm></au><au><snm>Lindgren</snm><fnm>D</fnm></au><au><snm>Vallon-Christersson</snm><fnm>J</fnm></au><au><snm>Isaksson</snm><fnm>A</fnm></au><au><snm>Goransson</snm><fnm>H</fnm></au><au><snm>Juliusson</snm><fnm>G</fnm></au><au><snm>Rosenquist</snm><fnm>R</fnm></au><au><snm>Hoglund</snm><fnm>M</fnm></au><au><snm>Borg</snm><fnm>A</fnm></au><au><snm>Ringner</snm><fnm>M</fnm></au></aug><source>Genome Biol</source><pubdate>2008</pubdate><volume>9</volume><fpage>R136</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2008-9-9-r136</pubid><pubid idtype="pmcid">2592714</pubid><pubid idtype="pmpid">18796136</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>SNP arrays in heterogeneous tissue: highly accurate collection of both germline and somatic genetic information from unpaired single tumor samples.</p></title><aug><au><snm>Assie</snm><fnm>G</fnm></au><au><snm>LaFramboise</snm><fnm>T</fnm></au><au><snm>Platzer</snm><fnm>P</fnm></au><au><snm>Bertherat</snm><fnm>J</fnm></au><au><snm>Stratakis</snm><fnm>CA</fnm></au><au><snm>Eng</snm><fnm>C</fnm></au></aug><source>Am J Hum Genet</source><pubdate>2008</pubdate><volume>82</volume><fpage>903</fpage><lpage>915</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ajhg.2008.01.012</pubid><pubid idtype="pmcid">2427213</pubid><pubid idtype="pmpid">18355774</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>CARAT: a novel method for allelic detection of DNA copy number changes using high density oligonucleotide arrays.</p></title><aug><au><snm>Huang</snm><fnm>J</fnm></au><au><snm>Wei</snm><fnm>W</fnm></au><au><snm>Chen</snm><fnm>J</fnm></au><au><snm>Zhang</snm><fnm>J</fnm></au><au><snm>Liu</snm><fnm>G</fnm></au><au><snm>Di</snm><fnm>X</fnm></au><au><snm>Mei</snm><fnm>R</fnm></au><au><snm>Ishikawa</snm><fnm>S</fnm></au><au><snm>Aburatani</snm><fnm>H</fnm></au><au><snm>Jones</snm><fnm>KW</fnm></au><au><snm>Shapero</snm><fnm>MH</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2006</pubdate><volume>7</volume><fpage>83</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-7-83</pubid><pubid idtype="pmcid">1402331</pubid><pubid idtype="pmpid">16504045</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>A hidden Markov model to estimate population mixture and allelic copy-numbers in cancers using Affymetrix SNP arrays.</p></title><aug><au><snm>Lamy</snm><fnm>P</fnm></au><au><snm>Andersen</snm><fnm>CL</fnm></au><au><snm>Dyrskjot</snm><fnm>L</fnm></au><au><snm>Torring</snm><fnm>N</fnm></au><au><snm>Wiuf</snm><fnm>C</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2007</pubdate><volume>8</volume><fpage>434</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-8-434</pubid><pubid idtype="pmcid">2206057</pubid><pubid idtype="pmpid">17996079</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Major copy proportion analysis of tumor samples using SNP arrays.</p></title><aug><au><snm>Li</snm><fnm>C</fnm></au><au><snm>Beroukhim</snm><fnm>R</fnm></au><au><snm>Weir</snm><fnm>BA</fnm></au><au><snm>Winckler</snm><fnm>W</fnm></au><au><snm>Garraway</snm><fnm>LA</fnm></au><au><snm>Sellers</snm><fnm>WR</fnm></au><au><snm>Meyerson</snm><fnm>M</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2008</pubdate><volume>9</volume><fpage>204</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-9-204</pubid><pubid idtype="pmcid">2375907</pubid><pubid idtype="pmpid">18426588</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data.</p></title><aug><au><snm>Wang</snm><fnm>K</fnm></au><au><snm>Li</snm><fnm>M</fnm></au><au><snm>Hadley</snm><fnm>D</fnm></au><au><snm>Liu</snm><fnm>R</fnm></au><au><snm>Glessner</snm><fnm>J</fnm></au><au><snm>Grant</snm><fnm>SF</fnm></au><au><snm>Hakonarson</snm><fnm>H</fnm></au><au><snm>Bucan</snm><fnm>M</fnm></au></aug><source>Genome Res</source><pubdate>2007</pubdate><volume>17</volume><fpage>1665</fpage><lpage>1674</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.6861907</pubid><pubid idtype="pmcid">2045149</pubid><pubid idtype="pmpid">17921354</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Estimation and assessment of raw copy numbers at the single locus level.</p></title><aug><au><snm>Bengtsson</snm><fnm>H</fnm></au><au><snm>Irizarry</snm><fnm>R</fnm></au><au><snm>Carvalho</snm><fnm>B</fnm></au><au><snm>Speed</snm><fnm>TP</fnm></au></aug><source>Bioinformatics</source><pubdate>2008</pubdate><volume>24</volume><fpage>759</fpage><lpage>767</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btn016</pubid><pubid idtype="pmpid" link="fulltext">18204055</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>SiDCoN: a tool to aid scoring of DNA copy number changes in SNP chip data.</p></title><aug><au><snm>Nancarrow</snm><fnm>DJ</fnm></au><au><snm>Handoko</snm><fnm>HY</fnm></au><au><snm>Stark</snm><fnm>MS</fnm></au><au><snm>Whiteman</snm><fnm>DC</fnm></au><au><snm>Hayward</snm><fnm>NK</fnm></au></aug><source>PLoS ONE</source><pubdate>2007</pubdate><volume>2</volume><fpage>e1093</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0001093</pubid><pubid idtype="pmcid">2034603</pubid><pubid idtype="pmpid">17971856</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Molecular definition of breast tumor heterogeneity.</p></title><aug><au><snm>Shipitsin</snm><fnm>M</fnm></au><au><snm>Campbell</snm><fnm>LL</fnm></au><au><snm>Argani</snm><fnm>P</fnm></au><au><snm>Weremowicz</snm><fnm>S</fnm></au><au><snm>Bloushtain-Qimron</snm><fnm>N</fnm></au><au><snm>Yao</snm><fnm>J</fnm></au><au><snm>Nikolskaya</snm><fnm>T</fnm></au><au><snm>Serebryiskaya</snm><fnm>T</fnm></au><au><snm>Beroukhim</snm><fnm>R</fnm></au><au><snm>Hu</snm><fnm>M</fnm></au><au><snm>Halushka</snm><fnm>MK</fnm></au><au><snm>Sukumar</snm><fnm>S</fnm></au><au><snm>Parker</snm><fnm>LM</fnm></au><au><snm>Anderson</snm><fnm>KS</fnm></au><au><snm>Harris</snm><fnm>LN</fnm></au><au><snm>Garber</snm><fnm>JE</fnm></au><au><snm>Richardson</snm><fnm>AL</fnm></au><au><snm>Schnitt</snm><fnm>SJ</fnm></au><au><snm>Nikolsky</snm><fnm>Y</fnm></au><au><snm>Gelman</snm><fnm>RS</fnm></au><au><snm>Polyak</snm><fnm>K</fnm></au></aug><source>Cancer Cell</source><pubdate>2007</pubdate><volume>11</volume><fpage>259</fpage><lpage>273</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ccr.2007.01.013</pubid><pubid idtype="pmpid" link="fulltext">17349583</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Germline BRCA1 mutations and a basal epithelial phenotype in breast cancer.</p></title><aug><au><snm>Foulkes</snm><fnm>WD</fnm></au><au><snm>Stefansson</snm><fnm>IM</fnm></au><au><snm>Chappuis</snm><fnm>PO</fnm></au><au><snm>Begin</snm><fnm>LR</fnm></au><au><snm>Goffin</snm><fnm>JR</fnm></au><au><snm>Wong</snm><fnm>N</fnm></au><au><snm>Trudel</snm><fnm>M</fnm></au><au><snm>Akslen</snm><fnm>LA</fnm></au></aug><source>J Natl Cancer Inst</source><pubdate>2003</pubdate><volume>95</volume><fpage>1482</fpage><lpage>1485</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">14519755</pubid></xrefbib></bibl><bibl id="B27"><title><p>BRCA1 dysfunction in sporadic basal-like breast cancer.</p></title><aug><au><snm>Turner</snm><fnm>NC</fnm></au><au><snm>Reis-Filho</snm><fnm>JS</fnm></au><au><snm>Russell</snm><fnm>AM</fnm></au><au><snm>Springall</snm><fnm>RJ</fnm></au><au><snm>Ryder</snm><fnm>K</fnm></au><au><snm>Steele</snm><fnm>D</fnm></au><au><snm>Savage</snm><fnm>K</fnm></au><au><snm>Gillett</snm><fnm>CE</fnm></au><au><snm>Schmitt</snm><fnm>FC</fnm></au><au><snm>Ashworth</snm><fnm>A</fnm></au><au><snm>Tutt</snm><fnm>AN</fnm></au></aug><source>Oncogene</source><pubdate>2007</pubdate><volume>26</volume><fpage>2126</fpage><lpage>2132</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/sj.onc.1210014</pubid><pubid idtype="pmpid" link="fulltext">17016441</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>The roles of BRCA1 and BRCA2 and associated proteins in the maintenance of genomic stability.</p></title><aug><au><snm>Gudmundsdottir</snm><fnm>K</fnm></au><au><snm>Ashworth</snm><fnm>A</fnm></au></aug><source>Oncogene</source><pubdate>2006</pubdate><volume>25</volume><fpage>5864</fpage><lpage>5874</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/sj.onc.1209874</pubid><pubid idtype="pmpid" link="fulltext">16998501</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>Identification of typical medullary breast carcinoma as a genomic sub-group of basal-like carcinomas, a heterogeneous new molecular entity.</p></title><aug><au><snm>Vincent-Salomon</snm><fnm>A</fnm></au><au><snm>Gruel</snm><fnm>N</fnm></au><au><snm>Lucchesi</snm><fnm>C</fnm></au><au><snm>MacGrogan</snm><fnm>G</fnm></au><au><snm>Dendale</snm><fnm>R</fnm></au><au><snm>Sigal-Zafrani</snm><fnm>B</fnm></au><au><snm>Longy</snm><fnm>M</fnm></au><au><snm>Raynal</snm><fnm>V</fnm></au><au><snm>Pierron</snm><fnm>G</fnm></au><au><snm>de Mascarel</snm><fnm>I</fnm></au><au><snm>Taris</snm><fnm>C</fnm></au><au><snm>Stoppa-Lyonnet</snm><fnm>D</fnm></au><au><snm>Pierga</snm><fnm>JY</fnm></au><au><snm>Salmon</snm><fnm>R</fnm></au><au><snm>Sastre-Garau</snm><fnm>X</fnm></au><au><snm>Fourquet</snm><fnm>A</fnm></au><au><snm>Delattre</snm><fnm>O</fnm></au><au><snm>de Cremoux</snm><fnm>P</fnm></au><au><snm>Aurias</snm><fnm>A</fnm></au></aug><source>Breast Cancer Res</source><pubdate>2007</pubdate><volume>9</volume><fpage>R24</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/bcr1666</pubid><pubid idtype="pmcid">1868916</pubid><pubid idtype="pmpid">17417968</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Gene expression profiling and histopathological characterization of triple-negative/basal-like breast carcinomas.</p></title><aug><au><snm>Kreike</snm><fnm>B</fnm></au><au><snm>van Kouwenhove</snm><fnm>M</fnm></au><au><snm>Horlings</snm><fnm>H</fnm></au><au><snm>Weigelt</snm><fnm>B</fnm></au><au><snm>Peterse</snm><fnm>H</fnm></au><au><snm>Bartelink</snm><fnm>H</fnm></au><au><snm>Vijver</snm><mnm>van de</mnm><fnm>MJ</fnm></au></aug><source>Breast Cancer Res</source><pubdate>2007</pubdate><volume>9</volume><fpage>R65</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/bcr1771</pubid><pubid idtype="pmcid">2242660</pubid><pubid idtype="pmpid">17910759</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>X inactive-specific transcript RNA coating and genetic instability of the X chromosome in BRCA1 breast tumors.</p></title><aug><au><snm>Vincent-Salomon</snm><fnm>A</fnm></au><au><snm>Ganem-Elbaz</snm><fnm>C</fnm></au><au><snm>Manie</snm><fnm>E</fnm></au><au><snm>Raynal</snm><fnm>V</fnm></au><au><snm>Sastre-Garau</snm><fnm>X</fnm></au><au><snm>Stoppa-Lyonnet</snm><fnm>D</fnm></au><au><snm>Stern</snm><fnm>MH</fnm></au><au><snm>Heard</snm><fnm>E</fnm></au></aug><source>Cancer Res</source><pubdate>2007</pubdate><volume>67</volume><fpage>5134</fpage><lpage>5140</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1158/0008-5472.CAN-07-0465</pubid><pubid idtype="pmpid" link="fulltext">17545591</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>Systematic assessment of copy number variant detection via genome-wide SNP genotyping.</p></title><aug><au><snm>Cooper</snm><fnm>GM</fnm></au><au><snm>Zerr</snm><fnm>T</fnm></au><au><snm>Kidd</snm><fnm>JM</fnm></au><au><snm>Eichler</snm><fnm>EE</fnm></au><au><snm>Nickerson</snm><fnm>DA</fnm></au></aug><source>Nat Genet</source><pubdate>2008</pubdate><volume>40</volume><fpage>1199</fpage><lpage>1203</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng.236</pubid><pubid idtype="pmcid">2759751</pubid><pubid idtype="pmpid">18776910</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>Common deletion polymorphisms in the human genome.</p></title><aug><au><snm>McCarroll</snm><fnm>SA</fnm></au><au><snm>Hadnott</snm><fnm>TN</fnm></au><au><snm>Perry</snm><fnm>GH</fnm></au><au><snm>Sabeti</snm><fnm>PC</fnm></au><au><snm>Zody</snm><fnm>MC</fnm></au><au><snm>Barrett</snm><fnm>JC</fnm></au><au><snm>Dallaire</snm><fnm>S</fnm></au><au><snm>Gabriel</snm><fnm>SB</fnm></au><au><snm>Lee</snm><fnm>C</fnm></au><au><snm>Daly</snm><fnm>MJ</fnm></au><au><snm>Altshuler</snm><fnm>DM</fnm></au></aug><source>Nat Genet</source><pubdate>2006</pubdate><volume>38</volume><fpage>86</fpage><lpage>92</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng1696</pubid><pubid idtype="pmpid">16468122</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>American Type Culture Collection (ATCC)</p></title><url>http://www.lgcstandards-atcc.org/</url></bibl><bibl id="B35"><title><p>High frequency of TP53 mutation in BRCA1 and sporadic basal-like carcinomas but not in BRCA1 luminal breast tumors.</p></title><aug><au><snm>Manie</snm><fnm>E</fnm></au><au><snm>Vincent-Salomon</snm><fnm>A</fnm></au><au><snm>Lehmann-Che</snm><fnm>J</fnm></au><au><snm>Pierron</snm><fnm>G</fnm></au><au><snm>Turpin</snm><fnm>E</fnm></au><au><snm>Warcoin</snm><fnm>M</fnm></au><au><snm>Gruel</snm><fnm>N</fnm></au><au><snm>Lebigot</snm><fnm>I</fnm></au><au><snm>Sastre-Garau</snm><fnm>X</fnm></au><au><snm>Lidereau</snm><fnm>R</fnm></au><au><snm>Remenieras</snm><fnm>A</fnm></au><au><snm>Feunteun</snm><fnm>J</fnm></au><au><snm>Delattre</snm><fnm>O</fnm></au><au><snm>de The</snm><fnm>H</fnm></au><au><snm>Stoppa-Lyonnet</snm><fnm>D</fnm></au><au><snm>Stern</snm><fnm>MH</fnm></au></aug><source>Cancer Res</source><pubdate>2009</pubdate><volume>69</volume><fpage>663</fpage><lpage>671</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1158/0008-5472.CAN-08-1560</pubid><pubid idtype="pmpid" link="fulltext">19147582</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>Integragen</p></title><url>http://www.integragen.com</url></bibl><bibl id="B37"><title><p>Gene Expression Omnibus (GEO)</p></title><url>http://www.ncbi.nlm.nih.gov/geo/</url></bibl><bibl id="B38"><title><p><it>R</it>: A language and environment for statistical computing.</p></title><aug><au><cnm>R Development Core Team</cnm></au></aug><publisher>R Foundation for Statistical Computing. Vienna, Austria</publisher><pubdate>2009</pubdate></bibl><bibl id="B39"><title><p>Highly recurrent der(1;16)(q10;p10) and other 16q arm alterations in lobular breast cancer.</p></title><aug><au><snm>Flagiello</snm><fnm>D</fnm></au><au><snm>Gerbault-Seureau</snm><fnm>M</fnm></au><au><snm>Sastre-Garau</snm><fnm>X</fnm></au><au><snm>Padoy</snm><fnm>E</fnm></au><au><snm>Vielh</snm><fnm>P</fnm></au><au><snm>Dutrillaux</snm><fnm>B</fnm></au></aug><source>Genes Chromosomes Cancer</source><pubdate>1998</pubdate><volume>23</volume><fpage>300</fpage><lpage>306</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/(SICI)1098-2264(199812)23:4&lt;300::AID-GCC4&gt;3.0.CO;2-N</pubid><pubid idtype="pmpid" link="fulltext">9824202</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>Metaphase harvest and cytogenetic analysis of solid tumor cultures.</p></title><aug><au><snm>Fletcher</snm><fnm>J</fnm></au></aug><source>Current Protocol in Human Genetics</source><editor>L HJ, Korf BR, Morton CC, Seidman CE, Seidman JG, Smith DR</editor><pubdate>2007</pubdate><volume>Chapter 10</volume><note>Unit 10.3</note></bibl><bibl id="B41"><title><p>Institut Curie Bioinformatics: GAP download Site</p></title><url>http://bioinfo.curie.fr/projects/snp_gap</url></bibl></refgrp>
   </bm>
</art>