<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
<ui>gb-2011-12-12-r122</ui>
<ji>1465-6906</ji>
<fm>
<dochead>Method</dochead>
<bibl>
<title><p>Accurate proteome-wide protein quantification from high-resolution <sup>15</sup>N mass spectra</p></title>
<aug>
<au id="A1"><snm>Khan</snm><fnm>Zia</fnm><insr iid="I1"/><insr iid="I2"/><insr iid="I3"/><email>zia@uchicago.edu</email></au>
<au id="A2" ce="yes"><snm>Amini</snm><fnm>Sasan</fnm><insr iid="I2"/><insr iid="I4"/><insr iid="I5"/><email>samini@illumina.com</email></au>
<au id="A3"><snm>Bloom</snm><mi>S</mi><fnm>Joshua</fnm><insr iid="I2"/><insr iid="I4"/><email>jbloom@princeton.edu</email></au>
<au id="A4"><snm>Ruse</snm><fnm>Cristian</fnm><insr iid="I6"/><email>ruse@cshl.edu</email></au>
<au id="A5"><snm>Caudy</snm><mi>A</mi><fnm>Amy</fnm><insr iid="I2"/><insr iid="I7"/><email>amy.caudy@utoronto.ca</email></au>
<au id="A6"><snm>Kruglyak</snm><fnm>Leonid</fnm><insr iid="I2"/><insr iid="I8"/><insr iid="I9"/><email>leonid@genomics.princeton.edu</email></au>
<au id="A7"><snm>Singh</snm><fnm>Mona</fnm><insr iid="I1"/><insr iid="I2"/><email>mona@cs.princeton.edu</email></au>
<au id="A8"><snm>Perlman</snm><mi>H</mi><fnm>David</fnm><insr iid="I2"/><insr iid="I4"/><insr iid="I10"/><email>perlmand@princeton.edu</email></au>
<au id="A9" ca="yes"><snm>Tavazoie</snm><fnm>Saeed</fnm><insr iid="I2"/><insr iid="I4"/><insr iid="I11"/><email>st2744@columbia.edu</email></au>
</aug>
<insg>
<ins id="I1"><p>Department of Computer Science, Washington Rd, Princeton University, Princeton, NJ 08544, USA</p></ins>
<ins id="I2"><p>Lewis-Sigler Institute for Integrative Genomics Washington Rd, Princeton University, Princeton, NJ 08544, USA</p></ins>
<ins id="I3"><p>Current address: Department of Human Genetics, South Ellis Avenue, University of Chicago, Chicago, IL 60637, USA</p></ins>
<ins id="I4"><p>Department of Molecular Biology, Washington Rd, Princeton University, Princeton, NJ 08544, USA</p></ins>
<ins id="I5"><p>Current address: Illumina Inc., Towne Centre Drive, San Diego, CA 92121, USA</p></ins>
<ins id="I6"><p>Cold Spring Harbor Laboratory, Bungtown Rd, Cold Spring Harbor, NY 11724, USA</p></ins>
<ins id="I7"><p>Current address: Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, King's College Circle, Toronto, Ontario, M5S 1A1, Canada</p></ins>
<ins id="I8"><p>Department of Ecology and Evolutionary Biology, Washington Rd, Princeton University, Princeton, NJ 08544, USA</p></ins>
<ins id="I9"><p>Howard Hughes Medical Institute, Washington Rd, Princeton University, Princeton, NJ 08544, USA</p></ins>
<ins id="I10"><p>Princeton Mass Spectrometry Center, Washington Rd, Princeton University, Princeton, NJ 08544, USA</p></ins>
<ins id="I11"><p>Current address: Department of Biochemistry and Molecular Biophysics, 116th Street and Broadway, Columbia University, New York, NY 10027, USA</p></ins>
</insg>
<source>Genome Biology</source>
<issn>1465-6906</issn>
<pubdate>2011</pubdate>
<volume>12</volume>
<issue>12</issue>
<fpage>R122</fpage>
<url>http://genomebiology.com/2011/12/12/R122</url>
<xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2011-12-12-r122</pubid><pubid idtype="pmpid">22182234</pubid></pubidlist></xrefbib></bibl>
<history><rec><date><day>11</day><month>7</month><year>2011</year></date></rec><revrec><date><day>30</day><month>11</month><year>2011</year></date></revrec><acc><date><day>19</day><month>12</month><year>2011</year></date></acc><pub><date><day>19</day><month>12</month><year>2011</year></date></pub></history>
<cpyrt><year>2011</year><collab>Khan et al.; licensee BioMed Central Ltd.</collab><note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec><st><p>Abstract</p></st>
<p>In quantitative mass spectrometry-based proteomics, the metabolic incorporation of a single source of <sup>15</sup>N-labeled nitrogen has many advantages over using stable isotope-labeled amino acids. However, the lack of a robust computational framework for analyzing the resulting spectra has impeded wide use of this approach. We have addressed this challenge by introducing a new computational methodology for analyzing <sup>15</sup>N spectra in which quantification is integrated with identification. Application of this method to an <it>Escherichia coli </it>growth transition reveals significant improvement in quantification accuracy over previous methods.</p>
</sec>
</abs>
</fm>
<bdy>
<sec><st><p>Background</p></st>
<p>Experimental methods for proteome-wide quantification using liquid chromatography coupled mass spectrometry (LC-MS) rest heavily on computational methods that analyze mass spectra for peptide and protein quantification <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Computational analysis typically relies on two main components: (1) a peptide identification algorithm that assigns amino acid sequences to fragmentation spectra by searching a database of theoretical peptides obtained by an <it>in silico </it>digest of an organism's proteome; and (2) a quantification algorithm that collects peaks across mass spectra in the form of extracted ion chromatograms (XICs) over the duration of chromatographic elution of peptides.</p>
<p>These algorithms have been shown to produce robust and accurate peptide quantification results across a wide range of quantification strategies. They are an important part of the data analysis employed in a widely used experimental approach called stable isotope labeling by amino acids in cell culture (SILAC) <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. In most typical applications of this approach, a sample in which stable isotope-labeled amino acids are incorporated metabolically by an organism is compared to an unlabeled sample from a different experimental condition. Because both samples are combined prior to protein extraction, the samples are subjected to the same extraction, sample handling, digestion, chromatography, and ionization conditions. This eliminates much of the technical variation between individual samples. Consequently, the final relative abundance measurements, in the form of ratios of the areas of paired XICs from both the unlabeled and labeled peptides, are highly accurate. To ensure that only labeled amino acids are incorporated into proteins, amino acid auxotrophs are typically employed in conjunction with labeled amino acids, rendering this strategy unavailable for prototrophic microorganisms, such as wild-type bacteria and yeast.</p>
<p><sup>15</sup>N-labeling through the metabolic incorporation of a single source of labeled nitrogen provides an alternative strategy for labeling virtually all nitrogen in expressed proteins of prototrophs and auxotrophs <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. <sup>15</sup>N-labeling has three notable advantages over labeling with specific amino acids. First, all peptides are labeled, irrespective of their amino acid composition, and therefore any endogenous peptide or peptides produced by any chosen endoprotease can be used for quantification. The use of multiple endoproteases has been shown to significantly increase the number of distinct, non-overlapping peptides identified per protein <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Second, the label is not subject to the complications of metabolic conversion of amino acids <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Such label conversion is a particular problem in yeast, where arginine is efficiently converted into proline <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Third, labeled ammonium salts are much less expensive than labeled amino acids and are efficiently used as a nitrogen source by many microorganisms, including <it>Escherichia coli </it>and both budding and fission yeasts.</p>
<p>Despite these advantages, labeled amino acids have been used in favor of <sup>15</sup>N-labeling for proteome-wide quantification in microbes <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Amino acid labels typically produce a small number of distinct mass differences between pairs of XICs, allowing all XICs and XIC pairs to be found prior to database search. In contrast, <sup>15</sup>N-labels produce mass differences that depend on the length and composition of a peptide, complicating the detection of such pairs. Existing methods for <sup>15</sup>N quantification rely on peptide database search of fragmentation spectra to locate paired peaks in parent mass spectra <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. Statistically significant matches to a merged database of both unlabeled and <sup>15</sup>N-labeled peptides provide the labeling status of a fragmentation spectrum and an amino acid sequence from which nitrogen composition can be determined. During quantification, these methods find peaks within parent mass spectra that fall within an interval centered at the precursor retention time and precursor mass-to-charge ratio (<it>m/z</it>) of the peptide-assigned fragmentation spectrum. Then, a mass difference, computed using the nitrogen composition and precursor charge, is used to find peaks in parent mass spectra that correspond to the XIC with the opposite labeling status. By examining only peaks in parent mass spectra near a fragmentation spectrum and at the expected mass difference, existing methods use only a small fraction of the information in all the parent mass spectra to distinguish signal from noise, resulting in noisy peptide quantifications. Therefore, the lack of a robust quantification method significantly limits the wide use of <sup>15</sup>N-labeling for proteome-wide quantification in prototrophic microbes.</p>
<p>We have developed a novel approach for <sup>15</sup>N quantification that finds all XICs as a first step, reliably using all the available information in all the parent mass spectra. Once XICs are found, our method pairs these XICs by using two databases of labeled and unlabeled intact peptide fragment masses and peptide nitrogen counts. Then, pair membership allows the method to determine the label status of a fragmentation spectrum and search the appropriate peptide database. Additionally, the XIC pair provides a constraint on the nitrogen composition, allowing the search to limit the peptides scored against any given fragmentation spectrum. In an experiment to compare the proteome of stationary phase <it>E. coli </it>with that of exponentially growing cells, we show that our method significantly improves the quantitative accuracy of protein abundance ratios obtained from <sup>15</sup>N mass spectra.</p>
</sec>
<sec><st><p>Results and discussion</p></st>
<p>Our method finds XICs prior to database search of fragmentation spectra using techniques we have previously developed <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. From the accurate mass and charge information assigned to a monoisotopic XIC, our method searches entries in unlabeled and <sup>15</sup>N peptide databases and uses the returned nitrogen counts to examine a limited number of <sup>15</sup>N mass differences (Figure <figr fid="F1">1a</figr>). After unambiguously pairing XICs, the method designates each XIC as originating from either the unlabeled or the <sup>15</sup>N-labeled sample. This process is repeated for all monoisotopic XICs until no more XICs can be paired.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Approach for peptide quantification using <sup>15</sup>N mass spectra</p></caption><text>
   <p><b>Approach for peptide quantification using <sup>15</sup>N mass spectra</b>. <b>(a) </b>The method uses accurate mass and charge measurements to search a <sup>14</sup>N-unlabeled database and a <sup>15</sup>N database of intact peptide masses and nitrogen counts obtained by <it>in silico </it>digestion of an organism's proteome. The number of nitrogens in each of the returned peptides is used to examine a limited number of mass differences designated by arrows. On finding an unambiguous <sup>14</sup>N-<sup>15</sup>N pair, the method labels each member of the pair as originating from either the unlabeled or the <sup>15</sup>N-labeled sample. <b>(b) </b>During peptide identification, a fragmentation spectrum (1) associated with a member of the pair is searched against an unlabeled database or <sup>15</sup>N-labeled database of peptides based on the assigned label status. Note that only one member of the peptide pair needs to have an associated fragmentation spectrum. The monoisotopic masses of the peptides (2) and their nitrogen composition (3) are used to limit the search space of peptides scored against the spectrum. The intensity of each member of a <sup>14</sup>N-<sup>15</sup>N pair is used to derive a peptide ratio. <b>(c) </b>Top, a population of incompletely labeled peptides can generate a complex isotope distribution pattern for <sup>15</sup>N-labeled peptides. The peaks in purple, in order of decreasing <it>m/z</it>, correspond to peptides lacking one and two labeled nitrogens. Bottom, even in highly enriched samples, such peaks can be found in real mass spectra as illustrated for a +2 charge peptide, LGFFETVDTDTQR from <it>E. coli </it>protein yaeT (b0177). <b>(d) </b>For a range of charge states, monoisotopic XICs can be paired with an XIC with an additional <sup>13</sup>C, but lack any XICs detected at a negative <sup>13</sup>C shift (top). In contrast, <sup>15</sup>N XICs will lack both XICs at negative and positive <sup>13</sup>C shifts (bottom).</p>
</text><graphic file="gb-2011-12-12-r122-1"/></fig>
<p>During peptide identification, the method iterates through paired XICs that have an associated fragmentation spectrum. Based on the label assigned to an XIC, our method searches the fragmentation spectrum against an unlabeled database or a database of <sup>15</sup>N-labeled peptides. In addition to the mass of the peptide, this search uses the discovered constraint on the nitrogen composition of a peptide from the pairing phase to limit the search space of peptides scored for identification (Figure <figr fid="F1">1b</figr>). Last, the peptide expression ratios are obtained from the areas of paired XICs. Remaining, unpaired monoisotopic XICs that have associated fragmentation spectra are assigned a labeling status based on the highest scoring search result from the unlabeled or the labeled database.</p>
<p>A critical element of our approach is the identification of monoisotopic XICs. This process is complicated by the presence of incompletely labeled peptides. Even in highly enriched <sup>15</sup>N-labeled samples, a population of incompletely labeled peptides can generate a set of XICs that have a more complex isotope distribution pattern than the unlabeled sample (Figure <figr fid="F1">1c</figr>). To distinguish the monoisotopic XIC from this incompletely labeled population, we leverage the precision of modern high-resolution mass spectrometers. Our approach rests on the observation that the difference <it>d </it>= 0.00631994439 Da that exists between the 0.99703489341 Da shift between <sup>15</sup>N isotopes and the 1.00335483781 Da shift between <sup>13</sup>C isotopes for charge +1 peptides allows these incompletely labeled XICs to be distinguished from <sup>13</sup>C XICs. If the precision <it>p </it>of a high-resolution instrument approaches 1 part per million (ppm), this shift can be distinguished at up to (<it>d </it>/ <it>p</it>) &#215; 106 = 6,319 in <it>m/z</it>, above the maximum measured in a typical bottom-up proteomics experiment. At a charge state of +4, this shift can be distinguished at up to 1,579 in <it>m/z</it>. Examples of such shifts can be found in real spectra, as illustrated for a pair of peaks generated by unlabeled and labeled populations of a +2 charge peptide, LGFFETVDTDTQR from <it>E. coli </it>protein yaeT (b0177; Figure <figr fid="F1">1c</figr>, bottom). Here, the error between the theoretical shift and the observed shift between the peaks generated by the population of peptides lacking a single <sup>15</sup>N and the fully labeled population is 4.4 ppm and 0.3 ppm for <sup>13</sup>C and <sup>15</sup>N, respectively. An instrument with precision of 4 ppm would be sufficient to identify this shift as originating from a <sup>15</sup>N isotope, as opposed to a <sup>13</sup>C isotope. To leverage the precision of modern high-resolution instruments, our method labels XICs as monoisotopic if, for a range of charge states, no XIC is found at a negative <sup>13</sup>C shift and an XIC is found at a positive <sup>13</sup>C shift (Figure <figr fid="F1">1d</figr>, top). An XIC is labeled as undetermined or <sup>15</sup>N if no XICs are found at both negative and positive <sup>13</sup>C shifts for a range of charge states (Figure <figr fid="F1">1d</figr>, bottom). Additionally, if a <sup>15</sup>N or undetermined XIC was selected for fragmentation, its associated fragmentation spectrum can also be transferred to the nearest monoisotopic XIC at a positive <sup>15</sup>N shift to increase identification rates <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Also, in experimental samples where high enrichment levels cannot be obtained, due to limitations in labeling efficiency and impurities in the nitrogen source, the intensity information in <sup>13</sup>C and <sup>15</sup>N XICs found by our high-precision approach can be leveraged to generate ratios that are adjusted for significant populations of these partially labeled peptides <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
<p>In order to evaluate our approach, we performed an experiment to quantitatively compare the proteomes of stationary and exponential phase <it>E. coli </it>(see Materials and methods). We selected these two conditions because the transition to stationary phase is well-characterized, and the expression levels of a large number of proteins during this transition are known to differ over a wide range <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Stationary and exponential phase cells were harvested from <sup>15</sup>N-labeled and unlabeled media, respectively. Cell lysate was prepared, combined, and rendered into peptides by trypsin digestion, and then subjected to two-dimensional nano-flow LC-MS analysis.</p>
<p>We analyzed the resulting MS data, comparing our approach to the method employed by XPRESS and Mascot Distiller v.2.3.2. Both methods rely on finding XIC pairs by searching fragmentation spectra against a merged database of both unlabeled and <sup>15</sup>N-labeled peptides <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B16">16</abbr></abbrgrp>. The parameters for XPRESS and Distiller were selected to be appropriate for the precision and mass accuracy of the data. In order to evaluate the methods, we examined the consistency among ratios measured from distinct peptides belonging to the same protein. We expected that distinct peptides from the same protein would be determined to have approximately the same ratios of stationary phase to exponential measurements. For proteins with two or more distinct quantified peptides, we randomly assigned the peptides from each protein into one of two groups and computed the median log<sub>2 </sub>ratio measured for peptides in the two groups. The two ratios, from peptide group 1 and peptide group 2, were correlated for XPRESS and Distiller as well as our method (Figure <figr fid="F2">2</figr>). The significantly higher Spearman's correlation for our method (0.96 our method, 0.84 for XPRESS, and 0.67 for Distiller) indicated that our method produced significantly better quantitative measurements and peptide identifications.</p>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>Comparison with XPRESS and Mascot Distiller</p></caption><text>
   <p><b>Comparison with XPRESS and Mascot Distiller</b>. <b>(a-c) </b>Correlation of log<sub>2 </sub>ratios between distinct peptides from the same protein obtained by Mascot Distiller (a) <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, XPRESS (b) <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, and our method (c). Peptides from each protein were randomly assigned to two groups for this comparison of log<sub>2 </sub>ratios of peptides in group 1 versus log<sub>2 </sub>ratios of peptides in group 2. Multiple measurements (for example, charge states) of the same peptide, when present, were collated and given a median log<sub>2 </sub>ratio prior to forming the peptide groups. Black points designate proteins identified and quantified by both methods, and red points designate proteins identified and quantified that were specific to each method. The diagonal gray line designates perfect correlation.</p>
</text><graphic file="gb-2011-12-12-r122-2"/></fig>
<p>Both Distiller and XPRESS reported more proteins (<it>N </it>= 664 Distiller; <it>N </it>= 554 XPRESS) with two or more quantified peptides than our method (<it>N </it>= 424). This difference in total number of quantified proteins reported by the methods could either be due to differences in sensitivity of accurately determined proteins, or differences in accuracy of quantification. To investigate this, we observed that the Spearman's correlation for proteins (<it>N </it>= 385 Distiller; <it>N </it>= 373 XPRESS) quantified by two or more peptides by our method and each of Distiller and XPRESS independently was higher for our approach (0.98 our method; 0.79 Distiller; 0.89 XPRESS). Next, we examined the correlation for proteins specific to XPRESS (<it>N </it>= 181), Distiller (<it>N </it>= 226), and to our method (<it>N </it>= 51). For these proteins, we computed significantly higher correlations for our method (0.87 our method; 0.49 Distiller; 0.65 XPRESS), suggesting that erroneous pairing of unrelated peaks and subsequent mis-quantification of the assigned peptides contribute to the inferior performance of XPRESS and Distiller.</p>
<p>We additionally examined the raw mass spectra corresponding to the XICs of the one outlier protein yihX (b3385) reported by our method (Figure <figr fid="F2">2c</figr>). The outlier was a peptide with the sequence VLGAWSDLTR. From parent mass spectra, it appeared that an unknown peptide with nearly the same mass co-eluted with both the labeled and unlabeled versions of VLGAWSDLTR, leading to incorrect detection of XICs. However, the exact cause of the problem was difficult to discern by eye since the peaks belonging to the XICs were of very low intensity.</p>
<p>In order to evaluate the reproducibility of our method, we conducted a label-swap experiment. We independently cultured two identical sets of stationary phase and exponential phase cells differing only by which culture was grown in the <sup>15</sup>N labeled media. Analysis of both biological replicates by our method enabled quantification of 539 proteins and 1,985 peptides at a stringent 1% peptide-level false discovery rate (FDR). High correlations across protein and peptide ratios (Spearman's 0.96 and 0.94, respectively) from the two biological label-swap replicates indicated that the analysis was highly reproducible (Figure S1a,b in Additional file <supplr sid="S1">1</supplr>).</p>
<suppl id="S1">
<title><p>Additional file 1</p></title>
<text><p><b>Supplementary figures</b>. Additional figures referred to in the text.</p></text>
<file name="gb-2011-12-12-r122-S1.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<p>The label swap experiment allowed us to investigate the apparent increase in variance from XPRESS for negative log<sub>2 </sub>peptide ratios (Figure <figr fid="F2">2b</figr>). When we conducted the same analysis using XPRESS, we found the reverse pattern of increasing variance towards positive log<sub>2 </sub>ratios (Figure S2a in Additional file <supplr sid="S1">1</supplr>). No such increasing variance was observed from our method (Figure S2b in Additional file <supplr sid="S1">1</supplr>). These results indicate the increasing variance in the negative range in XPRESS is linked to the growth condition of the cells and may reflect that XPRESS is sensitive to the differing expression levels of proteins in the two experimental conditions. Additionally, our method reported slightly more (<it>N </it>= 408) proteins with two or more quantified peptides than XPRESS (<it>N </it>= 399) for the label-swap. Correlations across peptide ratios were higher for our method overall (0.97 our method; 0.84 XPRESS), for proteins in common (0.97 our method; 0.85 XPRESS), and proteins specific to each method (0.94 our method; 0.79 XPRESS). This additional result provides further evidence that indicates our method provides more accurate quantitative measurements and peptide identifications than XPRESS.</p>
<p>The label-swap experiment also allowed us to examine the number of multiply coupled monoisotopic XICs and unpaired monoisotopic XICs present in <sup>15</sup>N mass spectra: 221 monoisotopic XICs in the forward experiment and 254 monoisotopic XICs in the label-swap experiment were multiply coupled. For comparison, 5,556 monoisotopic XIC pairs were found in the forward experiment and 5,224 monoisotopic XIC pairs were found in the label-swap experiment. Unpaired monoisotopic XICs could be generated by several experimental factors: a missed pairing, a multiply coupled XIC that was left unpaired, a protein that is expressed in only one of the conditions, an expression difference outside the detectable range, or contaminant compounds. As all of these factors contribute unpaired XICs, we found in the forward experiment 28,426 monoisotopic XICs of which 17,093 were left unpaired and in the label-swap experiment 31,256 monoisotopic XICs of which 20,554 were left unpaired.</p>
<p>We additionally used the identified XIC pairs from the label-swap experiment to estimate the precision of our data set. Using the nitrogen composition of the peptide sequence assigned to a pair, we calculated an expected theoretical shift between the two XICs. By computing the difference between the measured shift between the XICs and the expected theoretical shift, we created a histogram of precision measurements in parts per million. The mean precision was 0.151 ppm and the standard deviation was 0.987 (Figure S3 in Additional file <supplr sid="S1">1</supplr>). Our estimate of precision indicates that the difference between <sup>15</sup>N isotopes and <sup>13</sup>C isotopes can be detected for charge states +1 and +2 for about 90% of the XICs for the entire <it>m/z </it>range measured. For peptides with charge states +3 and +4, this <it>m/z </it>limit reaches approximately 1,301 and 976, respectively.</p>
<p>In order to deepen our quantitative analysis of the proteomes to include proteins of lower abundance, we subjected our original protein sample to additional fractionation steps. First, the sample was partitioned based on protein solubility into two fractions: high solubility and low solubility. Next, peptides derived from each of these two protein-level fractions were further fractionated using strong cation exchange (SCX) chromatography (see Materials and methods). These additional fractionation steps enabled quantification of 2,320 distinct proteins, 14,605 distinct peptides, and 29,658 quantified XIC pairs at a 1% peptide-level FDR (Additional file <supplr sid="S2">2</supplr> and Materials and methods).</p>
<suppl id="S2">
<title><p>Additional file 2</p></title>
<text><p><b>Peptide ratios for fractionated data set</b>. Raw peptide ratios reported by our method for the solubility and SCX fractionated <it>E. coli </it>protein samples.</p></text>
<file name="gb-2011-12-12-r122-S2.XLS">
   <p>Click here for file</p>
</file>
</suppl>
<p>To evaluate our method on this deeper survey of the proteome, we examined the correlations between the ratios measured for different peptides from the same protein (Figure S4a in Additional file <supplr sid="S1">1</supplr>). High correlations (Spearman's 0.96) indicate that our method could accurately identify and quantify distinct peptides from the same protein to measure the fold-change of the whole protein. In addition, we correlated ratios between charge +2 and charge greater than +2 isoforms of the same peptide (Figure S4b in Additional file <supplr sid="S1">1</supplr>). Although the differing charge states produce very different fragmentation spectra, the peptide ratios from the two charge states were highly correlated (Spearman's 0.98), confirming the high degree of accuracy in both our quantifications and our assignments for each peptide. Next, we examined the intra-experiment variability of the ratios reported by the method. From the ratios of distinct peptides from each protein, we calculated a protein level median coefficient of variation (CV) of 18.4% (Figure S5a in Additional file <supplr sid="S1">1</supplr>). We also calculated a peptide level median CV of 11.9% from ratios of peptides observed in multiple fractions or with differing charge states (Figure S5b in Additional file <supplr sid="S1">1</supplr>). These low CVs at both the protein level and peptide level indicate high quantification accuracy. In summary, the depth and accuracy of this analysis illustrates the utility of our method for large-scale proteomics studies.</p>
<p>We further used the data from the additional fractions to evaluate the sensitivity and specificity gained by using the discovered constraint on the nitrogen composition of a peptide from the XIC pairing phase to limit the search space of peptides scored for tandem mass spectrometry (MS/MS) identification. We independently analyzed each of the 24 fractions in two ways: with our method's nitrogen composition constraint imposed and with it deactivated (Additional file <supplr sid="S3">3</supplr>). When our method was applied to the data, the MS/MS score cutoff required to maintain a constant 1% FDR was significantly less for all charge states, indicating statistically significant gain in sensitivity (paired <it>t</it>-test, <it>P </it>&lt; 3.69 &#215; 10<sup>-10 </sup>for charge states +1 and +2 and <it>P </it>&lt; 1.35 &#215; 10<sup>-11 </sup>for charge states +3 or greater). When our method was deactivated, the highest scoring search results above the 1% FDR threshold were free to have nitrogen compositions that did not match the composition discovered by XIC pairing. As these are likely incorrect peptide identifications, they can be removed by simple filtering. After filtering, the total number of search results and quantifications returned by our method on the same data was significantly higher (Wilcoxon signed rank, <it>P </it>&lt; 9.68 &#215; 10<sup>-6</sup>). This additional result indicates that our method provides a statistically significant gain in specificity.</p>
<suppl id="S3">
<title><p>Additional file 3</p></title>
<text><p><b>Sensitivity and specificity results</b>. False discovery rate (FDR) 1% score thresholds and total number of quantified peptides with and without our method for limiting peptides scored during MS/MS database search based on nitrogen composition applied to a data set.</p></text>
<file name="gb-2011-12-12-r122-S3.XLS">
   <p>Click here for file</p>
</file>
</suppl>
<p>Next, we investigated whether the quantitative output of our method captured expression level changes of proteins known to differ between exponential and stationary phase. Globally decreased rates of translation and ribosome biosynthesis characterize stationary phase growth <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. In strong agreement with expectation, we observed that 49 of 51 quantified ribosomal proteins were down-regulated by a similar amount upon transition into stationary phase (Figure <figr fid="F3">3a</figr>). Another known feature of stationary phase is that cells adapt to higher cell density and nutrient limitation by inducing the expression of many stress proteins, including starvation-induced DNA protection protein, Dps, and osmotically inducible proteins, OsmC and OsmY <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. This is orchestrated by several regulators, including the RNA polymerase sigma factor, RpoS <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. We observed elevated expression levels of each of these proteins in stationary phase cells (Additional file <supplr sid="S2">2</supplr>), in agreement with previous results.</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>Ribosomal protein co-regulation and functional enrichment analysis of protein expression ratios</p></caption><text>
   <p><b>Ribosomal protein co-regulation and functional enrichment analysis of protein expression ratios</b>. <b>(a) </b>Box and whisker plot of the log<sub>2 </sub>ratios of stationary over exponential phase for 51 ribosomal proteins from the low solubility protein fraction. Boxes designate the quartiles. The thick centerline designates the median. Thin lines illustrate the range. <b>(b) </b>A graphical representation of the functional enrichment analysis method from <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> applied to protein expression ratios comparing stationary to exponential phase of growth in <it>E. coli </it>computed by our method. In this representation, rows correspond to significantly informative pathways and columns correspond to 10 equally populated expression bins of log<sub>2 </sub>ratios (stationary over exponential). Colors indicate pathway over or under-representation levels across the expression bins. Red designates (in log<sub>10</sub>) over-representation of a particular pathway in any expression bin, whereas, blue designates under-representation.</p>
</text><graphic file="gb-2011-12-12-r122-3"/></fig>
<p>Finally, to examine whether our measured protein expression differences on a global level accurately characterize the differences between exponential phase and stationary phase cells, we conducted a functional enrichment analysis that bins expression differences based on their magnitudes and directions and determines which biological processes are over- or under-represented in these bins (Figure <figr fid="F3">3b</figr>) <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. Our results demonstrated that, in addition to cessation of ribosome biosynthesis <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, we captured many other adaptive physiological changes across major biological processes that are known features of stationary phase in <it>E. coli</it>. These include repression of flagellum biosynthesis and motility associated processes <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> and down-regulation of proteins involved in oxidative metabolism <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>.</p>
</sec>
<sec><st><p>Conclusions</p></st>
<p>We have introduced a new computational analysis method that significantly improves the accuracy of proteome-wide protein quantification from <sup>15</sup>N mass spectra. The method combines two ideas, one to use pairing information to constrain database search, earlier proposed in <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, and another to pair XICs using nitrogen counts, earlier proposed in <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, to obtain an integrated approach for identification and quantification. Using protein samples from a well-characterized growth transition in <it>E. coli</it>, we show that the new method enables the collection of high-quality quantitative data for large-scale proteomics experiments, which is validated by numerous metrics, including its consistency with previously published observations. More broadly, the method allows researchers in the fields such as microbiology to leverage the robustness and accuracy of metabolic labeling without the experimental limitations and complications of amino acid labeling approaches. Because of this practical advantage, we anticipate that our method will provide a valuable tool for proteome-wide protein quantification.</p>
</sec>
<sec><st><p>Materials and methods</p></st>
<sec><st><p>Strain and media</p></st>
<p><it>E. coli </it>K-12 MG1655 was used for all bacterial experiments in this work. Bacterial cells were grown in M9 minimal media with 0.4% w/v glucose as carbon source. The M9 media contained 12.8 g Na<sub>2</sub>HPO<sub>4</sub>-7H<sub>2</sub>O, 3 g KH<sub>2</sub>PO<sub>4</sub>, 0.5 g NaCl, 11 mg CaCl<sub>2</sub>, 241 mg MgSO<sub>4</sub>, and 1 g NH<sub>4</sub>Cl (with either <sup>14</sup>N or <sup>15</sup>N) per liter. <sup>15</sup>N-ammonium chloride was purchased from Cambridge Isotope Laboratories (Andover, MA, USA). Growth conditions: overnight cultures of <it>E. coli </it>MG1655 were diluted into fresh M9 media (1:500) and shaken at 250 rpm at 37&#176;C. Cultures were harvested at OD600 values of 0.2 and 0.9 for exponential and early stationary phase samples, respectively.</p>
</sec>
<sec><st><p>Protein extraction</p></st>
<p>We harvested 100 ml of exponentially growing cells (in <sup>14</sup>N media) and 22 ml of stationary phase cells (in <sup>15</sup>N media) by centrifugation and these were frozen in an ethanol-dry ice bath. The pellets were re-suspended in 1 ml of B-PER bacterial protein extraction buffer (Pierce, Rockford, IL, USA), pooled together, and vortexed vigorously for 1 min. The mixture was centrifuged at 13,000 rpm for 5 min. The supernatant (high solubility fraction) was collected and frozen in an ethanol-dry ice bath. The pellet was re-suspended in 2 ml of 1:10 diluted B-PER reagent (PIERCE). The suspension was centrifuged and washed one more time with 1:10 diluted B-PER reagent. The pellet was re-suspended in 1 ml of Inclusion Body Solubilization Reagent. The suspension was vortexed for 1 min, shaken for 30 min, and sonicated to further break the cellular structures. Cell debris was removed from the suspension by centrifugation. The supernatant was frozen in an ethanol-dry ice bath (low solubility fraction).</p>
</sec>
<sec><st><p>Mass spectrometry sample preparation</p></st>
<p>The low solubility and high solubility protein fractions were subjected to buffer exchange, thiol reduction and alkylation, and tryptic digestion by the filter-aided sample preparation (FASP) procedure <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Peptides were desalted using a home-packed capillary reversed-phase column (500 &#956;m internal diameter &#215; 20 cm, POROS 10R2 C18 resin) using a Harvard syringe pump, and eluted directly onto a home-packed capillary strong cation exchange column (500 &#956;m internal diameter &#215; 45 cm, POROS SCX resin ), which was connected to the outlet of the reversed-phase column to minimize sample loss. SCX fractionation of peptides was conducted using a Dionex Ultimate NanoLC capillary HPLC system (Dionex, Sunnyvale, CA, USA), using a gradient from a 75%:25% mix of buffers A:B to 100% buffer B (buffer A, 7 mM KH<sub>2</sub>PO4, pH 2.65, 30% acetonitrile (ACN); buffer B, 7 mM KH<sub>2</sub>PO<sub>4</sub>, 350 mM KCl, pH 2.65, 30% ACN) over an 80-min period at flow rate of 10 &#956;l/min, followed by column stripping and reconditioning for 10 min in buffer C (50 mM K<sub>2</sub>HPO<sub>4</sub>, 500 mM NaCl, pH 7.5) and water. SCX fractions were collected every 5 min and were pooled into 12 fractions of roughly equivalent peptide abundance according to the integration of their UV absorbance (&#955; = 214 nm) values during the course of separation.</p>
</sec>
<sec><st><p>Mass spectrometry data acquisition</p></st>
<p>Fractions were desalted using StageTip micro-scale reversed-phase chromatography <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, then subjected to reversed-phase nano-LC-MS and MS/MS performed on a nano-flow capillary high pressure HPLC system (Eksigent, Dublin, CA, USA) coupled to an LTQ-Orbitrap hybrid mass spectrometer (ThermoFisher Scientific, San Jose, CA, USA). Sample concentration and desalting were performed online using a trapping capillary column (200 &#956;m &#215; approximately 30 mm, packed with 5 &#956;m, 100 &#197; Magic AQ C18 material; Michrom, Auburn, CA, USA) at a flow rate of 7 &#956;l/min for 3.5 min, while separation was achieved using an analytical capillary column (75 &#956;m &#215; approximately 20 cm, packed with 3 &#956;m, 100 &#197; Magic AQ C18 material; Michrom) terminating in a pulled sprayer tip, under a linear gradient of A and B buffers (buffer A, 3% ACN/0.1% formic acid; buffer B, 97% ACN/0.1% formic acid) over 180 min at a flow rate of approximately 0.5 &#956;l/min. Electrospray ionization was carried out at 2.5 kV, with the LTQ heated capillary set to 200&#176;C. Full-scan mass spectra were acquired in the Orbitrap in the positive-ion mode over the <it>m/z </it>range of 300 to 1,800 at a resolution of 60,000. MS/MS spectra were simultaneously acquired using the LTQ for the seven most abundant multiply charged species in the full-scan spectrum having signal intensities of &gt; 1,000 NL (Thermo). Dynamic exclusion was set such that MS/MS was acquired only once for each species over a period of 120 s.</p>
</sec>
<sec><st><p>Peptide identification and quantification</p></st>
<p>We used a database of peptide sequences for <it>E. coli </it>K-12 from the UniprotKB, which was built by <it>in silico </it>tryptic digest to obtain two databases of labeled and unlabeled peptide masses and nitrogen counts. XICs and their associated fragmentation spectra were detected using methods we previously developed <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Monoisotopic XICs were paired in order of decreasing intensity. During pairing, the mass determined from the mass-to-charge ratio (<it>m/z</it>) and charge z obtained from <sup>13</sup>C isotope spacing was used to search both databases. A positive or negative shift of 0.99703489341 &#215; N/z was examined for candidate XICs for each returned nitrogen count N for the unlabeled and labeled database, respectively. The <it>m/z </it>of the candidate XIC was allowed to vary between a &#177; 3 ppm window. Also, the pairing allowed the position of a candidate XIC to vary between the start and end time of the current XIC to compensate for differences in co-elution time known to occur due to <sup>15</sup>N-labeling. If only a single candidate XIC was found, the current XIC and the candidate XIC were paired, and a labeling status and nitrogen count were assigned to any associated fragmentation spectra. Based on labeling status, fragmentation spectra were searched against an unlabeled or labeled tryptic peptide database using the fragmentation model from the Open Mass Spectrometry Search Algorithm <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> and scored according to (M/L) &#931; log<sub>2 </sub>(I<sub>m</sub>), where L is the number of peaks in the theoretical spectrum, M is the number of matched experimentally observed peaks, and I<sub>m </sub>is the intensity of the m-th peak that matched a theoretical spectrum peak. This modified score was used to leverage intensity information in spectra and assure near Gaussian distribution of the scores. Prior to scoring, peptides with matching precursor masses but differing nitrogen counts were filtered. FDR was estimated using a q-value approach and concatenated reverse decoy database <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. XICs that have fragmentation spectra without a corresponding XIC pair, or an ambiguous pair, can be assigned an identification and labeling status based on database search of a labeled and <sup>15</sup>N-labeled database.</p>
<p>We used the implementation of XPRESS in Trans Proteomic Pipeline (v.4.4.1) <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. The mass tolerance for XPRESS was set to 0.05 Da. All tryptic peptides were searched using a precursor mass window of &#177; 10 ppm at a stringent FDR of 1%. A single missed cleavage was allowed during peptide database construction.</p>
</sec>
<sec><st><p>Availability</p></st>
<p>We have implemented our method in an open source software system for LC-MS data analysis that provides an extremely fast and highly efficient implementation of this method <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B28">28</abbr></abbrgrp>. A tutorial and all the raw mass spectra analyzed in this manuscript are available for download at <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> and at Proteome Commons <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> with the following Tranche Hash: 4b3VyisnIV6ZXYHRRebqqLxSpgy45DrDVqJojES07+m8JrGaL8RshZx6y/XFZNYSU1iL3VJY7mwhi3XvxhESYXOpBaYAAAAAAAACuQ = =</p>
</sec>
</sec>
<sec><st><p>Abbreviations</p></st>
<p>ACN: acetonitrile; CV: coefficient of variation; FDR: false discovery rate; LC-MS: liquid chromatography coupled mass spectrometry; MS/MS: tandem mass spectrometry; ppm: parts per million; XIC: extracted ion chromatogram; SCX: strong cation exchange; SILAC: stable isotope labeling by amino acids in cell culture.</p>
</sec>
<sec><st><p>Authors' contributions</p></st>
<p>SA conducted the growth transition experiment and the Gene Ontology enrichment analysis, conceived the study in its design and coordination and helped to draft the manuscript. JB conducted the initial pilot experiments from which the method was designed, conceived the study in its design and coordination and helped to draft the manuscript. ZK conducted the LC-MS data analysis, designed and implemented the method with input from MS, conceived the study in its design and coordination and helped to draft the manuscript. CR contributed the idea of comparing to XPRESS. DP conducted the mass spectrometry experiments and data acquisition, conceived the study in its design and coordination and helped to draft the manuscript. AC, LK, MS and ST conceived the study in its design and coordination and helped to draft the manuscript. All authors read and approved the final manuscript for publication.</p>
</sec>
</bdy>
<bm>
<ack>
<sec><st><p>Acknowledgements</p></st>
<p>NIH grant P50 GM071508 (ZK, AAC, DHP, SA); James S McDonnell Centennial Fellowship (LK); Howard Hughes Medical Institute (LK); NIH grant 5R01AI077562 (ST). We would also like to thank Sean R Hackett and Ileana Cristea for their insightful comments on this manuscript.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>An assessment of software solutions for the analysis of mass spectrometry based quantitative proteomics data.</p></title><aug><au><snm>Mueller</snm><fnm>LN</fnm></au><au><snm>Brusniak</snm><fnm>M-Y</fnm></au><au><snm>Mani</snm><fnm>DR</fnm></au><au><snm>Aebersold</snm><fnm>R</fnm></au></aug><source>J Proteome Res</source><pubdate>2008</pubdate><volume>7</volume><fpage>51</fpage><lpage>61</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1021/pr700758r</pubid><pubid idtype="pmpid" link="fulltext">18173218</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification.</p></title><aug><au><snm>Cox</snm><fnm>J</fnm></au><au><snm>Mann</snm><fnm>M</fnm></au></aug><source>Nat Biotechnol</source><pubdate>2008</pubdate><volume>26</volume><fpage>1367</fpage><lpage>1372</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nbt.1511</pubid><pubid idtype="pmpid" link="fulltext">19029910</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Mass spectrometry-based proteomics turns quantitative.</p></title><aug><au><snm>Ong</snm><fnm>S-E</fnm></au><au><snm>Mann</snm><fnm>M</fnm></au></aug><source>Nat Chem Biol</source><pubdate>2005</pubdate><volume>1</volume><fpage>252</fpage><lpage>262</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nchembio736</pubid><pubid idtype="pmpid" link="fulltext">16408053</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Accurate quantitation of protein expression and site-specific phosphorylation.</p></title><aug><au><snm>Oda</snm><fnm>Y</fnm></au><au><snm>Huang</snm><fnm>K</fnm></au><au><snm>Cross</snm><fnm>FR</fnm></au><au><snm>Cowburn</snm><fnm>D</fnm></au><au><snm>Chait</snm><fnm>BT</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1999</pubdate><volume>96</volume><fpage>6591</fpage><lpage>6596</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.96.12.6591</pubid><pubid idtype="pmcid">21959</pubid><pubid idtype="pmpid" link="fulltext">10359756</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Value of using multiple proteases for large-scale mass spectrometry-based proteomics.</p></title><aug><au><snm>Swaney</snm><fnm>DL</fnm></au><au><snm>Wenger</snm><fnm>CD</fnm></au><au><snm>Coon</snm><fnm>JJ</fnm></au></aug><source>J Proteome Res</source><pubdate>2010</pubdate><volume>9</volume><fpage>1323</fpage><lpage>1329</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1021/pr900863u</pubid><pubid idtype="pmcid">2833215</pubid><pubid idtype="pmpid" link="fulltext">20113005</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>A computational approach to correct arginine-to-proline conversion in quantitative proteomics.</p></title><aug><au><snm>Park</snm><fnm>SK</fnm></au><au><snm>Liao</snm><fnm>L</fnm></au><au><snm>Kim</snm><fnm>JY</fnm></au><au><snm>Yates</snm><fnm>JR</fnm></au></aug><source>Nat Methods</source><pubdate>2009</pubdate><volume>6</volume><fpage>184</fpage><lpage>185</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nmeth0309-184</pubid><pubid idtype="pmcid">2946183</pubid><pubid idtype="pmpid" link="fulltext">19247291</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>A genetic engineering solution to the arginine conversion problem in stable isotope labeling by amino acids in cell culture (SILAC).</p></title><aug><au><snm>Bicho</snm><fnm>CC</fnm></au><au><snm>de Lima Alves</snm><fnm>F</fnm></au><au><snm>Chen</snm><fnm>ZA</fnm></au><au><snm>Rappsilber</snm><fnm>J</fnm></au><au><snm>Sawin</snm><fnm>KE</fnm></au></aug><source>Mol Cell Proteomics</source><pubdate>2010</pubdate><volume>9</volume><fpage>1567</fpage><lpage>1577</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1074/mcp.M110.000208</pubid><pubid idtype="pmcid">2896365,2896365</pubid><pubid idtype="pmpid" link="fulltext">20460254</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Comprehensive mass-spectrometry-based proteome quantification of haploid versus diploid yeast.</p></title><aug><au><snm>de Godoy</snm><fnm>LMF</fnm></au><au><snm>Olsen</snm><fnm>JV</fnm></au><au><snm>Cox</snm><fnm>J</fnm></au><au><snm>Nielsen</snm><fnm>ML</fnm></au><au><snm>Hubner</snm><fnm>NC</fnm></au><au><snm>Frohlich</snm><fnm>F</fnm></au><au><snm>Walther</snm><fnm>TC</fnm></au><au><snm>Mann</snm><fnm>M</fnm></au></aug><source>Nature</source><pubdate>2008</pubdate><volume>455</volume><fpage>1251</fpage><lpage>1254</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature07341</pubid><pubid idtype="pmpid" link="fulltext">18820680</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>MSQuant, an open source platform for mass spectrometry-based quantitative proteomics.</p></title><aug><au><snm>Mortensen</snm><fnm>P</fnm></au><au><snm>Gouw</snm><fnm>JW</fnm></au><au><snm>Olsen</snm><fnm>JV</fnm></au><au><snm>Ong</snm><fnm>SE</fnm></au><au><snm>Rigbolt</snm><fnm>KT</fnm></au><au><snm>Bunkenborg</snm><fnm>J</fnm></au><au><snm>Cox</snm><fnm>J</fnm></au><au><snm>Foster</snm><fnm>LJ</fnm></au><au><snm>Heck</snm><fnm>AJ</fnm></au><au><snm>Blagoev</snm><fnm>B</fnm></au><au><snm>Andersen</snm><fnm>JS</fnm></au><au><snm>Mann</snm><fnm>M</fnm></au></aug><source>J Proteome Res</source><pubdate>2010</pubdate><volume>9</volume><fpage>393</fpage><lpage>403</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1021/pr900721e</pubid><pubid idtype="pmpid" link="fulltext">19888749</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>Quantitative profiling of differentiation-induced microsomal proteins using isotope-coded affinity tags and mass spectrometry.</p></title><aug><au><snm>Han</snm><fnm>DK</fnm></au><au><snm>Eng</snm><fnm>J</fnm></au><au><snm>Zhou</snm><fnm>H</fnm></au><au><snm>Aebersold</snm><fnm>R</fnm></au></aug><source>Nat Biotechnol</source><pubdate>2001</pubdate><volume>19</volume><fpage>946</fpage><lpage>951</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nbt1001-946</pubid><pubid idtype="pmcid">1444949</pubid><pubid idtype="pmpid" link="fulltext">11581660</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>A correlation algorithm for the automated quantitative analysis of shotgun proteomics data.</p></title><aug><au><snm>MacCoss</snm><fnm>MJ</fnm></au><au><snm>Wu</snm><fnm>CC</fnm></au><au><snm>Liu</snm><fnm>H</fnm></au><au><snm>Sadygov</snm><fnm>R</fnm></au><au><snm>Yates</snm><fnm>JR</fnm></au></aug><source>Anal Chem</source><pubdate>2003</pubdate><volume>75</volume><fpage>6912</fpage><lpage>6921</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1021/ac034790h</pubid><pubid idtype="pmpid" link="fulltext">14670053</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>A uniform proteomics MS/MS analysis platform utilizing open XML file formats.</p></title><aug><au><snm>Keller</snm><fnm>A</fnm></au><au><snm>Eng</snm><fnm>J</fnm></au><au><snm>Zhang</snm><fnm>N</fnm></au><au><snm>Li</snm><fnm>X-j</fnm></au><au><snm>Aebersold</snm><fnm>R</fnm></au></aug><source>Mol Systems Biol</source><pubdate>2005</pubdate><volume>1</volume><note>2005.0017</note></bibl><bibl id="B13"><title><p>Protein quantification across hundreds of experimental conditions.</p></title><aug><au><snm>Khan</snm><fnm>Z</fnm></au><au><snm>Bloom</snm><fnm>JS</fnm></au><au><snm>Garcia</snm><fnm>BA</fnm></au><au><snm>Singh</snm><fnm>M</fnm></au><au><snm>Kruglyak</snm><fnm>L</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2009</pubdate><volume>106</volume><fpage>15544</fpage><lpage>15548</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0904100106</pubid><pubid idtype="pmcid">2732709</pubid><pubid idtype="pmpid" link="fulltext">19717460</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>Optimizing identification and quantitation of 15N-labeled proteins in comparative proteomics.</p></title><aug><au><snm>Gouw</snm><fnm>JW</fnm></au><au><snm>Tops</snm><fnm>BBJ</fnm></au><au><snm>Mortensen</snm><fnm>P</fnm></au><au><snm>Heck</snm><fnm>AJR</fnm></au><au><snm>Krijgsveld</snm><fnm>J</fnm></au></aug><source>Anal Chem</source><pubdate>2008</pubdate><volume>80</volume><fpage>7796</fpage><lpage>7803</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1021/ac801249v</pubid><pubid idtype="pmpid" link="fulltext">18808151</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>The Escherichia coli proteome: past, present, and future prospects.</p></title><aug><au><snm>Han</snm><fnm>MJ</fnm></au><au><snm>Lee</snm><fnm>SY</fnm></au></aug><source>Microbiol Mol Biol Rev</source><pubdate>2006</pubdate><volume>70</volume><fpage>362</fpage><lpage>439</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/MMBR.00036-05</pubid><pubid idtype="pmcid">1489533</pubid><pubid idtype="pmpid" link="fulltext">16760308</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Mascot Distiller.</p></title><url>http://www.matrixscience.com/distiller_support.html</url></bibl><bibl id="B17"><title><p>Analysis of the shut-off of ribosomal RNA promoters in Escherichia coli upon entering the stationary phase of growth.</p></title><aug><au><snm>Aviv</snm><fnm>M</fnm></au><au><snm>Giladi</snm><fnm>H</fnm></au><au><snm>Oppenheim</snm><fnm>AB</fnm></au><au><snm>Glaser</snm><fnm>G</fnm></au></aug><source>FEMS Microbiol Lett</source><pubdate>1996</pubdate><volume>140</volume><fpage>71</fpage><lpage>76</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1574-6968.1996.tb08317.x</pubid><pubid idtype="pmpid">8666203</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>Identification of a central regulator of stationary-phase gene expression in Escherichia coli.</p></title><aug><au><snm>Lange</snm><fnm>R</fnm></au><au><snm>Hengge-Aronis</snm><fnm>R</fnm></au></aug><source>Mol Microbiol</source><pubdate>1991</pubdate><volume>5</volume><fpage>49</fpage><lpage>59</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1365-2958.1991.tb01825.x</pubid><pubid idtype="pmpid" link="fulltext">1849609</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>Revealing global regulatory perturbations across human cancers.</p></title><aug><au><snm>Goodarzi</snm><fnm>H</fnm></au><au><snm>Elemento</snm><fnm>O</fnm></au><au><snm>Tavazoie</snm><fnm>S</fnm></au></aug><source>Mol Cell</source><pubdate>2009</pubdate><volume>36</volume><fpage>900</fpage><lpage>911</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.molcel.2009.11.016</pubid><pubid idtype="pmcid">2900319</pubid><pubid idtype="pmpid" link="fulltext">20005852</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Microarray analysis of RpoS-mediated gene expression in Escherichia coli K-12.</p></title><aug><au><snm>Patten</snm><fnm>CL</fnm></au><au><snm>Kirchhof</snm><fnm>MG</fnm></au><au><snm>Schertzberg</snm><fnm>MR</fnm></au><au><snm>Morton</snm><fnm>RA</fnm></au><au><snm>Schellhorn</snm><fnm>HE</fnm></au></aug><source>Mol Genet Genomics</source><pubdate>2004</pubdate><volume>272</volume><fpage>580</fpage><lpage>591</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00438-004-1089-2</pubid><pubid idtype="pmpid" link="fulltext">15558318</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Combined transcriptome and proteome analysis of Escherichia coli during high cell density culture.</p></title><aug><au><snm>Yoon</snm><fnm>SH</fnm></au><au><snm>Han</snm><fnm>MJ</fnm></au><au><snm>Lee</snm><fnm>SY</fnm></au><au><snm>Jeong</snm><fnm>KJ</fnm></au><au><snm>Yoo</snm><fnm>JS</fnm></au></aug><source>Biotechnol Bioeng</source><pubdate>2003</pubdate><volume>81</volume><fpage>753</fpage><lpage>767</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/bit.10626</pubid><pubid idtype="pmpid" link="fulltext">12557308</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>Novel approach for peptide quantitation and sequencing based on 15N and 13C metabolic labeling.</p></title><aug><au><snm>Snijders</snm><fnm>APL</fnm></au><au><snm>de Vos</snm><fnm>MGJ</fnm></au><au><snm>Wright</snm><fnm>PC</fnm></au></aug><source>J Proteome Res</source><pubdate>2005</pubdate><volume>4</volume><fpage>578</fpage><lpage>585</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1021/pr0497733</pubid><pubid idtype="pmpid" link="fulltext">15822937</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Universal sample preparation method for proteome analysis.</p></title><aug><au><snm>Wisniewski</snm><fnm>JR</fnm></au><au><snm>Zougman</snm><fnm>A</fnm></au><au><snm>Nagaraj</snm><fnm>N</fnm></au><au><snm>Mann</snm><fnm>M</fnm></au></aug><source>Nat Methods</source><pubdate>2009</pubdate><volume>6</volume><fpage>359</fpage><lpage>362</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nmeth.1322</pubid><pubid idtype="pmpid" link="fulltext">19377485</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>Stop and go extraction tips for matrix-assisted laser desorption/ionization, nanoelectrospray, and LC/MS sample pretreatment in proteomics.</p></title><aug><au><snm>Rappsilber</snm><fnm>J</fnm></au><au><snm>Ishihama</snm><fnm>Y</fnm></au><au><snm>Mann</snm><fnm>M</fnm></au></aug><source>Anal Chem</source><pubdate>2003</pubdate><volume>75</volume><fpage>663</fpage><lpage>670</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1021/ac026117i</pubid><pubid idtype="pmpid">12585499</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Open mass spectrometry search algorithm.</p></title><aug><au><snm>Geer</snm><fnm>LY</fnm></au><au><snm>Markey</snm><fnm>SP</fnm></au><au><snm>Kowalak</snm><fnm>JA</fnm></au><au><snm>Wagner</snm><fnm>L</fnm></au><au><snm>Xu</snm><fnm>M</fnm></au><au><snm>Maynard</snm><fnm>DM</fnm></au><au><snm>Yang</snm><fnm>X</fnm></au><au><snm>Shi</snm><fnm>W</fnm></au><au><snm>Bryant</snm><fnm>SH</fnm></au></aug><source>J Proteome Res</source><pubdate>2004</pubdate><volume>3</volume><fpage>958</fpage><lpage>964</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1021/pr0499491</pubid><pubid idtype="pmpid" link="fulltext">15473683</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>A direct approach to false discovery rates.</p></title><aug><au><snm>Storey</snm><fnm>JD</fnm></au></aug><source>J R Stat Soc B Stat Methodol</source><pubdate>2002</pubdate><volume>64</volume><fpage>479</fpage><lpage>498</lpage><xrefbib><pubid idtype="doi">10.1111/1467-9868.00346</pubid></xrefbib></bibl><bibl id="B27"><title><p>Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry.</p></title><aug><au><snm>Elias</snm><fnm>JE</fnm></au><au><snm>Gygi</snm><fnm>SP</fnm></au></aug><source>Nat Methods</source><pubdate>2007</pubdate><volume>4</volume><fpage>207</fpage><lpage>214</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nmeth1019</pubid><pubid idtype="pmpid" link="fulltext">17327847</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>PVIEW: Princeton LC-MS/MS Data Viewer and Analyzer.</p></title><url>http://compbio.cs.princeton.edu/pview</url></bibl><bibl id="B29"><title><p>Supplementary Data.</p></title><url>http://compbio.cs.princeton.edu/pview/protein15N</url></bibl><bibl id="B30"><title><p>Proteome Commons.</p></title><url>https://ProteomeCommons.org</url></bibl></refgrp>
</bm>
</art>