Nature Methods
- 4, 817 - 821 (2007)
Published online: 27 September 2007; | doi:10.1038/nmeth1097
Decoding protein modifications using top-down mass spectrometryNertila Siuti & Neil L KelleherNertila Siuti and Neil L. Kelleher are in the Departments of Chemistry and Biochemistry, and the Institute for Genomic Biology, University of Illinois at Urbana-Champaign, 53 Roger Adams Laboratory, 600 South Matthews Avenue, Urbana, Illinois 61801, USA. kelleher@scs.uiuc.edu Top-down mass spectrometry is an emerging technology which strives to preserve the post-translationally modified forms of proteins present in vivo by measuring them intact, rather than measuring peptides produced from them by proteolysis. The top-down technology is beginning to capture the interest of biologists and mass spectrometrists alike, with a main goal of deciphering interaction networks operative in cellular pathways. Here we outline recent approaches and applications of top-down mass spectrometry as well as an outlook for its future.Advances in the genome sciences have generated more specific hypotheses about the dynamic nature of the protein-level language used for signaling in eukaryotic cells. This has led to an increased focus on both targeted and large-scale approaches that can precisely characterize endogenous protein forms at the molecular level. The proteomes of eukaryotic cells are highly complex owing to related genes encoding similar protein sequences and to alternative splicing and post-translational modifications (PTMs). Mass spectrometry (MS) is maturing as a powerful technology for analyzing proteomes. The emerging top-down MS approach1,
2,
3,
4,
5 in particular can provide a 'bird's eye' view of the protein forms present and their relative abundances, before dissecting each for complete characterization of their primary structures. Since its early days1,
6,
7, top-down mass spectrometry has continued to evolve from focused studies of single protein targets to the study of complex mixtures or even proteomes8,
9,
10. Here we outline the methods, applications, advantages and challenges associated with top-down mass spectrometry both for single protein targets and for entire proteomes, as well as prospects for the future.
Top-down and bottom-up MS MS-based proteomics is typically carried out by first digesting a protein mixture into short peptides with a protease, then analyzing the peptide mixture by MS; there are many permutations on this general theme, which is conceptually known as 'bottom-up' proteomics. In contrast, the 'top-down' approach involves direct analysis of intact proteins, without previous proteolytic digestion. A new variant method, known to some as 'middle-down', analyzes larger peptide fragments (>3 kDa), thus combining some benefits of both bottom-up and top-down approaches (for example, generating peptides that contain multiple PTMs).
It is well established that protein PTMs are a key driving force behind cellular signaling. A distinct advantage of top-down over peptide-based approaches is that the abundance of the protein forms can be determined directly, as intact proteins are less susceptible to instrumental biases than are their small peptide counterparts11. The approach is depicted in Figure 1 for a hypothetical 11 kDa protein with three forms present from three different cell states. Initially, a 'basis set' of all the forms expressed is determined for a particular cell state (Fig. 1a). Once the different forms present are detected, each form is isolated and fragmented by tandem mass spectrometry (MS/MS; Fig. 1a, inset). This process can be repeated for samples from different cell states (Fig. 1b,c, insets), which allows detection of PTM dynamics caused by genetic manipulation, drug treatment, transcript knockdown or other perturbations.
 | |  | The MS/MS level of information is the top-down equivalent of the tryptic digest typically used in a bottom-up experiment. Top-down MS/MS also achieves protein identification and molecular characterization, but with a different information content. Notably, MS/MS of intact protein forms facilitates the determination of modifications as they occur in combination, as well as revealing information about PTM hierarchies (such as which PTMs occur first, second, third and so on)11,
12.
Top-down technology The three basic pillars of MS-based proteomics are (i) the 'front end' fractionation of complex mixtures, (ii) mass spectral data acquisition and (iii) protein identification and characterization by database searching. Over the past decade, these components have improved drastically for peptides, followed by slower progress for intact proteins. Here we discuss the available approaches and technologies for top-down MS as well as the needs for specific improvements.
Front-end fractionation of intact proteins. Despite efforts toward improving 'front end' separations, no top-down platform has yet emerged as the best option. Two-dimensional gels provide a nice 'bird's eye' view of the proteome through good separation; however, large-scale analyses of gel spots by top-down MS has been difficult to achieve by either MALDI- or electrospray ionization (ESI)-based approaches. Lack of sensitivity has thwarted the MALDI approach that uses a phenomenon called 'in-source decay'13 to achieve fragmentation and identification of intact proteins14. ESI is more promising for large protein MS/MS, because it generates multiply charged ions for more efficient dissociation. However, it has not yet proven possible to implement ESI in conjunction with protein electroelution from two-dimensional gels. Proteome fractionation by preparative gel electrophoresis in the first dimension using an acid-labile surfactant in place of sodium dodecylsulfate has been demonstrated in combination with ESI and Fourier-transform (FTMS)/MS8,
15.
Other common approaches involve fractionating proteins in the first dimension by anion exchange16, capillary isoelectric focusing17 or chromatofocusing18. These fractionation approaches have been implemented in work flows on diverse types of instrument, such as time-of-flight (TOF) MS19,
20 and FTMS21. As in bottom-up proteomics, the second dimension of separation is dominated by reversed-phase liquid chromatography (RLPC). With improved instrumentation and MS/MS methods, top-down MS should become more compatible with the timescale of chromatography used at present in bottom-up liquid chromatography-mass spectrometry (see Box 1).
MS instrumentation. Instruments become approximately fivefold more sensitive about every three years and come in basic forms such as time-of-flight (TOF), quadrupole, ion trap or FTMS, or as combinations of these. Continued refinements are needed for top-down because proteins and peptides above 5 kDa have more charge states and more isotopic peaks than the average tryptic peptide.
Historically, top-down MS has been most often performed on FTMS instruments, but new instruments are being developed that may facilitate tandem MS experiments for intact proteins of high mass. For example, the need for a superconducting magnet in FTMS was recently bypassed using a new approach to measuring the mass-to-charge ratio of ions22,
23. Similar to the original type of FTMS, this new Orbitrap instrument provides high resolving power for each detected peak. This translates to low-parts-per-million mass accuracy and higher confidence in protein identification; however, both FTMS and Orbitrap instruments are slower than the lower-resolution ion itrap. Even with only 103 resolving power, however, the utility of the stand-alone ion traps for top-down is improving, thanks in part to new methods for fragmenting larger peptides and proteins in the gas phase. Both low and high resolution mass spectrometers should continue to drive advancements in top-down proteomics (see Box 1).
MS/MS fragmentation. The classical method for breaking ions apart involves colliding them into a neutral gas such as helium or argon, a process known as collision-induced or collisionally activated dissociation (CID or CAD). Newer methods using electrons for MS/MS are now coming into more widespread use (Fig. 2), with electron capture dissociation (ECD) first reported in 1998 (ref. 24). Recently, this concept has evolved into electron transfer dissociation (ETD) with work by the Hunt group25,
26. Combining ETD with previous approaches for charge manipulation of ions from the McLuckey laboratory27,
28, top-down in a stand-alone ion trap is now a reality29,
30. The electron-based methods for ion fragmentation cleave proteins and large peptides at many more backbone positions than older approaches, which translates into an improved capacity to localize modifications to specific sites. Both ECD and ETD retain any PTM that is stable during the original ionization process (Fig. 2a)31, in contrast to collisional dissociation of tryptic peptides, which often ejects PTMs such as phosphorylation and glycosylation (Fig. 2b). However, when applied during top-down MS/MS the classical methods of ion fragmentation frequently retain labile PTMs (Fig. 2a)32,
33. This is likely to be due to the higher-order structure of gaseous ions larger than 8 kDa, which drives fragmentation at amide backbone sites preferentially over PTM ejection. Another key recent advance is the ability to obtain high-quality fragmentation spectra above 200 kDa by preventing ultralarge proteins from folding up like 'spaghetti' immediately after ESI34. ETD and ECD will propel both top-down and middle-down strategies to achieve better protein characterization35.
 | |  | Data analysis. The specificity of database searching of intact protein MS/MS spectra is often orders of magnitude higher than that for bottom-up fragments. This translates to a greater capability to consider PTMs and coding polymorphisms (cSNPs) during a primary database search32. In fact, the concept of putting PTMs and cSNPs in databases32,
36 is gaining more acceptance, with an ontology for PTMs now specified by the Proteomics Standard Initiative.
The only algorithm available at present for identifying protein forms from tandem mass spectra of intact proteins is ProSight37 (see Box 1). The standard search uses the experimental protein mass36 to select all protein candidates that lie within a specified mass range (for example, 2,000 Da). ProSight then uses a Poisson model to provide statistically significant matches32 between the fragment ion mass values in the experimental MS/MS spectrum and the theoretical masses predicted from each candidate. This type of search is error tolerant in ' m' mode36, which considers the precise mass difference between the experimental protein molecular mass and the database candidate. Searching in m mode facilitates the detection and localization of PTMs not present in the database. ProSight uses a candidate expansion method referred to as 'shotgun annotation'38 to consider combinations of diverse PTMs, cSNPs and alternative splicing events harbored in the database21. Shotgun annotation allows direct coupling of protein identification and characterization. Each year, more cSNPs and PTMs (especially phosphorylations, glycosylations and disulfide bonds) are added to public databases. The ProSight approach provides a means to access this information in new database searches.
Current applications Chromatin biology. Histone proteins and hypotheses of a 'histone code' have attracted increasing attention over the past several years. Both top-down and bottom-up MS approaches have been very successful in generating a comprehensive map of isolated histone PTMs11,
38,
39,
40,
41,
42,
43. However, in the context of understanding the dynamic interactions and functions of these proteins, a complete view of all the PTMs present at the intact level is necessary. This kind of biological information is best obtained by performing MS/MS on intact histones or large proteolytic fragments containing many modifications43,
44. Top-down MS reports on the combinations of methylations, phosphorylations and acetylations present on the same histone molecule in vivo. MS/MS fragmentation can be used to precisely identify highly similar histone variants and localize PTMs to specific sites. In addition to detecting combinations of PTMs on large fragments, top-down MS has the capability to distinguish positional isomers with the same molecular weight (such as substoichiometric acetylations on multiple lysine residues). Such 'PTM isomers' can be quantified by comparing the ratios of fragment ion abundances produced during MS/MS11. The overall percent occupancy of PTMs present on a particular histone can thus be estimated. Top-down MS can also precisely identify highly related histone family members—for example, the many variants of H2A39 and H2B40).
Protein-level variation. In addition to PTMs, there are other sources of protein-level variation in eukaryotic cells. These include families of highly related genes encoding protein sequences with high identity (Fig. 3a, top), polymorphisms and alternative splicing (Fig. 3a, middle and right). Expression of these variations leads to a mixture of protein forms with slightly different intact masses (Fig. 3a). In addition to resolving such mixtures, top-down MS can also determine expression ratios of intact protein forms21, unlike typical bottom-up approaches or standard RNA-level analysis. When such a mixture is analyzed by bottom-up MS, the peptides produced can either be common to all forms or isoform-specific (Fig. 3b), with the latter often difficult to detect and reassemble in the protein 'scaffold'.
 | |  | Membrane proteins. Top-down MS has also made strides in the analysis of membrane proteins. In past years, substantial progress has been made in adapting this approach to integral membrane proteins (recently reviewed in ref. 45). Fifty-eight thylakoid membrane proteins from a plant have been analyzed using these tailored chromatographic methods46.
The future of top-down MS Top-down mass spectrometry has made valuable contributions to our knowledge of combinations of protein PTMs. The initial contributions and measurement benefits are most clearly viewed through the lens of histone analysis in chromatin biology44. Continued work on histones with the top-down family of approaches will deepen our understanding of PTM dynamics in bulk chromatin and their functions in cancer and epigenetics. We anticipate that large-protein MS will continue to evolve such that sample utilization and overall speed of the measurement will approach the norms of bottom-up techniques. This will allow hundreds of top-down measurements in parallel to detect long range PTM interactions on the same protein and on separate proteins within cellular pathways.
As PTM-focused and general forms of bottom-up MS continue to mature, we project that site-specific modifications will increasingly be added to databases, streamlining the subsequent readout of complex PTM patterns and hierarchies. With improving technology, tandem mass spectrometry above 5 kDa will increasingly be applicable to cellular structures beyond chromatin, helping to advance a major goal of modern proteomics: to develop a deep sense of PTM function, how PTMs relate to each other and how they function as nodes in signaling networks.
Published online: 27 September 2007.
REFERENCES
- Kelleher, N.L. et al. Top down versus bottom up protein characterization by tandem high-resolution mass spectrometry. J. Am. Chem. Soc. 121, 806–812 (1999). | Article | ISI | ChemPort |
- McLafferty, F.W., Fridriksson, E.K., Horn, D.M., Lewis, M.A. & Zubarev, R.A. Biochemistry: biomolecule mass spectrometry. Science 284, 1289–1290 (1999). | Article | PubMed | ISI | ChemPort |
- Reid, G.E. & McLuckey, S.A. 'Top down' protein characterization via tandem mass spectrometry. J. Mass Spectrom. 37, 663–675 (2002). | Article | PubMed | ISI | ChemPort |
- Kelleher, N.L. Top down proteomics. Anal. Chem. 76, 197A–203A (2004). | PubMed |
- Bogdanov, B. & Smith, R.D. Proteomics by FTICR mass spectrometry: top down and bottom up. Mass Spectrom. Rev. 24, 168–200 (2005). | Article | PubMed | ISI | ChemPort |
- McLafferty, F.W. High resolution tandem FT mass spectrometry above 10 kDa. Acc. Chem. Res. 27, 379–386 (1994). | Article | ISI | ChemPort |
- Kelleher, N.L., Costello, C.A., Begley, T.P. & McLafferty, F.W. Thiaminase I (42 kDa) heterogeneity, sequence refinement, and active site location from high-resolution tandem mass spectrometry. J. Am. Soc. Mass Spectrom. 6, 981–984 (1995). | Article | ISI | ChemPort |
- Meng, F. et al. Processing complex mixtures of intact proteins for direct analysis by mass spectrometry. Anal. Chem. 74, 2923–2929 (2002). | Article | PubMed | ISI | ChemPort |
- Meng, F. et al. Molecular-level description of proteins from Saccharomyces cerevisiae using quadrupole FT hybrid mass spectrometry for top down proteomics. Anal. Chem. 76, 2852–2858 (2004). | Article | PubMed | ISI | ChemPort |
- Patrie, S.M. et al. Top down mass spectrometry of <60 kDa proteins from Methanosarcina acetivorans using quadrupole FTMS with automated octapole collisionally activated dissociation. Mol. Cell. Proteomics 5, 14–25 (2006). | PubMed | ISI | ChemPort |
- Pesavento, J.J., Mizzen, C.A. & Kelleher, N.L. Quantitative analysis of modified proteins and their positional isomers by tandem mass spectrometry: human histone H4. Anal. Chem. 78, 4271–4280 (2006). | Article | PubMed | ISI | ChemPort |
- Zabrouskov, V. et al. Stepwise deamidation of ribonuclease A at five sites determined by top down mass spectrometry. Biochemistry 45, 987–992 (2006). | Article | PubMed | ISI | ChemPort |
- Reiber, D.C., Grover, T.A. & Brown, R.S. Identifying proteins using matrix-assisted laser desorption/ionization in-source fragmentation data combined with database searching. Anal. Chem. 70, 673–683 (1998). | Article | PubMed | ISI | ChemPort |
- Ogorzalek Loo, R.R. et al. Virtual 2-D gel electrophoresis: visualization and analysis of the E. coli proteome by mass spectrometry. Anal. Chem. 73, 4063–4070 (2001). | Article | PubMed | ChemPort |
- Johnson, J.R., Meng, F., Forbes, A.J., Cargile, B.J. & Kelleher, N.L. Fourier-transform mass spectrometry for automated fragmentation and identification of 5–20 kDa proteins in mixtures. Electrophoresis 23, 3217–3223 (2002). | Article | PubMed | ISI | ChemPort |
- Sharma, S. et al. Proteomic profiling of intact protein using WAX-RPLC 2-D separations and FTICR mass spectrometry. J. Proteome Res. 6, 602–610 (2007). | Article | PubMed | ISI | ChemPort |
- Zhou, F. & Johnston, M.V. Protein profiling by capillary isoelectric focusing, reversed-phase liquid chromatography, and mass spectrometry. Electrophoresis 26, 1383–1388 (2005). | Article | PubMed | ISI | ChemPort |
- Chong, B.E., Yan, F., Lubman, D.M. & Miller, F.R. Chromatofocusing nonporous reversed-phase high-performance liquid chromatography/electrospray ionization time-of-flight mass spectrometry of proteins from human breast cancer whole cell lysates: a novel two-dimensional liquid chromatography/mass spectrometry method. Rapid Commun. Mass Spectrom. 15, 291–296 (2001). | Article | PubMed | ISI | ChemPort |
- Zhao, J. et al. Proteomic analysis of estrogen response of premalignant human breast cells using a 2-D liquid separation/mass mapping technique. Proteomics 6, 3847–3861 (2006). | Article | PubMed | ISI | ChemPort |
- Nemeth-Cawley, J.F. & Rouse, J.C. Identification and sequencing analysis of intact proteins via collision-induced dissociation and quadrupole time-of-flight mass spectrometry. J. Mass Spectrom. 37, 270–282 (2002). | Article | PubMed | ISI | ChemPort |
- Roth, M.J. et al. Precise and parallel characterization of coding polymorphisms, alternative splicing and modifications in human proteins by mass spectrometry. Mol. Cell. Proteomics 4, 1002–1008 (2005). | PubMed | ISI | ChemPort |
- Hu, Q. et al. The Orbitrap: a new mass spectrometer. J. Mass Spectrom. 40, 430–443 (2005). | Article | PubMed | ISI | ChemPort |
- Scigelova, M. & Makarov, A. Orbitrap mass analyzer - overview and applications in proteomics. Proteomics 6 (suppl. 2), 16–21 (2006).
- Zubarev, R., Kelleher, N. & McLafferty, F. Electron capture dissociation of multiply charged protein cations. A nonergodic process. J. Am. Chem. Soc. 120, 3265–3266 (1998). | Article | ISI | ChemPort |
- Syka, J.E., Coon, J.J., Schroeder, M.J., Shabanowitz, J. & Hunt, D.F. Peptide and protein sequence analysis by electron transfer dissociation mass spectrometry. Proc. Natl. Acad. Sci. USA 101, 9528–9533 (2004). | Article | PubMed | ChemPort |
- Coon, J.J. et al. Protein identification using sequential ion/ion reaction and tandem mass spectrometry. Proc. Natl. Acad. Sci. USA 102, 9463–9468 (2005). | Article | PubMed | ChemPort |
- Reid, G.E., Shang, H., Hogan, J.M., Lee, G.U. & McLuckey, S.A. Gas-phase concentration, purification, and identification of whole proteins from complex mixtures. J. Am. Chem. Soc. 124, 7353–7362 (2002). | Article | PubMed | ISI | ChemPort |
- Cargile, B.J., McLuckey, S.A. & Stephenson, J.L., Jr. Identification of bacteriophage MS2 coat protein from E. coli lysates via ion trap collisional activation of intact protein ions. Anal. Chem. 73, 1277–1285 (2001). | Article | PubMed | ISI | ChemPort |
- Chi, A., Bai, D.L., Geer, L.Y., Shabanowitz, J. & Hunt, D.F. Analysis of intact proteins on a chromatographic time scale by electron transfer dissociation tandem mass spectrometry. Int. J. Mass Spectrom. 259, 197–203 (2007). | PubMed | ISI | ChemPort |
- Amunugama, R., Hogan, J.M., Newton, K.A. & McLuckey, S.A. Whole protein dissociation in a quadrupole ion trap: identification of an a priori unknown modified protein. Anal. Chem. 76, 720–727 (2004). | Article | PubMed | ISI | ChemPort |
- Kelleher, N.L. et al. Localization of labile post-translational modifications by electron capture dissociation: the case of
-carboxyglutamic acid. Anal. Chem. 71, 4250–4253 (1999). | Article | PubMed | ISI | ChemPort | - Meng, F. et al. Informatics and multiplexing of intact protein identification in bacteria and the archaea. Nat. Biotechnol. 19, 952–957 (2001). | Article | PubMed | ISI | ChemPort |
- Reid, G.E., Stephenson, J.L., Jr. & McLuckey, S.A. Tandem mass spectrometry of ribonuclease A and B: N-linked glycosylation site analysis of whole protein ions. Anal. Chem. 74, 577–583 (2002). | Article | PubMed | ISI | ChemPort |
- Han, X., Jin, M., Breuker, K. & McLafferty, F.W. Extending top down mass spectrometry to proteins with masses greater than 200 kilodaltons. Science 314, 109–112 (2006). | Article | PubMed | ISI | ChemPort |
- Chi, A. et al. Analysis of phosphorylation sites on proteins from Saccharomyces cerevisiae by electron transfer dissociation (ETD) mass spectrometry. Proc. Natl. Acad. Sci. USA 104, 2193–2198 (2007). | Article | PubMed | ChemPort |
- Taylor, G.K. et al. Web and database software for identification of intact proteins using top down mass spectrometry. Anal. Chem. 75, 4081–4086 (2003). | Article | PubMed | ISI | ChemPort |
- LeDuc, R.D. et al. ProSight PTM: an integrated environment for protein identification and characterization by top-down mass spectrometry. Nucleic Acids Res. [online] 32, W340–W345 (2004)
|