Computational Methods for Understanding Mass Spectrometry–Based Shotgun Proteomics Data

Pavel Sinitcyn, Jan Daniel Rudolph, Jürgen Cox*

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Computational proteomics is the data science concerned with the identification and quantification of proteins from high-throughput data and the biological interpretation of their concentration changes, posttranslational modifications, interactions, and subcellular localizations. Today, these data most often originate from mass spectrometry–based shotgun proteomics experiments. In this review, we survey computational methods for the analysis of such proteomics data, focusing on the explanation of the key concepts. Starting with mass spectrometric feature detection, we then cover methods for the identification of peptides. Subsequently, protein inference and the control of false discovery rates are highly important topics covered. We then discuss methods for the quantification of peptides and proteins. A section on downstream data analysis covers exploratory statistics, network analysis, machine learning, and multiomics data integration. Finally, we discuss current developments and provide an outlook on what the near future of computational proteomics might bear.
Original languageEnglish
Pages (from-to)207-234
JournalAnnual Review of Biomedical Data Science
Volume1
DOIs
Publication statusPublished - 4 May 2018

Fingerprint

Dive into the research topics of 'Computational Methods for Understanding Mass Spectrometry–Based Shotgun Proteomics Data'. Together they form a unique fingerprint.

Cite this