Gene and protein sequence features augment HLA class I ligand predictions

Kaspar Bresser, Benoit P Nicolet, Anita Jeko, Wei Wu, Fabricio Loayza-Puch, Reuven Agami, Albert J R Heck, Monika C Wolkers, Ton N Schumacher*

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

The sensitivity of malignant tissues to T cell-based immunotherapies depends on the presence of targetable human leukocyte antigen (HLA) class I ligands. Peptide-intrinsic factors, such as HLA class I affinity and proteasomal processing, have been established as determinants of HLA ligand presentation. However, the role of gene and protein sequence features as determinants of epitope presentation has not been systematically evaluated. We perform HLA ligandome mass spectrometry to evaluate the contribution of 7,135 gene and protein sequence features to HLA sampling. This analysis reveals that a number of predicted modifiers of mRNA and protein abundance and turnover, including predicted mRNA methylation and protein ubiquitination sites, inform on the presence of HLA ligands. Importantly, integration of such "hard-coded" sequence features into a machine learning approach augments HLA ligand predictions to a comparable degree as experimental measures of gene expression. Our study highlights the value of gene and protein features for HLA ligand predictions.

Original languageEnglish
Article number114325
Number of pages23
JournalCell Reports
Volume43
Issue number6
Early online date11 Jun 2024
DOIs
Publication statusPublished - 11 Jun 2024

Bibliographical note

Publisher Copyright:
© 2024 The Authors

Funding

We would like to thank Mireille Toebes for helpful input during generation of the HLA class I ligandome dataset. We would like to thank Prof. Dr. Daniel Peeper for kindly providing the SK-MEL-95 and M026.X1 cell lines. This work was supported by Institute for Chemical Immunology (ICI) grant 003 (to T.N.S.). K.B. conceptualization, methodology, validation, formal analysis, investigation, data curation, writing \u2013 original draft, writing \u2013 review & editing, visualization; B.P.N. conceptualization, methodology, formal analysis, writing \u2013 review & editing; A.J. formal analysis, methodology, and investigation; W.D. data curation; F.L.-P. investigation, methodology, and resources; R.A. methodology, resources, and supervision; A.J.R.H. conceptualization, methodology, resources, supervision, and funding acquisition; M.C.W. methodology, writing \u2013 review & editing, and supervision; T.N.S. conceptualization, methodology, writing \u2013 review & editing, supervision, and funding acquisition. T.N.S. is an advisor for Allogene Therapeutics, Asher Bio, Merus, Neogene Therapeutics, and Scenic Biotech; a stockholder in Allogene Therapeutics, Asher Bio, Cell Control, Celsius, Merus, and Scenic Biotech; and a venture partner at Third Rock Ventures; all outside of the current work. We would like to thank Mireille Toebes for helpful input during generation of the HLA class I ligandome dataset. We would like to thank Prof. Dr. Daniel Peeper for kindly providing the SK-MEL-95 and M026.X1 cell lines. This work was supported by Institute for Chemical Immunology (ICI) grant 003 (to T.N.S.).

FundersFunder number
Scenic Biotech
Institute for Chemical Immunology003

    Keywords

    • CP: Immunology
    • HLA class I
    • HLA ligand predictions
    • HLA ligandome
    • XGBoost
    • antigen presentation
    • epitope prediction
    • epitopes
    • machine learning

    Fingerprint

    Dive into the research topics of 'Gene and protein sequence features augment HLA class I ligand predictions'. Together they form a unique fingerprint.

    Cite this