Native or Non-Native Protein-Protein Docking Models? Molecular Dynamics to the Rescue

Zuzana Jandova, Attilio Vittorio Vargiu, Alexandre M.J.J. Bonvin*

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Molecular docking excels at creating a plethora of potential models of protein-protein complexes. To correctly distinguish the favorable, native-like models from the remaining ones remains, however, a challenge. We assessed here if a protocol based on molecular dynamics (MD) simulations would allow distinguishing native from non-native models to complement scoring functions used in docking. To this end, the first models for 25 protein-protein complexes were generated using HADDOCK. Next, MD simulations complemented with machine learning were used to discriminate between native and non-native complexes based on a combination of metrics reporting on the stability of the initial models. Native models showed higher stability in almost all measured properties, including the key ones used for scoring in the Critical Assessment of PRedicted Interaction (CAPRI) competition, namely the positional root mean square deviations and fraction of native contacts from the initial docked model. A random forest classifier was trained, reaching a 0.85 accuracy in correctly distinguishing native from non-native complexes. Reasonably modest simulation lengths of the order of 50-100 ns are sufficient to reach this accuracy, which makes this approach applicable in practice.

Original languageEnglish
Pages (from-to)5944-5954
Number of pages11
JournalJournal of Chemical Theory and Computation
Volume17
Issue number9
DOIs
Publication statusPublished - 14 Sept 2021

Bibliographical note

Funding Information:
This work was financially supported by the European Union Horizon 2020 projects BioExcel (675728, 823830).

Funding Information:
The authors thank the entire computational structural biology group at the Utrecht University for fruitful discussions, and, in particular, Dr. Francesco Ambrosetti for his input on machine learning. This research used the Savio computational cluster resource provided by the Berkeley Research Computing program at the University of California, Berkeley (supported by the UC Berkeley Chancellor, Vice Chancellor for Research, and Chief Information Officer).

Publisher Copyright:
© 2021 The Authors. Published by American Chemical Society

Funding

This work was financially supported by the European Union Horizon 2020 projects BioExcel (675728, 823830). The authors thank the entire computational structural biology group at the Utrecht University for fruitful discussions, and, in particular, Dr. Francesco Ambrosetti for his input on machine learning. This research used the Savio computational cluster resource provided by the Berkeley Research Computing program at the University of California, Berkeley (supported by the UC Berkeley Chancellor, Vice Chancellor for Research, and Chief Information Officer).

Fingerprint

Dive into the research topics of 'Native or Non-Native Protein-Protein Docking Models? Molecular Dynamics to the Rescue'. Together they form a unique fingerprint.

Cite this