Robust deep learning-based protein sequence design using ProteinMPNN

J. Dauparas, I. Anishchenko, N. Bennett, H. Bai, R.J. Ragotte, L.F. Milles, B.I.M. Wicky, A. Courbet, R.J. de Haas, N. Bethel, P.J.Y. Leung, T.F. Huddy, S. Pellock, D. Tischer, F. Chan, B. Koepnick, H. Nguyen, A. Kang, B. Sankaran, A.K. BeraN.P. King, D. Baker*

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

414 Citations (Scopus)

Abstract

Although deep learning has revolutionized protein structure prediction, almost all experimentally characterized de novo protein designs have been generated using physically based approaches such as Rosetta. Here, we describe a deep learning-based protein sequence design method, ProteinMPNN, that has outstanding performance in both in silico and experimental tests. On native protein backbones, ProteinMPNN has a sequence recovery of 52.4% compared with 32.9% for Rosetta. The amino acid sequence at different positions can be coupled between single or multiple chains, enabling application to a wide range of current protein design challenges. We demonstrate the broad utility and high accuracy of ProteinMPNN using x-ray crystallography, cryo-electron microscopy, and functional studies by rescuing previously failed designs, which were made using Rosetta or AlphaFold, of protein monomers, cyclic homo-oligomers, tetrahedral nanoparticles, and target-binding proteins.

Original languageEnglish
Pages (from-to)49-56
JournalScience (New York, N.Y.)
Volume378
Issue number6615
DOIs
Publication statusPublished - 15 Sept 2022

Fingerprint

Dive into the research topics of 'Robust deep learning-based protein sequence design using ProteinMPNN'. Together they form a unique fingerprint.

Cite this