Click here to flash read.
<p>
The 68-kDa homodimeric 3C-like protease of SARS-CoV-2, M
<sup>
pro
</sup>
(3CLpro/Nsp5), is a promising antiviral drug target. We evaluate the concordance of models generated by the newly introduced AlphaFold2 structure prediction program with residual dipolar couplings (RDCs) measured in solution for
<sup>
15
</sup>
N-
<sup>
1
</sup>
H
<sup>
N
</sup>
and
<sup>
13
</sup>
C'-
<sup>
1
</sup>
H
<sup>
N
</sup>
atom pairs. The latter were measured using a new, highly precise TROSY-AntiTROSY Encoded RDC (TATER) experiment. Three sets of AlphaFold2 models were evaluated: (1) M
<sup>
pro
</sup>
<sub>
AF
</sub>
, generated using the standard AlphaFold2 input structural database; (2) M
<sup>
pro
</sup>
<sub>
AFD
</sub>
, where the AlphaFold2 implementation was modified to exclude all candidate template X-ray structures deposited after Jan 1, 2020; and (3) M
<sup>
pro
</sup>
<sub>
AFS
</sub>
, which excluded all structures homologous to coronaviral M
<sup>
pro
</sup>
. Close agreement between all three sets of AlphaFold models and experimental RDC data is found for most of the protein. For residues in well-defined secondary structure, the agreement decreases somewhat upon Amber relaxation. For these regions, M
<sup>
pro
</sup>
<sub>
AF
</sub>
agreement exceeds that of most high-resolution X-ray structures. Residues from domain 2 that comprise elements of both the active site and the homo-dimerization interface fit less well across all structures. These results indicate novel opportunities for combining experimentation with molecular dynamics simulations, where solution RDCs provide highly precise input for QM/MM simulations of substrate binding/reaction trajectories.
</p>