Covariant Fitness Clusters Reveal Structural Evolution of SARS-CoV-2 Polymerase Across the Human Population
Abstract
Understanding the fitness landscape of viral mutations is crucial for uncovering the evolutionary mechanisms contributing to pandemic behavior. Here, we apply a Gaussian process regression (GPR) based machine learning approach that generates spatial covariance (SCV) relationships to construct stability fitness landscapes for the RNA-dependent RNA polymerase (RdRp) of SARS- CoV-2. GPR generated fitness scores capture on a residue-by-residue basis a covariant fitness cluster centered at the C487-H642-C645-C646 Zn2+binding motif that iteratively evolves since the early phase pandemic. In the Alpha and Delta variant of concern (VOC), multi-residue SCV interactions in the NiRAN domain form a second fitness cluster contributing to spread. Strikingly, a novel third fitness cluster harboring a Delta VOC basal mutation G671S augments RdRp structural plasticity to potentially promote rapid spread through viral load. GPR principled SCV provides a generalizable tool to mechanistically understand evolution of viral genomes at atomic resolution contributing to fitness at the pathogen-host interface.
Related articles
Related articles are currently not available for this article.