Role of spatial patterning of N-protein interactions in SARS-CoV-2 genome packaging
Abstract
Viruses must efficiently and specifically package their genomes while excluding cellular nucleic acids and viral sub-genomic fragments. Some viruses use specific packaging signals, which are conserved sequence/structure motifs present only in the full-length genome. Recent work has shown that viral proteins important for packaging can undergo liquid-liquid phase separation (LLPS), where one or two viral nucleic acid binding proteins condense with the genome. The compositional simplicity of viral components lends itself well to theoretical modeling compared to more complex cellular organelles. Viral LLPS can be limited to one or two viral proteins and a single genome that is enriched in LLPS-promoting features. In our previous study, we observed that LLPS-promoting sequences of SARS-CoV-2 are located at the 5ʹ and 3ʹ ends of the genome, whereas the middle of the genome is predicted to consist mostly of solubilizing elements. Is this arrangement sufficient to drive single genome packaging, genome compaction, and genome cyclization? We addressed these questions using a coarse-grained polymer model, LASSI, to study the LLPS of nucleocapsid protein with RNA sequences that either promote LLPS or solubilization. With respect to genome cyclization, we find the most optimal arrangement restricts LLPS-promoting elements to the 5ʹ and 3ʹ ends of the genome, consistent with the native spatial patterning. Genome compaction is enhanced by clustered LLPS-promoting binding sites, while single genome packaging is most efficient when binding sites are distributed throughout the genome. These results suggest that many and variably positioned LLPS-promoting signals can support packaging in the absence of a singular packaging signal which argues against necessity of such a feature. We hypothesize that this model should be generalizable to multiple viruses as well as cellular organelles like paraspeckles, which enrich specific, long RNA sequences in a defined arrangement.
Statement of significance
The COVID-19 pandemic has motivated research of the basic mechanisms of coronavirus replication. A major challenge faced by viruses such as SARS-CoV-2 is the selective packaging of a large genome in a relatively small capsid while excluding host and sub-genomic nucleic acids. Genomic RNA of SARS-CoV-2 can condense with the Nucleocapsid (N-protein), a structural protein component critical for packaging of many viruses. Notably, certain regions of the genomic RNA drive condensation of N-protein while other regions solubilize it. Here, we explore how the spatial patterning of these opposing elements promotes single genome compaction, packaging, and cyclization. This model informs futurein silicoexperiments addressing spatial patterning of genomic features that are experimentally intractable because of the length of the genome.
Related articles
Related articles are currently not available for this article.