Proteins

S Glycoprotein

The emergence of SARS-CoV-2 has resulted in >90,000 infections and >3,000 deaths. Coronavirus spike (S) glycoproteins promote entry into cells and are the main target of antibodies. We show that SARS-CoV-2 S uses ACE2 to enter cells and that the receptor-binding domains of SARS-CoV-2 S and SARS-CoV S bind with similar affinities to human ACE2, correlating with the efficient spread of SARS-CoV-2 among humans. We found that the SARS-CoV-2 S glycoprotein harbors a furin cleavage site at the boundary between the S1/S2 subunits, which is processed during biogenesis and sets this virus apart from SARS-CoV and SARS-related CoVs. We determined cryo-EM structures of the SARS-CoV-2 S ectodomain trimer, providing a blueprint for the design of vaccines and inhibitors of viral entry. Finally, we demonstrate that SARS-CoV S murine polyclonal antibodies potently inhibited SARS-CoV-2 S mediated entry into cells, indicating that cross-neutralizing antibodies targeting conserved S epitopes can be elicited upon vaccination.

M Protein

The M protein is the most abundant viral protein present in the virion particle, giving a definite shape to the viral envelope. It binds to the nucleocapsid and acts as a central organizer of coronavirus assembly. Coronavirus M proteins are highly diverse in amino acid contents but maintain overall structural similarity within different genera. The M protein has three transmembrane domains, flanked by a short amino terminus inside the virion. Overall, the viral scaffold is maintained by M-M interaction. Of note, the M protein of SARS-CoV-2 does not have an amino acid substitution compared to that of SARS-CoV.

E Protein

The ongoing mutations in the structural proteins of SARS-CoV-2 are the major impediment for prevention and control of the COVID-19 disease. Presently we focused on evolution of the envelope (E) protein, one of the most enigmatic and less studied protein among the four structural proteins (S, E, M and N) associated with multitude of immunopathological functions of SARS-CoV-2. In the present study, we comprehensively analyzed 81,818 high quality E protein sequences of SARS-CoV-2 globally available in the GISAID database as of 20 August 2020. Compared to Wuhan reference strain, our mutational analysis explored only 1.2 % (982/81818) mutant strains undergoing a total of 115 unique amino acid (aa) substitutions in the E protein, highlighting the fact that most (98.8 %) of the E protein of SARS-CoV-2 strains are highly conserved. Moreover, we found 58.77 % (134 of 228) nucleotides (nt) positions of SARS-CoV-2 E gene encountering a total of 176 unique nt-level mutations globally, which may affect the efficacy of real time RT-PCR-based molecular detection of COVID-19. Importantly, higher aa variations observed in the C-terminal domain (CTD) of the E protein, particularly at Ser55-Phe56, Arg69 and the C-terminal end (DLLV: 72–75) may alter the binding of SARS-CoV-2 Envelope protein to tight junction-associated PALS1 and thus could play a key role in COVID-19 pathogenesis. Furthermore, this study revealed the V25A mutation in the transmembrane domain which is a key factor for the homopentameric conformation of E protein. Our analysis also observed a triple cysteine motif harboring mutation (L39M, A41S, A41V, C43F, C43R, C43S, C44Y, N45R) which may hinder the binding of E protein with spike glycoprotein. These results therefore suggest the continuous monitoring of the structural proteins including the envelope protein of SARS-CoV-2 since the number of genome sequences from across the world are continuously increasing.

N Protein

The nucleocapsid (N) protein is an important antigen for coronavirus, which participate in RNA package and virus particle release. In this study, we expressed the N protein of SARS-CoV-2 and characterized its biochemical properties. Static light scattering, size exclusive chromatography, and small-angle X-ray scattering (SAXS) showed that the purified N protein is largely a dimer in solution. CD spectra showed that it has a high percentage of disordered region at room temperature while it was best structured at 55 °C, suggesting its structural dynamics. Fluorescence polarization assay showed it has non-specific nucleic acid binding capability, which raised a concern in using it as a diagnostic marker. Immunoblot assays confirmed the presence of IgA, IgM and IgG antibodies against N antigen in COVID-19 infection patients' sera, proving the importance of this antigen in host immunity and diagnostics.