Functionality and translation fidelity characterization of mRNA vaccines using platform based mass spectrometry detection

Introduction
The success of the Pfizer-BioNTech and Moderna mRNA vaccines against severe acute respiratory syndrome coronavirus disease 2019 (SARS-CoV-2) has placed the spotlight on mRNA as an appealing platform for vaccinating against a wide range of diseases (e.g., influenza1, HIV2, and RSV3)4,5. mRNA vaccines often consist of lipid nanoparticles (LNP) encapsulating mRNA encoding the antigen sequence which, when translated by host cells, leads to antigen presentation and stimulation of immune response. Key advantages of mRNA vaccines include ease of manufacture, amenability to updating the antigen sequence to match rapidly mutating pathogens (e.g., SARS-CoV-2), and the ability to multiplex different mRNA constructs within the same vaccine product6. The use of modified ribonucleotides is common in mRNA-based therapeutics or vaccines because they decrease innate immunogenicity and increase mRNA stability, leading to the recognition of these advancements with the 2023 Nobel Prize in Physiology or Medicine7,8,9,10. As an example, the clinically approved SARS-CoV-2 mRNA vaccines incorporate N1-methylpseudouridine to achieve these attributes11,12. Using in vitro-transcribed mRNA as a therapeutic or vaccine is reliant on accurate translation of the encoded antigen protein following transfection into target cells, yet the use of modified ribonucleotides and the corresponding impact on mRNA translation fidelity were, until recently, not thoroughly investigated.
Recent work by Mulroney and coworkers investigated the impact that N1-methylpseudouridylation has on mRNA translation integrity and demonstrated that this modification results in +1 ribosomal frameshifting in vitro, albeit at significantly lower abundance than the in-frame translated protein13. This was observed along with cellular immunity in both mice and humans to the +1 frameshifted products from the BNT162b2 (SARS-CoV-2) vaccine (trade name: Comirnaty; generic name: tozinameran) mRNA following vaccination13. This +1 ribosomal frameshifting occurred in one of the regions containing a “slippery sequence” and the aforementioned work by Mulroney, et al. suggests that optimizing slippery sequences is a proactive and effective strategy for reducing the translation of frameshifted products. While programmed ribosomal frameshifting in either direction (+ or −) can be naturally occurring as an important part of virus replication and immune system evasion—for example, within influenza A14 and SARS-CoV-2 virus expression15—the finding of +1 ribosomal frameshifting in modified mRNA highlights an opportunity to proactively characterize this phenomenon during vaccine development. Despite no adverse outcomes being reported due to the incorrect translation of these vaccines associated with frameshifted products, this serves as a cautionary tale to optimize the mRNA sequence to prevent off-target effects for future mRNA therapeutics and vaccines.
To support the development, optimization, and production of mRNA vaccines, a wide range of analytical methods have been developed to better understand and monitor critical quality attributes (CQAs) of both the mRNA itself, the LNP, and the mRNA-LNP complex16,17. For mRNA, examples of these CQAs include sequence identity, 5′-capping efficiency, 3′ poly(A) tail length, and mRNA integrity (i.e., intactness)—all properties that ultimately affect the translation of antigen protein and, therefore, vaccine efficacy. To complement these methods for characterizing specific attributes of the mRNA, it is also important to have methods for evaluating the overall functionality of the mRNA, defined here as the successful translation of antigen by the ribosome18. To this end, we developed an analytical assay to assess mRNA functionality based on cell-free translation (CFT), also known as in vitro translation (IVT) or cell-free protein synthesis (CFPS) in the literature19,20. Unlike cell-based expression systems that require cell culture over multiple days, CFT is capable of rapidly assessing mRNA functionality as protein translation is observed within just a couple of hours. Moreover, CFT is performed by delivering mRNA directly to the ribosomal machinery, whereas in cell-based expression systems, measurement of mRNA functionality can be confounded by the quality of the LNP used to deliver the mRNA into the cells. Several CFT systems are commercially available, derived from bacteria, yeast, plant, insect, and mammalian cells19,20,21. Of these, wheat germ extract (WGE) is well established in the literature and is reported to have the highest yields of any eukaryotic CFT system22,23,24, thus making it the logical starting point for the development of a CFT assay to assess mRNA functionality. While CFT has a role in providing a rapid means for assessing mRNA functionality without the confounding effects of the LNP delivery system, cell-based translation (CBT) may be used for the assessment of mRNA drug product formulated in LNPs and often provides a more representative output of mRNA functionality in vivo. For these reasons, we also assessed the ability to characterize mRNA quality and functionality using CBT.
To quantitate the amount of antigen translated by WGE-CFT, an automated capillary Western blot method, Simple Western™ (SW), was chosen for initial method development. SW consists of protein separation by capillary electrophoresis (CE) followed by immobilization of the separated proteins to the capillary wall and immunodetection with a suitable primary antibody. SW measures both the molecular weight and relative signal of immunodetected proteins. However, it does not provide sequence-level information, thus limiting its specificity and ability to detect frame-shifting events during antigen translation from mRNA. A significant drawback of SW is its reliance on antigen-specific primary antibodies as a critical reagent. For multivalent vaccines, which may contain mRNA encoding variants of the same antigen with high levels of similarity, it may prove challenging to develop antibodies that are highly specific to each antigen. Additionally, if the mRNA antigen sequence is updated in response to newly emerged or seasonal serotypes, antibody redevelopment may be required which can be both time consuming and costly.
To overcome the potential limitations of using an immunodetection method, we sought to demonstrate the viability of using liquid chromatography–tandem mass spectrometry (LC–MS/MS, or “MS” as used throughout this manuscript) as an antibody-free alternative for the identification and relative quantification of antigen produced by CFT or CBT of mRNA. Bottom-up MS involves subjecting a mixture of proteins to enzymatic digestion and analyzing the resulting peptide mixture by MS. The resulting spectra can then be matched against a protein database that has been “digested” in silico using the enzyme of choice for identification of proteins in the sample. Bottom-up MS is a highly specific and sensitive technique, allowing for distinction between proteins even with high sequence homology without relying on specific antibodies. This enables an antibody-free platform-based approach to protein detection and characterization.
Diseases that mutate rapidly necessitate expedited vaccine development to provide relevant and often seasonal protection. Examples include COVID-19 (SARS-CoV-2) and influenza. Responding to such diseases requires analytical assays that are readily available and don’t require months of development lead time. Because of the rapid mutation of SARS-CoV-2, Delta (B.1.617.2) spike protein and Omicron (B.1.1.529) spike protein mRNA constructs serve as ideal examples to demonstrate the utility of such methods. Here, we demonstrate the utility of this platform-based MS method in detecting, characterizing, and providing relative quantification of proteins translated from CFT of mRNA. We also evaluated platform-based MS detection in a proof-of-concept experiment wherein MS was used to detect and provide relative quantification of proteins translated from a hexavalent mRNA drug product transfected in human cells using CBT. Furthermore, we demonstrate that using a CFT-MS approach, we can interrogate +1 ribosomal frameshifting occurring from N1-methylpsuedouridylation of the mRNA and enable identification of the frameshifting junction. The resulting workflow is a sensitive, antibody-free, platform-based approach that can be used to confirm identity and provide relative quantification of protein(s) translated from mRNA from either CFT or CBT for the assessment of mRNA quality, functionality, and translation fidelity.
Results
As a means of generating translated protein to assess mRNA functionality (i.e., translation of antigen protein), we utilized two approaches: cell-free translation (CFT) using wheat germ extract and cell-based translation (CBT) using Huh7 cells. Each mRNA translation approach has its own advantages as discussed previously. Detection of translated protein(s) following CFT is often accomplished using methods such as radiolabeling, immunoprobe (e.g., Western blotting, enzyme-linked immunosorbent assay (ELISA)), or functional assays like Luciferase activity readout25. Among these methods, we selected automated Western blotting by Simple Western™ (SW) due to its relatively high throughput, automation, reproducibility, and availability of antibodies for antigen detection26,27,28. However, the CFT-SW approach still relies on suitable antibodies for antigen detection, which can pose challenges including increased assay development time and cost for antibody screening, high detection limits and variability when using poor quality antibodies, the need for alternative detection assays if suitable antibodies cannot be found, and limited ability to multiplex detection for analysis of mRNA mixtures due to antibody cross-reactivity. Mass spectrometry, on the other hand, is a highly sensitive technique that lends itself well to a platform-based approach for protein detection and relative quantification, alleviating the need to perform antibody screening and re-development for each mRNA construct or antigen target. For this reason, we coupled MS to both CFT and CBT approaches. These workflows are outlined in Fig. 1.

Scheme outlining the cell-free and cell-based translation workflows coupled to Simple Western™ (CFT-SW) or reversed-phase liquid chromatography tandem mass spectrometry (CFT-MS and CBT-MS) workflows for detection of proteins translated from mRNA drug substances or drug products.
Assessment of mRNA functionality by detecting translated protein identity and sequence confirmation by CFT-MS
Another key advantage of mass spectrometry-based detection of proteins is the ability to characterize the primary structure of the translated protein(s). This enables confirmation of mRNA identity and quality as assessment of the translated protein sequence can ensure that the mRNA encodes for the correct protein sequence and has an appropriately located stop codon. Following the initial development of the CFT-MS workflow, assay performance was evaluated to demonstrate this capability. To accomplish this, mRNA encoding for the SARS-CoV-2 Delta (B.1.617.2) spike protein was analyzed using the CFT-MS workflow where protein digestion was accomplished with trypsin, chymotrypsin, or alpha lytic protease. While trypsin can be selected as a broadly applicable enzyme for a platform-based approach that allows for protein detection, relative abundance, and primary sequence confirmation, the depth of sequence coverage is dependent on the protein sequence of interest. Therefore, the use of additional enzymes suited to the protein sequence can help improve the sequence coverage, if desired. We analyzed the resulting data from CFT-MS analysis using a peptide mapping approach to confirm the sequence of the SARS-CoV-2 Delta (B.1.617.2) spike protein (Fig. 2). Results show that when using these three digestion enzymes, we obtained 94% sequence coverage of this spike protein using MS/MS matches. If mass-only matches are also considered, the sequence coverage increases to 99%. This demonstrates the advantage of using CFT-MS to ensure that the mRNA codes for the expected protein sequence. Additionally, coverage of the N-terminus and C-terminus advantageously enables confirmation that the entirety of the mRNA open reading frame is translated, and the stop codon is in the correct location. If high throughput and accurate quantitation of translated protein(s) without complete sequence confirmation is desired, CFT-MS could be performed with targeted MS analysis that tracks and quantifies selected surrogate peptides, such as with multiple reaction monitoring (MRM) based isotope dilution mass spectrometry (IDMS)29. The use of wheat germ extract for CFT simplifies MS analysis as it does not result in glycosylation of the translated protein due the lack of necessary cell machinery in the reaction mixture24. The use of alternative CFT systems, such as HeLa lysate, can be used to assess glycosylation patterns on the antigenic protein as the presence of host glycans can greatly impact immune response30.

Amino acids confirmed with MS/MS matches are represented in green and amino acids identified with mass only matches are represented in blue. Black amino acids indicate no mass only or MS/MS matches were identified. Overall, 94% sequence coverage was obtained from CFT-MS analysis with 13.1 µg/mL mRNA and considering only MS/MS matches. Both the N-terminus and C-terminus were covered with MS/MS matches, allowing verification of correct translation through the entire protein sequence. Peptide mapping results were obtained by combining results from CFT-MS analysis of Delta (B.1.617.2) spike protein using the enzymes trypsin, chymotrypsin, and alpha lytic protease. Additional details regarding which peptides were identified by each enzyme are included in Supplementary Fig. 1.
Assessment of mRNA functionality by detecting mRNA dose dependent protein translation by CFT-SW and CFT-MS
Next, we translated SARS-CoV-2 Delta (B.1.617.2) spike protein mRNA via CFT at varying concentrations of mRNA in order to evaluate the relationship between mRNA concentration and protein translation by CFT. The relative abundance of translated protein was measured by both SW and MS to compare the two workflows (Fig. 3). Results between the two workflows are remarkably similar, showing a strong linear response between 0 and 13.1 µg/mL mRNA, maximum signal at 52.4 µg/mL mRNA, and signal decrease at higher concentrations of mRNA. While initial analysis by CFT-SW was done with a highly sensitive antibody (“antibody 1”), Simple Western™ was repeated using a less sensitive antibody (“antibody 2”) to verify that the maximum observed at 52.4 µg/mL mRNA was not due to detector saturation (results included in Supplementary Fig. 2 and Supplementary Fig. 3). Because the observed mRNA concentration:antigen translation profile is the same regardless of antibody sensitivity or protein detection technique, these findings suggest that there is a diminishing return to increasing the mRNA concentration when it comes to cell-free translation. The slight decrease in translated protein when mRNA concentration exceeds 52.4 µg/mL is possibly due to mRNA aggregating at higher concentrations, as this loss of protein translation has been observed to varying degrees from other mRNA constructs (data not shown). Ultimately, this dose curve demonstrates the ability of both CFT workflows to generate accurate and repeatable relative abundance information about proteins translated from mRNA and aids in establishing a linear range for each assay.

Normalization to calculate relative abundance values as shown was performed using the sample with the maximum translated protein as 100%. Insets show the linear range for each workflow as highlighted by the gray shaded box, demonstrating strong linearity (R2 > 0.97) in the region of 0 to 13.1 µg/mL mRNA. Plots that show the actual (not normalized) peak area (CFT-SW) and MS1 area (CFT-MS) of translated Delta (B.1.617.2) spike protein are included in the Supplementary Information (Supplementary Fig. 3). CFT-MS data was corrected and normalized with the spiked-in ADH protein to account for potential sample loss during sample preparation and analysis. Results from each assay are shown as duplicate analyses (data points) and the corresponding average (solid line).
Assessment of mRNA quality using CFT-SW and CFT-MS as stability-indicating assays
Process development and stability assessment rely on assays that provide a functional readout of mRNA quality. To interrogate the ability of the CFT-SW and CFT-MS workflows to provide such information, we incubated SARS-CoV-2 Delta (B.1.617.2) mRNA at 50 °C and sampled at various time points up to 55 h. mRNA integrity at each time point was measured by microchip capillary electrophoresis (MCE) (Fig. 4)31. Additionally, we assessed the functionality of mRNA at each time point by wheat germ CFT combined with either SW or MS detection (Fig. 4). Results from both CFT-SW and CFT-MS demonstrate that the amount of translated protein (relative to the non-degraded time point t = 0) decreases accordingly as the mRNA is degraded. These results indicate that the relative abundance of translated protein from CFT-SW and CFT-MS workflows can indeed be used as a stability-indicating assay for the assessment of mRNA quality and functionality. If absolute quantification of the fully translated protein is required, a mass spectrometry approach that targets the C-terminus to ensure complete protein translation could be employed using single reaction monitoring (SRM) or multiple reaction monitoring (MRM).

Following incubation of SARS-CoV-2 Delta (B.1.617.2) mRNA at 50 °C for varying time points to degrade the mRNA, wheat germ CFT was performed on 2 µg/mL mRNA from each time point. Translated protein was detected by either Simple Western™ (“SW”, red squares) or LC–MS/MS (“MS”, blue circles). The mRNA itself was also analyzed by microchip capillary electrophoresis as a direct measurement of mRNA integrity (“MCE”, green triangles, electropherograms provided in Supplementary Fig. 4). The y-axis shows relative intensity in percent corresponding to: mRNA integrity normalized to t = 0 for MCE, chemiluminescence signal relative to t = 0 for SW, and summed MS1 area relative to t = 0 for MS. CFT-MS data was corrected and normalized with the spiked-in ADH protein. Results from each assay are shown as duplicate analyses (data points) and the corresponding average (solid line).
Assessment of mRNA functionality by evaluating multi-valent mRNA mixtures with high translated protein sequence homology by CFT-SW and CFT-MS
The adaptation of viruses to evade immune response creates variants that may require rapid revision of vaccine development to target the appropriate antigen. To complicate this further, several variants of the same virus may be in circulation at a given time. One such approach to tackle these challenges is to generate a multi-valent vaccine aimed at targeting several variants of the virus of interest. While multi-valent mRNA vaccines have already proven helpful in the fight against COVID-1932, an increase in valency can create analytical challenges, especially if the encoded antigens share high sequence similarity. For example, the spike proteins of the Delta (B.1.617.2) and Omicron (B.1.1.529) strains of SARS-CoV-2 share over 97% sequence homology (Supplementary Fig. 5). As such, selective immunodetection of antigens translated from a bivalent Delta+Omicron vaccine drug substance demonstrates a significant analytical challenge.
Even with the example shown here, extensive screening was needed to identify SARS-CoV-2 antibodies specific to Delta (B.1.617.2) and Omicron (B.1.1.529) spike protein (and even the Omicron (B.1.1.529)-specific antibody shows weak affinity to Delta (B.1.617.2) (Fig. 5A)). This challenge that arises when using CFT-SW is most likely to occur when analyzing samples with high sequence homology. Subjecting the Delta strain spike protein mRNA and the Omicron spike protein mRNA to CFT independently and then analyzing by SW shows that an antibody that should be specific to the Omicron (B.1.1.529) spike protein results in weak signal from the Delta (B.1.617.2) sample (Fig. 5A) due to antibody cross-reactivity. Similarly, CFT-SW of these same samples using an antibody that should be specific to the Delta (B.1.617.2) spike protein results in signal from the Omicron (B.1.1.529) sample in addition to non-specific binding observed in the blank control sample (Fig. 5B). While the mRNA constructs that encode for each of these proteins may be analyzed individually to avoid these challenges, there are instances where analysis of the mRNA mixture may be necessary to better understand the functionality of the mRNA constructs in the presence of one another, such as with co-formulated drug product characterization.

A Resulting profiles from CFT-SW of a mixture of SARS-CoV-2 spike protein Delta (B.1.617.2, blue) and Omicron (B.1.1.529, green) strain mRNA using an antibody that should be specific to Omicron (B.1.1.529) only (A) or specific to Delta (B.1.617.2) only (B), demonstrating the cross-reactivity of antibodies that can occur when proteins have high sequence homology (in this case, >97% sequence similarity). Alternatively, analyzing multiple mixtures of Delta (B.1.617.2) and Omicron (B.1.1.529) spike protein mRNA with CFT-MS demonstrates the utility in using mass spectrometry for such a scenario (C). Relative abundance (%) of detected Delta (B.1.617.2) and Omicron (B.1.1.529) spike protein following CFT of mRNA mixtures with varying ratios (solid data points) agrees well with the theoretical values based on mixing ratios (dashed lines).
To combat this challenge, the CFT-MS workflow may be employed. Because MS is a highly selective technique capable of identifying portions of the protein sequence that have unique amino acid combinations even in cases of high overall sequence homology, this antibody-free approach overcomes the multiplexing limitations of cross-reactivity in antibody-based analysis of mRNA mixtures. After mixing the Delta (B.1.617.2) strain spike protein mRNA and the Omicron (B.1.1.529) strain spike protein mRNA at varying ratios (100:0, 75:25, 50:50, 25:75, and 0:100 Delta:Omicron; experimental design illustrated in Supplementary Fig. 6), the bivalent mRNA mixtures were then subjected to CFT-MS. Following identification of the translated protein(s) through this experiment, the relative abundance of Delta (B.1.617.2) or Omicron (B.1.1.529) spike protein compared to total spike protein within the respective sample was calculated as outlined in the methods section. From this, we observe that the relative amount of each translated spike protein matches the theoretical mixing ratios of the respective mRNA constructs (Fig. 5C). For example, when the bivalent mRNA mixture consisted of 25% Delta mRNA and 75% Omicron mRNA, we observe approximately 25% Delta spike protein and 75% Omicron spike protein relative abundance among the two identified spike proteins. Representative MS/MS spectra showing confident and unambiguous assignment of tryptic peptides unique to each spike protein sequence along with examples of MS/MS spectra from of tryptic peptides that overlap between the two are included in Figs. S8–S10. Additionally, examples of the MS1 peak area of unique peptides from Delta and Omicron spike proteins are included in Supplementary Fig. S10, demonstrating how their abundances correlate with the mRNA mixing ratio. MS1 peak areas from peptides in common between the two proteins and observed across all mixing ratios are shown in Supplementary Fig. 11, showing that they are present at a relatively consistent abundance and is indicative of comparable translation efficiencies between the two mRNA constructs.
This highlights the ability of the platform-based MS detection approach to multiplex protein detection even in cases of high sequence homology, which is particularly advantageous in the analysis of multi-valent mRNA vaccines. It is important to note, however, that different mRNA constructs may have varying translation efficiencies33,34 which can result in translated protein relative abundances that do not necessarily reflect the ratio at which the mRNAs were mixed. Because the Delta (B.1.617.2) mRNA and Omicron (B.1.1.529) mRNA used in this work appear to have comparable translation efficiencies, they serve as a clear proof-of-concept for the value of using a platform-based mass spectrometry detection approach to generate relative abundance information of proteins translated from mRNAs, even in mixtures.
Assessment of mRNA functionality of formulated multi-valent mRNA drug product in human cells by CBT-MS
Using mass spectrometry to characterize proteins translated from individual or multi-valent mRNA mixtures following cell-free translation allows for rapid feedback about mRNA quality and functionality in early stages of development without the potentially confounding impact of lipid nanoparticle (LNP) delivery systems as mentioned previously. However, there remain advantages to using a cell-based translation assay such as the ability to interrogate the impact of selected LNP formulations on mRNA translation. Because of this, we tested the feasibility of characterizing proteins generated using a human cell line (Huh7). In this proof-of-concept study, the cells were transfected with two different doses (either 6.67 ng per mRNA or 4.44 ng per mRNA) of a hexavalent mRNA drug product (DP). Each of the six mRNAs in the DP used here encoded for a different spike protein, each with varying degrees of sequence homology (22–97%, Supplementary Fig. 12). Following incubation, the cells were lysed and the proteins analyzed using the platform-based MS detection workflow. Results show the clear confirmation of protein translation from each of the six mRNA constructs (Fig. 6). Representative MS/MS spectra used to show confident and unambiguous assignment of tryptic peptides unique to the translated protein sequence of each mRNA construct along with examples of MS/MS spectra from of tryptic peptides that overlap between some mRNA constructs are included in Figs. S14–S16.

Resulting protein translation from hexavalent mRNA vaccine incubated with Huh7 cells at a dose of 6.67 ng per mRNA construct (blue) or 4.44 ng per mRNA construct. Translated proteins were detected with the LC–MS/MS platform workflow as described here and protein translation is shown as relative abundance (%) compared to the most abundant protein derived from mRNA1. Results show that the proteins are translated in an mRNA construct and dose-dependent manner.
Comparing the abundance of each translated protein relative to the most abundant translated protein, results show that the cells transfected with 6.67 ng per mRNA produce a greater amount of each protein than those transfected with 4.44 ng per mRNA. This indicates a logical dependence of protein translation on the mRNA dose given to cells and agrees with the CFT-MS dose dependent experiments outlined earlier in this manuscript. Within a given mRNA dosage, e.g., comparing the six translated proteins within the 6.67 ng per mRNA dose, the relative amount of translated protein appear to have differences. Absolute quantification will be required to confirm the expression differences from multi-valent mRNA to compare mRNA translation efficiency between the constructs. Future work will interrogate the impact of multi-valency on mRNA translation efficiency compared with single valency in addition to establishing cell-based potency assays with high quality antibodies35,36. Understanding nuances in mRNA translation efficiency is necessary for proper balancing of multi-valent vaccines and methods such as CFT-MS and CBT-MS provide an opportunity to control vaccine production, increase product knowledge, and drive consistency. Ultimately, looking at the neutralized antibodies in mice or non-human primates would provide critical information regarding the impact of multi-valency in vivo before human clinical study.
Assessment of translation fidelity by identifying off-target protein translation and characterizing sites of +1 ribosomal frameshifting with CFT-MS
The use of modified ribonucleotides in therapeutic mRNAs is critical in decreasing innate immunogenicity and improving mRNA stability8,9,10. Yet despite the advantages, there is still much to be understood about the impact on mRNA translation fidelity. For example, a recent report demonstrated the observation of +1 ribosomal frameshifting at “slippery sites” in N1-methylpsuedouridine modified mRNAs. This unintended frameshifting can ultimately change the protein sequence that is translated by the ribosome following the frameshifting junction. Of note, the BNT162b2 SARS-CoV-2 mRNA vaccine that uses N1-methylpseudouridine was also shown to induce off-target immunity in both mice and humans to the “mistranslated” protein following vaccination. Despite no evidence to-date of off-target effects from production of the frameshifted protein following vaccination, this work represents a clear example to why it is important to enable analytical detection and characterization of these frameshifted products in an effort to identify such sites that may require further mRNA sequence optimization.
In the Delta (B.1.617.2) spike mRNA used throughout this manuscript, the codon sequence was optimized using an algorithm that did not consider removal of these slippery sequences. Therefore, there are sites that may result in unintended +1 ribosomal frameshifting (Fig. 7). To test whether these sites could be detected from workflows established here, the Delta (B.1.617.2) spike mRNA was subjected to CFT with wheat germ extract and the resulting sample analyzed by the platform-based, untargeted MS detection approach. Analyzing the resulting data by considering both the full in-frame translated protein and one of the possible +1 frameshifted (+1FS) proteins showed a positive identification of both the in-frame and +1FS species. In this instance, the +1FS product was observed at just under 7% relative abundance when comparing the MS1 peak area of the +1FS protein with the in-frame protein, which agrees well with the reported N1-methylpseudouridylated firefly luciferase +1FS observed at 8% of the corresponding in-frame protein13. Only one unique +1FS peptide, GYHLMSFPR, was observed for the +1FS Delta spike protein here. This peptide is indicative of the +1FS product and covers the frameshifting junction. While it would be ideal to have multiple unique +1FS peptides for confirmation, the likelihood of this is at least in part dependent on the mRNA sequence itself. Upon theoretical translation of the mRNA sequence following +1FS at either of the two slippery sites indicated in Fig. 7, only 10 amino acids extend beyond this identified junction peptide before a stop codon is created from the frameshift that puts a halt to protein translation.

Segment of SARS-CoV-2 Delta (B.1.617.2) spike protein mRNA sequence and corresponding translated protein amino acid sequence for in-frame and two possible +1 ribosomal frameshifted products that result in the same off-target amino acid sequence. “XXX” codon motifs are used to obscure proprietary mRNA sequence information. “+1 FS” corresponds to +1 ribosomal frameshift.
Because of the low relative abundance of the +1FS product, the sample was again analyzed by CFT-MS but rather than using the platform-based MS method, the peptide GYHLMSFPR was targeted for fragmentation with no retention time constraints. The resulting data showed high quality MS fragmentation spectra (Fig. 8) with clear confirmation of this peptide at the same retention time it was observed in the untargeted MS analysis. The fragments observed from targeted MS/MS analysis of this peptide unequivocally confirm the presence of the +1FS product and, in this case, the amino acid junction at which frameshifting occurs. In this example, a +1FS at the UUU-C motif or the CCC-C motif could result in the same amino acid sequence upon frameshifted translation. It is possible that one or both of these slippery sites are the root cause of observing the +1FS product, but point mutations of the mRNA sequence would be needed to confirm which is beyond the scope of this work. Of note, the previous report13 used LC–MS/MS to help narrow in on possible junction sites of +1FS and required immunoprecipitation of the translated protein(s) from cell-based translation followed by SDS-PAGE separation and in-gel digestion of the major +1FS species prior to MS analysis, all of which require lengthy sample preparation. Furthermore, that reported workflow in which the major +1FS species were isolated does not consider all possible sites of frameshifting. Advantageously, the CFT-MS workflow demonstrated here enables a more efficient detection approach for assessing +1FS product(s) and can help to identify the corresponding mRNA junction(s).

CFT-MS analysis confirms the presence of the GYHLMSFPR peptide that proves translation of the frameshifted protein via comparing the experimentally-observed MS/MS spectrum (black) with theoretical y- (red), b- (blue), and a-type (green) fragments. Additional confirmed fragments such as internal fragments are not labeled. The location of all identified peptide fragments from MS/MS is shown in the diagram on top of the mass spectrum, demonstrating thorough coverage of the C-terminus where the +1 frameshifted amino acid sequence begins.
Discussion
Assessment of protein translation from mRNA provides valuable insights into mRNA functionality and quality. Cell-free translation (CFT) offers rapid sample preparation and a simplified model for protein translation, free from the confounding effects of lipid nanoparticles (LNPs). On the other hand, cell-based translation (CBT) assesses mRNA formulated in LNPs and provides a more representative understanding of mRNA functionality in vivo. While each translation method has its unique applications, detecting translated proteins using antibody-based methods can pose challenges due to the expensive and time-consuming antibody screening process as well as the lack of antibody specificity. To address these limitations, a novel platform-based, antibody-free detection method using LC–MS/MS was developed. This approach enables fast and flexible detection of proteins translated from mRNA using either CFT or CBT inputs, making it particularly suitable for rapidly evolving viruses.
To evaluate the effectiveness of the CFT-MS workflow, we employed mRNA encoding for the SARS-CoV-2 Delta (B.1.617.2) spike protein. The results demonstrated that MS not only identified the translated protein, but also provided detailed insights into the primary sequence and relative abundance of the translated protein in an mRNA dose-dependent manner. Additionally, the CFT-SW and CFT-MS workflows were able to assess mRNA quality and integrity following thermal degradation. Further exploration of CFT-MS in the analysis of a bivalent mixture of mRNAs encoding for SARS-CoV-2 Delta (B.1.617.2) spike protein and SARS-CoV-2 Omicron (B.1.1.529) spike protein illustrated the ability of mass spectrometry to multiplex protein detection and highlights the specificity of this technique, even with significant protein sequence homology. We further established a CBT-MS workflow, with the MS component continuing to be platform-based, that demonstrated a dose-dependent response of translated proteins originating from a hexavalent mRNA vaccine drug product.
The utility of CFT-MS is exemplified in the ability to detect +1 ribosomal frameshifted Delta (B.1.617.2) spike protein caused by N1-methylpseudouridylation of the mRNA. This highlights the capability of this approach to assess mRNA translation fidelity and provides a framework for identifying regions of mRNA sequences that may require additional optimization to prevent unintended frameshifting at “slippery sequences.” There are numerous ways that a CFT-MS approach could be used in the context of +1 ribosomal frameshifting for vaccine development, including tracking the abundance of off-target +1FS protein to ensure consistency. For this purpose, a targeted approach would be most appropriate. Depending on the sequence of the +1FS protein it could be possible to design a targeted MS assay that quantifies a +1FS peptide not susceptible to sample preparation artifacts such as methionine oxidation. Alternatively, a more robust targeted MS approach could be to track the abundance of in-frame N-terminus and in-frame C-terminus peptides. In this case, quantitative values in agreement between the in-frame protein termini would be confirmation of satisfactory translation fidelity and could be used to track product consistency. If the values do not match, however, then frameshifting could be occurring and would need to be assessed further.
We envision that the platform-based, untargeted CFT-MS approach here could be utilized in the design of mRNA sequences and in rapid process development feedback for characterizing new mRNA constructs without the need for additional, targeted assay development or antibody screening. As an example, confirming the presence of +1FS protein and the corresponding junction site could be a first indicator to target specific regions of the mRNA sequence for in silico redesign to reduce or remove +1FS events. Additionally, CFT-MS could be used to interrogate mRNA constructs that use other ribonucleotide modifications besides N1-methylpseudouridylation.
Given that one of the mRNA constructs used in the CBT-MS experiments (“mRNA6” in Fig. 6) was the same N1-methylpseudouridylated Delta (B.1.617.2) spike mRNA analyzed by CFT-MS in Figs. 7 and 8, the +1 ribosomal frameshifted off-target product was considered when searching the CBT-MS data as well, but was not identified. This is likely due to the significantly lower abundance of the frameshifted product and could not be captured by the untargeted CBT-MS workflow using these mRNA doses as previously reported. Despite this, the CFT-MS approach described above can still provide +1 ribosomal frameshifting confirmation and localization. Furthermore, the current platform-based mass spectrometry detection approach outlined throughout this manuscript has additional applicability to interrogating mRNA translation efficiency in either CFT or CBT based approaches. For example, N1-methylpseudouridylation has been shown to impact the synthesis rates of proteins and the density of ribosomes on mRNAs which suggests a direct impact on mRNA translation37.
We believe that the advantages of MS-based detection and characterization of translated proteins from multi-valent mRNA mixtures using either CFT or CBT will continue to enable the rapid advancement of mRNA vaccines in early stages of development and the platform-based approach will be of particular use in the context of discovery and early clinical development. When there is a business need and urgency, a CFT-MS or CBT-MS workflow may be deployed in regulated GMP/QC labs. Advancements in automation allow for complex sample preparation procedures, including the S-trap™, to be automated for increased robustness and reproducibility. This, in combination with targeted MS analysis and automated data treatment, would enable CFT-MS or CBT-MS to be more amenable to moving beyond the research laboratory. Similarly, the use of mass spectrometry-based workflows in the GMP/QC space has already been evaluated with therapeutic proteins using multi-attribute method (MAM)38,39,40,41.
In summary, one of the biggest advantages of mRNA vaccine technology is the rapid development that can enable critical response to quickly evolving diseases, yet this attribute requires that analytical methodologies can keep up. For the first time in peer-reviewed literature, a platform-based, antibody-free mass spectrometry detection method that can be coupled to either cell-free or cell-based mRNA translation was established to provide key information about single- or multi-valent mRNA vaccines such as translated protein identity, mRNA quality, and mRNA translation efficiency and fidelity. These CFT-MS and CBT-MS techniques enable pharmaceutical development to quickly pivot in response to rapidly evolving viruses.
Methods
Preparation of mRNA
Prior to handling mRNA, all bench surfaces, pipettes, and gloves were wiped with RnaseZap™ (Invitrogen, cat. no. AM9786) followed by 70% isopropyl alcohol. mRNA was handled using only RNase-free consumables. mRNA was diluted in 1 mM sodium citrate pH 6.4 (Thermo Fisher Scientific). The concentration of mRNA was measured using a NanoDrop spectrophotometer (Thermo Fisher Scientific) blanked with mRNA diluent.
Cell-free translation (CFT) of mRNA
Cell-free translation (CFT) was performed using wheat germ extract (Promega) supplemented with Complete Amino Acid Mixture (Promega), potassium acetate (Promega), RNasin Plus RNase inhibitor (Promega), and nuclease-free water (Thermo Fisher Scientific). mRNA was added to the CFT mixture and incubated in a heat block at 25 °C for 2 h. After 2 h, the CFT mixture was aliquoted for subsequent analysis by Simple Western™ or mass spectrometry. Aliquots for Simple Western™ were diluted 10-fold in 1× Simple Western™ Sample Buffer prior to freezing at −70 °C, whereas aliquots for LC–MS/MS were stored neat at −70 °C.
Cell-based translation (CBT) of mRNA
Cell-based translation (CBT) was performed using Huh7 cells prepared in a 96-well plate at 85-95% confluency. Cells were transfected with a hexavalent mRNA drug product (DP) comprised of mRNAs 1 through 6 that encode for various proprietary spike protein sequences encapsulated in proprietary lipid nanoparticles (LNPs) for delivery. Each mRNA DP was transfected at a dose of 6.67 or 4.44 ng per mRNA construct and incubated at 37 °C for 18 h. After incubation, the supernatant was removed and the cell monolayer was gently washed three times with approximately 200 µL PBS. Following this, the cells were lysed with the addition of 100 µL of cell lysis buffer from the EasyPep™ MS Sample Prep kit (Thermo Fisher Scientific, Waltham, MA). After mixing vigorously (10–15×), the collected cell lysate containing the translated proteins was frozen at −70 °C prior to preparation for mass spectrometry analysis as outlined below.
Automated Western blotting
Automated capillary Western blotting (i.e., Simple Western™) of CFT samples were performed using either a Wes or Jess instrument (ProteinSimple) following the manufacturer’s procedure. Briefly, CFT samples were denatured and reduced by treatment with Master Mix solution prepared from the EZ Std kit Pack (ProteinSimple) followed by heating at 95 °C for 5 min. Simple Western was then performed using Antibody Diluent 2 (ProteinSimple) as the blocking reagent, anti-CoV spike S2 mouse monoclonal antibody (Sino Biological) as the primary antibody, and anti-mouse IgG HRP conjugate (ProteinSimple) as the secondary antibody. Immunoprobe detection was performed using a 1:1 mixture of Luminol-S (ProteinSimple) and Peroxide (ProteinSimple) and measuring chemiluminescence. All data analysis was performed in Compass software (ProteinSimple).
Microchip capillary electrophoresis
The integrity of thermally-degraded Delta mRNA was measured using microchip capillary electrophoresis (MCE) as described previously31. Briefly, mRNA samples were prepared in singlet by diluting to a final concentration of 10 µg/mL in 10% (w/v) Brij® 58 (Acros Organics) and 1× Sample Buffer (Perkin Elmer) in formamide (Sigma-Aldrich). The treated samples were then heated at 70 °C for 10 min followed by incubation on ice for at least 5 min. The samples were analyzed in duplicate by the RNA Standard Sensitivity Assay on a LabChip GXII Touch (Perkin Elmer) using a DNA 5 K/RNA/CZE LabChip. The mRNA integrity of each sample was calculated by dividing the peak area of intact mRNA by the total peak area of the sample.
Mass spectrometry enzymatic digestion
Cell-free translation (CFT) or cell-based translation (CBT) mRNA samples were prepared as described above and the resulting translated protein(s) were thawed then sonicated in a water bath sonicator for 5 min. Following this, the contents of each sample were spiked with 0.0125 µg of alcohol dehydrogenase (ADH) from Saccharomyces cerevisiae (Sigma Aldrich) and digested using a suspension trap (S-trapTM micro, Protifi, Fairport, NY) following manufacturer recommendations. Briefly, samples were combined with sodium dodecyl sulfate (SDS, EMD Millipore) to a final concentration of 5% SDS, proteins were reduced with dithiothreitol (Thermo Pierce) and then alkylated with iodoacetamide (Thermo Pierce). Next, samples were acidified with the addition of phosphoric acid then combined with S-trap binding buffer (100 mM triethylammonium bicarbonate buffer, 90% methanol, pH 7.1) and loaded onto the S-trap™ columns with centrifugation. After proteins were trapped on the S-trap™ column, porcine trypsin modified to prevent autolytic proteolysis (Trypsin Gold, Promega) was added (1 µg per column) and columns incubated at 37 °C overnight. Following digestion, peptides were eluted from the S-trap™ and subsequently dried using a SpeedVac followed by reconstitution in 0.1% formic acid in water. For the peptide mapping experiment only (Fig. 2), three separate sample preparations using trypsin (Trypsin Gold, porcine, Promega), chymotrypsin (bovine pancreas, Promega) or alpha lytic protease (bacterial, isolated from myxobacterium Lysobacter enzymogenes, Sigma Aldrich) were conducted using the same S-trap™ procedure detailed above but instead using the respective enzyme for digestion. In these experiments, starting material was generated from CFT using 2 µg/mL mRNA. These individual sample preparations were then analyzed separately from one another by LC–MS/MS in the same untargeted manner as other samples as outlined below.
LC–MS/MS sample analysis
Peptides eluted from the S-trap™ were cleaned up directly by loading onto EvoTips and analyzed by LC–MS/MS using an Evosep One LC and Thermo ExplorisTM 480. It is worth noting that the sonication and peptide cleanup steps were found to be critical for avoiding column clogging due to the viscous nature of the wheat germ extract. Alternatively, using a C18 cleanup tip such as ZipTip® or Pierce™ combined with separation using a nanoAcquity LC (Waters) was found to perform comparably (data not shown). Samples analyzed here with the Evosep One utilized an EV1106 column and the 15SPD (15 “samples per day”, corresponding to an 88 min method) predefined gradient. Mobile phases consisted of 0.1% formic acid in water (A) and 0.1% formic acid in acetonitrile (B). All MS/MS data was collected in data-dependent acquisition for all experiments unless noted otherwise. The spray voltage was set to 1.9 kV, with the ion transfer tube set to 275 °C. The full scan range was 275–1200 m/z, MS1 resolution = 60,000, MS2 = 15,000. The normalized AGC target for MS1 = 250% with a maximum injection time of 20 ms. MS2 = 50% with a maximum injection time set to auto. Number of dependent scans was 30. Following untargeted identification of the +1 frameshifted Delta (B.1.617.2) spike protein peptide GYHLMSFPR, a targeted MS analysis was used to confirm the observation where the 2+ and 3+ charge states were added to a targeted inclusion list (no retention time window was specified). For this targeted experiment, MS2 AGC target of 1e5 and maximum injection time of 120 ms were used.
Mass spectrometry data analysis
Resulting RAW data from untargeted experiments were analyzed using Proteome Discoverer 2.5 (Thermo Scientific) with Sequest HT search against the respective proteome (Triticum aestivum, UniProt proteome ID: UP000019116) and expected mRNA translated protein sequences. Proteome Discoverer was searched with a precursor mass tolerance of 10 ppm, fragment mass tolerance of 0.02 Da, up to 1 missed cleavage, and 1% FDR target. Oxidation of methionine was considered as a variable modification and carbamidomethylation of cysteine resulting from incubation with iodoacetamide during sample preparation was considered as a fixed modification. Results of this analysis provide protein abundance defined as the summed MS1 areas of unique identified peptides.
To address possible fluctuations in recovery across the sample preparation and mass spectrometry analysis, the ADH spike was used to verify consistent sample recovery and quantification normalization. This was accomplished by summing the abundances of endogenous and spiked Saccharomyces cerevisiae alcohol dehydrogenase (P00330) and dividing the observed Delta spike protein abundances by this value for normalization. Because experiments in this manuscript using mixtures of multiple mRNA constructs are represented relative to other proteins within the same sample, the ADH correction factor would therefore be negated and is only included within the monovalent experiments represented in Fig. 3 and Fig. 4. Additionally, a plot showing the ADH recovery across these samples is included in Supplementary Fig. 16 as an illustration of relatively consistent recovery using this sample preparation and MS analysis approach.
Relative abundances, expressed in percentage throughout, were obtained by dividing the protein abundance (summed MS1 area) by the highest abundance data point within the respective plot/experiment. For example, in the mRNA degradation experiment, all data points were calculated by dividing by the t = 0 time point in which the highest abundance of translated Delta spike protein was observed. This was accomplished in the same manner for CFT-SW and MCE comparisons using chemiluminescence signal or intensity, respectively. For Delta and Omicron spike protein mRNA mixing experiments, the relative abundances of the translated Delta and Omicron spike proteins obtained from CFT-MS analysis are expressed as a percentage of the total spike protein content. For example, mixing Delta and Omicron mRNA constructs in a 1:1 ratio (50% Delta, 50% Omicron) and performing CFT-MS resulted in summed MS1 area of 6.3E + 08 and 7.57E + 08 corresponding to 45.4% and 54.6%, respectively.
For peptide mapping, Genedata Expressionist 17.0 was used with a standard peptide mapping workflow including the expected mRNA translated protein sequence, the respective enzyme (trypsin, chymotrypsin, or alpha lytic protease), a mass accuracy of 15 ppm, and allowing for up to one missed cleavage. Following analysis of the three individual enzyme samples, the peptide mapping results were combined to obtain a total sequence coverage map.
Responses