Tuning aromatic contributions by site-specific encoding of fluorinated phenylalanine residues in bacterial and mammalian cells

  1. Department of Molecular Physiology and Biophysics, University of Iowa College of Medicine, Iowa City, IA, United States
  2. The GCE4All Research Center, Department of Biochemistry & Biophysics, Oregon State University, OR, United States
  3. Theoretical and Computational Biophysics Group, NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign

Peer review process

Consolidated peer review report (12 May 2022)

GENERAL ASSESSMENT

This is an interesting preprint wherein the authors report the generation of pyrrolysine-based aminoacyl-tRNA synthetases capable of incorporating fluorinated phenylalanine non-canonical amino acids (ncAA) into proteins expressed in either bacteria or mammalian cells. Synthetase evolution was directed using para-methyl tetrafluorophenylalanine as an ncAA. The authors use several screens and assays to characterize individual synthetases using superfolder GFP to measure protein expression using fluorescence and then identify the incorporated ncAA using mass spectrometry. For the two most extensively characterized synthetases, a wide array of fluorinated phenylalanine derivatives can be successfully incorporated. Contaminating incorporate of phenylalanine can be detected when attempting to incorporate several ncAAs in E. coli, but this bleed through incorporation of phenylalanine is not observed for expression in mammalian cells. In the case of monofluorinated phenylalanine for expression in mammalian cells, incorporation in place of phenylalanine is observed in other regions of GFP outside the site directed using the amber codon, making it challenging to incorporate the less heavily fluorinated ncAAs. Finally, the authors demonstrate that GFP and two membrane proteins (CFTR and Nav1.5) can be expressed at levels adequate for biochemical studies with one of the synthetases in the presence of a trifluoro phenylalanine ncAA, suggesting that the approach should be feasible for combining with many structural and biophysical approaches. Overall, this is an interesting study that generates several new synthetases that have utility for incorporation of fluorinated phenylalanine derivatives that can be used for expression in both prokaryotic and eukaryotic expression systems and that would likely be feasible for use for structural and other mechanistic biophysical studies.

RECOMMENDATIONS

Revisions essential for endorsement:

  1. The results and methods sections could be improved throughout by taking the space to provide the reader with a clearer conceptualization of each step in the process implemented by the authors to evolve the synthetase to incorporate fluorinated phenylalanine derivatives. What was the starting synthetase (presumably from ref 39)? Which positions were subject to random mutagenesis? Is the library a new one or one used previously? Can the authors provide the sequences for the different evolved synthetases characterized here? In the methods it is stated that 17 unique sequences were identified, but why not report what they are? Might it be worthwhile discussing how the present results compare with earlier attempts to evolve the synthetase for other ncAAs? It was also not entirely clear to us how do the positive and negative selection screens work. How many rounds of positive-negative selection have been made? Which mutations have been identified in the variants? Why do the results shown in Fig. 2B seem to disagree with some of the results shown in Fig. 2C? C10 is mentioned in the methods section but not shown in Fig. 2C. More extensive citation of prior work would help but the work will be much more accessible to the general audience if the authors explain everything conceptually in the results section and add more details to the methods section. The screening protocol used to identify synthetase variants is explained in greater detail in the methods, however the corresponding Figure (Fig 2) is not very well explained, please provide more explanation in the legend or the text about the results presented. Round 1 and Round 2 correspond to 2 different rounds of positive-negative screening? Panel C: it’s not clear what round 1 and round 2 stands for and why some of the mutants are in round 1 and why some others in round 2. Were the UP50 plots done for the” top performing synthetases” in both rounds? Please specify. Please state what the crosses mean in Fig. 2B.

  2. The authors demonstrate that they can incorporate different fluorinated phenylalanine residues into GFP and that under similar conditions two membrane proteins can be expressed at reasonable levels. What is less clear is how well the ncAAs will be incorporated into different membrane proteins. Might the procedures employed for GFP work less effectively for other proteins? The claim that the technique is widely applicable to membrane proteins would be strengthened if the authors could provide evidence for robust incorporation of ncAAs into membrane proteins, but even if this is too challenging for the time being, the authors should openly discuss problems that might be encountered or what makes them optimistic that the synthetases developed here will be effective at incorporating ncAAs into proteins beyond what they have shown for GFP.

  3. One opportunity for demonstrating robust trifluorophenylalanine incorporation into Nav1.5 might be to include functional data demonstrating that the gating properties of the channel are altered compared to control. Is the F1486 position sufficiently sensitive for a functional readout to provide at least qualitative information about the extent of ncAA incorporation? This would also demonstrate trafficking of the protein to the membrane as a functional channel. Although it is difficult to measure the intact molecular weight of hCFTR and hNaV 1.5 proteins due to their size by MS, have the authors tried LC-ESI-MS/MS analysis following enzymatic digestion? This could conceivably help to validate not only the incorporation of fluoro-Phe ncAAs, but also the site specificity of incorporation.

Additional suggestions for the authors to consider:

  1. The sfGFP protein samples purified for intact LC-ESI-MS analysis can be used for MS/MS analysis. Most mass spectrometers have MS/MS capability. The protein sequences and the structures of F-Phe ncAAs are known. All these make the MS/MS validation applicable. Most importantly, the results would provide strong evidence of site-specific encoding of F-Phe in proteins.

  2. Although a soluble protein was used to test the synthetases, the presentation gives the impression that the ultimate goal is for use on membrane proteins. Although membrane proteins are of interest to many and to the authors, why not present it as useful for both soluble and membrane proteins? Are there any known example of cation-pi interactions mediated by Phe in soluble proteins that would be worth investigating? A more general point is the authors could provide better framing or context by discussing how important cation-pi interactions are in proteins and what we know about them. In that regard, the intro would benefit from a few more sentences giving examples of important cation-pi interactions, and/or summarizing briefly findings of the in silico studies that are mentioned.

  3. Could the introduction of the ncAA affect GFP fluorescence? Along these same lines, could the author explain why they select residue N150 for the introduction of the ncAA?

  4. Providing the specific sequences of sfGFP-His expressed in E coli and HEKT cells, and adding the expected Dmass for N150F would help readers to better understand the intact ESI-MS data presented in the paper and it's also hard to read the labels in Fig.4.

  5. The author might consider citing Last et al. as it features and interesting role of Phe residues in anion selectivity in the Fluc channel (Last et al. (2017) eLife 6:e31259).

  6. Table 1, shows DG in the binding energy measurements but we don’t recall seeing in the manuscript how DG was calculated. Also, we may be missing something, but the theoretical quantum calculations referenced in the text (ref 24) will give a result in DE as energy. We are also curious about the meaning of the PHE% (Table 1 as well). How was it calculated? What kind of information is it providing?

REVIEWING TEAM

Reviewed by:

Ana I. Fernández-Mariño, Research Fellow (K.J. Swartz lab, NINDS, USA): ion channel structure and mechanism, electrophysiology and molecular biophysics

Yan Li, Director Proteomics Core, NINDS, USA: protein mass spectrometry

Chloé Martens, Assistant Professor, Université Libre de Bruxelles: membrane protein structural biology, membrane transport

Kenton J. Swartz, Senior Investigator, NINDS, USA: ion channel structure and mechanisms, chemical biology and biophysics, electrophysiology and fluorescence spectroscopy

Curated by:

Kenton J. Swartz, Senior Investigator, NINDS, USA

(This consolidated report is a result of peer review conducted by Biophysics Colab on version 1 of this preprint. Minor corrections and presentational issues have been omitted for brevity.)