Activation mechanism of the human Smoothened receptor

Prateek D. Bansal; Soumajit Dutta; Diwakar Shukla

doi:10.63204/4757.2

Evaluation statement (22 August 2023)

Bansal et al. present an atomistic view of the transition cascade of the class F GPCR Smoothened (Smo). The extensive long-range molecular dynamics simulations together with stochastic modelling provide theoretical insight into Smo activation and how this is modulated by different ligands. The work identifies testable hypotheses for functional studies of Smo and other class F GPCRs. Future simulations of regions beyond the seven-transmembrane bundle, particularly the cysteine-rich domain, will afford a more complete understanding of receptor activation.

Biophysics Colab considers this to be a convincing computational study and recommends it to scientists interested in the conformational dynamics of class F GPCRs.

(This evaluation by Biophysics Colab refers to version 2 of this preprint, which has been revised in response to peer review of version 1.)

Abstract

Smoothened (SMO) is a membrane protein of the Class F subfamily of G-Protein Coupled Receptors (GPCRs) and maintains homeostasis of cellular differentiation. SMO undergoes conformational change during activation, transmitting the signal across the membrane, making it amenable to bind to its intracellular signaling partner. Receptor activation has been studied at length for Class A receptors, but the mechanism of Class F receptor activation remain unknown. Agonists and antagonists bound to SMO at sites in the Transmembrane Domain (TMD) and the Cysteine Rich Domain have been characterized, giving a static view of the various conformations SMO adopts. While the structures of the inactive and active SMO outline the residue-level transitions, a kinetic view of the overall activation process remains unexplored for Class F receptors. We describe SMO’s activation process in atomistic detail by performing 300 μs of molecular dynamics simulations and combining it with Markov state model theory. A molecular switch, conserved across Class F and analogous to the activation-mediating D-R-Y motif in Class A receptors, is observed to break during activation. We also show that this transition occurs in a stage-wise movement of the transmembrane helices - TM6 first, followed by TM5. To see how modulators affect SMO activity, we simulated agonist and antagonist-bound SMO. We observed that agonist-bound SMO has an expanded hydrophobic tunnel in SMO’s core TMD, while antagonist-bound SMO shrinks this tunnel, further supporting the hypothesis that cholesterol travels through a tunnel inside Smoothened to activate it. In summary, this study elucidates the distinct activation mechanism of Class F GPCRs and shows that SMO’s activation process rearranges the core transmembrane domain to open a hydrophobic conduit for cholesterol transport.

Introduction

G protein-coupled receptors (GPCRs) act as molecular telephones and transmit signals across the cellular membrane by associating with G proteins^1,2 or arrestins.³ The process of signal transduction generally involves GPCRs binding to agonists which aid the shift in conformational equilibrium, facilitating the receptors to transition to an active state. Activation allows the receptor to associate with intracellular binding partners, allowing the process of signal transduction.⁴ GPCR activation is an area of active research - with studies establishing conserved structural motifs like the E/DRY, NPxxY^4–8 in Class A and PxxG, HETx⁹ in Class B receptors acting as molecular switches that stabilize the inactive state. Unlike Class A and B GPCRs, activation of Class F receptors : Smoothened (SMO), Frizzleds1-10 (FZD_1–10) is still poorly understood. A primary reason for this elusiveness is that these receptors share none of the structural motifs seen in Class A/B, and have less than 10% sequence similarity to Class A receptors¹⁰ as well as Class B receptors. Since Class A and B GPCRs are involved in mediating virtually every physiological response : they are crucial drug targets, as 34% of all FDA approved drugs target one of these proteins.¹¹

Smoothened (SMO) is a transmembrane protein from the Class F of GPCRs. Class F consists of proteins that are involved in maintaining tissue homeostasis and regenerative responses in adults, and are crucial in embryonic development, as they regulate cellular differentiation by binding to sterol and Wnt ligands.^12–15 SMO is expressed in tissues throughout the body, particularly in cerebellar and pituitary tissue,¹⁶ and is a member of the Hedgehog (HH) signaling pathway. When the endogenous inhibitor of SMO, a membrane protein Patched (PTCH), is inhibited by Sonic Hedgehog (Shh) binding, SMO translocates to the ciliary membrane, and undergoes conformational transitions (activation) to bind to its intracellular signaling partner G_i.^17,18 How PTCH inhibits SMO is still unclear. However, multiple studies have described PTCH’s inhibition on SMO as acting through reducing SMO’s accessibility to membrane cholesterol.¹⁹ A recent study described the effect of PTCH on the cholesterol accessibility of the upper leaflet, suggesting that PTCH inhibits SMO by either transporting cholesterol to the inner leaflet, or to an extracellular acceptor.²⁰ HH signaling is critical to embryonic development, and any changes in signaling can lead to severe birth defects.²¹ Cyclopamine, a naturally occuring alkaloid in corn lily, has been identified as a teratogen (agents responsible for birth defects in infants),²² and was responsible for birth defects in lambs in Idaho in the 1950s.²³ It was identified later that cyclopamine’s mechanism of action involved inhibiting HH signaling by binding to SMO.^24–26 On the other hand, overstimulation of HH signaling via SMO has been linked to the pathogenesis of pediatric medulloblastoma and basal cell carcinoma.^27,28 Vismodegib²⁹ and Sonidegib³⁰ are two FDA approved drugs that target SMO, but are prone to chemoresistance. ³¹ Therefore, understanding activation mechanisms of Class F GPCRs is hence critical to design novel therapeutics.

Structures of SMO bound to agonists and antagonists outline the effects of allosteric and orthosteric modulators binding on SMO activity. These structures show the existence of two primary binding sites in SMO - the first in the Cysteine Rich Domain (CRD), which binds agonists cholesterol³² and cyclopamine. ³⁴ The second site is present in the TMD, which binds both antagonists LY2940680,¹⁰ SANT1 and AntaXV,³⁵ cyclopamine,³⁶ TC114,³⁷ Vismodegib³² and agonists SAG1.5,³⁵ SAG,³⁸ SAG21k,³⁹ 24, 25-epoxycholesterol,³³ and cholesterol. ³⁸ Mutagenesis studies have outlined the presence of an intracellular W^7.55f-R^6.32f π-cation lock⁴⁰ in Class F that is broken on activation (Fig. 1A), with mutations that disrupt this lock lead to increased agonist potency and pathway selection (superscripts refer to the modified Ballesteros-Weinstein numbering system used to denote Class F GPCR TM residues⁴¹ introduced by Wang et al.³⁵). On the extracellular end, for SAG-bound SMO, the D-R-E network is broken in active SMO³⁵ (Fig. 1A). The intracellular end of active SMO shows rearrangements in TM6 (outward), TM3 (outward) and TM5 (inward) (Fig. 1B). These studies paint a static picture of how SMO activity can be attributed to structural rearrangements, however, a dynamic understanding of the process of SMO activation still remains. Hence to provide a dynamic overview of activation, we simulated ~ 250 μs Apo-SMO (no ligand bound) to understand SMO’s activation process in atomistic detail. Moreover, it has been shown that PTCH modulates SMO activity by controlling its access to membrane cholesterol^19,42 which then travels through a hydrophobic tunnel inside SMO to access the primary ligand binding site in CRD, showing an expanded tunnel in active SMO (Fig. 1A). Hence, we simulated agonist bound (SAG-SMO) (~ 36 μs) and antagonist bound (SANT1-SMO) (~ 42 μs to explore the effects of bound modulators on SMO activity, and the mechanisms of action for these molecules. Using a highly parallel Adaptive sampling based approach and constructing a Markov state model (MSM),^43,44 we probe submillisecond dynamics of SMO, and show that SMO activation involves a intracellular structural motif that is conserved across Class F receptors. MSMs have been used to model membrane protein behavior at varied timescales, to probe activity of membrane transporters,^45–47 as well as to study conformational dynamics of signaling proteins.^8,48–55 In particular, Markov state models have been employed to investigate conformational dynamics of GPCRs, such as beta2-adrenergic receptor,^8,52,56 Dopamine D₃ receptor,⁴⁸ μ-opioid receptor, ^54,55 Chemokine receptor CCR2,⁴⁹ and Cannabinoid Receptors 1, 2.^50,51 Using MSMs, we outline the involvement of multiple CRD-TMD salt-bridges that are rearranged during SMO activation, establishing a role for the CRD in SMO activation. We show that the hydrophobic tunnel inside SMO expands in the presence of an agonist, and is occluded by the antagonist. These observations are amenable to experimental observations that bolster the cholesterol transport-like activity of SMO. We then use a mutual-information based approach to outline the allosteric mechanisms through which the agonist SAG operates, i.e. by changing the allosteric pathways in SMO to more active-like SMO. These observations provide a detailed and atomistic in-depth view of SMO activation, and may aid in design of antagonists for cancer therapy.

Major structural changes during SMO Activation. (A) Comparison of the broken D-R-E network and the W-R π-cation lock, and the expanded tunnel, in inactive (magenta, 5L7D³²) vs active (green, 6XBL³³) SMO (B) Comparison of inactive and active SMO, indicating the outward movement of the TM6 and TM3 and inward movement of TM5 in active SMO.

Results and Discussion

SMO activation involves a conserved molecular switch

To probe the transitions SMO undergoes during activation, SMO was simulated in a ligand-free form (Apo-SMO) from two starting points - inactive and active structures. Simulations were performed using a parallel approach - by clustering the existing data based on selected features (feature selection explained in Methods) and seeding the next round of simulations by randomly selecting starting points from the least populated clusters - a technique known as Adaptive sampling⁵⁷ (Fig. S8, Table S1, S2). The high dimensionality of the data was reduced by transforming it using time-Independent Component Analysis (tICA). ^58,59 tICA uses a linear combination of the supplied features to identify the slowest collective degress of freedom in the data by computing the time-lagged autocorrelation. The first two tICA components account for the two slowest processes associated with activation (Fig. S9, S10). The active and inactive structures were separated majorly in the first tICA component (tIC 1), indicating that activation was the slowest process observed in simulations. Hence, features that were highly correlated with tIC 1 (Fig. S11) were considered pivotal to activation. The convergence of the data, clusters and hence the free energies derived from it, were confirmed by the presence of a continuous density of data along tIC 1 (Fig. S9A). This shows that the simulations have indeed sampled the conformational landscape necessary to probe the activation pathway of SMO. The tICA transformed data was clustered - dividing the data into kinetically distinct microstates. A MSM was constructed on the clustered data to compute the transition rates between microstates, and to reweigh the data, eliminating the bias introduced by Adaptive sampling.

At the intracellular end, we observe that W339^3.50f shows a very dramatic reorientation on receptor activation, moving outwards from the center of the TM bundle, to accommodate the bound G_i. W339^3.50f is conserved across all Class F receptors (Fig. S12). Upon further analysis, we ob that this rearrangement extends to include M449^6.30f and G453^6.34f (outward movement), G422^5.65f (translation) (Fig. 2A), as well as W535^7.55f (inward rotation) - residues that are all conserved across the entire Class F family (Fig. S12).

Molecular metrics integral to SMO Activation. (A) Rearrangement of the WGM motif, a conserved molecular switch across class F GPCRs, undergoes rearrangement on SMO activation. (B) Relative free energies from MSM-weighted simulation data plotted on the TM3-TM6 distance vs TM3-TM5 distance measured at residues W339^3.50f, M449^6.30f and G422^5.65f. (C) Breaking of the D-R-E network on the extracellular end of the TMD. (D) Similar to (B), but for TM3-TM6 distance vs the D-E distance. (E) The π-cation lock breaks by the sidechain rotation of W535^7.55f. (F) Same as (B) but for TM3-TM6 distance vs χ₂ dihedral measured at W535^7.55f.

M449^6.30f’s outward movement is a proxy for the outward movement of TM6 a process associated with canonical GPCR activation.^33,38 However, instead of kinking outwards as observed in Class B receptors, TM6 in SMO undergoes translation, to accommodate G_i. This can be attributed to the absence of P^6.43f, a residue conserved across FZDs (Fig. S12). P^6.43f is replaced by F462^6.43f in SMO - thereby increasing its rigidity and resistance to developing kinks.⁶⁰ Recently published structure of active FZD7⁶¹ shows this kink at P^6.43f. Similar translation in TM6 is observed in Rhodopsin, a Class A receptor. ⁶² This particular feature is hence unique to the activation mechanism of SMO. TM5 on the other hand, shows slight inward translation. To capture these outlined movements, we projected the entire Apo-SMO data onto W339^3.50f – M449^6.30f (TM3-TM6 distance) v/s W339^3.50f – G422^5.65f (TM3-TM5 distance) and computed the free energy associated with each state(Fig. 2B, Fig. S13). The free energy plot shows that this TM3-5-6 rearrangement follows a stage-wise process - with TM6 moving outwards first by ~ 4 Å, (State 1 in Fig. 2B) followed by the rest of the TM3 outward movement after a slight outward rearrangement in TM5 (State 2 in Fig. 2B). The overall free energy barrier for this rearrangement is ~ 2.5 kcal/mol. The outward movement of TM6 is analogous to class A receptor activation (Fig. S14 A,B). A conserved molecular switch mediating SMO’s activation on the intracellular end is similar to the breakage of molecular switch E/DRY in Class A GPCRs-with W339^3.50f being the residue analogous to R^3.50 (Fig. S14 C,D). Hence, we posit that this conserved molecular motif (W-G-M) is integral to Class F receptor activation, and provides a basis for activation across the entire Class F receptors, while also showing the uniqueness of activation of Class F receptors.

The crystal structure of SMO bound to the synthetic agonist SAG1.5 gives clues about the activation-specific residue-level rearrangements that occur on the extracellular end of SMO. D473^6.54f has been established as a residue critical to SMO activity, as it forms a part of SMO’s core TMD ligand binding cavity, and is shown to interact with agonists SAG1.5, SAG, oxysterols and antagonists GDC-0449, AntaXV.^{33,35,38,63,64} Specifically, a network of salt bridges formed by the residues D473^6.54f, E518^7.38f and R400^5.43f is broken in SAG1.5-bound SMO (Fig. 2C).³⁵ Hence, we also projected the Apo-SMO data on the D473^6.54f – E518^7.38f distance v/s intracellular TM3-6 movement (Fig. 2D, Fig. S13). We observe that the TM6-TM3 outward movement (2 in Fig. 2D) is preceded by the breakage of the hydrogen bond between D473^6.54f-E518^7.38f (1 in Fig. 2D).

To outline the role of the π-cation lock W535^7.55f-R451^6.32f in activation, we projected this π-cation lock contact v/s the TM3-6 outward movement (Fig. 2F, Fig. S13) for Apo-SMO. Projecting the Apo-SMO data along the sidechain dihedral angle χ₂ of W535^7.55f, clearly showed the distinct inactive and active states. This shows that the mechanism of π-cation lock breaking involves the sidechain rotation of W535^7.55f. Additionally, we observe that the π-cation lock breaks around the same TM3-TM6 distance as the outward movement of TM3. Thus, the WGM motif and the π-cation lock at the intracellular end, and the D- R-E network at extracellular end are critical residue networks involved in SMO activation. These residues form a network of allosterically coupled residues, proving crucial for signal transduction across the membrane.

Residues at the CRD-TMD interface involve salt-bridge rearrangements in SMO activation

SMO, in addition to a heptahelical TM domain, possesses an extracellular domain called the Cysteine Rich Domain (CRD). The CRD consists of residues that are highly polar in comparison to the TMD, which is mostly hydrophobic (Fig. S15). This domain is critical for SMO activation, as SMOΔCRD mutants show a higher constitutive activity - suggesting that the CRD represses SMO’s basal activity.⁶⁵ The CRD also includes the primary sterol binding site in SMO³² - and it has been posited that PTCH inhibits SMO by reducing cholesterol access to this site. ¹⁸ Structures of active xenopus laevis SMO (xSMO) show a dramatic reorientation of the CRD on xSMO activation-suggesting that the CRD has a very dynamic range of motion. ⁶⁴ However, this reorientation is not observed in human SMO (hSMO). ^32,37 Thus to establish a role of the CRD in activation of hSMO, we sought residue pairs in Apo-SMO CRD-TMD interface that showed the highest variance along tIC1, the slowest process that captured Apo-SMO activation.

Fig. 3(A-F) show the residue pairs that have the highest change in contact frequency during activation - starting with the R485^6.66f – D209^CRD, salt-bridge, which breaks during activation (Fig. 3A) due to the outward movement of TM6. This indicates that the R485^6.66f–D209^CRD salt-bridge is involved in stabilizing the inward conformation of TM6 in the inactive state. This loss of the R485^6.66f–D209^CRD salt-bridge is however compensated by the formation of the nearby R161^CRD–D486^6.67f salt bridge, which is predominantly seen in the active conformation (Fig. 3E). Furthermore, the inactive state shows a salt-bridge E208^CRD–K395^ECL2 which breaks on activation, compensated by the formation of the nearby D201^CRD–R296^ECL1 (Fig. 3B, E). Additionally, activation strongly favors the formation of R159^CRD–D209^CRD (Fig. 3C) and D382^ECL2-K204^CRD (Fig. 3G) salt bridges. The inactive (green) and active (magenta) structures depicted in the figure were taken as representative structures from the inactive-like and active-like free energy wells in the tIC landscape.

Overall activation of SMO involves residues at CRD-TMD junction. (A)-(F) Snapshots and probability density plots outlining the salt-bridge rearrangements at the CRD-TMD interface during SMO activation.

The path along tIC1 from the inactive state to active state involves 3 intermediate states(I_1–3) (Fig. 4A), characterized by free energy barriers of atleast 1 kcal mol⁻¹ among them. Using Transition Path Theory on the constructed MSM, we calculated the fluxes of transitions between these states, to establish timescales for activation of SMO (Fig. 4B). The simulations show that the entire process of activation from inactive to active has a MFPT (mean first passage time) of ~ 72μs (Fig. 4B), while the reverse process is ~ 3X faster, with MFPT ~24 μs.

(A)Relative free energies from MSM-weighted simulation data of Apo-SMO plotted along tIC1 and tIC2, the 2 slowest components, with the intermediate states I_1–3 as shown. The intermediate states I_1–3 were defined based on metastable basins and free energy barriers associated with transitioning from an inactive to an active state. A cutoff of 1.8 kcal/mol was used to separate one basin from another. Residues shows as sticks include the π-cation lock, the WGM motif and the salt bridges involved in activation. (B) Overall transition pathway of SMO activation process. The inactive (PDB ID: 5L7D)³² and active (PDB ID: 6XBL)³⁸ structures are separated by the presence of 3 metastable conformations in between, I_1–3. Residues shown by sticks correspond to the salt bridges, the WGM motif, the DRE network and the π-cation lock, all residues critical for mediating SMO activation.

We observe that residue pair rearrangements that are associated with activation at the CRD-TMD junctions are salt-bridges, mostly between residues with one residue in CRD and the other one in TMD (Fig. S16). Almost none of these polar residues are conserved (Fig. S12, S17), indicating that these residues contribute to a unique activation process for SMO at the CRD-TMD interface. Additionally, we observe that the entire CRD motion can be accounted for by a slight outward rotational motion of the CRD (Fig. S18), thereby causing TM6 to move outwards and triggering activation on the intracellular end. Since the CRD has a cholesterol binding site, it is possible that cholesterol binding to the CRD triggers this outward rotation, inducing the signal that causes TM6 to move out. This potentially outlines a mechanism for the activation of SMO by cholesterol, its endogenous agonist.

SMO’s Activation is linked to opening of a hydrophobic tunnel

Endogenously, on PTCH’s inhibition by Shh, SMO is activated. SMO’s activation is mediated endogenously by cholesterol, suggesting that PTCH’s inhibition facilitates SMO’s activation by cholesterol. This suggests that cholesterol from the membrane travels to the extracellular sterol binding site. How this transfer of cholesterol occurs to the SMO CRD is still unknown. However, SMO does indeed present itself with a unique topology - the presence of a tunnel inside the protein. This tunnel has been hypothesized^33,38,39,64 to facilitate the transport of cholesterol from the membrane to the binding site in the CRD,³² making this tunnel a prime target for inhibitors. As noted by Qi et al., SMO antagonists (SANT1, AntaXV, LY2940680) bind deeper into a tunnel inside SMO, whereas SMO agonists (SAG) bind outside this tunnel. Adding a 4-aminomethyl moiety to the tail-end of SAG converts it to an antagonist, suggesting that this added moiety can hinder the tunnel. ⁶⁶ Mutations that introduced a bulky residue into the tunnel (V329F, V333F, V408F,I412F,T470Q), blocked SMO activity, ^32,39 suggesting that the tunnel conformation was linked to how small molecule and mutations modulated SMO activity.³⁸ This suggests that SMO antagonists like SANT1 act as steric antagonists by blocking the sterol tunnel inside SMO, while agonists like SAG allosterically activate SMO by breaking the D-R-E network, setting off receptor activation on the intracellular end. The mechanism and dynamics of the modulators acting on SMO’s activation is still unclear. Hence we simulated SMO bound to antagonist SANT1 (SANT1-SMO) and agonist SAG (SAG-SMO) to probe the effect of bound agonist and antagonist on SMO’s activation.

SMO’s tunnel is characterized by markedly hydrophobic residues (Fig. S19), pointing further towards the idea that a hydrophobic molecule may be transported through it. This tunnel runs through the core of the receptor, spans the entire TM domain, starting at the conserved residues W339^3.50f, spans ~ seven helical turns, and ends at the extracellular network of residues E518^7.38f, D473^6.54f and R400^5.43f. These three residues form the base of the space between the CRD and TMD. Moving outwards along the path defined along the tunnel directly leads to the binding site, with TM6, ECL2 and ECL1 forming the bridge between these sites (Fig. S20).

In SANT1-SMO simulations, the tunnel remains almost completely blocked (Fig. 5A, B), indicating that the mechanism by which SANT1 modulates SMO activity is by binding deeply into the SMO tunnel core, precluding the potential transport of cholesterol. SANT1’s piperazine moiety directly interacts with H470^6.51f and sidechain of M525^7.45f - forming hydrogen bond interactions. The pyrrolic head of the ligand remains buried deep inside, with minimal movement normal to the plane of the membrane, along the tunnel (Fig. S21). However, in Apo-SMO simulations, the tunnel remains relatively open(Fig. 5C, D). Interestingly, we observe a conformational dependence of the lipid organization in the membrane - Inactive SMO surrounds itself with a cholesterol in the upper leaflet, as opposed to other cases (Fig. S22). This suggests that cholesterol shows a propensity to accumulate outside inactive SMO to possibly transport itself in the hydrophobic tunnel, leading to SMO activation. Additionally, In SAG-SMO simulations, we observe that the tunnel radius has a sudden kink outward (z ~ −20 Å), suggesting that there is a relative expansion of the tunnel induced by SAG (Fig. 5 E, F). In the simulations, the membrane extends from z = 0 to z = −40. Since this expansion occurs between z = 0 and z ~ −20 Å, it suggests the opening is in the upper leaflet (Fig. S24A). On plotting the free energy difference between Apo-SMO and SAG-SMO, a marked difference in the free energy associated with the opening of the tunnel is observed (Fig. S23A-C). Recent studies suggest that active PTCH precludes SMO’s accessibility to cholesterol in the upper leaflet. ²⁰ To further probe into the exact position of this tunnel opening, we observed that a cluster of openings occured at x 16 Å and y ~ 22 Å - corresponding to the space between TM2 and TM3 (Fig. S24B). This is in agreement with a recent study that used coarse-grained simulations to observe a cholesterol binding site at the TM2-TM3 interface in the upper leaflet. ⁶⁸ Thus, SAG acts as an agonist by allosterically expanding the tunnel at the cholesterol interaction site - giving further evidence for the cholesterol-transport like activity of SMO. Thus we conclude that SANT1 functions as a steric antagonist by blocking the tunnel, whereas SAG functions by allosterically expanding the tunnel, thereby establishing design rules for SMO agonists and antagonists.

Tunnel radius plots for SMO. (A) Free energy plot of the tunnel diameter along the z-coordinate for SANT1-bound SMO. (C) same as (A), but for Apo-SMO. (E) same as (A), but for SAG-bound SMO. SAG-bound SMO clearly shows the expansion of the tunnel as compared to Apo-SMO and SANT1-SMO. (B), (D), (F) - representative figures for SANT-1 SMO, Apo-SMO and SAG-SMO. Tunnel radii were calculated using the HOLE program⁶⁷

Allosteric pathways between E518^7.38f and W339^3.50f. (A) Pathway in Apo-Inactive-SMO. Since the tunnel radius is decreased, TM6 outward movement is restricted, and therefore the entire allosteric communications occurs via TM6. (B) In SANT1-SMO, due to slight outward movement of TM6, the pathways switches from TM7 to TM6 to TM3. (C,D) SAG-SMO and Apo-Active SMO show the same allosteric pathway, which spans TM7-TM6-TM5-TM3.

SAG alters the allosteric pathways in SMO during the process of SMO activation

To further investigate the mechanism by which SAG allosterically modulates SMO’s activity resulting in the expansion of the tunnel, we computed the allosteric pathways that connected the intra- and extracellular ends of SMO, responsible for transmembrane signal transduction. Allosteric pathways contain a series of conformationally-coupled residues that link dynamically active and spatially distant residues. In Class A GPCRs, allosteric pathways are responsible for communicating conformational changes from the extracellular end to the intracellular end, completing the process of signal transduction.^69–71 Since SMO’s activation process involves allosteric communication between the extracellular ligand binding site (D-R-E network) and the G-protein coupling site (WGM motif), we sought to analyze the allosteric pathways that connect the two sites. We computed the dynamic pairwise mutual information of Inactive-Apo-SMO, Active-Apo-SMO, SANT1-SMO and SAG-SMO on a residue-level basis, and constructed a graphical network of residues that are allosterically linked. The dynamic mutual information takes into account the residue-level movements Based on this network, we present the allosteric pathway between the intra- and extracellular ends of TMD.

In our simulations, we observe that the allosteric pathway between the intra and extracellular ends in Apo-Inactive SMO completely passes through TM6, encompassing residues T466^6.47f, F460^6.41f and G456^6.37f (Fig. 5A). This establishes an integral role for TM6 in mediating the signals across the transmembrane domain in inactive-SMO. SANT1-SMO on the other hand, unexpectedly shows a distinct pathway, first going down intra-helically to A524^7.24f, crossing over to TM6 via A459^6.40f and finally to L335^3.46f. This however can be explained by the observation that the SANT1 causes a slight outward movement of TM6, to accommodate itself in the deep core TMD ligand binding cavity (Fig. S25). This outward movement of TM6 moves T466^4.67f away from E518^7.55f. This causes the network to rearrange itself, moving over to TM6 further downstream. On the other hand, SAG and Apo-Active SMO show the exact same networks, further indicating that SAG alters the allosteric networks in SMO to resemble Apo-SMO. These networks involve C469^6.50f, the most conserved residue in TM6, down L464^6.45f, and a flip over to TM5 as we move intracellularly, due to the outward intracellular movement of TM6, via L412^5.55f and F418^5.61f. Thus, we can establish a basis for the mechanisms through which SAG and SANT1 effectively modulate SMO activity, and establish an integral role for TMs 7,6,5,3 in signal transduction.

Conclusions

Our study reveals the activation mechanism for SMO, a Class F GPCR, in atomistic detail via molecular dynamics simulations. We characterized the residue level transitions that SMO undergoes during activation. We simulated SMO in Apo, SAG and SANT1 bound states to probe the activation mechanism of SMO, and computed the free energy landscape of the process. Our MSM weighted free energy landscapes show a barrier of max free energy barrier of ~ 3 kcal mol⁻¹ while transitioning from an inactive to active state, involving three intermediate states.

Class A and Class B receptors have been the subject of major interest involving GPCR activation. ^4,9 Receptor activation studies on Class F majorly focused on the start and end states of the receptor, without giving an overview of the dynamics of the process. Using computational methods, we show that SMO activation involves the rearrangement of a intracellular structural motif : the W-G-M motif, conserved across the entire Class F family. This lays the basis for a common activation mechanism for all Class F receptors on the intracellular end. Additionally, this motif involves W^3.50f, which is the residue equivalent to R^3.50 in class A receptors, establishing the integral role of TM3 in GPCR activation. On the extracellular end of TMD, we see that the D-R-E network of residues is pivotal to activation, as it engages the agonist and sets off the activation process at the intracellular end. We also show evidence of allosteric coupling between these two sites, showing that the rearrangement of the D-R-E network is necessary to ensure intracellular rearrangement of the WGM motif.

We also establish a role for the CRD in SMO activation, forming and breaking salt-bridges while transitioning to an active state, contacts that have not been discussed previously. This gives novelty to the methodology established, inferring that MD simulations can be used to discover contacts crucial to activation, previously unknown. We show that the agonist SAG expands an intra-TMD tunnel inside SMO, further supporting the hypothesis that SMO transports a cholesterol molecule through its hydrophobic tunnel to activate SMO. ^{19,20,33,38,64} We also show that SAG acts as an allosteric modulator, by modifying SMO’s allosteric pathways to be similar to Apo-SMO. On the other hand, SANT1 acts as a steric antagonist, by occluding the hydrophobic tunnel inside SMO, hence lowering the radius. Therefore, we establish the mechanisms of action of antagonists and agonists in modulating SMO activity. Additionally, experimental validation by mutagenesis of the role of various residues needs to be performed for further corroboration of this computational study. Mutation of residues of the WGM motif - (W339, G422, M449) the various salt bridges, the interface of upper leaflet and TM2-3, and the allosterically coupled residues, possibly through techniques like Alanine scans and Deep Mutagenesis can be performed as testable hypotheses, thereby delineating the role of these residues in modulating SMO activity. Additionally, how cholesterol, the endogenous agonist of SMO, modulates SMO activity in the presence of agonists, still needs to be explored. However, we propose that the overall mechanistic findings from this study can be used to design novel SMO antagonists, for chemotherapy.

Methods

Molecular Dynamics (MD) Simulations

Simulation setup

SMO structures in the bound inactive conformation (inactive-SMO) (PDB ID: 5L7D³²) and active conformation (active-SMO) (PDB ID: 6XBL³⁸) were used as starting points for the SMO-Apo simulations. For apo systems, the bound ligand and the stabilizing antibodies were removed. The missing residues in the proteins were modeled using MODELLER⁷² (Table S3). The inactivating mutation V329F in the inac-SMO was corrected back to wild-type. For SAG-SMO, the bound SAG was retained in the SMO-SAG complex. ³⁸ To check for the protonations in acidic residues under physiological conditions, the pKa was calculated using the H++ server.⁷³ Accordingly, E518 was protonated in all SMO systems. For SANT1-SMO, owing to the lack of the CRD in the SANT1-SMO complex (PDB ID: 4N4W³⁵), we sought to use the inactive orientation of 5L7D (inactive SMO, CRD present) instead. The SANT1-bound crystal structure (4N4W) was aligned to inac-SMO 5L7D (to maintain the same binding pose for SANT1), and the 5L7D-SANT1 starting point was used for simulations. The terminal residues were capped using neutral terminal caps Acetyl (ACE) for N-terminus and MethylAmide (NME) for the C-terminus. The proteins were embedded in a membrane bilayer using CHARMM-GUI. ^74,75 The atomic interactions were characterized using the CHARMM36 force field. ^76,77 The choice of CHARMM36 force field was based on studies that use CHARMM36 to simulate various G-Protein Coupled Receptors, specifically at the time of system setup.^78–81 Use of CHARMM36m force field made noted no significant difference to the overall observations (Fig. S1). The force field parameters for synthetic ligands SAG and SANT1 were generated using ParamChem, ⁸² an automated version of CGenFF.^83,84 Owing to presence of penalties greater than 10 assigned by CGenFF for various angles and dihedrals for both SAG and SANT1, optimization using the MP2/6-31G* QM calculations was performed. The python-based library Psi4 was used for this purpose. ⁸⁵ Input files were generated using the web-based input generator CHARMM-GUI. ⁸⁶ The composition of the membrane bilayer was based on lipid composition of the mice brain cerebellum⁸⁷ - (75% POPC, 21% Cholesterol, 4% Sphingomyelin) (Table S4), to mimic physiological cerebellar membrane composition. The system was solvated using TIP3P water⁸⁸ and 150 mM NaCl, to mimic physiological conditions. Overall the system sizes for inac-SMO, act-SMO, SAG-SMO and SANT1-SMO were 106,415, 105,971, 105,100 and 105,582 atoms with box sizes 86×86×153 Å³, 86×86×152 Å³ 86×86×152 Å³ and 85×85×153 Å³ respectively. The mass of non-protein hydrogens was repartitioned to 3.024 Da,⁸⁹ to enable simulations with a long timestep (4 fs). Parmed, a part of the AmberTools19 package, was used for this purpose.⁹⁰

Pre-Production MD

The systems were minimized for 1000 steps, using the steepest descent method, followed by minimization using the SHAKE algorithm⁹¹ for 14000 steps. Systems were then heated from 0-310 K using NVT conditions for 5 ns, constraining the backbone using a force constant of 10 kcal mol⁻¹ Å⁻². Systems were then equilibrated using the NPT conditions for 5ns, at 310 K and 1 bar, using similar backbone restraints. This was followed by an equilibration of 40 ns, without constraints. Apo-SMO and SANT-SMO simulations were performed using the AMBER18^90,92–95 biomolecular simulation package. SAG-SMO simulations were performed using NAMD 2.14.^96,97 NAMD was used in this case to aid the simulation of lone pairs associated with the Chlorine atom in SAG.

Production MD

Post equilibration, the GPU-accelerated pmemd.cuda package from AMBER18^90,95 was used for production MD. Integrator timestep was set to 4fs. Periodic boundary conditions were used, and the temperature was maintained using the Langevin Thermostat. ⁹⁸ Pressure of each of the systems was set 1 bar, and was maintained using the Monte Carlo Barostat. Particle Mesh Ewald⁹⁹ (PME) method was used for computing long-range electrostatic interactions. SHAKE⁹¹ algorithm was used to restrain the Hydrogen bonds. Cutoff for nonbonded interactions was set to 10 Å. Frames were saved every 25000 steps, giving a frame rate of 100 ps between each frame. Simulations were performed using the Blue Waters super-computer(NVIDIA Tesla K20X GPUs) or our in-house computing cluster(NVIDIA GeForce GTX 980 GPUs). Apo-SMO, SAG-SMO and SANT1-SMO were simulated for a total of ~ 250μs, ~36μs and ~ 42μs respectively.

Adaptive sampling, feature selection and clustering

Simulating biological systems using traditional long-MD simulations to observe submillisecond dynamics is unfeasible, hence we resorted to using a parallel approach to accelerate conformational sampling, called Adaptive sampling. The simulation data after every round of simulations was clustered (feature selection explained below), and the least populated clusters were used to seed simulations for the next round. Overall, for Apo-SMO, 7 rounds of simulations were performed, collecting 30-50μs per round. For SAG/SANT1-SMO, the data was collected in a similar fashion, for 3 rounds each, around 10-20 μs per round. The bias introduced in the system due to selectively starting simulations from least populated clusters was eliminated by constructing a Markov state model, that estimated the reverse transition probabilities from each microstate.

The progress of the transition from inactive to active was monitored by calculating features, each of which was selected based on maximum magnitude of Δ RRCS (RRCS - Residue Residue Contact Score). RRCS is a order-parameter identifying technique that uses a flat-linear-flat scoring scheme to assign a score to contact between every residue-pair in the system.⁶ Contacts that had |ΔRRCS| < 3.5 (58 such distances total) (Table S5) were used. K-means clustering was used to cluster the simulation data, based on these calculated features. Clustering was performed using the pyEMMA python library. ¹⁰⁰

Markov state model construction

The high dimensionality of the data was first reduced using tICA. The tICA lagtime was optimized by observing the plateauing of the implied timescales (−2/lnλ, λ being the largest eigenvalue of the first tICA eigenvector) vs the lag time, and was set to 30 ns for the 3 systems (Apo-SMO, SAG-SMO, SANT1-SMO). The tICA reduced-dimension data was then clustered using k-means clustering. The optimum number of clusters and no. of tICA components to be used was optimized by maximizing the VAMP2 score (sum of the squares of the highest eigenvalues of the transition matrix) for a particular the number of clusters, and the convergence of the implied timescales vs the MSM lag time. (Fig. S2,S3,S4). Accordingly, the number of clusters was set to 200 (Apo-SMO) and 100 (SANT1-SMO and SAG-SMO). The MSM lag time was set to 30 ns for the three systems (Apo-SMO, SAG-SMO, SANT1-SMO). The Chapman-Kolmogorov test, which tests the validity of the MSM on 5 macrostates, was performed using pyEMMA (Fig. S5,S6,S7).

Trajectory Analysis and Visualization

cpptraj¹⁰¹ was used for trajectory processing. VMD^102,103 and open-source PyMOL¹⁰⁴ were used to visualize and render images. MDTraj¹⁰⁵ was used for computing all order parameters. All plots were made using matplotlib ¹⁰⁶ and seaborn¹⁰⁷ python libraries. Numpy¹⁰⁸ was used for numerical computations. The salt-bridge based contacts were discovered by extracting probability-weighted 10000 frames from clusters the in Inactive, I^1–3 and Active states, using cluster probabilities from the MSM. They were analyzed for unique contacts using GetContacts. ¹⁰⁹ Tunnel radii for analysis of effect of SAG and SANT1 were calculated using HOLE. ⁶⁷

Mutual Information Calculations

Mutual Information for describing the allosteric pathways was computed using mdentropy,¹¹⁰ using the DihedralMutualInformation function. Analysis was performed on 10000 frames each extracted from Apo-SMO, SANT1-SMO and SAG-SMO data. The frames were chosen based on the predicted MSM probabilities, to represent the entire ensemble. A graph was constructed from the computed Mutual Information, and residues with C-α distances < 10 Å were considered to be connected by an edge. The weight of each edge was assigned as MI = MI_max - MI_ab, with the MI_max as the maximum mutual information computed among two residues in a protein, and MI_ab was the mutual information computed between residue pair ab. Edges with MI < MI_avg were not considered. This methodolody thus adapted has been discussed previously.^69–71 The caveats and limitations presented by the methodology – the presence of global dynamics independent of the local dynamics being explored by the limited simulation data¹¹¹ have been resolved by using long timescale simulations. Allosteric pathways were computed by calculating the shortest paths between 2 nodes, (in our case E518 and W339) using Dijkstra’s algorithm.¹¹² NetworkX,¹¹³ a python library was used for graph-construction, visualization and computing shortest paths.

Supporting information

Supplementary text, figures, methods and references

Data Availability

Stripped trajectories and corresponding parameter files have been uploaded to Box. Scripts used for MSM construction and trajectory analysis have been uploaded to github.

Author Contributions

D.S. designed the research. P.D.B. performed simulations. P.D.B and S.D. analyzed the data. P.D.B. wrote the manuscript with inputs from S.D. and D.S.

Acknowledgements

The authors thank The Blue Waters Petascale Computing Facility and National Center for Supercomputing Applications, which is supported by the National Science Foundation (awards OCI-0725070 and ACI-1238993) and the state of Illinois. Blue Waters is a joint effort of the University of Illinois at Urbana-Champaign and its National Center for Supercomputing Applications. D.S. acknowledges support from NIH grant R35GM142745 and Cancer Center at Illinois for their support. P.D.B thanks Austin Weigle and Jiming Chen of the Shukla Group at University of Illinois for the valuable insights throughout the course of this study.

References

(1)
1. Riobo N. A.
2. Saucy B.
3. DiLizio C.
4. Manning D. R.
2006Activation of heterotrimeric G proteins by SmoothenedProc. Natl. Acad. Sci. U.S.A. 103:12607–12612Google Scholar
(2)
1. Ogden S. K.
2. Fei D. L.
3. Schilling N. S.
4. Ahmed Y. F.
5. Hwa J.
6. Robbins D. J.
2008G protein Gαi functions immediately downstream of Smoothened in Hedgehog signallingNature 456:967–970Google Scholar
(3)
1. Chen W.
2. Ren X.-R.
3. Nelson C. D.
4. Barak L. S.
5. Chen J. K.
6. Beachy P. A.
7. de Sauvage F.
8. Lefkowitz R. J.
2004Activity-Dependent Internalization of Smoothened Mediated by ß-Arrestin 2 and GRK2Science 306:2257–2260Google Scholar
(4)
1. Weis W. I.
2. Kobilka B. K.
2018The Molecular Basis of G Protein-Coupled Receptor ActivationAnnu. Rev. Biochem. 87:897–919Google Scholar
(5)
1. Latorraca N. R.
2. Venkatakrishnan A. J.
3. Dror R. O.
2016GPCR Dynamics: Structures in MotionChem. Rev. 117:139–155Google Scholar
(6)
1. Zhou Q.
2. et al.
2019Common activation mechanism of class A GPCRseLife 8Google Scholar
(7)
1. Nygaard R.
2. et al.
2013The Dynamic Process of β2-Adrenergic Receptor ActivationCell 152:532–542Google Scholar
(8)
1. Kohlhoff K. J.
2. Shukla D.
3. Lawrenz M.
4. Bowman G. R.
5. Konerding D. E.
6. Belov D.
7. Altman R. B.
8. Pande V. S.
2013Cloud-based simulations on Google Exacycle reveal ligand modulation of GPCR activation pathwaysNat. Chem 6:15–21Google Scholar
(9)
1. Mattedi G.
2. Acosta-Gutiérrez S.
3. Clark T.
4. Gervasio F. L.
2020A combined activation mechanism for the glucagon receptorProc. Natl. Acad. Sci. U.S.A 117:15414–15422Google Scholar
(10)
1. Wang C.
2. Wu H.
3. Katritch V.
4. Han G. W.
5. Huang X.-P.
6. Liu W.
7. Siu F. Y.
8. Roth B. L.
9. Cherezov V.
10. Stevens R. C.
2013Structure of the human smoothened receptor bound to an antitumour agentNature 497:338–343Google Scholar
(11)
1. Sriram K.
2. Insel P. A.
2018G Protein-Coupled Receptors as Targets for Approved Drugs: How Many Targets and How Many Drugs?Mol. Pharmacol 93:251–258Google Scholar
(12)
1. Logan C. Y.
2. Nusse R.
2004The WNT signaling pathway in development and diseaseAnnu. Rev. Cell Dev. Biol. 20:781–810Google Scholar
(13)
1. Riddle R. D.
2. Johnson R. L.
3. Laufer E.
4. Tabin C.
1993Sonic hedgehog mediates the polarizing activity of the ZPACell 75:1401–1416Google Scholar
(14)
1. Briscoe J.
2. Thérond P. P.
2013The mechanisms of Hedgehog signalling and its roles in development and diseaseNat. Rev. Mol. Cell Biol 1f:416–429Google Scholar
(15)
1. Lee R. T. H.
2. Zhao Z.
3. Ingham P. W.
2016Hedgehog signallingDevelopment 143:367–372Google Scholar
(16)
2022GTEx Tissue Expression Project. 2021https://www.gtexportal.org/home/gene/SMO
(17)
1. Chen Y.
2. Struhl G.
1996Dual Roles for Patched in Sequestering and Transducing HedgehogCell 87:553–563Google Scholar
(18)
1. Kong J. H.
2. Siebold C.
3. Rohatgi R.
2019Biochemical mechanisms of vertebrate hedgehog signalingDevelopment 146Google Scholar
(19)
1. Kinnebrew M.
2. Iverson E. J.
3. Patel B. B.
4. Pusapati G. V.
5. Kong J. H.
6. Johnson K. A.
7. Luchetti G.
8. Eckert K. M.
9. McDonald J. G.
10. Covey D. F.
11. Siebold C.
12. Radhakrishnan A.
13. Rohatgi R.
2019Cholesterol accessibility at the ciliary membrane controls hedgehog signalingeLife 8Google Scholar
(20)
1. Kinnebrew M.
2. Luchetti G.
3. Sircar R.
4. Frigui S.
5. Viti L. V.
6. Naito T.
7. Beckert F.
8. Saheki Y.
9. Siebold C.
10. Radhakrishnan A.
11. Rohatgi R.
2021Patched 1 reduces the accessibility of cholesterol in the outer leaflet of membraneseLife 10Google Scholar
(21)
1. Nieuwenhuis E.
2. c Hui C.
2004Hedgehog signaling and congenital malformationsClin. Genet. 67:193–208Google Scholar
(22)
1. Keeler R. F.
1969Toxic and teratogenic alkaloids of western range plantsJ. Agric. Food Chem. 17:473–482Google Scholar
(23)
1. Heretsch P.
2. Tzagkaroulaki L.
3. Giannis A.
2010Cyclopamine and Hedgehog Signaling: Chemistry, Biology, Medical PerspectivesAngew. Chem. 49:3418–3427Google Scholar
(24)
1. Taipale J.
2. Chen J. K.
3. Cooper M. K.
4. Wang B.
5. Mann R. K.
6. Milenkovic L.
7. Scott M. P.
8. Beachy P. A.
2000Effects of oncogenic mutations in Smoothened and Patched can be reversed by cyclopamineNature 406:1005–1009Google Scholar
(25)
1. Chen J. K.
2. Taipale J.
3. Cooper M. K.
4. Beachy P. A.
2002Inhibition of Hedgehog signaling by direct binding of cyclopamine to SmoothenedGenes Dev. 16:2743–2748Google Scholar
(26)
1. Nachtergaele S.
2. Whalen D. M.
3. Mydock L. K.
4. Zhao Z.
5. Malinauskas T.
6. Krishnan K.
7. Ingham P. W.
8. Covey D. F.
9. Siebold C.
10. Rohatgi R.
2013Structure and function of the Smoothened extracellular domain in vertebrate Hedgehog signalingeLife 2Google Scholar
(27)
1. Corcoran R. B.
2. Scott M. P.
2006Oxysterols stimulate Sonic hedgehog signal transduction and proliferation of medulloblastoma cellsProc. Natl. Acad. Sci. U.S.A. 103:8408–8413Google Scholar
(28)
1. Raleigh D. R.
2. Reiter J. F.
2019Misactivation of Hedgehog signaling causes inherited and sporadic cancersJ. Clin. Investig. 129:465–475Google Scholar
(29)
1. Axelson M.
2. Liu K.
3. Jiang X.
4. He K.
5. Wang J.
6. Zhao H.
7. Kufrin D.
8. Palmby T.
9. Dong Z.
10. Russell A. M.
11. Miksinski S.
12. Keegan P.
13. Pazdur R.
2013US Food and Drug Administration approval: vismodegib for recurrent, locally advanced, or metastatic basal cell carcinomaClin. Cancer Res. 19:2289–2293Google Scholar
(30)
1. Jain S.
2. Song R.
3. Xie J.
2017Sonidegib: mechanism of action, pharmacology, and clinical utility for advanced basal cell carcinomasOncoTargets Ther. 10:1645–1653Google Scholar
(31)
1. Meani R. E.
2. Lim S.-W.
3. Chang A. L. S.
4. Kelly J. W.
2014Emergence of chemoresistance in a metastatic basal cell carcinoma patient after complete response to hedgehog pathway inhibitor vismodegib (GDC-0449)Australas. J. Dermatol. 55:218–221Google Scholar
(32)
1. Byrne E. F. X.
2. Sircar R.
3. Miller P. S.
4. Hedger G.
5. Luchetti G.
6. Nachtergaele S.
7. Tully M. D.
8. Mydock-McGrane L.
9. Covey D. F.
10. Rambo R. P.
11. Sansom M. S. P.
12. Newstead S.
13. Rohatgi R.
14. Siebold C.
2016Structural basis of Smoothened regulation by its extracellular domainsNature 535:517–522Google Scholar
(33)
1. Qi X.
2. Liu H.
3. Thompson B.
4. McDonald J.
5. Zhang C.
6. Li X.
2019Cryo-EM structure of oxysterol-bound human Smoothened coupled to a heterotrimeric GiNature 571:279–283Google Scholar
(34)
1. Huang P.
2. Nedelcu D.
3. Watanabe M.
4. Jao C.
5. Kim Y.
6. Liu J.
7. Salic A.
2016Cellular Cholesterol Directly Activates Smoothened in Hedgehog SignalingCell 166:1176–1187Google Scholar
(35)
1. Wang C.
2. Wu H.
3. Evron T.
4. Vardy E.
5. Han G. W.
6. Huang X.-P.
7. Hufeisen S. J.
8. Mangano T. J.
9. Urban D. J.
10. Katritch V.
11. Cherezov V.
12. Caron M. G.
13. Roth B. L.
14. Stevens R. C.
2014Structural basis for Smoothened receptor modulation and chemoresistance to anticancer drugsNat. Commun. 5:4355Google Scholar
(36)
1. Weierstall U.
2. et al.
2014Lipidic cubic phase injector facilitates membrane protein serial femtosecond crystallographyNat. Commun. 5:3309Google Scholar
(37)
1. Zhang X.
2. et al.
2017Crystal structure of a multi-domain human smoothened receptor in complex with a super stabilizing ligandNat. Commun. 8:15383Google Scholar
(38)
1. Qi X.
2. Friedberg L.
3. Bose-Boyd R. D.
4. Long T.
5. Li X.
2020Sterols in an intramolecular channel of Smoothened mediate Hedgehog signalingNat. Chem. Biol. 16:1368–1375Google Scholar
(39)
1. Deshpande I.
2. Liang J.
3. Hedeen D.
4. Roberts K. J.
5. Zhang Y.
6. Ha B.
7. Lator-raca N. R.
8. Faust B.
9. Dror R. O.
10. Beachy P. A.
11. Myers B. R.
12. Manglik A.
2019Smoothened stimulation by membrane sterols drives Hedgehog pathway activityNature 571:284–288Google Scholar
(40)
1. Wright S. C.
2. et al.
2019A conserved molecular switch in Class F receptors regulates receptor activation and pathway selectionNat. Commun. 10:667Google Scholar
(41)
1. Ballesteros J. A.
2. Weinstein H.
1995Methods in NeurosciencesElsevier pp. 366–428Google Scholar
(42)
1. Radhakrishnan A.
2. Rohatgi R.
3. Siebold C.
2020Cholesterol access in cellular membranes controls Hedgehog signalingNat. Chem. Biol. 16:1303–1313Google Scholar
(43)
1. Husic B. E.
2. Pande V. S.
2018Markov State Models: From an Art to a ScienceJ. Am. Chem. Soc. 140:2386–2396Google Scholar
(44)
1. Shukla D.
2. Hernández C. X.
3. Weber J. K.
4. Pande V. S.
2015Markov State Models Provide Insights into Dynamic Modulation of Protein FunctionAcc. Chem. Res. 48:414–422Google Scholar
(45)
1. Chan M. C.
2. Selvam B.
3. Young H. J.
4. Procko E.
5. Shukla D.
2022The substrate import mechanism of the human serotonin transporterBiophys. J. 121:715–730Google Scholar
(46)
1. Selvam B.
2. Yu Y.-C.
3. Chen L.-Q.
4. Shukla D.
2019Molecular Basis of the Glucose Transport Mechanism in PlantsACS Cent. Sci. 5:1085–1096Google Scholar
(47)
1. Selvam B.
2. Mittal S.
3. Shukla D.
2018Free Energy Landscape of the Complete Transport Cycle in a Key Bacterial TransporterACS Cent. Sci. 4:1146–1154Google Scholar
(48)
1. Ferruz N.
2. Doerr S.
3. Vanase-Frawley M. A.
4. Zou Y.
5. Chen X.
6. Marr E. S.
7. Nelson R. T.
8. Kormos B. L.
9. Wager T. T.
10. Hou X.
11. Villalobos A.
12. Sciabola S.
13. Fab-ritiis G. D.
2018Dopamine D3 receptor antagonist reveals a cryptic pocket in aminergic GPCRsSci. Rep. 8Google Scholar
(49)
1. Taylor B. C.
2. Lee C. T.
3. Amaro R. E.
2019Structural basis for ligand modulation of the CCR2 conformational landscapeProc. Natl. Acad. Sci. U.S.A. 116:8131–8136Google Scholar
(50)
1. Dutta S.
2. Selvam B.
3. Shukla D.
2022Distinct Binding Mechanisms for Allosteric Sodium Ion in Cannabinoid ReceptorsACS Chem. Neurosci. 13:379–389Google Scholar
(51)
1. Dutta S.
2. Selvam B.
3. Das A.
4. Shukla D.
2022Mechanistic origin of partial agonism of tetrahydrocannabinol for cannabinoid receptorsJ. Biol. Chem 298:101764Google Scholar
(52)
1. Chen Y.
2. Fleetwood O.
3. Pérez-Conesa S.
4. Delemotte L.
2021Allosteric Effect of Nanobody Binding on Ligand-Specific Active States of the β2 Adrenergic ReceptorJ. Chem. Inf. Model. 61:6024–6037Google Scholar
(53)
1. Selvam B.
2. Shamsi Z.
3. Shukla D.
2018Universality of the Sodium Ion Binding Mechanism in Class A G-Protein-Coupled ReceptorsAngew. Chem. 57:3048–3053Google Scholar
(54)
1. Kapoor A.
2. Martinez-Rosell G.
3. Provasi D.
4. de Fabritiis G.
5. Filizola M.
2017Dynamic and Kinetic Elements of μ-Opioid Receptor Functional SelectivitySci. Rep. 7Google Scholar
(55)
1. Kapoor A.
2. Provasi D.
3. Filizola M.
2020Atomic-Level Characterization of the Methadone-Stabilized Active Conformation of μ-Opioid ReceptorMol. Pharmacol. 98:475–486Google Scholar
(56)
1. Shukla D.
2. Lawrenz M.
3. Pande V. S.
2014Elucidating ligand-modulated conformational landscape of gpcrs using cloud-computing approachesMethods Enzymol. 557:551–572Google Scholar
(57)
1. Bowman G. R.
2. Ensign D. L.
3. Pande V. S.
2010Enhanced Modeling via Network Theory: Adaptive Sampling of Markov State ModelsJ. Chem. Theory Comput 6:787–794Google Scholar
(58)
1. Pérez-Hernéndez G.
2. Paul F.
3. Giorgino T.
4. Fabritiis G. D.
5. Noé F.
2013Identification of slow molecular order parameters for Markov model constructionJ. Chem. Phys. 139:015102Google Scholar
(59)
1. Schwantes C. R.
2. Pande V. S.
2013Improvements in Markov State Model Construction Reveal Many Non-Native Interactions in the Folding of NTL9J. Chem. Theory Comput. 9:2000–2009Google Scholar
(60)
1. Turku A.
2. Schihada H.
3. Kozielewicz P.
4. Bowin C.-F.
5. Schulte G.
2021Residue 6.43 defines receptor function in class F GPCRsNat. Commun. 12:3919Google Scholar
(61)
1. Xu L.
2. Chen B.
3. Schihada H.
4. Wright S. C.
5. Turku A.
6. Wu Y.
7. Han G.-W.
8. Kowalski-Jahn M.
9. Kozielewicz P.
10. Bowin C.-F.
11. Zhang X.
12. Li C.
13. Bouvier M.
14. Schulte G.
15. Xu F.
2021Cryo-EM structure of constitutively active human Frizzled 7 in complex with heterotrimeric GsCell. Res. 31:1311–1314Google Scholar
(62)
1. Hofmann K. P.
2. Scheerer P.
3. Hildebrand P. W.
4. Choe H.-W.
5. Park J. H.
6. Heck M.
7. Ernst O. P.
2009A G protein-coupled receptor at work: the rhodopsin modelTrends in Biochemical Sciences 34:540–552Google Scholar
(63)
1. Dijkgraaf G. J. P.
2. Alicke B.
3. Weinmann L.
4. Januario T.
5. West K.
6. Modrusan Z.
7. Burdick D.
8. Goldsmith R.
9. Robarge K.
10. Sutherlin D.
11. Scales S. J.
12. Gould S. E.
13. Yauch R. L.
14. de Sauvage F. J.
2010Small Molecule Inhibition of GDC-0449 Refractory Smoothened Mutants and Downstream Mechanisms of Drug ResistanceCancer Res. 71:435–444Google Scholar
(64)
1. Huang P.
2. Zheng S.
3. Wierbowski B. M.
4. Kim Y.
5. Nedelcu D.
6. Aravena L.
7. Liu J.
8. Kruse A. C.
9. Salic A.
2018Structural Basis of Smoothened Activation in Hedgehog SignalingCell 174:312–324Google Scholar
(65)
1. Myers B. R.
2. Sever N.
3. Chong Y. C.
4. Kim J.
5. Belani J. D.
6. Rychnovsky S.
7. Bazan J. F.
8. Beachy P. A.
2013Hedgehog Pathway Modulation by Multiple Lipid Binding Sites on the Smoothened Effector of Signal ResponseDev. Cell 26:346–357Google Scholar
(66)
1. Yang H.
2. Xiang J.
3. Wang N.
4. Zhao Y.
5. Hyman J.
6. Li S.
7. Jiang J.
8. Chen J. K.
9. Yang Z.
10. Lin S.
2009Converse Conformational Control of Smoothened Activity by Structurally Related Small MoleculesJ. Biol. Chem. 284:20876–20884Google Scholar
(67)
1. Smart O.
2. Goodfellow J.
3. Wallace B.
1993The pore dimensions of gramicidin ABiophys. J. 65:2455–2460Google Scholar
(68)
1. Hedger G.
2. Koldsø H.
3. Chavent M.
4. Siebold C.
5. Rohatgi R.
6. Sansom M. S.
2019Cholesterol Interaction Sites on the Transmembrane Domain of the Hedgehog Signal Transducer and Class F G Protein-Coupled Receptor SmoothenedStructure 27:549–559Google Scholar
(69)
1. Lee S.
2. Nivedha A. K.
3. Tate C. G.
4. Vaidehi N.
2019Dynamic Role of the G Protein in Stabilizing the Active State of the Adenosine A2A ReceptorStructure 27:703–712Google Scholar
(70)
1. Bhattacharya S.
2. Vaidehi N.
2014Differences in Allosteric Communication Pipelines in the Inactive and Active States of a GPCRBiophys. J. 107:422–434Google Scholar
(71)
1. Niesen M. J. M.
2. Bhattacharya S.
3. Grisshammer R.
4. Tate C. G.
5. Vaidehi N.
2013Thermostabilization of the β 1-adrenergic receptor correlates with increased entropy of the inactive stateJ. Phys. Chem. B 117:7283–7291Google Scholar
(72)
1. Eswar N.
2. Webb B.
3. Marti-Renom M. A.
4. Madhusudhan M.
5. Eramian D.
6. yi Shen M.
7. Pieper U.
8. Sali A.
2006Comparative Protein Structure Modeling Using ModellerCurr. Protoc. Bioinform. 15Google Scholar
(73)
1. Gordon J. C.
2. Myers J. B.
3. Folta T.
4. Shoja V.
5. Heath L. S.
6. Onufriev A.
2005H++: a server for estimating pKas and adding missing hydrogens to macromoleculesNucleic Acids Research 33:W368–W371Google Scholar
(74)
1. Jo S.
2. Kim T.
3. Iyer V. G.
4. Im W.
2008CHARMM-GUI: A web-based graphical user interface for CHARMMJ. Comput. Chem. 29:1859–1865Google Scholar
(75)
1. Lee J.
2. et al.
2018CHARMM-GUI Membrane Builder for Complex Biological Membrane Simulations with Glycolipids and LipoglycansJ. Chem. Theory Comput. 15:775–786Google Scholar
(76)
1. Klauda J. B.
2. Venable R. M.
3. Freites J. A.
4. O’Connor J. W.
5. Tobias D. J.
6. Mondragon-Ramirez C.
7. Vorobyov I.
8. MacKerell A. D.
9. Pastor R. W.
2010Update of the CHARMM All-Atom Additive Force Field for Lipids: Validation on Six Lipid TypesJ. Phys. Chem. B 114:7830–7843Google Scholar
(77)
1. Best R. B.
2. Zhu X.
3. Shim J.
4. Lopes P. E. M.
5. Mittal J.
6. Feig M.
7. MacKerell A. D.
2012Optimization of the additive CHARMM all-atom protein force field targeting improved sampling of the backbone ϕ, ψ and side-chain χ1 and χ2 dihedral anglesJ. Chem. Theory Comput. 8:3257–3273Google Scholar
(78)
1. Marino K. A.
2. Filizola M.
2017Methods in Molecular Biologypp. 351–364Google Scholar
(79)
1. Ribeiro J. M. L.
2. Filizola M.
2019Insights From Molecular Dynamics Simulations of a Number of G-Protein Coupled Receptor Targets for the Treatment of Pain and Opioid Use DisordersFrontiers in Molecular Neuroscience :12Google Scholar
(80)
1. Lu S.
2. He X.
3. Yang Z.
4. Chai Z.
5. Zhou S.
6. Wang J.
7. Rehman A. U.
8. Ni D.
9. Pu J.
10. Sun J.
11. Zhang J.
2021Activation pathway of a G protein-coupled receptor uncovers conformational intermediates as targets for allosteric drug designNature Communications 12Google Scholar
(81)
1. Hedderich J. B.
2. Persechino M.
3. Becker K.
4. Heydenreich F. M.
5. Gutermuth T.
6. Bouvier M.
7. Bünemann M.
8. Kolb P.
2022The pocketome of G-protein-coupled receptors reveals previously untargeted allosteric sitesNature Communications :13Google Scholar
(82)
2021CGenFF interface at paramchem. 2021https://cgenff.umaryland.edu/
(83)
1. Vanommeslaeghe K.
2. Hatcher E.
3. Acharya C.
4. Kundu S.
5. Zhong S.
6. Shim J.
7. Darian E.
8. Guvench O.
9. Lopes P.
10. Vorobyov I.
11. Mackerell A. D.
2009CHARMM general force field: A force field for drug-like molecules compatible with the CHARMM all-atom additive biological force fieldsJ. Comput. Chem. :NA–NAGoogle Scholar
(84)
1. Vanommeslaeghe K.
2. Raman E. P.
3. MacKerell A. D.
2012Automation of the CHARMM General Force Field (CGenFF) II: Assignment of Bonded Parameters and Partial Atomic ChargesJ. Chem. Inf. Model. 52:3155–3168Google Scholar
(85)
1. Turney J. M.
2. et al.
2011Psi4: an open-source ab-initio electronic structure programWiley Interdisciplinary Reviews: Computational Molecular Science 2:556–565Google Scholar
(86)
1. Lee J.
2. et al.
2015CHARMM-GUI Input Generator for NAMD, GROMACS, AMBER, OpenMM, and CHARMM/OpenMM Simulations Using the CHARMM36 Additive Force FieldJ. Chem. Theory Comput. 12:405–413Google Scholar
(87)
1. Scandroglio F.
2. Venkata J. K.
3. Loberto N.
4. Prioni S.
5. Schuchman E. H.
6. Chig-orno V.
7. Prinetti A.
8. Sonnino S.
2008Lipid content of brain, brain membrane lipid domains, and neurons from acid sphingomyelinase deficient miceJ. Neurochem. 107:329–338Google Scholar
(88)
1. Jorgensen W. L.
2. Chandrasekhar J.
3. Madura J. D.
4. Impey R. W.
5. Klein M. L.
1983Comparison of simple potential functions for simulating liquid waterJ. Chem. Phys. 79:926–935Google Scholar
(89)
1. Hopkins C. W.
2. Grand S. L.
3. Walker R. C.
4. Roitberg A. E.
2015Long-Time-Step Molecular Dynamics through Hydrogen Mass RepartitioningJ. Chem. Theory Comput. 11:1864–1874Google Scholar
(90)
1. Case D.
2. Belfon K.
3. Ben-Shalom I.
4. Brozell S.
5. Cerutti D.
6. Cheatham T.
7. Cruzeiro V.
8. Darden T.
9. Duke R.
2018AMBERUniversity of California Google Scholar
(91)
1. Andersen H. C.
1983Rattle: A “velocity” version of the shake algorithm for molecular dynamics calculationsJ. Comput. Phys. 52:24–34Google Scholar
(92)
1. Salomon-Ferrer R.
2. Case D. A.
3. Walker R. C.
2012An overview of the Amber biomolecular simulation packageWiley Interdiscip. Rev. Comput. Mol. Sci. 3:198–210Google Scholar
(93)
1. Case D. A.
2. Cheatham T. E.
3. Darden T.
4. Gohlke H.
5. Luo R.
6. Merz K. M.
7. Onufriev A.
8. Simmerling C.
9. Wang B.
10. Woods R. J.
2005The Amber biomolecular simulation programsJ. Comput. Chem. 26:1668–1688Google Scholar
(94)
1. Götz A. W.
2. Williamson M. J.
3. Xu D.
4. Poole D.
5. Grand S. L.
6. Walker R. C.
2012Routine Microsecond Molecular Dynamics Simulations with AMBER on GPUs. 1. Generalized BornJ. Chem. Theory Comput. 8:1542–1555Google Scholar
(95)
1. Salomon-Ferrer R.
2. Götz A. W.
3. Poole D.
4. Grand S. L.
5. Walker R. C.
2013Routine Microsecond Molecular Dynamics Simulations with AMBER on GPUs. 2. Explicit Solvent Particle Mesh EwaldJ. Chem. Theory Comput. 9:3878–3888Google Scholar
(96)
1. Phillips J. C.
2. Braun R.
3. Wang W.
4. Gumbart J.
5. Tajkhorshid E.
6. Villa E.
7. Chipot C.
8. Skeel R. D.
9. Kalé L.
10. Schulten K.
2005Scalable molecular dynamics with NAMDJ. Comput. Chem. 26:1781–1802Google Scholar
(97)
1. Phillips J. C.
2. et al.
2020Scalable molecular dynamics on CPU and GPU architectures with NAMDJ. Chem. Phys. 153:044130Google Scholar
(98)
1. Davidchack R. L.
2. Handel R.
3. Tretyakov M. V.
2009Langevin thermostat for rigid body dynamicsJ. Chem. Phys. 130:234101Google Scholar
(99)
1. Darden T.
2. York D.
3. Pedersen L.
1993Particle mesh Ewald: An N log (N) method for Ewald sums in large systemsJ. Chem. Phys. 98:10089–10092Google Scholar
(100)
1. Scherer M. K.
2. Trendelkamp-Schroer B.
3. Paul F.
4. Pérez-Hernández G.
5. Hoffmann M.
6. Plattner N.
7. Wehmeyer C.
8. Prinz J.-H.
9. Noé F.
2015PyEMMA 2: A Software Package for Estimation, Validation, and Analysis of Markov ModelsJ. Chem. Theory Comput. 11:5525–5542Google Scholar
(101)
1. Roe D. R.
2. Cheatham T. E.
2013PTRAJ and CPPTRAJ: Software for Processing and Analysis of Molecular Dynamics Trajectory DataJ. Chem. Theory Comput. 9:3084–3095Google Scholar
(102)
1. Humphrey W.
2. Dalke A.
3. Schulten K.
1996VMD: Visual molecular dynamicsJ. Mol. Graph. 14:33–38Google Scholar
(103)
1. Stone J.
1998An Efficient Library for Parallel Ray Tracing and AnimationM.Sc. thesis, Computer Science Department, University of Missouri-Rolla Google Scholar
(104)
Schrodinger, LLCThe PyMOL Molecular Graphics System https://pymol.org/2/
(105)
1. McGibbon R. T.
2. Beauchamp K. A.
3. Harrigan M. P.
4. Klein C.
5. Swails J. M.
6. Hernández C. X.
7. Schwantes C. R.
8. Wang L.-P.
9. Lane T. J.
10. Pande V. S.
2015MDTraj: A Modern Open Library for the Analysis of Molecular Dynamics TrajectoriesBiophys. J. 109:1528–1532Google Scholar
(106)
1. Hunter J. D.
2007Matplotlib: A 2D graphics environmentComput. Sci. Eng. 9:90–95Google Scholar
(107)
1. Waskom M. L.
2021seaborn: statistical data visualizationJ. Open Source Softw. 6:3021Google Scholar
(108)
1. Harris C. R.
2. et al.
2020Array programming with NumPyNature 585:357–362Google Scholar
(109)
2022Getcontacts. 2022https://getcontacts.github.io/
(110)
1. Hernández C. X.
2. Pande V. S.
2015mdentropy: v0.2https://doi.org/10.5281/zenodo.18859 Google Scholar
(111)
1. Pandini A.
2. Fornili A.
3. Fraternali F.
4. Kleinjung J.
2011Detection of allosteric signal transmission by information-theoretic analysis of protein dynamicsThe FASEB Journal 26:868–881Google Scholar
(112)
1. Dijkstra E. W.
1959A note on two problems in connexion with graphsNumer. Math 1:269–271Google Scholar
(113)
1. Hagberg A. A.
2. Schult D. A.
3. Swart P. J.
2008Exploring Network Structure, Dynamics, and Function using NetworkXIn: Proceedings of the 7th Python in Science Conference pp. 11–15Google Scholar

Article and author information

Author information

Prateek D. Bansal
Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, United States
Soumajit Dutta
Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, United States
Diwakar Shukla
Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, United States, Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, United States, Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, United States, Cancer Center at Illinois, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, United States
ORCID iD: 0000-0003-4079-5381
- E-mail: diwakar@illinois.edu

Version history

Preprint posted: June 8, 2022
Reviewed Preprint: September 9, 2022
Curated Preprint: August 24, 2023

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.