CRISPR technologies and the search for the PAM-free nuclease

Collias, Daphne; Beisel, Chase L.

doi:10.1038/s41467-020-20633-y

Download PDF

Review Article
Open access
Published: 22 January 2021

CRISPR technologies and the search for the PAM-free nuclease

Nature Communications volume 12, Article number: 555 (2021) Cite this article

32k Accesses
135 Citations
24 Altmetric
Metrics details

Subjects

Abstract

The ever-expanding set of CRISPR technologies and their programmable RNA-guided nucleases exhibit remarkable flexibility in DNA targeting. However, this flexibility comes with an ever-present constraint: the requirement for a protospacer adjacent motif (PAM) flanking each target. While PAMs play an essential role in self/nonself discrimination by CRISPR-Cas immune systems, this constraint has launched a far-reaching expedition for nucleases with relaxed PAM requirements. Here, we review ongoing efforts toward realizing PAM-free nucleases through natural ortholog mining and protein engineering. We also address potential consequences of fully eliminating PAM recognition and instead propose an alternative nuclease repertoire covering all possible PAM sequences.

DNA glycosylases provide antiviral defence in prokaryotes

Article Open access 17 April 2024

Improving prime editing with an endogenous small RNA-binding protein

Article Open access 03 April 2024

Genome engineering with Cas9 and AAV repair templates generates frequent concatemeric insertions of viral vectors

Article 08 April 2024

Introduction

The world of biotechnology has undergone a seismic shift with the arrival of CRISPR technologies. These technologies rely on a CRISPR-associated (Cas) nuclease paired with a guide RNA (gRNA). The ~20–30-nt guide portion of the gRNA helps the nuclease find complementary nucleic-acid sequences, and the nuclease enzymatically cleaves these sequences. This programmable and sequence-specific capability has improved existing approaches or catalyzed the development of new approaches that have collectively led to the shift. As one example, genome editing can be performed by cleaving specific DNA sequences and guiding the repair process, whether for reversing genetic diseases, improving traits of crop plants, or studying the genetic basis of cellular functions. In addition, gene expression can be selectively activated or repressed at an individual or multiple loci to tune the level of gene expression and alter cellular behavior^1,2. CRISPR has also been used for a growing class of in vitro diagnostics that rapidly screen for specific nucleic acid sequences in a patient sample with single-base resolution³. Many other applications of CRISPR technologies also exist, such as high-throughput screens, gene drives, tailored-spectrum antimicrobials, recorders of transcriptional profiles and cellular fate, and more⁴.

The ever-expanding list of applications has come with a push to improve the overall utility and flexibility of CRISPR technologies. One restrictive barrier has been the targetable sequences for a given Cas nuclease. Successful targeting requires two factors: extensive complementarity between the gRNA guide and the nucleic acid target, and a short sequence flanking the target typically called a protospacer-adjacent motif (PAM) (Fig. 1a). While some factors influence which sequences can be selected as targets (e.g., the presence of similar off-target genomic sites, GC content, and internal secondary structure)⁵, generally a guide can be created for any target. The PAM requirement, however, is far less flexible (see Box 1). The nuclease scans available DNA for a PAM before probing guide-target complementarity (Fig. 1a). Consequently, a sequence with perfect complementarity to the guide but lacking a PAM will be ignored by the nuclease. The PAM requirement, therefore, serves as a gatekeeper for targeting by CRISPR–Cas.

**Fig. 1: The PAM in target recognition and self/nonself-discrimination for CRISPR–Cas systems.**

While a limitation to target selection, the PAM plays an essential role in the natural function of CRISPR–Cas systems, the source of CRISPR technologies (Box 1). The PAM allows these prokaryotic immune systems to differentiate between the DNA target in foreign genetic material (nonself) and the same DNA sequence encoded within CRISPR arrays (self) that produce the RNA guides (Fig. 1b). Without the PAM requirement, CRISPR–Cas systems would target their CRISPR arrays, leading to a potentially catastrophic autoimmune response. Virtually all CRISPR nucleases require a PAM in one form or another. However, the recognized PAM sequences are not shared by all Cas nucleases and instead vary widely, with different sequences, lengths, complexities, orientations, and distances from the target (Supplementary Data 1)^{6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50}. This requirement restricts our ability to target any sequence with CRISPR and has led to widespread efforts to relax the PAM requirement, even to the point that nearly any sequence would be recognized as a PAM.

Here, we review efforts to-date that have involved mining natural orthologs and engineering a few well-characterized nucleases for relaxed or altered PAM requirements. We also explore the ramifications of achieving a truly PAM-free nuclease and propose a competing approach based on assembling a repertoire of PAM-dependent nucleases that collectively recognize all possible sequences. PAM determination methods have also been critical to elucidate sequences recognized by each nuclease (Box 2) and have been reviewed previously⁵¹. Overall, this review addresses a rapidly developing sector of CRISPR technologies that could redefine our ability to target any sequence at will.

Box 1. PAM origins and mechanisms

As part of target recognition, Cas nucleases proceed through two checkpoints. First, the nuclease assesses the sequence flanking the intended target (Fig. 1a). For DNA-targeting nucleases, this sequence is often one or multiple sequences collectively called a protospacer adjacent motif, or PAM^51,60. In contrast, RNA-targeting nucleases (e.g., type III Csm/Cmr complex, type VI Cas13) have been shown to evaluate complementarity between the flanking sequence and a handle sequence encoded within the gRNA⁶⁹. As the second checkpoint, the nuclease assesses base pairing between the guide and the DNA target strand through R-loop formation (Fig. 1a). If both checkpoints are passed, then the nuclease cleaves the target through its specific mechanism-of-action. The PAM, therefore, serves as an essential gatekeeper preventing the nuclease from accessing certain DNA sequences, even if they harbor complete complementarity to the guide.

The gatekeeper function of the PAM is rooted in the natural source of CRISPR technologies and Cas nucleases: CRISPR–Cas systems. These adaptive immune systems native to bacteria and archaea encode their gRNAs within unique patterns of DNA called CRISPR arrays. The arrays comprise alternating conserved repeats and guide-encoding spacers, with each spacer acquired from a previously encountered bacteriophage or another mobile genetic element. By storing the invader-derived sequence that gives rise to a gRNA, CRISPR–Cas systems inherently face a potentially fatal predicament: the DNA encoding the guide would also yield extensive complementarity to the guide. Thus, there lies the potential for each spacer to be recognized as the original invader, leading to genome attack. However, the flanking repeat lacks the PAM recognized by the Cas nuclease (Fig. 1b), allowing the nuclease to effectively ignore this ever-present opportunity for autoimmunity. The PAM, therefore, allows the nuclease to discriminate between subsequent infection by the invader (nonself) from the invader-derived spacer sequence encoded in the CRISPR array (self). Accordingly, CRISPR–Cas systems would be under stringent selective pressure to evolve and maintain PAM recognition as an absolute requirement of immune function. Fortunately for PAM engineering, the presence of this selective pressure also implies that PAM recognition could be undone outside of the natural context of Cas nucleases.

The molecular details of PAM recognition have been revealed for some canonical Cas nucleases. Cas9 from Streptococcus pyogenes (SpyCas9) has been characterized the most extensively, where structural analyses and subsequent biochemical assays revealed a series of steps that drive PAM recognition¹⁰⁵. Briefly, two arginines within the PAM-interacting domain (PID) recognize adjacent guanines on the nontarget strand of the NGG consensus PAM. Recognition is further stabilized by nonspecific interactions with DNA adjacent to the PAM¹⁰⁶. Residues Ser1109 and Glu1108 within the PID form a phosphate lock with the phosphate on the target strand linking the N nucleotide of the PAM and the first nucleotide of the target sequence complementary to the guide. These events release binding energy that initiates strand separation and R-loop formation.

Characterization of PAM recognition by other Cas nucleases revealed variations on this theme. For example, Cas9 nucleases that are phylogenetically distinct from SpyCas9 rely on a phosphate lock to drive R-loop formation, but read out their consensus PAM using residues within the PI and WED domains^{48,49,107,108}. Some of these Cas9 nucleases also recognize specific bases on both DNA strands^48,108, while molecular-modeling efforts have suggested contributions from van der Waals interactions¹⁰⁸. Separately, Cas12a nucleases rely on three distinct domains (PI, REC1, and WED) to recognize the PAM. Recognition occurs not only through detecting specific bases but also the shape of the double-stranded DNA and actively rejecting non-PAM sequences. A separate interaction also occurs with the phosphate separating the PAM and target, akin to the phosphate lock for Cas9¹⁰⁹. Finally, the multiprotein subunit effector complex from Type I CRISPR–Cas systems relies on the recognition of specific bases and DNA shape within the major or minor groove of the PAM DNA. The characterized Cascade complexes also drive a protein wedge into the DNA to force the two strands apart and promote R-loop formation¹¹⁰, in contrast to the phosphate lock exhibited by Cas9 and Cas12a. More details concerning the location and composition of the PAM as well as the molecular mechanisms of PAM recognition can be found in multiple recent reviews^51,110,111. Overall, existing molecular insights into PAM recognition have inspired how PAM recognition could be altered—or even relieved.

Box 2. PAM determination methods

Elucidating the set of recognized PAM sequences has been a key step when mining natural Cas orthologs or engineering PAM recognition. As a result, a variety of determination methods have been developed and implemented. Each method can be generally classified based on the use of bioinformatics or experimental approaches or the use of experimental approaches further divided based on whether the assay is in vitro or in vivo and relies on target binding or cleavage. Bioinformatics methods align CRISPR spacers from a nuclease’s natural CRISPR–Cas system to matching sequences (e.g., in plasmids and bacteriophages) in available databases, with the flanking sequence representing a PAM. The drawbacks to this approach are that flanking sequences are specific to the acquisition rather than the nuclease, few (if any) matching sequences are often identified, and the flanking sequences could have been mutated as part of CRISPR avoidance. Instead, experimental methods based on next-generation sequencing (NGS) are commonly employed to elucidate the nuclease’s recognized PAMs. In vitro methods typically involve subjecting a library of potential PAM sequences to NGS after cleavage by purified nuclease with a gRNA^24,26 or by whole-cell lysate with expressed nuclease and an added gRNA transcribed in vitro^21,76. An adapter sequence is ligated onto the cleaved sequences to enrich recognized PAM sequences^21,24,26. Alternatively, PAMs can be determined by the extent of depletion compared to the original library or a nontargeted control^50,76. Aside from assaying for DNA cleavage events, base-editing events can also be evaluated in vitro to evaluate the depleted PAM preference of edited sequences from a cytosine base editor⁷⁶. In vivo methods based on target cleavage have relied on three approaches: clearance of a plasmid encoding the target and PAM library in bacteria²², selecting gRNAs that target along the genome of an infecting RNA phage in bacteria¹¹² or evaluating editing frequencies using constructs encoding both the guide RNA and target in human cells¹¹³. Separately, two different methods in bacteria link target binding by a catalytically dead nuclease to green fluorescent protein fluorescence or growth that both enrich for recognized PAM sequences^25,46. Finally, cell-free transcription–translation systems (TXTL) offer a more rapid and scalable means to determine PAM sequences by eliminating cell transformation and growth as well as protein and RNA purification⁵⁰.

PAM recognition by a nuclease is a biophysical process that should remain the same whether operating in vitro, in TXTL, in bacteria, or in human cells. However, each of the available PAM determination methods has distinct properties that can yield differences in the elucidated PAMs. For example, binding appears to be more promiscuous than cleavage⁸⁹, while DNA cleavage does not necessarily yield a detectable edit. Separately, higher concentrations of nuclease–gRNA complexes can boost the recognition of less-preferred PAMs, as shown by varying the concentration as part of an in vitro DNA cleavage assay²⁴. While the consensus PAM is not expected to change, less-preferred PAMs can be given greater weight or be present or absent depending on the selected method. As general guidance, we recommend noting the method used to elucidate a given PAM as well as the selected conditions. When relying on the elucidated PAM for a given application, the PAM would be more reliable if the determination method closely parallels the application (e.g., methods based on target binding for applications in gene regulation).

There are also different approaches to convey the output of high-throughput PAM determination methods that trade-off simplicity and information content. A consensus sequence or motif (e.g., NGG for SpyCas9) represents the simplest approach, which facilitates the search for potential target sites. However, relying on a single motif often leaves out less-preferred sequences. Sequence logos convey nucleotide bias within a given position, capturing some bases that would not be present in a consensus motif. However, extracting individual sequences and their extent of recognition as PAMs is difficult, given the lack of individual sequences and their recognition by the nuclease. Finally, PAM wheels capture the full diversity of sequences and their relative recognition as PAMs, although extracting a single-consensus sequence or motif is more challenging with PAM wheels than with sequence logos⁴⁶. These different means of conveying PAMs are discussed in detail in prior reviews⁵¹. Nevertheless, the method of conveying a PAM preference is important to mention here as it can impact how we understand the nuclease’s targeting requirements and therefore their application in downstream technologies.

A growing need for flexible targeting with Cas nucleases

The need for relaxed PAM requirements did not immediately emerge from the first use of CRISPR technologies; instead, the need developed as the technologies advanced and expanded. The first CRISPR technology was used to introduce insertions or deletions (indels) through nonhomologous end joining that was intended to disrupt the functional expression of a gene^52,53. Disruptive indels could be introduced in many locations within a gene, placing few restrictions on potential targets. However, rules governing on-target activity or the propensity for off-targeting eliminate certain potential targets from consideration⁵⁴. Separately, dual nucleases have been used in different applications such as dual nicking with reduced off-target effects⁵⁵, where targeting activity is intimately dependent on the orientation and spacing of the two DNA targets. Some technologies are even more restrictive by requiring that a specific location be targeted, such as when introducing defined edits via homologous recombination or prime editing⁵⁶, activating gene expression in bacteria⁵⁷, or detecting single-nucleotide polymorphisms as part of in vitro diagnostics³. A poignant example involves base editors, which rely on a DNA-modifying domain that acts on a specific stretch of the target. The positioning of this editing window is principally determined by the PAM; possessing flexibility in the PAM is absolutely crucial given that the window can be as small as one or two nucleotides⁵⁸ or has to be precisely positioned to avoid editing adjacent bases. Therefore, there has been a more recent yet concerted push to expand recognized PAMs to accommodate the growing suite of CRISPR technologies. The push has come in two general forms: mining the natural world for new orthologs of Cas nucleases and employing protein engineering to alter PAM recognition by well-characterized nucleases.

Mining natural Cas orthologs for altered PAM recognition

Early efforts to co-opt Cas nucleases as technologies gave little consideration for the PAM, although these efforts hinted at the natural diversity of PAM recognition (Supplementary Data 1). In those days, multiple Cas9 nucleases from model bacteria were being characterized, with an eye toward harnessing these nucleases for some level of editing in different cellular contexts or finding active variants that could be packaged into viral delivery vectors^20,21,59. The Cas9 from the human pathogen Streptococcus pyogenes (SpyCas9) immediately jumped to the forefront, in part because of its simple NGG PAM (N = any base). At the same time, the characterization of other Cas9 nucleases revealed entirely distinct consensus PAMs. These other Cas9 nucleases included one from the CRISPR1 locus (Sth1Cas9) in the model lactic-acid bacterium Streptococcus thermophilus recognizing an NNATAAW (W = A, T) consensus PAM. The Cas9 from the pathogen Staphylococcus aureus (SauCas9), initially lauded for being shorter than SpyCas9 by 315 amino acids, recognizes an NNGRRT consensus PAM (R = A, G). The Cas9 from pathogen Neisseria meningitidis (NmeCas9) reflected a larger extreme with an NNNNGATT consensus PAM. These few examples hinted at the natural diversity of Cas9 nucleases.

The consensus motifs of the original Cas9 nucleases were primarily derived from analyzing phage sequences targeted by CRISPR spacers⁶⁰, which are skewed toward sequences recognized through adaptation rather than interference. In contrast, measuring DNA target binding or cleavage has offered a direct readout of PAM preferences (Box 2). Related efforts have revealed more flexibility than that of a simple consensus. For instance, the first high-throughput screen for PAMs recognized by SpyCas9 based on plasmid clearance in Escherichia coli identified NAG as a PAM, albeit with weaker recognition than NGG^22,23. Subsequent work from multiple groups has shown that SpyCas9 can also weakly recognize NGA, NNGG, and a selection of other sequences^{21,22,23,24,25}, reflecting a general preference for purines as well as some flexibility in the PAM gap—the distance between the target and first, defined base. While recognition can come from excess nuclease concentrations that can be readily avoided²⁴, many of these sequences were identified and validated under setups reflecting practical applications of CRISPR technologies, such as plasmid clearance in bacteria, DNA binding for gene regulation, or indel formation in mammalian cells^22,23,24,25. High-throughput screening indicated that virtually all of the originally characterized Cas9 nucleases also recognize less-preferred PAMs^{20,21,22,23,25}, representing a common theme for CRISPR nucleases. These studies underscore that PAMs are not solely a consensus sequence or a motif and instead represent a landscape of sequences with different extents of recognition. Furthermore, these studies have led to less-preferred PAMs being factored into off-target predictions^61,62,63 and serving as a starting point for boosting recognition of less-preferred sequences as part of PAM engineering.

Beyond deeper characterization of a handful of Cas9 nucleases, efforts shifted to exploring the full diversity of Cas9 nucleases found in the natural world (Fig. 2a). To date, over 900 distinct Cas9 homologs have been identified in sequenced genomes and metagenomes⁶⁴, and more homologs likely await discovery with further sequencing efforts. Exploring this expanded set has yielded a wide assortment of Cas9 nucleases with varying PAM profiles, protein sizes, and optimal activity temperature. One approach to prioritize within this massive set has been screening phylogenetically diverse Cas9 orthologs to identify the ones with unique PAM preferences. As one tour-de-force, Gasiunas et al. ²⁶ screened over 70 Cas9 orthologs taken from ten distinct clades they identified. These extensive efforts uncovered an assortment of PAM profiles, including variants recognizing C-rich (RspCas9), T-rich (Cca1/PspCas9), and A-rich (OrhCas9) PAMs. Separately, amino-acid identity analyses comparing PID of SpyCas9 and other Streptococci Cas9 nucleases led to the identification of new orthologs with divergent PAM preferences^27,28. Most notably, these efforts identified the Streptococcus canis Cas9 (ScCas9), which shares extensive homology to SpyCas9 outside of the PID, but recognizes an NNG PAM with a slight preference for an A at the second position²⁷. This PAM profile represents one of the most relaxed profiles observed so far in nature. Furthermore, focusing on the two arginine residues in SpyCas9 that directly contact the PAM led to the identification of the Cas9 from Streptococcus macacae (SmacCas9) that has glutamines at the corresponding residues²⁸. Chatterjee et al.²⁸ hypothesized and experimentally demonstrated that SmacCas9 recognizes a consensus NAA PAM. When taken all together, the complete set of characterized Cas9 nucleases already covers ~65% of possible sequences when the consensus PAMs are aligned (Supplementary Fig. 1). However, the complete set covers ~92% of possible sequences if the PAMs can fall anywhere within a four-base window (Supplementary Fig. 1). PAM diversity thus has represented a common theme as other Cas9 nucleases beyond SpyCas9 have been characterized (Supplementary Data 1)^{6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50}, potentially reflecting strong yet changing selective pressures on DNA-targeting requirements of CRISPR–Cas systems.

**Fig. 2: Phylogenetic relationship of PAM-characterized Cas9 and Cas12a nucleases found in nature.**

Despite the original and ongoing interest in Cas9, nature boasts an abundance of other CRISPR-associated single-effector nucleases that are still being discovered and harnessed as CRISPR technologies. Many of these nucleases even offer unique properties that open applications otherwise currently unavailable to Cas9. For example, Cas12a from Type V-A CRISPR–Cas systems generates a 5′ overhang as part of DNA cleavage instead of the blunt ends left by Cas9, processes its own gRNA from transcribed CRISPR arrays instead of requiring accessory factors similar to Cas9, and elicits collateral cleavage of single-stranded DNA upon target recognition that is not observed with Cas9⁶⁵. Cas12a was first reported only five years ago^29,30 but quickly became the most characterized class of nucleases second only to Cas9 (Supplementary Data 1). These efforts revealed that most of the characterized Cas12a nucleases recognize a T-rich PAM (Fig. 2b, Supplementary Data 1), e.g., TTTV (V = A, C, G) for the Acidaminococcus sp. Cas12a (AsCas12a)^29,31. Less-preferred sequences have been shown to partially deviate from the consensus motif, such as AsCas12a accommodating a G at some positions³². However, a few Cas12a nucleases have emerged as outliers. The Cas12a from Helcococcus kunzii (HkCas12a) preferentially recognizes either two adjacent C’s at the second and third PAM positions as well as the standard PAM, resulting in a consensus of YYV^33,66. Separately, the Cas12a from Prevotella ihumii (PiCas12a) exhibited the unique ability to recognize not only the TYV PAM but also guanine at the second, third, and/or fourth positions of the PAM (e.g., TTGC and GGCC)³³. While representing only two examples, the PAM profiles for HkCas12a and PiCas12a suggest that further ortholog mining of Cas12a has the potential to identify additional and highly diverse PAMs.

Other subtypes of single-effector nucleases from Type V systems are still being discovered and hold potential for expanding PAM recognition⁶⁴. To date, a handful of other Type V effectors has undergone PAM characterization (Supplementary Data 1), including Cas12b^34,35,36, Cas12c³⁷, Cas12d (formerly known as CasY)^38,39, Cas12e (formerly known as CasX)³⁹, Cas12f (previously known as Cas14 or from the subtype V-U3)⁴⁰, Cas12j (or Cas12Φ) that forms the smallest known ribonucleoprotein complex⁴¹, and a Cas12k associated with a Tn7-like transposon⁴². More Type V subtypes have been recently discovered that remain to be characterized experimentally, leaving the potential for the discovery of new PAM recognition mechanisms as well as other CRISPR-based functions and technological breakthroughs.

Multi-subunit effector complexes from Type I and III systems have also been shown to exhibit properties distinct from any known single-effector nuclease⁶⁴. For the abundant and phylogenetically diverse Type I systems, the Cascade complex responsible for target DNA binding generally recognizes flexible two- or three-base PAMs⁵¹, although PAMs have been determined for only a small number of these systems. Their ability to unidirectionally degrade DNA was recently exploited for extensive deletions in human cells⁶⁷. In contrast, systems that search for RNA targets (i.e., from Type III and VI) do not recognize PAMs and instead evaluate the extent of complementarity between the flanking portions of the gRNA and target (see Box 1)^68,69. The Cas13 single effectors from Type VI systems have been further exploited for programmable gene silencing equivalent to RNA interference⁶⁹. The discovery and characterization of these nucleases expanded our understanding of PAM requirements, and it provides a foundation on which to obtain PAM-free nucleases for other CRISPR-based applications.

Efforts to delve into each subtype have operated under an overarching assumption: only phylogenetically distinct nucleases can recognize distinct PAM profiles. However, observations from the growing collection of characterized nucleases have begun to challenge this assumption. One important observation is that PAM profiles do not fully track with nuclease phylogeny (Fig. 2). Instead, recognized PAMs vary widely—and even between closely related homologs. Besides the previously discussed Streptococci Cas9’s, Edraki et al.⁴³ identified related Cas9 orthologs in N. meningitidis strains with high sequence similarity everywhere, except for the PAM-interacting domain (PID). They found that representative members from different PID-aligned clusters recognized variations of the standard NNNNGATT consensus PAM for NmeCas9, including NNNNCAA, NNNNCAAA, and NNNNCCA. Separately, our group made similarly striking observations when investigating PiCas12a and the Cas12a from Prevotella disiens (PdCas12a)³³. The two shares >95% amino-acid identity (including 96% shared identity in the PID) yet recognize distinct PAM profiles, with PdCas12a recognizing a more traditional TTYV consensus PAM. Mutating a subset of residues within the REC and WED domains in PiCas12a to match those in PdCas12a steered the PAM profile into a new territory, resulting in better recognition of G-containing PAMs than either parent nuclease. These insights establish the importance of comparing PID identity in Cas9’s and PI, REC1, and WED identity in Cas12a’s when mining orthologs in search of PAM diversity (Box 1). The insights also suggest that PAM recognition and other properties such as nuclease activity or gRNA binding could be under different selective pressures in nature. Overall, the known diversity of Cas nucleases supports PAM recognition as a flexible feature that can be altered with few mutations. This flexibility has been instrumental to the second means of obtaining Cas nucleases with more relaxed PAM recognition: protein engineering.

Applying protein engineering to alter PAM recognition

In contrast to ortholog mining, protein engineering has proven to be a powerful means to alter and broaden PAM recognition starting from individual CRISPR nucleases. Protein engineering offers the means to steer proteins that evolved under biological pressures toward more technology-relevant applications, such as for genome editing or diagnostic detection. However, protein engineering poses multiple challenges. Each residue could be replaced with one of the 19 other amino acids, resulting in an astronomical number of combinations to screen for large portions of the protein. Individual mutations can also impact not just one but many properties of the protein, and mutations can impact these properties when introduced individually or in combinations⁷⁰, requiring extensive downstream characterization. Accordingly, a range of approaches has been associated with protein engineering for altering PAM recognition, including random mutagenesis, structure-guided design, and chimera generation. We specifically focus on altering PAM recognition (Supplementary Data 2)^{23,28,31,33,43,44,45,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85}, although similar approaches have been applied to alter cleavage efficiency and the propensity for off-targeting⁸⁶.

Initial efforts to alter PAM recognition began with SpyCas9, owing in part to its early adoption, robust activity, simple PAM, and the extensive knowledge base built around this nuclease. Kleinstiver et al.²³ reported the first alteration of PAM recognition using SpyCas9 by combining random mutagenesis of the PID with a growth-based selection and subsequent counterselection. This approach yielded variants that shifted the consensus from NGG to NGA (VQR variant), NGAG (EQR variant), or NGCG (VRER variant)²³. Furthermore, combining the most frequent mutations yielded the VRQR variant recognizing an NGA consensus PAM⁷³. As most of these motifs were at least partially recognized by the WT SpyCas9, the end result was reshaping rather than recreating the PAM profile. The researchers also isolated a variant (D1135E) that exhibited reduced recognition of the less-preferred PAMs NGA and NAG, although a separate study showed that this variant still recognized other less-preferred PAMs like NNGG²⁵.

The next set of engineering efforts aimed to broaden PAM recognition with a less-stringent motif, using the consensus NGG as a starting point. Hu et al.⁷⁴ used a directed evolution approach called phage-assisted continuous evolution (PACE) to identify one variant dubbed xCas9(3.7) (or more simply xCas9)⁸⁷. The researchers demonstrated that xCas9 could recognize NG with some preferences at the third PAM position along with GAW, CAA, and some NNG sequences. In addition, xCas9 was observed to exhibit reduced cleavage activity and less off-targeting^74,88,89, paralleling some high-fidelity Cas9 nucleases that exhibit similar reductions. Correspondingly, the majority of the mutations in xCas9 were located within the REC domain, which is commonly mutated in high-fidelity Cas9 nucleases and undergoes a target-induced conformational change thought to precede DNA cleavage by the HNH and RuvC endonuclease domains⁹⁰. Despite the reduced cleavage activity and dependence on the identity of the third PAM position, xCas9 represented a major advance on increasing PAM flexibility. As immediate competition to xCas9, Nishimashu et al. applied structure-guided design and mutant screening to develop their own relaxed variant of SpyCas9 called SpCas9-NG. The first mutated a key arginine (R1335) that directly contacts the second G in the NGG PAM, and they screened for mutations that introduce base-independent interactions to compensate for the lost PAM interaction. The resulting variant recognizes an NG consensus PAM⁷⁵, with weaker recognition of NA PAMs. The resulting variant possessed seven mutations solely in the PID, one of which (E1219F) was also mutated in xCas9 (E1219V) (Fig. 3). Head-to-head comparisons between xCas9 and SpCas9-NG showed that the latter could more readily recognize sequences within the NG motif and exhibited greater indel formation and base editing in human cells⁷⁵.

**Fig. 3: Mutations in the PAM-engineered variants of SpyCas9.**

Although xCas9 and SpCas9-NG effectively required only a single G for the PAM, the most recent efforts further relaxed this requirement. Walton et al.⁷⁶ set out to evolve a PAM-free SpyCas9 through further structure-guided mutagenesis of the VRQR variant. They began by sequentially screening mutations to key residues that impact PAM recognition. By screening an extensive list of mutant combinations, the researchers obtained two new variants: SpG and SpRY. SpG recognizes a consensus NG PAM and was shown to outperform xCas9 for all NGNN sequences and SpCas9-NG for two distinct NGNN sequences⁷⁶. The SpRY variant recognizes a consensus NR (R = A or G) PAM, with less-preferred recognition of an RY (Y = C or T) PAM, thus demonstrating the most relaxed PAM preference to-date. As a result of the relaxed PAM preferences, the SpRY variant, in particular, demonstrated a higher tendency for off-targeting compared to SpyCas9—albeit based on a limited dataset⁷⁶. However, high-fidelity mutations reduced off-target activity, as has been observed with other PAM-engineered variants^76,77,78. Collectively, these variants represent the greatest progress to-date on engineering SpyCas9’s PAM preference, giving the sense that a PAM-free SpyCas9—a version that could recognize any sequence as a PAM—is almost within reach.

Aside from directing SpyCas9’s PAM preference toward a less-preferred PAM or relaxing the consensus motif, recent strides have been made to engineer nonnatural PAM preferences for SpyCas9. Miller et al.⁷⁹ generated three SpyCas9 variants to attempt to guide the PAM preference toward NRRH, NRTH, and NRCH motifs (H = A, C, T), called SpCas9-NRRH, SpCas9-NRTH, and SpCas9-NCRH, respectively. Although the reported data indicate a more complicated PAM profile than the specified motifs, all three variants recognize PAM profiles that differ from the WT SpyCas9. There was also a bias for a G at the second PAM position and a clear preference for a T at the third PAM position for the NRTH variant (Supplementary Data 2). These variants were painstakingly generated using phage-assisted noncontinuous evolution, three separate PACE screens for each motif, followed by DNA shuffling and extensive characterization⁷⁹. The resulting variants contained mutations primarily in the PID but also in the REC and HNH domains. Perhaps, for this reason, all three variants exhibited reduced off-targeting compared to SpyCas9⁷⁹. In total, these three variants represent the first attempt to engineer SpyCas9 to recognize novel PAM profiles rather than a more relaxed consensus PAM.

In addition to the ongoing efforts to alter PAM recognition by SpyCas9, similar engineering approaches are also being applied to other Cas9 orthologs. For example, Hirano et al.⁴⁴ relied on a crystal structure of the large Francisella novicida Cas9 (FnCas9) to relax PAM recognition from the consensus NGG to YG (Y = C, T). Separately, two groups^80,81 relaxed PAM recognition by SauCas9 through random mutagenesis of the PID or structure-guided mutations (Supplementary Data 2). Finally, splicing divergent portions of the PID between otherwise similar homologs has allowed a distinct means of PAM engineering through the creation of protein chimeras. Chatterjee et al.⁷⁷ compared the PID of ScCas9 to closely related orthologs, resulting in the identification of a lysine residue from Streptococcus gordonii and a positively charged loop from Streptococcus anginosus predicted to enhance nonspecific interactions with DNA. Splicing these two features into ScCas9 yielded Sc++, which recognized an NNG consensus PAM with little dependencies on the surrounding bases. In a separate study from the same group, the PID of Streptococcus macacae (SmacCas9), which was predicted to recognize an NAA PAM, was spliced into SpyCas9. The resulting chimera, dubbed SpyMac, recognized NAA despite otherwise resembling SpyCas9²⁸. Using a similar approach, Ma et al.⁷⁸ created chimeric versions of SauCas9 (cCas9) by replacing its PID with those from different related Cas9 homologs. These variants generally exhibited relaxed recognition at some PAM positions, although some recognition became more stringent at other sites. Similarly, other groups^43,45 have made chimeras from closely related orthologs from Neisseria and Geobacillus to swap PAM profiles (Supplementary Data 2). The natural diversity of PAM preferences can therefore be exploited to meld engineering approaches and create variants that recognize new profiles. In total, the engineered SpyCas9 variants collectively cover ~56% of all possible sequences when the consensus PAMs are aligned and ~94% of the consensus PAMs can fall anywhere within a window of four bases. Furthermore, incorporating the NmeCas9 chimera recognizing an NNNNCC PAM raises this percentage for the four-base window to ~97%, covering the vast majority of potential sequences if some flexibility in the target location is acceptable (Supplementary Fig. 1).

PAM engineering is expanding beyond Cas9 to other CRISPR nucleases with unique properties. To date, multiple engineering efforts have altered the PAM profile of different Cas12a variants using some of the early approaches applied to SpyCas9. For example, Gao et al.³¹ altered PAM recognition by the widely used AsCas12a. Here, the researchers leveraged a crystal structure to identify and screen mutations in and around the PID, in turn identifying two variants (AsCas12a-RR and AsCas12a-RVR) that effectively shifted the consensus PAM from TTTV to TYCV and TATV, respectively. More recent work from Kleinstiver et al.⁸² applied targeted mutagenesis to AsCas12a based on its crystal structure. They identified an enhanced variant called enAsCas12a that exhibited a more relaxed PAM profile, although recognized PAM sequences did not conform to a single-consensus motif (Supplementary Data 2). Interestingly, AsCas12a-RVR and enAsCas12a shared two out of their three mutated residues and transferring these and other equivalent mutations to the Cas12a orthologs FnCas12a, LbCas12a (from Lachnospiraceae bacterium), and MbCas12a (from Moraxella bovoculi) resulted in similar alterations to the PAM profile^31,83,84. Finally, a recent study from Liu et al.⁸⁵ generated the first chimeric Cas12a by replacing two domains (WED-I and REC1) implicated in PAM recognition in the Cas12a ortholog MAD7 with that from the Cas12a from Thiomicrospira sp. (TsCas12a). The chimera exhibited more stringent PAM recognition, although it demonstrated the principle of creating Cas12a chimeras to alter PAM profiles. Overall, the groundwork is laid to alter PAM recognition by Cas12a and the many other recently discovered Cas nucleases distinct from Cas9.

Anticipated trade-offs with a PAM-free nuclease

The field continues taking large strides toward a truly PAM-free nuclease. Engineering efforts applied to SpyCas9 have relaxed this nuclease’s PAM profile to roughly one of two bases at a single position—or 50% of possible DNA sequences. Following closely behind are efforts to engineer other Cas9 nucleases exhibiting distinct properties (e.g., smaller size and higher thermostability) as well as modified Cas12a nucleases. However, with PAM-free nucleases seemingly within reach, it is worth reflecting on what is gained and what is lost—and whether any change in course is warranted.

The major upside of a PAM-free nuclease is clear: the ability to, in theory, target any sequence (Fig. 4). This flexibility would greatly simplify the selection of sites with high on-target but low off-target activity, generating predictable disruptive indels⁹¹, or placing the base-editing window directly over the target nucleotide. Any of these benefits would be further magnified when multiplexing because only one nuclease is necessary to simultaneously target any set of sequences. However, there are serious downsides worth considering (Fig. 4c). For gRNAs expressed from DNA constructs, self-targeting of this DNA would be immediate, unavoidable, and likely disastrous—highlighting the entire reason why CRISPR–Cas systems evolved PAMs (Box 1). In bacteria, self-targeting with a catalytically active nuclease would lead to the clearance of the gRNA-encoding plasmid or, for genomically integrated constructs, cell death⁹². In eukaryotes, self-targeting by a catalytically active nuclease would lead to indel formation within the guide, resulting in a modified guide sequence that can continue self-targeting until a defective gRNA is expressed. While this self-targeting strategy has been instrumental for lineage tracking⁹³, it would quickly lead to inadvertent and potentially unpredictable targeting by the resulting progression of modified guides in other CRISPR-based applications. Even when using a catalytically dead nuclease for CRISPR interference or activation^1,94, a gRNA would block its own transcription.

**Fig. 4: Implications of PAM engineering.**

As a separate downside, a nuclease with no PAM requirements would also be expected to interrogate every sequence in the genome. Such thoroughness in target scanning could present two issues: extended timescales for the nuclease to find its target and an increased propensity for off-targeting (Fig. 4b). The extended timescale would arise from the need to interrogate every possible PAM-flanked site, as evidenced by the increased lifetime of Cas9 on DNA with higher PAM densities in vitro⁹⁵. The end effect would be reduced editing efficiency, even if binding and cleavage rates match that of a standard nuclease. Separately, interrogating every possible site would give the nuclease ample opportunities to cleave potential off-target sites. Accordingly, there is some evidence that the engineered variants SpG, SpRY, and enAsCas12a recognizing relaxed PAMs exhibited increased off-targeting compared to their parent proteins^76,82. Fortunately, adding mutations that reduce mismatch tolerance could counteract this effect and even improve the frequency of on-target editing, such as was done to generate high-fidelity versions of SpRY, Sc++, and enAsCas12a^76,77,82. Even without off-target cleavage, transient occupancy of nonspecific sites across the genome could instigate genomic instability and cytotoxicity, as observed when overexpressing the catalytically dead SpyCas9 in Escherichia coli⁹⁶. Finally, introducing a disruptive mutation to the PAM is generally the most dependable means of creating a defined edit no longer recognized by the guide, particularly for single-base edits. While this strategy would no longer be applicable for PAM-free nucleases, relying on a high-fidelity version of the nuclease and disruptive mutations in the target could achieve the same outcome. Therefore, we posit that a PAM-free nuclease may not be universally applicable for every CRISPR technology and instead comes with real trade-offs that could compromise some applications.

Future perspectives and outlook

Given the potential drawbacks of a PAM-free nuclease, how should the field proceed? First, while engineered SpyCas9 nucleases are almost PAM-free, the abundance of other Cas9, Cas12a, and the remaining Cas nucleases have ample room for relaxing PAM recognition before approaching PAM-free status. To accelerate developments with these nucleases, a combination of ortholog mining and PAM engineering offers a fruitful and expedient path, such as that followed to create Sc++⁷⁷. Within ortholog mining, the set of characterized Cas9 nucleases indicates that exceptional PAM diversity exists within nature and remains to be fully uncovered. Future work could delve into established but poorly characterized CRISPR–Cas types, such as Type I and V systems, that have been recently repurposed for different CRISPR technologies^{42,67,97,98,99}. Doing so could present convenient starting points for further engineering, such as Type V–C nucleases that recognize PAMs with as little as a single base³⁷. Solving the structure of nucleases naturally recognizing only one nucleotide could reveal distinct modes of PAM recognition that could motivate future structure-guided engineering of these nucleases. The generation of chimeras from two similar homologs also highlights the benefit of splicing existing nucleases, although structure-based approaches such as SCHEMA could more effectively guide splicing of nuclease domains and mediate the large-scale screening of chimeras¹⁰⁰. Incorporating screening approaches that alternate between protein stability and function could also open regions of sequence space otherwise considered inaccessible through single-point mutations¹⁰¹. Finally, through these combined efforts, we envision the accrued datasets laying the foundation for computer-aided design of nucleases with defined PAM profiles, whether through molecular modeling or machine learning.

Cas nucleases exhibit a wide range of properties beyond PAM recognition important for different applications. These properties include size, protein folding, gRNA recognition and processing, binding and cleavage rates, propensity for off-targeting, temperature dependence, host immune response^102,103, and performance in different cellular contexts. Current efforts to alter the PAM profile have endeavored to determine not only the full profile through a range of high-throughput techniques⁵¹ but also to investigate on-target efficiency and off-targeting. However, these evaluations have not always been fully conducted, and the other properties of CRISPR nucleases are often neglected. The field of directed evolution has a common phrase¹⁰⁴: you get what you screen for. In the case of Cas nucleases, neglecting the various properties as part of any screen can allow these properties to stray and likely become less optimal. For example, the generation of the engineered variants SpCas9-NRRH, SpCas9-NRTH, and SpCas9-NRCH relied on binding activity by a catalytically dead Cas9, and the early round variants exhibited reduced or abolished cleavage activity⁷⁹. In the future, incorporating assays for these other properties could become a benchmark for introducing engineered nucleases, and these assays could eventually be incorporated into high-throughput screens that become part of the testing pipeline. In turn, future engineering efforts could alter the entire length of the nuclease, generating versions bearing little resemblance to their natural counterpart.

As a final point, we put forward an alternative that the field could pursue besides PAM-free nucleases: a nuclease repertoire (Fig. 4b). Here, each nuclease retains recognition of a defined PAM, whether the PAM is a single base (e.g., NG) or a series of bases (e.g., NAAA). A collection of these nucleases could be explicitly assembled to cover all possible sequences, thereby achieving a collective PAM-free status. Because each nuclease would retain PAM recognition, it would avoid some of the drawbacks discussed above when PAMs are no longer a requirement. A researcher would then select from this repertoire based on the desired target, using the flanking sequence to determine which nuclease should be employed. This design approach would represent the converse of the current practice in which the target is selected based on the available nuclease. Clearly, some researchers are thinking along these lines, based on claims of the percentage of possible sequences covered by a set of engineered variants^74,75,76,79. However, achieving a true repertoire would require a different approach. For one, it would require settling on the right balance between PAM specificity and repertoire size. For another, it would require prioritizing efforts to complement existing nucleases and ensuring that, aside from PAM recognition, the nucleases behave as similarly as possible. For example, further engineering efforts could focus on the few C/T-containing sequences not extensively covered by engineered SpyCas9 variants (Supplementary Fig. 1), such as by incorporating structural insights from Cas9 nucleases recognizing T-rich PAMs or the NmeCas9 chimeras that recognize an NNNNCC PAM (Supplementary Data 1 and 2). For multiplexing applications, priorities could be centered around creating variants that recognize not only different PAMs but also different gRNA scaffolds^46,47. Expressing multiple nucleases would be challenging for many applications, although efforts to express domains as split proteins or relying on alternative splicing could reduce the DNA footprint of the resulting constructs. Overall, developing the nuclease repertoire could be even more within reach and bring us to a point where any sequence can be the target of CRISPR technologies.

References

Qi, L. S. et al. Repurposing CRISPR as an RNA-guided platform for sequence-specific control of gene expression. Cell 152, 1173–1183 (2013).
Article CAS PubMed PubMed Central Google Scholar
Gilbert, L. A. et al. Genome-scale CRISPR-mediated control of gene repression and activation. Cell 159, 647–661 (2014).
Article CAS PubMed PubMed Central Google Scholar
Li, Y., Li, S., Wang, J. & Liu, G. CRISPR/Cas systems towards next-generation biosensing. Trends Biotechnol. 37, 730–743 (2019).
Article PubMed CAS Google Scholar
Barrangou, R. & Doudna, J. A. Applications of CRISPR technologies in research and beyond. Nat. Biotechnol. 34, 933–941 (2016).
Article CAS PubMed Google Scholar
Doench, J. G. et al. Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-Cas9. Nat. Biotechnol. 34, 184–191 (2016).
Article CAS PubMed PubMed Central Google Scholar
Horvath, P. et al. Diversity, activity, and evolution of CRISPR loci in Streptococcus thermophilus. J. Bacteriol. 190, 1401–1412 (2008).
Article CAS PubMed Google Scholar
Kim, E. et al. In vivo genome editing with a small Cas9 orthologue derived from Campylobacter jejuni. Nat. Commun. 8, 14500 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Mougiakos, I. et al. Characterizing a thermostable Cas9 for bacterial genome editing and silencing. Nat. Commun. 8, 1647 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Tsui, T. K. M., Hand, T. H., Duboy, E. C. & Li, H. The impact of DNA topology and guide length on target selection by a cytosine-specific Cas9. ACS Synth. Biol. 6, 1103–1113 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hou, Z. et al. Efficient genome engineering in human pluripotent stem cells using Cas9 from Neisseria meningitidis. Proc. Natl Acad. Sci. USA 110, 15644–15649 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Lee, C. M., Cradick, T. J. & Bao, G. The Neisseria meningitidis CRISPR-Cas9 system enables specific genome editing in mammalian cells. Mol. Ther. 24, 645–654 (2016).
Article CAS PubMed PubMed Central Google Scholar
Amrani, N. et al. NmeCas9 is an intrinsically high-fidelity genome-editing platform. Genome Biol. 19, 214 (2018).
Article CAS PubMed PubMed Central Google Scholar
Shields, R. C. et al. Repurposing the Streptococcus mutans CRISPR-Cas9 system to understand essential gene function. PLoS Pathog. 16, e1008344 (2020).
Article CAS PubMed PubMed Central Google Scholar
Mosterd, C. & Moineau, S. Characterization of a Type II-A CRISPR-Cas system in Streptococcus mutans. mSphere 5, e00235–20 (2020).
Schmidt, S. T., Yu, F. B., Blainey, P. C., May, A. P. & Quake, S. R. Nucleic acid cleavage with a hyperthermophilic Cas9 from an uncultured Ignavibacterium. Proc. Natl Acad. Sci. USA 116, 23100–23105 (2019).
Article CAS PubMed PubMed Central Google Scholar
Sapranauskas, R. et al. The Streptococcus thermophilus CRISPR/Cas system provides immunity in Escherichia coli. Nucleic Acids Res. 39, 9275–9282 (2011).
Article CAS PubMed PubMed Central Google Scholar
Gasiunas, G., Barrangou, R., Horvath, P. & Siksnys, V. Cas9-crRNA ribonucleoprotein complex mediates specific DNA cleavage for adaptive immunity in bacteria. Proc. Natl Acad. Sci. USA 109, E2579–E2586 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Brandt, K., Nethery, M. A., O’Flaherty, S. & Barrangou, R. Genomic characterization of Lactobacillus fermentum DSM 20052. BMC Genomics 21, 328 (2020).
Article CAS PubMed PubMed Central Google Scholar
Fedorova, I. et al. DNA targeting by Clostridium cellulolyticum CRISPR-Cas9 Type II-C system. Nucleic Acids Res. 48, 2026–2034 (2020).
Article CAS PubMed PubMed Central Google Scholar
Esvelt, K. M. et al. Orthogonal Cas9 proteins for RNA-guided gene regulation and editing. Nat. Methods 10, 1116–1121 (2013).
Article CAS PubMed PubMed Central Google Scholar
Ran, F. A. et al. In vivo genome editing using Staphylococcus aureus Cas9. Nature 520, 186–191 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Jiang, W., Bikard, D., Cox, D., Zhang, F. & Marraffini, L. A. RNA-guided editing of bacterial genomes using CRISPR-Cas systems. Nat. Biotechnol. 31, 233–239 (2013).
Article CAS PubMed PubMed Central Google Scholar
Kleinstiver, B. P. et al. Engineered CRISPR-Cas9 nucleases with altered PAM specificities. Nature 523, 481–485 (2015). This paper reports the first example of using protein engineering to alter PAM recognition by Cas9.
Article ADS PubMed PubMed Central CAS Google Scholar
Karvelis, T. et al. Rapid characterization of CRISPR-Cas9 protospacer adjacent motif sequence elements. Genome Biol. 16, 253 (2015).
Article PubMed PubMed Central CAS Google Scholar
Collias, D. et al. A positive, growth-based PAM screen identifies noncanonical motifs recognized by the S. pyogenes Cas9. Sci. Adv. 6, eabb4054 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Gasiunas, G. et al. A catalogue of biochemically diverse CRISPR-Cas9 orthologs. Nat. Commun. 11, 5512 (2020). This paper reports the PAM preferences for 79 natural Cas9 orthologs, the largest single effort to-date to characterize Cas nucleases found in nature.
Chatterjee, P., Jakimo, N. & Jacobson, J. M. Minimal PAM specificity of a highly similar SpCas9 ortholog. Sci. Adv. 4, eaau0766 (2018). This paper reports a natural Cas9 variant, ScCas9, requiring only a single nucleotide for the PAM.
Article ADS CAS PubMed PubMed Central Google Scholar
Chatterjee, P. et al. A Cas9 with PAM recognition for adenine dinucleotides. Nat. Commun. 11, 2474 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Zetsche, B. et al. Cpf1 is a single RNA-guided endonuclease of a class 2 CRISPR-Cas system. Cell 163, 759–771 (2015).
Article CAS PubMed PubMed Central Google Scholar
Zetsche, B. et al. A survey of genome editing activity for 16 Cas12a orthologs. Keio J. Med. https://doi.org/10.2302/kjm.2019-0009-OA (2019).
Gao, L. et al. Engineered Cpf1 variants with altered PAM specificities. Nat. Biotechnol. 35, 789–792 (2017). This paper reported the first engineered Cas12a variants with altered PAM recognition.
Article CAS PubMed PubMed Central Google Scholar
Jacobsen, T., Liao, C. & Beisel, C. L. The Acidaminococcus sp. Cas12a nuclease recognizes GTTV and GCTV as non-canonical PAMs. FEMS Microbiol. Lett. 366, fnz085 (2019).
Jacobsen, T. et al. Characterization of Cas12a nucleases reveals diverse PAM profiles between closely-related orthologs. Nucleic Acids Res. 48, 5624–5638 (2020).
Article CAS PubMed PubMed Central Google Scholar
Shmakov, S. et al. Discovery and functional characterization of diverse Class 2 CRISPR-Cas systems. Mol. Cell 60, 385–397 (2015).
Article CAS PubMed PubMed Central Google Scholar
Tian, Y. et al. A novel thermal Cas12b from a hot spring bacterium with high target mismatch tolerance and robust DNA cleavage efficiency. Int. J. Biol. Macromol. 147, 376–384 (2020).
Article CAS PubMed Google Scholar
Strecker, J. et al. Engineering of CRISPR-Cas12b for human genome editing. Nat. Commun. 10, 212 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Yan, W. X. et al. Functionally diverse type V CRISPR-Cas systems. Science 363, 88–91 (2019).
Article ADS CAS PubMed Google Scholar
Harrington, L. B. et al. A scoutRNA Is required for some Type V CRISPR-Cas systems. Mol. Cell 79, 416–424.e5 (2020).
Article CAS PubMed PubMed Central Google Scholar
Burstein, D. et al. New CRISPR-Cas systems from uncultivated microbes. Nature 542, 237–241 (2017).
Article ADS CAS PubMed Google Scholar
Karvelis, T. et al. PAM recognition by miniature CRISPR-Cas12f nucleases triggers programmable double-stranded DNA target cleavage. Nucleic Acids Res. 48, 5016–5023 (2020).
Article CAS PubMed PubMed Central Google Scholar
Pausch, P. et al. CRISPR-CasΦ from huge phages is a hypercompact genome editor. Science 369, 333–337 (2020).
ADS CAS PubMed PubMed Central Google Scholar
Strecker, J. et al. RNA-guided DNA insertion with CRISPR-associated transposases. Science 365, 48–53 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Edraki, A. et al. A compact, high-accuracy Cas9 with a dinucleotide PAM for in vivo genome editing. Mol. Cell 73, 714–726.e4 (2019).
Article CAS PubMed Google Scholar
Hirano, H. et al. Structure and engineering of Francisella novicida Cas9. Cell 164, 950–961 (2016).
Article CAS PubMed PubMed Central Google Scholar
Harrington, L. B. et al. A thermostable Cas9 with increased lifetime in human plasma. Nat. Commun. 8, 1424 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Leenay, R. T. et al. Identifying and visualizing functional PAM diversity across CRISPR-Cas systems. Mol. Cell 62, 137–147 (2016).
Article CAS PubMed PubMed Central Google Scholar
Fonfara, I. et al. Phylogeny of Cas9 determines functional exchangeability of dual-RNA and Cas9 among orthologous type II CRISPR-Cas systems. Nucleic Acids Res. 42, 2577–2590 (2014).
Article CAS PubMed Google Scholar
Yamada, M. et al. Crystal structure of the minimal Cas9 from Campylobacter jejuni reveals the molecular diversity in the CRISPR-Cas9 systems. Mol. Cell 65, 1109–1121.e3 (2017).
Article CAS PubMed Google Scholar
Nishimasu, H. et al. Crystal structure of Staphylococcus aureus Cas9. Cell 162, 1113–1126 (2015).
Article CAS PubMed PubMed Central Google Scholar
Marshall, R. et al. Rapid and scalable characterization of CRISPR technologies using an E. coli cell-free transcription-translation system. Mol. Cell 69, 146–157.e3 (2018).
Article CAS PubMed PubMed Central Google Scholar
Leenay, R. T. & Beisel, C. L. Deciphering, communicating, and engineering the CRISPR PAM. J. Mol. Biol. 429, 177–191 (2017). This paper reviews our understanding of the PAM as well as different methods for PAM determination.
Article CAS PubMed Google Scholar
Mali, P. et al. RNA-guided human genome engineering via Cas9. Science 339, 823–826 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Cong, L. et al. Multiplex genome engineering using CRISPR/Cas systems. Science 339, 819–823 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Lee, C. M., Cradick, T. J., Fine, E. J. & Bao, G. Nuclease target site selection for maximizing on-target activity and minimizing off-target effects in genome editing. Mol. Ther. 24, 475–487 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ran, F. A. et al. Double nicking by RNA-guided CRISPR Cas9 for enhanced genome editing specificity. Cell 154, 1380–1389 (2013).
Article CAS PubMed PubMed Central Google Scholar
Anzalone, A. V. et al. Search-and-replace genome editing without double-strand breaks or donor DNA. Nature 576, 149–157 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Vigouroux, A. & Bikard, D. CRISPR tools to control gene expression in bacteria. Microbiol. Mol. Biol. Rev. 84, e00077–19 (2020).
Kim, Y. B. et al. Increasing the genome-targeting scope and precision of base editing with engineered Cas9-cytidine deaminase fusions. Nat. Biotechnol. 35, 371–376 (2017).
Article CAS PubMed PubMed Central Google Scholar
Friedland, A. E. et al. Characterization of Staphylococcus aureus Cas9: a smaller Cas9 for all-in-one adeno-associated virus delivery and paired nickase applications. Genome Biol. 16, 257 (2015).
Article PubMed PubMed Central CAS Google Scholar
Mojica, F. J. M., Díez-Villaseñor, C., García-Martínez, J. & Almendros, C. Short motif sequences determine the targets of the prokaryotic CRISPR defence system. Microbiology 155, 733–740 (2009).
Article CAS PubMed Google Scholar
Labuhn, M. et al. Refined sgRNA efficacy prediction improves large- and small-scale CRISPR-Cas9 applications. Nucleic Acids Res. 46, 1375–1385 (2018).
Article CAS PubMed Google Scholar
Bae, S., Park, J. & Kim, J.-S. Cas-OFFinder: a fast and versatile algorithm that searches for potential off-target sites of Cas9 RNA-guided endonucleases. Bioinformatics 30, 1473–1475 (2014).
Article CAS PubMed PubMed Central Google Scholar
Cradick, T. J., Qiu, P., Lee, C. M., Fine, E. J. & Bao, G. COSMID: a web-based tool for identifying and validating CRISPR/Cas off-target sites. Mol. Ther. Nucleic Acids 3, e214 (2014).
Article CAS PubMed PubMed Central Google Scholar
Makarova, K. S. et al. Evolutionary classification of CRISPR-Cas systems: a burst of class 2 and derived variants. Nat. Rev. Microbiol. 18, 67–83 (2020). This paper provides the most recent classification scheme for CRISPR-Cas systems.
Article CAS PubMed Google Scholar
Chen, J. S. et al. CRISPR-Cas12a target binding unleashes indiscriminate single-stranded DNase activity. Science 360, 436–439 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Teng, F. et al. Enhanced mammalian genome editing by new Cas12a orthologs with optimized crRNA scaffolds. Genome Biol. 20, 15 (2019).
Article PubMed PubMed Central Google Scholar
Dolan, A. E. et al. Introducing a spectrum of long-range genomic deletions in human embryonic stem cells using Type I CRISPR-Cas. Mol. Cell 74, 936–950.e5 (2019).
Article CAS PubMed PubMed Central Google Scholar
Elmore, J. R. et al. Bipartite recognition of target RNAs activates DNA cleavage by the type III-B CRISPR-Cas system. Genes Dev. 30, 447–459 (2016).
Article CAS PubMed PubMed Central Google Scholar
Abudayyeh, O. O. et al. RNA targeting with CRISPR-Cas13. Nature 550, 280–284 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Sarkisyan, K. S. et al. Local fitness landscape of the green fluorescent protein. Nature 533, 397–401 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Anders, C., Bargsten, K. & Jinek, M. Structural plasticity of PAM recognition by engineered variants of the RNA-guided endonuclease Cas9. Mol. Cell 61, 895–902 (2016).
Article CAS PubMed PubMed Central Google Scholar
Nishimasu, H. et al. Crystal structure of Cas9 in complex with guide RNA and target DNA. Cell 156, 935–949 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kleinstiver, B. P. et al. High-fidelity CRISPR–Cas9 nucleases with no detectable genome-wide off-target effects. Nature 529, 490–495 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Hu, J. H. et al. Evolved Cas9 variants with broad PAM compatibility and high DNA specificity. Nature 556, 57–63 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Nishimasu, H. et al. Engineered CRISPR-Cas9 nuclease with expanded targeting space. Science 361, 1259–1262 (2018). This paper reports the engineered variant SpCas9-NG that recognizes a single nucleotide for the PAM.
Article ADS CAS PubMed PubMed Central Google Scholar
Walton, R. T., Christie, K. A., Whittaker, M. N. & Kleinstiver, B. P. Unconstrained genome targeting with near-PAMless engineered CRISPR-Cas9 variants. Science 368, 290–296 (2020). This paper reports the engineered variants SpG and SpRY that offer the greatest flexibility in PAM recognition to-date.
Article ADS CAS PubMed PubMed Central Google Scholar
Chatterjee, P. et al. An engineered ScCas9 with broad PAM range and high specificity and activity. Nat. Biotechnol. 38, 1154–1158 (2020).
Article CAS PubMed Google Scholar
Ma, D. et al. Engineer chimeric Cas9 to expand PAM recognition based on evolutionary information. Nat. Commun. 10, 560 (2019). This paper reports the generation of Cas9 chimeras for altering PAM preferences.
Article ADS CAS PubMed PubMed Central Google Scholar
Miller, S. M. et al. Continuous evolution of SpCas9 variants compatible with non-G PAMs. Nat. Biotechnol. 38, 471–481 (2020). This paper reports a set of engineered SpyCas9 variants that collectively cover most possible PAM sequences, reflecting one example of a nuclease repertoire.
Article CAS PubMed PubMed Central Google Scholar
Kleinstiver, B. P. et al. Broadening the targeting range of Staphylococcus aureus CRISPR-Cas9 by modifying PAM recognition. Nat. Biotechnol. 33, 1293–1298 (2015).
Article CAS PubMed PubMed Central Google Scholar
Luan, B., Xu, G., Feng, M., Cong, L. & Zhou, R. Combined computational-experimental approach to explore the molecular mechanism of SaCas9 with a broadened DNA targeting range. J. Am. Chem. Soc. 141, 6545–6552 (2019).
Article CAS PubMed Google Scholar
Kleinstiver, B. P. et al. Engineered CRISPR-Cas12a variants with increased activities and improved targeting ranges for gene, epigenetic and base editing. Nat. Biotechnol. 37, 276–282 (2019).
Article CAS PubMed PubMed Central Google Scholar
Tóth, E. et al. Improved LbCas12a variants with altered PAM specificities further broaden the genome targeting range of Cas12a nucleases. Nucleic Acids Res. 48, 3722–3733 (2020).
Article PubMed PubMed Central CAS Google Scholar
Wang, L. et al. Improved CRISPR‐Cas12a‐assisted one‐pot DNA editing method enables seamless DNA editing. Biotechnol. Bioeng. 116, 1463–1474 (2019).
Article CAS PubMed Google Scholar
Liu, R. M. et al. Synthetic chimeric nucleases function for efficient genome editing. Nat. Commun. 10, 5524 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Liu, R., Liang, L., Freed, E. F. & Gill, R. T. Directed evolution of CRISPR/Cas systems for precise gene editing. Trends Biotechnol. https://doi.org/10.1016/j.tibtech.2020.07.005 (2020).
Esvelt, K. M., Carlson, J. C. & Liu, D. R. A system for the continuous directed evolution of biomolecules. Nature 472, 499–503 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Guo, M. et al. Structural insights into a high fidelity variant of SpCas9. Cell Res. 29, 183–192 (2019).
Article CAS PubMed PubMed Central Google Scholar
Jones, S. K., Jr. et al. Massively parallel kinetic profiling of natural and engineered CRISPR nucleases. Nat. Biotechnol. https://doi.org/10.1038/s41587-020-0646-5 (2020).
Chen, J. S. et al. Enhanced proofreading governs CRISPR-Cas9 targeting accuracy. Nature 550, 407–410 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Chakrabarti, A. M. et al. Target-specific precision of CRISPR-mediated genome editing. Mol. Cell 73, 699–713.e6 (2019).
Article CAS PubMed PubMed Central Google Scholar
Vercoe, R. B. et al. Cytotoxic chromosomal targeting by CRISPR/Cas systems can reshape bacterial genomes and expel or remodel pathogenicity islands. PLoS Genet. 9, e1003454 (2013).
Article CAS PubMed PubMed Central Google Scholar
Baron, C. S. & van Oudenaarden, A. Unravelling cellular relationships during development and regeneration using genetic lineage tracing. Nat. Rev. Mol. Cell Biol. 20, 753–765 (2019).
Article CAS PubMed Google Scholar
Bikard, D. et al. Programmable repression and activation of bacterial gene expression using an engineered CRISPR-Cas system. Nucleic Acids Res. 41, 7429–7437 (2013).
Article CAS PubMed PubMed Central Google Scholar
Sternberg, S. H., Redding, S., Jinek, M., Greene, E. C. & Doudna, J. A. DNA interrogation by the CRISPR RNA-guided endonuclease Cas9. Nature 507, 62–67 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Cho, S. et al. High-Level dCas9 expression induces abnormal cell morphology in Escherichia coli. ACS Synth. Biol. 7, 1085–1094 (2018).
Article CAS PubMed Google Scholar
Vo, P. L. H. et al. CRISPR RNA-guided integrases for high-efficiency, multiplexed bacterial genome engineering. Nat. Biotechnol. https://doi.org/10.1038/s41587-020-00745-y (2020).
Klompe, S. E., Vo, P. L. H., Halpin-Healy, T. S. & Sternberg, S. H. Transposon-encoded CRISPR-Cas systems direct RNA-guided DNA integration. Nature 571, 219–225 (2019).
Article CAS PubMed Google Scholar
Hidalgo-Cantabrana, C. & Barrangou, R. Characterization and applications of Type I CRISPR-Cas systems. Biochem. Soc. Trans. 48, 15–23 (2020).
Article CAS PubMed Google Scholar
Voigt, C. A., Martinez, C., Wang, Z.-G., Mayo, S. L. & Arnold, F. H. Protein building blocks preserved by recombination. Nat. Struct. Biol. 9, 553–558 (2002).
CAS PubMed Google Scholar
Li, Y. et al. A diverse family of thermostable cytochrome P450s created by recombination of stabilizing fragments. Nat. Biotechnol. 25, 1051–1056 (2007).
Article CAS PubMed Google Scholar
Charlesworth, C. T. et al. Identification of preexisting adaptive immunity to Cas9 proteins in humans. Nat. Med. 25, 249–254 (2019).
Article CAS PubMed PubMed Central Google Scholar
Ferdosi, S. R. et al. Multifunctional CRISPR-Cas9 with engineered immunosilenced human T cell epitopes. Nat. Commun. 10, 1842 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Schmidt-Dannert, C. & Arnold, F. H. Directed evolution of industrial enzymes. Trends Biotechnol. 17, 135–136 (1999).
Article CAS PubMed Google Scholar
Jiang, F. & Doudna, J. A. CRISPR-Cas9 structures and mechanisms. Annu. Rev. Biophys. 46, 505–529 (2017).
Article CAS PubMed Google Scholar
Anders, C., Niewoehner, O., Duerst, A. & Jinek, M. Structural basis of PAM-dependent target DNA recognition by the Cas9 endonuclease. Nature 513, 569–573 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Sun, W. et al. Structures of Neisseria meningitidis Cas9 complexes in catalytically poised and anti-CRISPR-inhibited states. Mol. Cell 76, 938–952.e5 (2019).
Article CAS PubMed PubMed Central Google Scholar
Hirano, S. et al. Structural basis for the promiscuous PAM recognition by Corynebacterium diphtheriae Cas9. Nat. Commun. 10, 1968 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Yamano, T. et al. Crystal structure of Cpf1 in complex with guide RNA and target DNA. Cell 165, 949–962 (2016).
Article CAS PubMed PubMed Central Google Scholar
Gleditzsch, D. et al. PAM identification by CRISPR-Cas effector complexes: diversified mechanisms and structures. RNA Biol. 16, 504–517 (2019).
Article PubMed Google Scholar
Swarts, D. C. & Jinek, M. Cas9 versus Cas12a/Cpf1: structure-function comparisons and implications for genome editing. WIREs RNA 9, e1481 (2018).
Article CAS PubMed Google Scholar
Abudayyeh, O. O. et al. C2c2 is a single-component programmable RNA-guided RNA-targeting CRISPR effector. Science 353, aaf5573 (2016).
Article PubMed PubMed Central CAS Google Scholar
Kim, H. K. et al. High-throughput analysis of the activities of xCas9, SpCas9-NG and SpCas9 at matched and mismatched target sequences in human cells. Nat. Biomed. Eng. 4, 111–124 (2020).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank Benjamin Gray, Ryan Jackson, and Ryan Leenay for critical feedback. This work was supported through the National Institutes of Health (1R35GM119561 to C.L.B.).

Author information

Authors and Affiliations

Department of Chemical & Biomolecular Engineering, North Carolina State University, Raleigh, NC, 27695-7905, USA
Daphne Collias & Chase L. Beisel
Helmholtz Institute for RNA-based Infection Research (HIRI)/Helmholtz Centre for Infection Research (HZI), 97080, Würzburg, Germany
Chase L. Beisel
Medical Faculty, University of Würzburg, 97080, Würzburg, Germany
Chase L. Beisel

Authors

Daphne Collias
View author publications
You can also search for this author in PubMed Google Scholar
Chase L. Beisel
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.C. and C.L.B. developed the concept for the review and contributed to the writing and editing of the paper.

Corresponding author

Correspondence to Chase L. Beisel.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Pranam Chatterjee and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Collias, D., Beisel, C.L. CRISPR technologies and the search for the PAM-free nuclease. Nat Commun 12, 555 (2021). https://doi.org/10.1038/s41467-020-20633-y

Download citation

Received: 15 September 2020
Accepted: 03 December 2020
Published: 22 January 2021
DOI: https://doi.org/10.1038/s41467-020-20633-y

This article is cited by

Improved prediction of bacterial CRISPRi guide efficiency from depletion screens through mixed-effect machine learning and data integration
- Yanying Yu
- Sandra Gawlitt
- Lars Barquist
Genome Biology (2024)
Continuous directed evolution of a compact CjCas9 variant with broad PAM compatibility
- Lukas Schmidheini
- Nicolas Mathis
- Gerald Schwank
Nature Chemical Biology (2024)
Engineering Cas9: next generation of genomic editors
- Maxim A. Kovalev
- Artem I. Davletshin
- Dmitry S. Karpov
Applied Microbiology and Biotechnology (2024)
Establishment of RT-RPA-Cas12a assay for rapid and sensitive detection of human rhinovirus B
- Yongdong Li
- Xuefei Wang
- Weidong Qian
BMC Microbiology (2023)
Recent advances in CRISPR-based genome editing technology and its applications in cardiovascular research
- Zhen-Hua Li
- Jun Wang
- Xiao Yang
Military Medical Research (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.