[go: up one dir, main page]

GB2642401A - Synthesis of 1N-methylpseudouridine and 1N-methylpseudouridine phosphates - Google Patents

Synthesis of 1N-methylpseudouridine and 1N-methylpseudouridine phosphates

Info

Publication number
GB2642401A
GB2642401A GB2401559.6A GB202401559A GB2642401A GB 2642401 A GB2642401 A GB 2642401A GB 202401559 A GB202401559 A GB 202401559A GB 2642401 A GB2642401 A GB 2642401A
Authority
GB
United Kingdom
Prior art keywords
methyltransferase
amino acid
mutations
acid sequence
seq
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
GB2401559.6A
Other versions
GB202401559D0 (en
GB2642401A8 (en
Inventor
Kazemi Masoud
Ikasalaitè Diana
Jauniakis Vykintas
Kantakevicius Pauli
Koplunaité Martyna
Meakys Rolandas
Zakrys Linas
Rokaitis Irmantas
KARPUS Laurynas
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Biomatter Designs UAB
Original Assignee
Biomatter Designs UAB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Biomatter Designs UAB filed Critical Biomatter Designs UAB
Priority to GB2401559.6A priority Critical patent/GB2642401A/en
Publication of GB202401559D0 publication Critical patent/GB202401559D0/en
Priority to PCT/EP2025/052875 priority patent/WO2025168590A2/en
Publication of GB2642401A publication Critical patent/GB2642401A/en
Publication of GB2642401A8 publication Critical patent/GB2642401A8/en
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/26Preparation of nitrogen-containing carbohydrates
    • C12P19/28N-glycosides
    • C12P19/30Nucleotides
    • C12P19/32Nucleotides having a condensed ring system containing a six-membered ring having two N-atoms in the same ring, e.g. purine nucleotides, nicotineamide-adenine dinucleotide
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/52Genes encoding for enzymes or proenzymes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1003Transferases (2.) transferring one-carbon groups (2.1)
    • C12N9/1007Methyltransferases (general) (2.1.1.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1048Glycosyltransferases (2.4)
    • C12N9/1077Pentosyltransferases (2.4.2)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/88Lyases (4.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/90Isomerases (5.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/26Preparation of nitrogen-containing carbohydrates
    • C12P19/28N-glycosides
    • C12P19/30Nucleotides
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/26Preparation of nitrogen-containing carbohydrates
    • C12P19/28N-glycosides
    • C12P19/30Nucleotides
    • C12P19/305Pyrimidine nucleotides
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/26Preparation of nitrogen-containing carbohydrates
    • C12P19/28N-glycosides
    • C12P19/38Nucleosides
    • C12P19/40Nucleosides having a condensed ring system containing a six-membered ring having two nitrogen atoms in the same ring, e.g. purine nucleosides
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y201/00Transferases transferring one-carbon groups (2.1)
    • C12Y201/01Methyltransferases (2.1.1)
    • C12Y201/01067Thiopurine S-methyltransferase (2.1.1.67)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y201/00Transferases transferring one-carbon groups (2.1)
    • C12Y201/01Methyltransferases (2.1.1)
    • C12Y201/011587-Methylxanthosine synthase (2.1.1.158)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y201/00Transferases transferring one-carbon groups (2.1)
    • C12Y201/01Methyltransferases (2.1.1)
    • C12Y201/01165Methyl halide transferase (2.1.1.165)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y201/00Transferases transferring one-carbon groups (2.1)
    • C12Y201/01Methyltransferases (2.1.1)
    • C12Y201/01257Methyltransferases (2.1.1) tRNA (pseudouridine54-N1)-methyltransferase (2.1.1.257)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y201/00Transferases transferring one-carbon groups (2.1)
    • C12Y201/01Methyltransferases (2.1.1)
    • C12Y201/0126Methyltransferases (2.1.1) rRNA small subunit pseudouridine methyltransferase Nep1 (2.1.1.260)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y204/00Glycosyltransferases (2.4)
    • C12Y204/02Pentosyltransferases (2.4.2)
    • C12Y204/02003Uridine phosphorylase (2.4.2.3)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y402/00Carbon-oxygen lyases (4.2)
    • C12Y402/01Hydro-lyases (4.2.1)
    • C12Y402/0107Pseudouridylate synthase (4.2.1.70)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y504/00Intramolecular transferases (5.4)
    • C12Y504/02Phosphotransferases (phosphomutases) (5.4.2)
    • C12Y504/02007Phosphopentomutase (5.4.2.7)

Landscapes

  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Genetics & Genomics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Biomedical Technology (AREA)
  • Medicinal Chemistry (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Enzymes And Modification Thereof (AREA)

Abstract

A method of synthesising a IN-alkyl derivative of pseudouridine comprises contacting a substrate with a methyltransferase in the presence of a cofactor to produce the I N-alkylderivative of pseudouridine. The substrate has a structure of: The IN-methyl derivative of pseudouridine has a structure of: RI is a hydroxyl group, a monphosphate group, a diphosphate group, or a triphosphate group R2 is H or a hydroxyl group. R3 is a methyl group or an ethyl group. The cofactor is a source of a methyl group or an ethyl group, The methyltransferase is configured to transferthe methyl group or ethyl group from the cofactor to the substrate. Enzymatic synthesis of I N-methyl derivatives of pseudouridine may be more efficient and produce less waste than chemical Also are methyltransferases and useful in the method; and polynucleotides, expression vectors. and microorganisms useful for producing the methyltransferases.

Description

[0001] Synthesis of 1N-methylpseudouridine and 1N-methylpseudouridine phosphates
[0002] Background
[0003] 1N-methylpseudouridine (1N-methyl-W) nucleoside and its various phosphorylated derivatives (such as 1N-methylpseudouridine-5'-monophosphate (1N-methyl-IPMP) and 1Nmethylpseudouridine-5'-triphosphate (1N-methyl-IPTP)) are used instead of uridine in mRNAbased vaccines in order to reduce immune response to the mRNA molecules and improve overall vaccine efficiency. For example, N1-methylpseudouridine is used instead of uridine in Pfizer-BioNTech's mRNA vaccine against COVID-19, BNT162b2.
[0004] The rapid adoption of mRNA-based vaccines and expansion of their application into different clinical areas creates high commercial demand for 1N-methylpseudouridine nucleoside and its derivatives.
[0005] One existing process for synthesising 1N-methylpseudouridine and its derivatives is illustrated in Fig. 1. Pseudouridine 101 is obtained from natural sources. Chemical methylation is performed to obtain 1N-methylpseudouridine 102. Chemical methylation involves protection and deprotection steps in addition to the use of a methylating agent such as methyl iodide.
[0006] 1N-methylpseudouridine monophosphate 103 and 1N-methylpseudouridine triphosphate 104 are obtainable by chemical phosphorylation of the 1N-methylpseudouridine 102.
[0007] An alternative existing process involves extracting 1N-methylpseudouridine from natural sources.
[0008] Both existing processes require multiple steps to complete, and produce unwanted waste. As the mRNA vaccine market grows, there is an increasing demand for cleaner, cheaper, and quicker ways to produce the modified nucleosides and their derivatives.
[0009] Summary
[0010] In one aspect, there is provided a method of synthesising a 1N-alkyl derivative of pseudouridine. The method comprises contacting a substrate with a methyltransferase in the presence of a cofactor to produce the 1N-alkyl derivative of pseudouridine.
[0011] The substrate has a structure of:
[0012] HO
[0013] and the 1N-methyl derivative of pseudouridine has a structure of: 0 NH 0
[0014] NH
[0015] HO
[0016] In the formulae above, R2 is a hydroxyl group, a monophosphate group, a diphosphate group, or a triphosphate group; R2 is H or a hydroxyl group; and R3 is a methyl group or an ethyl group.
[0017] The cofactor is a source of a methyl group or an ethyl group.
[0018] The methyltransferase is configured to transfer the methyl group or ethyl group from the cofactor to the substrate.
[0019] Further aspects provide methyltransferases useful in the method; polynucleotides encoding the methyltransferases; expression vectors carrying the polynucleotides; and microorganisms transduced with the expression vectors.
[0020] Still another aspect provides the use of an aqueous buffer as a reaction medium for the synthesis of the 1N-alkyl derivative of pseudouridine in accordance with the method defined above. The aqueous buffer includes Mg' at a concentration in the range 0.5 mM to 15 mM, and has a pH in the range 7.4 to 7.6.
[0021] This Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter. Nor is the claimed subject matter limited to implementations that solve any or all of the disadvantages noted herein.
[0022] Brief Description of the Drawings
[0023] To assist understanding of embodiments of the present disclosure and to show how such embodiments may be put into effect, reference is made, by way of example only, to the accompanying drawings in which: Fig. 1 is a reaction scheme outlining a comparative process for synthesising 1Nmethylpseudouridine and phosphorylated derivatives thereof.
[0024] Fig. 2 is a bar chart showing the concentration of S-adenosylhomocysteine "SAH" detected when the assay described in Example 1 was performed using enzyme Ni in various different buffer systems.
[0025] Fig. 3 is a bar chart showing the concentration of SAH detected when the assay described in Example 1 was performed using enzyme Ni in the presence of (a) LP, (b) t-PMP, (c) IPTP, and (d) no substrate.
[0026] Figs 4A to 4C are mass chromatograms of reaction mixtures containing enzyme Ni and various different substrates, following treatment with an alkaline phosphatase, as described in Example 1. Fig. 4A is a mass chromatogram obtained when pseudouridine was used as the substrate. Fig. 4B is a mass chromatogram obtained when pseudouridine-5'-monophosphate was used as the substrate. Fig. 4C is a mass chromatogram obtained when pseudouridine-5'-triphosphate was used as the substrate.
[0027] Fig. 5 (a comparative example) shows a mass chromatogram of commercially-available 1N-methylpseudouridine standard.
[0028] Figs 6A to 6C are a series of mass chromatograms that confirm that the present method produced 1N-methylpseudouridine. The chromatograms are of a reaction mixture containing enzyme Ni and pseudouridine-5'-monophosphate. Fig. 6A is a mass chromatogram of the reaction mixture alone. Fig. 6B is a mass chromatogram of the reaction mixture spiked with 5 picomoles of 1N-methylpseudouridine standard. Fig. 6C is an overlayed view of the unspiked reaction mixture and the same reaction mixture spiked with 5 picomoles of 1N-methylpseudouridine standard.
[0029] Fig. 7 is a pairwise identity matrix of natural (N) and engineered (E) SPOUT methyltransferases. Each of these enzymes was found to be effective for methylating pseudouridine and its derivatives.
[0030] Fig. 8 is a bar chart showing concentrations of SAH detected when methylation of 2'-deoxypseudouridine was performed using enzymes Ni, E5 and E8, as described in Example 3.
[0031] Fig. 9A shows the concentration of SAH detected when ethylation of LP and LUMP was conducted using enzyme E5, as discussed in Example 4.
[0032] Fig. 9B shows the concentration of SAH detected when ethylation of IMP was conducted using enzyme E6, as discussed in Example 4.
[0033] Fig. 10 is a pairwise identity matrix for four mRNA cap guanine-N7 methyltransferases 51 to S4.
[0034] Fig. 11 is a reaction scheme outlining an example method of synthesising 1N-methyl psuedouridine from uridine.
[0035] Detailed Description
[0036] General definitions The verb to comprise' is used herein as shorthand for 'to include or to consist of'. In other words, although the verb 'to comprise' is intended to be an open term, the replacement of this term with the closed term 'to consist of' is explicitly contemplated, particularly where used in connection with chemical compositions.
[0037] The symbol "Iii" refers to pseudouridine. "IPMP" refers to pseudouridine monophosphate.
[0038] "t-PTP" refers to pseudouridine triphosphate.
[0039] HO HO OH
[0040] II
[0041] OH OH
[0042] HO HO
[0043] WMP Lu
[0044] OH r 0 HO 0 0 p
[0045] HO
[0046] 0 NH 0
[0047] OH
[0048] NH
[0049] HO
[0050] WTP
[0051] "A.)" refers to deoxypseudouridine. "clIPMP" refers to deoxypseudouridine monophosphate.
[0052] "clIPTP" refers to deoxypseudouridine triphosphate.
[0053] HO HO 1H it
[0054] HO HO dIP
[0055] OH 0 P 0
[0056] I
[0057] HO \ 0 0 p
[0058] HO
[0059] HO diPTP
[0060] "SAM" stands for S-adenosyl-L-methionine: NH2 NA^Nr
[0061] N HO
[0062] NH 0 CH3 -+ o NH2
[0063] OH
[0064] SAM
[0065] "SAH" stands for S-adenosylhomocysteine.
[0066] will OH As used herein, the expression "1N-alkyl derivative of pseudouridine" refers to a compound of Formula 2: 0 NH 0
[0067] HO
[0068] Formula 2 where: is a hydroxyl group, a monophosphate group, a diphosphate group, or a triphosphate group; R2 is H or a hydroxyl group; and R3 is a methyl group or an ethyl group.
[0069] The methods provided herein most typically produce 1N-methyl derivatives of pseudouridine, i.e. compounds of Formula 2 in which R3 is a methyl group.
[0070] "Phosphorylated" compounds are compounds in which R3 is a monophosphate group, a diphosphate group, or a triphosphate group. Monophosphates and triphosphates are typically preferred.
[0071] As used herein, the term "phosphate" encompasses monophosphates and polyphosphates unless context clearly dictates otherwise.
[0072] Mutations are described using the nomenclature of den Dunnen and Antonarakis: Hum Genet 109(1): 121-124 (also published online as https://web.archive.org/web/20221111075857/https://www.hgmd.cf.ac. uk/docs/mut nom html). For example, taking HQ ID NO: 1 as a parent sequence, "K100" means replace the K residue at position 10 with a Q residue; "C203de1" means delete residue C203; and E149 N150insN means insert an N residue between residues E149 and N150. Position numbering refers to the parent sequence. Insertions and deletions do not shift the numbering of subsequent residues.
[0073] The word "about" when used in connection with a numerical value encompasses values within ± 10% of the stated value.
[0074] As used herein, "positively charged residues" refers to histidine (H), lysine (IC), and arginine (R) residues.
[0075] Methods of synthesising 1N-alkyl derivatives of pseudouridine Provided herein is a method of synthesising a 1N-a lkyl derivative of pseudouridine. The method comprises contacting a substrate with a methyltransferase in the presence of a cofactor to produce the 1N-alkyl derivative of pseudouridine.
[0076] The method may be performed in vivo or ex vivo.
[0077] The substrate has a structure of Formula 1:
[0078] HO
[0079] Formula 1 R1 is a hydroxyl group, a monophosphate group, a diphosphate group, or a triphosphate group, and R2 is H or OH. In other words, the substrate is an optionally-phosphorylated pseudouridine or deoxypseudouridine.
[0080] Atom 04 of the substrate is bolded in the formula above. This atom may be involved in the binding of the substrate to the enzyme. More specifically, atom 04 may interact with a positively-charged residue in the active site of the methyltransferase.
[0081] The source of the substrate is not particularly limited. The substrate may be extracted from natural sources, formed in vivo, or chemically synthesised. Chemical phosphorylation of pseudouridine or deoxypseudouridine may be performed.
[0082] The substrate is monomeric, and is not part of an RNA, DNA, or the like.
[0083] The substrate may be produced by: i) contacting uridine with a uridine phosphorylase in the presence of a monophosphate ion source to obtain uracil and a-D-ribose-1-monophosphate; ii) contacting the a-D-ribose-1-phosphate with a pentomutase to obtain D-ribose-5-monophosphate; iii) contacting the D-ribose-5-monophosphate with pseudouridine monophosphate glycosidase to form pseudouridine 5'-monophosphate; and optionally iv) contacting the pseudouridine 5'-monophosphate with a phosphatase to obtain pseudouridine.
[0084] Steps i) to iii) and optionally iv) may be performed in vivo, particularly in implementations where the synthesis of the 1N-alkyl derivative of pseudouridine is performed in vivo.
[0085] The 1N-alkyl derivative of pseudouridine produced by the method has a structure of Formula 2: 0 NH 0
[0086] HO
[0087] Formula 2 Contacting the substrate with the methyltransferase does not modify Ft' or R2. R3 is a methyl group or an ethyl group, depending upon the cofactor chosen.
[0088] The nature of the cofactor is not particularly limited, provided that the cofactor acts as a source of the methyl group or ethyl group, and the methyltransferase is capable of transferring the methyl group or ethyl group from the cofactor to the substrate to produce the 1N-alkyl derivative of pseudouridine.
[0089] The methyltransferase is configured to transfer the methyl group or ethyl group from the cofactor to the substrate. Numerous examples of suitable methyltransferases are provided 15 hereinbelow.
[0090] As will be demonstrated in the Examples, the primary structures of methyltransferases vary widely. By way of illustration, Fig. 6 shows that, even with a specific family of methyltransferases (in this example, SPOUT methyltransferases) methyltransferases which are effective for transferring methyl groups to substrates of Formula 1 can share less than 20 % sequence identity.
[0091] Methyltransferases which methylate pseudouridine bases of RNA may be suitable, and can be identified by searching publicly-available databases. The activity of such methyltransferases may be improved by introducing one or more mutations, e.g. to allow improved binding of the substrate via a stacking interaction and/or to provide an active site having neutral or positive charge to stabilise the deprotonated form of the substrate.
[0092] Methyltransferases useful in the present method will typically have one, and preferably both, of the following properties: i) an active site which includes a positively-charged residue positioned to interact with atom 04 of the pseudouridine base, the positively-charged residue being selected from lysine and arginine; and/or ii) a stacking amino acid residue selected from a histidine residue, a tyrosine residue, and a tryptophan residue, arranged proximal to the active site and capable of forming a stacking interaction with the pseudouridine base.
[0093] In implementations where R1 is a monophosphate group, a diphosphate group, or a triphosphate group, the methyltransferase may also usefully include: iii) a salt bridging amino acid residue selected from lysine and arginine, the salt bridging amino acid residue being arranged proximal to the active site and capable of forming a salt bridge with the monophosphate group, diphosphate group, or triphosphate group.
[0094] Optionally, the cofactor is S-adenosyl-L-methionine or S-adenosyl-Lethionine.
[0095] Most typically, the methods provided herein produce a 1N-methyl derivative of pseudouridine (i.e., R3 is a methyl group). The cofactor is most typically S-adenosyl-Lmethionine.
[0096] Optionally, 113 is selected from a hydroxyl group, a monophosphate group, and a triphosphate group.
[0097] When 111 is a hydroxyl group or a monophosphate group, the method may further comprise, after the contacting, phosphorylating the 1N-alkyl derivative of pseudouridine The phosphorylation may be chemical phosphorylation.
[0098] R2 may be a hydroxyl group. In other words, the substrate may be an optionallyphosphorylated pseudouridine.
[0099] Alternatively, R2 may be H. In other words, the substrate may be an optionally-phosphorylated deoxypseudouridine.
[0100] The method may further comprise producing the methyltransferase by fermentation of a microorganism configured to express the methyltransferase.
[0101] The contacting may be performed in the presence of the microorganism. In such implementations, the substrate may be contacted with the methyltransferase inside or outside the microorganism.
[0102] The synthesis of the 1N-alkyl derivative of pseudouridine may take place in vivo. In such implementations, the 1N-alkyl derivative of pseudouridine may be purified from cell media or biomass. Performing the synthesis in vivo may avoid a need to extract and purify the methyltransferase, thereby improving efficiency.
[0103] The contacting may be performed in an aqueous buffer, such as a Tris buffer.
[0104] The aqueous buffer may include Mg' at a concentration in the range 0.5 mM to 15 mM, optionally 0.7 mM to 1.3 mM, further optionally 0.9 to 1.1 mM. The Mg2+ may be provided by any suitable magnesium salt, such as MgC12. Experiments reported in Example 1, below, demonstrated that including Mg' in the buffer increased the activity of the methyltransferase.
[0105] The buffer may have a pH in the range 7.4 to 7.6. Methyltransferases used in the Examples were found to be most active at approximately pH 7.5, though other pHs can be used.
[0106] The buffer may further include: NaCI at a concentration of 50 to 350 mM, optionally 270 to 230 mM, further optionally 290 to 310 mM; and/or KCI at a concentration of 70 to 130 mM, optionally 90 to 110 mM.
[0107] The concentration of the substrate, cofactor, and methyltransferase in the buffer are not particularly limited and may be selected as appropriate. For example, the cofactor may be present at a concentration in the range 20 to 30 LIM, and the substrate may be present at a concentration in the range 80 to 120 p.M The concentration of the methyltransferase is typically in the range 0.5 to 20 p.M.
[0108] In particular, the buffer may have a pH of 7.5 ± 0.1 and include: Tris-HCI at a concentration of about 25 mM; NaCI at a concentration of about 300 mM; MgC12 at a concentration of about 1 mM; KCI at a concentration of about 100 mM; the cofactor (e.g., SAM) at a concentration of about 25 RM; and the substrate at a concentration of about 100 RM.
[0109] One aspect provides a use of the buffers discussed above as a reaction medium for the enzymatic synthesis by a methyltransferase of a 1N-alkyl derivative of pseudouridine according to Formula 2 from a substrate of Formula 1 and a cofactor selected from 5-adenosyl-L-methionine and S-adenosyl-L-ethionine.
[0110] Example Class I Methyltransferases The methyltransferase used in the method may be a Class I methyltransferase engineered to transfer the methyl group or ethyl group from the cofactor to the substrate. For example, the methyltransferase may be a cap methyltransferase, in particular an mRNA cap guanine-N7 methyltransferase.
[0111] The methyltransferase may have an active site with a neutral charge or a positive charge. In 30 particular, the methyltransferase may be an engineered mRNA cap guanine-N7 methyltransferase which includes one or more mutations to provide an active site which is more positively-charged than the active site of the wild-type mRNA cap guanine-N7 methyltransferase.
[0112] For example, the mutations may include replacing a negatively-charged glutamic acid or aspartic acid residue in the active site (most typically, a glutamic acid residue) with a neutrally-charged residue selected from asparagine (N), serine (S), cysteine (C), a la nine (A), threonine (T), methionine (M), glycine (G), valine (V), leucine (L), or isoleucine (I).
[0113] Alternatively, the mutations may include replacing one or more negatively-charged glutamic acid or aspartic acid residue in the active site (most typically, a glutamic acid residue) with a positively-charged residue selected from arginine (R), lysine (K), and histidine (H). Of these, arginine is preferred.
[0114] The mutations may alternatively or additionally include replacing one or more neutrally-charged residue (such as an F, V. or 1 residue) with a positively-charged residue selected from arginine, lysine and histidine; optionally arginine or lysine; and preferably arginine.
[0115] Providing positive charge in the active site has been found to enhance activity. Particularly preferably, the positive charge is positioned to interact with atom 04 of the substrate.
[0116] Excessive positive charge may however reduce the stability of the enzyme. It is therefore preferable for the active site to have a net positive charge of one or two. Further positively-charged residues may optionally be present away from the active site, for instance to provide a salt bridge to the substrate, provided that acceptable stability is maintained.
[0117] Examples of engineered mRNA cap guanine-N7 methyltransferases modified to have positively charged active sites are methyltransferases of SEQ ID NO: 5 having one or more mutations selected from: i) E2120, E212K, E212R, E212N, E212S, E212C, E212A, E212T, E212M, E212H, E212G, E212V, E212L, and E2121; optionally E212K or E212R, and preferably E212R; H) F201R or F201K; Hi) I267R or 1267K; iv) Y271R or Y271K.
[0118] Typically, the mutations introduce at least one but no more than two positively charged residues. Any one of the mutations at position 212 that do not introduce a positive charge (E2120, E212N, E2125, E212C, E212A, E212T, E212M, E212G, E212V, E212L, and E2121) may be combined with one or two mutations selected from groups ii) to iv).
[0119] Further examples of engineered mRNA cap guanine-N7 methyltransferases modified to have positively charged active sites are methyltransferases of SEQ ID NO: 6 having one or more mutations selected from: i) E2260, E226K, E226R, E226N, E2265, E226C, E226A, E226T, E226M, E226H, E226G, E226V, E2261, or E2261; optionally E226K or E226R; and preferably E226R; H) F215R or F215K; iii) A286R or A286K; iv) Y290R or Y290K.
[0120] Typically, the mutations introduce at least one but no more than two positively-charged residues. Any of the mutations at position 226 that do not introduce a positive charge may be combined with one or two mutations selected from groups ii), iii), to iv).
[0121] Still further examples of engineered mRNA cap guanine-N7 methyltransferases modified to have positively charged active sites are methyltransferases of SEQ ID NO: 7 having one or more mutations selected from: i) E235Q, E235K, E235R, E235N, E2355, E235C, E235A, E235T, E235M, E235H, E235G, E235V, E2351, or E2351; optionally E235K or E235R; and preferably E226R; H) Y224R or Y224K; Hi) A299R or A299K; iv) Y304R or Y304K.
[0122] Typically, the mutations introduce at least one but no more than two positively-charged residues. Any of the mutations at position 235 that do not introduce a positive charge may be combined with one or two mutations selected from groups ii), iii), to iv).
[0123] Additional examples of engineered mRNA cap guanine-N7 methyltransferases modified to have positively charged active sites are methyltransferases of SEQ ID NO: 8 having one or more mutations selected from: i) E2100, E210K, E210R, E210N, E2105, E210C, E210A, E210T, [210M, E210H, E210G, E210V, E210L or E2101; optionally E210K or E210R; and preferably E210R; H) F199R or F199K; Hi) V265 R or V265K; iv) Y269R or Y269K.
[0124] Typically, the mutations introduce at least one but no more than two positively-charged residues. Any of the mutations at position 210 that do not introduce a positive charge may be combined with one or two mutations selected from groups ii), Hi), to iv).
[0125] Also contemplated are variants of the various engineered mRNA cap guanine-N7 methyltransferases described above, the variants having at least 80% total sequence identity to the corresponding wild-type sequence (SEQ ID NO: 5, 6, 7, or 8).
[0126] Additional mutations introduced in such variants are not particularly limited provided that the methyltransferase remains capable of transferring a methyl group from the cofactor to the substrate. The additional mutations may comprise additions, deletions, or substitutions.
[0127] Variants may include one or more conservative mutations.
[0128] Variants may include one or more mutations to improve binding of the substrate. For example, a histidine or tryptophan residue may be introduced at a position selected to provide a stacking interaction with the pseudouridine base. Alternatively or additionally, a residue such as K or R capable of forming a salt bridge to a phosphate group at position R1 of the substrate may be introduced. Alternatively or additionally, substitutions may be made to improve hydrogen bonding to the ribose or deoxyribose group of the substrate.
[0129] Although the above methyltransferases have been described in the context of the method of synthesising a 1N-alkyl derivative of pseudouridine, it should be appreciated that the methyltransferases as such represent an independent aspect of the present disclosure.
[0130] Related aspects provide an isolated polynucleotide having a sequence encoding a methyltransferase as defined herein; an expression vector carrying the polynucleotide; and a microorganism transduced with the expression vector. The microorganism may be E. coll.
[0131] Example Class IV (SPOUT) methyltransferases The methyltransferase used in the method may be a dimeric SPOUT methyltransferase comprising a SPOUT domain configured to transfer the methyl group from the cofactor to the substrate. As will be demonstrated in the Examples, various SPOUT methyltransferases have been found to be useful for methylating pseudouridine derivatives of Formula 1.
[0132] The methyltransferase may be a minimalist dimeric SPOUT methyltransferase, free of any domains other than the SPOUT domain. Such a methyltransferase may optionally include a short terminal extension, e.g. an N-terminal extension, including no more than 50 amino acid residues. Preferably, the methyltransferase may consist of the SPOUT domain.
[0133] Typically, the SPOUT domain includes an amino acid residue which forms a stacking interaction with the substrate when the SPOUT domain binds to the substrate. This residue is referred to herein as a "stacking residue". It has been found that wild-type SPOUT methyltransferases that provide a stacking interaction (e.g., proteins Ni and N2 as discussed in the Examples) are useful in the methods described herein. SPOUT methyltransferases N3 and N4 that did not provide the stacking interaction had activities below the lower limit of detection. However, modifying these methyltransferases to provide the stacking interaction resulted in enzymes with good activity.
[0134] The amino acid residue which forms the stacking interaction may be a histidine residue, a tyrosine residue, or tryptophan residue, located proximal to the catalytically-active residues of the protein. In this context, "proximal" describes the relationship between the stacking amino acid residue and the catalytically-active residues in the quaternary structure of the dimeric SPOUT methyltransferase. The stacking and catalytically-active resides are proximal when the stacking interaction between the stacking residue and the substrate facilitates catalysis of the alkylation reaction by the active residues.
[0135] The mechanism of action of methyltransferases is well-characterized in the literature (see Krishnamohan and Jackman, Biochemistry. 2019 February 05; 58(5): 336-345, in particular Fig. 3). The active site residues and nearby residues can be identified using routine techniques.
[0136] Alternatively or additionally, when group R1 of the substrate is a monophosphate group, a diphosphate group, or a triphosphate group, the SPOUT domain may include an amino acid residue which forms a salt bridge with a phosphate group of the substrate when the SPOUT domain binds to the substrate. The amino acid residue which forms the salt bridge may be a lysine or arginine residue.
[0137] The methyltransferase may be a tRNA (pseudouridine54-N1)-methyltransferase. Two examples of tRNA (pseudouridine54-N1)-methyltransferases were investigated, and both displayed a useful level of activity. It was further uncovered that activity could be improved by introducing various mutations.
[0138] One example of a useful tRNA (pseudouridine54-N1)-methyltransferase has an amino acid sequence of SEQ ID NO: 1. Variants of this methyltransferase which include at least one mutation and have at least 70% sequence identity to SEQ ID NO: 1 are also contemplated.
[0139] The amino acid sequence may include mutations K100" K75N, and either ICOD or K2ON. In such implementations, the amino acid sequence may further include one or more mutations selected from K74D, R81L, and K89N. Variants of SEQ ID NO: 1 containing these combinations of mutations were found to have improved activity.
[0140] Variants of SEQ ID NO: 1 may alternatively or additionally include one or more mutations selected from D385, C65A, I66L, Q107E, N145E, E149K, 1155L, R176K, 1193V, K199R, K200R, R201E, C203F.
[0141] Variants of SEQ ID NO: 1 may alternatively or additionally include one or more mutations selected from D104Y, R115K, N121D, V1231, M138K, N139D, 1155L, R167K, R176K, K200R, R201N, C203de1, E204de1,1205de1.
[0142] Variants of SEQ ID NO: 1 may alternatively or additionally include one or more mutations selected from: L32V, D385, Q59E, C65A, 571N, ElOON, 0101K, K103E, D104E, N106T, Q107E, R115S, R116K, L117K, N121D, V1231, L127K, E128K, N132K, M138K, N139D, V1431, D147K, E149 N150insN, N150 P1SlinsE, P151 V152insN, 1154V, 1155L, R167K, D170E, 1(173G, R176K, 1177V, N187D, 1193V, K200R, R201G, C203de1, E204de1, and 1205de1.
[0143] Variants of SEQ ID NO: 1 may alternatively or additionally include one or more mutations selected from K10Q, P23D, S77K, and D79N.
[0144] Other examples of useful mutations to SEQ ID NO: 1 include P23K, S77D, and F41N.
[0145] Specific examples of useful methyltransferases include the following: i) SEQ ID NO: 1 with mutations 1(10Q, K20D, and K75N (i.e., Enzyme El as set out in the Examples, having a sequence of SEQ ID NO: 9); ii)SEQ ID NO: 1 with mutations K100,1(20N,1(75N, and R81L; Hi) SEQ ID NO: 1 with mutations K100, K20D, K75N, K74D, and K89N; iv) SEQ ID NO: 1 with mutations D38S, C65A, I66L, Q107E, N145E, E149K,11551_, R176K, I193V, K199R, K200R, R201E, and C203F (Enzyme ES, SEQ ID NO: 15); v) SEQ ID NO: 1 with mutations K100, P23D, S77K, and D79N (i.e., Enzyme E2, SEQ ID NO: 10); vi) SEQ ID NO: 1 with mutations D104Y, 12115K, N121D, V1231, M138K, N139D, 1155L, R167K, R176K, K200R, R201N, C203de1, E204de1,and1205del (Enzyme E12, SEQ ID NO: 13); vii) SEQ ID NO: 1 with mutations L32V, D38S, 059E, C65A, 571N, ElOON, 0101K, K103E, D104E, N106T, Q107E, R1155, R116K, L117K, N121D, V1231, L127K, E128K, N132K, M138K, N139D, V1431, D147K, E149_N150insN, N150_P151insE, P151 V152insN, I154V, I155L, R167K, D170E, K173G, R176K, I177V, N187D, 1193V, K200R, R201G, C203de1, E204de1, and 1205del (Enzyme E7, SEQ ID NO: 14).
[0146] Another example of a useful tRNA (pseudouridine54-N1)-methyltransferase has an amino acid sequence of SEQ ID NO: 2. Variants of this methyltransferase which include at least one mutation and have at least 70 % sequence identity to SEQ ID NO: 2 are also contemplated.
[0147] The mutation(s) present in the variants are not particularly limited provided that the methyltransferase remains capable of transferring a methyl group from the cofactor to the substrate. The additional mutations may comprise additions, deletions, or substitutions. For instance, variants may include one or more conservative mutations.
[0148] The methyltransferase may alternatively be an rRNA small subunit pseudouridine methyltransferase Nep1 engineered to methylate the substrate.
[0149] The engineered rRNA small subunit pseudouridine methyltransferase Nep1 may comprise a mutation compared to a wild-type Nep1, the mutation introducing: i) a stacking amino acid residue selected from a histidine residue, a tyrosine residue, and a tryptophan residue, the stacking amino acid residue being arranged proximal to one or more amino acid residues which catalyse transfer of the methyl group or ethyl group from the cofactor to the substrate; and/or ii) a salt bridging amino acid residue configured to form a salt bridge to a phosphate group of the substrate, the salt bridging amino acid being selected from a lysine residue and an arginine residue, and the salt bridging amino acid being arranged proximal to one or more amino acid residues which catalyse transfer of the methyl group or ethyl group from the cofactor to the substrate.
[0150] Mutations i) and/or ii) may improve substrate binding and may increase activity of the 30 methyltransferase.
[0151] Alternatively or additionally, the engineered rRNA small subunit pseudouridine methyltransferase Nep1 may comprise iii) a mutation to the active site of the enzyme which replaces a negatively-charged amino acid residue (D or E) with a neutrally-charged amino acid residue selected from N, 5, C, A, T, M, G, V, L, and I or a positively-charged amino acid residue selected from R, K and H. The alkylation reaction proceeds via an intermediate in which H is removed from the Ni position of the pseudouridine base, and removing a negative charge or preferably replacing the negative charge with a positive charge at the active site may therefore improve enzyme activity.
[0152] Preferably, at least one but no more than two positively-charged residues are introduced at the active site.
[0153] The rRNA small subunit pseudouridine methyltransferase Nep1 may have an amino acid sequence having 70 to 99.6 % sequence identity to SEQ ID NO:3. The amino acid sequence has less than 100% identity to SEQ ID NO:3, and in other words includes at least one mutation compared to SEQ ID NO: 3.
[0154] The at least one mutation to SEQ ID NO: 3 may include V2_A27de1. This mutation corresponds to the omission from the N-terminus a strand of 26 amino acids. Omitting this strand improved the level of expression of the methyltransferase by a producer cell, as well as improving solubility.
[0155] The at least one mutation to SEQ ID NO: 3 may comprise at least one of K1060, R129N, and S233W.
[0156] The at least one mutation to HQ ID NO: 3 may comprise 5233W and optionally one or more of L97A, D101A, K106Q, R129N and R1325. The at least one mutation may comprise S233W, V2 A27del, and optionally one or more of L97A, D101A, K106Q, R129N and R132S. The at least one mutation may comprise 5223W; one or more of L97A, D101A, and R1325; and optionally V2_A27de1.
[0157] Further variants may include additional mutations, e.g. conservative mutations, provided that the methyltransferase remains capable of transferring a methyl group from the cofactor to the substrate.
[0158] Provided are the following methyltransferases: i) SEQ ID NO: 3 with mutations S223W, L97A, D101A, and R132S; and ii) SEQ ID NO: 3 with mutations V2_A27de1, 5223W, L97A, D101A, and R1325.
[0159] A further example of an rRNA small subunit pseudouridine methyltransferase Nep1 useful in the methods described herein is a methyltransferase having an amino acid sequence having to 99.6% sequence identity to SEQ ID NO: 4. The amino acid sequence has less than 100 % identity to SEQ ID NO: 4, and in other words includes at least one mutation.
[0160] The at least one mutation may comprise D210H, D210Y, or D210W. These mutations introduce stacking interactions with the pseudouridine base. The mutations may further comprise R107S, particularly in implementations which include D210H.
[0161] Alternatively or additionally, the at least one mutation may comprise W212A, W212H, or W212N.
[0162] The at least one mutation may comprise: a) D210H and W212H; or b) D210Y and W212N.
[0163] The mutations may further comprise 145K and optionally Y49E.
[0164] The mutations may further comprise 676A or G76N.
[0165] The mutations may comprise D210Y and 0186A.
[0166] The mutations may comprise one or more of K104N, R111Q, and K187H.
[0167] The mutations may include one or more of R1075 and Q186A.
[0168] In a more specific example, there is provided a methyltransferase having an amino acid sequence of SEQ ID NO: 4 with mutations I45K, Y49E, G76A, K104N, R107S, R1110,, Q186A, K187H, D210Y, and W212N.
[0169] Another example methyltransferase has an amino acid sequence of SEQ ID NO: 4 with mutations 145K, Y49E, G76A, K104N, R107S, R1110,, Q186A, K187H, D210W, and W212N.
[0170] Another example methyltransferase has an amino acid sequence of SEQ ID NO: 12.
[0171] Although the above methyltransferases have been described in the context of the method of synthesising a 1N-alkyl derivative of pseudouridine, it should be appreciated that the methyltransferases as such represent an independent aspect of the present disclosure.
[0172] Related aspects provide an isolated polynucleotide having a sequence encoding a methyltransferase as defined herein; an expression vector carrying the polynucleotide; and a microorganism transduced with the expression vector.
[0173] Preparing the substrate The methods provided herein may further comprise preparing the substrate from a precusor.
[0174] Pseudouridine 5'-monophosphate may be formed by contacting D-ribose-5-monophosphate with uracil and a pseudouridine monophosphate glycosidase. The pseudouridine 5'-monophosphate may be used directly as a substrate for the methyltransferase. Alternatively, the pseudouridine 5'-monophosphate may be converted to pseudouridine using a phosphatase.
[0175] The D-ribose-5-monophosphate may be formed by contacting a-D-ribose-1-monophosphate with a phosphopentomutase. The a-D-ribose-1-monophosphate may be formed by contacting uridine with a uridine phosphorylase and an inorganic phosphate.
[0176] The substrate may conveniently be prepared in vivo. Conveniently, both the preparation of the substrate and the synthesis of the 1N-alkyl derivative of pseudouridine may be performed in vivo using the same microorganism. The microorganism may be engineered to express the pseudouridine monophosphate glycosidase, phosphopentomutase, uridine phosphorylase, and the methyltransferase.
[0177] Pseudouridine-Sr -monophosphate may in particular be prepared by the method described in WO 2023/131727 Al.
[0178] Examples
[0179] Example 1. SPOUT methyltransferases Enzymes potentially capable of methylating pseudouridine substrates were identified by searching sequence databases for methyltransferase sequences similar to known methyltransferases that were shown to methylate pseudouridine bases in RNA sequence context. Structural models were predicted, and structural diversity evaluated. The sequences were then grouped into clusters. Representative members of each of the clusters were investigated by modelling interactions between the protein and the desired substrates.
[0180] Enzymes predicted to alkylate pseudouridine nucleosides or nucleotides were selected for testing.
[0181] Cloned genes of identified enzymes were synthesized (Twist Bioscience). Plasmids carrying the genes were transformed into E call using electroporation, in which electro-competent E. cull were subjected to 18,000 V/cm. A S'-methylthioadenosine/S-adenosylhomocysteine nucleosidase-deficient strain of E. call was used to avoid S-adenosylhomocysteine (SAH) degradation.
[0182] Expression was carried out in media consisting of 2% tryptone (Formedium), 1% yeast extract (Formedium), 2 % NaCI (Roth) and 100 jtg/mL ampicillin (Sigma-Aldrich) using T7 RNA polymerase/promoter system. The cells were grown at 37 °C, induced with 0.5 mM IPTG (Sigma-Aldrich) and 0.1 % L-rhamnose (Roth) at OD 600 = 0.8 and grown for 22 h at 16 °C afterwards. The proteins were expressed with 6 C-terminal histidine residues to facilitate purification. Multiple purified enzymes produced in this way were tested for their ability to methylate pseudouridine LP, pseudourdine monophosphate LUMP, and pseudouridine triphosphate IPTP.
[0183] The enzymes tested are discussed further below.
[0184] Optimised buffer conditions for enzymatic production of N1-methylpseudouridine and its derivatives were identified by performing reactions in various 25 mM Tris-HCI solutions having different pHs, NaCI concentrations, and MgC12 concentrations. Each solution included 100 mM KCI, 25 tIM S-adenosyl-L-methionine (SAM), 100 RIVI pseudouridine-5'-monophosphate or pseudouridine-5'-triphosphate as a substrate, and 0.7 to 9 RM methyltransferase in a total volume of 25 p.L (Table 1). Reaction mixtures were incubated at 37 °C for 24 h and monitored by detecting S-adenosylhomocysteine (SAH), a secondary product of methylation. SAH was detected using a commercial kit which quantifies SAH through an enzyme-coupled reaction with bioluminescent readout (MTase-GloTm Methyltransferase Assay, Promega). SAH concentrations were determined according to the calibration curve.
[0185] Buffer # pH [NaCI] / [MgC12] I mM [KCI] / [SAM] / Substrate / pM mM mM IIM 1 7.5 0 10 100 25 100 2 8.5 0 1 100 25 100 3 7.5 100 0 100 25 100 4 8.5 100 10 100 25 100 8.5 300 10 100 25 100 6 7.5 300 1 100 25 100 7 8.5 300 0 100 25 100 8 7.5 300 10 100 25 100 The table above identifies the buffer solutions used, and Fig. 2 shows the amount of SAH detected in each reaction mixture. Buffer 6, comprising 25 mM Tris-HCI at pH 7.5, 100 mM KCI, 300 mM NaCI, 1mM MgC12 gave the best performance These conditions with slight modifications were used in further experiments.
[0186] Methylation activity was tested in a reaction mixture containing 25 mM Tris-HCI (pH 7.5), 100 mM KCI, 300 mM NaCI, 1mM MgC12, 0.03 % NP-40, 25 p.M S-adenosyl-L-methionine (SAM), 1 mM of substrate LIMP or WTP) and 2.5 to 20 Li.M methyltransferase (depending on the protein used), in a total volume of 25 pl. The reaction mixture was incubated at 37 °C for 24 h, at which point the reaction was stopped.
[0187] Methylation activity was determined by measuring the amount of S-adenosylhomocysteine (SAH) present in the reaction mixture. SAH is a secondary product of methylation. SAH was detected using a commercially-available kit (MTase-GloT" Methyltransferase Assay, 15 Promega).
[0188] By way of illustration, Fig. 3 shows that SAH was detected in the above-described assay when using protein Ni as the methyltransferase with 4i, IPMP, and 11)TP as substrates. Only a trace amount of SAH was detected in the absence of any substrate.
[0189] HPLC-MS was used to confirm that the products of the reactions were 1N-methyl-41 or its phosphorylated derivatives. For the HPLC-MS (ESI) analysis, samples containing phosphorylated 4) nucleotides were first treated with alkaline phosphatase to remove the phosphate.
[0190] Two ion transitions were monitored to detect 4) in the analysed samples: 259.1 [M+H] 139.1 and 259.1 [M+H]+ 4 169.1.
[0191] Peaks with retention times in the range 3.4 to 3.629 min had the above-mentioned rniz values.
[0192] Figs. 4A to 4C are mass chromatograms of reaction mixtures prepared using protein Ni, after treatment with alkaline phosphatase. The chromatograms show ion transitions 259.1 [m+H] 4 139.1 and 259.1 [M+H] 4 169.1 with retention times of 3.4 to 3.629 min, which correspond to 1N-methylpseudouridine. Fig. 4A shows results obtained using pseudouridine as the substrate; Fig. 4B shows results obtained using pseudouridine-5'-monophosphate as a substrate; and Fig. 4C shows results obtained using pseudouridine-5'-triphosphate as a substrate. Analogous chromatograms were observed for others of the tested enzymes.
[0193] For comparison, Fig. 5 shows a mass chromatograph of a commercially-obtained standard sample of 1N-methyl-LP. As may be seen, the retention time and 258.0852 Da molar mass of the standard sample closely match those of the product of the present method. The slight difference in retention time is believed to be the result of the presence of additional components in the reaction mixture.
[0194] The identity of the produced substance was further confirmed by spiking reaction mixtures with commercially available standard and observing the area increase of target peaks for ion m/z 139.1 (ion transition 259.1 [M+H] 4 139.1) and 169.1 (ion transition 259.1 [M+H]4 4 169.1). The results of tests performed using enzyme Ni are shown in Figs. 6A to 6C, and analogous chromatograms were observed for other identified enzymes and other substrates.
[0195] Fig. 6A shows the mass chromatogram of the unspiked reaction mixture; Fig. 6B shows the mass chromatogram of the spiked reaction mixture; and Fig. 6C is an overlayed view of Figs. 6A and 6B.
[0196] The enzymes that were active with pseudouridine and/or its derivatives include both natural and engineered proteins with sequence identity levels as low as 16%. This is illustrated in Fig. 7, which is a pairwise identity matrix showing levels of sequence identity amongst 6 illustrative SPOUT methyltransferases that were found to be active using assays as described above.
[0197] In order to measure reaction yield, samples were treated with alkaline phosphatase and analysed with HPLC-MS. The amount of 1N-methyl-LP in the sample was calculated by comparing area of 1N-methyl-LP peak in the measured samples to the area of 1N-methyl-tP peak in a standard samples with known concentration. In the most efficient reaction of those performed in this study, the yields of 1N-methyl-W, 1N-methyl-t-PMP and 1N-methyl-LPTP were approximately 37 %; 36 % and 37 % respectively.
[0198] The table below shows the measured level of activity of selected enzymes. The list of enzymes is not exhaustive. At least 42 further enzymes which were active on some or all of the substrates were identified in the present study.
[0199] Protein (SEQ ID NO) Activity on Substrate 41 1411VIP IPTP Ni (SEQ ID NO: 1) +++ ++ ++ N2 (SEQ ID NO: 2) n d ++ +++ El (SEQ ID NO: 9) +++ + ++ E2 (SEQ ID NO: 10) +++ ++ ++ E3 (SEQ ID NO: 11) ++ ++ +++ E4 (SEQ ID NO: 12) +++ +++ +++ Key: N: natural enzyme E: engineered enzyme n.d: not determined +: low relative activity ++: medium relative activity +++: high relative activity Relative activity scales were different for each enzyme.
[0200] Various ones of the enzymes tested are discussed in detail below.
[0201] Protein NI Protein Ni is a tRNA (pseudouridine54-N1) methyltransferase derived from Methanocoldococcus jannoschii. It is a dimeric protein, belonging to the SPOUT superfamily of SAM-dependent methyltransferases. The dimer includes an aM knot structural domain that forms part of the SAM binding pocket.
[0202] Ni has an amino acid sequence of:
[0203] MREFIFKANKTITSSDINLKDLPGSCGRLDLLCRCVSDAFFLSHDIRRDVVFYAVLYGQPNPPVCIK
[0204] FVGSELKKVSPDERNIAIFIKKALKKFEELDEEQRKDWNQSTPGIYVRRLGFRNLVLEKLEEGKNIY
[0205] YLHMNGEDVENVDIENPVFIIGDHIGIGEEDERFLDEIKAKRISLSPLELHANHCITIIHNVLDKKRI CEI (SEQ ID NO: 1) Residues R28, D30, and R34 were identified as active site residues.
[0206] Residue H185 was identified as a substrate binding residue that forms a stacking interaction with the pseudouridine base.
[0207] The following mutations to SEQ ID NO: 1 were identified as potentially beneficial: Mutation Observations K10Q Improves activity when combined with K20D, K75N and optionally (i) R81L or (ii) K74D and K89N.
[0208] K100 alone did not result in a measurable change in activity.
[0209] It is believed that K10 pulls the substrate away from the catalytic site, and replacing K10 with another residue is therefore advantageous.
[0210] F41N Increases affinity for pseudouridine by hydrogen bonding to the ribose sugar.
[0211] S77K Increases affinity for phosphorylated pseudouridine derivatives by forming a salt bridge to the phosphate group.
[0212] 577R Increases affinity for phosphorylated pseudouridine derivatives by forming a salt bridge to the phosphate group.
[0213] 577K provides a larger improvement.
[0214] P23D Improves orientation of positive charge from position 77 but potentially reduces protein solubility.
[0215] P23K Increases affinity for phosphorylated pseudouridine derivatives by forming a salt bridge to the phosphate group.
[0216] S77D Useful in combination with P23K. May orient P231( in place.
[0217] The following mutants of protein Ni were investigated: ID Mutations Observations El K1OQ, K20D, K75N Improvement in activity compared to WT E2 Kl0Q, P23D, 577K, D79N Improvement in activity compared to WT.
[0218] More active than N2.
[0219] E9 K1OQ, K2ON, K75N, R81L Improvement in activity compared to WT El0 Kl0Q, K20D, K75N, K74D, K89N Improvement in activity compared to WT Ell KlOQ Approximately the same the activity as WT E5 D385, C65A, 166L, Q107E, N145[, Most active mutant of SEQ ID 1 out of those tested.
[0220] E149K, 1155L, R176K, I193V, K199R, K200R, R201E, C203F El2 D104Y, R115K, N121D, V1231, Second most active mutant of SEQ ID NO: 1 M138K, N139D, I155L, R167K, R176K, K200R, R201N, C203de1, E204de1,and 1205de I E7 L32V, D385, 059E, C65A, 571N, Third most active variant of SEQ ID NO: 1 ElOON, 0101K, K103E, D104E, N106T, Q107E, R1155, R116K, L117K, N121D, V1231, L127K, E128K, N132K, M138K, N139D, V143I, D147K, E149 N150insN, N150_Pl5linsE, P151_V152insN, I154V, I155L, R167K, D170E, K173G, R176K, I177V, N187D, 1193y, K200R, R2016, C203de1, E204de1, and 1205del El had the following sequence:
[0221] MREFIFKANQTITSSDINLDDLPGSCGRLDLLCRCVSDAFFLSHDIRRDVVFYAVLYGQPNPPVCI KFVGSELKNVSPDERNIAIFIKKALKKFEELDEEQRKDWNQSTPGIYVRRLGFRNLVLEKLEEGKNI
[0222] YYLFIMNGEDVENVDIENPVFIIGDHIGIGEEDERFLDEIKAKRISLSPLELHANFICITIIFINVLDKKR
[0223] ICEI (SEQ ID NO: 9) E2 had the following sequence:
[0224] MREFIFKANQTITSSDINLIOLDGSCGRLDLLCRCVSDAFFLSFIDIRRDVVFYAVLYGQPNPPVCI
[0225] KFVGSELKKVKPNERNIAIFIKKALKKFEELDEEQRKDWNQSTPGIYVRRLGFRNLVLEKLEEGKNI
[0226] YYLHMNGEDVENVDIENPVFIIGDHIGIGEEDERFLDEIKAKRISLSPLELHANHCITIIHNVLDKKR ICEI (SEQ ID NO: 10) E5 had the following sequence:
[0227] MREFIFKANKTITSSDINLKDLPGSCGRLDLLCRCVSSAFFLSHDIRRDVVFYAVLYGQPNIDPVALK
[0228] FVGSELKKVSPDERNIAIFIKKALKKFEELDEEQRKDWNESTPGIYVRRLGFRNLVLEKLEEGKNIY YLHMNGEDVEEVDIKNPVFILGDHIGIGEEDERFLDEIKAKKISLSPLELHANHCITIVHNVLDRREI FEI (SEQ ID NO: 15) E12 had the following sequence: MREFIFKANKTITSSDINLKDLPGSCGRLDLLCRCVSDAFFLSFIDIRRDVVFYAVLYGQPNPPVCIK FVGSELKKVSPDERNIAIFIKKALKKFEELDEEQRKYWNQSTPGIYVKRLGFRDLILEKLEEGKNIYY LHKDGEDVENVDIENPVFILGDHIGIGEEDEKFLDEIKAKKISLSPLELHANHCITIIFINVLDKRNI (SEQ ID NO: 13) E7 had the following sequence:
[0229] MREFIFKANKTITSSDINLKDLPGSCGRLDLVCRCVSSAFFLSHDIRRDVVEYAVLYGEPNPPVAIK FVGNELKKVSPDERNIAIFIKKALKKFEELDENKREEWTESTPGIYVSKKGERDLILEKKKEGKKIYYL
[0230] HKDGEDIENVKINENENPVFVLGDHIGIGEEDEKFLEEIGAKKVSLSPLELHADHCITIVHNVLDKR
[0231] GI (SEQ ID NO: 14) Protein N2 Protein N2 is a tRNA (pseudouridine54-N1)-methyltransferase derived from Thermococci archaeon. N2 is a dimeric protein, belonging to the SPOUT superfamily of SAM-dependent methyltransferases. The dimer includes an a/I3 knot structural domain that forms part of the SAM binding pocket.
[0232] Protein N2 has an amino acid sequence of: MVVFIVKSNTAKTKFLLKDLPGSGKRIDILCRCVNSAFCLSHDIRKDVILYLCFAGKTIKFVGKELKH LTPDERG 1AI LIRKALEGNPTPGVYVSEKSFQDTLLESGKEIIYLDERG EDISH LKLKKDMCFVLG DH LGFDREDQKILEKIGKKISISPKILHADHCIIVVHNFLDRS (SEQ ID NO: 2) Residues R26, D28, and R32 were identified as active site residues.
[0233] Residue H126 was identified as a substrate binding residue that forms a stacking interaction with the pseudouridine base.
[0234] Protein N3 Protein N3 is rRNA small subunit pseudouridine methyltransferase Nep1 from Saccharomyces cerevisiae. N3 is a dimeric protein belonging to the SPOUT superfamily of SAM-dependent methyltransferases. The dimer includes an a/I3 knot structural domain that forms part of the SAM binding pocket.
[0235] Protein N3 has an amino acid sequence of: MVEDSRVRDALKGGDQKALPASLVPQAPPVLTSKDKITKRMIVVLAMASLETHKISSNGPGGD KYVLLNCDDHQGLLKKMGRDISEARPDITHQCLLTLLDSPINKAGKLQVYIQTSRGILIEVNPTVRI PRTFKRFSGLMVOLLHKLSIRSVNSEEKLLKVIKNPITDHLPTKCRKVTL5FDAPVIRVQDYIEKLDD DESICVFVGAMARGI<DNFADEYVDEKVGLSNYPLSASVACSIKFCHGAEDAWNIL (SEQ ID NO: 3) Residues R88, D90, and R132 were identified as active site residues.
[0236] R136 -potentially forms a salt bridge with the psUMP/psUTP phosphate Removal of the 26 underlined amino acids from N-terminus was found to improve expression and solubility of the protein. This removal corresponds to a mutation of V2_A27de1.
[0237] The following mutations to HQ ID NO: 3 were identified as beneficial. Position numbering is based on wild-type, non-truncated, protein N3.
[0238] Mutation Observations 1(1060 Reduces non-productive binding of substrate.
[0239] R129N Forms a hydrogen bond to the ribose group of the substrate.
[0240] 5233W Improves substrate binding by forming a stacking interaction with the pseudouridine base.
[0241] R1325 Makes space for 5233W and helps to orient catalytic residue R88.
[0242] L97A Makes space for S233W.
[0243] D101A Makes space for S233W.
[0244] The most active mutant of SEQ ID NO: 3 which was tested, protein E3, contained the following four mutations: L97A, D101A, R1325, and 5233W. The sequence of E3 was: MVEDSRVRDALKGGDO.KALPASLVPQAPPVLTSKDKITKRMIVVLAMASLETHKISSNGPGGDK YVLLNCDDHQGLLI(KMGRDISEARPDITHQCLATLLASPINKAGIKLQVYIQTSRGILIEVNPTVRIP STFKRFSGLMVOLLHKISIRSVNSEEKLLKVIKNPITDHLPTKCRKVTLSFDAPVIRVQDYIEKLDDD ESICVFVGAMARGKDNFADEYVDEKVGLSNYPLWASVACSKFCHGAEDAWNIL (SEQ ID NO: 11) Protein N4 Protein N4 is an rRNA small subunit pseudouridine methyltransferase Nep1 from Candidatus Verstraetearchaeota archaeon. N4 is a di meric protein belonging to the SPOUT superfamily of SAM-dependent methyltransferases. The dimer includes an a/[3 knot structural domain that forms part of the SAM binding pocket.
[0245] Protein N4 has an amino acid sequence of:
[0246] MSALLNVILAESALELVPKSILDHPAVTKNAERRGKKPGDTLLDISLHYEAMKKLPNFEKRGRPDI HTTLLTILGSPANLEGLVRTYIHTINDQVVYIDPSVKI PRNYNRFVGLMEQLLKEGRVPPKGDLVL MYVKKQSLEGLLKEIKPTRTFMLSENGEKINISTLAKELCQENRPTVIIGGFQKGSFLKKHVELADK
[0247] VYAVYGSPLDTWVIASMLLHGYEIEKGII (SEQ ID NO: 4) Residues R63, D65, and R107 were identified as active site residues.
[0248] The following mutations to SEQ ID NO: 4 were identified as beneficial: Mutation Observations D210H Improves substrate affinity by forming a stacking interaction with the pseudouridine base when combined with a further mutation to provide space for this interaction (e.g., R107S and/or W212H).
[0249] D210W Improves substrate affinity by forming a stacking interaction with the pseudouridine base.
[0250] D210Y Improves substrate affinity by forming a stacking interaction with the pseudouridine base.
[0251] Provides a larger improvement in activity than D210H or D210W.
[0252] R107S Makes space for stacking from D210 position and helps to orient R63 by hydrogen bonding to D65.
[0253] W212A Makes space for stacking from D210 position.
[0254] W212H Makes space for stacking from D210 position.
[0255] Provided a greater improvement than W212A for mutants containing D210H.
[0256] W212N Makes space for stacking from D210 position.
[0257] Provided a greater improvement than W212A for mutants containing D210Y.
[0258] I45K May improve affinity for phosphorylated substrates by forming a salt bridge.
[0259] Provided little effect on activity when used alone.
[0260] A synergistic improvement in activity was observed when this mutation was combined with D210H, D210W, or D210Y.
[0261] Y49E Helps to orient I45K.
[0262] K104N May hydrogen bond to the ribose moiety of the substrate.
[0263] R111Q May reduce non-productive binding.
[0264] K187H May reduce non-productive binding.
[0265] 0186A Increases activity when combined with D210Y.
[0266] G76A/N A G76A or G76N mutation was present in all of the variants that included a mutation at position D210. However, the effect of G76A/N was not entirely clear, and this mutation is not believed to be essential.
[0267] Other possible mutations of SEQ ID NO: 4 include those selected from: L131, V132L, L133V, M134L, Y135M, V136Y, I(137V, 01391K, S1400, L141S, E142, G143L, L144E, L145G, K146L, E147L, I148K, K149E, P1501, T151K, R152P, F154R, M155T, L156F, 5157M, E158L, N1595, G160E, E161N, K162G, 1163E, N164K, 5166N, T1671, L1685, A1691, 1(170L, E171A, L172K, C173E, Q174L, E175C, N176Q, R177E, P178N, 1179R, V180P, 1181T, I182V, G183I, G1841, F1856, 0186G, K187F, G1880,, 5189H, F190G, L1915, K192F, K193L, H194K, V195K, E196H, 1197V, A198E, D199L, K200A, V201D, Y202K, A203V, V204Y, Y205A, G206V, 5207Y, P208G, L2095, D210P, T2111_, V213T, 1214A, A215V, 52161, M217A, 1218S, L219M, H2201, G2211_, Y222H, E223G, I224Y, K226I, G227E, I228K, and I229G.
[0268] Of the variants of protein N4 tested, E6 was found to have the highest activity. E6 had the following combination of mutations: K104N, R1110,, K187H, D210Y, W212N, R107S, G76A, I45K, Y49E, and Q186A. The sequence of E6 is:
[0269] MSALLNVILAESALELVPKSILDHPAVTKNAERRGKKPGDTLLDKSLHEEAMKKLPNFEKRGRPDI
[0270] IHTTLLTILASPANLEGLVRTYIHTINDQVVYIDPSVNIPSNYNQFVGLMEQLLKEGRVPPKGDLVL
[0271] MYVKKQSLEGLLKEIKPTRTFMLSENGEKINISTLAKELCQENRPTVIIGGFAHGSFLKKHVELADK
[0272] VYAVYGSPLYTNVIASMLLHGYEIEKGII (SEQ ID NO: 16) Another engineered methyltransferase with an amino acid sequence similar to that of N4 was E8, as set out hereinbelow (SEQ ID NO: 17). E8 can be expressed as SEQ ID NO: 4 with the following mutations: I45K, Y49E, G76A, K104N, R1075, R111Q, Q186A, 1(187H, D210W, and W212N.
[0273] Another example of an engineered methyltransferase based on N4 is E4. E4 had the following amino acid sequence:
[0274] MSALLNVILAESALELVPKSILDHPAVTKNAERRGKKPGDTLLDKSLHEEAMKKLPNFEKRGRPDI
[0275] IHTTLLTILASPANLEGLVRTYIHTINDQVVYIDPSVNIPSNYNQFVGLMEQLLKEGRVPPKGDLVL MYVICKQSLEGLLKEIKPTRTFMLSENGEKINISTLAKELCQENRPTVIIGGFQHGSFLKKHVELADK VYAVYGSPLH TAVIASMLLH GYEIEKGII(SEQ ID NO: 12) Compared to SEQ ID NO: 4, E4 includes the following mutations: I45K, Y49E, G76A, 1(104N, R107S, R111Q, L131, V132L, 1133V, M1341, Y135M, V136Y, K137V, Q1391<, S1400" L141S, E142, G143L, L144E, L145G, K146L, E147L, 1148K,I(149E, P1501, T1511(, R152P, F154R, M155T, L156F, S157M, E1581., N1595, G160E, E161N, K162G, 1163E, N164K, 5166N, T1671, L1685, A169T, 1(170L, E171A, L172K, C173E, Q174L, E175C, N176Q, R177E, P178N, T179R, V180P, I181T, I182V, G1831, G1841, F185G, 0186G, K187F, G188Q, S189H, F190G, L191S, 1(192F, K1931, H194K, V195K, E196H, 1197V, A198E, D199L, K200A, V201D, Y202K, A203V, V204Y, Y205A, G206V, 5207Y, P208G, L2095, D210P, T2111, W212H, V213T, 1214A, A215V, 5216I, M217A, L218S, L219M, H2201, G221L, Y222H, E223G,I224Y, K2261, G227E, I228K, and I229G.
[0276] Interestingly, E4 had a sequence similar to that of a wild-type SPOUT methyltransferase 5 derived from an unidentified microorganism believed to be in the phylum Verstraetearchaeota, having a sequence of: MSALLNVILAESALELVPKSILDHPAVTKNAERRGKKPGDTLLDISLHYEAMKKLPNFEKRGRPDI I HTTLLTILGSPANLEGLVRTYIHTINDQVVYIDPSVKI PRNYNRFVGLMEQLLKEGRVPPKGDLVL MYVKKOSLEGLLKEIKPTRTFMLSENGEKINISTLAKELCQENRPTVIIGGFQKGSFLKKHVELADK VYAVYGSPLDTWVIASMLLHGYEIEKGII (SEQ ID NO: 18) E4 has the following differences with respect to SEQ ID NO: 18: I45K, Y49E, G76A, K104N, R1075, R111Q9 K187H, V201, Y202V, A203Y, V204A, Y205V, G206Y, 5207G, P2085, L209P, D210L, T211H, W212, V213T, 1214A, A215V, 52161, M217A, L2185, L219M, H220L, 6221L, Y222H, E223, I224G, E225Y, K226E, G2271, 1228E, and I229K. Accordingly, there is provided a methyltransferase having an amino acid sequence of SEQ ID NO: 18 with at least one, and preferably all, of these mutations.
[0277] Example 2. Enzymatic production of N1-methyl pseudouridine in vivo E. coli cells were transformed by recombinant plasmids harbouring Ni, N2, E3 enzymes or catalytic Ni mutant (negative control) by electroporation. Transformants were grown overnight at 37 °C on LB-agar plates (50 jig/mL ampicillin), suspended in 20 mL liquid LB-medium (50 pig/mL ampicillin) and grown in a shaker at 37°C until OD600 reached 0.6 -0.8.
[0278] After reaching the desired 0D600, flasks were put on ice for 30 min. IPTG and L-rhamnose were added to final concentrations of 0.5 mM and 0.1 %, respectively. The flasks were shaken at 16 °C for approximately 20 h. 1.5 mL of each cell culture was collected by centrifugation, washed with 1 mL of 50 mM potassium phosphate buffer (pH 7.5) and resuspended into 100 RI_ of the same buffer that contained 5 mM of pseudouridine (II)).
[0279] Tubes containing reaction mixtures were incubated at 37 °C and 180 rpm for 3 days. The cells were separated from the buffer by centrifugation. The supernatant was mixed with acetonitrile (1:1) and analysed with HPLC-MS. The remaining cells were resuspended in 300 pi of 50 mM potassium phosphate buffer (pH 7.5) and lysed by sonication. The cell remnants were removed by centrifugation. 5 ML of FastAP were added to 50 ML of each reaction mixture, mixtures were incubated at 37 °C overnight. 50 jiL of acetonitrile was added and mixtures were centrifuged for 10 min at 4 °C at 16,000 rpm to precipitate the remaining proteins.
[0280] The reaction mixtures were analysed by HPLC-MS. N1-methyl pseudouridine concentration was determined according to the calibration curve prepared from the standard. Results are summarized in the table below.
[0281] Nil-methyl-LP yield, % W.-methyl-4),W Ni buffer 0.95 47.6 Nice!! content 0.083 4.16 N2 buffer 0.49 24.3 N2 cell content 0.01 0.48 E3 buffer 0.063 3.2 E3 cell content 0 0 Control buffer 0 0 Control cell content 0 0 The results demonstrate that N1-methyl pseudouridine can be produced in vivo using a microorganism that expresses a methyltransferase as provided herein.
[0282] Example 3. Enzymatic production of N1-methyl-2'-deoxypseudouridine and its derivatives Reaction mixtures containing 25 mM Tris-HCI (pH 7.5), 100 mM KCI, 300 mM NaCl, 1mM MgC12, 0.03 % NP-40, 25 pM S-adenosyl-L-methionine (SAM), 1 mM 2'-deoxypseudouridine substrate (dill and 7 p.M Ni methyltransferase in a total volume of 25 p.1_ were incubated at 37 °C. The reaction was stopped after 24 h and monitored by detecting 5-adenosylhomocysteine (SAH), a secondary product of methylation. SAH was detected using a commercial kit which monitors the formation of SAH through an enzyme-coupled reaction with bioluminescent readout (MTase-Glorm Methyltransferase Assay, Promega). The measured SAH concentrations are shown in Fig. 8.
[0283] The data shown in Fig. 8 demonstrate that enzymes Ni, E5, and E8 were all found to methylate c14).
[0284] In view of these results, active site similarities, structural modelling, and molecular docking data it is expected that SPOUT methyltransferases as described herein will methylate c14), dIPMP, and clIPTP.
[0285] E8 had a sequence of:
[0286] MSALLNVILAESALELVPKSILDHPAVTKNAERRGKKPGDTLLDKSLHEEAMKKLPNFEKRGRPDI
[0287] IHTTLLTILASPANLEGLVRTYIHTINDQVVYIDPSVNIPSNYNQFVGLMEQLLKEGRVPPKGDLVL
[0288] MYVKKQSLEGLLKEIKPTRTFMLSENGEKINISTLAKELCQENRPTVIIGGFAHGSFLKKHVELADK VYAVYGSPLWTNVIASMLLHGYEIEKGII (SEQ ID NO: 17) Example 4. Enzymatic production of N1-ethylpseudouridine and its derivatives Reaction mixtures containing 25 mM Tris-HCI (pH 7.5), 100 mM KCI, 300 mM NaCI, 1mM MgC12, 0.03 % NP-40, 20 p.M S-adenosyl-L-ethionine (SAE), 1 mM pseudouridine substrate (4) or IPMP) and 6-12 p.M methyltransferase (according to the protein) in a total volume of 25 RL was incubated in 37 °C. The reaction was stopped after 24 h and monitored by detecting S-adenosylhomocysteine (SAH), a secondary product of ethylation (SAH was detected using a commercial kit which monitors the formation of SAH through an enzyme-coupled reaction with bioluminescent readout (MTase-GloTm Methyltransferase Assay, Promega). SAH concentrations were determined according to the calibration curve, and are shown in Figs. 9A ([5) and 9B (E6).
[0289] Figs. 9A and 9B shows that ES and E6 respectively can ethylate pseudouridine and derivatives thereof. Based on these results, and due to active site similarities, structural modelling, and molecular docking data it is expected that other SPOUT methyltransferases as described herein will also ethylatel-P, luMP, WTP, dIP, ditIMP or clIPTP.
[0290] Example 5. Cap methyltransferases Methyltransferases that have no reported activity with pseudouridine, but whose active sites are potentially capable of methylating pseudouridine substrates were identified as follows.
[0291] Known and putative methyltransferase sequences were collected from public and private databases. Structural models were predicted and structural diversity evaluated. The sequences were then grouped into clusters. Representative members of the clusters were investigated by modelling possible interactions between the protein and the desired substrates.
[0292] Genes encoding the identified enzymes and mutants thereof were synthesized (Twist Bioscience). Plasmids carrying the genes were transformed into E. coli using an electroporation process, in which electro-competent E. coli was subjected to 18,000 V/cm. A 5i-methylthioadenosine/S-adenosylhomocysteine nucleosidase-deficient strain of E. coil was used to avoid S-adenosylhomocysteine (SAH) degradation.
[0293] Expression was carried out in media consisting of 2% tryptone (Formedium), 1% yeast extract (Formedium), 2 % NaCI (Roth) and 100 p.g/mL ampicillin (Sigma-Aldrich) using a T7 RNA polymerase/promoter system. The cells were grown at 37 °C, induced with 0.5 mM IPTG (Sigma-Aldrich) and 0.1 % L-rhamnose (Roth) at OD 600 = 0.8 and grown for 22 h at 16 °C afterwards. The proteins were expressed with 6 C-terminal histidine residues to facilitate purification.
[0294] Multiple purified enzymes produced in this way were tested for their ability to methylate natural substrate known from literature: guanosine-5T-triphosphate "GTP". For this, reaction mixtures containing 50 mM Tris-HCI (pH 7.5), 5 mM DTT, 0.03 % NP40, 50 pM SAM, 5 mM GTP and 0.3-12 pM enzyme (according to protein) were incubated at 37 °C. The reactions were stopped after 1 h and activity was assessed by detecting S-adenosylhomocystei ne (SAH), a secondary product of methylation. SAH was detected using a commercial kit (MTase-GloT" Methyltransferase Assay, Promega). The enzymes were found to be active on GTP native substrate.
[0295] Modelling has demonstrated that certain class I methyltransferases, specifically cap methyltransferases, would with small changes to their active site be capable of methylating pseudouridine and its derivatives. For example, modelling has shown that arginine residues correctly positioned in the active site will enable pseudouridine methylation. The methylation reaction involves deprotonation of the pseudouridine base at the Ni position. The positive charge of the arginine residues will promote such deprotonation and/or stabilize the deprotonated base.
[0296] Our experimental data with SPOUT methyltransferases and structural modelling of SPOUT and Cap methyltransferases shows that Cap methyltransferases will be able to methylate and ethylate LP, LUMP, LPTP, cILP, cILP MP, and diPTP.
[0297] Proteins Si to 54 as discussed below were investigated.
[0298] Fig. 10 is a pairwise sequence identity matrix of proteins Si to 54. The Fig. demonstrates that, despite all being mRNA cap guanine-N7 methyltransferases, Si to S4 have less than about 40 % sequence identity.
[0299] Protein.51 Protein Si is a Class 1 SAM dependent methyltransferase, more specifically an mRNA cap guanine-N7 methyltransferase from Nemotocido parisii having an amino acid sequence of: MEKSNNNVANHYNKIKSLGVQSREASKIIGVREANNFLKQKLIQKFIRENSVVLDLGCGKGGDLS KLKHHNIKHYYGCDIAKESLAEALKRSLTHKFKSDFLQADFINNKIIIQEKADLVMAQFSFHYAFA NENSVKKAVNNVCNNLKEGGVFILTIPDMQVITRRSARNIVD6SEGNSLYKVCPNKSFYKNELFG
[0300] RGYEFHLQEALTGCEEYLIDLNYLTSHFASKGIKKIFDIDFLSFLNHEMSADKETYSRMVRHPLTKE
[0301] ELPIIELYRAVAYKKNGS (SEQ ID NO: 5) The activity of wild-type protein Si was below the limit of detection of the assays used. However, analysis demonstrated that activity can be enhanced by introducing mutations.
[0302] The following useful mutations to SEQ ID NO: 5 have been identified: Mutation Effect E212Q, E212N, E212S, Removal of negative charge from position 212 should E212C, E212A, E212T, stabilize deprotonated pseudouridine base.
[0303] E212M, E212H, E212G, E212V, E2121_, or E2121.
[0304] E212K The introduction of a positive charge at position 212 should stabilize the deprotonated pseudouridine base more effectively than removing the negative charge. A K residue may downshift the pKa of pseudouridine base.
[0305] E212R Introducing an R residue at position 212 should be even more effective than introducing a K residue provided that the R residue is orientated correctly.
[0306] tRNA and rRNA SPOUT methyltransferases utilize catalytic arginine residues to downshift the pKa of pseudouridine base.
[0307] Y271R or Y217K Introducing a positive charge should stabilize the deprotonated pseudouridine base by downshifting the pKa of the pseudouridine base.
[0308] F201R or F201K Introducing a positive charge should stabilize the deprotonated pseudouridine base by downshifting the pKa of the pseudouridine base.
[0309] I267R or I267K Introducing a positive charge should stabilize the deprotonated pseudouridine base by downshifting the pKa of the pseudouridine base.
[0310] Further mutations may be introduced, e.g. to improve the orientation of the R or K residues, thereby improving activity further.
[0311] Protein 52 Protein S2 is a Class 1 SAM dependent methyltransferase, more specifically an mRNA cap guanine-N7 methyltransferase from Catenaria anguillulae having an amino acid sequence of:
[0312] MGDSATDQSRRAAAQLVANHYNQRQSSTVDSRKDSPIFHLRAFNNWVKAVLMNRFLRSGLH
[0313] LLDIGCGKGGDLNKYNKARIAQLYAFDVASVSIDQATERYRQMHRPWFKAQFQALDCYNDSIE
[0314] PYMPRGASVGAVSMQFCAHYAFQSEKQVRIMLENVSRWLAPGGYYFGTVPDANVLVRKLRAS
[0315] PGLEYGNSIYKIRFVQKDAYPVYGHEYSFLLEDAIDDCPEYLIHWPSFVRLAAEYGLEQVEHTNFH PFYHEQADKFRDLLVRMKVVTEDRPELSMDEWEAAGLYSVFVFRKRGS (SEQ ID NO: 6) The activity of wild-type protein 52 was below the limit of detection of the assays used.
[0316] However, analysis demonstrated that activity can be enhanced by introducing mutations.
[0317] The following useful mutations to SEQ ID NO: 6 have been identified: Mutation Effect E226Q, E226N, F226S" Removal of negative charge from position 226 should E226C, E226A, E226T, stabilize deprotonated pseudouridine base.
[0318] E226M, E226H, E226G, E226V, E226L, or E2261.
[0319] E226K The introduction of a positive charge at position 226 should stabilize the deprotonated pseudouridine base more effectively than removing the negative charge. A K residue may downshift the pKa of pseudouridine base.
[0320] E226R Introducing an R residue at position 226 should be even more effective than introducing a K residue provided that the R residue is orientated correctly.
[0321] tRNA and rRNA SPOUT methyltransferases utilize catalytic arginine residues to downshift the pl(a of pseudouridine base.
[0322] Y290R or Y290K Introducing a positive charge should stabilize the deprotonated pseudouridine base by downshifting the pKa of the pseudouridine base.
[0323] F215R or F215K Introducing a positive charge should stabilize the deprotonated pseudouridine base by downshifting the pKa of the pseudouridine base.
[0324] A286R or A286K Introducing a positive charge should stabilize the deprotonated pseudouridine base by downshifting the pKa of the pseudouridine base.
[0325] Further mutations may be introduced, e.g. to improve the orientation of the R or I( residues, thereby improving activity further.
[0326] Protein 53 Protein S3 is a Class 1 SAM dependent methyltransferase, more specifically a mRNA cap guanine-N7 methyltransferase from Lachance° thermotolerans having an amino acid sequence of:
[0327] MSVVNVDQIIRKHYNERTFVAKRRRRHLSPIIKLRNENNAIKYMLIDKFTFPGNVVLEMGCGKG
[0328] GDLRKYGAAGISC1FIGIDISNASIVEAQKRFSSMGNLDYQVILITGDCFGESLGVAVEPFPECRFPC DVVSAQFCLHYAFESEEKARRTLLNVTICSLKIGGYFIGTIPDSEFIRYKLNKITKDVDI(PSWGNAIY KVTFENSDYQKNNNEFTSPFGOMYTYWLEDAIDNVPEYVIPFETLRSLADEYGLELELQMPFNA FFVQEIPKWINKFSPKMQEGLQRSDGICYGVEGDEKEAASYFYTVFAFKKVICGS (SEQ ID NO: 7) The activity of wild-type protein 53 was below the limit of detection of the assays used. However, analysis demonstrated that activity can be enhanced by introducing mutations.
[0329] The following useful mutations to SEQ ID NO: 7 have been identified: Mutation Effect E235Q, E235N, E2355, Removal of negative charge from position 235 should E235C, E235A, E235T, stabilize deprotonated pseudouridine base.
[0330] E235M, E235H, E235G, E235V, E2351_, or E2351.
[0331] E235K The introduction of a positive charge at position 235 should stabilize the deprotonated pseudouridine base more effectively than removing the negative charge. A K residue may downshift the pKa of pseudouridine base.
[0332] E235R Introducing an R residue at position 235 should be even more effective than introducing a K residue provided that the R residue is orientated correctly.
[0333] tRNA and rRNA SPOUT methyltransferases utilize catalytic arginine residues to downshift the pKa of pseudouridine base.
[0334] Y304R or Y304K Introducing a positive charge should stabilize the deprotonated pseudouridine base by downshifting the pKa of the pseudouridine base.
[0335] Y224R or Y224K Introducing a positive charge should stabilize the deprotonated pseudouridine base by downshifting the pKa of the pseudouridine base.
[0336] A299R or A299K Introducing a positive charge should stabilize the deprotonated pseudouridine base by downshifting the pKa of the pseudouridine base.
[0337] Further mutations may be introduced, e.g. to improve the orientation of the R or K residues, thereby improving activity further.
[0338] Protein 54 Protein S3 is a Class 1 SAM dependent methyltransferase, more specifically an mRNA cap guanine-N7 methyltransferase from Lachancea thermotolerans having an amino acid sequence of:
[0339] MEGKKEEIREHYNSIRERGRESRQRSKTINIRNANNFIKACLI RLYTKRGDSVLDLGCGKGGDLLK YERAGIGEYYGVDIAEVSINDARVRARNMKRREKVEFRAQDSYGRHMDLGKEEDVISSQESFHY AFSTSESLDIAQRNIARHLRPGGYFIMTVPSRDVILERYKQGRMSNDFYKIELEKMEDVPMESVR EYRFTLLDSVNNCIEYFVDFTRMVDGFKRLGLSLVERKGFIDFYEDEGRRNPELSKKMGLGCLTRE
[0340] ESEVVGIYEVVVERKLVPESDA (SEQ ID NO: 8) The activity of wild-type protein S4 was below the limit of detection of the assays used. However, analysis demonstrated that activity can be enhanced by introducing mutations.
[0341] The following useful mutations to SEQ ID NO: 8 have been identified: Mutation Effect E210Q, E210N, E210S, Removal of negative charge from position 210 should E210C, E210A, E210T, stabilize deprotonated pseudouridine base.
[0342] E210M, E210H, E210G, E210V, E210L, or E2101.
[0343] E210K The introduction of a positive charge at position 210 should stabilize the deprotonated pseudouridine base more effectively than removing the negative charge. A K residue may downshift the pKa of pseudouridine base.
[0344] E21OR Introducing an R residue at position 210 should be even more effective than introducing a K residue provided that the R residue is orientated correctly.
[0345] tRNA and rRNA SPOUT methyltransferases utilize catalytic arginine residues to downshift the pKa of pseudouridine base.
[0346] Y269R or Y269K Introducing a positive charge should stabilize the deprotonated pseudouridine base by downshifting the pKa of the pseudouridine base.
[0347] F199R or F199K Introducing a positive charge should stabilize the deprotonated pseudouridine base by downshifting the pKa of the pseudouridine base.
[0348] V265R or V265K Introducing a positive charge should stabilize the deprotonated pseudouridine base by downshifting the pKa of the pseudouridine base.
[0349] Further mutations may be introduced, e.g. to improve the orientation of the R or K residues, thereby improving activity further.
[0350] Example 6. in vivo synthesis of 1N-methylpseudoridine from a uridine feedstock The methods of synthesising 1N-alkyl derivatives of pseudouridine described herein may be implemented in vivo. As an example, Fig. 11 is a reaction scheme outlining the in vivo synthesis of 1N-methyl pseudouridine using uridine as a starting material.
[0351] In the reaction scheme of Fig 11, uracil is first cleaved into uracil and ribose-1-monophosphate using uridine phosphorylase (udp) and inorganic phosphate.
[0352] A phosphopentomutase (deoB) then isomerizes the ribose-1-monophosphate into ribose-5-monophosphate.
[0353] The ribose-5-monophosphate is then connected to uracil using pseudouridine monophosphate glycosidase (psuG), to obtain pseudouridine 5'-monophosphate.
[0354] In this example, since the reactions are done in vivo, cellular phosphatases dephosphorylate the pseudouridine monophosphate to form the corresponding nucleoside, pseudouridine.
[0355] The pseudouridine is then methylated in accordance with the method provided herein, using a suitable methyltransferases and S-adenosyl methionine (SAM) as the methyl group donor. This forms the 1N-methylpseudouridine.
[0356] In a specific implementation of the above method, E. coli BL21 (DE3) competent cells were transformed by electroporation with two recombinant plasmids. The plasmids contained (i) genes encoding enzymes for synthesising pseudouridine 5'-monophosphate synthesis from uridine (udp, deoB, and psuG), and (ii) a gene encoding an appropriate pseudouridine methyltransferase, respectively. Four such transformants having the following combinations of genes were investigated: i) pCDF-udp-deoB and pRSF-psuG-B2P3; H) pCDF-udp-deoB and pRSF-psuG-B2P23; Hi) pCDF-udp-deoB and pRSF-psuG-B5P8; and iv) pCDF-udp-deoB and pRSF-psuG-B11P9.
[0357] Transformants were grown overnight at 37°C on LB-agar plates (50 Rg/mLspectinomycin and 30 jig/mL kanamycin). Transformants were suspended in 20 mL liquid LB-medium (50 mg/mL spectinomycin and 30 jig/mL kanamycin) and grown in a shaker at 37°C until 0D600 reached 0.5-0.6. IPTG was added to a final concentration of 1 mM, and the flasks were shaken at 37 °C for 3 h. After induction was complete, 3 mL of each cell culture were collected by centrifugation, washed with 1 mL of 125 mM potassium phosphate buffer (pH 7.0), and resuspended into 100 RI_ of the same buffer that contained 5 mM uridine and 0.5 mM MnC12. Tubes containing these reaction mixtures were incubated at 37 °C and 300 rpm for 24 h. The cells were then separated from the buffer by centrifugation, resuspended in 300 RI_ of 125 mM potassium phosphate buffer (pH 7.0), and lysed by sonication. Cellular debris were removed by centrifugation. 5 [IL of FastAP were added to the tubes, that contained 50 p.L of buffer in which reactions were done, or 50 RI_ of supernatant after cell sonication. The resulting mixtures were incubated at 37 °C and 300 rpm overnight. 50 ML of acetonitrile were added to each tube, mixtures were centrifuged for 10 min at 4°C at 16 000 rpm to precipitate the remaining proteins.
[0358] The reaction mixtures were analyzed by FIPLC-MS. The 1N-methyl pseudouridine (mill)) concentration was determined according to the calibration curve prepared from the standard. N1-Methyl pseudouridine was detected in the reaction buffer where B2P3 methyltransferase was present, however no 1N-methyl pseudouridine was detected in analyzed cell content. The final yield of 1N-methyl pseudouridine was 0.162%, corresponding to a 1N-methylpseudouriding concentration of 8.1 p.M in the reaction mixture.
[0359] It will be appreciated that the above embodiments have been described by way of example only.
[0360] Other variants or use cases of the disclosed techniques may become apparent to the person skilled in the art once given the disclosure herein. The scope of the disclosure is not limited by the described embodiments but only by the accompanying claims.

Claims (85)

1. Claims 1. A method of synthesising a 1N-alkyl derivative of pseudouridine, which method comprises: contacting a substrate with a methyltransferase in the presence of a cofactor to produce the 1N-alkyl derivative of pseudouridine; wherein: the substrate has a structure of:HOthe 1N-methyl derivative of pseudouridine has a structure of: 0 NH 0NHHOR1 being a hydroxyl group, a monophosphate group, a diphosphate group, or a triphosphate group; R2 being H or a hydroxyl group; and R3 being a methyl group or an ethyl group; the cofactor is a source of a methyl group or an ethyl group; and the methyltransferase is configured to transfer the methyl group or ethyl group from the cofactor to the substrate.
2. The method according to claim 1, wherein the cofactor is S-adenosyl-L-methionine or S-adenosyl-L-ethionine.
3. The method according to claim 2, wherein Ft' is a methyl group and the cofactor is Sadenosyl-L-methionine.
4. The method according to any preceding claim, wherein Ft' is a hydroxyl group, a monophosphate group, or a triphosphate group.
5. The method according to claim 4, wherein R" is a hydroxyl group or a monophosphate group, and the method further comprises, after the contacting, phosphorylating the 1N-alkyl derivative of pseudouridine.
6. The method according to any preceding claim, wherein Ft2 is a hydroxyl group.
7. The method according to any preceding claim, further comprising preparing the substrate by contacting D-ribose-5-monophosphate with uracil and a pseudouridine monophosphate glycosidase to form pseudouridine 5T-monophosphate.
8. The method according to claim 7, further comprising contacting the pseudouridine 5'monophosphate with a phosphatase to form pseudouridine.
9. The method according to claim 7 or claim 8, further comprising forming the D-ribose5-monophosphate by contacting a-D-ribose-1-monophosphate with a phosphopentomutase.
10. The method according to claim 9, further comprising forming the D-ribose-5-monophosphate by contacting uridine with a uridine phosphorylase and an inorganic phosphate.
11. The method according to any preceding claim, further comprising producing the methyltransferase by fermentation of a microorganism configured to express the methyltransferase; optionally wherein the microorganism is further configured to express uridine phosphorylase, phosphopentomutase, and pseudouridine monophosphate glycosidase.
12. The method according to claim 11, wherein the contacting is performed in the presence of the microorganism; optionally wherein the method is performed in vivo.
13. The method according to any preceding claim, wherein the contacting is performed in an aqueous buffer, the aqueous buffer including Mg' at a concentration in the range 0.5 mM to 15 mM, optionally 0.7 mM to 1.3 mM, further optionally 0.9 to 1.1 mM.
14. The method according to claim 13, wherein the buffer has a pH in the range 7.4 to 7.6.
15. The method according to any preceding claim, wherein the methyltransferase has an active site with a neutral charge or a positive charge, optionally wherein the active site includes a lysine or arginine residue positioned to interact with atom 04 of the substrate and/or the active site has a net positive charge of one or two.
16. The method according to any preceding claim, wherein the methyltransferase is a Class 1 methyltransferase engineered to transfer the methyl group or ethyl group from the cofactor to the substrate.
17. The method according to claim 16, wherein the methyltransferase is a cap methyltransferase, optionally an mRNA cap guanine-N7 methyltransferase.
18. The method according to claim 17, wherein the methyltransferase has an amino acid sequence with at least 80 % sequence identity to HQ ID NO: 5, the amino acid sequence including at least one mutation selected from: i) F201R or F201K; ii) E212Q, E212K, E212R, E212N, E212S, E212C, E212A, E212T, E212M, E212H, E212G, E212V, E212L, or E2121; iii)1267R or 1267K; iv) Y271R or Y271K; optionally wherein the amino acid sequence includes no more than two mutations selected from i) to iv).
19. The method according to claim 17, wherein the methyltransferase has an amino acid sequence with at least 80 % sequence identity to SEQ ID NO: 6, the amino acid sequence including at least one mutation selected from: i) F215R or F215K; H) E226Q, E226K, E226R, E226N, E226S, E226C, E226A, E226T, E226M, E226H, E2266, E226V, E226L, or E2261; Hi) A286R or A286K; iv) Y290R or Y290K; optionally wherein the amino acid sequence includes no more than two mutations selected from i) to iv).
20. The method according to claim 17, wherein the methyltransferase has an amino acid sequence with at least 80 % sequence identity to SEQ ID NO: 7, the amino acid sequence including at least one mutation selected from: I) Y224R or Y224K; H) E235Q, E235K, E235R, E235N, E2355, E235C, E235A, E235T, E235M, E235H, E2356, E235V, E235L, or E2351; Hi) A299R or A299K; iv) Y304R or Y304K; optionally wherein the amino acid sequence includes no more than two mutations selected from i) to iv).
21. The method according to claim 17, wherein the methyltransferase has an amino acid sequence with at least 80 % sequence identity to SEQ ID NO: 8, the amino acid sequence including at least one mutation selected from: i) F199R or F199K; H) E210Q, E210K, E210R, E210N, E2105, E210C, E210A, E210T, E210M, E210H, E2106, E210V, E210L or E2101; iii) V265R or V265K; and iv) Y269R or Y269K; optionally wherein the amino acid sequence includes no more than two mutations selected from i) to iv).
22. The method according to any of claims 1 to 15, wherein the methyltransferase is a dimeric SPOUT methyltransferase comprising a SPOUT domain configured to transfer the methyl group from the cofactor to the substrate.
23. The method according to claim 22, wherein the methyltransferase is a minimalist dimeric SPOUT methyltransferase, free of any domains other than the SPOUT domain.
24. The method according to claim 23, wherein the methyltransferase consists of the SPOUT domain.
25. The method according to any preceding claim, wherein the methyltransferase has an active site which includes an amino acid residue which forms a stacking interaction with the substrate when the SPOUT domain binds to the substrate, optionally wherein the amino acid residue which forms the stacking interaction is arranged proximal to one or more amino acid residues which catalyse transfer of the methyl group or ethyl group from the cofactor to the substrate.
26. The method according to claim 25, wherein the amino acid residue which forms the stacking interaction is selected from a histidine residue, a tyrosine residue, and a tryptophan residue; optionally wherein the amino acid residue which forms the stacking interaction is a tyrosine residue.
27. The method according to any preceding claim, wherein W is a monophosphate group, a diphosphate group, or a triphosphate group, wherein the methyltransferase includes an amino acid residue which forms a salt bridge with a phosphate group of the substrate when the methyltransferase binds to the substrate; and optionally wherein the amino acid residue which forms the salt bridge is a lysine or arginine residue.
28. The method according to any preceding claim, wherein the methyltransferase has a neutrally-charged or positively-charged active site.
29. The method according to any of claims 22 to 28, wherein the methyltransferase is a tRNA (pseudouridine54-N1)-methyltransferase.
30. The method according to claim 29, wherein the methyltransferase is an engineered tRNA (pseudouridine54-N1)-methyltransferase which includes at least one of the following modifications compared to wild-type tRNA (pseudouridine54-N1)-methyltransferase: i) introduction of a stacking amino acid residue selected from a histidine residue, a tyrosine residue, and a tryptophan residue, the stacking amino acid residue being arranged proximal to one or more amino acid residues which catalyse transfer of the methyl group or ethyl group from the cofactor to the substrate; and/or ii) introduction of a salt bridging amino acid residue configured to form a salt bridge to a phosphate group of the substrate, the salt bridging amino acid being selected from a lysine residue and an arginine residue, and the salt bridging amino acid being arranged proximal to one or more amino acid residues which catalyse transfer of the methyl group or ethyl group from the cofactor to the substrate; and/or iii) replacement of a negatively-charged amino acid residue proximal to one or more amino acid residues which catalyse transfer of the methyl group or ethyl group from the cofactor to the substrate with a neutrally-charged or positively-charged amino acid residue, preferably a positively-charged amino acid residue.
31. The method according to claim 30, wherein the methyltransferase has an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 1.
32. The method according to claim 31, wherein the amino acid sequence includes mutations K100, K75N, and one of K2OD and K2ON.
33. The method according to claim 31 or claim 32, wherein the wherein the amino acid sequence includes one or more mutations selected from K74D, R81L, and K89N.
34. The method according to any of claims 31 to 33, wherein the amino acid sequence includes one or more mutations selected from: D385, C65A, I66L, 0107E, N145E, E149K,1155L, R176K, 1193V, K199R, K200R, R201E, and C203F.
35. The method according to any of claims 31 to 34, wherein the amino acid sequence includes one or more mutations selected from: D104Y, R115K, N121D, V1231, M138K, N139D, 1155L, R167K, R176K, K200R, R201N, C203de1, E204de1, and 1205de1.
36. The method according to any of claims 31 to 35, wherein the amino acid sequence includes one or more mutations selected from: L32V, D385, 059E, C65A, 571N, E100N, Q101K, K103E, D104E, N106T, Q107E, R1155, R116K, L117K, N121D, V1231, L127K, E128K, N132K, M138K, N139D, V1431, D147K, E149 N150insN, N150 P151insE, P151 V152insN, 1154V, 1155L, R167K, D170E, 1(173G, R176K, 1177V, N187D,1193V, K200R, R201G, C203de1, E204de1, and 1205de1.
37. The method according to any of claims 31 to 36, wherein the amino acid sequence includes a P23K mutation.
38. The method according to claim 37, wherein the amino acid sequence further includes an 577D mutation.
39. The method according to any of claims 31 to 38, wherein the amino acid sequence includes an F41N mutation.
40. The method according to any of claims 31 to 39, wherein the amino acid sequence includes one or more mutations selected from K100, P23D, S77K, and D79N.
41. The method of claim 31, wherein the methyltransferase has an amino acid sequence selected from: i) SEQ ID NO: 1 with mutations K100,, K20D, and K75N; ii) SEQ ID NO: 1 with mutations K100" K2ON, K75N, and R81L; iii) SEQ ID NO: 1 with mutations 1(10Q, K20D, 1(75N, K74D, and 1(89N; iv) SEQ ID NO: 1 with mutations D385, C65A,I66L, Q107E, N145E, E149K, I155L, R176K, I193V, K199R,I(200R, R201E, and C203F; v) SEQ ID NO: 1 with mutations K10Q, P23D, S77K, and D79N; vi) SEQ ID NO: 1 with mutations D104Y, R115K, N121D, V1231, M138K, N139D,1155L, R167K, R176K, K200R, R201N, C203de1, E204de1,and1205del; and vii) SEQ ID NO: 1 with mutations L32V, D385, Q59E, C65A, 571N, E100N, Q101K, K103E, D104E, N106T, Q107E, R115S, R116K, L117K, N121D, V1231, L1271<, E128K, N132K, M138K, N139D, V1431, D147K, E149_N150insN, N150_P151insE, P151_V152insN, 1154V, 1155L, R167K, D170E, K1736, R176K, 1177V, N187D, 1193V, 1(200R, R2016, C203de1, E204de1, and 1205de1.
42. The method according to claim 29 or claim 30, wherein the methyltransferase has an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 2.
43. The method according to any of claims 22 to 28, wherein the methyltransferase is an rRNA small subunit pseudouridine methyltransferase Nep1.
44. The method according to claim 43, wherein the methyltransferase is an engineered methyltransferase having: i) a stacking amino acid residue selected from a histidine residue, a tyrosine residue, and a tryptophan residue, the stacking amino acid residue being arranged proximal to one or more amino acid residues which catalyse transfer of the methyl group or ethyl group from the cofactor to the substrate; and ii) a neutrally-charged or positively charged active site for catalysing transfer of the methyl group or ethyl group from the cofactor to the substrate.
45. The method according to claim 43 or claim 44, wherein the methyltransferase has an amino acid sequence having 70 to 99.6% sequence identity to SEQ ID NO:3, the amino acid sequence including one or more mutations to SEQ ID NO:3.
46. The method according to claim 45, wherein the at least one mutation comprises V2 A27del.
47. The method according to claim 45 or claim 46, wherein the one or more mutations comprise at least one of K1060, R129N, and 5233W.
48. The method according to claim 47, wherein the one or more mutations comprise S223W and one or more of L97A, D101A, and R132S; preferably wherein the mutations are 5223W, L97A, D101A, and R1325.
49. The method according to claim 43 or claim 44, wherein the methyltransferase has an amino acid sequence having 70 to 99.6% sequence identity to SEQ ID NO: 4, the amino acid including one or more mutations.
50. The method according to claim 49, wherein the one or more mutations comprise D210H, D210Y, D210P, or D210W; and optionally one or more of: L131, V132L, L133V, M134L, Y135M, V136Y, K137V, 0139K, 51400, L141S, E142, G143L, L144E, L145G, K146L, E147L, I148K, I<149E, P1501, T151K, R152P, F154R, M155T, L156F, 5157M, E158L, N1595, G160E, E161N, K162G, 1163E, N164K, 5166N, T167I, L168S, A169T, K170L, E171A, L172K, C173E, 0174L, E175C, N176Q, R177E, P178N, T179R, VISOR, I181T, 1182V, G1831, G1841, F185G, Q186G, K187F, G188Q, S189H, F190G, L191S, K192F, K193L, H194K, V195K, E196H, L197V, A198E, D199L, K200A, V201D, Y202K, A203V, V204Y, Y205A, 6206V, 5207Y, P208G, L209S, D210P, T211L, V213T, 1214A, A215V, S2161, M217A, L218S, L219M, H220L, G221L, Y222H, E2236, 1224Y, K226I, G227E, I228K, I229G.
51. The method according to claim 50, wherein the mutations further comprise R107S.
52. The method according to claim 50 or claim 51, wherein the mutations further comprise W212A, W212H, or W212N.
53. The method according to claim 52, wherein the mutations comprise: a) D210H and W212H; or b) D210Y and W212N.
54. The method according to any of claims 50 to 53, wherein the mutations further comprise I45K.
55. The method according to claim 54, wherein the mutations further comprise Y49E.
56. The method according to any of claims 50 to 55, wherein the mutations further comprise G76A or G76N.
57. The method according to any of claims 50 to 56, wherein the mutations comprise D210Y and 0186A.
58. The method according to any of claims 49 to 57, wherein the mutations comprise one or more of: K104N, R111Q, and K187H.
59. The method according to claim 53, wherein the mutations are I45K, Y49E, G76A, 1(104N, R1075, R111Q 0186A, K187H, D210Y, and W212N.
60. A methyltransferase having an amino acid sequence of SEQ ID NO: 1 with one or more mutations, the one or more mutations being selected from: R116K, V1431, S77K, N106T, R201G, R81L, R115K, 571N, M138K,K20D,1(199R,1205de1, 0101K, 1154V, E128K, N139D, V1231, L117K, E204de1, N121D, R201E, D104Y, C203F, L32V, I177V, D147K, I66L, D385, D104E, N132K, P151_V152insN, N150_P151insE, D79N, 1(173G, 0107E, 1(200R, R201N, E149K, R115S, N187D, E149_N150insN, ICON, P23D, R176K, K100, N145E, K89N, 1155L, 1193V, Q59E, E100N, K103E, K75N, L127K, D170E, C65A, C203de1, K74D, and R167K.
61. The methyltransferase according to claim 60, wherein the mutations are: i) K100, K20D, and K75N; or H) K100,, K2ON, K75N, and R81L; or iii)1(100, K20D,1(75N, K74D, and K89N; or iv) D385, C65A, I66L, Q107E, N145E, E149K, I1551., R176K, I193V, K199R, K200R, R201E, and C203F; or v) K100, P23D, 577K, and D79N; or vi) D104Y, R115K, N121D, V1231, M138K, N139D, I155L, R167K, R176K, K200R, R201N, C203de1, E204de1, and 1205de1; or vii) L32V, D385, Q59E, C65A, 571N, E100N, Q101K, K103E, D104E, N106T, Q107E, R1155, R116K, L117K, N121D, V1231, L127K, E128K, N132K, M138K, N139D, V1431, D147K, E149_N150insN, N150_P151insE, P151_V152insN, I154V, I155L, R167K, D170E, 1(173G, R176K, I177V, N187D, I193V, K200R, R2016, C203de1, E204de1, and 1205del.
62. A methyltransferase having an amino acid sequence of SEQ ID NO: 3 with a 5233W mutation and optionally one or more further mutations selected from V2_A27de1, L97A, D101A, K1060, R129N and R1325.
63. The methyltransferase according to claim 62, wherein the further mutations are L97A, D101A, and R1325.
64. A methyltransferase having an amino acid sequence of SEQ ID NO: 4 with a mutation selected from D210H, D210Y, D210P, and D210W; and optionally one or more further mutations selected from 145K, Y49E, G76A, G76N, 1(104N, R1075, R111Q 0186A, 1(187H, W212A, W212H, W212N, L131, V132L, L133V, M134L, Y135M, V136Y, K137V, 0139K, 51400, L1415, E142, G143L, L144E, L145G, 1(146L, E147L, I148K, K149E, P1501, T151K, R152P, F154R, M155T, L156F, 5157M, E158L, N1595, 6160E, E161N, K162G, 1163E, N164K, 5166N, T1671, L1685, A169T, K170L, E171A, L172K, C173E, 01741., E175C, N176Q, R177E, P178N, T179R, V180P, 1181T, 1182V, 61831, 61841, F1856, 01866, 1(187F, 61880., 5189H, F190G, L1915, K192F, K193L, H194K, V195K, E196H, L197V, A198E, D199L, K200A, V201D, Y202K, A203V, V204Y, Y205A, 6206V, 5207Y, P2086, L2095, D210P, T211L, V213T, I214A, A215V, 52161, M217A, L2185, L219M, H220L, 6221L, Y222H, E223G, I224Y, K226I, 6227E, I228K, and I229G; or one or more further mutations selected from: I45K; Y49E; 676A or 676N; K104N; R107S; R111Q; K187H; Q186A; and W212A, W212H, or W212N.
65. The methyltransferase according to claim 64, wherein the mutations comprise: a) D210H and W212H; or b) D210Y and W212N.
66. The methyltransferase according to claim 64 or claim 65, wherein the mutations include I45K and Y49E.
67. The methyltransferase according to any of claims 64 to 66, wherein the mutations include 676A or 676N.
68. The methyltransferase according to claim 64, wherein the mutations are D210Y and Q186A.
69. The methyltransferase according to claim 64, wherein the mutations are I45K, Y49E, G76A, K104N, R1075, R111Q, Q186A, K187H, D210Y, and W212N.
70. A methyltransferase having an amino acid sequence of HQ ID NO: 18 with at least one of the following mutations: I45K, Y49E, 676A, 1(104N, R107S, R111Q, K187H, V201, Y202V, A203Y, V204A, Y205V, 6206Y, 52076, P208S, L209P, D210L, T211H, W212, V213T, I214A, A215V, S216I, M217A, L218S, L219M, H220L, 6221L, Y222H, E223, I2246, E225Y, K226E, 62271,1228E, and I229K.
71. The methyltransferase according to claim 70, having an amino acid sequence of SEQ ID NO: 12.
72. A methyltransferase having an amino acid sequence of SEQ ID NO: 5 with at least one mutation selected from: i) F201R or F201K; ii) E2120" E212K, E212R, E212N, E212S, E212C, E212A, E212T, E212M, E212H, E212G, E212V, E212L, or E2121; Hi) I267R or 1267K; iv) Y271R or Y271K.
73. A methyltransferase having an amino acid sequence of SEQ ID NO: 6 with at least one mutation selected from: i) F215R or F215K; H) E226Q, E226K, E226R, E226N, E226S, E226C, E226A, E226T, E226M, E226H, E226G, E226V, E226L, or E2261; Hi) A286R or A286K; iv) Y290R or Y290K.
74. A methyltransferase having an amino acid sequence of SEQ ID NO: 7 with at least one mutation selected from: i) Y224R or Y224K; H) E235Q, E235K, E235R, E235N, E2355, E235C, E235A, E235T, E235M, E235H, E235G, E235V, E235L, or E2351; Hi) A299R or A299K; iv) Y304R or Y304K.
75. A methyltransferase having an amino acid sequence of SEQ ID NO: 8 with at least one mutation selected from: i) F199R or F199K; H) E2100, E210K, E210R, E210N, E210S, E210C, E210A, E210T, E210M, E210H, E2106, E210V, E210L or E2101; iii) V265R or V265K; iv) Y269R or Y269K.
76. The methyltransferase of any of claims 72 to 75, wherein the amino acid sequence includes no more than two mutations.
77. A polynucleotide having a sequence encoding a methyltransferase as defined in any of claims 60 to 76.
78. An expression vector carrying the polynucleotide of claim 77.
79. A microorganism transduced with the expression vector of claim 78.
80. Use of an aqueous buffer as a reaction medium for a method of synthesising a 1N-alkyl derivative of pseudouridine as defined in any of claims 1 to 59, wherein the aqueous buffer includes the substrate, the cofactor, the methyltransferase, and Mg', the Mg' being present at a concentration in the range 0.5 mM to 15 mM; and the aqueous buffer has a pH in the range 7.4 to 7.6.
81. Use according to claim 80, wherein the aqueous buffer is a Tris buffer.
82. Use according to claim80 or claim 81, wherein the aqueous buffer includes NaCI at a concentration of 50 to 350 mM, and KCI at a concentration of 70 to 130 mM.
83. Use according to any of claims 80 to 82, wherein the Mg2+ is supplied by MgC12.
84. Use according to any of claims 80 to 83, wherein: the cofactor is present in the aqueous buffer at a concentration in the range 20 to 30 IJ.M; and/or the substrate is present in the aqueous buffer at a concentration in the range 80 to 120 p.M; and/or the methyltransferase is present in the aqueous buffer at a concentration in the range 0.5 to 20 RM.
85. Use according to claim 84, wherein the buffer has a pH of 7.5 ± 0.1 and includes: Tris-HCI at a concentration of about 25 mM; NaCI at a concentration of about 300 mM; MgC12 at a concentration of about 1 mM; (CI at a concentration of about 100 mM; the cofactor at a concentration of about 25 pM; and the substrate at a concentration of about 100 p.M.
GB2401559.6A 2024-02-06 2024-02-06 Synthesis of 1N-methylpseudouridine and 1N-methylpseudouridine phosphates Pending GB2642401A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
GB2401559.6A GB2642401A (en) 2024-02-06 2024-02-06 Synthesis of 1N-methylpseudouridine and 1N-methylpseudouridine phosphates
PCT/EP2025/052875 WO2025168590A2 (en) 2024-02-06 2025-02-05 Synthesis of 1n-methylpseudouridine and 1n-methylpseudouridine phosphates

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GB2401559.6A GB2642401A (en) 2024-02-06 2024-02-06 Synthesis of 1N-methylpseudouridine and 1N-methylpseudouridine phosphates

Publications (3)

Publication Number Publication Date
GB202401559D0 GB202401559D0 (en) 2024-03-20
GB2642401A true GB2642401A (en) 2026-01-14
GB2642401A8 GB2642401A8 (en) 2026-01-21

Family

ID=90236532

Family Applications (1)

Application Number Title Priority Date Filing Date
GB2401559.6A Pending GB2642401A (en) 2024-02-06 2024-02-06 Synthesis of 1N-methylpseudouridine and 1N-methylpseudouridine phosphates

Country Status (2)

Country Link
GB (1) GB2642401A (en)
WO (1) WO2025168590A2 (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20230383328A1 (en) * 2022-03-24 2023-11-30 Natures's Toolbox, Inc. Methods and compositions for the enzymatic production of pseudouridine triphosphate

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6503729B1 (en) * 1996-08-22 2003-01-07 The Board Of Trustees Of The University Of Illinois Selected polynucleotide and polypeptide sequences of the methanogenic archaeon, methanococcus jannashii
BR112012001351A2 (en) * 2009-07-22 2015-09-01 Univ California Method, genetically engineered yeast cell, and bacteria-yeast co-culture.
EP3223621A4 (en) * 2014-11-26 2018-10-17 Codexis, Inc. Biosynthetic production of caffeine
WO2021212034A1 (en) * 2020-04-16 2021-10-21 Nature's Toolbox, Inc. In vitro manufacturing and purification of therapeutic mrna
CN112301013B (en) * 2020-11-03 2022-11-08 深圳瑞德林生物技术有限公司 Complex enzyme and application thereof in preparation of ergothioneine
GB2614570A (en) 2022-01-10 2023-07-12 Uab Biomatter Designs C-nucleoside monophosphate synthesis

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20230383328A1 (en) * 2022-03-24 2023-11-30 Natures's Toolbox, Inc. Methods and compositions for the enzymatic production of pseudouridine triphosphate

Also Published As

Publication number Publication date
GB202401559D0 (en) 2024-03-20
GB2642401A8 (en) 2026-01-21
WO2025168590A3 (en) 2025-11-20
WO2025168590A2 (en) 2025-08-14

Similar Documents

Publication Publication Date Title
US11859217B2 (en) Terminal deoxynucleotidyl transferase variants and uses thereof
Chou et al. A novel action of human apurinic/apyrimidinic endonuclease: excision of L-configuration deoxyribonucleoside analogs from the 3′ termini of DNA
US20190078065A1 (en) Modified dna polymerases
CN112105725A (en) Variants of terminal deoxynucleotidyl transferase and uses thereof
JP2023506760A (en) Chimeric Terminal Deoxynucleotide Transferases for Template-Free Enzymatic Synthesis of Polynucleotides
Lee et al. Pathways of thymidine hypermodification
JP2023522234A (en) Terminal deoxynucleotidyl transferase mutants and uses thereof
JP2021072813A (en) Method for producing nucleic acid molecule
WO2023240202A2 (en) Modified terminal deoxynucleotidyl transferase polymerases
KR20250128300A (en) Enzymatic synthesis of NTP and NQP
Li et al. Endogenous microenvironmental engineering through targeted alteration of salt bridge network can effectively regulate enzymatic pH adaptation
US8728789B2 (en) DNA fragment encoding a polyphosphate-driven nucleoside 5′-diphosphate kinase polypeptide
Weber et al. Identification and in vitro characterization of UDP-GlcNAc-RNA cap-modifying and decapping enzymes
Ooi et al. RNA lariat debranching enzyme
GB2642401A (en) Synthesis of 1N-methylpseudouridine and 1N-methylpseudouridine phosphates
Karkas et al. Stereochemical considerations in the enzymatic phosphorylation and antiviral activity of acyclonucleosides. I. Phosphorylation of 2′-nor-2′-deoxyguanosine
JP2025502552A (en) C-Nucleoside Monophosphate Synthesis
WO2022090323A1 (en) Enzymatic synthesis of polynucleotide probes
US20240309418A1 (en) Enzymatic methods and compositions for polymerization of phosphoramidate and thiophosphoramidate-linked dna
JP2023501651A (en) Highly efficient template-free enzymatic synthesis of polynucleotides
Sauerwald et al. N6-Acetyladenosine: a new modified nucleoside from Methanopyrus kandleri tRNA
KR20250128321A (en) Nucleic acid ligation method
WO2023197015A1 (en) Compositions and analysis of dephosphorylated oligoribonucleotides
Mitton-Fry et al. Structure-guided engineering of a polyphosphate kinase 2 class III from an Erysipelotrichaceae bacterium to produce base-modified purine nucleotides
Chang et al. Refolding and characterization of a yeast dehydrodolichyl diphosphate synthase overexpressed in Escherichia coli