WO2026017063A1

WO2026017063A1 - Vector and method for preparing circular rna free from non-target sequence

Info

Publication number: WO2026017063A1
Application number: PCT/CN2025/108778
Authority: WO
Inventors: 赵幼尚; 杨访问; 周奉雯; 孟庆伟; 陈康明
Original assignee: Nanjing Jinsirui Science and Technology Biology Corp
Current assignee: Nanjing Jinsirui Science and Technology Biology Corp
Priority date: 2024-07-16
Filing date: 2025-07-16
Publication date: 2026-01-22
Anticipated expiration: 2027-01-16

Abstract

The present application relates to an RNA molecule, which contains: a 3' self-splicing intron fragment containing a 3' splice site, a target sequence, and a 5' self-splicing intron fragment containing a 5' splice site, with the elements being operably linked in sequence, wherein the 3' self-splicing intron fragment and the 5' self-splicing intron fragment can enable circularization of the RNA molecule and can be removed from the RNA molecule during the circularization process; the 3'-terminal sequence of the target sequence ends with a nucleotide having a U base; the 5' self-splicing intron fragment contains an internal guide sequence, and the 5'-terminal sequence of the internal guide sequence is reverse complementary to the 5'-terminal sequence of the target sequence; and the 3'-terminal sequence of the internal guide sequence starts with a nucleotide having a G base and is reverse complementary to the 3'-terminal sequence of the target sequence starting from the end thereof.

Description

Vectors and methods for preparing circular RNAs free of non-target sequences

相关申请的交叉引用Cross-reference to related applications

本申请要求2024年7月16日提交的申请号为202410954230.5的中国专利申请和2024年11月1日提交的申请号为202411556442.4的中国专利申请的优先权，其全部内容通过引入并入本文。This application claims priority to Chinese patent application No. 202410954230.5 filed on July 16, 2024 and Chinese patent application No. 202411556442.4 filed on November 1, 2024, the entire contents of which are incorporated herein by reference.

Invention Field

本申请涉及一种RNA分子，其包含：包含3'剪接位点的3'自剪接型内含子片段、目的序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件按序可操作地连接，其中该3'自剪接型内含子片段和该5'自剪接型内含子片段能够使得该RNA分子环化并在环化过程中从该RNA分子中去除，其中该目的序列的3'端序列末尾是碱基为U的核苷酸，5'自剪接型内含子片段包含内部引导序列，该内部引导序列的5'端序列与目的序列的5'端序列反向互补，该内部引导序列的3’端序列以碱基为G的核苷酸为起始并与目的序列的3'端序列反向互补。本申请还涉及能够转录出上述RNA分子的DNA分子，以及本申请的RNA分子和DNA分子在制备环状RNA、特别是不含与3'自剪接型内含子片段或5'自剪接型内含子片段相邻的外显子的环状RNA分子中的用途。This application relates to an RNA molecule comprising: a 3' self-splicing intron fragment containing a 3' splice site, a target sequence, and a 5' self-splicing intron fragment containing a 5' splice site, wherein the elements are operatively linked in sequence, wherein the 3' self-splicing intron fragment and the 5' self-splicing intron fragment enable the RNA molecule to be circularized and removed from the RNA molecule during the circularization process, wherein the 3' end of the target sequence is a U-terminal nucleotide, and the 5' self-splicing intron fragment includes an internal guide sequence whose 5' end is anticomplementary to the 5' end of the target sequence, and whose 3' end begins with a G-terminal nucleotide and is anticomplementary to the 3' end of the target sequence. This application also relates to DNA molecules capable of transcribing the aforementioned RNA molecules, and the use of the RNA and DNA molecules of this application in the preparation of circular RNA, particularly circular RNA molecules that do not contain exons adjacent to 3' self-splicing intron fragments or 5' self-splicing intron fragments.

Background Technology

随着技术的突破，核糖核酸(RNA)疗法，包括mRNA疫苗，已发展成了生物医药行业内具有强大潜力的新领域。然而，RNA，特别是mRNA，存在易降解的问题。如何提高mRNA的稳定性从而延长蛋白翻译持续时间，引起了人们极大的关注和兴趣。一种有效的途径是将mRNA转化为环状RNA(cirRNA)，即一类首尾共价连接的闭合环状单链RNA分子，其缺乏5'及3'末端，因而具有更强的抵御核酸外切酶降解的能力，从而拥有更长的半衰期。With technological breakthroughs, RNA therapy, including mRNA vaccines, has emerged as a promising new field within the biopharmaceutical industry. However, RNA, especially mRNA, is prone to degradation. Improving mRNA stability to prolong protein translation duration has attracted significant attention and interest. One effective approach is to convert mRNA into circular RNA (cirRNA), a type of covalently linked closed circular single-stranded RNA molecule lacking 5' and 3' ends, thus exhibiting stronger resistance to exonuclease degradation and a longer half-life.

基因片段中一般包含内含子和外显子，在RNA剪切后，外显子序列拼接在一起形成成熟mRNA。大多数内含子的剪切去除需要多种蛋白相互作用来完成，但也存在一些特殊的内含子，其通过自剪接的方式去除。这种具有自我剪接功能的内含子可以根据结构和剪接机制分为两类：I类和II类。I类内含子的加工是由外源鸟苷辅因子(GTP)介导的酯交换反应所启动的^[3,4]。II类内含子的剪接与真核生物mRNA前体的剪接相似，其中5'端剪接位点受到称为分支位点处凸起的核苷酸的-OH基团的攻击，先产生内含子套索结构，随后，5'外显子处释放的-OH攻击3'剪接位点，最终使得两端外显子拼接到一起^[4,5]。利用I类和II类内含子的这种自剪接功能，研究者通过序列设计，将自剪接内含子分成两半，分别装载到待环化RNA序列的两端，通过体外催化的方式可实现RNA的成环^[1,6-8]。Daniel G.Anderson等人^[1,6-8]利用I类自剪接型内含子如鱼腥藻(Anabaena)pre-tRNA-Leu基因或T4噬菌体Td基因(T4td)的内含子，实现了RNA的体外环化。由于环状RNA两端共价闭合，无裸露的5'端，因此没有帽子结构来启动RNA翻译。为了解决环状RNA翻译的问题，研究者在环状RNA中添加内部核糖体进入位点序列(IRES)，使其具有稳定的体外和体内翻译功能。Gene fragments generally contain introns and exons. After RNA splicing, the exon sequences are spliced together to form mature mRNA. Most intron splicing removal requires multiple protein interactions, but some special introns are removed through self-splicing. These introns with self-splicing function can be divided into two categories based on their structure and splicing mechanism: Class I and Class II. The processing of Class I introns is initiated by a transesterification reaction mediated by exogenous guanosine cofactor (GTP) ^[3,4] . The splicing of Class II introns is similar to that of eukaryotic mRNA precursors, where the 5' splice site is attacked by the -OH group of a nucleotide protruding at the branching site, first producing an intron lasso structure. Subsequently, the -OH released at the 5' exon attacks the 3' splice site, ultimately splicing the two exons together ^[4,5] . Utilizing the self-splicing function of class I and class II introns, researchers have designed sequences to split these introns in half and load them onto both ends of the RNA sequence to be circularized, achieving RNA circularization via in vitro catalysis ^[1,6-8] . Daniel G. Anderson et al. ^[1,6-8] used introns of class I self-splicing introns, such as the pre-tRNA-Leu gene from Anabaena or the Td gene (T4td) from T4 bacteriophage, to achieve in vitro RNA circularization. Because circular RNA is covalently closed at both ends and lacks a naked 5' end, it lacks a cap structure to initiate RNA translation. To address the translation problem of circular RNA, researchers added internal ribosome entry site sequences (IRES) to the circular RNA, giving it stable in vitro and in vivo translational functions.

I类自剪接型内含子的一级结构高度变化，但是具有相对保守的二级和三级结构。如图1所示，I类自剪接型内含子的二级结构通常由配对(P)元件P1-P10、以及单链的环区域构成。内含子在靠近5'端处包含一段内部引导序列(IGS)，该IGS的5'端序列能够与内含子3'端的相邻外显子互补配对，而IGS的3'端序列能够与内含子5'端的相邻外显子互补配对。在内含子和外显子互补配对后，在P1和P10的剪接位点即内含子-外显子交界处(图1中三角形指向的P1和P10区)发生剪切，使得内含子从整体结构中移除，P1和P10区的外显子连接^[9]。在将自剪接型内含子用于制备环状RNA时，需要加入内含子两侧相邻的外显子，以顺利完成剪接。在剪接完成后，内含子从最终的环状RNA中去除，而两侧的邻接外显子序列则留在环状RNA终产物中。The primary structure of class I self-splicing introns varies greatly, but they have relatively conserved secondary and tertiary structures. As shown in Figure 1, the secondary structure of class I self-splicing introns is usually composed of pairing (P) elements P1-P10 and a single-stranded loop region. The intron contains an internal guide sequence (IGS) near the 5' end. The 5' end sequence of the IGS can pair complementaryly with the adjacent exon at the 3' end of the intron, while the 3' end sequence of the IGS can pair complementaryly with the adjacent exon at the 5' end of the intron. After the intron and exon pair complementaryly, splicing occurs at the splice sites of P1 and P10, i.e., the intron-exon junction (the P1 and P10 regions pointed to by the triangle in Figure 1), so that the intron is removed from the overall structure and the exons in the P1 and P10 regions are connected ^[9] . When using self-splicing introns to prepare circular RNA, it is necessary to add the exons adjacent to both sides of the intron to complete the splicing smoothly. After splicing is complete, the introns are removed from the final circular RNA, while the adjacent exon sequences on both sides remain in the circular RNA final product.

此外，由于置换的内含子-外显子元件(PIE)的剪接位点位于IRES附近，并且两个序列都是高度结构化的，IRES序列可能会干扰剪接核酶的折叠。为了使这些结构能够独立折叠，于是在PIE剪接位点和IRES之间设计了一系列间隔序列，通过增加间隔序列，将剪接效率提高了将近一倍。Furthermore, since the splicing sites of the substituted intron-exon elements (PIEs) are located near the IRES, and both sequences are highly structured, the IRES sequence may interfere with the folding of splicing ribozymes. To enable these structures to fold independently, a series of spacer sequences were designed between the PIE splicing sites and the IRES. By increasing the number of spacer sequences, the splicing efficiency was nearly doubled.

然而，留在环状RNA终产物中的为自剪接所需的相邻外显子序列、以及人为添加的外源间隔序列，大大增加了免疫原性以及相关的治疗风险，可能导致环状RNA的应用范围受限。However, the adjacent exon sequences required for self-splicing and the artificially added exogenous spacer sequences remaining in the circular RNA end product greatly increase immunogenicity and related therapeutic risks, which may limit the application scope of circular RNA.

对于本申请中任何文件的引用，并不等同于承认这些文件是本申请的现有技术。Reference to any document in this application is not an admission that such document is prior art.

Summary of the Invention

本申请的发明人发现，在图2A所示的炭疽杆菌recA基因中自剪接型内含子中，内部引导序列(IGS)中用于与5'外显子序列(P1反义链)反向互补配对的3'端序列(P1)、和/或用于与3'外显子(P10反义链)反向互补配对的5'端序列(P10)，除P1中用于与5'外显子的最后一个U形成G:U配对剪接位点的G外，可以做改造(参见图2B)，从而使得可以使用不含5'相邻外显子、或3'相邻外显子、或以上两个外显子的自剪接型内含子来制备环状RNA，所制备的环状RNA中自然也不含上述5'相邻外显子、3'相邻外显子、或以上两个外显子。The inventors of this application have discovered that in the self-splicing introns of the Bacillus anthracis recA gene shown in Figure 2A, the 3' end sequence (P1) in the internal guide sequence (IGS) used for reverse complementary pairing with the 5' exon sequence (P1 antisense strand), and/or the 5' end sequence (P10) used for reverse complementary pairing with the 3' exon (P10 antisense strand), except for the G in P1 used to form a G:U pairing splicing site with the last U of the 5' exon, can be modified (see Figure 2B). This allows the use of self-splicing introns that do not contain a 5' adjacent exon, or a 3' adjacent exon, or both of the above exons to prepare circular RNA. Naturally, the prepared circular RNA also does not contain the aforementioned 5' adjacent exon, 3' adjacent exon, or both of the above exons.

具体地，将自剪接型内含子分成3'内含子片段和5'内含子片段两部分，将自剪接型内含子的3'内含子片段和5'内含子片段分别装载在待环化RNA序列的5'和3'侧，i)当待环化RNA的3'端序列末尾是碱基为U的核苷酸，且IGS的3'端序列设计成以碱基为G的核苷酸为起始，并与待环化RNA的3'端序列(作为P1反义链)，而非5'侧外显子，反向互补配对时，无需使用5'内含子片段的5'侧相邻外显子就可完成RNA环化，ii)当IGS的5'端序列设计成与待环化RNA的5'端序列(作为P10反义链)，而非3'侧外显子，反向互补配对时，无需使用3'内含子片段的3'侧相邻外显子就可完成RNA环化，当iii)当待环化RNA的3'端序列末尾是碱基为U的核苷酸，IGS的3'端序列设计成以碱基G的核苷酸为起始，并与待环化RNA的3'端序列(作为P1反义链)反向互补配对，且IGS的5'端序列设计成与待环化RNA的5'端序列(作为P10反义链)反向互补配对时，无需使用5'内含子片段的5'侧相邻外显子和3'内含子片段的3'侧相邻外显子就可完成RNA环化。所制备的环状RNA中不含上述5'侧相邻外显子、3'侧相邻外显子、或以上两个外显子。Specifically, the self-splicing intron is divided into a 3' intron fragment and a 5' intron fragment. The 3' and 5' intron fragments are loaded onto the 5' and 3' sides of the RNA sequence to be circularized, respectively. i) When the 3' end of the RNA to be circularized ends in a U nucleotide, and the 3' end of the IGS is designed to start with a G nucleotide and pair in reverse with the 3' end of the RNA to be circularized (as the P1 antisense strand), rather than the 5' exon, RNA circularization can be completed without using the adjacent 5' exon of the 5' intron fragment. ii) When the 5' end of the IGS is designed to pair with the 5' end of the RNA to be circularized (… When the 3' end of the RNA to be circularized is a U-terminal nucleotide, and the 3' end sequence of the IGS is designed to start with a G-terminal nucleotide and is reverse complementary to the 3' end sequence of the RNA to be circularized (as the P1 antisense strand), and the 5' end sequence of the IGS is designed to be reverse complementary to the 5' end sequence of the RNA to be circularized (as the P1 antisense strand), RNA circularization can be completed without using the 5' adjacent exons of the 5' intron and the 3' adjacent exons of the 3' intron. The prepared circular RNA does not contain the aforementioned 5' adjacent exons, 3' adjacent exons, or both of the above exons.

本申请的发明人对贝氏柯克斯体(Coxiella burnetii)23S核糖体基因中I类自剪接型内含子的IGS(如图5所示)做类似改造，发现可以实现相同的效果。因而，本申请的发明人认为，来自不同基因的I类自剪接型内含子，均可以进行上述的IGS改造，实现相同的效果。The inventors of this application made similar modifications to the IGS (as shown in Figure 5) of the class I self-splicing intron in the Coxiella burnetii 23S ribosome gene and found that the same effect could be achieved. Therefore, the inventors of this application believe that class I self-splicing introns from different genes can all undergo the above-mentioned IGS modification to achieve the same effect.

此外，本申请的发明人还发现，在做上述IGS改造、且内含子剪接位点与IRES之间未加人工间隔序列的情况下，RNA环化效率仍可以达到生产要求。Furthermore, the inventors of this application have discovered that even with the aforementioned IGS modification and without adding an artificial spacer sequence between the intron splicing site and the IRES, the RNA circularization efficiency can still meet production requirements.

因而，在第一个方面，本申请提供一种用于制备环状RNA的单链DNA分子，其可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、目的序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接。Therefore, in a first aspect, this application provides a single-stranded DNA molecule for preparing circular RNA, which may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a target sequence, and a 5' self-splicing intron fragment containing a 5' splice site, wherein the elements are operatively linked.

3'自剪接型内含子片段和5'自剪接型内含子片段能够使得由该单链DNA分子的互补链所转录出的RNA分子环化。在由该单链DNA分子互补链转录出的RNA分子的环化过程中，由3'自剪接型内含子片段和5'自剪接型内含子片段转录得到的RNA片段从由该单链DNA分子互补链转录出的RNA分子中去除。3'自剪接型内含子片段和5'自剪接型内含子片段可以得自同一个自剪接型内含子，尤其是I类自剪接型内含子。在一些实施方式中，3'自剪接型内含子片段和5'自剪接型内含子片段可以得自炭疽杆菌recA基因中的自剪接型内含子、或贝氏柯克斯体23S核糖体基因中的自剪接型内含子。5'自剪接型内含子片段和3'自剪接型内含子片段，当以此顺序排布时，可以连接形成具有上述自剪接功能的自剪接型内含子，例如完整的自剪接型内含子。在一些实施方式中，3'自剪接型内含子片段和5'自剪接型内含子片段可以得自炭疽杆菌RecA基因中的自剪接型内含子，且该3'自剪接型内含子片段可以包含SEQ ID NO:10所示的核苷酸序列。在一些实施方式中，3'自剪接型内含子片段和5'自剪接型内含子片段可以得自贝氏柯克斯体23S核糖体基因中的自剪接型内含子，且该3'自剪接型内含子片段可以包含SEQ ID NO:21所示的核苷酸序列。The 3' and 5' self-splicing intron fragments enable the circularization of RNA molecules transcribed from the complementary strand of the single-stranded DNA molecule. During the circularization of the RNA molecule transcribed from the complementary strand of the single-stranded DNA molecule, the RNA fragment transcribed from the 3' and 5' self-splicing intron fragments is removed from the RNA molecule transcribed from the complementary strand of the single-stranded DNA molecule. The 3' and 5' self-splicing intron fragments can be derived from the same self-splicing intron, particularly a type I self-splicing intron. In some embodiments, the 3' and 5' self-splicing intron fragments can be derived from the self-splicing intron in the *Bacillus anthracis* recA gene or the self-splicing intron in the *Coxiella belladonna* 23S ribosome gene. When arranged in this order, the 5' and 3' self-splicing intron fragments can be linked to form a self-splicing intron with the aforementioned self-splicing function, such as a complete self-splicing intron. In some embodiments, the 3' and 5' self-splicing intron fragments can be derived from the self-splicing intron in the *Bacillus anthracis* RecA gene, and the 3' self-splicing intron fragment can contain the nucleotide sequence shown in SEQ ID NO: 10. In some embodiments, the 3' and 5' self-splicing intron fragments can be derived from the self-splicing intron in the *Coxiella belladonna* 23S ribosome gene, and the 3' self-splicing intron fragment can contain the nucleotide sequence shown in SEQ ID NO: 21.

5'自剪接型内含子片段可以包含内部引导序列。5' self-splicing intron fragments can contain internal guide sequences.

内部引导序列的5'端序列可以设计成与目的序列的5'端序列反向互补。内部引导序列的5'端序列可以与目的序列的5'端的任意数量的核苷酸反向互补。例如，内部引导序列的5'端序列可以与目的序列的5'端的2-15nt(例如2、3、4、5、6、7、8、9、10、11、12、13、14或15nt)，例如2-12nt核苷酸反向互补。在一些实施方式中，内部引导序列的5'端序列可以与目的序列的5'端的3-6nt核苷酸反向互补。在一些实施方式中，内部引导序列的5'端序列可以与目的序列的5'端序列反向互补，单链DNA分子还可以在5'自剪接型内含子片段的5'侧包含天然相邻外显子或对应于天然相邻外显子的序列。该天然相邻外显子可以包含部分或截短的序列。天然相邻外显子或对应于天然相邻外显子的序列可以包含长度3-20nt(例如2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19或20nt)的核苷酸，例如3-10nt。本申请的单链DNA分子可以至少不在3'自剪接型内含子片段的3'侧包含天然相邻外显子或对应于天然相邻外显子的序列。The 5' end sequence of the internal guide sequence can be designed to be anticomplementary to the 5' end sequence of the target sequence. The 5' end sequence of the internal guide sequence can be anticomplementary to any number of nucleotides at the 5' end of the target sequence. For example, the 5' end sequence of the internal guide sequence can be anticomplementary to 2-15 nt (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, or 15 nt), such as 2-12 nt nucleotides, at the 5' end of the target sequence. In some embodiments, the 5' end sequence of the internal guide sequence can be anticomplementary to 3-6 nt nucleotides at the 5' end of the target sequence. In some embodiments, the single-stranded DNA molecule can also include a naturally adjacent exon or a sequence corresponding to a naturally adjacent exon on the 5' side of the 5' self-splicing intron fragment. This naturally adjacent exon can contain a partial or truncated sequence. Naturally adjacent exons or sequences corresponding to naturally adjacent exons may contain nucleotides of length 3-20 nt (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 nt), such as 3-10 nt. The single-stranded DNA molecules of this application may at least not contain naturally adjacent exons or sequences corresponding to naturally adjacent exons on the 3' side of the 3' self-splicing intron fragment.

内部引导序列的3’端序列可以设计成与目的序列的3'端序列反向互补。内部引导序列的3’端序列可以以碱基为G的核苷酸为起始，并与目的序列的3'端序列反向互补。目的序列的3'端序列末尾可以是碱基为T的核苷酸。该单链DNA分子互补链转录出的RNA分子的内部引导序列3'端序列碱基为G的起始核苷酸能够与转录出的目的序列的3'端序列末尾的碱基为U的核苷酸形成G:U配对。内部引导序列的3'端序列可以与目的序列的3'端的任意数量的核苷酸反向互补。例如，内部引导序列的3'端序列可以与目的序列的3'端的3-20nt(例如3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19或20nt)，例如3-15nt核苷酸反向互补。在一些实施方式中，内部引导序列的3'端序列可以与目的序列的3'端的3-10nt核苷酸反向互补。在另一些实施方式中，内部引导序列的3'端序列可以与目的序列的3'端的3-9nt核苷酸反向互补。在一些实施方式中，内部引导序列的3’端序列可以以碱基为G的核苷酸为起始并与目的序列的3'端序列末尾开始反向互补，且目的序列在3'端序列末尾是碱基为T的核苷酸，单链DNA分子还可以在3'自剪接型内含子片段的3'侧包含天然相邻外显子或对应于天然相邻外显子的序列。该天然相邻外显子可以包含部分或截短的序列。天然相邻外显子或对应于天然相邻外显子的序列可以包含长度3-20nt(例如2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19或20nt)的核苷酸，例如3-10nt。本申请的单链DNA分子可以至少不在5'自剪接型内含子片段的5'侧包含天然相邻外显子或对应于天然相邻外显子的序列。The 3' end sequence of the internal guide sequence can be designed to be anticomplementary to the 3' end sequence of the target sequence. The 3' end sequence of the internal guide sequence can begin with a G-base nucleotide and be anticomplementary to the 3' end sequence of the target sequence. The 3' end sequence of the target sequence can end with a T-base nucleotide. The G-base initiating nucleotide of the internal guide sequence of the RNA molecule transcribed from the complementary strand of this single-stranded DNA molecule can form a G:U pair with the U-base at the end of the transcribed target sequence. The 3' end sequence of the internal guide sequence can be anticomplementary to any number of nucleotides at the 3' end of the target sequence. For example, the 3' end sequence of the internal guide sequence can be anticomplementary to 3-20 nt (e.g., 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 nt) of the 3' end of the target sequence, for example, 3-15 nt nucleotides. In some embodiments, the 3' end sequence of the internal guide sequence may be anticomplementary to the 3-10 nt nucleotides at the 3' end of the target sequence. In other embodiments, the 3' end sequence of the internal guide sequence may be anticomplementary to the 3-9 nt nucleotides at the 3' end of the target sequence. In some embodiments, the 3' end sequence of the internal guide sequence may begin with a G-base nucleotide and be anticomplementary to the 3' end of the target sequence, and the target sequence may end with a T-base nucleotide at the 3' end. The single-stranded DNA molecule may also contain a naturally adjacent exon or a sequence corresponding to the naturally adjacent exon on the 3' side of the 3' self-splicing intron fragment. The naturally adjacent exon may contain a partially or truncated sequence. The naturally adjacent exon or the sequence corresponding to the naturally adjacent exon may contain nucleotides of length 3-20 nt (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 nt), for example, 3-10 nt. The single-stranded DNA molecule of this application may at least not contain a naturally adjacent exon or a sequence corresponding to a naturally adjacent exon on the 5' side of the 5' self-splicing intron fragment.

在一些实施方式中，内部引导序列的5'端序列可以与目的序列的5'端的2-15nt核苷酸反向互补，或内部引导序列的3'端序列可以与目的序列的3'端的3-20nt核苷酸反向互补。在一些实施方式中，内部引导序列的5'端序列可以与目的序列的5'端的2-15nt核苷酸反向互补，内部引导序列的3'端序列可以与目的序列的3'端的3-20nt核苷酸反向互补。在一些实施方式中，内部引导序列的5'端序列可以与目的序列的5'端的2-6nt核苷酸反向互补，或内部引导序列的3'端序列可以与目的序列的3'端的3-10nt核苷酸反向互补。在一些实施方式中，内部引导序列的5'端序列可以与目的序列的5'端的2-6nt核苷酸反向互补，内部引导序列的3'端序列可以与目的序列的3'端的3-10nt核苷酸反向互补。In some embodiments, the 5' end sequence of the internal guide sequence may be anticomplementary to the 2-15 nt nucleotides at the 5' end of the target sequence, or the 3' end sequence of the internal guide sequence may be anticomplementary to the 3-20 nt nucleotides at the 3' end of the target sequence. In some embodiments, the 5' end sequence of the internal guide sequence may be anticomplementary to the 2-15 nt nucleotides at the 5' end of the target sequence, and the 3' end sequence of the internal guide sequence may be anticomplementary to the 3-20 nt nucleotides at the 3' end of the target sequence. In some embodiments, the 5' end sequence of the internal guide sequence may be anticomplementary to the 2-6 nt nucleotides at the 5' end of the target sequence, or the 3' end sequence of the internal guide sequence may be anticomplementary to the 3-10 nt nucleotides at the 3' end of the target sequence. In some embodiments, the 5' end sequence of the internal guide sequence may be anticomplementary to the 2-6 nt nucleotides at the 5' end of the target sequence, and the 3' end sequence of the internal guide sequence may be anticomplementary to the 3-10 nt nucleotides at the 3' end of the target sequence.

在一些实施方式中，内部引导序列的5'端序列可以设计成与目的序列的5'端序列反向互补，内部引导序列的3’端序列可以设计成与目的序列的3'端序列反向互补。在一些实施方式中，内部引导序列的5'端序列可以与目的序列的5'端序列反向互补，内部引导序列的3’端序列可以以碱基为G的核苷酸为起始并与目的序列的3'端序列末尾开始反向互补，且目的序列在3'端序列末尾可以是碱基为T的核苷酸。在一些实施方式中，内部引导序列的3'端序列可以以碱基为G的核苷酸为起始并与目的序列的3'端序列反向互补，其中该目的序列的3'端序列末尾可以是碱基为T的核苷酸，其中该单链DNA分子互补链转录出的RNA分子的内部引导序列3'端序列碱基为G的起始核苷酸能够与转录出的目的序列的3'端序列末尾的碱基为U的核苷酸形成G:U配对。本申请的单链DNA分子可以不在5'自剪接型内含子片段的5'侧包含天然相邻外显子或对应于天然相邻外显子的序列，且可以不在3'自剪接型内含子片段的3'侧包含天然相邻外显子或对应于天然相邻外显子的序列。In some embodiments, the 5' end sequence of the internal guide sequence can be designed to be anticomplementary to the 5' end sequence of the target sequence, and the 3' end sequence of the internal guide sequence can be designed to be anticomplementary to the 3' end sequence of the target sequence. In some embodiments, the 5' end sequence of the internal guide sequence can be anticomplementary to the 5' end sequence of the target sequence, and the 3' end sequence of the internal guide sequence can be initiated by a G-base nucleotide and anticomplementary to the 3' end sequence of the target sequence, wherein the 3' end sequence of the target sequence can be a T-base nucleotide. In some embodiments, the 3' end sequence of the internal guide sequence can be initiated by a G-base nucleotide and anticomplementary to the 3' end sequence of the target sequence, wherein the 3' end sequence of the target sequence can be a T-base nucleotide, wherein the G-base initiating nucleotide of the 3' end sequence of the internal guide sequence of the RNA molecule transcribed from the complementary strand of the single-stranded DNA molecule can form a G:U pair with the U-base nucleotide at the end of the transcribed 3' end sequence of the target sequence. The single-stranded DNA molecule of this application may not contain a naturally adjacent exon or a sequence corresponding to a naturally adjacent exon on the 5' side of the 5' self-splicing intron fragment, and may not contain a naturally adjacent exon or a sequence corresponding to a naturally adjacent exon on the 3' side of the 3' self-splicing intron fragment.

目的序列可以包含编码目的肽或蛋白的开放阅读框、翻译功能元件的序列、非编码DNA的序列、单克隆位点、或多克隆位点。翻译功能元件可以选自翻译起始元件、和翻译增强元件。翻译起始元件可以是开启RNA翻译的序列，例如内部核糖体进入位点(IRES)。翻译增强元件可以是增强RNA翻译的序列，例如poly(A)尾、poly(C)、非cap依赖性翻译增强子(CITE)、或EIF4配体家族识别序列等。在本申请单链DNA所制备的环状RNA分子中，翻译增强元件可以位于编码目的肽或蛋白的开放阅读框的5'侧或3'侧。在本申请单链DNA的互补链所制备的环状RNA分子中，翻译增强元件可以位于翻译起始元件的5'侧或3'侧。在一些实施方式中，翻译功能元件可以包含翻译起始元件。在一些实施方式中，翻译功能元件可以为翻译起始元件。在一些实施方式中，翻译起始元件可以为IRES。The target sequence may include an open reading frame encoding a target peptide or protein, a sequence of translational functional elements, a sequence of non-coding DNA, a single cloning site, or a multiple cloning site. The translational functional element may be selected from translation initiation elements and translation enhancement elements. The translation initiation element may be a sequence that initiates RNA translation, such as an internal ribosome entry site (IRES). The translation enhancement element may be a sequence that enhances RNA translation, such as a poly(A) tail, a poly(C) tail, a cap-independent translation enhancer (CITE), or an EIF4 ligand family recognition sequence. In the circular RNA molecule prepared from the single-stranded DNA of this application, the translation enhancement element may be located on the 5' or 3' side of the open reading frame encoding the target peptide or protein. In the circular RNA molecule prepared from the complementary strand of the single-stranded DNA of this application, the translation enhancement element may be located on the 5' or 3' side of the translation initiation element. In some embodiments, the translational functional element may include a translation initiation element. In some embodiments, the translational functional element may be a translation initiation element. In some embodiments, the translation initiation element may be an IRES.

目的序列可以包含编码目的肽或蛋白的开放阅读框、和翻译功能元件的序列。目的序列可以从5'端到3'端包含(a)该翻译功能元件(例如IRES)的序列、和该编码目的肽或蛋白的开放阅读框，(b)该编码目的肽或蛋白的开放阅读框、和该翻译功能元件(例如IRES)的序列，(c)该翻译功能元件(例如IRES)的序列的3'端序列、该编码目的肽或蛋白的开放阅读框、和该翻译功能元件(例如IRES)的序列的5'端序列，其中该翻译功能元件的5'端序列和该翻译功能元件的3'端序列在以此顺序排列时形成该翻译功能元件，或(d)该编码目的肽或蛋白的开放阅读框的3'端序列、该翻译功能元件(例如IRES)的序列、和该编码目的肽或蛋白的开放阅读框的5'端序列，其中该编码目的肽或蛋白的开放阅读框的5'端序列和该编码目的肽或蛋白的开放阅读框的3'端序列在以此顺序排列时形成该编码目的肽或蛋白的开放阅读框。在一些实施方式中，目的序列可以包含一个或多个编码目的肽或蛋白的开放阅读框、和翻译功能元件(例如IRES)的序列。目的序列可以从5'端到3'端包含(a)该一个或多个编码目的肽或蛋白的开放阅读框、和该翻译功能元件(例如IRES)的序列，(b)该翻译功能元件(例如IRES)的序列、和该一个或多个编码目的肽或蛋白的开放阅读框，(c)该翻译功能元件(例如IRES)的序列的3'端序列、该一个或多个编码目的肽或蛋白的开放阅读框、和该翻译功能元件(例如IRES)的序列的5'端序列，或(d)其中一个编码目的肽或蛋白的开放阅读框的3'端序列、其他编码目的肽或蛋白的开放阅读框(可能存在的)、该翻译功能元件(例如IRES)的序列、和其中一个编码目的肽或蛋白的开放阅读框的5'端序列。The target sequence may contain an open reading frame encoding the target peptide or protein, and a sequence of translational functional elements. The target sequence may include, from the 5' end to the 3' end, (a) the sequence of the translational functional element (e.g., IRES) and the open reading frame encoding the target peptide or protein, (b) the open reading frame encoding the target peptide or protein and the sequence of the translational functional element (e.g., IRES), (c) the 3' end sequence of the translational functional element (e.g., IRES), the open reading frame encoding the target peptide or protein, and the 5' end sequence of the translational functional element (e.g., IRES), wherein the 5' end sequence and the 3' end sequence of the translational functional element form the translational functional element when arranged in this order, or (d) the 3' end sequence of the open reading frame encoding the target peptide or protein, the sequence of the translational functional element (e.g., IRES), and the 5' end sequence of the open reading frame encoding the target peptide or protein, wherein the 5' end sequence and the 3' end sequence of the open reading frame encoding the target peptide or protein form the open reading frame encoding the target peptide or protein when arranged in this order. In some embodiments, the target sequence may comprise one or more open reading frames encoding a target peptide or protein, and a sequence of a translational functional element (e.g., IRES). The target sequence may comprise, from its 5' end to its 3' end, (a) the open reading frame encoding the one or more target peptide or protein, and the sequence of the translational functional element (e.g., IRES); (b) the sequence of the translational functional element (e.g., IRES), and the one or more open reading frames encoding the target peptide or protein; (c) the 3' end sequence of the translational functional element (e.g., IRES), the one or more open reading frames encoding the target peptide or protein, and the 5' end sequence of the translational functional element (e.g., IRES); or (d) the 3' end sequence of one of the open reading frames encoding the target peptide or protein, other open reading frames encoding the target peptide or protein (which may exist), the sequence of the translational functional element (e.g., IRES), and the 5' end sequence of one of the open reading frames encoding the target peptide or protein.

在一些实施方式中，该目的序列可以包含编码目的肽或蛋白的开放阅读框、和翻译功能元件的序列，其中该翻译功能元件选自翻译起始元件、和翻译增强元件，其中该目的序列从5'端到3'端可以包含翻译功能元件(例如IRES)的序列的3'端序列、该编码目的肽或蛋白的开放阅读框、和该翻译功能元件(例如IRES)的序列的5'端序列，其中该翻译功能元件的5'端序列和该翻译功能元件的3'端序列在以此顺序排列时形成该翻译功能元件。在一些实施方式中，该目的序列可以包含编码目的肽或蛋白的开放阅读框、和翻译功能元件的序列，其中该翻译功能元件选自翻译起始元件、和翻译增强元件，其中该目的序列从5'端到3'端可以包含该编码目的肽或蛋白的开放阅读框的3'端序列、该翻译功能元件的序列、和该编码目的肽或蛋白的开放阅读框的5'端序列，其中该编码目的肽或蛋白的开放阅读框的5'端序列和该编码目的肽或蛋白的开放阅读框的3'端序列在以此顺序排列时形成该编码目的肽或蛋白的开放阅读框。In some embodiments, the target sequence may include an open reading frame encoding a target peptide or protein and a sequence of a translational functional element, wherein the translational functional element is selected from translation initiation elements and translation enhancement elements, wherein the target sequence may include, from the 5' end to the 3' end, the 3' end sequence of a translational functional element (e.g., IRES), the open reading frame encoding the target peptide or protein, and the 5' end sequence of the translational functional element (e.g., IRES), wherein the 5' end sequence of the translational functional element and the 3' end sequence of the translational functional element are arranged in this order to form the translational functional element. In some embodiments, the target sequence may include an open reading frame encoding a target peptide or protein and a sequence of a translational functional element, wherein the translational functional element is selected from translation initiation elements and translation enhancement elements, wherein the target sequence may include, from the 5' end to the 3' end, the 3' end sequence of the open reading frame encoding the target peptide or protein, the sequence of the translational functional element, and the 5' end sequence of the open reading frame encoding the target peptide or protein, wherein the 5' end sequence of the open reading frame encoding the target peptide or protein and the 3' end sequence of the open reading frame encoding the target peptide or protein are arranged in this order to form the open reading frame encoding the target peptide or protein.

目的序列可以包含非编码DNA的序列。目的序列可以(a)包含该非编码DNA的序列，或(b)从5'端到3'端包含该非编码DNA的序列的3'端序列、和该非编码DNA的序列的5'端序列，其中该非编码DNA序列的5'端序列和该非编码DNA序列的3'端序列在以此顺序排列时形成该非编码RNA序列。The target sequence may contain a sequence of non-coding DNA. The target sequence may (a) contain the sequence of the non-coding DNA, or (b) contain the 3' end sequence of the non-coding DNA sequence and the 5' end sequence of the non-coding DNA sequence from the 5' end to the 3' end, wherein the 5' end sequence of the non-coding DNA sequence and the 3' end sequence of the non-coding DNA sequence, when arranged in this order, form the non-coding RNA sequence.

目的序列可以包含编码目的肽或蛋白的开放阅读框。目的序列可以(a)包含该编码目的肽或蛋白的开放阅读框，或(b)从5'端到3'端包含该编码目的肽或蛋白的开放阅读框的3'端序列、和该编码目的肽或蛋白的开放阅读框的5'端序列，其中该编码目的肽或蛋白的开放阅读框的5'端序列和该编码目的肽或蛋白的开放阅读框的3'端序列在以此顺序排列时形成该编码目的肽或蛋白的开放阅读框。在一些实施方式中，目的序列可以包含一个或多个编码目的肽或蛋白的开放阅读框。目的序列可以从5'端到3'端包含(a)该一个或多个编码目的肽或蛋白的开放阅读框，或(b)其中一个编码目的肽或蛋白的开放阅读框的3'端序列、其他编码目的肽或蛋白的开放阅读框(如存在的情况下)、和其中一个编码目的肽或蛋白的开放阅读框的5'端序列。The target sequence may contain an open reading frame (OPF) encoding a target peptide or protein. The target sequence may (a) contain the OPF encoding the target peptide or protein, or (b) contain, from the 5' end to the 3' end, the 3' end sequence of the OPF encoding the target peptide or protein, and the 5' end sequence of the OPF encoding the target peptide or protein, wherein the 5' end sequence of the OPF encoding the target peptide or protein and the 3' end sequence of the OPF encoding the target peptide or protein, when arranged in this order, form the OPF encoding the target peptide or protein. In some embodiments, the target sequence may contain one or more OPF encodings of a target peptide or protein. The target sequence may contain, from the 5' end to the 3' end, (a) the one or more OPF encodings of a target peptide or protein, or (b) the 3' end sequence of one of the OPF encodings of a target peptide or protein, other OPF encodings of a target peptide or protein (if present), and the 5' end sequence of one of the OPF encodings of a target peptide or protein.

目的序列可以包含单克隆位点、或多克隆位点。可以通过单克隆位点、或多克隆位点在本申请的单链DNA分子中插入任何想要的序列，例如编码目的肽或蛋白的开放阅读框、和翻译功能元件(例如IRES)的序列等。The target sequence may contain a single cloning site or a multiple cloning site. Any desired sequence, such as an open reading frame encoding a target peptide or protein, or a sequence of a translational functional element (e.g., IRES), can be inserted into the single-stranded DNA molecule of this application through a single cloning site or a multiple cloning site.

目的序列可以包含单克隆位点或多克隆位点、和翻译功能元件(例如IRES)的序列。目的序列可以从5'端到3'端包含该翻译功能元件(例如IRES)的序列的3'端序列、该单克隆位点或多克隆位点、和该翻译功能元件(例如IRES)的序列的5'端序列。可以通过单克隆位点、或多克隆位点在本申请的单链DNA分子中插入任何想要的序列，例如编码目的肽或蛋白的开放阅读框等。The target sequence may contain a single-cloning site or a multiple-cloning site, and a sequence of a translational functional element (e.g., IRES). The target sequence may include, from the 5' end to the 3' end, the 3' end sequence of the translational functional element (e.g., IRES), the single-cloning site or multiple-cloning site, and the 5' end sequence of the translational functional element (e.g., IRES). Any desired sequence, such as an open reading frame encoding a target peptide or protein, can be inserted into the single-stranded DNA molecule of this application through the single-cloning site or multiple-cloning site.

目的序列的长度可以为50-8000个核苷酸。The target sequence can be 50-8000 nucleotides in length.

目的肽或蛋白可以是真核或原核来源的肽或蛋白。目的肽或蛋白可以是人源或非人源的肽或蛋白。在一些实施方式中，目的肽或蛋白可以是抗原蛋白、抗体、或蛋白酶等。在一些实施方式中，目的肽或蛋白可以是萤火虫荧光素酶、长腹水蚤荧光素酶、高斯荧光素酶、绿色荧光蛋白等。The target peptide or protein can be of eukaryotic or prokaryotic origin. It can be human or non-human. In some embodiments, the target peptide or protein can be an antigen protein, antibody, or protease, etc. In some embodiments, the target peptide or protein can be firefly luciferase, long-bellied luciferase, Gaussian luciferase, green fluorescent protein, etc.

翻译功能元件可以为翻译起始元件，其中该翻译起始元件可以为内部核糖体进入位点(IRES)。The translation functional element can be a translation initiation element, wherein the translation initiation element can be an internal ribosome entry site (IRES).

内部核糖体进入位点(IRES)可以为源自柯萨奇病毒B3(CVB3)、肠病毒B107(EVB107)、人鼻病毒B3(HRVB3)、肠病毒A(EV-A)、人鼻病毒B6(HRVB6)、柯萨奇病毒A(CVB1/2)、Taura综合征病毒、吸血猎蝽病毒、泰累尔氏脑脊髓炎病毒、猿猴病毒40、红火蚁病毒1、禾谷缢管蚜病毒、网状内皮增生症病毒、福曼脊髓灰质炎病毒1、大豆尺蠖病毒、克什米尔蜂病毒、人鼻病毒2、琉璃叶蝉病毒-1、人免疫缺陷病毒1型、琉璃叶蝉病毒-1、虱P病毒、丙型肝炎病毒、甲型肝炎病毒、GB型肝炎病毒、口蹄疫病毒、人肠道病毒71、马鼻病毒、茶尺蠖样病毒、脑心肌炎病毒(EMCV)、果蝇C病毒、十字花科烟草病毒、蟋蟀麻痹病毒、牛病毒性腹泻病毒1、黑皇后细胞病毒、蚜虫致死麻痹病毒、禽脑脊髓炎病毒、急性蜂麻痹病毒、芙蓉枯黄环斑病毒、猪瘟病毒、人类FGF2、人类SFTPA1、人类AMLl/RUNXl、果蝇触角足、人类AQP4、人类AT1R、人类BAG-1、人类BCL2、人类BiP、人类c-IAPl、人类c-myc、人类eIF4G、小鼠NDST4L、人类LEF1、小鼠HIF1α、人类n.myc、小鼠Gtx、人类p27kipl、人类PDGF2/c-sis、人类p53、人类Pim-1、小鼠Rbm3、果蝇reaper、犬Scamper、果蝇Ubx、唾液病毒、柯萨病毒、双埃柯病毒、人类UNR、小鼠UtrA、人类VEGF-A、人类XIAP、果蝇hairless、酿酒酵母TFIID、酿酒酵母YAP1、人类c-src、人类FGF-1、猿猴小核糖核酸病毒、芜菁绉缩病病毒、或eIF4G适体的IRES。在一些实施方式中，IRES可以选自柯萨奇病毒B3(CVB3)、肠病毒B107(EVB107)、人鼻病毒B3(HRVB3)、肠病毒A(EV-A)或人鼻病毒B6(HRVB6)的IRES。在一些实施方式中，IRES可以为柯萨奇病毒B3(CVB3)的IRES。编码IRES的序列可以包含与SEQ ID NO:4具有至少85％、90％、91％、92％、93％、94％、95％、96％、97％、98％、99％或100％序列同一性的核苷酸序列。在一些实施方式中，IRES可以为肠病毒B107(EVB107)的IRES。编码IRES的序列可以包含与SEQ ID NO:61具有至少85％、90％、91％、92％、93％、94％、95％、96％、97％、98％、99％或100％序列同一性的核苷酸序列。在一些实施方式中，IRES可以为人鼻病毒B3(HRVB3)的IRES。编码IRES的序列可以包括与SEQ ID NO:74具有至少85％、90％、91％、92％、93％、94％、95％、96％、97％、98％、99％或100％序列同一性的核苷酸序列。在一些实施方式中，IRES可以为肠病毒A(EV-A)的IRES。编码IRES的序列可以包括与SEQ ID NO:75或SEQ ID NO:76具有至少85％、90％、91％、92％、93％、94％、95％、96％、97％、98％、99％或100％序列同一性的核苷酸序列。Internal ribosome entry sites (IRES) can originate from Coxsackievirus B3 (CVB3), Enterovirus B107 (EVB107), Human rhinovirus B3 (HRVB3), Enterovirus A (EV-A), Human rhinovirus B6 (HRVB6), Coxsackievirus A (CVB1/2), Taura syndrome virus, blood-sucking assassin bug virus, Tyrell's encephalomyelitis virus, simian virus 40, red imported fire ant virus 1, rice constrictor aphid virus, reticuloendotheliosis virus, and Forman poliovirus. Toxin 1, Soybean Looper Virus, Kashmir Bee Virus, Human Rhinovirus 2, Glass Leafhopper Virus-1, Human Immunodeficiency Virus Type 1, Glass Leafhopper Virus-1, Lice P Virus, Hepatitis C Virus, Hepatitis A Virus, GB Hepatitis Virus, Foot-and-Mouth Disease Virus, Human Enterovirus 71, Equine Rhinovirus, Tea Looper-like Virus, Encephalomyelitis Virus (EMCV), Fruit Fly C Virus, Cruciferous Tobacco Virus, Cricket Paralysis Virus, Bovine Viral Diarrhea Virus 1, Black Queen Cell Virus, Aphid Lethal Paralysis Virus, Avian Encephalomyelitis Virus Acute bee paralysis virus, Hibiscus rosa-spot virus, classical swine fever virus, human FGF2, human SFTPA1, human AML1/RUNX1, Drosophila antennae and legs, human AQP4, human AT1R, human BAG-1, human BCL2, human BiP, human c-IAPl, human c-myc, human eIF4G, mouse NDST4L, human LEF1, mouse HIF1α, human n.myc, mouse Gtx, human p27kipl, human PDGF2/ c-sis, human p53, human Pim-1, mouse Rbm3, fruit fly reaper, canine scamper, fruit fly UBX, salivary virus, Coxsackievirus, bi-echovirus, human UNR, mouse UtrA, human VEGF-A, human XIAP, fruit fly hairless, Saccharomyces cerevisiae TFIID, Saccharomyces cerevisiae YAP1, human c-src, human FGF-1, simian microRNA virus, turnip shrunkenness virus, or IRES of eIF4G aptamers. In some embodiments, the IRES may be selected from IRES of Coxsackievirus B3 (CVB3), enterovirus B107 (EVB107), human rhinovirus B3 (HRVB3), enterovirus A (EV-A), or human rhinovirus B6 (HRVB6). In some embodiments, the IRES may be an IRES of Coxsackievirus B3 (CVB3). The sequence encoding IRES may comprise a nucleotide sequence having at least 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity with SEQ ID NO:4. In some embodiments, IRES may be an IRES of enterovirus B107 (EVB107). The sequence encoding IRES may comprise a nucleotide sequence having at least 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity with SEQ ID NO:61. In some embodiments, IRES may be an IRES of human rhinovirus B3 (HRVB3). The sequence encoding IRES may include a nucleotide sequence having at least 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity with SEQ ID NO:74. In some embodiments, IRES may be an IRES of enterovirus A (EV-A). The sequence encoding IRES may include a nucleotide sequence having at least 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity with SEQ ID NO:75 or SEQ ID NO:76.

在一些实施方式中，目的序列可以从5'端到3'端包含(a)IRES的序列的3'端序列、一个或多个编码目的肽或蛋白的开放阅读框、和IRES的序列的5'端序列，或者(b)该IRES的序列的3'端序列、该单克隆位点或多克隆位点、和该IRES的序列的5'端序列，其中IRES为包含SEQ ID NO:4所示核苷酸序列的柯萨奇病毒B3(CVB3)的IRES，包含SEQ ID NO:61所示核苷酸序列的肠病毒B107(EVB107)的IRES,包含SEQ ID NO:74所示核苷酸序列的人鼻病毒B3(HRVB3)的IRES，或包含SEQ ID NO:75或SEQ ID NO:76所示核苷酸序列的肠病毒A(EV-A)的IRES。编码IRES的序列的3'端序列和编码IRES的序列的5'端序列分别包含：(1)SEQ ID NO:26和27所示的核苷酸序列，(2)SEQ ID NO:40和41所示的核苷酸序列，(3)SEQ ID NO:46和47所示的核苷酸序列，(4)SEQ ID NO:55和56所示的核苷酸序列，(5)SEQ ID NO:62和63所示的核苷酸序列，或(6)SEQ ID NO:65和66所示的核苷酸序列。In some embodiments, the target sequence may include, from the 5' end to the 3' end, (a) the 3' end sequence of the IRES sequence, one or more open reading frames encoding the target peptide or protein, and the 5' end sequence of the IRES sequence, or (b) the 3' end sequence of the IRES sequence, the single cloning site or multiple cloning site, and the 5' end sequence of the IRES sequence, wherein the IRES is an IRES of Coxsackievirus B3 (CVB3) containing the nucleotide sequence shown in SEQ ID NO:4, an IRES of enterovirus B107 (EVB107) containing the nucleotide sequence shown in SEQ ID NO:61, an IRES of human rhinovirus B3 (HRVB3) containing the nucleotide sequence shown in SEQ ID NO:74, or an IRES of enterovirus A (EV-A) containing the nucleotide sequence shown in SEQ ID NO:75 or SEQ ID NO:76. The 3' end sequence encoding IRES and the 5' end sequence encoding IRES contain: (1) the nucleotide sequences shown in SEQ ID NO:26 and 27, (2) the nucleotide sequences shown in SEQ ID NO:40 and 41, (3) the nucleotide sequences shown in SEQ ID NO:46 and 47, (4) the nucleotide sequences shown in SEQ ID NO:55 and 56, (5) the nucleotide sequences shown in SEQ ID NO:62 and 63, or (6) the nucleotide sequences shown in SEQ ID NO:65 and 66.

在一些实施方式中，该编码目的肽或蛋白的开放阅读框为编码绿色荧光蛋白的开放阅读框，其中编码绿色荧光蛋白的开放阅读框包含SEQ ID NO:5所示的核苷酸序列。在一些实施方式中，目的序列可以从5'端到3'端包含编码目的肽或蛋白的开放阅读框的3'端序列、翻译功能元件(例如IRES)的序列、和编码目的肽或蛋白的开放阅读框的5'端序列；优选地，其中编码目的肽或蛋白的开放阅读框的3'端序列、和编码目的肽或蛋白的开放阅读框的5'端序列可以分别包含AA和SEQ ID NO:12所示的核苷酸。在一些实施方式中，目的序列可以从5'端到3'端包含编码目的肽或蛋白的开放阅读框的3'端序列、翻译功能元件(例如IRES)的序列、和编码目的肽或蛋白的开放阅读框的5'端序列，其中编码目的肽或蛋白的开放阅读框的3'端序列、和编码目的肽或蛋白的开放阅读框的5'端序列可以分别包含AA和SEQ ID NO:12所示的核苷酸。In some embodiments, the open reading frame encoding the target peptide or protein is an open reading frame encoding green fluorescent protein, wherein the open reading frame encoding green fluorescent protein comprises the nucleotide sequence shown in SEQ ID NO:5. In some embodiments, the target sequence may comprise, from the 5' end to the 3' end, the 3' end sequence encoding the target peptide or protein, the sequence of a translational functional element (e.g., IRES), and the 5' end sequence encoding the target peptide or protein; preferably, the 3' end sequence encoding the target peptide or protein and the 5' end sequence encoding the target peptide or protein may respectively comprise the nucleotides shown in SEQ ID NO:12. In some embodiments, the target sequence may include, from the 5' end to the 3' end, a 3' end sequence of an open reading frame encoding a target peptide or protein, a sequence of a translational functional element (e.g., IRES), and a 5' end sequence of an open reading frame encoding a target peptide or protein, wherein the 3' end sequence of the open reading frame encoding the target peptide or protein and the 5' end sequence of the open reading frame encoding the target peptide or protein may respectively contain nucleotides AA and SEQ ID NO:12.

在一些实施方式中，编码目的肽或蛋白的开放阅读框可以为编码荧光素酶的开放阅读框，其中编码荧光素酶的开放阅读框可以包含SEQ ID NO:28所示的核苷酸序列。在一些实施方式中，目的序列可以从5'端到3'端包含编码目的肽或蛋白的开放阅读框的3'端序列、翻译功能元件(例如IRES)的序列、和编码目的肽或蛋白的开放阅读框的5'端序列；优选地，其中编码目的肽或蛋白的开放阅读框的3'端序列、和编码目的肽或蛋白的开放阅读框的5'端序列可以分别包含AA和SEQ ID NO:29，或者SEQ ID NO:32和33所示的核苷酸序列。在一些实施方式中，目的序列可以从5'端到3'端包含编码目的肽或蛋白的开放阅读框的3'端序列、翻译功能元件(例如IRES)的序列、和编码目的肽或蛋白的开放阅读框的5'端序列，其中编码目的肽或蛋白的开放阅读框的3'端序列、和编码目的肽或蛋白的开放阅读框的5'端序列可以分别包含AA和SEQ ID NO:29，或者SEQ ID NO:32和33所示的核苷酸序列。In some embodiments, the open reading frame encoding the target peptide or protein may be an open reading frame encoding luciferase, wherein the open reading frame encoding luciferase may contain the nucleotide sequence shown in SEQ ID NO:28. In some embodiments, the target sequence may contain, from the 5' end to the 3' end, the 3' end sequence encoding the target peptide or protein, the sequence of a translational functional element (e.g., IRES), and the 5' end sequence encoding the target peptide or protein; preferably, the 3' end sequence encoding the target peptide or protein and the 5' end sequence encoding the target peptide or protein may respectively contain AA and the nucleotide sequences shown in SEQ ID NO:29, or SEQ ID NO:32 and 33. In some embodiments, the target sequence may include, from the 5' end to the 3' end, a 3' end sequence of an open reading frame encoding a target peptide or protein, a sequence of a translational functional element (e.g., IRES), and a 5' end sequence of an open reading frame encoding a target peptide or protein, wherein the 3' end sequence of the open reading frame encoding the target peptide or protein and the 5' end sequence of the open reading frame encoding the target peptide or protein may respectively include AA and SEQ ID NO:29, or the nucleotide sequences shown in SEQ ID NO:32 and 33.

在一些实施方式中，3'自剪接型内含子片段和该5'自剪接型内含子片段可以得自炭疽杆菌RecA基因中的自剪接型内含子、或得自贝氏柯克斯体23S核糖体基因中的自剪接型内含子。In some embodiments, the 3' self-splicing intron fragment and the 5' self-splicing intron fragment can be derived from the self-splicing intron in the RecA gene of Bacillus anthracis, or from the self-splicing intron in the 23S ribosomal gene of Coxiella behnkeni.

本申请的单链DNA分子还可以在3'自剪接型内含子片段的上游或5'侧包含5'同源臂的序列，以及在5'自剪接型内含子片段的下游或3'侧包含3'同源臂的序列。在一些实施方式中，单链DNA分子可以从5'端到3'端依次包含5'同源臂的序列、包含3'剪接位点的3'自剪接型内含子片段、目的序列、包含5'剪接位点的5'自剪接型内含子片段、和3'同源臂的序列。5'同源臂和3'同源臂可以互补配对，例如形成至少约15％、25％、35％、45％、55％、65％、75％、80％、85％、90％、95％、96％、97％、98％、99％或100％碱基配对。在一些实施方式中，5'同源臂和3'同源臂可以互补形成含有一个回文茎和一个茎-环的结构，其中回文茎与茎-环的茎部连接，其中5'同源臂的5'端和3'同源臂的3'端在回文茎部相邻。在一些实施方式中，5'同源臂和3'同源臂可以互补形成含有一个回文茎和两个茎-环的结构，其中回文茎与两个茎-环的茎部连接，两个茎-环的茎部连接，其中5'同源臂的5'端和3'同源臂的3'端在回文茎或其中一个茎-环的环部相邻。在一些实施方式中，5'同源臂的序列和3'同源臂的序列可以分别包含与SEQ ID NO:2和7具有至少85％、90％、91％、92％、93％、94％、95％、96％、97％、98％、99％或100％序列同一性的核苷酸序列。The single-stranded DNA molecule of this application may also include a sequence of 5' homologous arms upstream of or 5' side of the 3' self-splicing intron fragment, and a sequence of 3' homologous arms downstream of or 3' side of the 5' self-splicing intron fragment. In some embodiments, the single-stranded DNA molecule may sequentially include, from the 5' end to the 3' end, a sequence of 5' homologous arms, a 3' self-splicing intron fragment containing a 3' splice site, a target sequence, a 5' self-splicing intron fragment containing a 5' splice site, and a sequence of 3' homologous arms. The 5' homologous arms and 3' homologous arms may be complementary, for example, forming at least about 15%, 25%, 35%, 45%, 55%, 65%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% base pairing. In some embodiments, the 5' and 3' homologous arms can complementarily form a structure containing a palindromic stem and a stem-loop, wherein the palindromic stem is connected to the stem portion of the stem-loop, and the 5' end of the 5' homologous arm and the 3' end of the 3' homologous arm are adjacent at the palindromic stem portion. In some embodiments, the 5' and 3' homologous arms can complementarily form a structure containing a palindromic stem and two stem-loops, wherein the palindromic stem is connected to the stem portions of two stem-loops, and the stem portions of two stem-loops are connected, wherein the 5' end of the 5' homologous arm and the 3' end of the 3' homologous arm are adjacent at the loop portion of the palindromic stem or one of the stem-loops. In some embodiments, the sequences of the 5' and 3' homologous arms can respectively comprise nucleotide sequences having at least 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity with SEQ ID NO: 2 and 7.

本申请的单链DNA分子，还可以在5'端，例如3'自剪接型内含子片段或5'同源臂的序列的5'侧，包含RNA聚合酶启动子。RNA聚合酶启动子可以为源自T7病毒、T6病毒、SP6病毒、T3病毒、或T4病毒的RNA聚合酶启动子。在一些实施方式中，RNA聚合酶启动子可以为T7病毒启动子，其可以包含与SEQ ID NO:1具有至少85％、90％、91％、92％、93％、94％、95％、96％、97％、98％、99％或100％序列同一性的核苷酸序列。The single-stranded DNA molecule of this application may also contain an RNA polymerase promoter at its 5' end, for example, on the 5' side of a 3' self-splicing intron fragment or a 5' homologous arm sequence. The RNA polymerase promoter may be an RNA polymerase promoter derived from T7 virus, T6 virus, SP6 virus, T3 virus, or T4 virus. In some embodiments, the RNA polymerase promoter may be a T7 virus promoter, which may contain a nucleotide sequence having at least 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity with SEQ ID NO:1.

本申请的单链DNA分子，还可以在3'端，例如5'自剪接型内含子片段或3'同源臂的序列的3'侧，包含限制性内切酶位点。限制性内切酶切位点可以是例如EcoRI、EcoRV。The single-stranded DNA molecule of this application may also contain a restriction endonuclease site at its 3' end, for example, on the 3' side of a 5' self-splicing intron fragment or a 3' homologous arm sequence. The restriction endonuclease site may be, for example, EcoRI or EcoRV.

本申请的单链DNA分子还可以在3'自剪接型内含子片段与目的序列之间包含5'间隔序列，和/或在目的序列与5'自剪接型内含子片段之间包含3'间隔序列。特别地，本申请的单链DNA分子可以不在3'自剪接型内含子片段与目的序列之间包含5'间隔序列，和/或不在目的序列与5'自剪接型内含子片段之间包含3'间隔序列。The single-stranded DNA molecule of this application may also include a 5' spacer sequence between the 3' self-splicing intron fragment and the target sequence, and/or include a 3' spacer sequence between the target sequence and the 5' self-splicing intron fragment. In particular, the single-stranded DNA molecule of this application may not include a 5' spacer sequence between the 3' self-splicing intron fragment and the target sequence, and/or may not include a 3' spacer sequence between the target sequence and the 5' self-splicing intron fragment.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：RNA聚合酶启动子、5'同源臂的序列、包含3'剪接位点的3'自剪接型内含子片段、目的序列、包含5'剪接位点的5'自剪接型内含子片段、和3'同源臂的序列，其中各元件可操作地连接。在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次由RNA聚合酶启动子、5'同源臂的序列、包含3'剪接位点的3'自剪接型内含子片段、目的序列、包含5'剪接位点的5'自剪接型内含子片段、和3'同源臂的序列构成，其中各元件可操作地连接。在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：RNA聚合酶启动子、5'同源臂的序列、包含3'剪接位点的3'自剪接型内含子片段、目的序列、包含5'剪接位点的5'自剪接型内含子片段、3'同源臂的序列、和限制性内切酶位点，其中各元件可操作地连接。在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端由RNA聚合酶启动子、5'同源臂的序列、包含3'剪接位点的3'自剪接型内含子片段、目的序列、包含5'剪接位点的5'自剪接型内含子片段、3'同源臂的序列、和限制性内切酶位点构成，其中各元件可操作地连接。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise, from the 5' end to the 3' end: an RNA polymerase promoter, a sequence of a 5' homologous arm, a 3' self-splicing intron fragment containing a 3' splice site, a target sequence, a 5' self-splicing intron fragment containing a 5' splice site, and a sequence of a 3' homologous arm, wherein each element is operatively linked. In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise, from the 5' end to the 3' end, an RNA polymerase promoter, a sequence of a 5' homologous arm, a 3' self-splicing intron fragment containing a 3' splice site, a target sequence, a 5' self-splicing intron fragment containing a 5' splice site, and a sequence of a 3' homologous arm, wherein each element is operatively linked. In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise, from the 5' end to the 3' end: an RNA polymerase promoter, a sequence of a 5' homologous arm, a 3' self-splicing intron fragment containing a 3' splice site, a target sequence, a 5' self-splicing intron fragment containing a 5' splice site, a sequence of a 3' homologous arm, and a restriction endonuclease site, wherein each element is operatively linked. In some embodiments, the single-stranded DNA molecule of this application may be composed, from the 5' end to the 3' end, of an RNA polymerase promoter, a sequence of a 5' homologous arm, a 3' self-splicing intron fragment containing a 3' splice site, a target sequence, a 5' self-splicing intron fragment containing a 5' splice site, a sequence of a 3' homologous arm, and a restriction endonuclease site, wherein each element is operatively linked.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段和3’外显子、AA、编码CVB3 IRES的序列、编码绿色荧光蛋白(EGFP)的序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接，其中包含3'剪接位点的3'自剪接型内含子片段和3’外显子、编码CVB3 IRES的序列、编码绿色荧光蛋白(EGFP)的序列、和包含5'剪接位点的5'自剪接型内含子片段分别包含如SEQ ID NO:3、4、12、和9；或SEQ ID NO:18、4、12、和20所示的核苷酸序列。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site and a 3' exon, AA, a sequence encoding CVB3 IRES, a sequence encoding green fluorescent protein (EGFP), and a 5' self-splicing intron fragment containing a 5' splice site, wherein the elements are operatively linked, and the 3' self-splicing intron fragment containing a 3' splice site and a 3' exon, the sequence encoding CVB3 IRES, the sequence encoding green fluorescent protein (EGFP), and the 5' self-splicing intron fragment containing a 5' splice site respectively comprise the nucleotide sequences shown in SEQ ID NO: 3, 4, 12, and 9; or SEQ ID NO: 18, 4, 12, and 20.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、编码CVB3 IRES的序列、编码绿色荧光蛋白(EGFP)的序列、以及5’外显子和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接，其中包含3'剪接位点的3'自剪接型内含子片段、编码CVB3 IRES的序列、编码绿色荧光蛋白(EGFP)的序列、以及5’外显子和包含5'剪接位点的5'自剪接型内含子片段分别包含如SEQ ID NO:10、4、5、和11；或SEQ ID NO:21、4、5、和22所示的核苷酸序列。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a sequence encoding CVB3 IRES, a sequence encoding green fluorescent protein (EGFP), and a 5' exon and a 5' self-splicing intron fragment containing a 5' splice site, wherein the elements are operatively linked, wherein the 3' self-splicing intron fragment containing a 3' splice site, the sequence encoding CVB3 IRES, the sequence encoding green fluorescent protein (EGFP), and the 5' exon and the 5' self-splicing intron fragment containing a 5' splice site respectively comprise the nucleotide sequences shown in SEQ ID NO: 10, 4, 5, and 11; or SEQ ID NO: 21, 4, 5, and 22.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、AA、编码CVB3 IRES的序列、编码绿色荧光蛋白(EGFP)的序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接，其中包含3'剪接位点的3'自剪接型内含子片段、编码CVB3 IRES的序列、编码绿色荧光蛋白(EGFP)的序列、和包含5'剪接位点的5'自剪接型内含子片段分别包含如SEQ ID NO:10、4、12、和13；或SEQ ID NO:21、4、12、和23所示的核苷酸序列。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, AA, a sequence encoding CVB3 IRES, a sequence encoding green fluorescent protein (EGFP), and a 5' self-splicing intron fragment containing a 5' splice site, wherein the elements are operatively linked, wherein the 3' self-splicing intron fragment containing a 3' splice site, the sequence encoding CVB3 IRES, the sequence encoding green fluorescent protein (EGFP), and the 5' self-splicing intron fragment containing a 5' splice site respectively comprise the nucleotide sequences shown in SEQ ID NO: 10, 4, 12, and 13; or SEQ ID NO: 21, 4, 12, and 23.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、AA、编码CVB3 IRES的序列、编码荧光素酶(Fluc)的序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接，其中包含3'剪接位点的3'自剪接型内含子片段、编码CVB3 IRES的序列、编码荧光素酶(Fluc)的序列、和包含5'剪接位点的5'自剪接型内含子片段分别包含如SEQ ID NO:10、4、29、和30所示的核苷酸序列。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, AA, a sequence encoding CVB3 IRES, a sequence encoding luciferase (Fluc), and a 5' self-splicing intron fragment containing a 5' splice site, wherein the elements are operatively linked, and the 3' self-splicing intron fragment containing a 3' splice site, the sequence encoding CVB3 IRES, the sequence encoding luciferase (Fluc), and the 5' self-splicing intron fragment containing a 5' splice site respectively comprise nucleotide sequences as shown in SEQ ID NO: 10, 4, 29, and 30.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、编码CVB3 IRES的3’序列、一个或多个编码目的肽或蛋白的开放阅读框、编码CVB3 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段，其中包含3'剪接位点的3'自剪接型内含子片段和包含5'剪接位点的5'自剪接型内含子片段分别得自炭疽杆菌RecA基因中自剪接型内含子的3'序列和5'序列或得自贝氏柯克斯体23S核糖体基因中的自剪接型内含子的3'序列和5'序列。在一些实施方案中，包含3'剪接位点的3'自剪接型内含子片段和包含5'剪接位点的5'自剪接型内含子片段可以包含如下核苷酸序列：(1)SEQ ID NO:10和SEQ ID NO:31；(2)SEQ ID NO:21和SEQ ID NO:39；或(3)SEQ ID NO:21和SEQ ID NO:48。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a 3' sequence encoding CVB3 IRES, one or more open reading frames encoding a target peptide or protein, a 5' sequence encoding CVB3 IRES, and a 5' self-splicing intron fragment containing a 5' splice site, wherein the 3' self-splicing intron fragment containing the 3' splice site and the 5' self-splicing intron fragment containing the 5' splice site are respectively derived from the 3' and 5' sequences of the self-splicing intron in the RecA gene of Bacillus anthracis or from the 3' and 5' sequences of the self-splicing intron in the 23S ribosomal gene of Coxiella behnea. In some embodiments, the 3' self-splicing intron fragment containing a 3' splice site and the 5' self-splicing intron fragment containing a 5' splice site may comprise the following nucleotide sequences: (1) SEQ ID NO:10 and SEQ ID NO:31; (2) SEQ ID NO:21 and SEQ ID NO:39; or (3) SEQ ID NO:21 and SEQ ID NO:48.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、编码EVB107 IRES的3’序列、一个或多个编码目的肽或蛋白的开放阅读框、编码EVB107 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段，其中包含3'剪接位点的3'自剪接型内含子片段和包含5'剪接位点的5'自剪接型内含子片段分别得自炭疽杆菌RecA基因中自剪接型内含子的3'序列和5'序列或得自贝氏柯克斯体23S核糖体基因中的自剪接型内含子的3'序列和5'序列。在一些实施方案中，包含3'剪接位点的3'自剪接型内含子片段和包含5'剪接位点的5'自剪接型内含子片段可以分别包含SEQ ID NO:21和SEQ ID NO:57所示核苷酸序列。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a 3' sequence encoding EVB107 IRES, one or more open reading frames encoding a target peptide or protein, a 5' sequence encoding EVB107 IRES, and a 5' self-splicing intron fragment containing a 5' splice site, wherein the 3' self-splicing intron fragment containing the 3' splice site and the 5' self-splicing intron fragment containing the 5' splice site are respectively derived from the 3' and 5' sequences of the self-splicing intron in the RecA gene of Bacillus anthracis or from the 3' and 5' sequences of the self-splicing intron in the 23S ribosomal gene of Coxiella behnea. In some embodiments, the 3' self-splicing intron fragment containing a 3' splice site and the 5' self-splicing intron fragment containing a 5' splice site may contain the nucleotide sequences shown in SEQ ID NO:21 and SEQ ID NO:57, respectively.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、编码HRVB3 IRES的3’序列、一个或多个编码目的肽或蛋白的开放阅读框、编码HRVB3 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段，其中包含3'剪接位点的3'自剪接型内含子片段和包含5'剪接位点的5'自剪接型内含子片段分别得自炭疽杆菌RecA基因中自剪接型内含子的3'序列和5'序列或得自贝氏柯克斯体23S核糖体基因中的自剪接型内含子的3'序列和5'序列。在一些实施方案中，包含3'剪接位点的3'自剪接型内含子片段和包含5'剪接位点的5'自剪接型内含子片段可以分别包含SEQ ID NO:21和SEQ ID NO:64所示核苷酸序列。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a 3' sequence encoding HRVB3 IRES, one or more open reading frames encoding a target peptide or protein, a 5' sequence encoding HRVB3 IRES, and a 5' self-splicing intron fragment containing a 5' splice site, wherein the 3' self-splicing intron fragment containing the 3' splice site and the 5' self-splicing intron fragment containing the 5' splice site are respectively derived from the 3' and 5' sequences of the self-splicing intron in the RecA gene of Bacillus anthracis or from the 3' and 5' sequences of the self-splicing intron in the 23S ribosomal gene of Coxiella behnea. In some embodiments, the 3' self-splicing intron fragment containing a 3' splice site and the 5' self-splicing intron fragment containing a 5' splice site may contain the nucleotide sequences shown in SEQ ID NO:21 and SEQ ID NO:64, respectively.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、编码EV-AIRES的3’序列、一个或多个编码目的肽或蛋白的开放阅读框、编码EV-AIRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段，其中包含3'剪接位点的3'自剪接型内含子片段和包含5'剪接位点的5'自剪接型内含子片段分别得自炭疽杆菌RecA基因中自剪接型内含子的3'序列和5'序列或得自贝氏柯克斯体23S核糖体基因中的自剪接型内含子的3'序列和5'序列。在一些实施方案中，包含3'剪接位点的3'自剪接型内含子片段和包含5'剪接位点的5'自剪接型内含子片段可以分别包含SEQ ID NO:21和SEQ ID NO:67所示核苷酸序列。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise, from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a 3' sequence encoding EV-AIRES, one or more open reading frames encoding a target peptide or protein, a 5' sequence encoding EV-AIRES, and a 5' self-splicing intron fragment containing a 5' splice site, wherein the 3' self-splicing intron fragment containing the 3' splice site and the 5' self-splicing intron fragment containing the 5' splice site are derived from the 3' and 5' sequences of the self-splicing intron in the *Bacillus anthracis* RecA gene or from the 3' and 5' sequences of the self-splicing intron in the *Coxiella belladonna* 23S ribosome gene, respectively. In some embodiments, the 3' self-splicing intron fragment containing the 3' splice site and the 5' self-splicing intron fragment containing the 5' splice site may comprise the nucleotide sequences shown in SEQ ID NO:21 and SEQ ID NO:67, respectively.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、编码CVB3 IRES的3’序列、编码荧光素酶(Fluc)的序列、编码CVB3 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接，其中包含3'剪接位点的3'自剪接型内含子片段、编码CVB3 IRES的3’序列、编码荧光素酶(Fluc)的序列、编码CVB3 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段分别包含如SEQ ID NO:10、26、28、27、和31所示的核苷酸序列。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a 3' sequence encoding CVB3 IRES, a sequence encoding luciferase (Fluc), a 5' sequence encoding CVB3 IRES, and a 5' self-splicing intron fragment containing a 5' splice site, wherein each element is operatively linked, and the 3' self-splicing intron fragment containing a 3' splice site, the 3' sequence encoding CVB3 IRES, the sequence encoding luciferase (Fluc), the 5' sequence encoding CVB3 IRES, and the 5' self-splicing intron fragment containing a 5' splice site respectively comprise nucleotide sequences as shown in SEQ ID NO: 10, 26, 28, 27, and 31.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、编码荧光素酶(Fluc)的3’序列、编码CVB3IRES的序列、编码荧光素酶(Fluc)的5’序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接，其中包含3'剪接位点的3'自剪接型内含子片段、编码荧光素酶(Fluc)的3’序列、编码CVB3 IRES的序列、编码荧光素酶(Fluc)的5’序列、和包含5'剪接位点的5'自剪接型内含子片段分别包含如SEQ ID NO:10、32、4、33和34所示的核苷酸序列。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a 3' sequence encoding luciferase (Fluc), a sequence encoding CVB3IRES, a 5' sequence encoding luciferase (Fluc), and a 5' self-splicing intron fragment containing a 5' splice site, wherein each element is operatively linked, and the 3' self-splicing intron fragment containing a 3' splice site, the 3' sequence encoding luciferase (Fluc), the sequence encoding CVB3IRES, the 5' sequence encoding luciferase (Fluc), and the 5' self-splicing intron fragment containing a 5' splice site respectively comprise nucleotide sequences as shown in SEQ ID NO: 10, 32, 4, 33, and 34.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、编码CVB3 IRES的3’序列、编码绿色荧光蛋白EGFP的序列、编码CVB3 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接，其中包含3'剪接位点的3'自剪接型内含子片段、编码CVB3 IRES的3’序列、编码绿色荧光蛋白EGFP的序列、编码CVB3 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段分别包含如SEQ ID NO:21、40、5、41、和39所示的核苷酸序列。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a 3' sequence encoding CVB3 IRES, a sequence encoding green fluorescent protein EGFP, a 5' sequence encoding CVB3 IRES, and a 5' self-splicing intron fragment containing a 5' splice site, wherein each element is operatively linked, and the 3' self-splicing intron fragment containing a 3' splice site, the 3' sequence encoding CVB3 IRES, the sequence encoding green fluorescent protein EGFP, the 5' sequence encoding CVB3 IRES, and the 5' self-splicing intron fragment containing a 5' splice site respectively comprise the nucleotide sequences shown in SEQ ID NO: 21, 40, 5, 41, and 39.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、编码CVB3 IRES的3’序列、编码高斯荧光素酶Gluc的序列、编码CVB3 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接，其中包含3'剪接位点的3'自剪接型内含子片段、编码CVB3 IRES的3’序列、编码高斯荧光素酶Gluc的序列、编码CVB3 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段分别包含如SEQ ID NO:21、46、49、47、和48所示的核苷酸序列。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a 3' sequence encoding CVB3 IRES, a sequence encoding Gaussian luciferase Gluc, a 5' sequence encoding CVB3 IRES, and a 5' self-splicing intron fragment containing a 5' splice site, wherein each element is operatively linked, and the 3' self-splicing intron fragment containing a 3' splice site, the 3' sequence encoding CVB3 IRES, the sequence encoding Gaussian luciferase Gluc, the 5' sequence encoding CVB3 IRES, and the 5' self-splicing intron fragment containing a 5' splice site respectively comprise nucleotide sequences as shown in SEQ ID NO: 21, 46, 49, 47, and 48.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、编码CVB3 IRES的3’序列、编码绿色荧光蛋白EGFP的序列、编码CVB3 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接，其中包含3'剪接位点的3'自剪接型内含子片段、编码CVB3 IRES的3’序列、编码绿色荧光蛋白EGFP的序列、编码CVB3 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段分别包含如SEQ ID NO:21、46、5、47、和48所示的核苷酸序列。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a 3' sequence encoding CVB3 IRES, a sequence encoding green fluorescent protein EGFP, a 5' sequence encoding CVB3 IRES, and a 5' self-splicing intron fragment containing a 5' splice site, wherein each element is operatively linked, and the 3' self-splicing intron fragment containing a 3' splice site, the 3' sequence encoding CVB3 IRES, the sequence encoding green fluorescent protein EGFP, the 5' sequence encoding CVB3 IRES, and the 5' self-splicing intron fragment containing a 5' splice site respectively comprise nucleotide sequences as shown in SEQ ID NO: 21, 46, 5, 47, and 48.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、编码CVB3 IRES的3’序列、编码荧光素酶(Fluc)的序列、编码CVB3 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接，其中包含3'剪接位点的3'自剪接型内含子片段、编码CVB3 IRES的3’序列、编码荧光素酶(Fluc)的序列、编码CVB3 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段分别包含如SEQ ID NO:21、46、28、47、和48所示的核苷酸序列。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a 3' sequence encoding CVB3 IRES, a sequence encoding luciferase (Fluc), a 5' sequence encoding CVB3 IRES, and a 5' self-splicing intron fragment containing a 5' splice site, wherein each element is operatively linked, and the 3' self-splicing intron fragment containing a 3' splice site, the 3' sequence encoding CVB3 IRES, the sequence encoding luciferase (Fluc), the 5' sequence encoding CVB3 IRES, and the 5' self-splicing intron fragment containing a 5' splice site respectively comprise nucleotide sequences as shown in SEQ ID NO: 21, 46, 28, 47, and 48.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、编码EVB107 IRES的3’序列、编码绿色荧光蛋白EGFP的序列、编码EVB107 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接，其中包含3'剪接位点的3'自剪接型内含子片段、编码EVB107 IRES的3’序列、编码绿色荧光蛋白EGFP的序列、编码EVB107IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段分别包含如SEQ ID NO:21、55、5、56、和57所示的核苷酸序列。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a 3' sequence encoding EVB107 IRES, a sequence encoding green fluorescent protein EGFP, a 5' sequence encoding EVB107 IRES, and a 5' self-splicing intron fragment containing a 5' splice site, wherein each element is operatively linked, and the 3' self-splicing intron fragment containing a 3' splice site, the 3' sequence encoding EVB107 IRES, the sequence encoding green fluorescent protein EGFP, the 5' sequence encoding EVB107 IRES, and the 5' self-splicing intron fragment containing a 5' splice site respectively comprise nucleotide sequences as shown in SEQ ID NO: 21, 55, 5, 56, and 57.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、编码EVB107 IRES的3’序列、编码高斯荧光素酶Gluc的序列、编码EVB107 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接，其中包含3'剪接位点的3'自剪接型内含子片段、编码EVB107 IRES的3’序列、编码高斯荧光素酶Gluc的序列、编码EVB107 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段分别包含如SEQ ID NO:21、55、49、56、和57所示的核苷酸序列。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a 3' sequence encoding EVB107 IRES, a sequence encoding Gaussian luciferase Gluc, a 5' sequence encoding EVB107 IRES, and a 5' self-splicing intron fragment containing a 5' splice site, wherein each element is operatively linked, and the 3' self-splicing intron fragment containing a 3' splice site, the 3' sequence encoding EVB107 IRES, the sequence encoding Gaussian luciferase Gluc, the 5' sequence encoding EVB107 IRES, and the 5' self-splicing intron fragment containing a 5' splice site respectively comprise nucleotide sequences as shown in SEQ ID NO: 21, 55, 49, 56, and 57.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、编码HRVB3 IRES的3’序列、编码高斯荧光素酶Gluc的序列、编码HRVB3 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接，其中包含3'剪接位点的3'自剪接型内含子片段、编码HRVB3 IRES的3’序列、编码高斯荧光素酶Gluc的序列、编码HRVB3 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段分别包含如SEQ ID NO:21、62、49、63、和64所示的核苷酸序列。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a 3' sequence encoding HRVB3 IRES, a sequence encoding Gaussian luciferase Gluc, a 5' sequence encoding HRVB3 IRES, and a 5' self-splicing intron fragment containing a 5' splice site, wherein each element is operatively linked, and the 3' self-splicing intron fragment containing a 3' splice site, the 3' sequence encoding HRVB3 IRES, the sequence encoding Gaussian luciferase Gluc, the 5' sequence encoding HRVB3 IRES, and the 5' self-splicing intron fragment containing a 5' splice site respectively comprise nucleotide sequences as shown in SEQ ID NO: 21, 62, 49, 63, and 64.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、编码HRVB3 IRES的3’序列、编码绿色荧光蛋白EGFP的序列、编码HRVB3 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接，其中包含3'剪接位点的3'自剪接型内含子片段、编码HRVB3 IRES的3’序列、编码绿色荧光蛋白EGFP的序列、编码HRVB3 IRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段分别包含如SEQ ID NO:21、62、5、63、和64所示的核苷酸序列。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a 3' sequence encoding HRVB3 IRES, a sequence encoding green fluorescent protein EGFP, a 5' sequence encoding HRVB3 IRES, and a 5' self-splicing intron fragment containing a 5' splice site, wherein each element is operatively linked, and the 3' self-splicing intron fragment containing a 3' splice site, the 3' sequence encoding HRVB3 IRES, the sequence encoding green fluorescent protein EGFP, the 5' sequence encoding HRVB3 IRES, and the 5' self-splicing intron fragment containing a 5' splice site respectively comprise the nucleotide sequences shown in SEQ ID NO: 21, 62, 5, 63, and 64.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、编码EV-AIRES的3’序列、编码高斯荧光素酶Gluc的序列、编码EV-AIRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接，其中包含3'剪接位点的3'自剪接型内含子片段、编码EV-AIRES的3’序列、编码高斯荧光素酶Gluc的序列、编码EV-AIRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段分别包含如SEQ ID NO:21、65、49、66、和67所示的核苷酸序列。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a 3' sequence encoding EV-AIRES, a sequence encoding Gaussian luciferase Gluc, a 5' sequence encoding EV-AIRES, and a 5' self-splicing intron fragment containing a 5' splice site, wherein each element is operatively linked, and the 3' self-splicing intron fragment containing a 3' splice site, the 3' sequence encoding EV-AIRES, the sequence encoding Gaussian luciferase Gluc, the 5' sequence encoding EV-AIRES, and the 5' self-splicing intron fragment containing a 5' splice site respectively comprise nucleotide sequences as shown in SEQ ID NO: 21, 65, 49, 66, and 67.

在一些实施方式中，本申请的单链DNA分子可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、编码EV-AIRES的3’序列、编码绿色荧光蛋白EGFP的序列、编码EV-AIRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接，其中包含3'剪接位点的3'自剪接型内含子片段、编码EV-AIRES的3’序列、编码绿色荧光蛋白EGFP的序列、编码EV-AIRES的5’序列、和包含5'剪接位点的5'自剪接型内含子片段分别包含如SEQ ID NO:21、65、5、66、和67所示的核苷酸序列。In some embodiments, the single-stranded DNA molecule of this application may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a 3' sequence encoding EV-AIRES, a sequence encoding green fluorescent protein EGFP, a 5' sequence encoding EV-AIRES, and a 5' self-splicing intron fragment containing a 5' splice site, wherein each element is operatively linked, and the 3' self-splicing intron fragment containing a 3' splice site, the 3' sequence encoding EV-AIRES, the sequence encoding green fluorescent protein EGFP, the 5' sequence encoding EV-AIRES, and the 5' self-splicing intron fragment containing a 5' splice site respectively comprise nucleotide sequences as shown in SEQ ID NO: 21, 65, 5, 66, and 67.

本申请还提供一种双链DNA分子，其可以包含i)本申请的单链DNA分子、以及ii)与该单链DNA分子互补的第二链。在一些实施方式中，本申请的双链DNA分子可以包含i)本申请的单链DNA分子、以及ii)与该单链DNA分子完全互补的第二链。在一些实施方式中，双链DNA分子为用于制备环状RNA的双链DNA分子。This application also provides a double-stranded DNA molecule, which may comprise i) the single-stranded DNA molecule of this application, and ii) a second strand complementary to the single-stranded DNA molecule. In some embodiments, the double-stranded DNA molecule of this application may comprise i) the single-stranded DNA molecule of this application, and ii) a second strand completely complementary to the single-stranded DNA molecule. In some embodiments, the double-stranded DNA molecule is a double-stranded DNA molecule used for preparing circular RNA.

本申请还提供一种载体，其包含本申请的单链DNA分子、或双链DNA分子。在一些实施方式中，载体可以是用于制备环状RNA的载体，包含本申请的用于制备环状RNA的单链DNA分子、或用于制备环状RNA的双链DNA分子。载体可以是环状的、或线性的。在一些实施方式中，载体可以是线性的。在一些实施方式中，载体可以是环状的、并经处理成为线性的。本申请的载体可以转录出约500至约10000nt的环状RNA。This application also provides a vector comprising the single-stranded DNA molecule or double-stranded DNA molecule of this application. In some embodiments, the vector may be a vector for preparing circular RNA, comprising the single-stranded DNA molecule of this application for preparing circular RNA, or the double-stranded DNA molecule for preparing circular RNA. The vector may be circular or linear. In some embodiments, the vector may be linear. In some embodiments, the vector may be circular and processed to become linear. The vector of this application can transcribe approximately 500 to approximately 10,000 nt of circular RNA.

在第二个方面，本申请提供一种可环化的RNA分子，其可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、目的序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接。In a second aspect, this application provides a circularizable RNA molecule that may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a target sequence, and a 5' self-splicing intron fragment containing a 5' splice site, wherein the elements are operatively linked.

3'自剪接型内含子片段和5'自剪接型内含子片段能够使得该RNA分子环化并在环化过程中从该RNA分子中去除。3'自剪接型内含子片段和5'自剪接型内含子片段可以得自同一个自剪接型内含子，尤其是I类自剪接型内含子。在一些实施方式中，3'自剪接型内含子片段和5'自剪接型内含子片段可以得自炭疽杆菌recA基因中的自剪接型内含子、或贝氏柯克斯体23S核糖体基因中的自剪接型内含子。5'自剪接型内含子片段和3'自剪接型内含子片段，当以此顺序排布时，可以连接形成有自剪接功能的自剪接型内含子，例如完整的自剪接型内含子。在一些实施方式中，3'自剪接型内含子片段和5'自剪接型内含子片段可以得自炭疽杆菌RecA基因中的自剪接型内含子，且该3'自剪接型内含子片段可以包含SEQ ID NO:10所示的核苷酸序列。在一些实施方式中，3'自剪接型内含子片段和5'自剪接型内含子片段可以得自贝氏柯克斯体23S核糖体基因中的自剪接型内含子，且该3'自剪接型内含子片段可以包含SEQ ID NO:21所示的核苷酸序列。The 3' and 5' self-splicing intron fragments enable the RNA molecule to circularize and be removed from the RNA molecule during the circularization process. The 3' and 5' self-splicing intron fragments can be derived from the same self-splicing intron, particularly a type I self-splicing intron. In some embodiments, the 3' and 5' self-splicing intron fragments can be derived from the self-splicing intron in the *Bacillus anthracis* recA gene or the self-splicing intron in the *Coxiella belladonna* 23S ribosome gene. When arranged in this order, the 5' and 3' self-splicing intron fragments can be joined to form a self-splicing intron with self-splicing function, such as a complete self-splicing intron. In some embodiments, the 3' and 5' self-splicing intron fragments can be derived from the self-splicing introns of the *Bacillus anthracis* RecA gene, and the 3' self-splicing intron fragment may contain the nucleotide sequence shown in SEQ ID NO: 10. In some embodiments, the 3' and 5' self-splicing intron fragments can be derived from the self-splicing introns of the *Coxiella belladonna* 23S ribosome gene, and the 3' self-splicing intron fragment may contain the nucleotide sequence shown in SEQ ID NO: 21.

内部引导序列的5'端序列可以与目的序列的5'端序列反向互补。内部引导序列的5'端序列可以与目的序列的5'端的任意数量的核苷酸反向互补。例如，内部引导序列的5'端序列可以与目的序列的5'端的2-15nt(例如2、3、4、5、6、7、8、9、10、11、12、13、14或15nt)，例如2-12nt核苷酸反向互补。例如，内部引导序列的5'端序列可以与目的序列的5'端的3-6nt(例如3、4、5或6nt)核苷酸反向互补。在一些实施方式中，内部引导序列的5'端序列可以与目的序列的5'端序列反向互补，本申请的RNA分子还可以在5'自剪接型内含子片段的5'侧包含天然相邻外显子或对应于天然相邻外显子的序列。该天然相邻外显子可以包含部分或截短的序列。天然相邻外显子或对应于天然相邻外显子的序列可以包含长度3-20nt(例如2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19或20nt)的核苷酸，例如3-10nt。本申请的单链DNA分子可以至少不在3'自剪接型内含子片段的3'侧包含天然相邻外显子或对应于天然相邻外显子的序列。The 5' end sequence of the internal guide sequence may be anticomplementary to the 5' end sequence of the target sequence. The 5' end sequence of the internal guide sequence may be anticomplementary to any number of nucleotides at the 5' end of the target sequence. For example, the 5' end sequence of the internal guide sequence may be anticomplementary to 2-15 nt (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, or 15 nt) nucleotides at the 5' end of the target sequence, such as 2-12 nt nucleotides. For example, the 5' end sequence of the internal guide sequence may be anticomplementary to 3-6 nt (e.g., 3, 4, 5, or 6 nt) nucleotides at the 5' end of the target sequence. In some embodiments, the 5' end sequence of the internal guide sequence may be anticomplementary to the 5' end sequence of the target sequence, and the RNA molecule of this application may also include a naturally adjacent exon or a sequence corresponding to a naturally adjacent exon on the 5' side of the 5' self-splicing intron fragment. This naturally adjacent exon may contain a partial or truncated sequence. Naturally adjacent exons or sequences corresponding to naturally adjacent exons may contain nucleotides of length 3-20 nt (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 nt), such as 3-10 nt. The single-stranded DNA molecules of this application may at least not contain naturally adjacent exons or sequences corresponding to naturally adjacent exons on the 3' side of the 3' self-splicing intron fragment.

内部引导序列的3’端序列可以与目的序列的3'端序列反向互补。内部引导序列的3’端序列可以以碱基为G的核苷酸为起始，并与目的序列的3'端序列反向互补。目的序列的3'端序列末尾可以是碱基为U的核苷酸。内部引导序列的3'端序列可以与目的序列的3'端的任意数量的核苷酸反向互补。例如，内部引导序列的3'端序列可以与目的序列的3'端的3-20nt(例如3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19或20nt)，例如3-15nt核苷酸反向互补。在一些实施方式中，内部引导序列的3'端序列可以与目的序列的3'端的3-10nt，例如3-9nt核苷酸反向互补。在一些实施方式中，内部引导序列的3’端序列可以以碱基为G的核苷酸为起始并与目的序列的3'端序列末尾开始反向互补，且目的序列的3'端序列末尾可以是碱基为U的核苷酸，本申请的RNA分子还可以在3'自剪接型内含子片段的3'侧包含天然相邻外显子或对应于天然相邻外显子的序列。在一些实施方式中，内部引导序列的3'端序列可以以碱基为G的核苷酸为起始并与目的序列的3'端序列反向互补，其中该目的序列的3'端序列末尾可以是碱基为U的核苷酸，该内部引导序列的3'端序列碱基为G的起始核苷酸能够与目的序列的3'端序列末尾的碱基为U的核苷酸形成G:U配对。The 3' end sequence of the internal guide sequence may be anticomplementary to the 3' end sequence of the target sequence. The 3' end sequence of the internal guide sequence may begin with a G-base nucleotide and be anticomplementary to the 3' end sequence of the target sequence. The 3' end sequence of the target sequence may end with a U-base nucleotide. The 3' end sequence of the internal guide sequence may be anticomplementary to any number of nucleotides at the 3' end of the target sequence. For example, the 3' end sequence of the internal guide sequence may be anticomplementary to 3-20 nt (e.g., 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 nt) of the 3' end of the target sequence, for example, 3-15 nt nucleotides. In some embodiments, the 3' end sequence of the internal guide sequence may be anticomplementary to 3-10 nt (e.g., 3-9 nt) of the 3' end of the target sequence. In some embodiments, the 3' end sequence of the internal guide sequence may begin with a G-base nucleotide and be reverse complementary to the 3' end sequence of the target sequence, wherein the 3' end sequence of the target sequence may be a U-base nucleotide. The RNA molecule of this application may also contain a naturally adjacent exon or a sequence corresponding to a naturally adjacent exon on the 3' side of the 3' self-splicing intron fragment. In some embodiments, the 3' end sequence of the internal guide sequence may begin with a G-base nucleotide and be reverse complementary to the 3' end sequence of the target sequence, wherein the 3' end sequence of the target sequence may be a U-base nucleotide. The G-base initiating nucleotide of the internal guide sequence can form a G:U pair with the U-base nucleotide at the 3' end sequence of the target sequence.

该天然相邻外显子可以包含部分或截短的序列。天然相邻外显子或对应于天然相邻外显子的序列可以包含长度3-20nt(例如2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19或20nt)的核苷酸，例如3-10nt。本申请的单链DNA分子可以至少不在5'自剪接型内含子片段的5'侧包含天然相邻外显子或对应于天然相邻外显子的序列。The naturally adjacent exon may contain a partial or truncated sequence. The naturally adjacent exon, or the sequence corresponding to the naturally adjacent exon, may contain nucleotides of length 3-20 nt (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 nt), for example, 3-10 nt. The single-stranded DNA molecule of this application may at least not contain a naturally adjacent exon or the sequence corresponding to the naturally adjacent exon on the 5' side of the 5' self-splicing intron fragment.

在一些实施方式中，内部引导序列的5'端序列可以与目的序列的5'端序列反向互补，内部引导序列的3’端序列可以与目的序列的3'端序列反向互补。在一些实施方式中，内部引导序列的5'端序列可以与目的序列的5'端序列反向互补，内部引导序列的3’端序列可以以碱基为G的核苷酸为起始并与目的序列的3'端序列末尾开始反向互补，且目的序列在3'端序列末尾可以是碱基为U的核苷酸。在一些实施方式中，内部引导序列的5'端序列可以与目的序列的5'端序列反向互补，内部引导序列的3’端序列可以以碱基为G的核苷酸为起始并与目的序列的3'端序列反向互补，且目的序列在3'端序列末尾可以是碱基为U的核苷酸，该内部引导序列3'端序列碱基为G的起始核苷酸能够与目的序列的3'端序列末尾的碱基为U的核苷酸形成G:U配对。本申请的RNA分子可以不在5'自剪接型内含子片段的5'端包含天然相邻外显子或对应于天然相邻外显子的序列，且可以不在3'自剪接型内含子片段的3'端包含天然相邻外显子或对应于天然相邻外显子的序列。In some embodiments, the 5' end sequence of the internal guide sequence may be anticomplementary to the 5' end sequence of the target sequence, and the 3' end sequence of the internal guide sequence may be anticomplementary to the 3' end sequence of the target sequence. In some embodiments, the 5' end sequence of the internal guide sequence may be anticomplementary to the 5' end sequence of the target sequence, the 3' end sequence of the internal guide sequence may begin with a G-base and be anticomplementary to the 3' end sequence of the target sequence, and the 3' end sequence of the target sequence may be a U-base. In some embodiments, the 5' end sequence of the internal guide sequence may be anticomplementary to the 5' end sequence of the target sequence, the 3' end sequence of the internal guide sequence may begin with a G-base and be anticomplementary to the 3' end sequence of the target sequence, and the 3' end sequence of the target sequence may be a U-base, wherein the G-base initiating nucleotide of the internal guide sequence's 3' end sequence can form a G:U pair with the U-base at the end of the target sequence's 3' end sequence. The RNA molecule of this application may not contain a naturally adjacent exon or a sequence corresponding to a naturally adjacent exon at the 5' end of the 5' self-splicing intron fragment, and may not contain a naturally adjacent exon or a sequence corresponding to a naturally adjacent exon at the 3' end of the 3' self-splicing intron fragment.

目的序列可以包含编码目的肽或蛋白的开放阅读框、和翻译功能元件。翻译功能元件可以选自翻译起始元件、和翻译增强元件。翻译起始元件可以是开启RNA翻译的序列，例如内部核糖体进入位点(IRES)。翻译增强元件可以是增强RNA翻译的序列，例如poly(A)、poly(C)、非cap依赖性翻译增强子(CITE)、或EIF4配体家族识别序列等。在本申请可环化RNA制备的环状RNA分子中，翻译增强元件可以位于编码目的肽或蛋白的开放阅读框的5'侧或3'侧。在本申请可环化RNA制备的环状RNA分子中，翻译增强元件可以位于翻译起始元件的5'侧或3'侧。在一些实施方式中，翻译功能元件可以包含翻译起始元件。在一些实施方式中，翻译功能元件可以为翻译起始元件。在一些实施方式中，翻译起始元件可以为IRES。The target sequence may contain an open reading frame encoding a target peptide or protein and a translational functional element. The translational functional element may be selected from translation initiation elements and translation enhancement elements. The translation initiation element may be a sequence that initiates RNA translation, such as an internal ribosome entry site (IRES). The translation enhancement element may be a sequence that enhances RNA translation, such as poly(A), poly(C), cap-independent translation enhancer (CITE), or EIF4 ligand family recognition sequences. In the circular RNA molecules prepared from circularizable RNA of this application, the translation enhancement element may be located on the 5' or 3' side of the open reading frame encoding the target peptide or protein. In the circular RNA molecules prepared from circularizable RNA of this application, the translation enhancement element may be located on the 5' or 3' side of the translation initiation element. In some embodiments, the translational functional element may include a translation initiation element. In some embodiments, the translational functional element may be a translation initiation element. In some embodiments, the translation initiation element may be an IRES.

目的序列可以包含编码目的肽或蛋白的开放阅读框、和内部核糖体进入位点(IRES)。目的序列可以从5'端到3'端包含(a)该翻译功能元件(例如IRES)、和该编码目的肽或蛋白的开放阅读框，(b)该编码目的肽或蛋白的开放阅读框、和该翻译功能元件(例如IRES)，(c)该翻译功能元件(例如IRES)的3'端序列、该编码目的肽或蛋白的开放阅读框、和该翻译功能元件(例如IRES)的5'端序列，其中该翻译功能元件的5'端序列和该翻译功能元件的3'端序列在以此顺序排列时形成该翻译功能元件，或(d)该编码目的肽或蛋白的开放阅读框的3'端序列、该翻译功能元件(例如IRES)、和该编码目的肽或蛋白的开放阅读框的5'端序列，其中该编码目的肽或蛋白的开放阅读框的5'端序列和该编码目的肽或蛋白的开放阅读框的3'端序列在以此顺序排列时形成该编码目的肽或蛋白的开放阅读框。在一些实施方式中，目的序列可以包含一个或多个编码目的肽或蛋白的开放阅读框、和翻译功能元件(例如IRES)。目的序列可以从5'端到3'端包含(a)该一个或多个编码目的肽或蛋白的开放阅读框、和该翻译功能元件(例如IRES)，(b)该翻译功能元件(例如IRES)、和该一个或多个编码目的肽或蛋白的开放阅读框，(c)该翻译功能元件(例如IRES)的3'端序列、该一个或多个编码目的肽或蛋白的开放阅读框、和该翻译功能元件(例如IRES)的5'端序列，或(d)其中一个编码目的肽或蛋白的开放阅读框的3'端序列、其他编码目的肽或蛋白的开放阅读框(如存在的情况下)、该翻译功能元件(例如IRES)、和其中一个编码目的肽或蛋白的开放阅读框的5'端序列。The target sequence may contain an open reading frame encoding the target peptide or protein, and an internal ribosome entry site (IRES). The target sequence may include, from the 5' end to the 3' end, (a) the translational functional element (e.g., IRES) and the open reading frame encoding the target peptide or protein, (b) the open reading frame encoding the target peptide or protein and the translational functional element (e.g., IRES), (c) the 3' end sequence of the translational functional element (e.g., IRES), the open reading frame encoding the target peptide or protein, and the 5' end sequence of the translational functional element (e.g., IRES), wherein the 5' end sequence and the 3' end sequence of the translational functional element form the translational functional element when arranged in this order, or (d) the 3' end sequence of the open reading frame encoding the target peptide or protein, the translational functional element (e.g., IRES), and the 5' end sequence of the open reading frame encoding the target peptide or protein, wherein the 5' end sequence and the 3' end sequence of the open reading frame encoding the target peptide or protein form the open reading frame encoding the target peptide or protein when arranged in this order. In some embodiments, the target sequence may comprise one or more open reading frames encoding a target peptide or protein, and a translational functional element (e.g., IRES). The target sequence may comprise, from its 5' end to its 3' end, (a) the one or more open reading frames encoding the target peptide or protein, and the translational functional element (e.g., IRES); (b) the translational functional element (e.g., IRES), and the one or more open reading frames encoding the target peptide or protein; (c) the 3' end sequence of the translational functional element (e.g., IRES), the one or more open reading frames encoding the target peptide or protein, and the 5' end sequence of the translational functional element (e.g., IRES); or (d) the 3' end sequence of one of the open reading frames encoding the target peptide or protein, other open reading frames encoding the target peptide or protein (if present), the translational functional element (e.g., IRES), and the 5' end sequence of one of the open reading frames encoding the target peptide or protein.

目的序列可以包含非编码RNA。目的序列可以(a)包含该非编码RNA，或(b)从5'端到3'端包含该非编码RNA的3'端序列、和该非编码RNA的5'端序列，其中该非编码RNA序列的5'端序列和该非编码RNA序列的3'端序列在以此顺序排列时形成该非编码RNA序列。The target sequence may contain non-coding RNA. The target sequence may (a) contain the non-coding RNA, or (b) contain the 3' end sequence of the non-coding RNA and the 5' end sequence of the non-coding RNA from the 5' end to the 3' end, wherein the 5' end sequence of the non-coding RNA and the 3' end sequence of the non-coding RNA form the non-coding RNA sequence when arranged in this order.

目的序列的长度可以为50-10000个核苷酸。目的序列的长度也可以为50-8000个核苷酸。例如，可以是200个、500个、1000个、1500个、2000个、3000个、4000个、5000个、6000个、7000个、8000个、9000个或10000个核苷酸。The target sequence can be 50-10000 nucleotides in length. Alternatively, it can be 50-8000 nucleotides. For example, it can be 200, 500, 1000, 1500, 2000, 3000, 4000, 5000, 6000, 7000, 8000, 9000, or 10000 nucleotides.

目的肽或蛋白可以是真核或原核来源的肽或蛋白。目的肽或蛋白可以是人源或非人源的肽或蛋白。在一些实施方式中，目的肽或蛋白可以是抗原蛋白、抗体、或Cas9核酸内切酶等。在一些实施方式中，目的肽或蛋白可以是萤火虫荧光素酶、长腹水蚤荧光素酶、高斯荧光素酶、绿色荧光蛋白等。The target peptide or protein can be of eukaryotic or prokaryotic origin. It can be human or non-human. In some embodiments, the target peptide or protein can be an antigen protein, antibody, or Cas9 endonuclease, etc. In some embodiments, the target peptide or protein can be firefly luciferase, long-horned luciferase, Gaussian luciferase, green fluorescent protein, etc.

内部核糖体进入位点(IRES)可以为源自柯萨奇病毒B3(CVB3)、肠病毒B107(EVB107)、人鼻病毒B3(HRVB3)、肠病毒A(EV-A)、人鼻病毒B6(HRVB6)、柯萨奇病毒A(CVB1/2)、Taura综合征病毒、吸血猎蝽病毒、泰累尔氏脑脊髓炎病毒、猿猴病毒40、红火蚁病毒1、禾谷缢管蚜病毒、网状内皮增生症病毒、福曼脊髓灰质炎病毒1、大豆尺蠖病毒、克什米尔蜂病毒、人鼻病毒2、琉璃叶蝉病毒-1、人免疫缺陷病毒1型、琉璃叶蝉病毒-1、虱P病毒、丙型肝炎病毒、甲型肝炎病毒、GB型肝炎病毒、口蹄疫病毒、人肠道病毒71、马鼻病毒、茶尺蠖样病毒、脑心肌炎病毒(EMCV)、果蝇C病毒、十字花科烟草病毒、蟋蟀麻痹病毒、牛病毒性腹泻病毒1、黑皇后细胞病毒、蚜虫致死麻痹病毒、禽脑脊髓炎病毒、急性蜂麻痹病毒、芙蓉枯黄环斑病毒、猪瘟病毒、人类FGF2、人类SFTPA1、人类AMLl/RUNXl、果蝇触角足、人类AQP4、人类AT1R、人类BAG-1、人类BCL2、人类BiP、人类c-IAPl、人类c-myc、人类eIF4G、小鼠NDST4L、人类LEF1、小鼠HIF1α、人类n.myc、小鼠Gtx、人类p27kipl、人类PDGF2/c-sis、人类p53、人类Pim-1、小鼠Rbm3、果蝇reaper、犬Scamper、果蝇Ubx、唾液病毒、柯萨病毒、双埃柯病毒、人类UNR、小鼠UtrA、人类VEGF-A、人类XIAP、果蝇hairless、酿酒酵母TFIID、酿酒酵母YAP1、人类c-src、人类FGF-1、猿猴小核糖核酸病毒、芜菁绉缩病病毒、或eIF4G适体的IRES。在一些实施方式中，IRES可以选自柯萨奇病毒B3(CVB3)、肠病毒B107(EVB107)、人鼻病毒B3(HRVB3)或人鼻病毒B6(HRVB6)的IRES。在一些实施方式中，IRES可以为柯萨奇病毒B3(CVB3)的IRES。编码IRES的序列可以包含与SEQ ID NO:4具有至少60％、70％、75％、80％、85％、90％、91％、92％、93％、94％、95％、96％、97％、98％、99％或100％序列同一性的核苷酸序列。在一些实施方式中，IRES可以为肠病毒B107(EVB107)的IRES。编码IRES的序列可以包含与SEQ ID NO:61具有至少60％、70％、75％、80％、85％、90％、91％、92％、93％、94％、95％、96％、97％、98％、99％或100％序列同一性的核苷酸序列。在一些实施方式中，IRES可以为人鼻病毒B3(HRVB3)的IRES。编码IRES的序列可以包括与SEQ ID NO:74具有至少60％、70％、75％、80％、85％、90％、91％、92％、93％、94％、95％、96％、97％、98％、99％或100％序列同一性的核苷酸序列。在一些实施方式中，IRES可以为肠病毒A(EV-A)的IRES。编码IRES的序列可以包括与SEQ ID NO:75具有至少60％、70％、75％、80％、85％、90％、91％、92％、93％、94％、95％、96％、97％、98％、99％或100％序列同一性的核苷酸序列。Internal ribosome entry sites (IRES) can originate from Coxsackievirus B3 (CVB3), Enterovirus B107 (EVB107), Human rhinovirus B3 (HRVB3), Enterovirus A (EV-A), Human rhinovirus B6 (HRVB6), Coxsackievirus A (CVB1/2), Taura syndrome virus, blood-sucking assassin bug virus, Tyrell's encephalomyelitis virus, simian virus 40, red imported fire ant virus 1, rice constrictor aphid virus, reticuloendotheliosis virus, and Forman poliovirus. Toxin 1, Soybean Looper Virus, Kashmir Bee Virus, Human Rhinovirus 2, Glass Leafhopper Virus-1, Human Immunodeficiency Virus Type 1, Glass Leafhopper Virus-1, Lice P Virus, Hepatitis C Virus, Hepatitis A Virus, GB Hepatitis Virus, Foot-and-Mouth Disease Virus, Human Enterovirus 71, Equine Rhinovirus, Tea Looper-like Virus, Encephalomyelitis Virus (EMCV), Fruit Fly C Virus, Cruciferous Tobacco Virus, Cricket Paralysis Virus, Bovine Viral Diarrhea Virus 1, Black Queen Cell Virus, Aphid Lethal Paralysis Virus, Avian Encephalomyelitis Virus Acute bee paralysis virus, Hibiscus rosa-spot virus, classical swine fever virus, human FGF2, human SFTPA1, human AML1/RUNX1, Drosophila antennae and legs, human AQP4, human AT1R, human BAG-1, human BCL2, human BiP, human c-IAPl, human c-myc, human eIF4G, mouse NDST4L, human LEF1, mouse HIF1α, human n.myc, mouse Gtx, human p27kipl, human PDGF2/ c-sis, human p53, human Pim-1, mouse Rbm3, fruit fly reaper, canine scamper, fruit fly UBX, salivary virus, Coxsackievirus, bi-echovirus, human UNR, mouse UtrA, human VEGF-A, human XIAP, fruit fly hairless, Saccharomyces cerevisiae TFIID, Saccharomyces cerevisiae YAP1, human c-src, human FGF-1, simian microRNA virus, turnip shrunkenness virus, or IRES of eIF4G aptamers. In some embodiments, the IRES may be selected from IRES of Coxsackievirus B3 (CVB3), enterovirus B107 (EVB107), human rhinovirus B3 (HRVB3), or human rhinovirus B6 (HRVB6). In some embodiments, the IRES may be an IRES of Coxsackievirus B3 (CVB3). The sequence encoding IRES may comprise a nucleotide sequence having at least 60%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity with SEQ ID NO:4. In some embodiments, IRES may be an IRES of enterovirus B107 (EVB107). The sequence encoding IRES may comprise a nucleotide sequence having at least 60%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity with SEQ ID NO:61. In some embodiments, IRES may be an IRES of human rhinovirus B3 (HRVB3). The sequence encoding IRES may include a nucleotide sequence having at least 60%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity with SEQ ID NO:74. In some embodiments, IRES may be an IRES of enterovirus A (EV-A). The sequence encoding IRES may include a nucleotide sequence having at least 60%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity with SEQ ID NO:75.

在一些实施方式中，目的序列可以从5'端到3'端包含(a)IRES的3'端序列、一个或多个编码目的肽或蛋白的开放阅读框、和IRES的5'端序列，其中IRES为包含SEQ ID NO:4所示核苷酸序列的柯萨奇病毒B3(CVB3)的IRES。编码IRES的序列的3'端序列和编码IRES的序列的5'端序列分别包含SEQ ID NO:26和27所示的核苷酸序列，分别包含SEQ ID NO:40和41所示的核苷酸序列，或者分别包含SEQ ID NO:46和47所示的核苷酸序列。In some embodiments, the target sequence may include, from the 5' end to the 3' end, (a) the 3' end sequence of IRES, one or more open reading frames encoding the target peptide or protein, and the 5' end sequence of IRES, wherein the IRES is the IRES of Coxsackievirus B3 (CVB3) containing the nucleotide sequence shown in SEQ ID NO:4. The 3' end sequence encoding IRES and the 5' end sequence encoding IRES may contain the nucleotide sequences shown in SEQ ID NO:26 and 27, respectively, the nucleotide sequences shown in SEQ ID NO:40 and 41, respectively, or the nucleotide sequences shown in SEQ ID NO:46 and 47, respectively.

在一些实施方式中，目的序列可以从5'端到3'端包含(a)IRES的3'端序列、一个或多个编码目的肽或蛋白的开放阅读框、和IRES的5'端序列，其中IRES为包含SEQ ID NO:61所示核苷酸序列的EVB107的IRES。编码IRES的序列的3'端序列和编码IRES的序列的5'端序列分别包含SEQ ID NO:55和56所示的核苷酸序列。In some embodiments, the target sequence may comprise, from the 5' end to the 3' end, (a) the 3' end sequence of IRES, one or more open reading frames encoding the target peptide or protein, and the 5' end sequence of IRES, wherein the IRES is the IRES of EVB107 comprising the nucleotide sequence shown in SEQ ID NO:61. The 3' end sequence encoding the IRES and the 5' end sequence encoding the IRES comprise the nucleotide sequences shown in SEQ ID NO:55 and 56, respectively.

在一些实施方式中，目的序列可以从5'端到3'端包含(a)IRES的3'端序列、一个或多个编码目的肽或蛋白的开放阅读框、和IRES的5'端序列，其中IRES为包含SEQ ID NO:74所示核苷酸序列的HRVB3的IRES。编码IRES的序列的3'端序列和编码IRES的序列的5'端序列分别包含SEQ ID NO:62和63所示的核苷酸序列。In some embodiments, the target sequence may include, from the 5' end to the 3' end, (a) the 3' end sequence of IRES, one or more open reading frames encoding the target peptide or protein, and the 5' end sequence of IRES, wherein the IRES is the IRES of HRVB3 comprising the nucleotide sequence shown in SEQ ID NO:74. The 3' end sequence encoding the IRES and the 5' end sequence encoding the IRES comprise the nucleotide sequences shown in SEQ ID NO:62 and 63, respectively.

在一些实施方式中，目的序列可以从5'端到3'端包含(a)IRES的3'端序列、一个或多个编码目的肽或蛋白的开放阅读框、和IRES的5'端序列，其中IRES为包含SEQ ID NO:75所示核苷酸序列的EV-A的IRES。编码IRES的序列的3'端序列和编码IRES的序列的5'端序列分别包含SEQ ID NO:65和66所示的核苷酸序列。In some embodiments, the target sequence may comprise, from the 5' end to the 3' end, (a) the 3' end sequence of the IRES, one or more open reading frames encoding the target peptide or protein, and the 5' end sequence of the IRES, wherein the IRES is the IRES of EV-A comprising the nucleotide sequence shown in SEQ ID NO:75. The 3' end sequence encoding the IRES and the 5' end sequence encoding the IRES comprise the nucleotide sequences shown in SEQ ID NO:65 and 66, respectively.

在一些实施方式中，该编码目的肽或蛋白的开放阅读框为编码绿色荧光蛋白的开放阅读框，其中编码绿色荧光蛋白的开放阅读框包含SEQ ID NO:5所示的核苷酸序列。在一些实施方式中，目的序列可以从5'端到3'端包含编码目的肽或蛋白的开放阅读框的3'端序列、IRES、和编码目的肽或蛋白的开放阅读框的5'端序列，其中编码目的肽或蛋白的开放阅读框的3'端序列、和编码目的肽或蛋白的开放阅读框的5'端序列可以分别包含AA和SEQ ID NO:12所示的核苷酸。In some embodiments, the open reading frame encoding the target peptide or protein is an open reading frame encoding green fluorescent protein, wherein the open reading frame encoding green fluorescent protein comprises the nucleotide sequence shown in SEQ ID NO:5. In some embodiments, the target sequence may comprise, from the 5' end to the 3' end, the 3' end sequence encoding the target peptide or protein, IRES, and the 5' end sequence encoding the target peptide or protein, wherein the 3' end sequence encoding the target peptide or protein and the 5' end sequence encoding the target peptide or protein may respectively comprise the nucleotides shown in SEQ ID NO:12.

在一些实施方式中，编码目的肽或蛋白的开放阅读框可以为编码荧光素酶的开放阅读框，其中编码荧光素酶的开放阅读框可以包含SEQ ID NO:28所示的核苷酸序列。在一些实施方式中，目的序列可以从5'端到3'端包含编码目的肽或蛋白的开放阅读框的3'端序列、IRES、和编码目的肽或蛋白的开放阅读框的5'端序列，其中编码目的肽或蛋白的开放阅读框的3'端序列、和编码目的肽或蛋白的开放阅读框的5'端序列可以分别包含AA和SEQ ID NO:29，或者SEQ ID NO:32和33所示的核苷酸序列。In some embodiments, the open reading frame encoding the target peptide or protein may be an open reading frame encoding luciferase, wherein the open reading frame encoding luciferase may contain the nucleotide sequence shown in SEQ ID NO:28. In some embodiments, the target sequence may contain, from the 5' end to the 3' end, the 3' end sequence encoding the target peptide or protein, IRES, and the 5' end sequence encoding the target peptide or protein, wherein the 3' end sequence encoding the target peptide or protein and the 5' end sequence encoding the target peptide or protein may respectively contain AA and SEQ ID NO:29, or the nucleotide sequences shown in SEQ ID NO:32 and 33.

本申请的RNA分子还可以在3'自剪接型内含子片段的上游或5'侧包含5'同源臂，以及在5'自剪接型内含子片段的下游或3'侧包含3'同源臂。在一些实施方式中，RNA分子可以从5'端到3'端依次包含5'同源臂、包含3'剪接位点的3'自剪接型内含子片段、目的序列、包含5'剪接位点的5'自剪接型内含子片段、和3'同源臂。5'同源臂和3'同源臂可以互补配对，例如形成至少约15％、25％、35％、45％、55％、65％、75％、80％、85％、90％、95％、96％、97％、98％、99％或100％碱基配对。在一些实施方式中，5'同源臂和3'同源臂可以互补形成含有一个回文茎和一个茎-环的结构，其中回文茎与茎-环的茎部连接，其中5'同源臂的5'端和3'同源臂的3'端在回文茎部相邻。在一些实施方式中，5'同源臂和3'同源臂可以互补形成含有一个回文茎和两个茎-环的结构，其中回文茎与两个茎-环的茎部连接，两个茎-环的茎部连接，其中5'同源臂的5'端和3'同源臂的3'端在回文茎或其中一个茎-环的环部相邻。在一些实施方式中，5'同源臂和3'同源臂可以分别包含与SEQ ID NO:2和7具有至少85％、90％、91％、92％、93％、94％、95％、96％、97％、98％、99％或100％序列同一性的核苷酸序列。The RNA molecule of this application may also include a 5' homologous arm upstream of or 5' to the 3' self-splicing intron fragment, and a 3' homologous arm downstream of or 3' to the 5' self-splicing intron fragment. In some embodiments, the RNA molecule may sequentially include a 5' homologous arm, a 3' self-splicing intron fragment containing a 3' splice site, a target sequence, a 5' self-splicing intron fragment containing a 5' splice site, and a 3' homologous arm from the 5' end to the 3' end. The 5' homologous arm and the 3' homologous arm may be complementary, for example, forming at least about 15%, 25%, 35%, 45%, 55%, 65%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% base pairing. In some embodiments, the 5' and 3' homologous arms can complementarily form a structure containing a palindromic stem and a stem-loop, wherein the palindromic stem is connected to the stem portion of the stem-loop, and the 5' end of the 5' homologous arm and the 3' end of the 3' homologous arm are adjacent at the palindromic stem portion. In some embodiments, the 5' and 3' homologous arms can complementarily form a structure containing a palindromic stem and two stem-loops, wherein the palindromic stem is connected to the stem portions of two stem-loops, and the stem portions of two stem-loops are connected, wherein the 5' end of the 5' homologous arm and the 3' end of the 3' homologous arm are adjacent at the loop portion of the palindromic stem or one of the stem-loops. In some embodiments, the 5' and 3' homologous arms can each comprise a nucleotide sequence having at least 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity with SEQ ID NO: 2 and 7.

本申请的RNA分子可以在3'自剪接型内含子片段与目的序列之间包含5'间隔序列，和/或在目的序列与5'自剪接型内含子片段之间包含3'间隔序列。特别地，本申请的RNA分子可以不在3'自剪接型内含子片段与目的序列之间包含5'间隔序列，和/或不在目的序列与5'自剪接型内含子片段之间包含3'间隔序列。The RNA molecule of this application may include a 5' spacer sequence between the 3' self-splicing intron fragment and the target sequence, and/or include a 3' spacer sequence between the target sequence and the 5' self-splicing intron fragment. In particular, the RNA molecule of this application may not include a 5' spacer sequence between the 3' self-splicing intron fragment and the target sequence, and/or may not include a 3' spacer sequence between the target sequence and the 5' self-splicing intron fragment.

本申请的RNA分子，在一些实施方式中，可以是由本申请第一个方面的单链DNA分子的互补链、双链DNA分子或载体转录而来的。In some embodiments, the RNA molecule of this application may be transcribed from the complementary strand of a single-stranded DNA molecule, a double-stranded DNA molecule, or a vector of the first aspect of this application.

在第三个方面，本申请提供一种制备环状RNA的方法，包括i)在合适的条件下从本申请第一方面的单链DNA分子的互补链、双链DNA分子或载体中体外转录出RNA分子，以及ii)在合适的条件下孵育该RNA分子，其中第ii)步的合适条件包括存在镁离子和三磷酸鸟苷(GTP)。在一些实施方式中，合适的条件包括，在三磷酸鸟苷和Mg²⁺的存在下，约37℃孵育约2小时。在一些实施方式中，合适的条件包括，在三磷酸鸟苷和Mg²⁺的存在下，约37℃孵育约2小时，以及加入DNA酶I，约37℃孵育约30分钟。GTP的浓度可以为2mM，镁离子的浓度可以为10mM-20mM。In a third aspect, this application provides a method for preparing circular RNA, comprising i) in vitro transcription of RNA molecules from the complementary strand of a single-stranded DNA molecule, a double-stranded DNA molecule, or a vector of the first aspect of this application under suitable conditions, and ii) incubating the RNA molecule under suitable conditions, wherein the suitable conditions in step ii) include the presence of magnesium ions and guanosine triphosphate (GTP). In some embodiments, suitable conditions include incubation at approximately 37°C for approximately 2 hours in the presence of guanosine triphosphate and ^Mg²⁺ . In some embodiments, suitable conditions include incubation at approximately 37°C for approximately 2 hours in the presence of guanosine triphosphate and ^Mg²⁺ , and the addition of DNase I, followed by incubation at approximately 37°C for approximately 30 minutes. The concentration of GTP may be 2 mM, and the concentration of magnesium ions may be 10 mM-20 mM.

或者，本申请的制备环状RNA的方法可以包括，在合适的条件下孵育本申请第二方面的可环化RNA分子，其中合适的条件包括存在镁离子和三磷酸鸟苷(GTP)。在一些实施方式中，合适的条件包括，在三磷酸鸟苷和Mg²⁺的存在下，约37℃孵育约2小时。在一些实施方式中，合适的条件包括，在三磷酸鸟苷和Mg²⁺的存在下，约37℃孵育约2小时，以及加入DNA酶I，约37℃孵育约30分钟。GTP的浓度可以为2mM，镁离子的浓度可以为10mM-20mM。Alternatively, the method for preparing circular RNA according to this application may include incubating the circularizable RNA molecule of the second aspect of this application under suitable conditions, wherein suitable conditions include the presence of magnesium ions and guanosine triphosphate (GTP). In some embodiments, suitable conditions include incubation at approximately 37°C for approximately 2 hours in the presence of guanosine triphosphate and ^Mg²⁺ . In some embodiments, suitable conditions include incubation at approximately 37°C for approximately 2 hours in the presence of guanosine triphosphate and ^Mg²⁺ , and incubation at approximately 37°C for approximately 30 minutes with the addition of DNase I. The concentration of GTP may be 2 mM, and the concentration of magnesium ions may be 10 mM-20 mM.

本申请也保护由本申请的第一方面的单链DNA分子、双链DNA分子或载体或第二个方面的可环化RNA分子制备出的环状RNA，以及经本申请的制备环状RNA的方法制备出的环状RNA。This application also protects circular RNA prepared from single-stranded DNA molecules, double-stranded DNA molecules or vectors of the first aspect of this application or circularizable RNA molecules of the second aspect of this application, as well as circular RNA prepared by the method for preparing circular RNA of this application.

环状RNA分子可以在细胞内翻译蛋白、或作为有生物学活性的非编码RNA存在。环状RNA的长度可以为至少500、550、600、650、700、750、800、850、900、950、1000、1100、1200、1300、1400、1500、1600、1700、1800、1900、2000、2500、3000、3500、4000、4500、5000、5500、6000、6500、7000、7500、8000、8500、9000、9500个核苷酸。Circular RNA molecules can be translated into proteins within cells or exist as biologically active non-coding RNAs. The length of circular RNA can be at least 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800, 1900, 2000, 2500, 3000, 3500, 4000, 4500, 5000, 5500, 6000, 6500, 7000, 7500, 8000, 8500, 9000, or 9500 nucleotides.

在第四个方面，本申请提供一种根据本申请第三方面制备环状RNA方法得到的环状RNA。在一些实施方式中，该环状RNA可以不包含与3'自剪接型内含子片段相邻的外显子或5'自剪接型内含子片段相邻的外显子。在另一些实施方式中，该环状RNA可以不包含与3'自剪接型内含子片段相邻的外显子和5'自剪接型内含子片段相邻的外显子。In a fourth aspect, this application provides a circular RNA obtained according to a method for preparing circular RNA according to a third aspect of this application. In some embodiments, the circular RNA may not contain exons adjacent to either the 3' self-splicing intron fragment or the 5' self-splicing intron fragment. In other embodiments, the circular RNA may not contain exons adjacent to either the 3' self-splicing intron fragment or the 5' self-splicing intron fragment.

本申请也提供一种环状RNA，该环状RNA可以包含目的序列、可选择地翻译功能元件以及可选择地与3'自剪接型内含子片段相邻的外显子或5'自剪接型内含子片段相邻的外显子。在一些实施方式中，该环状RNA可以包含目的序列、翻译功能元件以及与3'自剪接型内含子片段相邻的外显子。一些实施方式中，该环状RNA可以包含目的序列、翻译功能元件以及与5'自剪接型内含子片段相邻的外显子。在一些优选实施方式中，该环状RNA可以由目的序列和翻译功能元件组成。在另一些实施方式中，该环状RNA可以包含目的序列以及与3'自剪接型内含子片段相邻的外显子。在一些实施方式中，该环状RNA可以包含目的序列以及与5'自剪接型内含子片段相邻的外显子。在一些优选实施方式中，该环状RNA可以仅由目的序列组成。在一些优选实施方式中，该环状RNA可以仅由目的序列和翻译功能元件组成；或由目的序列组成。This application also provides a circular RNA, which may comprise a target sequence, a selectively translatable functional element, and exons selectively adjacent to either a 3' self-splicing intron fragment or a 5' self-splicing intron fragment. In some embodiments, the circular RNA may comprise a target sequence, a translatable functional element, and exons adjacent to a 3' self-splicing intron fragment. In some embodiments, the circular RNA may comprise a target sequence, a translatable functional element, and exons adjacent to a 5' self-splicing intron fragment. In some preferred embodiments, the circular RNA may consist of a target sequence and a translatable functional element. In other embodiments, the circular RNA may comprise a target sequence and exons adjacent to a 3' self-splicing intron fragment. In some embodiments, the circular RNA may comprise a target sequence and exons adjacent to a 5' self-splicing intron fragment. In some preferred embodiments, the circular RNA may consist only of the target sequence. In some preferred embodiments, the circular RNA may consist only of the target sequence and a translatable functional element; or consist only of the target sequence.

翻译功能元件可以是翻译起始元件，优选地，可以是内部核糖体进入位点(IRES)。该IRES可以选自柯萨奇病毒B3(CVB3)、肠病毒B107(EVB107)、人鼻病毒B3(HRVB3)、肠病毒A(EV-A)或人鼻病毒B6(HRVB6)的IRES。本申请中描述的其他来源的IRES同样适用。The translational functional element can be a translation initiation element, preferably an internal ribosome entry site (IRES). This IRES can be selected from IRES of Coxsackievirus B3 (CVB3), Enterovirus B107 (EVB107), Human Rhinovirus B3 (HRVB3), Enterovirus A (EV-A), or Human Rhinovirus B6 (HRVB6). IRES from other sources described in this application are also applicable.

在第五个方面，本申请提供一种组合物，其可以包含本申请第四方面提到的环状RNA分子和该环状RNA的开环RNA，其中，该环状RNA在组合物中的摩尔百分比为至少30％、至少40％、至少50％、至少60％、至少70％、至少80％。在一些实施方式中，该环状RNA在组合物中的摩尔百分比可以为至少40％。在一些实施方式中，该环状RNA在组合物中的摩尔百分比可以是通过毛细管电泳实验检测的。In a fifth aspect, this application provides a composition that may comprise the circular RNA molecule mentioned in the fourth aspect of this application and its open circular RNA, wherein the molar percentage of the circular RNA in the composition is at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, or at least 80%. In some embodiments, the molar percentage of the circular RNA in the composition may be at least 40%. In some embodiments, the molar percentage of the circular RNA in the composition may be detected by capillary electrophoresis.

该环状RNA的开环RNA可以是任何物理、化学或其他因素使得环状RNA的闭合环状结构变成线性结构的RNA。The open circular RNA of this circular RNA can be any RNA whose closed circular structure is transformed into a linear structure by physical, chemical, or other factors.

该组合物中还可以包含本申请第二方面用于制备该环状RNA的可环化RNA分子。该组合物还可以包含残留内含子序列的RNA。The composition may also contain a circularizable RNA molecule used in the second aspect of this application for preparing the circular RNA. The composition may also contain RNA with residual intron sequences.

在第六个方面，本申请提供一种宿主细胞，其可以包含本申请的单链DNA分子、双链DNA分子、载体、可环化RNA分子或环状RNA分子。In a sixth aspect, this application provides a host cell that may contain the single-stranded DNA molecule, double-stranded DNA molecule, vector, circularizable RNA molecule, or circular RNA molecule of this application.

本申请还提供一种组合物，其可以包含本申请的单链DNA分子、双链DNA分子、载体、可环化RNA分子，或含有本申请单链DNA分子、双链DNA分子、载体、可环化RNA分子宿主细胞。该组合物可以用于制备可环化的RNA、和/或环状RNA，特别是在体外或细胞体内产生可翻译出蛋白或具有生物学活性的环状RNA。具有生物学活性的环状RNA可以是例如miRNA海绵体、或非编码RNA。本申请的组合物(包含单链DNA分子、双链DNA分子)可以用于制备环状RNA制备用载体。This application also provides a composition that may comprise the single-stranded DNA molecule, double-stranded DNA molecule, vector, or circularizable RNA molecule of this application, or a host cell containing the single-stranded DNA molecule, double-stranded DNA molecule, vector, or circularizable RNA molecule of this application. This composition can be used to prepare circularizable RNA and/or circular RNA, particularly to generate translatable proteins or biologically active circular RNA in vitro or in vivo. Biologically active circular RNA may be, for example, miRNA sponges or non-coding RNA. The composition of this application (comprising single-stranded DNA molecules and double-stranded DNA molecules) can be used to prepare a vector for preparing circular RNA.

本申请还提供一种组合物，其可以包含本申请的环状RNA分子、或含有本申请环状RNA分子的宿主细胞。该组合物可以是药物组合物，还可以包含药学上可接受的载体。该组合物可以经例如脂质体转染、电穿孔、或经纳米载体包裹而转染到细胞中。This application also provides a composition that may comprise the circular RNA molecule of this application or a host cell containing the circular RNA molecule of this application. The composition may be a pharmaceutical composition and may also comprise a pharmaceutically acceptable carrier. The composition may be transfected into cells via, for example, liposome transfection, electroporation, or encapsulation with a nanocarrier.

本申请也提供组合物在用于制备环状RNA、或用于体内治疗等中的用途。例如，在一个实施方式中，本申请提供一种用于在有需要的受试者中表达疫苗、治疗用蛋白、或其他类型蛋白的方法，包括对受试者施用本申请的包含环状RNA的组合物。治疗用蛋白可以为例如抗体、融合蛋白等。This application also provides the use of the composition in the preparation of circular RNA or for in vivo therapy. For example, in one embodiment, this application provides a method for expressing a vaccine, therapeutic protein, or other type of protein in a subject in need, comprising administering the composition of this application containing circular RNA to the subject. The therapeutic protein may be, for example, an antibody, a fusion protein, etc.

在第七个方面，本申请提供一种在有需要的受试者中治疗或预防疾病的方法，包括向受试者施用本申请的包含本申请环状RNA分子的药物组合物。其中，该环状RNA分子包含编码目的肽或蛋白的开放阅读框。目的肽或蛋白可以是疾病相关抗原、或治疗剂。疾病相关抗原可以是处于微生物，例如病毒、细菌、支原体等的外表上的肽或蛋白，或肿瘤相关抗原。治疗剂可以是例如抗体。In a seventh aspect, this application provides a method for treating or preventing a disease in a subject in need, comprising administering to the subject a pharmaceutical composition comprising a circular RNA molecule of this application. The circular RNA molecule comprises an open reading frame encoding a target peptide or protein. The target peptide or protein may be a disease-associated antigen or a therapeutic agent. Disease-associated antigens may be peptides or proteins located on the surface of microorganisms, such as viruses, bacteria, mycoplasma, etc., or tumor-associated antigens. The therapeutic agent may be, for example, an antibody.

当目的肽或蛋白是处于微生物，例如病毒、细菌、支原体等的外表上的肽或蛋白时，本申请的方法可以用于治疗或预防与该微生物感染相关的疾病。When the target peptide or protein is a peptide or protein on the surface of a microorganism, such as a virus, bacteria, mycoplasma, etc., the method of this application can be used to treat or prevent diseases related to infection by that microorganism.

当目的肽或蛋白是肿瘤相关抗原、或靶向肿瘤相关抗原的蛋白如抗体时，本申请的方法可以用于治疗与该肿瘤相关抗原相关的肿瘤。When the target peptide or protein is a tumor-associated antigen, or a protein such as an antibody that targets a tumor-associated antigen, the method of this application can be used to treat tumors associated with that tumor-associated antigen.

目的肽或蛋白也可以是哺乳动物，例如人体内表达的正常蛋白，其可以对缺乏该正常蛋白的受试者进行补充治疗。The target peptide or protein can also be a normal protein expressed in mammals, such as humans, which can be used to supplement subjects who lack this normal protein.

受试者可以是哺乳动物，例如人。在本申请中，同一个核苷酸序列，如由同一个SEQ ID NO表示的核苷酸序列，可以既表示DNA序列，又表示RNA序列，差别仅在于T与U的替换。The subjects can be mammals, such as humans. In this application, the same nucleotide sequence, such as the nucleotide sequence represented by the same SEQ ID NO, can represent both a DNA sequence and an RNA sequence, differing only in the substitution of T and U.

基于以下的详细描述和实施例，当前公开的其他特征和有利之处将是非常明晰可见的，详细描述和实施例不应当解读为是限制性的。所有在说明书中引用的文献、Genbank登记号、专利和公开专利申请均通过引用的方式并入本文。Other features and advantages disclosed herein will become readily apparent from the following detailed description and embodiments, which should not be construed as limiting. All references, Genbank registration numbers, patents, and published patent applications cited in this specification are incorporated herein by reference.

Attached Figure Description

以下以示例方式给出但不意在将本发明限制于所述具体实施方式的具体描述，可以结合附图更好地进行理解。The following detailed description, given by way of example but not intended to limit the invention to the specific embodiments described, can be better understood in conjunction with the accompanying drawings.

图1示出I类自剪接型内含子的二级结构示意图，其中IGS为内部引导序列，其5'和3'端序列分别与内含子3'侧相邻外显子以及5'侧相邻外显子互补配对，三角形指向内含子的5'和3'剪接位点。Figure 1 shows a schematic diagram of the secondary structure of a type I self-splicing intron, where IGS is the internal guide sequence, and its 5' and 3' end sequences are complementary to the exons adjacent to the 3' and 5' sides of the intron, respectively. The triangle points to the 5' and 3' splicing sites of the intron.

图2A和2B示出炭疽杆菌recA基因中自剪接型内含子的示意性结构(2A)以及IGS中可以用于改造的核苷酸位点(2B)。Figures 2A and 2B show schematic structures of self-splicing introns in the Bacillus anthracis recA gene (2A) and nucleotide sites in the IGS that can be modified (2B).

图3A-3D示出用于制备环状RNA的含炭疽杆菌recA基因中自剪接型内含子相邻外显子E3和E5(3A)、仅含炭疽杆菌recA基因中自剪接型内含子相邻外显子E3(3B)、仅含炭疽杆菌recA基因中自剪接型内含子相邻外显子E5(3C)、以及不含炭疽杆菌recA基因中自剪接型内含子相邻外显子E3和E5(3D)的载体的结构示意图。Figures 3A-3D show schematic diagrams of vectors used to prepare circular RNA containing exons E3 and E5 adjacent to the self-splicing intron of the Bacillus anthracis recA gene (3A), containing only exon E3 adjacent to the self-splicing intron of the Bacillus anthracis recA gene (3B), containing only exon E5 adjacent to the self-splicing intron of the Bacillus anthracis recA gene (3C), and not containing exons E3 and E5 adjacent to the self-splicing intron of the Bacillus anthracis recA gene (3D).

图4A-4D示出使用仅含炭疽杆菌recA基因中自剪接型内含子相邻外显子E3的载体制备的RNA分子的E-Gel^TMEX凝胶电泳胶图(4A)和毛细管凝胶电泳图(4C)，以及用仅含炭疽杆菌recA基因中自剪接型内含子相邻外显子E5的载体制备的RNA分子的E-Gel^TMEX凝胶电泳胶图(4B)和毛细管凝胶电泳图(4D)，其中各E-Gel^TMEX凝胶电泳胶图中的3个泳道从左到右分别为体外转录得到的、经RNA环化得到的、以及经RNA酶R处理得到的RNA产物。Figures 4A-4D show E-Gel ^™ EX gel electrophoresis (4A) and capillary gel electrophoresis (4C) images of RNA molecules prepared using a vector containing only the exon E3 adjacent to the self-splicing intron of the Bacillus anthracis recA gene, and E-Gel™ EX gel electrophoresis (4B) and capillary gel electrophoresis (4D) images of RNA molecules prepared using a vector containing only the exon E5 adjacent to the self-splicing intron of the Bacillus anthracis recA gene. The three lanes in each E-Gel ^™ ^EX gel electrophoresis image, from left to right, represent the RNA products obtained by in vitro transcription, RNA circularization, and RNA enzyme R treatment, respectively.

图5示出贝氏柯克斯体23S核糖体基因中自剪接型内含子的示意性结构。Figure 5 shows a schematic structure of the self-splicing intron in the 23S ribosomal gene of the Coxisonian Behringer.

图6A-6D示出用于制备环状RNA的含贝氏柯克斯体23S核糖体基因中自剪接型内含子相邻外显子E3和E5(6A)、仅含贝氏柯克斯体23S核糖体基因中自剪接型内含子相邻外显子E3(6B)、仅含贝氏柯克斯体23S核糖体基因中自剪接型内含子相邻外显子E5(6C)、以及不含贝氏柯克斯体23S核糖体基因中自剪接型内含子相邻外显子E3和E5(6D)的载体的结构示意图。Figures 6A-6D show schematic diagrams of vectors used to prepare circular RNA containing exons E3 and E5 adjacent to the self-splicing intron of the Coxsell 23S ribosomal gene (6A), containing only exon E3 adjacent to the self-splicing intron of the Coxsell 23S ribosomal gene (6B), containing only exon E5 adjacent to the self-splicing intron of the Coxsell 23S ribosomal gene (6C), and not containing exons E3 and E5 adjacent to the self-splicing intron of the Coxsell 23S ribosomal gene (6D).

图7A-7D示出使用仅含贝氏柯克斯体23S核糖体基因中自剪接型内含子相邻外显子E3的载体制备的RNA分子的E-Gel^TMEX凝胶电泳胶图(7A)和毛细管凝胶电泳图(7C)，以及用仅含贝氏柯克斯体23S核糖体基因中自剪接型内含子相邻外显子E5的载体制备的RNA分子的E-Gel^TMEX凝胶电泳胶图(7B)和毛细管凝胶电泳图(7D)，其中各E-Gel^TMEX凝胶电泳胶图中的3个泳道从左到右分别为体外转录得到的、经RNA环化得到的、以及经RNA酶R处理得到的RNA产物。Figures 7A-7D show E-Gel ^™ EX gel electrophoresis (7A) and capillary gel electrophoresis (7C) images of RNA molecules prepared using a vector containing only the exon E3 adjacent to the self-splicing intron of the Coxsell 23S ribosomal gene, and E-Gel ^™ EX gel electrophoresis (7B) and capillary gel electrophoresis (7D) images of RNA molecules prepared using a vector containing only the exon E5 adjacent to the self-splicing intron of the Coxsell 23S ribosomal gene. The three lanes in each E-Gel ^™ EX gel electrophoresis image, from left to right, represent the RNA products obtained by in vitro transcription, RNA circularization, and RNA enzyme R treatment, respectively.

图8A-8C示出不含E3和E5且在炭疽杆菌recA基因中自剪接型内含子的3'内含子片段和5'内含子片段之间按序包含IRES和开放阅读框(ORF)(8A)，3'IRES、ORF、和5'IRES(8B)，以及3'ORF、IRES和5'ORF(8C)的载体的示意性结构图。Figures 8A-8C show schematic diagrams of vectors that do not contain E3 and E5 and contain IRES and open reading frames (ORF) sequentially between the 3' and 5' introns of the self-splicing introns in the Bacillus anthracis recA gene (8A), 3'IRES, ORF, and 5'IRES (8B), and 3'ORF, IRES, and 5'ORF (8C).

图9A-9F示出由不含E3和E5且在炭疽杆菌recA基因中自剪接型内含子的3'内含子片段和5'内含子片段之间按序包含CVB3 IRES和开放阅读框(ORF)的载体制备的RNA分子的E-Gel^TMEX凝胶电泳胶图(9A)和毛细管凝胶电泳图(9D)，由不含E3和E5且在3'内含子片段和5'内含子片段之间按序包含3'CVB3 IRES、ORF、和5'CVB3 IRES的载体制备的RNA分子的E-Gel^TMEX凝胶电泳胶图(9B)和毛细管凝胶电泳图(9E)，以及由不含E3和E5且在3'内含子片段和5'内含子片段之间按序包含3'ORF、CVB3 IRES和5'ORF的载体的制备的RNA分子的E-Gel^TMEX凝胶电泳胶图(9C)和毛细管凝胶电泳图(9F)。Figures 9A-9F show E-Gel™ EX gel electrophoresis (9A) and capillary gel electrophoresis (9D) images of RNA molecules prepared from vectors lacking E3 and E5 and containing CVB3 IRES and an open reading frame (ORF) sequentially between the 3' and 5' introns of the Bacillus anthracis recA gene; E-Gel ^™ EX gel electrophoresis (9B) and capillary gel electrophoresis (9E) images of RNA molecules prepared from vectors lacking E3 and E5 and containing 3' CVB3 IRES, ORF, and 5' CVB3 IRES sequentially between the 3' and 5'introns; and E-Gel ^™ images of RNA molecules prepared from vectors lacking E3 and E5 and containing 3' ORF, CVB3 IRES, and 5' ORF sequentially between the 3' and 5' introns ^. EX gel electrophoresis image (9C) and capillary gel electrophoresis image (9F).

图10A和10B示出由不含E3和E5且在贝氏柯克斯体23S核糖体基因中自剪接型内含子的3'内含子片段和5'内含子片段之间按序包含3'CVB3 IRES、ORF、和5'CVB3 IRES的载体制备的RNA分子的E-Gel^TMEX凝胶电泳胶图(10A)和毛细管凝胶电泳图(10B)。Figures 10A and 10B show E-Gel™ EX gel electrophoresis (10A) and capillary gel electrophoresis (10B) images of RNA molecules prepared from vectors containing 3' CVB3 IRES, ORF, and 5' CVB3 IRES sequentially between the 3' and 5' intron fragments of the self-splicing introns in the Coxsell ^23S ribosomal gene, which do not contain E3 and E5.

图11A-11I示出由不含E3和E5、在贝氏柯克斯体23S核糖体基因中自剪接型内含子的3'内含子片段和5'内含子片段之间按序包含3'CVB3 IRES、ORF、和5'CVB3IRES，且CVB3 IRES的截断位置为第18个碱基后的载体制备的RNA分子的E-Gel^TMEX凝胶电泳胶图(11A：Coxie-B18-Gluc；B：Coxie-B18-EGFP；11C：Coxie-B18-Fluc)和毛细管凝胶电泳图(11D、11E：Coxie-B18-Gluc；11F、11G：Coxie-B18-EGFP；11H、11I：Coxie-B18-Fluc)，其中各E-Gel^TMEX凝胶电泳胶图中的3个泳道从左到右分别为体外转录得到的、经RNA环化得到的、以及经RNA酶R处理得到的RNA产物。Figures 11A-11I show E-Gel™ EX gel electrophoresis images (11A: Coxie-B18-Gluc; B: Coxie-B18-EGFP; 11C: Coxie-B18-Fluc) and capillary ^gel electrophoresis images (11D, 11E: Coxie-B18-Gluc; 11F, 11G: Coxie-B18-EGFP; 11H, 11I: Coxie-B18-Fluc) of RNA molecules prepared from vectors containing no E3 and E5, self-splicing introns in the Coxsell 23S ribosomal gene, and sequentially including 3'CVB3 IRES, ORF, and 5'CVB3 IRES, with the CVB3 ^IRES truncated after the 18th base. The three lanes in the EX gel electrophoresis image, from left to right, represent the RNA products obtained from in vitro transcription, RNA circularization, and RNA enzyme R treatment, respectively.

图12A-12D示出由不含E3和E5、在贝氏柯克斯体23S核糖体基因中自剪接型内含子的3'内含子片段和5'内含子片段之间按序包含3'EVB107 IRES、ORF、和5'EVB107 IRES，且EVB107 IRES的截断位置为第18个碱基后的载体制备的RNA分子的E-Gel^TMEX凝胶电泳胶图(12A：Coxie-EVB107-B18-Gluc；12B：Coxie-EVB107-B18-EGFP)和毛细管凝胶电泳图(12C：Coxie-EVB107-B18-Gluc；12D：Coxie-EVB107-B18-EGFP)。其中各E-Gel^TMEX凝胶电泳胶图中的3个泳道从左到右分别为体外转录得到的、经RNA环化得到的、以及经RNA酶R处理得到的RNA产物。Figures 12A-12D show E-Gel™ EX gel electrophoresis images (12A: Coxie-EVB107-B18-Gluc; 12B: Coxie-EVB107-B18-EGFP) and capillary ^gel electrophoresis images (12C: Coxie-EVB107-B18-Gluc; 12D: Coxie-EVB107-B18-EGFP) of RNA molecules prepared from vectors containing 3' intron fragments without E3 and E5, self-splicing introns in the Coxsell 23S ribosomal gene, and containing 3'EVB107 IRES, ORF, and 5'EVB107 IRES in sequence, with the EVB107 IRES truncated after the 18th base. The three lanes in each E-Gel ^™ EX gel electrophoresis image, from left to right, represent RNA products obtained through in vitro transcription, RNA circularization, and RNA enzyme R treatment, respectively.

图13A-13C示出线性EGFP、环状Ana-EGFP及本申请实施例制备的环状coxie-B18-EGFP RNA的细胞表达测试与免疫原性测试结果，其中图13A和图13B分别为转染HEK293T细胞及A549细胞，在24小时、48小时、72小时、6天(6D)及8天(8D)后表达荧光强度，图13C为环状coxie-B18-EGFP转染A549细胞后48小时的免疫原性检测结果。Figures 13A-13C show the cell expression and immunogenicity test results of linear EGFP, circular Ana-EGFP, and circular coxie-B18-EGFP RNA prepared in the embodiments of this application. Figures 13A and 13B show the fluorescence intensity of HEK293T cells and A549 cells after 24 hours, 48 hours, 72 hours, 6 days (6D), and 8 days (8D), respectively. Figure 13C shows the immunogenicity test results of circular coxie-B18-EGFP transfected into A549 cells 48 hours later.

图14A-14H示出由不含E3和E5、在贝氏柯克斯体23S核糖体基因中自剪接型内含子的3'内含子片段和5'内含子片段之间按序包含3'HRVB3 IRES、ORF、和5'HRVB3 IRES，且HRVB3 IRES的截断位置为第18个碱基后的载体制备的RNA分子的E-Gel^TMEX凝胶电泳胶图(14A:Coxie-HRVB3-B18-Gluc,14B:Coxie-HRVB3-B18-EGFP)和毛细管凝胶电泳图(14E:Coxie-HRVB3-B18-Gluc,14F:Coxie-HRVB3-B18-EGFP)；由不含E3和E5、在贝氏柯克斯体23S核糖体基因中自剪接型内含子的3'内含子片段和5'内含子片段之间按序包含3'EV-A-mut IRES、ORF、和5'EV-A-mut IRES，且EV-A-mut IRES的截断位置为第374个碱基后的载体制备的RNA分子的E-Gel^TMEX凝胶电泳胶图(14C：Coxie-EV-A-mut-B374-Gluc，14D:Coxie-EV-A-mut-B374-EGFP)和毛细管凝胶电泳图(14G：Coxie-EV-A-mut-B374-Gluc，14H:Coxie-EV-A-mut-B374-EGFP)。其中各E-Gel^TMEX凝胶电泳胶图中的3个泳道从左到右分别为体外转录得到的、经RNA环化得到的、以及经RNA酶R处理得到的RNA产物。Figures 14A-14H show the E-Gel™ of RNA molecules prepared from vectors containing 3' HRVB3 IRES, ORF, and 5' HRVB3 IRES sequentially between the 3' and 5' introns of the self-splicing introns in the Coxsell 23S ribosomal gene, excluding E3 and ^E5 , with the HRVB3 IRES truncated after the 18th base. EX gel electrophoresis images (14A: Coxie-HRVB3-B18-Gluc, 14B: Coxie-HRVB3-B18-EGFP) and capillary gel electrophoresis images (14E: Coxie-HRVB3-B18-Gluc, 14F: Coxie-HRVB3-B18-EGFP); E-Gel™ images of RNA molecules prepared from a vector containing no E3 and E5, with 3' EV-A-mut IRES, ORF, and 5' EV-A-mut IRES sequentially between the 3' and 5' intron fragments of the self-splicing intronic introns in the Coxsell 23S ribosomal gene, and with the EV-A-mut IRES truncated after the 374th base ^. EX gel electrophoresis images (14C: Coxie-EV-A-mut-B374-Gluc, 14D: Coxie-EV-A-mut-B374-EGFP) and capillary gel electrophoresis images (14G: Coxie-EV-A-mut-B374-Gluc, 14H: Coxie-EV-A-mut-B374-EGFP). The three lanes in each E-Gel ^™ EX gel electrophoresis image, from left to right, represent the RNA products obtained through in vitro transcription, RNA circularization, and RNase R treatment, respectively.

Detailed Implementation

本文中用到的术语，除非特别指出，均具有字典、教科书、技术工具书中的普通含义，或本领域技术人员通常所理解的意思。以下对于一些术语的描述，仅出于便于理解本申请的目的，而不意在对这些术语进行特别的限定，除非特别指出。Unless otherwise specified, the terms used herein have their common meanings as found in dictionaries, textbooks, and technical reference books, or as commonly understood by those skilled in the art. The following descriptions of some terms are for the purpose of understanding this application only and are not intended to impose any particular limitations on these terms, unless otherwise specified.

如本文和所附权利要求书中所使用的，单数形式“一个”、“一种”、和“该”包括所指对象的复数形式，除非上下文另有明确规定。As used herein and in the appended claims, the singular forms “a,” “an,” and “the” include the plural form of the object referred to, unless the context clearly specifies otherwise.

术语“或”是指列举的可选择要素中的单个要素，除非上下文明确地另外指出。The term "or" refers to a single element among the listed selectable elements, unless the context explicitly indicates otherwise.

术语“包含”或“包括”是指将所述的要素、整数或步骤包括在内，但是不排除任意其他要素、整数或步骤的加入。在文中，当使用术语“包含”或“包括”时，除非另有指明，也涵盖所述及的要素、整数或步骤的组合。术语“由…构成”或“由…组成”一般是指仅含有所述的要素、整数或步骤，而不得加入其他要素、整数或步骤。The terms "comprising" or "including" mean that the stated elements, integers, or steps are included, but do not exclude the inclusion of any other elements, integers, or steps. In this document, when the terms "comprising" or "including" are used, unless otherwise specified, they also cover combinations of the stated elements, integers, or steps. The terms "consisting of" or "comprises of" generally mean that only the stated elements, integers, or steps are included, without the addition of other elements, integers, or steps.

各元件“可操作地连接”，是指各元件的排布可以使得完成所需的功能。例如，本申请中各元件“可操作地连接”的DNA链可以完成RNA体外或体内转录，本申请中各元件“可操作地连接”的RNA分子可以完成环化等。The term "operably linked" means that the arrangement of the elements enables the completion of the required function. For example, the DNA strands in which the elements are "operably linked" in this application can perform RNA transcription in vitro or in vivo, and the RNA molecules in which the elements are "operably linked" in this application can perform circularization, etc.

核酸分子的5'端可以是具有游离磷酸基的末端，3'端可以是具有游离羟基的末端。“上游”一般是指在核酸序列中相对更靠近5'端的位置，而“下游”一般是指在核酸序列中相对更靠近3'端的位置。The 5' end of a nucleic acid molecule can be a terminal with a free phosphate group, and the 3' end can be a terminal with a free hydroxyl group. "Upstream" generally refers to a position relatively closer to the 5' end in the nucleic acid sequence, while "downstream" generally refers to a position relatively closer to the 3' end in the nucleic acid sequence.

“体外转录”或“IVT”是指在含有RNA转录酶、NTP等的条件下，在体外无细胞体系中，以DNA为模板，模拟体内转录过程形成RNA的过程。当使用质粒载体作为DNA模板时，在体外转录前，利用酶切位点对质粒线进行酶切，使其线性化。"In vitro transcription" or "IVT" refers to the process of forming RNA using DNA as a template in a cell-free system under conditions containing RNA transcriptase, NTPs, etc., mimicking the in vivo transcription process. When using a plasmid vector as a DNA template, the plasmid line is linearized by enzyme digestion sites before in vitro transcription.

“前体RNA”、“IVT转录产物”、“信使RNA”、“mRNA”或“可环化RNA”是指由本申请的载体转录得到的RNA产物，其可以在合适条件下环化，形成环状RNA。"Precursor RNA", "IVT transcript", "messenger RNA", "mRNA" or "circularizable RNA" refers to the RNA product transcribed from the vector of this application, which can be circularized under suitable conditions to form circular RNA.

外显子是指DNA中出现在成熟RNA分子上的序列。外显子被内含子隔开，在转录后经加工去除内含子而连接在一起。相对地，内含子即为间隔外显子的序列，其转录在前体RNA中，经剪接而去除，最终不显示在成熟RNA分子中。内含子的去除，通常需要剪接小体，即由核小RNA(snRNA，U1、U2、U4、U5、U6等)和蛋白质因子(约100多种)动态组成、识别RNA前体的剪接位点并催化剪接反应的核糖核蛋白复合体，的参与。然而，极少数形成核酶的内含子会发生自剪接，由RNA单独形式剪接酶的功能。Exons are sequences in DNA that appear on mature RNA molecules. Exons are separated by introns and are joined together after transcription by intron removal. Conversely, introns are sequences that separate exons; they are transcribed into precursor RNA, removed by splicing, and ultimately not displayed in mature RNA molecules. Intron removal typically requires the participation of splice bodies, ribonucleoprotein complexes dynamically composed of nuclear small RNAs (snRNAs, U1, U2, U4, U5, U6, etc.) and protein factors (approximately 100 types), which recognize splice sites on the RNA precursor and catalyze the splicing reaction. However, a very small number of introns that form ribozymes undergo self-splicing, functioning as spliceases in RNA alone.

本申请中的“自剪接型内含子”就是指能自行发生剪接，而无需剪接小体参与的内含子。自剪接型内含子发生剪接的条件包括存在GTP和镁离子。在一些实施方式中，自剪接型内含子发生剪接的条件包括，体外转录后纯化的RNA产物在约70℃孵育5分钟，置于冰上3分钟，加入GTP和Mg²⁺，约55℃孵育5-10分钟。根据结构和剪接的机制，自剪接型内含子通常分为两类，即I类和II类。In this application, "self-splicing introns" refers to introns that can splice themselves without the involvement of spliceosomes. Conditions for splicing of self-splicing introns include the presence of GTP and magnesium ions. In some embodiments, the conditions for splicing of self-splicing introns include incubating the purified RNA product after in vitro transcription at approximately 70°C for 5 minutes, placing it on ice for 3 minutes, adding GTP and ^Mg²⁺ , and incubating at approximately 55°C for 5-10 minutes. Based on their structure and splicing mechanism, self-splicing introns are generally classified into two categories: Class I and Class II.

本文中的“互补”、“配对”、“互补配对”或“碱基配对”是指两个核苷酸或两个碱基能够根据A-T、A-U、C-G的碱基互补原则进行配对结合。当一个核苷酸序列与另一核苷酸序列“互补”时，可以指两个核苷酸序列100％相互互补，也可以指两个核苷酸序列具有高度互补，例如90％以上的互补。“互补”的两条链互为对方的“互补链”。“反向互补”是指同一条核苷酸链上的两个片段、或两条核苷酸链，当分别从5'到3'的方向和从3'到5'的方向读取时，可以进行互补配对。例如，同一条核苷酸链上的GGAAA片段和UUUCC片段反向互补，或者一条核苷酸链上的GGAAA片段和另一条核苷酸链上的UUUCC片段反向互补。特别地，在本申请中，同一条核苷酸链上P1序列GGAAAU片段与P1反义链的GUUUCC片段呈反向互补。In this document, "complementary," "pairing," "complementary pairing," or "base pairing" refers to the ability of two nucleotides or two bases to pair and bind according to the A-T, A-U, C-G base complementarity principle. When one nucleotide sequence is "complementary" to another, it can mean that the two nucleotide sequences are 100% complementary, or that the two nucleotide sequences are highly complementary, such as more than 90% complementary. The two strands that are "complementary" are each other's "complementary strands." "Anti-complementarity" refers to the ability of two fragments or two nucleotide chains on the same nucleotide chain to pair complementarily when read from the 5' to 3' direction and from the 3' to 5' direction, respectively. For example, the GGAAA fragment and the UUUCC fragment on the same nucleotide chain are anti-complementary, or the GGAAA fragment on one nucleotide chain and the UUUCC fragment on another nucleotide chain are anti-complementary. In particular, in this application, the GGAAAU fragment of the P1 sequence on the same nucleotide chain is anti-complementary to the GUUUCC fragment of the P1 antisense strand.

本文中的“启动子”是指通过提供RNA聚合酶的识别和结合位点来控制转录子合成的序列。启动子区域还可以包括对于其他参与转录调控的因子的识别或结合位点。启动子可以是诱导型的，响应于诱导信号而起始转录，或者可以是组成型的。诱导型启动子，在不存在诱导信号时，引起的转录量很小，或几乎没有。In this article, a "promoter" refers to a sequence that controls transcriptomic synthesis by providing recognition and binding sites for RNA polymerase. The promoter region may also include recognition or binding sites for other factors involved in transcriptional regulation. Promoters can be inducible, initiating transcription in response to an induction signal, or constitutive. Inducible promoters, in the absence of an induction signal, elicit very little or no transcription.

“开放阅读框”或“ORF”是指从起始密码子开始、结束于终止密码子的连续核苷酸序列，可编码完整的多肽链。在mRNA序列中，每三个连续核苷酸(即三联“密码子”)编码相应的氨基酸。其中有一个起始密码子AUG和三个终止密码子UAA、UAG、UGA。核糖体从起始密码子开始翻译，沿着mRNA序列合成多肽链并不断延伸，遇到终止密码子时，多肽链的延伸反应终止。An "open reading frame" or "ORF" is a continuous sequence of nucleotides that begins with a start codon and ends with a stop codon, encoding a complete polypeptide chain. In an mRNA sequence, every three consecutive nucleotides (i.e., a triplet "codon") encode a corresponding amino acid. There is one start codon (AUG) and three stop codons (UAA, UAG, and UGA). The ribosomes begin translation from the start codon, synthesizing the polypeptide chain along the mRNA sequence and continuously elongating it. The elongation process terminates when a stop codon is encountered.

本文中的“翻译功能元件”是指对RNA分子中对其自身翻译有作用的序列，例如可以促进其自身翻译的序列，可以包括使其开启翻译的序列(在本文中又称为“翻译起始元件”)，如IRES，或增强其翻译的序列(在本文中又称为“翻译增强元件”)，例如poly(A)尾、非cap依赖性翻译增强子(CITE)等。其中poly(A)尾可以包含连续的腺苷酸，或由连续腺苷酸组成。或者，poly(A)尾可以包含2-5个由间隔序列隔开的连续腺苷酸片段，其中间隔序列包含1-20个核苷酸，各个连续腺苷酸片段包含10-100个连续腺苷酸。非cap依赖性翻译增强子(CITE)是指不依赖于cap帽子而通过例如招募翻译起始因子或核糖体亚基等而进行RNA高效翻译的序列。“内部核糖体进入位点”或“IRES”是指形成二级结构以吸引翻译起始复合物前体到达翻译起始密码子例如AUG的RNA序列。本文中不同来源的IRES均包含野生型及相应的变体，例如来源于CVB3、EVB107、HRB3或EV-A病毒的IRES包括但不限于来源于这些病毒的野生型IRES和野生型IRES的突变体。这里野生型IRES的突变体，可以是野生型IRES不同功能区段组装的IRES，也可以是相对于野生型IRES包含核苷酸取代、删除或插入的IRES。在一些实施方案中，EV-AIRES包含SEQ ID NO:76所示的核苷酸序列，EV-AIRES突变体包含与SEQ ID NO:76具有至少70％序列一致性的核苷酸序列。在一个具体实施例中，EV-AIRES突变体包含SEQ ID NO:75所示的核苷酸序列。In this article, "translational functional elements" refers to sequences in RNA molecules that function in their own translation. These include sequences that promote translation, such as sequences that initiate translation (referred to as "translation initiation elements" in this article), like IRES, or sequences that enhance translation (referred to as "translation enhancement elements" in this article), such as poly(A) tails and cap-independent translation enhancers (CITEs). A poly(A) tail can contain consecutive adenosine nucleotides or consist of consecutive adenosine nucleotides. Alternatively, a poly(A) tail can contain 2-5 consecutive adenosine nucleotide segments separated by spacer sequences, where the spacer sequences contain 1-20 nucleotides and each consecutive adenosine nucleotide segment contains 10-100 consecutive adenosine nucleotides. A cap-independent translation enhancer (CITE) is a sequence that enables efficient RNA translation without relying on a cap, by recruiting translation initiation factors or ribosomal subunits. An "internal ribosome entry site" or "IRES" refers to an RNA sequence that forms a secondary structure to attract the precursor of the translation initiation complex to the translation initiation codon, such as AUG. The IRES from different sources mentioned herein include wild-type and corresponding variants, such as IRES derived from CVB3, EVB107, HRB3, or EV-A viruses, including but not limited to wild-type IRES and mutants of wild-type IRES derived from these viruses. Mutants of wild-type IRES can be IRES assembled from different functional regions of wild-type IRES, or IRES containing nucleotide substitutions, deletions, or insertions relative to wild-type IRES. In some embodiments, EV-AIRES comprise the nucleotide sequence shown in SEQ ID NO:76, and EV-AIRES mutants comprise a nucleotide sequence having at least 70% sequence identity with SEQ ID NO:76. In one specific embodiment, EV-AIRES mutants comprise the nucleotide sequence shown in SEQ ID NO:75.

对于本申请中含“自剪接型内含子”、“开放阅读框”、“翻译功能元件”等在DNA分子或RNA分子中采用相同的名称进行描述，不区分DNA分子转录成RNA分子后核苷酸序列类型不同。In this application, terms such as "self-splicing intron," "open reading frame," and "translational element" are used interchangeably in both DNA and RNA molecules, without distinguishing between the different nucleotide sequence types after DNA molecules are transcribed into RNA molecules.

“同源臂”是指RNA序列，其能够与另一RNA(例如另一同源臂)形成至少约15％、25％、35％、45％、55％、65％、75％、80％、85％、90％、95％、96％、97％、98％、99％或100％碱基配对。A "homologous arm" refers to an RNA sequence that can form at least about 15%, 25%, 35%, 45%, 55%, 65%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% base pairings with another RNA (e.g., another homologous arm).

“间隔序列”是指为避免相邻或邻近的、或甚至相距较远的元件在转录、RNA折叠等过程中相互干扰而加入的序列。载体元件是否会在转录、RNA折叠等过程中相互干扰，可以通过计算机软件如RNA折叠软件(如RNAFold)来预测。间隔序列也可以借助于计算机软件如RNA折叠软件(如RNAFold)来设计。"Spacer sequences" are sequences added to prevent interference between adjacent, nearby, or even distant elements during transcription, RNA folding, and other processes. Whether vector elements will interfere with each other during transcription, RNA folding, etc., can be predicted using computer software such as RNA folding software (e.g., RNAFold). Spacer sequences can also be designed using computer software such as RNA folding software (e.g., RNAFold).

本文中提到的“同一性”或“序列同一性”是指在进行序列比对后，一条序列中与参照序列中核苷酸/氨基酸残基相同的核苷酸/氨基酸百分比，如果需要的话，在序列对比中引入空格来达到两条序列间最大的序列一致性百分比。本领域技术人员可以通过多种方法，例如使用计算机软件，来进行两两序列对比或多序列比对，以确定两条或多条核酸或氨基酸序列之间的序列一致性百分比，此类计算机软件为例如ClustalOmega、T-coffee、Kalign和MAFFT等。The term "identity" or "sequence identity" as used herein refers to the percentage of nucleotides/amino acids in a sequence that are identical to those in a reference sequence after sequence alignment. If necessary, spaces are introduced in the sequence alignment to achieve the maximum percentage of sequence similarity between the two sequences. Those skilled in the art can use various methods, such as computer software, to perform pairwise or multiple sequence alignments to determine the percentage of sequence similarity between two or more nucleic acid or amino acid sequences. Such computer software includes, for example, ClustalOmega, T-coffee, Kalign, and MAFFT.

术语“受试者”包括任何人或非人动物。术语“非人动物”包括所有脊椎动物，例如哺乳类和非哺乳类，例如非人灵长类、羊、狗、猫、牛、马、鸡、两栖类、和爬行类，尽管优选哺乳动物，例如非人灵长类、羊、狗、猫、牛和马。The term “subject” includes any human or non-human animal. The term “non-human animal” includes all vertebrates, such as mammals and non-mammalians, such as non-human primates, sheep, dogs, cats, cattle, horses, chickens, amphibians, and reptiles, although mammals, such as non-human primates, sheep, dogs, cats, cattle, and horses, are preferred.

术语“载体”是指一段天然存在的或合成的核苷酸片段，如DNA或RNA片段，包括单链和双链DNA片段，例如化学合成的DNA片段、天然的质粒、或经改造的病毒基因组等。载体中可以插入有外源DNA片段，以进行该外源DNA片段的克隆和/或表达。载体可以包含，例如复制起点、选择性标记或报告基因、多克隆位点(MCS)等。该术语包括线性的DNA片段(如PCR产物、线性化的质粒片段等)、质粒载体、病毒载体、细菌人工染色体(BAC)、酵母人工染色体(YAC)等。当载体为双链DNA时，对于元件的排序、以及元件序列的方向的描述，是针对其中一条DNA链而言的。在双链载体中，两条DNA链基本互补或完全互补，即两条DNA链之间形成至少约75％、80％、85％、90％、95％、96％、97％、98％、99％或100％碱基配对。The term "vector" refers to a naturally occurring or synthetic nucleotide fragment, such as a DNA or RNA fragment, including single-stranded and double-stranded DNA fragments, such as chemically synthesized DNA fragments, natural plasmids, or modified viral genomes. A foreign DNA fragment may be inserted into a vector for the cloning and/or expression of that foreign DNA fragment. Vectors may contain, for example, origins of replication, selectivity markers or reporter genes, multiple cloning sites (MCS), etc. The term includes linear DNA fragments (such as PCR products, linearized plasmid fragments, etc.), plasmid vectors, viral vectors, bacterial artificial chromosomes (BACs), yeast artificial chromosomes (YACs), etc. When the vector is double-stranded DNA, the description of the element sequencing and the orientation of the element sequence refers to one of the DNA strands. In a double-stranded vector, the two DNA strands are substantially complementary or completely complementary, meaning that at least approximately 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% base pairing exists between the two DNA strands.

“核苷”是DNA和RNA的组成部分，包含核糖(对于RNA)或脱氧核糖(对于DNA)与碱基。“核苷酸”是指由核苷与磷酸基团组成的分子，可以在5’位包含羟基且在3’位包含磷酸基、或者在3’位包含羟基且在5’位包含磷酸基。在本申请中，在某些情境下，“核苷酸”和“碱基”可互换使用。"Nucleoside" is a component of DNA and RNA, consisting of ribose (for RNA) or deoxyribose (for DNA) and a base. "Nucleotide" refers to a molecule composed of a nucleoside and a phosphate group, which may contain a hydroxyl group at the 5' position and a phosphate group at the 3' position, or vice versa. In this application, "nucleotide" and "base" may be used interchangeably in certain contexts.

“内部引导序列”或“IGS”是指自I类剪接型内含子中靠近5’端的为自剪接所需的一段序列，其可以与内含子5’侧外显子和内含子3’侧外显子形成互补配对，使内含子形成正确的二级和三级结构，从而推进内含子剪接程序的进行。“剪接位点”是指内含子和外显子的交界处。The "internal guide sequence" or "IGS" refers to a sequence near the 5' end of a type I spliced intron that is required for self-splicing. This sequence can form complementary pairs with the 5' and 3' exons of the intron, enabling the intron to form the correct secondary and tertiary structures, thereby advancing the intron splicing process. The "splicing site" refers to the junction between the intron and the exon.

在利用自剪切型内含子进行环状RNA的制备时，难免会带入一些非目的序列。When preparing circular RNA using self-cleaving introns, it is inevitable that some non-target sequences will be introduced.

例如，自剪接型内含子需要借助两侧相邻的外显子序列形成所需的二级和三级结构，从而推进自剪接程序。具体地，如图1所示，内含子中靠近5'端的内部引导序列IGS需要与内含子5’侧外显子和内含子3’侧外显子形成互补配对，在两个三角形指向的剪接位点发生剪接，内含子脱除，两侧外显子连接。在使用自剪接型内含子构建用于制备环状RNA的载体时，通常将内含子分成3'内含子片段(及3'侧外显子E3)和5'内含子片段(及5'侧外显子E5)两部分，将3'内含子片段(及3'侧外显子E3)和5'内含子片段(及5'侧外显子E5)分别装载在待环化RNA序列的5'和3'侧。对于如何将内含子分成两半，即在哪些位点将内含子分成两半，本领域技术人员能够根据内含子的一级、二级、三级和/或四级结构判断出，且对于分成两半的内含子片段是否可以使待环化RNA成功环化，可使用本申请实施例中列出的方法等进行测试。当RNA进行环化时，自剪接型内含子和两侧相邻外显子发生上述互补配对，内含子从整体结构中脱离，E3和E5连接并留在环状RNA产物中。E3和E5就是环状RNA中的非目的序列，其存在可能会增加免疫原性以及相关的治疗风险，从而使得环状RNA的应用范围受限。For example, self-splicing introns require adjacent exon sequences on both sides to form the necessary secondary and tertiary structures, thereby advancing the self-splicing process. Specifically, as shown in Figure 1, the internal guide sequence IGS near the 5' end of the intron needs to form complementary pairings with the 5' and 3' exons of the intron. Splicing occurs at the two triangular splice sites, resulting in intron removal and the joining of the exons on both sides. When constructing vectors for preparing circular RNA using self-splicing introns, the intron is typically divided into a 3' intron fragment (and 3' exon E3) and a 5' intron fragment (and 5' exon E5). The 3' intron fragment (and 3' exon E3) and the 5' intron fragment (and 5' exon E5) are then loaded onto the 5' and 3' sides of the RNA sequence to be circularized, respectively. Regarding how to split an intron in half, i.e., at which sites the intron is split in half, those skilled in the art can determine this based on the primary, secondary, tertiary, and/or quaternary structure of the intron. Furthermore, the methods listed in the embodiments of this application can be used to test whether the split intron fragments can successfully circularize the RNA to be circularized. During RNA circularization, the self-splicing intron and its adjacent exons undergo the aforementioned complementary pairing, and the intron detaches from the overall structure. E3 and E5 connect and remain in the circular RNA product. E3 and E5 are non-target sequences in the circular RNA; their presence may increase immunogenicity and related therapeutic risks, thus limiting the application scope of circular RNA.

图2A和图5分别示出炭疽杆菌recA基因中自剪接型内含子、和贝氏柯克斯体23S核糖体基因中自剪接型内含子的示意性结构图。其中，IGS的5’端和3’端序列分别称为P10和P1，5’侧外显子中与P1互补配对的序列称为P1反义链，而3’侧外显子中与P10互补配对的序列称为P10反义链。在临近5’剪接位点处，IGS中的G与5’外显子最后一个核苷酸U形成一个较为保守的G:U配对。Figures 2A and 5 show schematic structural diagrams of self-splicing introns in the *Bacillus anthracis* *recA* gene and the *Coxiella behnkeni* 23S ribosomal gene, respectively. The 5' and 3' sequences of the IGS are designated P10 and P1, respectively. The sequence complementary to P1 in the 5' exon is called the P1 antisense strand, while the sequence complementary to P10 in the 3' exon is called the P10 antisense strand. Near the 5' splice site, the G in the IGS forms a relatively conserved G:U pair with the last nucleotide U of the 5' exon.

本申请的发明人，通过将待环化RNA中的3’端序列3-20nt核苷酸作为P1反义链，并将内含子IGS中的P1改造成与P1反义链反向互补，可以使用不含内含子5’侧外显子的自剪接型内含子实现RNA的环化，得到的环状RNA中自然也不含该E5序列。在改造中，保留P1与P1反义链形成的保守G:U配对。因而，待环化RNA的3’端序列末尾应当，或最好是碱基为U的核苷酸。The inventors of this application have achieved RNA circularization using a self-splicing intron that does not contain the 5' exon of the intron, by using the 3-20 nt nucleotide sequence at the 3' end of the RNA to be circularized as the P1 antisense strand and modifying the P1 in the intron IGS to be anticomplementary to the P1 antisense strand. The resulting circular RNA naturally does not contain the E5 sequence. In the modification, the conserved G:U pairing formed by P1 and the P1 antisense strand is preserved. Therefore, the 3' end of the RNA to be circularized should, or preferably should, be a U-terminated nucleotide.

此外，发明人还通过将待环化RNA中的5’端序列2-15nt核苷酸作为P10反义链，并将内含子IGS中的P10改造成与P10反义链反向互补，可以使用不含内含子3’侧外显子的自剪接型内含子实现RNA的环化，得到的环状RNA中自然也不含该E3序列。In addition, the inventors also used the 2-15nt nucleotide sequence at the 5' end of the RNA to be circularized as the P10 antisense strand and modified the P10 in the intron IGS to be reverse complementary to the P10 antisense strand. This allowed them to use self-splicing introns that do not contain the 3' exon of the intron to achieve RNA circularization. Naturally, the resulting circular RNA does not contain the E3 sequence.

在同时做出上述P1和P10改造时，可以使用不含E3和E5的自剪接型内含子完成RNA环化。得到的环状RNA中自然也不含E3和E5这些非目的序列。When performing the P1 and P10 modifications simultaneously, RNA circularization can be accomplished using self-splicing introns that do not contain E3 and E5. The resulting circular RNA will naturally not contain the non-target sequences E3 and E5.

发明人对两种I类自剪接型内含子，即炭疽杆菌recA基因中的自剪接型内含子、和贝氏柯克斯体23S核糖体基因中的自剪接型内含子，进行了IGS的改造，并将IGS改造后的内含子用于环化RNA的制备。结果表明，这种内含子IGS的改造、以及改造后内含子的应用，具有普适性，即都可以用于RNA的环化，且RNA环化的效率较高，达到了生产要求。因而，对于其他来源的内含子，也同样可以采取本发明的策略，对内含子的IGS进行改造，并将IGS改造后的内含子用于环化RNA的制备。The inventors modified two types of class I self-splicing introns—one from the recA gene of *Bacillus anthracis* and the other from the 23S ribosome gene of *Coxiella behnkeni*—using introns with IGS (Intracytoplasmic Spoken RNA). The modified introns were then used to prepare circularized RNA. Results showed that this intron IGS modification and the application of the modified introns are universally applicable; both can be used for RNA circularization with high efficiency, meeting production requirements. Therefore, the strategy of this invention can also be applied to introns from other sources, modifying their IGS and using the modified introns for the preparation of circularized RNA.

因而，本申请提供一种用于制备环状RNA的单链DNA分子，其可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、目的序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接。Therefore, this application provides a single-stranded DNA molecule for preparing circular RNA, which may sequentially comprise from the 5' end to the 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a target sequence, and a 5' self-splicing intron fragment containing a 5' splice site, wherein the elements are operatively linked.

3'自剪接型内含子片段和5'自剪接型内含子片段能够使得由该单链DNA分子的互补链转录出的RNA分子环化。3'自剪接型内含子片段和5'自剪接型内含子片段可以得自同一个自剪接型内含子。Both 3' and 5' self-splicing intron fragments can cause the RNA molecule transcribed from the complementary strand of the single-stranded DNA molecule to become circular. Both 3' and 5' self-splicing intron fragments can originate from the same self-splicing intron.

内部引导序列的5'端序列可以与目的序列的5'端序列反向互补。内部引导序列的5'端序列可以与目的序列的5'端的2-15nt，例如2-12nt核苷酸反向互补。在一些实施方式中，内部引导序列的5'端序列可以与目的序列的5'端的3-6nt核苷酸反向互补。The 5' end sequence of the internal guide sequence may be anticomplementary to the 5' end sequence of the target sequence. The 5' end sequence of the internal guide sequence may be anticomplementary to 2-15 nt, for example, 2-12 nt nucleotides at the 5' end of the target sequence. In some embodiments, the 5' end sequence of the internal guide sequence may be anticomplementary to 3-6 nt nucleotides at the 5' end of the target sequence.

内部引导序列的3’端序列可以与目的序列的3'端序列反向互补。内部引导序列的3'端序列可以与目的序列的3'端的3-20nt。例如3-15nt核苷酸反向互补。在一些实施方式中，内部引导序列的3'端序列可以与目的序列的3'端的3-10nt核苷酸反向互补。在另一些实施方式中，在一些实施方式中，内部引导序列的3'端序列可以与目的序列的3'端的3-9nt核苷酸反向互补。如上所述，为了保持P1和P1反义链中的保守G:U配对，内部引导序列的3’端序列可以以碱基为G的核苷酸为起始，且目的序列在3'端序列末尾可以是碱基为T的核苷酸。The 3' end sequence of the internal guide sequence can be anticomplementary to the 3' end sequence of the target sequence. The 3' end sequence of the internal guide sequence can be anticomplementary to 3-20 nt (e.g., 3-15 nt) nucleotides at the 3' end of the target sequence. In some embodiments, the 3' end sequence of the internal guide sequence can be anticomplementary to 3-10 nt nucleotides at the 3' end of the target sequence. In other embodiments, the 3' end sequence of the internal guide sequence can be anticomplementary to 3-9 nt nucleotides at the 3' end of the target sequence. As described above, to maintain the conserved G:U pairing in the P1 and P1 antisense strands, the 3' end sequence of the internal guide sequence can be initiated by a G-based nucleotide, and the 3' end sequence of the target sequence can be an T-based nucleotide.

当选定用于RNA环化的自剪接型内含子后，3'内含子片段可以在一些情况下固定，而5'内含子片段会随着待环化RNA的序列而变，因为其中的IGS需要与待环化RNA的两端序列反应互补。在一些实施方式中，3'自剪接型内含子片段和5'自剪接型内含子片段可以得自炭疽杆菌RecA基因中的自剪接型内含子，且该3'自剪接型内含子片段可以包含SEQ ID NO:10所示的核苷酸序列。在一些实施方式中，3'自剪接型内含子片段和5'自剪接型内含子片段可以得自贝氏柯克斯体23S核糖体基因中的自剪接型内含子，且该3'自剪接型内含子片段可以包含SEQ ID NO:21所示的核苷酸序列。Once a self-splicing intron for RNA circularization is selected, the 3' intron fragment can be fixed in some cases, while the 5' intron fragment varies depending on the sequence of the RNA to be circularized, because the IGS within it needs to react complementaryly with the sequences at both ends of the RNA to be circularized. In some embodiments, the 3' and 5' self-splicing intron fragments can be derived from the self-splicing intron in the *Bacillus anthracis* RecA gene, and the 3' self-splicing intron fragment may contain the nucleotide sequence shown in SEQ ID NO: 10. In some embodiments, the 3' and 5' self-splicing intron fragments can be derived from the self-splicing intron in the *Coxiella belladonna* 23S ribosome gene, and the 3' self-splicing intron fragment may contain the nucleotide sequence shown in SEQ ID NO: 21.

目的序列可以包含编码目的肽或蛋白的开放阅读框、翻译功能元件的序列、编码非编码RNA的序列、单克隆位点、或多克隆位点。The target sequence may contain an open reading frame encoding the target peptide or protein, a sequence of translational functional elements, a sequence encoding non-coding RNA, a single cloning site, or a multiple cloning site.

翻译功能元件可以选自翻译起始元件、和翻译增强元件。在本申请单链DNA所制备的环状RNA分子中，翻译增强元件可以位于编码目的肽或蛋白的开放阅读框的5'侧或3'侧。例如，poly(A)尾、或3’CITE可以位于编码目的肽或蛋白的开放阅读框的3'侧，而得自Hox基因的翻译增强子则可以位于编码目的肽或蛋白的开放阅读框的5'侧。在本申请单链DNA所制备的环状RNA分子中，翻译增强元件可以位于翻译起始元件的5'侧或3'侧。本领域的技术人员，可以根据环状RNA分子的最终结构，来安排本申请DNA单链分子中翻译功能元件中翻译起始元件、和翻译增强元件的位置。例如，在最终得到的环状RNA分子中，使得poly(A)尾、或3’CITE等位于编码目的肽或蛋白的开放阅读框的3'侧，而得自Hox基因的翻译增强子等位于编码目的肽或蛋白的开放阅读框的5'侧。Translational functional elements can be selected from translation initiation elements and translation enhancement elements. In the circular RNA molecules prepared from single-stranded DNA of this application, the translation enhancement elements can be located on the 5' side or 3' side of the open reading frame encoding the target peptide or protein. For example, a poly(A) tail or 3'CITE can be located on the 3' side of the open reading frame encoding the target peptide or protein, while a translation enhancer derived from a Hox gene can be located on the 5' side of the open reading frame encoding the target peptide or protein. In the circular RNA molecules prepared from single-stranded DNA of this application, the translation enhancement elements can be located on the 5' side or 3' side of the translation initiation element. Those skilled in the art can arrange the positions of the translation initiation elements and translation enhancement elements in the single-stranded DNA molecules of this application according to the final structure of the circular RNA molecules. For example, in the final obtained circular RNA molecules, a poly(A) tail or 3'CITE is located on the 3' side of the open reading frame encoding the target peptide or protein, while a translation enhancer derived from a Hox gene is located on the 5' side of the open reading frame encoding the target peptide or protein.

目的序列可以包含编码目的肽或蛋白的开放阅读框、和翻译功能元件(例如IRES)的序列。当该翻译功能元件(例如IRES)的序列的3'端序列末尾是T时，目的序列可以从5'端到3'端包含该编码目的肽或蛋白的开放阅读框、和该翻译功能元件(例如IRES)的序列。当该编码IRES的序列的3'端序列末尾不是T时，可以将翻译功能元件(例如IRES)的序列分拆成两段，并使其5'端序列的3'端序列末尾是T，从而目的序列从5'端到3'端包含该翻译功能元件(例如IRES)的序列的3'端序列、该编码目的肽或蛋白的开放阅读框、和该翻译功能元件(例如IRES)的序列的5'端序列。当该编码目的肽或蛋白的开放阅读框的3'端序列末尾是T时，目的序列可以从5'端到3'端包含该翻译功能元件(例如IRES)的序列、和该编码目的肽或蛋白的开放阅读框。当该编码目的肽或蛋白的开放阅读框的3'端序列末尾不是T时，可以将编码目的肽或蛋白的开放阅读框分拆成两段，并使其5'端序列的3'端序列末尾是T，从而目的序列从5'端到3'端包含该编码目的肽或蛋白的开放阅读框的3'端序列、该翻译功能元件(例如IRES)的序列、和该编码目的肽或蛋白的开放阅读框的5'端序列。目的序列可以包含一个或多个编码目的肽或蛋白的开放阅读框、和翻译功能元件(例如IRES)的序列。目的序列可以从5'端到3'端包含(a)该一个或多个编码目的肽或蛋白的开放阅读框、和该翻译功能元件(例如IRES)的序列，(b)该翻译功能元件(例如IRES)的序列、和该一个或多个编码目的肽或蛋白的开放阅读框，(c)该翻译功能元件(例如IRES)的序列的3'端序列、该一个或多个编码目的肽或蛋白的开放阅读框、和该翻译功能元件(例如IRES)的序列的5'端序列，或(d)其中一个编码目的肽或蛋白的开放阅读框的3'端序列、其他编码目的肽或蛋白的开放阅读框(如存在的情况下)、该翻译功能元件(例如IRES)的序列、和其中一个编码目的肽或蛋白的开放阅读框的5'端序列。同样地，目的序列的3'端序列末尾需要安排成带T碱基的核苷酸。目的序列可以包含编码非编码RNA的序列。根据编码非编码RNA的序列的3'端序列末尾是否为带T的核苷酸，目的序列可以(a)包含该编码非编码RNA的序列，或(b)从5'端到3'端包含该编码非编码RNA的序列的3'端序列、和该编码非编码RNA的序列的5'端序列。The target sequence may contain an open reading frame encoding a target peptide or protein, and a sequence of a translational functional element (e.g., IRES). When the 3' end of the sequence of the translational functional element (e.g., IRES) is T, the target sequence can contain the open reading frame encoding the target peptide or protein and the sequence of the translational functional element (e.g., IRES) from the 5' end to the 3' end. When the 3' end of the sequence encoding IRES is not T, the sequence of the translational functional element (e.g., IRES) can be split into two segments, and the 3' end of the 5' end of each segment can be T, so that the target sequence contains the 3' end sequence of the sequence of the translational functional element (e.g., IRES), the open reading frame encoding the target peptide or protein, and the 5' end sequence of the sequence of the translational functional element (e.g., IRES) from the 5' end to the 3' end. When the 3' end of the open reading frame (OPF) encoding the target peptide or protein ends in a T, the target sequence can include the sequence of the translational functional element (e.g., IRES) and the ORF encoding the target peptide or protein from the 5' end to the 3' end. When the 3' end of the ORF encoding the target peptide or protein does not end in a T, the ORF encoding the target peptide or protein can be split into two segments, with the 3' end of each segment ending in a T. Thus, the target sequence can include the 3' end sequence of the ORF encoding the target peptide or protein, the sequence of the translational functional element (e.g., IRES), and the 5' end sequence of the ORF encoding the target peptide or protein from the 5' end to the 3' end. The target sequence can contain one or more ORFs encoding the target peptide or protein and sequences of translational functional elements (e.g., IRES). The target sequence may include, from the 5' end to the 3' end, (a) the open reading frame (OPF) encoding the target peptide or protein and the sequence of the translational element (e.g., IRES); (b) the sequence of the translational element (e.g., IRES) and the one or more ORF encoding the target peptide or protein; (c) the 3' end sequence of the translational element (e.g., IRES), the one or more ORF encoding the target peptide or protein and the 5' end sequence of the translational element (e.g., IRES); or (d) the 3' end sequence of one of the ORF encoding the target peptide or protein, other ORF encoding the target peptide or protein (if present), the sequence of the translational element (e.g., IRES), and the 5' end sequence of one of the ORF encoding the target peptide or protein. Similarly, the 3' end of the target sequence must be arranged as a nucleotide with a T base. The target sequence may contain a sequence encoding non-coding RNA. Depending on whether the 3' end of the sequence encoding non-coding RNA ends with a nucleotide containing a T, the target sequence may (a) contain the sequence encoding non-coding RNA, or (b) contain the 3' end of the sequence encoding non-coding RNA and the 5' end of the sequence encoding non-coding RNA from the 5' end to the 3' end.

目的序列可以包含编码目的肽或蛋白的开放阅读框。根据编码目的肽或蛋白的开放阅读框的3'端序列末尾是否为带T的核苷酸，目的序列可以(a)包含该编码目的肽或蛋白的开放阅读框，或(b)从5'端到3'端包含该编码目的肽或蛋白的开放阅读框的3'端序列、和该编码目的肽或蛋白的开放阅读框的5'端序列。在一些实施方式中，目的序列可以包含一个或多个编码目的肽或蛋白的开放阅读框。根据一个或多个编码目的肽或蛋白的开放阅读框的3'端序列末尾是否为带T的核苷酸，目的序列可以从5'端到3'端包含(a)该一个或多个编码目的肽或蛋白的开放阅读框，或(b)其中一个编码目的肽或蛋白的开放阅读框的3'端序列、其他编码目的肽或蛋白的开放阅读框(如存在的情况下)、和其中一个编码目的肽或蛋白的开放阅读框的5'端序列。The target sequence may contain an open reading frame (OPF) encoding a target peptide or protein. Depending on whether the 3' end of the OPF encoding the target peptide or protein is a T-terminated nucleotide, the target sequence may (a) contain the OPF encoding the target peptide or protein, or (b) contain the 3' end sequence of the OPF encoding the target peptide or protein and the 5' end sequence of the OPF encoding the target peptide or protein from the 5' end to the 3' end. In some embodiments, the target sequence may contain one or more OPF encoding the target peptide or protein. Depending on whether the 3' end sequence of one or more OPF encoding the target peptide or protein is a T-terminated nucleotide, the target sequence may contain (a) the one or more OPF encoding the target peptide or protein from the 5' end to the 3' end, or (b) the 3' end sequence of one OPF encoding the target peptide or protein, other OPF encoding the target peptide or protein (if present), and the 5' end sequence of one OPF encoding the target peptide or protein.

目的序列可以包含单克隆位点、或多克隆位点。可以通过单克隆位点、或多克隆位点在本申请的单链DNA分子中插入任何想要的序列，例如编码目的肽或蛋白的开放阅读框、和翻译功能元件(例如IRES)的序列等。在单克隆或多克隆位点处插入所需序列时，需考虑插入后的序列的3'端末序列尾是否为带T的核苷酸。The target sequence can contain a single-cloning site or a multiple-cloning site. Any desired sequence, such as an open reading frame encoding a target peptide or protein, or a sequence of a translational functional element (e.g., IRES), can be inserted into the single-stranded DNA molecule of this application through a single-cloning site or multiple-cloning site. When inserting the desired sequence at a single-cloning or multiple-cloning site, it is necessary to consider whether the 3' end of the inserted sequence is a T-containing nucleotide.

目的序列可以包含单克隆位点或多克隆位点、和翻译功能元件(例如IRES)的序列。目的序列可以从5'端到3'端包含该翻译功能元件(例如IRES)的序列的3'端序列、该单克隆位点或多克隆位点、和该翻译功能元件(例如IRES)的序列的5'端序列，其中该翻译功能元件(例如IRES)的序列的5'端序列的3'端序列末尾可以为带T的核苷酸。可以通过单克隆位点、或多克隆位点在本申请的单链DNA分子中插入任何想要的序列，例如编码目的肽或蛋白的开放阅读框等，此时无需考虑3'端序列末尾的核苷酸种类。The target sequence may contain a single cloning site or a multiple cloning site, and a sequence of a translational element (e.g., IRES). The target sequence may include, from the 5' end to the 3' end, the 3' end sequence of the translational element (e.g., IRES), the single cloning site or multiple cloning site, and the 5' end sequence of the translational element (e.g., IRES), wherein the 3' end of the 5' end sequence of the translational element (e.g., IRES) may end with a T-nucleotide. Any desired sequence, such as an open reading frame encoding a target peptide or protein, can be inserted into the single-stranded DNA molecule of this application through the single cloning site or multiple cloning site, without considering the type of nucleotide at the 3' end.

相应地，本申请还保护一种可环化的RNA分子，其可以从5'端到3'端依次包含：包含3'剪接位点的3'自剪接型内含子片段、目的序列、和包含5'剪接位点的5'自剪接型内含子片段，其中各元件可操作地连接。Accordingly, this application also protects a circularizable RNA molecule which may comprise, from the 5' end to the 3' end, the following in sequence: a 3' self-splicing intron fragment containing a 3' splice site, a target sequence, and a 5' self-splicing intron fragment containing a 5' splice site, wherein the elements are operatively linked.

3'自剪接型内含子片段和5'自剪接型内含子片段能够使得该RNA分子环化并在环化过程中从该RNA分子中去除。The 3' self-splicing intron fragment and the 5' self-splicing intron fragment can cause the RNA molecule to circularize and be removed from the RNA molecule during the circularization process.

内部引导序列的5'端序列可以与目的序列的5'端序列反向互补。内部引导序列的5'端序列可以与目的序列的5'端的2-15nt，例如2-12nt核苷酸反向互补。The 5' end sequence of the internal guide sequence can be reverse complementary to the 5' end sequence of the target sequence. The 5' end sequence of the internal guide sequence can be reverse complementary to 2-15 nt (e.g., 2-12 nt) nucleotides at the 5' end of the target sequence.

内部引导序列的3’端序列可以与目的序列的3'端序列反向互补。内部引导序列的3'端序列可以与目的序列的3'端的3-20nt，例如3-15nt核苷酸反向互补。如上所述，为了保持P1和P1反义链中的保守G:U配对，内部引导序列的3’端序列可以以碱基为G的核苷酸为起始，且目的序列可以在3'端序列末尾是碱基为U的核苷酸。The 3' end sequence of the internal guide sequence can be anticomplementary to the 3' end sequence of the target sequence. The 3' end sequence of the internal guide sequence can be anticomplementary to a 3-20 nt (e.g., 3-15 nt) nucleotide at the 3' end of the target sequence. As mentioned above, to maintain the conserved G:U pairing in the P1 and P1 antisense strands, the 3' end sequence of the internal guide sequence can begin with a G-based nucleotide, and the target sequence can end with a U-based nucleotide at its 3' end.

同理，根据目的序列中所含的序列的3'端序列末尾核苷酸的情况，对序列进行顺序调整、或者对序列一拆为二，将5'端序列置于目的序列的3'端，且使调整后目的序列3'端序列末尾是碱基为U的核苷酸。Similarly, depending on the nucleotides at the end of the 3' end of the target sequence, the sequence can be rearranged or split into two, with the 5' end placed at the 3' end of the target sequence, and the adjusted target sequence ending with a U nucleotide at the 3' end.

如此，可以根据需要选用本申请DNA分子或RNA分子来制备环状RNA分子，使得环状RNA分子不带自剪接型内含子的5'侧外显子序列、或3'侧外显子序列，或者既不带自剪接型内含子的5'侧外显子序列也不带3'侧外显子序列，从而降低环状RNA分子中的非目的序列。目的序列不同拆分位置的本申请DNA分子或RNA分子，均可以制备既不带自剪接型内含子的5'侧外显子序列也不带3'侧外显子序列的环状RNA分子。Thus, the DNA or RNA molecules described in this application can be selected as needed to prepare circular RNA molecules, ensuring that the circular RNA molecules do not contain the 5' exon sequence of self-splicing introns, or the 3' exon sequence, or neither the 5' exon sequence of self-splicing introns nor the 3' exon sequence, thereby reducing the non-target sequences in the circular RNA molecules. Circular RNA molecules without either the 5' exon sequence of self-splicing introns or the 3' exon sequence can be prepared from the DNA or RNA molecules described in this application at different splitting positions of the target sequence.

此外，在本申请的DNA分子和RNA分子中，不另外包含人工间隔序列，使得进一步降低非目的序列。而本申请的DNA分子和RNA分子在制备环状RNA时，仍能保持较高的RNA环化效率，能够应用于生产，特别是大规模生产。Furthermore, the DNA and RNA molecules of this application do not contain additional artificial spacer sequences, thereby further reducing non-target sequences. Moreover, the DNA and RNA molecules of this application maintain high RNA circularization efficiency during the preparation of circular RNA, enabling their application in production, especially large-scale production.

在本申请的DNA分子和RNA分子中，目的序列中可以包含翻译功能元件IRES，如CVB3 IRES、EVB107 IRES、HRVB3 IRES，EV-AIRES(包含EV-AIRES的变体，以下称EV-A-mut IRES)。针对IRES元件进行调整或者一拆为二，拆分成5'端序列和3'端序列，将IRES序列的5'端序列置于编码目的肽或蛋白的开放阅读框之后，且使得调整后目的序列3'端序列末尾为带碱基T或U的核苷酸。IRES不同位置拆分均可以制备既不带自剪接型内含子的5'侧外显子序列也不带3'侧外显子序列的环状RNA分子。在本申请一些优选实施方式中，在CVB3 IRES、EVB107 IRES或HRVB3IRES第18个碱基后、或在EV-A-mut IRES第374个碱基后截断为二的DNA分子或RNA分子制备环状RNA的环化率会有较明显的提升，RNA酶R处理环化产物后能够获得纯度更高的环状RNA。In the DNA and RNA molecules of this application, the target sequence may contain translational functional elements (IRES), such as CVB3 IRES, EVB107 IRES, HRVB3 IRES, and EV-AIRES (variants containing EV-AIRES, hereinafter referred to as EV-A-mut IRES). The IRES element can be adjusted or split into a 5' end sequence and a 3' end sequence. The 5' end sequence of the IRES is placed after the open reading frame encoding the target peptide or protein, and the 3' end sequence of the adjusted target sequence ends with a nucleotide containing the bases T or U. Splitting the IRES at different positions can prepare circular RNA molecules that do not contain either the 5' exon sequence or the 3' exon sequence of self-splicing introns. In some preferred embodiments of this application, the cyclization rate of DNA or RNA molecules truncated into two after the 18th base of CVB3 IRES, EVB107 IRES or HRVB3 IRES, or after the 374th base of EV-A-mut IRES, is significantly improved. RNase R treatment of the cyclization product can yield cyclized RNA with higher purity.

本申请还提供一种组合物，其可以包含本申请的单链DNA分子、双链DNA分子、载体、可环化RNA分子，或包含上述单链DNA分子、双链DNA分子、载体、或可环化RNA分子的细胞。该组合物可以用于制备可环化的RNA、和/或环状RNA，特别是在体外或细胞体内产生可翻译出蛋白或具有生物学活性的环状RNA。具有生物学活性的环状RNA可以是例如miRNA海绵体、或非编码RNA。This application also provides a composition that may comprise the single-stranded DNA molecule, double-stranded DNA molecule, vector, or circularizable RNA molecule of this application, or a cell comprising the aforementioned single-stranded DNA molecule, double-stranded DNA molecule, vector, or circularizable RNA molecule. This composition can be used to prepare circularizable RNA and/or circular RNA, particularly to generate translatable proteins or biologically active circular RNA in vitro or in vivo. Biologically active circular RNA may be, for example, miRNA sponges or non-coding RNA.

本申请还提供一种组合物，其可以包含本申请的环状RNA分子、或包含环状RNA分子的细胞。该组合物可以是药物组合物，还可以包含药学上可接受的载体、赋形剂或稀释剂。This application also provides a composition that may comprise the circular RNA molecule of this application or a cell comprising the circular RNA molecule. The composition may be a pharmaceutical composition and may also comprise a pharmaceutically acceptable carrier, excipient, or diluent.

药学上可接受的载体可以是组合物中除活性成分(即本申请的载体、细胞、前体RNA、或环状RNA)以外的成分。药学上可接受的载体可以包括但不限于缓冲剂、赋形剂、稳定剂或防腐剂。药学上可接受的载体的实例是生理上相容的溶剂、分散介质、包衣、抗细菌和抗真菌剂、等渗剂和吸收延迟剂等，如盐、缓冲液、糖类、抗氧化剂、水性或非水性载体、防腐剂、润湿剂、表面活性剂或乳化剂或其组合。可以基于载体的活性和制剂的所需特性，如稳定性和/或最小氧化，通过实验确定药物组合物中的药学上可接受的载体的量。Pharmaceutically acceptable carriers can be components in a composition other than the active ingredient (i.e., the carrier, cell, precursor RNA, or circular RNA of this application). Pharmaceutically acceptable carriers can include, but are not limited to, buffers, excipients, stabilizers, or preservatives. Examples of pharmaceutically acceptable carriers are physiologically compatible solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic agents, and absorption delay agents, such as salts, buffers, sugars, antioxidants, aqueous or non-aqueous carriers, preservatives, wetting agents, surfactants, or emulsifiers, or combinations thereof. The amount of a pharmaceutically acceptable carrier in a drug composition can be determined experimentally based on the activity of the carrier and the desired properties of the formulation, such as stability and/or minimal oxidation.

在某些实施方式中，本申请的组合物可以包含缓冲液，如乙酸、柠檬酸、组氨酸、硼酸、甲酸、琥珀酸、磷酸、碳酸、苹果酸、天冬氨酸、Tris缓冲液、HEPPSO、HEPES、中性缓冲盐水、磷酸盐缓冲盐水等；碳水化合物，如葡萄糖、蔗糖、甘露糖或葡聚糖、甘露醇；蛋白质；多肽或氨基酸，如甘氨酸；抗氧化剂；螯合剂，如EDTA或谷胱甘肽；佐剂(例如，氢氧化铝)；抗细菌和抗真菌剂；和防腐剂。In some embodiments, the compositions of this application may comprise buffer solutions such as acetic acid, citric acid, histidine, boric acid, formic acid, succinic acid, phosphoric acid, carbonic acid, malic acid, aspartic acid, Tris buffer, HEPPSO, HEPES, neutral buffered saline, phosphate buffered saline, etc.; carbohydrates such as glucose, sucrose, mannose or dextran, mannitol; proteins; peptides or amino acids such as glycine; antioxidants; chelating agents such as EDTA or glutathione; adjuvants (e.g., aluminum hydroxide); antibacterial and antifungal agents; and preservatives.

本申请也提供组合物在用于制备环状RNA、或用于体内治疗等中的用途。例如，在一个实施方式中，本申请提供一种用于在有需要的受试者中表达疫苗、治疗用蛋白、或其他类型蛋白的方法，包括对受试者施用本申请的包含环状RNA的组合物。治疗用蛋白可以为例如抗体、融合蛋白等。在一个实施方式中，受试者为某一功能基因丧失的受试者，本申请组合物中的环状RNA可以行使该缺失基因所转录的RNA的功能，或在体内持续表达缺失基因所翻译的蛋白。This application also provides the use of the composition in the preparation of circular RNA or for in vivo therapy. For example, in one embodiment, this application provides a method for expressing a vaccine, therapeutic protein, or other type of protein in a subject in need, including administering the composition of this application containing circular RNA to the subject. The therapeutic protein may be, for example, an antibody, a fusion protein, etc. In one embodiment, the subject is a subject with a loss of a functional gene, and the circular RNA in the composition of this application may perform the function of the RNA transcribed from the missing gene, or continuously express the protein translated from the missing gene in vivo.

当环状RNA翻译表达治疗用蛋白、或发挥治疗作用时，向受试者施用治疗有效量的包含环状RNA的组合物。“治疗有效量”的本申请组合物优选地引起疾病症状严重程度的降低、无疾病症状时期的频率和持续时间的上升、或防止由疾病引起的损伤或失能。例如，对于荷肿瘤受试者的治疗，“治疗有效量”是指，相对于未治疗的受试者，优选地，肿瘤生长抑制至少约40％，更优选抑制至少约60％，更优选抑制至少约80％，更优选抑制至少约99％。治疗有效量的本申请融合蛋白可以在受试者(通常为人，或者可以为另一种哺乳动物)中减小肿瘤体积、或者减轻症状。When the circular RNA is translated to express a therapeutic protein or exerts a therapeutic effect, a therapeutically effective amount of the composition containing the circular RNA is administered to the subject. The "therapeutically effective amount" of the composition preferably causes a reduction in the severity of disease symptoms, an increase in the frequency and duration of symptom-free periods, or prevention of damage or disability caused by the disease. For example, in the treatment of a tumor-bearing subject, a "therapeuticly effective amount" means, relative to an untreated subject, preferably, tumor growth is inhibited by at least about 40%, more preferably by at least about 60%, more preferably by at least about 80%, and more preferably by at least about 99%. The therapeutically effective amount of the fusion protein of this application can reduce tumor volume or alleviate symptoms in a subject (typically a human, or possibly another mammal).

当需要的时候，本申请中的前体RNA或环状RNA可以进行纯化。例如，在向受试者施用本申请的组合物时，环状RNA优选是纯化的。环状RNA或前体RNA的纯化可以通过在高效液相色谱(HPLC)系统中施用尺寸排阻柱等方法来进行。When necessary, the precursor RNA or circular RNA in this application can be purified. For example, when administering the composition of this application to a subject, the circular RNA is preferably purified. Purification of the circular RNA or precursor RNA can be performed by methods such as applying a size exclusion column in a high-performance liquid chromatography (HPLC) system.

对于药物组合物的施用，具体可以由医学工作者，如医生，根据受试者的具体情况，如性别、年龄、既往病史等进行确定。The specific administration of the drug composition can be determined by medical professionals, such as doctors, based on the subject's specific circumstances, such as gender, age, and medical history.

本申请的药物组合物可以配置用于经口、静脉内、肌内、皮下、肠道外、脊柱或表皮施用(例如通过注射或推注)。“肠道外施用”是指不同于肠道和局部外用的方式，通常通过注射进行，包括但不限于静脉内、肌内、动脉内、膜内、囊内、眶内、心脏内、皮内、腹膜内、经气管、皮下、表皮下、关节内、囊下、蛛网膜下、脊柱内、硬脑膜上和胸骨内注射和推注。在一个实施方式中，可以将组合物配制成用于输注或静脉内施用。本文公开的组合物可以例如作为无菌液体制剂提供，例如等渗水溶液、乳液、悬浮液、分散体或粘性组合物，其可以缓冲至所需的pH。在某些实施方式中，组合物的施用可以以任何方式进行，例如通过肠胃外或非胃肠外施用，包括通过气雾吸入、注射、输注、摄取、输血、植入或移植。例如，本文所述的组合物可通过静脉内注射、鼻内、鞘内或腹膜内经动脉、皮内、皮下、肿瘤内、髓内、结节内、肌内施用于患者。The pharmaceutical compositions of this application can be formulated for oral, intravenous, intramuscular, subcutaneous, parenteral, spinal, or epidermal administration (e.g., by injection or bolus). "Parenteral administration" refers to methods other than intestinal and topical application, typically administered by injection, including but not limited to intravenous, intramuscular, intra-arterial, intramembranous, intracystic, intraorbital, intracardiac, intradermal, intraperitoneal, tracheal, subcutaneous, subepidermal, intra-articular, sub-bursular, subarachnoid, intraspinal, supradural, and intrasternal injections and boluses. In one embodiment, the composition can be formulated for infusion or intravenous administration. The compositions disclosed herein can be provided, for example, as sterile liquid formulations, such as isotonic aqueous solutions, emulsions, suspensions, dispersions, or viscous compositions that can be buffered to the desired pH. In some embodiments, the composition can be administered in any manner, such as via parenteral or non-parenteral administration, including via aerosol inhalation, injection, infusion, ingestion, transfusion, implantation, or transplantation. For example, the compositions described herein can be administered to patients via intravenous injection, intranasal administration, intrathecal administration, intrasheath administration, intraperitoneal administration via artery, intradermal administration, subcutaneous administration, intratumoral administration, intramedullary administration, intranodal administration, or intramuscular administration.

本申请的组合物可以经例如脂质体转染、电穿孔、或经纳米载体包裹而转染到细胞中。纳米载体可以是例如脂质、聚合物或脂质-聚合物杂合体。The compositions of this application can be transfected into cells via, for example, liposome transfection, electroporation, or encapsulation with nanocarriers. Nanocarriers can be, for example, lipids, polymers, or lipid-polymer hybrids.

本申请的组合物可以经释放递送系统到达受试者体内。释放递送系统包括基于聚合物的系统，如聚丙交酯-乙交酯、共聚草酸酯、聚酯酰胺、聚原酸酯、聚己内酯、聚羟基丁酸和聚酸酐。递送系统还包括非聚合物系统，该非聚合物系统是脂质，包括固醇，如胆固醇、胆固醇酯，以及脂肪酸或中性脂肪，如单甘油二酯和甘油三酯；基于肽的系统；水凝胶释放系统；蜡涂层；使用常规粘合剂和赋形剂的压片；部分融合的植入物；等等。在某些实施方式中，脂质纳米颗粒或聚合物被用作本文所述的治疗性环状RNA的递送载体，包括将RNA递送至组织。The compositions of this application can be delivered to a subject via a release delivery system. Release delivery systems include polymer-based systems such as polylactide-glycolic acid, copolyoxalate, polyesteramide, polyorthoester, polycaprolactone, polyhydroxybutyrate, and polyanhydride. Delivery systems also include non-polymer systems, which are lipids, including sterols such as cholesterol and cholesterol esters, and fatty acids or neutral fats such as monoglycerides and triglycerides; peptide-based systems; hydrogel release systems; wax coatings; tablets using conventional adhesives and excipients; partially fused implants; and so on. In some embodiments, lipid nanoparticles or polymers are used as delivery carriers for the therapeutic circular RNA described herein, including the delivery of RNA to tissues.

药物组合物可以是缓释试剂，包括植入体、和微胶囊递送系统。可以使用生物可降解、生物相容的聚合物，例如乙烯-醋酸乙烯、聚酸酐、聚乙醇酸、胶原蛋白、聚原酸酯、和聚乳酸。药物组合物可以经医学设备来给药，例如(1)无针皮下注射设备(例如，美国专利5,399,163；5,383,851；5,312,335；5,064,413；4,941,880；4,790,824；和4,596,556)；(2)微量输液泵(美国专利4,487,603)；(3)经皮给药设备(美国专利4,486,194)；(4)推注设备(美国专利4,447,233和4,447,224)；和(5)渗透设备(美国专利4,439,196和4,475,196)。The pharmaceutical composition can be a sustained-release agent, including implants and microcapsule delivery systems. Biodegradable and biocompatible polymers such as ethylene-vinyl acetate, polyanhydride, polyglycolic acid, collagen, polyorthoester, and polylactic acid can be used. The pharmaceutical composition can be administered via medical devices, such as (1) needle-free subcutaneous injection devices (e.g., U.S. Patents 5,399,163; 5,383,851; 5,312,335; 5,064,413; 4,941,880; 4,790,824; and 4,596,556); (2) microinfusion pumps (U.S. Patent 4,487,603); (3) transdermal drug delivery devices (U.S. Patent 4,486,194); (4) bolus injection devices (U.S. Patents 4,447,233 and 4,447,224); and (5) permeation devices (U.S. Patents 4,439,196 and 4,475,196).

本申请的组合物可以和其他治疗剂、疫苗等组合使用。本文讨论的治疗剂的组合可以作为在药学可接受载体中的单一组合物同时施用，或者作为分开的组合物同时施用，其中各药剂处于药学可接受载体中。在另一个实施方式中，治疗剂的组合可以按序施用。The compositions of this application can be used in combination with other therapeutic agents, vaccines, etc. The combinations of therapeutic agents discussed herein can be administered simultaneously as a single composition in a pharmaceutically acceptable carrier, or as separate compositions, wherein each agent is contained in a pharmaceutically acceptable carrier. In another embodiment, the combinations of therapeutic agents can be administered sequentially.

此外，如果进行多次联合疗法施用，且药剂按序施用，则在各时间点的按序施用的次序可以反转或保持相同，按序施用可以与同时施用或其任何组合相结合。Furthermore, if multiple combination therapies are administered and the drugs are administered sequentially, the order of administration at each time point can be reversed or kept the same, and sequential administration can be combined with simultaneous administration or any combination thereof.

本申请将参照以下非限制性实施例进行进一步说明。This application will be further described with reference to the following non-limiting embodiments.

实施例Example

实施例1.对于炭疽杆菌RecA基因中自剪接型内含子的内部引导序列(IGS)的改Example 1. Modification of the internal guide sequence (IGS) of the self-splicing intron in the RecA gene of Bacillus anthracis. 造make

取Minsu KO发表文章^[2]中炭疽杆菌RecA基因所含的I类自剪接型内含子，对其内部引导序列(IGS)中除用于保守性G:U配对的G外的核苷酸序列进行改造，并根据Deniel G发表文章^[1]构建出可制备可环化RNA的载体，以得到不含自剪接型内含子及其相邻外显子的环状RNA。The class I self-splicing introns contained in the RecA gene of Bacillus anthracis published by Minsu KO ^[2] were modified, and the nucleotide sequence of its internal guide sequence (IGS) except for the G used for conserved G:U pairing was modified. Based on the article published by Deniell G ^[1] , a vector for preparing circular RNA was constructed to obtain circular RNA without self-splicing introns and their adjacent exons.

图2A示出炭疽杆菌RecA基因所含的自剪接型内含子及其相邻外显子的结构，包括IGS序列中的P1序列(GUUUCC)和P10序列(UGG)，5'端相邻外显子中与IGS中P1反向互补的序列区称为P1反义链(GGAAAU)，3'端相邻外显子中与IGS中P10反向互补的序列区称为P10反义链(CCA)。发生自剪接反应时，P1与P1反义链、P10与P10反义链互补配对，在P1与P1反义链互补结构中的G-U保守碱基对的碱基U后发生断裂，并在P10与P10反义链互补结构中的P10反义链5’端处发生断裂，P1反义链与P10反义链连接。图2B示出对IGS序列及其互补配对区序列的改造示意图，其中可以加以改造的核苷酸以N示出。Figure 2A shows the structure of the self-splicing intron and its adjacent exons in the *Bacillus anthracis* RecA gene, including the P1 sequence (GUUUCC) and P10 sequence (UGG) in the IGS sequence. The region in the 5' adjacent exon that is anticomplementary to P1 in the IGS is called the P1 antisense strand (GGAAAU), and the region in the 3' adjacent exon that is anticomplementary to P10 in the IGS is called the P10 antisense strand (CCA). During the self-splicing reaction, P1 and P1 antisense strands, and P10 and P10 antisense strands, pair complementaryly. A break occurs after the U base in the conserved G-U base pair in the complementary structure of P1 and P1 antisense strands, and a break occurs at the 5' end of the P10 antisense strand in the complementary structure of P10 and P10 antisense strands. The P1 antisense strands then connect with the P10 antisense strands. Figure 2B shows a schematic diagram of modifications to the IGS sequence and its complementary pairing regions, where nucleotides that can be modified are indicated by N.

图3A示出IGS未改造的载体的构造(左)、以及由该载体得到的环状RNA的结构(右)，其中载体可以按序包含自剪接型内含子的3'内含子及其相邻的3'外显子(E3)、编码IRES的序列、编码EGFP的序列、相邻的5'外显子(E5)、以及自剪接型内含子的5'内含子，其中IGS与E3以及E5的序列反向互补。具体地，载体可以依次包含3'内含子和3'外显子(E3或P10反义链)(SEQ ID NO:3)、编码CVB3 IRES的序列(SEQ ID NO:4)、编码绿色荧光蛋白(EGFP)的序列(SEQ ID NO:5)、5'外显子(E5或P1反义链)和5'内含子(SEQ ID NO:6)。在3'内含子和3'外显子的序列(SEQ ID NO:3)中，3'端的大写下划线序列为E3或P10反义链序列。在5'外显子和5'内含子序列 (SEQ ID NO:6)中，5'端的大写斜体为E5或P1反义链序列，加下划线的为IGS序列，其中IGS中的小写加粗下划线的为与E3反向互补的序列，IGS中的大写加粗的为与E5反向互补的序列。Figure 3A shows the construction of an unmodified IGS vector (left) and the structure of the circular RNA obtained from the vector (right). The vector can sequentially contain a 3' intron of a self-splicing intron and its adjacent 3' exon (E3), a sequence encoding IRES, a sequence encoding EGFP, an adjacent 5' exon (E5), and the 5' intron of the self-splicing intron, wherein the IGS sequence is inversely complementary to the sequences of E3 and E5. Specifically, the vector can sequentially contain a 3' intron and a 3' exon (E3 or P10 antisense strand) (SEQ ID NO:3), a sequence encoding CVB3 IRES (SEQ ID NO:4), a sequence encoding green fluorescent protein (EGFP) (SEQ ID NO:5), a 5' exon (E5 or P1 antisense strand), and a 5' intron (SEQ ID NO:6). The sequences of the 3' intron and 3' exon... In (SEQ ID NO:3), the capitalized underscore sequence at the 3' end is the E3 or P10 antisense strand sequence. The 5' exon and 5' intron sequences... In (SEQ ID NO:6), the uppercase italic at the 5' end represents the E5 or P1 antisense sequence, and the underlined part represents the IGS sequence. The lowercase bold underlined part of the IGS sequence represents the sequence that is the anticomplement of E3, and the uppercase bold part of the IGS sequence represents the sequence that is the anticomplement of E5.

图3B示出去除E5且IGS序列改造成与E3以及编码EGFP的序列的3'端序列反向互补的可环化RNA制备用载体的构造(左)，以及由该载体得到的环状RNA的结构(右)。具体地，载体可以依次包含3'内含子和3'外显子(E3或P10反义链)(SEQ ID NO:3)、编码CVB3 IRES的序列(SEQ ID NO:4)、编码绿色荧光蛋白(EGFP)的序列(SEQ ID NO:12，相比SEQ ID NO:5少3'端两个AA)、5'内含子(SEQ ID NO:9)。在3'内含子和3'外显子的序列(SEQ ID NO:3)中，3'端的大写下划线序列为E3或P10反义链序列。在5'内含子序列 (SEQ ID NO:9)中，加下划线的为IGS序列，其中IGS中的小写加粗下划线的为与E3反向互补的序列(即为P10序列)，IGS中的大写加粗的为与编码EGFP的序列的3'序列反向互补的序列(即为P1序列)。Figure 3B shows the construction (left) of a vector for preparing circular RNA with E5 removed and the IGS sequence modified to be inversely complementary to the 3' end sequence of E3 and the sequence encoding EGFP, and the structure of the circular RNA obtained from this vector (right). Specifically, the vector may sequentially contain a 3' intron and a 3' exon (E3 or P10 antisense strand) (SEQ ID NO:3), a sequence encoding CVB3 IRES (SEQ ID NO:4), a sequence encoding green fluorescent protein (EGFP) (SEQ ID NO:12, which lacks two AA segments at the 3' end compared to SEQ ID NO:5), and a 5' intron (SEQ ID NO:9). The sequences of the 3' intron and 3' exon... In (SEQ ID NO:3), the capitalized underscore sequence at the 3' end is the E3 or P10 antisense sequence. The 5' intron sequence... In (SEQ ID NO:9), the underlined part is the IGS sequence, where the lowercase bold underlined part of IGS is the sequence that is inversely complementary to E3 (i.e., the P10 sequence), and the uppercase bold part of IGS is the sequence that is inversely complementary to the 3' sequence of the sequence encoding EGFP (i.e., the P1 sequence).

图3C示出去除E3且IGS序列改造成与编码IRES的序列的5'端序列以及E5反向互补的环化RNA制备用载体的构造(左)，以及由该载体得到的环状RNA的结构(右)。具体地，载体可以依次包含3'内含子(SEQ ID NO:10)、编码CVB3 IRES的序列(SEQ ID NO:4)、编码绿色荧光蛋白(EGFP)的序列(SEQ ID NO:5)、5'外显子和5'内含子(SEQ ID NO:11)。其中3'内含子的序列如Aaagagatgaagagatagtccggactatagagatggaaaatctatagatagtg(SEQ ID NO:10)所示。在5'外显子和5'内含子的序列 (SEQ ID NO:11)中，5'端的大写斜体为E5或P1反义链序列，加下划线的为IGS序列，其中IGS中的小写加粗下划线的为与编码IRES的序列的5'端序列反向互补的序列，IGS中的大写加粗的为与E5反向互补的序列。Figure 3C shows the construction (left) of a vector for preparing circular RNA with E3 removed and the IGS sequence modified to be the 5' end sequence encoding IRES, and the reverse complement of E5, and the structure (right) of the circular RNA obtained from this vector. Specifically, the vector may sequentially contain a 3' intron (SEQ ID NO:10), a sequence encoding CVB3 IRES (SEQ ID NO:4), a sequence encoding green fluorescent protein (EGFP) (SEQ ID NO:5), a 5' exon, and a 5' intron (SEQ ID NO:11). The sequence of the 3' intron is shown as Aaagagatgaagagatagtccggactatagagatggaaaatctatagatagtg (SEQ ID NO:10). The sequences of the 5' exon and 5' intron are also shown. In (SEQ ID NO:11), the uppercase italic at the 5' end is the E5 or P1 antisense sequence, and the underlined one is the IGS sequence. The lowercase bold underlined one in the IGS is the sequence that is the reverse complement of the 5' end sequence encoding IRES, and the uppercase bold one in the IGS is the sequence that is the reverse complement of E5.

图3D示出去除E3和E5、将编码EGFP的序列的最3'端的两个AA置于编码IRES的序列的5'端、且IGS改造成与编码IRES的序列的5'端序列及其5'侧两个AA以及编码EGFP的序列的3'端序列反向互补的环化RNA制备用载体的构造(左)，以及由该载体得到的环状RNA的结构(右)。具体地，载体可以依次包含3'内含子(SEQ ID NO:10)、两个AA核苷酸、编码CVB3 IRES的序列(SEQ ID NO:4)、编码绿色荧光蛋白(EGFP)的序列(SEQ ID NO:12，相比SEQ ID NO:5少3'端两个AA)、5'内含子(SEQ ID NO:13)。在5'内含子的序列 (SEQ ID NO:13)中，加下划线的为IGS序列，其中IGS中的小写加粗下划线的为与编码IRES的序列的5'端序列以及位于其5'侧两个AA反向互补的序列，IGS中的大写加粗的为与编码EGFP序列中的3'端序列反向互补的序列。Figure 3D shows the construction (left) of a vector for preparing circular RNA, in which E3 and E5 are removed, the two AA nucleotides at the 3' end of the EGFP-encoding sequence are placed at the 5' end of the IRES-encoding sequence, and the IGS is modified to be inversely complementary to the 5' end sequence of the IRES-encoding sequence and its two AA nucleotides on its 5' side, as well as the 3' end sequence of the EGFP-encoding sequence, and the structure (right) of the circular RNA obtained from this vector. Specifically, the vector may sequentially contain a 3' intron (SEQ ID NO:10), two AA nucleotides, a sequence encoding CVB3 IRES (SEQ ID NO:4), a sequence encoding green fluorescent protein (EGFP) (SEQ ID NO:12, which lacks the two AA nucleotides at the 3' end compared to SEQ ID NO:5), and a 5' intron (SEQ ID NO:13). The sequence of the 5' intron... In (SEQ ID NO:13), the underlined part is the IGS sequence, where the lowercase bold underlined part of IGS is the 5' end sequence of the sequence encoding IRES and the two AA sequences located on its 5' side are inversely complementary. The uppercase bold part of IGS is the sequence inversely complementary to the 3' end sequence of the sequence encoding EGFP.

1.1载体构建1.1 Carrier Construction

构建图3B和图3C中所示的两个载体。Construct the two carriers shown in Figures 3B and 3C.

具体地，对于图3B中示出的载体，合成编码链DNA片段，其从5'到3'方向依次包含：T7启动子(SEQ ID NO:1)、5'同源臂(SEQ ID NO:2)、3'内含子和3’外显子(SEQ ID NO:3)、编码CVB3 IRES的序列(SEQ ID NO:4)、编码绿色荧光蛋白(EGFP)的序列(SEQ ID NO:12，相比SEQ ID NO:5少3'端两个AA)、5'内含子(SEQ ID NO:9)、3'同源臂(SEQ ID NO:7)、以及用于质粒线性化的EcoRV限制性酶切位点(GATATC)。将上述合成的DNA片段及其互补链克隆至pUC57载体(SEQ ID NO:8)中，经EcoRV酶切获得线性化载体产物。Specifically, for the vector shown in Figure 3B, a synthesized coding DNA fragment is formed, which, from 5' to 3', sequentially includes: a T7 promoter (SEQ ID NO:1), a 5' homologous arm (SEQ ID NO:2), a 3' intron and a 3' exon (SEQ ID NO:3), a sequence encoding CVB3 IRES (SEQ ID NO:4), a sequence encoding green fluorescent protein (EGFP) (SEQ ID NO:12, which lacks two AA segments at the 3' end compared to SEQ ID NO:5), a 5' intron (SEQ ID NO:9), a 3' homologous arm (SEQ ID NO:7), and an EcoRV restriction enzyme site (GATATC) for plasmid linearization. The synthesized DNA fragment and its complementary strand are cloned into the pUC57 vector (SEQ ID NO:8), and the linearized vector product is obtained by EcoRV digestion.

对于图3C中示出的载体，合成编码链DNA片段，其从5'到3'方向依次包含：T7启动子(SEQ ID NO:1)、5'同源臂(SEQ ID NO:2)、3'内含子(SEQ ID NO:10)、编码CVB3 IRES的序列(SEQ ID NO:4)、编码绿色荧光蛋白(EGFP)的序列(SEQ ID NO:5)、5'外显子和5'内含子(SEQ ID NO:11)、3'同源臂(SEQ ID NO:7)、以及用于质粒线性化的EcoRV限制性酶切位点(GATATC)。将上述合成的DNA片段及其互补片段克隆至pUC57载体(SEQ ID NO:8)中，经EcoRV酶切获得线性化载体产物。For the vector shown in Figure 3C, a synthesized coding DNA fragment was formed, which, from 5' to 3', sequentially includes: a T7 promoter (SEQ ID NO:1), a 5' homologous arm (SEQ ID NO:2), a 3' intron (SEQ ID NO:10), a sequence encoding CVB3 IRES (SEQ ID NO:4), a sequence encoding green fluorescent protein (EGFP) (SEQ ID NO:5), a 5' exon and a 5' intron (SEQ ID NO:11), a 3' homologous arm (SEQ ID NO:7), and an EcoRV restriction enzyme site (GATATC) for plasmid linearization. The synthesized DNA fragment and its complementary fragment were cloned into the pUC57 vector (SEQ ID NO:8), and the linearized vector product was obtained by EcoRV digestion.

上述的基因合成、载体构建和质粒线性化均由GenScript完成。The gene synthesis, vector construction, and plasmid linearization described above were all performed using GenScript.

1.2体外转录1.2 In vitro transcription

将上述获得的线性化质粒载体进行体外转录(IVT)，获得RNA产物。具体地，以表1所示的反应体系，37℃孵育2h，之后加入2μl DNA酶I(无RNA酶，ON-109，Hongene)，混匀，37℃孵育30min。The linearized plasmid vector obtained above was subjected to in vitro transcription (IVT) to obtain RNA product. Specifically, the reaction system shown in Table 1 was incubated at 37°C for 2 h, followed by the addition of 2 μl of DNase I (RNase-free, ON-109, Hongene), mixed well, and incubated at 37°C for 30 min.

表1.体外转录的反应体系
Table 1. Reaction system for in vitro transcription

1.3 LiCl沉淀纯化RNA1.3 LiCl precipitation purification of RNA

对上述获得的IVT原液，经LiCl沉淀进行纯化得到RNA产物。具体地，向IVT原液中加入22μl无酶水(AM9937，Thermo Fisher)及20μl 8M LiCl溶液(无DNA酶及RNA酶，ST498-100 ml，Beyotime)，使LiCl浓度为2.5M，混匀，置于-20℃孵育30min以上。4℃、12000g离心15min，弃上清。加入500μl 70％乙醇，颠倒混匀，4℃、12000g离心3min，弃上清；重复该步骤。4℃、12000g离心1min，吸尽上清，加入100μl无酶水，溶解RNA。The IVT stock solution obtained above was purified by LiCl precipitation to obtain RNA product. Specifically, 22 μl of enzyme-free water (AM9937, Thermo Fisher) and 20 μl of 8M LiCl solution (DNase and RNase-free, ST498-100 ml, Beyotime) were added to the IVT stock solution to make the LiCl concentration 2.5M. The mixture was stirred and incubated at -20℃ for at least 30 min. The mixture was centrifuged at 12000g for 15 min at 4℃, and the supernatant was discarded. 500 μl of 70% ethanol was added, the mixture was inverted and stirred, and centrifuged at 12000g for 3 min at 4℃, and the supernatant was discarded; this step was repeated. The mixture was centrifuged at 12000g for 1 min at 4℃, the supernatant was aspirated, and 100 μl of enzyme-free water was added to dissolve the RNA.

使用Nanodrop ONEC微量紫外-可见光分光光度计(ND-ONEC-W，Thermo scientific)检测得到纯化后的RNA的浓度。The concentration of purified RNA was determined using a Nanodrop ONEC micro-ultraviolet-visible spectrophotometer (ND-ONEC-W, Thermo Scientific).

1.4 RNA环化1.4 RNA circularization

对上述取得的纯化RNA进行环化处理(一步环化法)。具体地，取20μg纯化后的RNA，用无酶水调整至约0.5-2μg/μl，70℃孵育5min，随后立即置于冰上3min。加入GTP(100mM)至终浓度为2mM，并加入含有Mg²⁺的T4 RNA连接酶缓冲液(10×)(B0216S，NEB)，使得Mg²⁺终浓度为10mM。55℃孵育10min。The purified RNA obtained above was subjected to circularization (one-step circularization method). Specifically, 20 μg of purified RNA was taken and adjusted to approximately 0.5-2 μg/μl with enzyme-free water, incubated at 70°C for 5 min, and then immediately placed on ice for 3 min. GTP (100 mM) was added to a final concentration of 2 mM, and T4 RNA ligase buffer (10×) (BO216S, NEB) containing Mg2 ⁺ was added to bring the final Mg2 ⁺ concentration to 10 mM. The mixture was incubated at 55°C for 10 min.

产物经上述LiCl沉淀进行纯化。具体地，向RNA环化液中加入8M LiCl溶液，使LiCl浓度为2.5M。纯化后的RNA浓度可由Nanodrop检测。The product was purified by LiCl precipitation. Specifically, 8M LiCl solution was added to the RNA circularization solution to make the LiCl concentration 2.5M. The concentration of the purified RNA could be detected by Nanodrop.

1.5 RNA酶R消化处理1.5 RNase R digestion treatment

取1.4得到的LiCl纯化后RNA产物，加入0.6μl Tris-HCl、1μl KCl-3M、0.003μlMgCl₂-1M、和0.3μl RNase R(48U/μl)，吹打混匀，37℃孵育15min。Take the purified RNA product obtained from LiCl in step 1.4, add 0.6 μl Tris-HCl, 1 μl KCl-3M, _0.003 μl MgCl2-1M, and 0.3 μl RNase R (48 U/μl), mix well by pipetting, and incubate at 37°C for 15 min.

产物经上述LiCl沉淀进行纯化。具体地，向RNA液中加入8M LiCl溶液，使LiCl浓度为2.5M。纯化后的RNA浓度可由Nanodrop检测。The product was purified by LiCl precipitation. Specifically, 8M LiCl solution was added to the RNA solution to make the LiCl concentration 2.5M. The concentration of the purified RNA could be detected by Nanodrop.

1.6凝胶电泳分析1.6 Gel electrophoresis analysis

取上述1.3、1.4和1.5得到的RNA，通过E-Gel^TMEX凝胶电泳进行分析。具体地，取纯化后的RNA 150-200ng，用无酶水调整至10μl，随后加入10μl凝胶上样缓冲液II(AM8546G，ThermoFisher)，混匀。取出E-Gel^TMEX琼脂糖凝胶(2％，G401002，ThermoFisher)，去除梳子，将凝胶放入电泳系统的凝胶卡槽,在胶孔中依次加入20μl混匀的样品。设置电泳时间10-20min，开始电泳。电泳结束后，打开滤光片冷却5-10min。使用E-Gel^TMEX设备(G8300，ThermoFisher)自带的图像采集系统观察电泳结果，并采集图片。The RNA obtained in steps 1.3, 1.4, and 1.5 above was analyzed by E-Gel ^™ EX gel electrophoresis. Specifically, 150-200 ng of purified RNA was taken, adjusted to 10 μl with enzyme-free water, and then 10 μl of gel loading buffer II (AM8546G, ThermoFisher) was added and mixed well. The E-Gel ^™ EX agarose gel (2%, G401002, ThermoFisher) was removed, the comb was removed, and the gel was placed into the gel slot of the electrophoresis system. 20 μl of the mixed sample was added to each well. The electrophoresis time was set to 10-20 min, and electrophoresis began. After electrophoresis, the filter was turned off and the gel was cooled for 5-10 min. The electrophoresis results were observed and images were captured using the image acquisition system built into the E-Gel ^™ EX device (G8300, ThermoFisher).

根据图3B构建的载体所产生的RNA的E-Gel^TMEX凝胶电泳胶图示于图4A，根据图3C构建的载体所产生的RNA的E-Gel^TMEX凝胶电泳胶图示于图4B。各图中的三个泳道从左到右分别为1.3、1.4和1.5得到的RNA，其中最上面的条带为IVT得到的线性RNA，由于含有自剪接内含子，比环化后的环状RNA条带稍微大一点，且移动较慢。从图中可以看出，经RNA酶R酶处理的RNA中几乎不含线性前体RNA，环状RNA得到富集。Figure 4A shows the E-Gel ^™ EX gel electrophoresis results of RNA generated from the vector constructed according to Figure 3B, and Figure 4B shows the E-Gel ^™ EX gel electrophoresis results of RNA generated from the vector constructed according to Figure 3C. The three lanes in each figure, from left to right, represent RNA obtained in lanes 1.3, 1.4, and 1.5, respectively. The top band represents the linear RNA obtained by IVT; due to the presence of self-splicing introns, it is slightly larger than the circular RNA band and moves more slowly. The figures show that the RNA treated with RNase R contains almost no linear precursor RNA, while circular RNA is enriched.

1.7毛细管凝胶电泳1.7 Capillary gel electrophoresis

为了检测RNA的成环效率，对环化后的RNA用Agilent 5200片段分析仪(M53100AA，Agilent)和Agilent DNF-471RNA试剂盒(15nt，DNF-471-0500，Agilent)进行毛细管凝胶电泳实验。To detect the efficiency of RNA circularization, the circularized RNA was subjected to capillary gel electrophoresis using an Agilent 5200 fragment analyzer (M53100AA, Agilent) and an Agilent DNF-471 RNA kit (15nt, DNF-471-0500, Agilent).

具体地，取1.4中的纯化RNA，用无酶水调整至浓度为100-150ng/μl。在样品板中加入22μL稀释液标准物(RNA稀释液标准物)、2μL待测样品和2μLRNA分子量标准物，混合后置于80℃反应2min，之后立即放在4℃冰盒上，瞬时降温。随后参照Agilent 5200片段分析仪说明书，进行毛细管电泳实验。Specifically, take the purified RNA from step 1.4 and adjust its concentration to 100-150 ng/μl with enzyme-free water. Add 22 μL of dilution standard (RNA dilution standard), 2 μL of the sample to be tested, and 2 μL of RNA molecular weight standard to the sample plate. After mixing, incubate at 80°C for 2 min, then immediately place on an ice box at 4°C for instantaneous cooling. Subsequently, perform capillary electrophoresis according to the Agilent 5200 fragment analyzer manual.

根据图3B构建的载体所产生的RNA的毛细管凝胶电泳图示于图4C，根据图3C构建的载体所产生的RNA的毛细管凝胶电泳图示于图4D。The capillary gel electrophoresis diagram of RNA generated from the vector constructed according to Figure 3B is shown in Figure 4C, and the capillary gel electrophoresis diagram of RNA generated from the vector constructed according to Figure 3C is shown in Figure 4D.

从图中的条带大小可以看出，环化前的体外转录产物条带略大于环化后的环状RNA条带，因为线性前体RNA中含有自剪接内含子序列。环化后环状RNA的纯度均在50％以上，即分别为62.2％和75.3％。As can be seen from the band sizes in the figure, the band of the in vitro transcription product before circularization is slightly larger than the band of the circular RNA after circularization, because the linear precursor RNA contains self-splicing intron sequences. The purity of the circular RNA after circularization is above 50%, specifically 62.2% and 75.3%, respectively.

1.8环状RNA的测序1.8 Sequencing of circular RNA

对1.4得到的环状RNA进行逆转录、PCR、和胶回收，并对环化接头位置测序。使用II第一链cDNA合成试剂盒(加gDNA去除剂)(R212-01，诺唯赞)，对上述发生体外自剪接反应而得到的产物进行逆转录。具体地，按表2所示配置反应体系，移液器吹打混匀，42℃加热2min。The circular RNA obtained in step 1.4 was reverse transcribed, PCR-mediated, and gel-recovered, and the circular adapter sites were sequenced. II. First-strand cDNA Synthesis Kit (with gDNA removal agent) (R212-01, Novizan) was used to reverse transcribe the product obtained from the above in vitro self-splicing reaction. Specifically, the reaction system was prepared as shown in Table 2, mixed by pipetting, and heated at 42°C for 2 min.

表2.逆转录反应体系

Table 2. Reverse transcription reaction system

之后，在上述混合液中加入4μl 5×HiScript II qRT SuperMix II，移液器吹打混匀，50℃15 min，85℃5 sec。逆转录结束后，将所得的cDNA加入表3所示PCR反应体系，移液器吹打混匀，并按照表4所示的程序进行PCR反应。Next, add 4 μl of 5×HiScript II qRT SuperMix II to the above mixture, pipette to mix, and incubate at 50℃ for 15 min and 85℃ for 5 sec. After reverse transcription, add the obtained cDNA to the PCR reaction system shown in Table 3, pipette to mix, and perform the PCR reaction according to the procedure shown in Table 4.

表3.PCR反应体系
Table 3. PCR reaction system

表4.PCR程序设定
Table 4. PCR Program Settings

对全部反应产物进行电泳回收，于蓝光切胶仪(OSE-470L，天根生化科技有限公司)下切取目的条带，参照凝胶回收试剂盒(D2500-01/D2500-02，omega)使用说明书进行凝胶回收。回收的产物送Sanger测序，环化接头测序引物序列为SEQ ID NO:16和17。All reaction products were recovered by electrophoresis, and the target band was cut using a blue light gel cutter (OSE-470L, Tiangen Biotech Co., Ltd.) as a reference. Gel recovery was performed using the gel recovery kit (D2500-01/D2500-02, omega) according to the instructions. The recovered product was sent for Sanger sequencing, and the sequencing primer sequences for the circular adapter were SEQ ID NO: 16 and 17.

根据图3B和图3C构建的载体所产生的环状RNA测序结果分别如SEQ ID NO:44和45所示，其中斜体加粗为残留的相邻外显子序列。

The sequencing results of the circular RNA generated by the vectors constructed according to Figures 3B and 3C are shown in SEQ ID NO:44 and 45, respectively, where the italicized and bold text represents the residual adjacent exon sequences.

可以看出，在去掉E5并使得内含子IGS中的P1序列与编码EGFP的序列的3’端序列ACAAGT互补配对后，载体得到的RNA可以环化，且环状RNA中仅残留3’侧外显子序列CCA。在去掉E3并使得IGS中P10与编码IRES的序列的5’端TTA互补配对后，载体得到的RNA可以环化，且环状RNA中仅残留5’侧外显子序列GGAAAU。It can be seen that after removing E5 and making the P1 sequence in the intron IGS complementary to the 3' end sequence ACAAGT of the EGFP-encoding sequence, the resulting RNA can be circularized, and only the 3' exon sequence CCA remains in the circular RNA. After removing E3 and making the P10 in the IGS complementary to the 5' end TTA of the IRES-encoding sequence, the resulting RNA can be circularized, and only the 5' exon sequence GGAAAU remains in the circular RNA.

实施例2.对于贝氏柯克斯体23S核糖体基因中自剪接型内含子的内部引导序列的Example 2. Internal guide sequence for self-splicing introns in the Coxiella 23S ribosomal gene. 改造Transformation

类似地，参照实施例1，对贝氏柯克斯体23S核糖体基因中的自剪接型内含子进行IGS的改造。图5示出该内含子及其相邻外显子的结构，其中IGS中的P1序列(GUAGUUACG)与相邻外显子中P1反义链(CGUAACUAU)反向互补，IGS中的P10序列(ACCGUU)与相邻外显子中P10反义链(AACGGU)反向互补。Similarly, referring to Example 1, the self-splicing intron in the Coxionemal 23S ribosomal gene was modified by IGS. Figure 5 shows the structure of this intron and its adjacent exon, wherein the P1 sequence (GUAGUUACG) in the IGS is anticomplementary to the P1 antisense strand (CGUAACUAU) in the adjacent exon, and the P10 sequence (ACCGUU) in the IGS is anticomplementary to the P10 antisense strand (AACGGU) in the adjacent exon.

图6A示出IGS未改造的载体的构造(左)、以及由该载体得到的环状RNA的结构(右)，其中载体可以按序包含自剪接型内含子的3'内含子及其相邻的3'外显子(E3)、编码IRES的序列、编码EGFP的序列、相邻的5'外显子(E5)、以及自剪接型内含子的5'内含子，其中IGS与E3以及E5的序列反向互补。具体地，载体可以依次包含3'内含子和3'外显子(E3或P10反义链)(SEQ ID NO:18)、编码CVB3 IRES的序列(SEQ ID NO:4)、编码绿色荧光蛋白(EGFP)的序列(SEQ ID NO:5)、5'外显子(E5或P1反义链)和5'内含子(SEQ ID NO:19)。在3'内含子和3'外显子的序列 (SEQ ID NO:18)中，3'端的大写下划线序列为E3或P10反义链序列。在5'外显子和5'内含子序列 (SEQ ID NO:19)中，5'端的大写斜体为E5或P1反义链序列，加下划线的为IGS序列，其中IGS中的小写加粗下划线的为与E3反向互补的序列(P10)，IGS中的大写加粗的为与E5反向互补的序列(P1)。Figure 6A shows the structure of an unmodified IGS vector (left) and the structure of the circular RNA obtained from the vector (right). The vector can sequentially contain a 3' intron of a self-splicing intron and its adjacent 3' exon (E3), a sequence encoding IRES, a sequence encoding EGFP, an adjacent 5' exon (E5), and the 5' intron of the self-splicing intron, wherein the IGS sequence is inversely complementary to the sequences of E3 and E5. Specifically, the vector can sequentially contain a 3' intron and a 3' exon (E3 or P10 antisense strand) (SEQ ID NO:18), a sequence encoding CVB3 IRES (SEQ ID NO:4), a sequence encoding green fluorescent protein (EGFP) (SEQ ID NO:5), a 5' exon (E5 or P1 antisense strand), and a 5' intron (SEQ ID NO:19). The sequences of the 3' intron and 3' exon... In (SEQ ID NO:18), the capitalized underscore sequence at the 3' end is the E3 or P10 antisense strand sequence. The 5' exon and 5' intron sequences... In (SEQ ID NO:19), the uppercase italic at the 5' end represents the E5 or P1 antisense sequence, and the underlined part represents the IGS sequence. The lowercase bold underlined part of the IGS sequence represents the sequence that is the antisense complement of E3 (P10), and the uppercase bold part of the IGS sequence represents the sequence that is the antisense complement of E5 (P1).

图6B示出去除E5且IGS序列改造成与E3以及编码EGFP的序列的3'端序列反向互补的可环化RNA制备用载体的构造(左)，以及由该载体得到的环状RNA的结构(右)。具体地，载体可以依次包含3'内含子和3'外显子(E3或P10反义链)(SEQ ID NO:18)、编码CVB3 IRES的序列(SEQ ID NO:4)、编码绿色荧光蛋白(EGFP)的序列(SEQ ID NO:12，相比SEQ ID NO:5少3'端两个AA)、和5'内含子(SEQ ID NO:20)。在5'外显子和5'内含子序列 (SEQ ID NO:20)中，加下划线的为IGS序列，其中IGS中的小写加粗下划线的为与E3反向互补的序列，IGS中的大写加粗的为与编码EGFP序列中的3'端序列反向互补的序列。Figure 6B shows the construction (left) of a vector for preparing circularizable RNA with E5 removed and the IGS sequence modified to be inversely complementary to the 3' end sequence of E3 and the sequence encoding EGFP, and the structure (right) of the circular RNA obtained from this vector. Specifically, the vector may sequentially contain a 3' intron and a 3' exon (E3 or P10 antisense strand) (SEQ ID NO:18), a sequence encoding CVB3 IRES (SEQ ID NO:4), a sequence encoding green fluorescent protein (EGFP) (SEQ ID NO:12, which lacks two AA segments at the 3' end compared to SEQ ID NO:5), and a 5' intron (SEQ ID NO:20). The 5' exon and 5' intron sequences... In (SEQ ID NO:20), the underlined part is the IGS sequence, where the lowercase bold underlined part of IGS is the sequence that is inversely complementary to E3, and the uppercase bold part of IGS is the sequence that is inversely complementary to the 3' end sequence of the EGFP encoding sequence.

图6C示出去除E3且IGS序列改造成与编码IRES的序列的5'端序列以及E5反向互补的环化RNA制备用载体的构造(左)，以及由该载体得到的环状RNA的结构(右)。具体地，载体可以依次包含3'内含子(SEQ ID NO:21)、编码CVB3 IRES的序列(SEQ ID NO:4)、编码绿色荧光蛋白(EGFP)的序列(SEQ ID NO:5)、5'外显子和5'内含子(SEQ ID NO:22)。其中3'内含子的序列如aaattgaggtagtaaaaatctgcgaggtgcggacaatcagcaggaaagactaatctttttgattagaatcctcagagactaatacgccgagcgtacttagaaaaaatccattctaagtgtgatgatagagtccgatctctatagcgatatagagggttttcaaaagaaacctgttaatcattatcccatgattaacataacgtcaata(SEQ ID NO:21)所示。在5'外显子和5'内含子的序列 (SEQ ID NO:22)中，5'端的大写斜体为E5或P1反义链序列，加下划线的为IGS序列，其中IGS中的小写加粗下划线的为与编码IRES的序列的5'端序列反向互补的序列，IGS中的大写加粗的为与E5反向互补的序列。Figure 6C shows the construction (left) of a vector for preparing circular RNA with E3 removed and the IGS sequence modified to be the 5' end sequence encoding the IRES sequence and E5 reverse complementary, and the structure (right) of the circular RNA obtained from the vector. Specifically, the vector may sequentially contain a 3' intron (SEQ ID NO:21), a sequence encoding CVB3 IRES (SEQ ID NO:4), a sequence encoding green fluorescent protein (EGFP) (SEQ ID NO:5), a 5' exon, and a 5' intron (SEQ ID NO:22). The sequence of the 3' intron is shown in SEQ ID NO:21. The sequences of the 5' exon and 5' intron are also shown. In (SEQ ID NO:22), the uppercase italic at the 5' end is the E5 or P1 antisense sequence, and the underlined one is the IGS sequence. The lowercase bold underlined one in the IGS is the sequence that is the reverse complement of the 5' end sequence encoding IRES, and the uppercase bold one in the IGS is the sequence that is the reverse complement of E5.

图6D示出去除E3和E5、将编码EGFP的序列的最3'端的两个AA置于编码IRES的序列的5'侧、且IGS改造成与编码IRES的序列的5'端序列及其5'侧两个AA以及编码EGFP的序列的3'端序列反向互补的环化RNA制备用载体的构造(左)，以及由该载体得到的环状RNA的结构(右)。具体地，载体可以依次包含3'内含子(SEQ ID NO:21)、两个AA核苷酸、编码CVB3 IRES的序列(SEQ ID NO:4)、编码绿色荧光蛋白(EGFP)的序列(SEQ ID NO:12，相比SEQ ID NO:5少3'端两个AA)、5'内含子(SEQ ID NO:23)。在5'内含子的序列 (SEQ ID NO:23)中，加下划线的为IGS序列，其中IGS中的小写加粗下划线的为与编码IRES的序列的5'端序列以及位于其5'侧的两个AA反向互补的序列，IGS中的大写加粗的为与编码EGFP中的3'端序列反向互补的序列。Figure 6D shows the construction (left) of a circular RNA preparation vector with E3 and E5 removed, the two AA nucleotides at the 3' end of the EGFP-encoding sequence placed on the 5' side of the IRES-encoding sequence, and the IGS modified to be reverse complementary to the 5' end sequence of the IRES-encoding sequence and its two AA nucleotides on the 5' side, as well as the 3' end sequence of the EGFP-encoding sequence, and the structure (right) of the circular RNA obtained from this vector. Specifically, the vector may sequentially contain a 3' intron (SEQ ID NO:21), two AA nucleotides, a sequence encoding CVB3 IRES (SEQ ID NO:4), a sequence encoding green fluorescent protein (EGFP) (SEQ ID NO:12, which lacks the two AA nucleotides at the 3' end compared to SEQ ID NO:5), and a 5' intron (SEQ ID NO:23). The sequence of the 5' intron... In (SEQ ID NO:23), the underlined part is the IGS sequence, where the lowercase bold underlined part of IGS is the 5' end sequence of the sequence encoding IRES and the two AA sequences located on its 5' side, which are inversely complementary. The uppercase bold part of IGS is the sequence inversely complementary to the 3' end sequence of the sequence encoding EGFP.

构建图6B和图6C示出的用于制备可环化RNA的载体。Construct the vectors for preparing circularizable RNA as shown in Figures 6B and 6C.

具体地，对于图6B示出的载体，合成编码链DNA片段，其从5'到3'方向依次包含：T7启动子(SEQ ID NO:1)、5'同源臂(SEQ ID NO:2)、3'内含子和3’外显子(SEQ ID NO:18)、编码CVB3 IRES的序列(SEQ ID NO:4)、编码绿色荧光蛋白(EGFP)的序列(SEQ ID NO:12，相比SEQ ID NO:5少3'端两个AA)、5'内含子(SEQ ID NO:20)、3'同源臂(SEQ ID NO:7)、以及用于质粒线性化的EcoRV限制性酶切位点(GATATC)。将上述合成的DNA片段及其互补链克隆至pUC57载体(SEQ ID NO:8)中，经EcoRV酶切获得线性化载体产物。Specifically, for the vector shown in Figure 6B, a synthesized coding DNA fragment is formed, which, from 5' to 3', sequentially includes: a T7 promoter (SEQ ID NO:1), a 5' homologous arm (SEQ ID NO:2), a 3' intron and a 3' exon (SEQ ID NO:18), a sequence encoding CVB3 IRES (SEQ ID NO:4), a sequence encoding green fluorescent protein (EGFP) (SEQ ID NO:12, which lacks two AA segments at the 3' end compared to SEQ ID NO:5), a 5' intron (SEQ ID NO:20), a 3' homologous arm (SEQ ID NO:7), and an EcoRV restriction enzyme site (GATATC) for plasmid linearization. The synthesized DNA fragment and its complementary strand are cloned into the pUC57 vector (SEQ ID NO:8), and the linearized vector product is obtained by EcoRV digestion.

对于图6C中示出的载体，合成编码链DNA片段，其从5'到3'方向依次包含：T7启动子(SEQ ID NO:1)、5'同源臂(SEQ ID NO:2)、3'内含子(SEQ ID NO:21)、编码CVB3 IRES的序列(SEQ ID NO:4)、编码绿色荧光蛋白(EGFP)的序列(SEQ ID NO:5)、5'外显子和5'内含子(SEQ ID NO:22)、3'同源臂(SEQ ID NO:7)、以及用于质粒线性化的EcoRV限制性酶切位点(GATATC)。将上述合成的DNA片段及其互补链克隆至pUC57载体(SEQ ID NO:8)中，经EcoRV酶切获得线性化载体产物。For the vector shown in Figure 6C, a synthesized coding DNA fragment was formed, which, from 5' to 3', sequentially includes: a T7 promoter (SEQ ID NO:1), a 5' homologous arm (SEQ ID NO:2), a 3' intron (SEQ ID NO:21), a sequence encoding CVB3 IRES (SEQ ID NO:4), a sequence encoding green fluorescent protein (EGFP) (SEQ ID NO:5), a 5' exon and a 5' intron (SEQ ID NO:22), a 3' homologous arm (SEQ ID NO:7), and an EcoRV restriction enzyme site (GATATC) for plasmid linearization. The synthesized DNA fragment and its complementary strand were cloned into the pUC57 vector (SEQ ID NO:8), and the linearized vector product was obtained by EcoRV digestion.

参照实施例1中的1.2至1.7，对得到的线性化质粒进行体外转录、环化条件处理、RNA酶R消化处理、E-Gel^TMEX凝胶电泳、以及毛细管电泳检测。Referring to steps 1.2 to 1.7 in Example 1, the obtained linearized plasmid was subjected to in vitro transcription, circularization treatment, RNase R digestion, E-Gel ^™ EX gel electrophoresis, and capillary electrophoresis detection.

根据图6B构建的载体所产生的RNA的E-Gel^TMEX凝胶电泳胶图示于图7A，根据图6C构建的载体所产生的RNA的E-Gel^TMEX凝胶电泳胶图示于图7B。各图中的三个泳道从左到右分别为1.3、1.4和1.5得到的RNA，其中最上面的条带为IVT得到的线性RNA，由于含有自剪接内含子，比环化后的环状RNA条带稍微大一点，且移动较慢。从图中可以看出，经RNA酶R酶处理的RNA中几乎不含线性前体RNA，环状RNA得到富集。Figure 7A shows the E-Gel ^™ EX gel electrophoresis results of RNA generated from the vector constructed according to Figure 6B, and Figure 7B shows the E-Gel ^™ EX gel electrophoresis results of RNA generated from the vector constructed according to Figure 6C. The three lanes in each figure, from left to right, represent RNA obtained in lanes 1.3, 1.4, and 1.5, respectively. The top band represents the linear RNA obtained by IVT; due to the presence of self-splicing introns, it is slightly larger than the circular RNA band and moves more slowly. The figures show that the RNA treated with RNase R contains almost no linear precursor RNA, while circular RNA is enriched.

根据图6B构建的载体所产生的RNA的毛细管凝胶电泳图示于图7C，根据图6C构建的载体所产生的RNA的毛细管凝胶电泳图示于图7D。从条带大小可以看出，环化前的体外转录产物条带略大于环化后的环状RNA条带，因为线性前体RNA中含有自剪接内含子序列。环化后环状RNA纯度均在50％以上，即分别为52.2％和60.6％。Figure 7C shows the capillary gel electrophoresis results of RNA generated from the vector constructed according to Figure 6B, and Figure 7D shows the capillary gel electrophoresis results of RNA generated from the vector constructed according to Figure 6C. The band sizes indicate that the band of the in vitro transcription product before circularization is slightly larger than the band of the circular RNA after circularization, because the linear precursor RNA contains self-splicing intron sequences. The purity of the circular RNA after circularization is above 50%, specifically 52.2% and 60.6%, respectively.

参照实施例1中1.8的方法步骤，对参照1.4得到的纯化RNA进行逆转录、PCR和胶回收，并对环化接头位置测序。其中PCR所用的引物及测序引物为SEQ ID NO:16和17。Following the method steps in 1.8 of Example 1, the purified RNA obtained in 1.4 was subjected to reverse transcription, PCR, and gel recovery, and the circular adapter sites were sequenced. The primers used for PCR and sequencing were SEQ ID NO: 16 and 17.

根据图6B和图6C构建的载体所产生的环状RNA测序结果分别如SEQ ID NO:24和25所示，其中斜体加粗为残留的相邻外显子序列。
The sequencing results of the circular RNA generated by the vectors constructed according to Figures 6B and 6C are shown in SEQ ID NO:24 and 25, respectively, where the italicized and bold text represents the residual adjacent exon sequences.

可以看出，在去掉E5并使得内含子IGS中的P1序列与编码EGFP的序列的3’端序列TGTACAAGT互补配对后，载体得到的RNA可以环化，且环状RNA中仅残留3’外显子序列AACGGU。在去掉E3并使得IGS中P10与编码IRES的序列的5’端TTAAAA互补配对后，载体得到的RNA可以环化，且环状RNA中仅残留5’外显子序列CGUAACUAU。It can be seen that after removing E5 and making the P1 sequence in the intron IGS complementary to the 3' end sequence TGTACAAGT of the EGFP-encoding sequence, the resulting RNA can be circularized, and only the 3' exon sequence AACGGU remains in the circular RNA. Similarly, after removing E3 and making the P10 sequence in the IGS complementary to the 5' end TTAAAA of the IRES-encoding sequence, the resulting RNA can be circularized, and only the 5' exon sequence CGUAACUAU remains in the circular RNA.

实施例3.含有IGS改造的炭疽杆菌RecA基因自剪接型内含子的环状RNA制备用Example 3. Preparation of circular RNA containing self-splicing introns of the RecA gene from Bacillus anthracis modified with IGS. 载体的构建Carrier construction

构建结构如图8A所示的载体，以验证同时去除炭疽杆菌RecA基因中与自剪接型内含子相邻的外显子E3和E5的RNA环化效果，其与图3D所示的结构原理相同，编码的目的基因从EGFP换成荧光素酶(Fluc)。具体地，合成编码链DNA片段，其从5'到3'方向依次包含：T7启动子(SEQ ID NO:1)、5'同源臂(SEQ ID NO:2)、3'内含子(SEQ ID NO:10)、两个AA核苷酸、编码CVB3 IRES的序列(SEQ ID NO:4)、编码荧光素酶(Fluc)的序列(SEQ ID NO:29，相比SEQ ID NO:28少3'端两个AA)、5'内含子(SEQ ID NO:30)、3'同源臂(SEQ ID NO:7)、以及用于质粒线性化的EcoRV限制性酶切位点(GATATC)。将上述合成的DNA片段及其互补链克隆至pUC57载体(SEQ ID NO:8)中，经EcoRV酶切获得线性化载体产物。在5'内含子序列 (SEQ ID NO:30)中，加下划线的为IGS序列，其中IGS中的小写加粗下划线的为与编码IRES的序列的5'端序列以及位于其5'的两个AA反向互补的序列，IGS中的大写加粗的为与编码Fluc序列中的3'端序列反向互补的序列。A vector with the structure shown in Figure 8A was constructed to verify the effect of simultaneously removing RNA circularization from exons E3 and E5 adjacent to the self-splicing intron in the *Bacillus anthracis* RecA gene. The structural principle is the same as that shown in Figure 3D, except that the target gene encoded by *Fluc* was replaced with luciferase. Specifically, a coding DNA fragment was synthesized, which, from 5' to 3', sequentially includes: a T7 promoter (SEQ ID NO:1), a 5' homologous arm (SEQ ID NO:2), a 3' intron (SEQ ID NO:10), two AA nucleotides, a sequence encoding CVB3 IRES (SEQ ID NO:4), a sequence encoding luciferase (Fluc) (SEQ ID NO:29, lacking the two AA nucleotides at the 3' end compared to SEQ ID NO:28), a 5' intron (SEQ ID NO:30), a 3' homologous arm (SEQ ID NO:7), and an EcoRV restriction enzyme site (GATATC) for plasmid linearization. The synthesized DNA fragment and its complementary strand were cloned into the pUC57 vector (SEQ ID NO:8), and the linearized vector product was obtained by EcoRV digestion. The 5' intron sequence... In (SEQ ID NO:30), the underlined part is the IGS sequence, where the lowercase bold underlined part of IGS is the sequence that is inversely complementary to the 5' end sequence of the sequence encoding IRES and the two AAs located at its 5'. The uppercase bold part of IGS is the sequence that is inversely complementary to the 3' end sequence of the sequence encoding Fluc.

同时，同样以制备无相邻外显子E3和E5的环状RNA为目的，制备构建另外2种载体，如图8B和图8C所示。Meanwhile, with the same aim of preparing circular RNA without adjacent exons E3 and E5, two other vectors were prepared and constructed, as shown in Figure 8B and Figure 8C.

图8B所示的载体，将IRES截断放在编码目的基因如Fluc的开放阅读框(ORF)的两端，并保证5’IRES的最后一个碱基为T，同时将自剪接型内含子的IGS中的P1和P10序列分别改造成与5’IRES的3’端序列、以及3’IRES的5’端序列反向互补。The vector shown in Figure 8B places the IRES truncated at both ends of the open reading frame (ORF) encoding the target gene such as Fluc, ensuring that the last base of 5’IRES is T. At the same time, the P1 and P10 sequences in the IGS of the self-splicing intron are modified to be reverse complementary to the 3’ end sequence of 5’IRES and the 5’ end sequence of 3’IRES, respectively.

在图8C所示的载体，将编码目的基因如Fluc的ORF截断放在IRES的两端，并保证5’ORF的最后一个碱基为T，同时将自剪接型内含子的IGS中的P1和P10序列分别改造成与5’ORF的3’端、以及3’ORF的5’端序列反向互补。In the vector shown in Figure 8C, the ORF encoding the target gene such as Flu is truncated and placed at both ends of IRES, ensuring that the last base of the 5’ORF is T. At the same time, the P1 and P10 sequences in the IGS of the self-splicing intron are modified to be reverse complementary to the 3’ end of the 5’ORF and the 5’ end of the 3’ORF, respectively.

对于图8B所示的载体，合成编码链DNA片段，其从5'到3'方向依次包含：T7启动子(SEQ ID NO:1)、5'同源臂(SEQ ID NO:2)、3'内含子(SEQ ID NO:10)、3’CVB3 IRES编码序列(SEQ ID NO:26)、编码荧光素酶(Fluc)的序列(SEQ ID NO:28)、5’CVB3 IRES组件(SEQ ID NO:27)、5'内含子(SEQ ID NO:31)、3'同源臂(SEQ ID NO:7)、以及用于质粒线性化的EcoRV限制性酶切位点(GATATC)。将上述合成的DNA片段及其互补链克隆至pUC57载体(SEQ ID NO:8)中，经EcoRV酶切获得线性化载体产物。在5'内含子序列 (SEQ ID NO:31)中，加下划线的为IGS序列，其中IGS中的小写加粗下划线的为与编码3'IRES的序列的5'端序列反向互补的序列，IGS中的大写加粗的为与编码5'IRES的序列中的3'端序列反向互补的序列。For the vector shown in Figure 8B, a synthesized coding DNA fragment was formed, which, from 5' to 3', sequentially includes: a T7 promoter (SEQ ID NO:1), a 5' homologous arm (SEQ ID NO:2), a 3' intron (SEQ ID NO:10), a 3' CVB3 IRES coding sequence (SEQ ID NO:26), a sequence encoding luciferase (Fluc) (SEQ ID NO:28), a 5' CVB3 IRES assembly (SEQ ID NO:27), a 5' intron (SEQ ID NO:31), a 3' homologous arm (SEQ ID NO:7), and an EcoRV restriction enzyme site (GATATC) for plasmid linearization. The synthesized DNA fragment and its complementary strand were cloned into the pUC57 vector (SEQ ID NO:8), and the linearized vector product was obtained by EcoRV digestion. The 5' intron sequence... In (SEQ ID NO:31), the underlined part is the IGS sequence, where the lowercase bold underlined part of the IGS is the sequence that is the reverse complement of the 5' end sequence of the sequence encoding 3'IRES, and the uppercase bold part of the IGS is the sequence that is the reverse complement of the 3' end sequence of the sequence encoding 5'IRES.

对于图8C所示的载体，合成编码链DNA片段，其从5'到3'方向依次包含：T7启动子(SEQ ID NO:1)、5'同源臂(SEQ ID NO:2)、3'内含子(SEQ ID NO:10)、3'荧光素酶(Fluc)编码序列(SEQ ID NO:32)、编码CVB3 IRES的序列(SEQ ID NO:4)、5'荧光素酶(Fluc)编码序列(SEQ ID NO:33)、5'内含子(SEQ ID NO:34)、以及用于质粒线性化的EcoRV限制性酶切位点(GATATC)。将上述合成的DNA片段及其互补链克隆至pUC57载体(SEQ ID NO:8)中，经EcoRV酶切获得线性化载体产物。在5'内含子序列 (SEQ ID NO:34)中，加下划线的为IGS序列，其中IGS中的小写加粗下划线的为与编码3'Fluc的序列的5'端序列反向互补的序列，IGS中的大写加粗的为与编码5'Fluc的序列中的3'端序列反向互补的序列。For the vector shown in Figure 8C, a synthesized coding DNA fragment was formed, which, from 5' to 3', sequentially includes: a T7 promoter (SEQ ID NO:1), a 5' homologous arm (SEQ ID NO:2), a 3' intron (SEQ ID NO:10), a 3' luciferase (Fluc) coding sequence (SEQ ID NO:32), a sequence encoding CVB3 IRES (SEQ ID NO:4), a 5' luciferase (Fluc) coding sequence (SEQ ID NO:33), a 5' intron (SEQ ID NO:34), and an EcoRV restriction enzyme site (GATATC) for plasmid linearization. The synthesized DNA fragment and its complementary strand were cloned into the pUC57 vector (SEQ ID NO:8), and the linearized vector product was obtained by EcoRV digestion. The 5' intron sequence... In (SEQ ID NO:34), the underlined part is the IGS sequence, where the lowercase bold underlined part of the IGS is the sequence that is the reverse complement of the 5' end sequence of the sequence encoding 3'Fluc, and the uppercase bold part of the IGS is the sequence that is the reverse complement of the 3' end sequence of the sequence encoding 5'Fluc.

参照实施例1中的1.2至1.7，对得到的线性化质粒进行体外转录、环化条件处理(用于内含子剪接)、RNA酶R消化处理、E-Gel^TMEX凝胶电泳、以及毛细管电泳检测。Referring to steps 1.2 to 1.7 in Example 1, the obtained linearized plasmid was subjected to in vitro transcription, circularization (for intron splicing), RNase R digestion, E-Gel ^™ EX gel electrophoresis, and capillary electrophoresis detection.

根据图8A-8C构建的载体所产生的RNA的E-Gel^TM凝胶电泳结果分别如图9A-9C所示。可以看出，3种载体均能够产生环状RNA。The E-Gel ^™ electrophoresis results of the RNA generated by the vectors constructed according to Figures 8A-8C are shown in Figures 9A-9C, respectively. It can be seen that all three vectors can produce circular RNA.

根据图8A-8C构建的载体所产生的RNA的毛细管电泳检测的结果分别如图9D-9F所示，环化后的RNA纯度分别为45.9％、57.3％和58.5％，可见根据图8B和8C构建的载体可以产出环状RNA，且环状RNA的纯度较高，能达到生产要求。ORF-Fluc的常规生产环化率在50％左右。The capillary electrophoresis results of the RNA generated by the vectors constructed according to Figures 8A-8C are shown in Figures 9D-9F, respectively. The purity of the circularized RNA was 45.9%, 57.3%, and 58.5%, respectively. It can be seen that the vectors constructed according to Figures 8B and 8C can produce circular RNA with high purity, meeting production requirements. The conventional production circularization rate of ORF-Fluc is around 50%.

参照实施例1中1.8的方法步骤，对参照1.4得到的纯化RNA进行逆转录、PCR和胶回收，并对环化接头位置测序。其中，对于图8A构建的载体所产出的环状RNA，PCR所用的引物及测序引物均如SEQ ID NO:17和14所示，对于图8B构建的载体所产出的环状RNA，PCR所用的引物及测序引物均如SEQ ID NO:43和14所示，对于图8C构建的载体所产出的环状RNA，PCR所用的引物及测序引物如SEQ ID NO:15和35所示。Following the method steps in 1.8 of Example 1, the purified RNA obtained according to 1.4 was subjected to reverse transcription, PCR, and gel recovery, and the circular adapter positions were sequenced. Specifically, for the circular RNA produced by the vector constructed in Figure 8A, the primers used for PCR and sequencing are shown in SEQ ID NO: 17 and 14; for the circular RNA produced by the vector constructed in Figure 8B, the primers used for PCR and sequencing are shown in SEQ ID NO: 43 and 14; and for the circular RNA produced by the vector constructed in Figure 8C, the primers used for PCR and sequencing are shown in SEQ ID NO: 15 and 35.

由图8A-8C的载体所产生的环状RNA的接头处测序结果分别如SEQ ID NO:36-38所示。可以看到，3个环化产物均不含外源残留序列(即无自剪接型相邻外显子残留)。The sequencing results of the adapter sites of the circular RNAs generated from the vectors in Figures 8A-8C are shown in SEQ ID NO:36-38, respectively. As can be seen, none of the three circularized products contain exogenous residual sequences (i.e., no self-splicing type adjacent exon residues).

Gttttggagcacggaaagacgatgacggaaaaagagatcgtggattacgtcgccagtcaagtaacaaccgcgaaaaagttgcgcggaggagttgtgtttgtggacgaagtaccgaaaggtcttaccggaaaactcgacgcaagaaaaatcagagagatcctcataaaggccaagaagggcggaaagatcgccgtgt/aaTTAAAACAGCCTGTGGGTTGATCCCACCCACAGGCCCATTGGGCGCTAGCACTCTGGTATCACGGTACCTTTGTG(SEQ ID NO:36，小写为Fluc3’端序列，大写为IRES5’端序列，/为连接处)
Gttttggagcacggaaagacgatgacggaaaaagagatcgtggattacgtcgccagtcaagtaacaaccgcgaaaaagttgcgcggaggagttgtgtttgtggacgaagtaccgaaaggtcttaccggaaaactcgacgcaagaaaaatcagagagatcctcataaaggccaagaagggcggaaagatcg ccgtgt / aaT TAAAACAGCCTGTGGGTTGATCCCACCCACAGGCCCATTGGGCGCTAGCACTCTGGTATCACGGTACCTTTGTG(SEQ ID NO:36, lowercase is the Fluc3' end sequence, uppercase is the IRES5' end sequence, / is the linker)

(SEQ ID NO:37，小写为Fluc序列，大写为IRES序列，其中下划线加粗的为5’端IRES序列，/为连接处)(SEQ ID NO:37, lowercase for Fluc sequence, uppercase for IRES sequence, where the underlined and bolded part is the 5' IRES sequence, and / indicates the junction)

Acatcacttacgctgagtacttcgaaatgtccgttcggttggcagaagctatgaaacgatatgggctgaatacaaatcacagaatcgtcgtatgcagtgaaaactctcttcaattctttatgccggtgttgggcgcgttatttatcggagttgcagttgcgcccgcgaacgacatttataatgaacgtgaattgctcaacagtatgggcatttcgcagcctaccgtggtgttcgttt/CCAAAAAGGGGTTGCAAAAAATTTTGAACGTGCAAAAAAAGCTCCCAATCATCCAAAAAATTATTATCATGGATTCTAAAACGGATTACCAGGGATTTCAGTCGATGTACACGTTCGTCACATCTCATCTACCTCCCGGTTTTAATGAATACGATTTTGTGCCAGAGTCCTTCGATAGGGACAAGACAATTGCACTGATCATGAACTCCTCT(SEQ ID NO:38，小写为Fluc 5’端序列，大写为Fluc 3’端序列，/为连接处)Acatcacttacgctgagtacttcgaaatgtccgttcggttggcagaagctatgaaacgatatgggctgaatacaaatcacagaatcgtcgtatgcagtgaaaactctcttcaattctttatgcc ggtgttgggcgcgttatttatcggagttgcagttgcgcccgcgaacgacatttataatgaacgtgaattgctcaacagtatgggcatttcgcagcctaccgtggtgttcgttt/CCAAAAAGGGG TTGCAAAAAATTTTGAACGTGCAAAAAAAGCTCCCAATCATCCAAAAAATTATTATCATGGATTCTAAAACGGATTACCAGGGATTTCAGTCGATGTACACGTTCGTCACATCTCATCTACCTCCCGGTTTTAATGAATACGATTTTGTGCCAGAGTCCTTCGATAGGGACAAGACAATTGCACTGATCATGAACTCCTCT (SEQ ID NO:38, lowercase represents the 5' end sequence of Fluc, uppercase represents the 3' end sequence of Fluc, / represents the linker)

实施例4.含有IGS改造的贝氏柯克斯体23S核糖体基因自剪接型内含子的环状Example 4. Circular introns containing self-splicing introns of the IGS-modified Bethlehem Coxison 23S ribosomal gene. RNA制备用载体的构建Construction of vectors for RNA preparation

采用贝氏柯克斯体23S核糖体基因中的自剪接型内含子，构建如图8B所示的载体，以验证同时去除与自剪接型内含子相邻外显子E3和E5的RNA环化效果。Using the self-splicing intron from the Coxison 23S ribosomal gene, a vector as shown in Figure 8B was constructed to verify the RNA circularization effect of simultaneously removing exons E3 and E5 adjacent to the self-splicing intron.

具体地，合成编码链DNA片段，其从5'到3'方向依次包含：T7启动子(SEQ ID NO:1)、5'同源臂(SEQ ID NO:2)、3'内含子(SEQ ID NO:21)、编码3’CVB3 IRES的序列(SEQ ID NO:40)、编码绿色荧光蛋白EGFP的序列(SEQ ID NO:5)、编码5’CVB3 IRES的序列(SEQ ID NO:41)、5'内含子(SEQ ID NO:39)、3'同源臂(SEQ ID NO:7)、以及用于质粒线性化的EcoRV限制性酶切位点(GATATC)。将上述合成的DNA片段及其互补链克隆至pUC57载体(SEQ ID NO:8)中，经EcoRV酶切获得线性化载体产物。在5'内含子序列(SEQ ID NO:39)中，加下划线的为IGS序列，其中IGS中的小写加粗下划线的为与编码3'IRES的序列的5'端序列反向互补的序列，IGS中的大写加粗的为与编码5'IRES的序列中的3'端序列反向互补的序列。Specifically, a synthetic coding DNA fragment was synthesized, which, from 5' to 3', sequentially includes: a T7 promoter (SEQ ID NO:1), a 5' homologous arm (SEQ ID NO:2), a 3' intron (SEQ ID NO:21), a sequence encoding 3' CVB3 IRES (SEQ ID NO:40), a sequence encoding green fluorescent protein EGFP (SEQ ID NO:5), a sequence encoding 5' CVB3 IRES (SEQ ID NO:41), a 5' intron (SEQ ID NO:39), a 3' homologous arm (SEQ ID NO:7), and an EcoRV restriction enzyme site (GATATC) for plasmid linearization. The synthesized DNA fragment and its complementary strand were cloned into the pUC57 vector (SEQ ID NO:8), and the linearized vector product was obtained by EcoRV digestion. The 5' intron sequence... In (SEQ ID NO:39), the underlined part is the IGS sequence, where the lowercase bold underlined part of the IGS is the sequence that is the reverse complement of the 5' end sequence of the sequence encoding 3'IRES, and the uppercase bold part of the IGS is the sequence that is the reverse complement of the 3' end sequence of the sequence encoding 5'IRES.

所构建载体所产出的RNA的E-Gel^TM凝胶电泳结果如图10A所示，存在环状RNA条带。The E-Gel ^™ gel electrophoresis results of the RNA produced by the constructed vector are shown in Figure 10A, showing the presence of circular RNA bands.

所构建载体所产出的RNA的毛细管电泳检测的结果如图10B所示，环化后的RNA纯度为64.5％，纯度较高，能达到生产要求。The results of capillary electrophoresis of the RNA produced by the constructed vector are shown in Figure 10B. The purity of the circularized RNA was 64.5%, which is high and meets the production requirements.

参照实施例1中1.8的方法步骤，对参照1.5得到的纯化RNA进行逆转录、PCR和胶回收，并对环化接头位置测序。PCR所用的引物及测序引物为SEQ ID NO:16和17，接头处的测序结果如SEQ ID NO:42所示。可以看到，环化产物不含外源残留序列(即无自剪接型相邻外显子残留)。Following the method steps in 1.8 of Example 1, the purified RNA obtained in 1.5 was subjected to reverse transcription, PCR, and gel recovery, and the circular adapter site was sequenced. The primers used for PCR and sequencing were SEQ ID NO: 16 and 17, and the sequencing results at the adapter site are shown in SEQ ID NO: 42. It can be seen that the circularized product does not contain exogenous residual sequences (i.e., no self-splicing type adjacent exon residues).

(SEQ ID NO:42，小写为EGFP序列，大写为IRES序列，其中加粗下划线为5’IRES序列，/为连接处) (SEQ ID NO:42, lowercase represents the EGFP sequence, uppercase represents the IRES sequence, where the bold underline is the 5' IRES sequence, and / indicates the linker)

实施例5.基于IRES不同截断位置的环状RNA制备载体的构建Example 5. Construction of circular RNA preparation vectors based on different IRES truncation positions

为测试图8B所示载体中IRES不同断开位点是否仍然可以制备无外源残留序列的环状RNA，同时测试IRES不同断开位点的环化效果。采用贝氏柯克斯体23S核糖体基因中的自剪接型内含子和截断位置为第18个碱基后断开的CVB3 IRES，分别构建3种如图8B所示的载体Coxie-B18-Gluc、Coxie-B18-EGFP、Coxie-B18-Fluc，以验证同时去除与自剪接型内含子相邻外显子E3和E5的RNA环化效果。To test whether circular RNA without exogenous residual sequences could still be prepared from different IRES breakpoints in the vector shown in Figure 8B, and to test the circularization effect of different IRES breakpoints, three vectors, Coxie-B18-Gluc, Coxie-B18-EGFP, and Coxie-B18-Fluc, were constructed using a self-splicing intron from the Coxsell 23S ribosomal gene and a CVB3 IRES truncated at the 18th base. These vectors were used to verify the RNA circularization effect of simultaneously removing exons E3 and E5 adjacent to the self-splicing intron.

具体地，合成Coxie-B18-Gluc载体编码链DNA片段，其从5'到3'方向依次包含：T7启动子(SEQ ID NO:1)、5'同源臂(SEQ ID NO:2)、3'内含子(SEQ ID NO:21)、编码3’CVB3 IRES的序列(SEQ ID NO:46)、编码高斯荧光素酶Gluc的序列(SEQ ID NO:49)、编码5’CVB3 IRES的序列(SEQ ID NO:47)、5'内含子(SEQ ID NO:48)、3'同源臂(SEQ ID NO:7)、以及用于质粒线性化的EcoRV限制性酶切位点(GATATC)。将上述合成的DNA片段及其互补链克隆至pUC57载体(SEQ ID NO:8)中，经EcoRV酶切获得线性化载体产物。在5'内含子序列 (SEQ ID NO:48)中，加粗的为IGS序列，其中IGS中的小写加粗加下划线的为与编码3'IRES的序列的5'端序列反向互补的序列，IGS中的大写加粗的为与编码5'IRES的序列中的3'端序列反向互补的序列。Specifically, a DNA fragment encoding the Coxie-B18-Gluc vector was synthesized, which, from 5' to 3', sequentially includes: a T7 promoter (SEQ ID NO:1), a 5' homologous arm (SEQ ID NO:2), a 3' intron (SEQ ID NO:21), a sequence encoding 3' CVB3 IRES (SEQ ID NO:46), a sequence encoding Gaussian luciferase Gluc (SEQ ID NO:49), a sequence encoding 5' CVB3 IRES (SEQ ID NO:47), a 5' intron (SEQ ID NO:48), a 3' homologous arm (SEQ ID NO:7), and an EcoRV restriction enzyme site (GATATC) for plasmid linearization. The synthesized DNA fragment and its complementary strand were cloned into the pUC57 vector (SEQ ID NO:8), and the linearized vector product was obtained by EcoRV digestion. The 5' intron sequence... In (SEQ ID NO:48), the bold part is the IGS sequence, where the lowercase bold and underlined part of IGS is the sequence that is the reverse complement of the 5' end sequence of the sequence encoding 3'IRES, and the uppercase bold part of IGS is the sequence that is the reverse complement of the 3' end sequence of the sequence encoding 5'IRES.

具体地，合成Coxie-B18-EGFP载体的编码链DNA片段，其从5'到3'方向依次包含：T7启动子(SEQ ID NO:1)、5'同源臂(SEQ ID NO:2)、3'内含子(SEQ ID NO:21)、编码3’CVB3 IRES的序列(SEQ ID NO:46)、编码绿色荧光蛋白EGFP的序列(SEQ ID NO:5)、编码5’CVB3 IRES的序列(SEQ ID NO:47)、5'内含子(SEQ ID NO:48)、3'同源臂(SEQ ID NO:7)、以及用于质粒线性化的EcoRV限制性酶切位点(GATATC)。将上述合成的DNA片段及其互补链克隆至pUC57载体(SEQ ID NO:8)中，经EcoRV酶切获得线性化载体产物。在5'内含子序列 (SEQ ID NO:48)中，加粗的为IGS序列，其中IGS中的小写加粗下划线的为与编码3'IRES的序列的5'端序列反向互补的序列，IGS中的大写加粗的为与编码5'IRES的序列中的3'端序列反向互补的序列。Specifically, the coding DNA fragment of the Coxie-B18-EGFP vector was synthesized, which, from 5' to 3', sequentially includes: a T7 promoter (SEQ ID NO:1), a 5' homologous arm (SEQ ID NO:2), a 3' intron (SEQ ID NO:21), a sequence encoding 3' CVB3 IRES (SEQ ID NO:46), a sequence encoding green fluorescent protein EGFP (SEQ ID NO:5), a sequence encoding 5' CVB3 IRES (SEQ ID NO:47), a 5' intron (SEQ ID NO:48), a 3' homologous arm (SEQ ID NO:7), and an EcoRV restriction enzyme site (GATATC) for plasmid linearization. The synthesized DNA fragment and its complementary strand were cloned into the pUC57 vector (SEQ ID NO:8), and the linearized vector product was obtained by EcoRV digestion. The 5' intron sequence... In (SEQ ID NO:48), the bold part is the IGS sequence, where the lowercase bold underlined part of IGS is the sequence that is the reverse complement of the 5' end sequence of the sequence encoding 3'IRES, and the uppercase bold part of IGS is the sequence that is the reverse complement of the 3' end sequence of the sequence encoding 5'IRES.

具体地，合成Coxie-B18-Fluc载体的编码链DNA片段，其从5'到3'方向依次包含：T7启动子(SEQ ID NO:1)、5'同源臂(SEQ ID NO:2)、3'内含子(SEQ ID NO:21)、编码3’CVB3 IRES的序列(SEQ ID NO:46)、编码萤火虫荧光素酶Fluc的序列(SEQ ID NO:28)、编码5’CVB3 IRES的序列(SEQ ID NO:47)、5'内含子(SEQ ID NO:48)、3'同源臂(SEQ ID NO:7)、以及用于质粒线性化的EcoRV限制性酶切位点(GATATC)。将上述合成的DNA片段及其互补链克隆至pUC57载体(SEQ ID NO:8)中，经EcoRV酶切获得线性化载体产物。在5'内含子序列 (SEQ ID NO:48)中，加粗的为IGS序列，其中IGS中的小写加粗加下划线的为与编码3'IRES的序列的5'端序列反向互补的序列，IGS中的大写加粗的为与编码5'IRES的序列中的3'端序列反向互补的序列。Specifically, the coding DNA fragment of the Coxie-B18-Fluc vector was synthesized, which, from 5' to 3', sequentially includes: a T7 promoter (SEQ ID NO:1), a 5' homologous arm (SEQ ID NO:2), a 3' intron (SEQ ID NO:21), a sequence encoding 3' CVB3 IRES (SEQ ID NO:46), a sequence encoding firefly luciferase Flu (SEQ ID NO:28), a sequence encoding 5' CVB3 IRES (SEQ ID NO:47), a 5' intron (SEQ ID NO:48), a 3' homologous arm (SEQ ID NO:7), and an EcoRV restriction enzyme site (GATATC) for plasmid linearization. The synthesized DNA fragment and its complementary strand were cloned into the pUC57 vector (SEQ ID NO:8), and the linearized vector product was obtained by EcoRV digestion. The 5' intron sequence... In (SEQ ID NO:48), the bold part is the IGS sequence, where the lowercase bold and underlined part of IGS is the sequence that is the reverse complement of the 5' end sequence of the sequence encoding 3'IRES, and the uppercase bold part of IGS is the sequence that is the reverse complement of the 3' end sequence of the sequence encoding 5'IRES.

所构建载体所产出的RNA的E-Gel^TM凝胶电泳结果如图11A-11C所示，通过RNA酶R(RR)后处理，判定环状条带位置，即所构建的载体均能生产出环状RNA。The E-Gel ^™ gel electrophoresis results of the RNA produced by the constructed vectors are shown in Figures 11A-11C. The position of the circular band was determined by post-treatment with RNase R (RR), indicating that all constructed vectors can produce circular RNA.

所构建载体所产出的RNA的毛细管电泳检测的结果如图11D-11I所示，Coxie-B18-Gluc环化后的RNA纯度为78.5％，使用RNA酶R消化处理后的RNA纯度为99.4％；Coxie-B18-EGFP环化后的RNA纯度为80.9％，使用RNA酶R消化处理后的RNA纯度为100％；Coxie-B18-Fluc环化后RNA纯度为83.5％，使用RNA酶R消化处理后的RNA纯度为98.7％。结果显示不同ORF环化后纯度均优于已发表策略。The capillary electrophoresis results of the RNA produced by the constructed vectors are shown in Figures 11D-11I. The purity of the RNA after circularization with Coxie-B18-Gluc was 78.5%, and the purity after digestion with RNase R was 99.4%; the purity of the RNA after circularization with Coxie-B18-EGFP was 80.9%, and the purity after digestion with RNase R was 100%; the purity of the RNA after circularization with Coxie-B18-Fluc was 83.5%, and the purity after digestion with RNase R was 98.7%. The results show that the purity after circularization with different ORFs is superior to that of published strategies.

参照实施例1中1.8的方法步骤，对参照1.5得到的纯化RNA进行逆转录、PCR和胶回收，并对环化接头位置测序。Circ-coxie-B18-Gluc(即环化的coxie-B18-Gluc)的PCR所用的引物及测序引物为SEQ ID NO:53和54，接头处的测序结果如SEQ IDNO:50所示。Circ-coxie-B18-EGFP(即环化的coxie-B18-EGFP)的PCR所用的引物及测序引物为SEQ ID NO:16和54，接头处的测序结果如SEQ ID NO:51所示。Circ-coxie-B18-Fluc(即环化的coxie-B18-Fluc)的PCR所用的引物及测序引物为SEQ ID NO:14和54，接头处的测序结果如SEQ ID NO:52所示。可以看到，环化产物不含外源残留序列(即无自剪接型相邻外显子残留)。Following the method steps in 1.8 of Example 1, the purified RNA obtained according to 1.5 was subjected to reverse transcription, PCR, and gel recovery, and the circular adapter sites were sequenced. The primers used for PCR and sequencing of Circ-coxie-B18-Gluc (i.e., circularized coxie-B18-Gluc) were SEQ ID NO: 53 and 54, and the sequencing results at the adapter sites are shown in SEQ ID NO: 50. The primers used for PCR and sequencing of Circ-coxie-B18-EGFP (i.e., circularized coxie-B18-EGFP) were SEQ ID NO: 16 and 54, and the sequencing results at the adapter sites are shown in SEQ ID NO: 51. The primers used for PCR and sequencing of Circ-coxie-B18-Fluc (i.e., circularized coxie-B18-Fluc) were SEQ ID NO: 14 and 54, and the sequencing results at the adapter sites are shown in SEQ ID NO: 52. As can be seen, the cyclized product does not contain any exogenous residual sequences (i.e., there are no self-splicing type adjacent exon residues).

Circ-coxie-B18-Gluc测序结果Circ-coxie-B18-Gluc sequencing results

(SEQ ID NO:50，小写为Gluc序列，大写为IRES序列，其中加粗下划线为5’IRES序列，/为连接处) (SEQ ID NO:50, lowercase represents the Gluc sequence, uppercase represents the IRES sequence, where the bold underline is the 5' IRES sequence, and / indicates a connection point)

Circ-coxie-B18-EGFP测序结果Circ-coxie-B18-EGFP sequencing results

(SEQ ID NO:51，小写为EGFP序列，大写为IRES序列，其中加粗下划线为5’IRES序列，/为连接处) (SEQ ID NO:51, lowercase represents the EGFP sequence, uppercase represents the IRES sequence, where the bold underline is the 5' IRES sequence, and / indicates the linker)

Circ-Coxie-B18-Fluc测序结果Circ-Coxie-B18-Fluc sequencing results

(SEQ ID NO:52，小写为EGFP序列，大写为IRES序列，其中加粗下划线为5’IRES序列，/为连接处) (SEQ ID NO:52, lowercase represents the EGFP sequence, uppercase represents the IRES sequence, where the bold underline is the 5' IRES sequence, and / indicates the linker)

基于本申请图8B用于制备无外源残留序列的环状RNA载体的策略，可对CVB3IRES进行不同位置截断，均能够生产出无外源残留序列的环状RNA，同时在CVB3IRES第18个碱基后截断环化率提升较为明显，RNA酶R处理后能够获得纯度更高的环状RNA，显著优于已发表策略。Based on the strategy in Figure 8B of this application for preparing circular RNA vectors without exogenous residual sequences, CVB3IRES can be truncated at different positions, and circular RNA without exogenous residual sequences can be produced. At the same time, the circularization rate is significantly improved after truncating the 18th base of CVB3IRES. After treatment with RNase R, circular RNA with higher purity can be obtained, which is significantly better than the published strategies.

实施例6.不同类型IRES的环状RNA制备载体的构建Example 6. Construction of circular RNA preparation vectors for different types of IRES

为了测试图8B所示载体是否适用于其他的IRES类型。筛选到肠病毒B107(EVB107)、人鼻病毒B3(HRVB3)的IRES和肠病毒A(EV-A)的IRES变体^[10]序列分别如SEQ ID NO:61，74和75所示，我们分别使用来自肠病毒B107(EVB107)、人鼻病毒B3(HRVB3)IRES和EV-A的IRES变体(EV-A-mut)替换CVB3 IRES，同时采用贝氏柯克斯体23S核糖体基因中的自剪接型内含子，分别构建2种如图8B所示的载体Coxie-EVB107-B18-Gluc，Coxie-EVB107-B18-EGFP，Coxie-HRVB3-B18-Gluc，Coxie-HRVB3-B18-EGFP，Coxie-EV-A-mut-B374-Gluc和Coxie-EV-A-mut-B374-EGFP。以验证同时去除与自剪接型内含子相邻外显子E3和E5的RNA环化效果，EVB107 IRES截断位置为第18个碱基后，HRVB3 IRES截断位置为第18个碱基后，EV-A-mut IRES截断位置为第374个碱基后。To test whether the carrier shown in Figure 8B is suitable for other IRES types. The IRES sequences of enterovirus B107 (EVB107), human rhinovirus B3 (HRVB3) and enterovirus A (EV-A) ^[10] were screened as shown in SEQ ID NO:61, 74 and 75, respectively. We replaced the CVB3 IRES with the IRES of enterovirus B107 (EVB107), human rhinovirus B3 (HRVB3) and the IRES variant of EV-A (EV-A-mut), respectively. At the same time, we used the self-splicing intron in the 23S ribosomal gene of Coxsella becci to construct two vectors as shown in Figure 8B: Coxie-EVB107-B18-Gluc, Coxie-EVB107-B18-EGFP, Coxie-HRVB3-B18-Gluc, Coxie-HRVB3-B18-EGFP, Coxie-EV-A-mut-B374-Gluc and Coxie-EV-A-mut-B374-EGFP. To verify the effect of simultaneously removing exons E3 and E5 adjacent to the self-splicing intron, the IRES of EVB107 were truncated after the 18th base, the IRES of HRVB3 were truncated after the 18th base, and the IRES of EV-A-mut were truncated after the 374th base.

具体地，合成Coxie-EVB107-B18-Gluc载体的编码链DNA片段，其从5'到3'方向依次包含：T7启动子(SEQ ID NO:1)、5'同源臂(SEQ ID NO:2)、3'内含子(SEQ ID NO:21)、编码3’EVB107 IRES的序列(SEQ ID NO:55)、编码高斯荧光素酶Gluc的序列(SEQ ID NO:49)、编码5’EVB107 IRES的序列(SEQ ID NO:56)、5'内含子(SEQ ID NO:57)、3'同源臂(SEQ ID NO:7)、以及用于质粒线性化的EcoRV限制性酶切位点(GATATC)。将上述合成的DNA片段及其互补链克隆至pUC57载体(SEQ ID NO:8)中，经EcoRV酶切获得线性化载体产物。在5'内含子序列 (SEQ ID NO:57)中，加粗的为IGS序列，其中IGS中的小写加粗加下划线的为与编码3'IRES的序列的5'端序列反向互补的序列，IGS中的大写加粗的为与编码5'IRES的序列中的3'端序列反向互补的序列。Specifically, the coding DNA fragment of the Coxie-EVB107-B18-Gluc vector was synthesized, which, from 5' to 3', sequentially includes: a T7 promoter (SEQ ID NO:1), a 5' homologous arm (SEQ ID NO:2), a 3' intron (SEQ ID NO:21), a sequence encoding 3'EVB107 IRES (SEQ ID NO:55), a sequence encoding Gaussian luciferase Gluc (SEQ ID NO:49), a sequence encoding 5'EVB107 IRES (SEQ ID NO:56), a 5' intron (SEQ ID NO:57), a 3' homologous arm (SEQ ID NO:7), and an EcoRV restriction enzyme site (GATATC) for plasmid linearization. The synthesized DNA fragment and its complementary strand were cloned into the pUC57 vector (SEQ ID NO:8), and the linearized vector product was obtained by EcoRV digestion. The 5' intron sequence... In (SEQ ID NO:57), the bold part is the IGS sequence, where the lowercase bold and underlined part of IGS is the sequence that is the reverse complement of the 5' end sequence of the sequence encoding 3'IRES, and the uppercase bold part of IGS is the sequence that is the reverse complement of the 3' end sequence of the sequence encoding 5'IRES.

具体地，合成Coxie-EVB107-B18-EGFP载体的编码链DNA片段，其从5'到3'方向依次包含：T7启动子(SEQ ID NO:1)、5'同源臂(SEQ ID NO:2)、3'内含子(SEQ ID NO:21)、编码3’EVB107 IRES的序列(SEQ ID NO:55)、编码绿色荧光蛋白EGFP的序列(SEQ ID NO:5)、编码5’EVB107 IRES的序列(SEQ ID NO:56)、5'内含子(SEQ ID NO:57)、3'同源臂(SEQ ID NO:7)、以及用于质粒线性化的EcoRV限制性酶切位点(GATATC)。将上述合成的DNA片段及其互补链克隆至pUC57载体(SEQ ID NO:8)中，经EcoRV酶切获得线性化载体产物。在5'内含子序列 (SEQ ID NO:57)中，加粗的为IGS序列，其中IGS中的小写加粗下划线的为与编码3'IRES的序列的5'端序列反向互补的序列，IGS中的大写加粗的为与编码5'IRES的序列中的3'端序列反向互补的序列。Specifically, the coding DNA fragment of the Coxie-EVB107-B18-EGFP vector was synthesized, which, from 5' to 3', sequentially includes: a T7 promoter (SEQ ID NO:1), a 5' homologous arm (SEQ ID NO:2), a 3' intron (SEQ ID NO:21), a sequence encoding 3'EVB107 IRES (SEQ ID NO:55), a sequence encoding green fluorescent protein EGFP (SEQ ID NO:5), a sequence encoding 5'EVB107 IRES (SEQ ID NO:56), a 5' intron (SEQ ID NO:57), a 3' homologous arm (SEQ ID NO:7), and an EcoRV restriction enzyme site (GATATC) for plasmid linearization. The synthesized DNA fragment and its complementary strand were cloned into the pUC57 vector (SEQ ID NO:8), and the linearized vector product was obtained by EcoRV digestion. The 5' intron sequence... In (SEQ ID NO:57), the bold part is the IGS sequence, where the lowercase bold underlined part of IGS is the sequence that is the reverse complement of the 5' end sequence of the sequence encoding 3'IRES, and the uppercase bold part of IGS is the sequence that is the reverse complement of the 3' end sequence of the sequence encoding 5'IRES.

具体地，合成Coxie-HRVB3-B18-Gluc载体的编码链DNA片段，其从5'到3'方向依次包含：T7启动子(SEQ ID NO:1)、5'同源臂(SEQ ID NO:2)、3'内含子(SEQ ID NO:21)、编码3’HRVB3 IRES的序列(SEQ ID NO:62)、编码高斯荧光素酶Gluc的序列(SEQ ID NO:49)、编码5’HRVB3 IRES的序列(SEQ ID NO:63)、5'内含子(SEQ ID NO:64)、3'同源臂(SEQ ID NO:7)、以及用于质粒线性化的EcoRV限制性酶切位点(GATATC)。将上述合成的DNA片段及其互补链克隆至pUC57载体(SEQ ID NO:8)中，经EcoRV酶切获得线性化载体产物。在5'内含子序列 (SEQ ID NO:64)中，加粗的为IGS序列，其中IGS中的小写加粗下划线的为与编码3'IRES的序列的5'端序列反向互补的序列，IGS中的大写加粗的为与编码5'IRES的序列中的3'端序列反向互补的序列。Specifically, the coding DNA fragment of the Coxie-HRVB3-B18-Gluc vector was synthesized, which, from 5' to 3', sequentially includes: a T7 promoter (SEQ ID NO:1), a 5' homologous arm (SEQ ID NO:2), a 3' intron (SEQ ID NO:21), a sequence encoding 3'HRVB3 IRES (SEQ ID NO:62), a sequence encoding Gaussian luciferase Gluc (SEQ ID NO:49), a sequence encoding 5'HRVB3 IRES (SEQ ID NO:63), a 5' intron (SEQ ID NO:64), a 3' homologous arm (SEQ ID NO:7), and an EcoRV restriction enzyme site (GATATC) for plasmid linearization. The synthesized DNA fragment and its complementary strand were cloned into the pUC57 vector (SEQ ID NO:8), and the linearized vector product was obtained by EcoRV digestion. The 5' intron sequence... In (SEQ ID NO:64), the bold part is the IGS sequence, where the lowercase bold underlined part of IGS is the sequence that is the reverse complement of the 5' end sequence of the sequence encoding 3'IRES, and the uppercase bold part of IGS is the sequence that is the reverse complement of the 3' end sequence of the sequence encoding 5'IRES.

具体地，合成Coxie-HRVB3-B18-EGFP载体的编码链DNA片段，其从5'到3'方向依次包含：T7启动子(SEQ ID NO:1)、5'同源臂(SEQ ID NO:2)、3'内含子(SEQ ID NO:21)、编码3’HRVB3 IRES的序列(SEQ ID NO:62)、编码绿色荧光蛋白EGFP的序列(SEQ ID NO:5)、编码5’HRVB3 IRES的序列(SEQ ID NO:63)、5'内含子(SEQ ID NO:64)、3'同源臂(SEQ ID NO:7)、以及用于质粒线性化的EcoRV限制性酶切位点(GATATC)。将上述合成的DNA片段及其互补链克隆至pUC57载体(SEQ ID NO:8)中，经EcoRV酶切获得线性化载体产物。在5'内含子序列 (SEQ ID NO:64)中，加粗的为IGS序列，其中IGS中的小写加粗下划线的为与编码3'IRES的序列的5'端序列反向互补的序列，IGS中的大写加粗的为与编码5'IRES的序列中的3'端序列反向互补的序列。Specifically, the coding DNA fragment of the Coxie-HRVB3-B18-EGFP vector was synthesized, which, from 5' to 3', sequentially includes: a T7 promoter (SEQ ID NO:1), a 5' homologous arm (SEQ ID NO:2), a 3' intron (SEQ ID NO:21), a sequence encoding 3'HRVB3 IRES (SEQ ID NO:62), a sequence encoding green fluorescent protein EGFP (SEQ ID NO:5), a sequence encoding 5'HRVB3 IRES (SEQ ID NO:63), a 5' intron (SEQ ID NO:64), a 3' homologous arm (SEQ ID NO:7), and an EcoRV restriction enzyme site (GATATC) for plasmid linearization. The synthesized DNA fragment and its complementary strand were cloned into the pUC57 vector (SEQ ID NO:8), and the linearized vector product was obtained by EcoRV digestion. The 5' intron sequence... In (SEQ ID NO:64), the bold part is the IGS sequence, where the lowercase bold underlined part of IGS is the sequence that is the reverse complement of the 5' end sequence of the sequence encoding 3'IRES, and the uppercase bold part of IGS is the sequence that is the reverse complement of the 3' end sequence of the sequence encoding 5'IRES.

具体地，合成Coxie-EV-A-mut-B374-Gluc载体的编码链DNA片段，其从5'到3'方向依次包含：T7启动子(SEQ ID NO:1)、5'同源臂(SEQ ID NO:2)、3'内含子(SEQ ID NO:21)、编码3’EV-A-mut IRES的序列(SEQ ID NO:65)、编码高斯荧光素酶Gluc的序列(SEQ ID NO:49)、编码5’EV-A-mut IRES的序列(SEQ ID NO:66)、5'内含子(SEQ ID NO:67)、3'同源臂(SEQ ID NO:7)、以及用于质粒线性化的EcoRV限制性酶切位点(GATATC)。将上述合成的DNA片段及其互补链克隆至pUC57载体(SEQ ID NO:8)中，经EcoRV酶切获得线性化载体产物。在5'内含子序列 (SEQ ID NO:67)中，加粗的为IGS序列，其中IGS中的小写加粗下划线的为与编码3'IRES的序列的5'端序列反向互补的序列，IGS中的大写加粗的为与编码5'IRES的序列中的3'端序列反向互补的序列。Specifically, the coding DNA fragment of the Coxie-EV-A-mut-B374-Gluc vector was synthesized, which, from 5' to 3', sequentially includes: a T7 promoter (SEQ ID NO:1), a 5' homologous arm (SEQ ID NO:2), a 3' intron (SEQ ID NO:21), a sequence encoding 3'EV-A-mut IRES (SEQ ID NO:65), a sequence encoding Gaussian luciferase Gluc (SEQ ID NO:49), a sequence encoding 5'EV-A-mut IRES (SEQ ID NO:66), a 5' intron (SEQ ID NO:67), a 3' homologous arm (SEQ ID NO:7), and an EcoRV restriction enzyme site (GATATC) for plasmid linearization. The synthesized DNA fragment and its complementary strand were cloned into the pUC57 vector (SEQ ID NO:8), and the linearized vector product was obtained by EcoRV digestion. The 5' intron sequence... In (SEQ ID NO:67), the bold part is the IGS sequence, where the lowercase bold underlined part of IGS is the sequence that is the reverse complement of the 5' end sequence of the sequence encoding 3'IRES, and the uppercase bold part of IGS is the sequence that is the reverse complement of the 3' end sequence of the sequence encoding 5'IRES.

具体地，合成Coxie-EV-A-mut-B374-EGFP载体的编码链DNA片段，其从5'到3'方向依次包含：T7启动子(SEQ ID NO:1)、5'同源臂(SEQ ID NO:2)、3'内含子(SEQ ID NO:21)、编码3’EV-A-mut IRES的序列(SEQ ID NO:65)、编码绿色荧光蛋白EGFP的序列(SEQ ID NO:5)、编码5’EV-A-mut IRES的序列(SEQ ID NO:66)、5'内含子(SEQ ID NO:67)、3'同源臂(SEQ ID NO:7)、以及用于质粒线性化的EcoRV限制性酶切位点(GATATC)。将上述合成的DNA片段及其互补链克隆至pUC57载体(SEQ ID NO:8)中，经EcoRV酶切获得线性化载体产物。在5'内含子序列 (SEQ ID NO:67)中，加粗的为IGS序列，其中IGS中的小写加粗下划线的为与编码3'IRES的序列的5'端序列反向互补的序列，IGS中的大写加粗的为与编码5'IRES的序列中的3'端序列反向互补的序列。Specifically, the coding DNA fragment of the Coxie-EV-A-mut-B374-EGFP vector was synthesized, which, from 5' to 3', sequentially includes: a T7 promoter (SEQ ID NO:1), a 5' homologous arm (SEQ ID NO:2), a 3' intron (SEQ ID NO:21), a sequence encoding 3'EV-A-mut IRES (SEQ ID NO:65), a sequence encoding green fluorescent protein EGFP (SEQ ID NO:5), a sequence encoding 5'EV-A-mut IRES (SEQ ID NO:66), a 5' intron (SEQ ID NO:67), a 3' homologous arm (SEQ ID NO:7), and an EcoRV restriction enzyme site (GATATC) for plasmid linearization. The synthesized DNA fragment and its complementary strand were cloned into the pUC57 vector (SEQ ID NO:8), and the linearized vector product was obtained by EcoRV digestion. The 5' intron sequence... In (SEQ ID NO:67), the bold part is the IGS sequence, where the lowercase bold underlined part of IGS is the sequence that is the reverse complement of the 5' end sequence of the sequence encoding 3'IRES, and the uppercase bold part of IGS is the sequence that is the reverse complement of the 3' end sequence of the sequence encoding 5'IRES.

所构建的Coxie-EVB107-B18-Gluc载体所产出的RNA的E-Gel^TM凝胶电泳结果如图12A，所构建的Coxie-EVB107-B18-EGFP载体所产出的RNA的E-Gel^TM凝胶电泳结果如12B所示；Coxie-HRVB3-B18-Gluc载体所产出的RNA的E-Gel^TM凝胶电泳结果如图14A，所构建的Coxie-HRVB3-B18-EGFP载体所产出的RNA的E-Gel^TM凝胶电泳结果如14B所示；所构建的Coxie-EV-A-mut-B374-Gluc载体所产出的RNA的E-Gel^TM凝胶电泳结果如14C所示，所构建的Coxie-EV-A-mut-B374-EGFP载体所产出的RNA的E-Gel^TM凝胶电泳结果如14D所示。The E-Gel ^™ electrophoresis results of RNA produced by the constructed Coxie-EVB107-B18-Gluc vector are shown in Figure 12A; the E-Gel ^™ electrophoresis results of RNA produced by the constructed Coxie-EVB107-B18-EGFP vector are shown in Figure 12B; the E-Gel ^™ electrophoresis results of RNA produced by the Coxie-HRVB3-B18-Gluc vector are shown in Figure 14A; the E-Gel ^™ electrophoresis results of RNA produced by the constructed Coxie-HRVB3-B18-EGFP vector are shown in Figure 14B; the E-Gel ^™ electrophoresis results of RNA produced by the constructed Coxie-EV-A-mut-B374-Gluc vector are shown in Figure 14C; and the E-Gel ^™ electrophoresis results of RNA produced by the constructed Coxie-EV-A-mut-B374-EGFP vector are shown in Figure 14D.

各图中的三个泳道从左到右分别为参照实施例1步骤1.3、1.4和1.5得到的RNA，环化后的RNA(Circ)和经RNA酶R酶处理的RNA(RR)中均存在环状RNA条带。所构建载体Coxie-EVB107-B18-Gluc和Coxie-EVB107-B18-EGFP所产出的RNA的毛细管电泳检测的结果分别如图12C和12D所示，Coxie-EVB107-B18-Gluc载体环化后的RNA纯度为80.2％，Coxie-EVB107-B18-EGFP载体环化后的RNA纯度为68.3％。Coxie-HRVB3-B18-Gluc、Coxie-HRVB3-B18-EGFP、Coxie-EV-A-mut-B374-Gluc和Coxie-EV-A-mut-B374-EGFP所产出的RNA的毛细管电泳检测的结果分别如图14E，14F，14G和14H所示，Coxie-HRVB3-B18-Gluc载体环化后的RNA纯度为80.2％，Coxie-HRVB3-B18-EGFP载体环化后的RNA纯度为78％,Coxie-EV-A-mut-B374-Gluc载体环化后的RNA纯度为76.3％，Coxie-EV-A-mut-B374-EGFP载体环化后的RNA纯度为78.9％。不同ORF环化后纯度与已发表策略纯度差异不大。The three lanes in each figure, from left to right, represent the RNA obtained according to steps 1.3, 1.4, and 1.5 of Example 1. Circular RNA bands are present in both the circularized RNA (Circ) and the RNA treated with RNase R (RR). The capillary electrophoresis results of the RNA produced by the constructed vectors Coxie-EVB107-B18-Gluc and Coxie-EVB107-B18-EGFP are shown in Figures 12C and 12D, respectively. The purity of the circularized RNA from the Coxie-EVB107-B18-Gluc vector is 80.2%, and the purity of the circularized RNA from the Coxie-EVB107-B18-EGFP vector is 68.3%. The capillary electrophoresis results of RNA produced by Coxie-HRVB3-B18-Gluc, Coxie-HRVB3-B18-EGFP, Coxie-EV-A-mut-B374-Gluc, and Coxie-EV-A-mut-B374-EGFP are shown in Figures 14E, 14F, 14G, and 14H, respectively. The purity of RNA after circularization using the Coxie-HRVB3-B18-Gluc vector was 80.2%, the Coxie-HRVB3-B18-EGFP vector was 78%, the Coxie-EV-A-mut-B374-Gluc vector was 76.3%, and the Coxie-EV-A-mut-B374-EGFP vector was 78.9%. The purity of RNA after circularization using different ORFs was not significantly different from that of published strategies.

参照实施例1中1.8的方法步骤，对参照1.5得到的纯化RNA进行逆转录、PCR和胶回收，并对环化接头位置测序。Circ-EVB107-B18-Gluc的PCR所用的引物及测序引物为SEQ ID NO:53和60，接头处的测序结果如SEQ ID NO:58所示。Circ-EVB107-B18-EGFP的PCR所用的引物及测序引物为SEQ ID NO:16和60，接头处的测序结果如SEQ ID NO:59所示。Circ-HRVB3-B18-Gluc的PCR所用的引物及测序引物为SEQ ID NO:53和72，接头处的测序结果如SEQ ID NO:68所示。Circ-HRVB3-B18-EGFP的PCR所用的引物及测序引物为SEQ ID NO:16和72，接头处的测序结果如SEQ ID NO:69所示。Circ-EV-A-mut-B374-Gluc的PCR所用的引物及测序引物为SEQ ID NO:53和73，接头处的测序结果如SEQ ID NO:70所示。Circ-EV-A-mut-B374-EGFP的PCR所用的引物及测序引物为SEQ ID NO:16和73，接头处的测序结果如SEQ ID NO:71所示。可以看到，环化产物不含外源残留序列(即无自剪接型相邻外显子残留)。Following the method steps in 1.8 of Example 1, the purified RNA obtained according to 1.5 was subjected to reverse transcription, PCR, and gel recovery, and the circular adapter sites were sequenced. The primers used for PCR and sequencing of Circ-EVB107-B18-Gluc were SEQ ID NO:53 and 60, and the sequencing results at the adapter sites are shown in SEQ ID NO:58. The primers used for PCR and sequencing of Circ-EVB107-B18-EGFP were SEQ ID NO:16 and 60, and the sequencing results at the adapter sites are shown in SEQ ID NO:59. The primers used for PCR and sequencing of Circ-HRVB3-B18-Gluc were SEQ ID NO:53 and 72, and the sequencing results at the adapter sites are shown in SEQ ID NO:68. The primers used for PCR and sequencing of Circ-HRVB3-B18-EGFP were SEQ ID NO:16 and 72, and the sequencing results at the adapter sites are shown in SEQ ID NO:69. The primers used for PCR and sequencing of Circ-EV-A-mut-B374-Gluc are SEQ ID NO:53 and 73, and the sequencing results at the adapter are shown in SEQ ID NO:70. The primers used for PCR and sequencing of Circ-EV-A-mut-B374-EGFP are SEQ ID NO:16 and 73, and the sequencing results at the adapter are shown in SEQ ID NO:71. It can be seen that the cyclized product does not contain any exogenous residual sequences (i.e., no self-splicing type adjacent exon residues).

Circ-EVB107-B18-Gluc测序结果Circ-EVB107-B18-Gluc sequencing results

(SEQ ID NO:58，小写为Gluc序列，大写为IRES序列，其中加粗下划线为5’IRES序列，/为连接处) (SEQ ID NO:58, lowercase represents the Gluc sequence, uppercase represents the IRES sequence, where the bold underline is the 5' IRES sequence, and / indicates a connection point)

Circ-EVB107-B18-EGFP测序结果Sequencing results of Circ-EVB107-B18-EGFP

(SEQ ID NO:59，小写为Gluc序列，大写为IRES序列，其中加粗下划线为5’IRES序列，/为连接处) (SEQ ID NO:59, lowercase represents the Gluc sequence, uppercase represents the IRES sequence, where the bold underline is the 5' IRES sequence, and / indicates a connection point)

Circ-HRVB-B18-Gluc测序结果Circ-HRVB-B18-Gluc sequencing results

(SEQ ID NO:68，小写为Gluc序列，大写为IRES序列，其中加粗下划线为5’IRES序列，/为连接处) (SEQ ID NO:68, lowercase represents the Gluc sequence, uppercase represents the IRES sequence, where the bold underline is the 5' IRES sequence, and / indicates a connection point)

Circ-HRVB-B18-EGFP测序结果Circ- HRVB -B18-EGFP sequencing results

(SEQ ID NO:69，小写为EGFP序列，大写为IRES序列，其中加粗下划线为5’IRES序列，/为连接处) (SEQ ID NO:69, lowercase represents the EGFP sequence, uppercase represents the IRES sequence, where the bold underline is the 5' IRES sequence, and / indicates the linker)

Circ-EV-A-mut-B374-Gluc测序结果Circ-EV-A-mut-B374-Gluc sequencing results

(SEQ ID NO:70，小写为Gluc序列，大写为IRES序列，其中加粗下划线为5’IRES序列，/为连接处) (SEQ ID NO:70, lowercase represents the Gluc sequence, uppercase represents the IRES sequence, where the bold underline is the 5' IRES sequence, and / indicates a connection point)

Circ-EV-A-mut-B374-EGFP测序结果Circ-EV-A-mut-B374-EGFP sequencing results

(SEQ ID NO:71，小写为EGFP序列，大写为IRES序列，其中加粗下划线为5’IRES序列，/为连接处)基于本申请中制备无外源残留序列的环状RNA载体的设计策略，可对IRES进行替换，由IRES-CVB3替换为IRES-EVB107、IRES-HRVB3和IRES-EV-A-mut，其中IRES-EVB107、IRES-HRVB3基因序列第18位碱基后截断，IRES-EV-A-mut在374位置截断，这三种IRES均能够生产出无外源残留序列的无痕环状RNA。 (SEQ ID NO:71, lowercase is the EGFP sequence, uppercase is the IRES sequence, where the bold underline is the 5' IRES sequence, and / is the link) Based on the design strategy of preparing circular RNA vectors without exogenous residual sequences in this application, IRES can be replaced by IRES-CVB3 with IRES-EVB107, IRES-HRVB3 and IRES-EV-A-mut. The gene sequences of IRES-EVB107 and IRES-HRVB3 are truncated after the 18th base, and IRES-EV-A-mut is truncated at position 374. All three IRES can produce traceless circular RNA without exogenous residual sequences.

实施例7.环状RNA的细胞表达测试与免疫原性测试Example 7. Cellular expression assay and immunogenicity assay of circular RNA

为了测试本申请策略生产的环状RNA的细胞表达水平和免疫原性，我们使用线性mRNA-EGFP，已发表策略制备的环状RNA(circ-Ana-EGFP)^[1]及本申请实施例5制备的环状RNA(circ-Coxie-B18-EGFP)进行细胞表达水平测试和免疫原性测试。To test the cellular expression level and immunogenicity of the circular RNA produced by the strategy of this application, we used linear mRNA-EGFP, circular RNA (circ-Ana-EGFP) prepared by the published strategy ^[1] and circular RNA (circ-Coxie-B18-EGFP) prepared in Example 5 of this application to test the cellular expression level and immunogenicity.

将线性mRNA-EGFP，已发表策略制备circ-Ana-EGFP及实施例5制备的circ-Coxie-B18-EGFP，转染HEK293T细胞和A549细胞(购自美国典型培养物保藏中心)，进行真核细胞表达检测。具体地，使用补充有10％胎牛血清(BI)和青霉素/链霉素抗生素(100U/ml青霉素，100μg/ml链霉素；Gibco)的Dulbecco改良Eagle培养基(DMEM，BI)，将细胞接种到24孔板中，24孔板中每孔1×10⁵个细胞，于37℃、5％CO₂和90％相对湿度下培养。第二天，在24孔板中加入上述RNA样品(线性mRNA-EGFP，circ-Ana-EGFP，circ-Coxie-B18-EGFP)，24孔板中每孔500ng，用lipofectamine MessengerMax(Invitrogen，LRNA003)转染，此时细胞培养物必须具有>90％的活力并达到70％汇合度。Linear mRNA-EGFP, circ-Ana-EGFP prepared using a published strategy, and circ-Coxie-B18-EGFP prepared in Example 5 were transfected into HEK293T cells and A549 cells (purchased from the American Center for Type Culture Collection) for eukaryotic cell expression detection. Specifically, cells were seeded into 24-well plates using Dulbecco modified Eagle medium (DMEM, BI) supplemented with 10% fetal bovine serum (BI) and penicillin/streptomycin antibiotics (100 U/ml penicillin, 100 μg/ml streptomycin; Gibco), with 1 × ^10⁵ cells per well, and cultured at 37°C, 5% _CO₂ , and 90% relative humidity. The next day, the above RNA samples (linear mRNA-EGFP, circ-Ana-EGFP, circ-Coxie-B18-EGFP) were added to 24-well plates, 500 ng per well, and transfected with lipofectamine MessengerMax (Invitrogen, LRNA003). At this time, the cell culture must have >90% viability and reach 70% confluence.

转染后24小时、48小时、72小时、6天和8天后，将细胞置于倒置荧光显微镜上，进行荧光拍照，后使用ImageJ进行荧光读值。Cells were placed on an inverted fluorescence microscope for fluorescence photography at 24, 48, 72, 6 and 8 days after transfection, and the fluorescence values were then read using ImageJ.

在转染后48h时收集细胞培养上清，离心去除A549细胞碎片后取100ul上清，使用Human IL-6ELISA试剂盒参照说明书进行检测(具体方法：酶标板中分别加入100ul样品和标准品，室温孵育90min；弃液体加入100ul生物素化抗体孵育60min；洗板；加入100ul HRP酶结合物孵育30min；洗板；加90ul TMB孵育15min后加终止液终止)，最后用酶标仪在450nm波长处检测OD值，通过绘制标准曲线计算出样品中IL-6的浓度。48 h post-transfection, cell culture supernatant was collected. After centrifugation to remove A549 cell debris, 100 μL of supernatant was taken and detected using the Human IL-6 ELISA kit according to the instructions (specific method: 100 μL of sample and standard were added to the ELISA plate and incubated at room temperature for 90 min; the liquid was discarded and 100 μL of biotinylated antibody was added and incubated for 60 min; the plate was washed; 100 μL of HRP enzyme conjugate was added and incubated for 30 min; the plate was washed; 90 μL of TMB was added and incubated for 15 min, and then stop solution was added to terminate the incubation). Finally, the OD value was detected at 450 nm using an ELISA reader, and the concentration of IL-6 in the sample was calculated by plotting a standard curve.

细胞转染表达的结果如图13A和13B所示，由本申请实施例5制备的circ-coxie-B18-EGFP在转染HEK293T细胞后48h和8天时，其荧光强度均极显著高于线性mRNA，和已发表策略制备的环状RNA(图13A)。由本申请实施例5制备的circ-coxie-B18-EGFP在转染A549细胞后48h时，其荧光强度均显著高于已发表策略的环状RNA，在8天时，其荧光强度均极显著高于线性mRNA，和已发表策略的环状RNA(图13B)。The results of cell transfection expression are shown in Figures 13A and 13B. The fluorescence intensity of circ-coxie-B18-EGFP prepared in Example 5 of this application was significantly higher than that of linear mRNA and circular RNA prepared using published strategies at 48 h and 8 days after transfection into HEK293T cells (Figure 13A). Similarly, the fluorescence intensity of circ-coxie-B18-EGFP prepared in Example 5 of this application was significantly higher than that of circular RNA prepared using published strategies at 48 h after transfection into A549 cells, and extremely significantly higher than that of linear mRNA and circular RNA prepared using published strategies at 8 days (Figure 13B).

免疫原性检测结果如图13C所示，其中NC为空白细胞组，其IL-6相对值记为1，由本申请中使用贝氏柯克斯体-B18构建的circ-coxie-B18-EGFP在A549细胞中免疫原性最低。The immunogenicity test results are shown in Figure 13C, where NC is the blank cell group and its relative IL-6 value is recorded as 1. The circ-coxie-B18-EGFP constructed using Coxella behnea-B18 in this application showed the lowest immunogenicity in A549 cells.

综上，由本申请中使用的贝氏柯克斯体的内含子和IRES第18个碱基后截断构建载体生产的环状RNA，在哺乳动物细胞中蛋白表达水平优于已发表策略，且免疫原性低于已发表策略的环状RNA。In summary, the circular RNA produced by constructing a vector using introns of the Coxsoe body used in this application and truncating IRES after the 18th base shows superior protein expression levels in mammalian cells compared to published strategies, and exhibits lower immunogenicity than circular RNAs from published strategies.

本申请中涉及的部分序列如下所示。

The sequences involved in this application are shown below.

尽管本申请已经结合一个或多个实施方式进行了描述，应当理解的是，本申请并不受限于这些实施方式。本申请中的描述意在涵盖所有变体形式以及等同物，均包含在所附权利要求的主旨和范围内。所有在本文中引用的文献通过引用的方式全部并入本文。Although this application has been described in conjunction with one or more embodiments, it should be understood that this application is not limited to these embodiments. The description in this application is intended to cover all variations and equivalents, all of which are included within the spirit and scope of the appended claims. All references cited herein are incorporated herein by reference in their entirety.

参考文献References

1.Wesselhoeft,R.A.,Kowalski,P.S.&Anderson,D.G.Engineering circular RNAfor potent and stable translation in eukaryotic cells.Nat.Commun.9,2629(2018).1. Wesselhoeft, R.A., Kowalski, P.S. & Anderson, D.G. Engineering circular RNA for potent and stable translation in eukaryotic cells. Nat. Commun. 9, 2629 (2018).

2.Ko M,Choi H,Park C.Group I self-splicing intron in the recAgene of Bacillus anthracis.J Bacteriol.2002Jul；184(14):3917-22.2.Ko M, Choi H, Park C.Group I self-splicing intron in the recAgene of Bacillus anthracis.J Bacteriol.2002Jul;184(14):3917-22.

3.Cech,T.R.1990.Self-splicing of group I introns.Annu.Rev.Biochem.59:543–568.3.Cech,T.R.1990.Self-splicing of group I introns.Annu.Rev.Biochem.59:543–568.

4.Saldanha,R.,G.Mohr,M.Belfort,and A.M.Lambowitz.1993.Group I and group II introns.FASEB J.7:15–244. Saldanha, R., G. Mohr, M. Belfort, and A.M. Lambowitz.1993. Group I and group II introns. FASEB J.7:15–24

5.Jacquier,A.1996.Group II introns:elaborate ribozymes.Biochimie 78:474–487.5.Jacquier,A.1996.Group II introns:elaborate ribozymes.Biochimie 78:474–487.

6.Chen R,Wang SK,Belk JA,Amaya L,Li Z,Cardenas A,Abe BT,Chen CK,Wender PA,Chang HY.Engineering circular RNA for enhanced protein production.Nat Biotechnol.2023Feb；41(2):262-272.6.Chen R, Wang SK, Belk JA, Amaya L, Li Z, Cardenas A, Abe BT, Chen CK, Wender PA, Chang HY. Engineering circular RNA for enhanced protein production. Nat Biotechnol. 2023 Feb; 41(2):262-272.

7.Chen,C.,Wei,H.,Zhang,K.,Li,Z.,Wei,T.,Tang,C.,Yang,Y.,&Wang,Z.(2022).A flexible,efficient,and scalable platform to produce circular RNAs as new therapeutics.bioRxiv.7.Chen, C., Wei, H., Zhang, K., Li, Z., Wei, T., Tang, C., Yang, Y., & Wang, Z. (2022). A flexible, efficient, and scalable platform to produce circular RNAs as new therapeutics.bioRxiv.

8.Qiu,Z.,Zhao,Y.,Hou,Q.,Zhu,J.,Zhai,M.,Li,D.,Li,Y.,Liu,C.,Li,N.,Cao,Y.,Yang,J.,Sun,Z.,&Zuo,C.(2022).Clean-PIE:a novel strategy for efficiently constructing precise circRNA with thoroughly minimized immunogenicity to direct potent and durable protein expression.bioRxiv.8. Qiu, Z., Zhao, Y., Hou, Q., Zhu, J., Zhai, M., Li, D., Li, Y., Liu, C., Li, N., Cao, Y., Yang, J., Sun, Z., & Zuo, C. (2022). Clean-PIE: a novel strategy for eff iciently constructing precise circRNA with thoroughly minimized immunogenicity to direct potent and durable protein expression.bioRxiv.

9.Hausner G,Hafez M,Edgell DR.Bacterial group I introns:mobile RNA catalysts.Mob DNA.2014Mar 10；5(1):8.9.Hausner G,Hafez M,Edgell DR.Bacterial group I introns:mobile RNA catalysts.Mob DNA.2014Mar 10;5(1):8.

10.Yu H,Wen Y,Yu W,et al.Optimized circular RNA vaccines for superior cancer immunotherapy.Theranostics.2025；15(4):1420-1438.Published 2025Jan 2.doi:10.7150/thno.10469810.Yu H,Wen Y,Yu W,et al.Optimized circular RNA vaccines for superior cancer immunotherapy.Theranostics.2025;15(4):1420-1438.Published 2025Jan 2.doi:10.7150/thno.104698

Claims

An RNA molecule comprising, from its 5' end to its 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a target sequence, and a 5' self-splicing intron fragment containing a 5' splice site, wherein the elements are operatively linked.

The 3' self-splicing intron fragment and the 5' self-splicing intron fragment can cause the RNA molecule to circularize and be removed from the RNA molecule during the circularization process.

The 5' self-splicing intron contains an internal guide sequence.

Where i) the 5' end sequence of the internal guide sequence is inversely complementary to the 5' end sequence of the target sequence, or

ii) The 3' end sequence of the internal guide sequence is initiated by a nucleotide with a base of G and is reverse complementary to the 3' end sequence of the target sequence, wherein the 3' end sequence of the target sequence ends with a nucleotide with a base of U, and the initiating nucleotide with a base of G in the 3' end sequence of the internal guide sequence can form a G:U pair with the nucleotide with a base of U at the end of the 3' end sequence of the target sequence.

The RNA molecule of claim 1, wherein the 5' end sequence of the internal guide sequence is anticomplementary to the 2-15 nt nucleotides at the 5' end of the target sequence, or the 3' end sequence of the internal guide sequence is anticomplementary to the 3-20 nt nucleotides at the 3' end of the target sequence.

The RNA molecule of claim 1, wherein i) the 5' end sequence of the internal guide sequence is anticomplementary to the 5' end sequence of the target sequence, and ii) the 3' end sequence of the internal guide sequence is initiated by a G-base nucleotide and is anticomplementary to the 3' end sequence of the target sequence, wherein the 3' end sequence of the target sequence ends with a U-base nucleotide, and the G-base initiating nucleotide of the 3' end sequence of the internal guide sequence can form a G:U pair with the U-base nucleotide at the end of the 3' end sequence of the target sequence.

The RNA molecule as described in claim 3, wherein,

i) The target sequence contains an open reading frame encoding a target peptide or protein and a translational functional element, wherein the translational functional element is selected from translation initiation elements and translation enhancement elements, and wherein the target sequence contains from the 5' end to the 3' end.

(a) The translational functional element and the open reading frame encoding the target peptide or protein.

(b) the open reading frame encoding the target peptide or protein, and the translational functional element.

(c) The 3' end sequence of the translational element, the open reading frame encoding the target peptide or protein, and the 5' end sequence of the translational element, wherein the 5' end sequence and the 3' end sequence of the translational element are arranged in this order to form the translational element, or

(d) The 3' end sequence of the open reading frame encoding the target peptide or protein, the translational functional element, and the 5' end sequence of the open reading frame encoding the target peptide or protein, wherein the 5' end sequence of the open reading frame encoding the target peptide or protein and the 3' end sequence of the open reading frame encoding the target peptide or protein are arranged in this order to form the open reading frame encoding the target peptide or protein.

ii) The target sequence contains a non-coding RNA sequence, which

(a) Contains the non-coding RNA sequence, or

(b) The sequence from the 5' end to the 3' end comprises the 3' end sequence of the non-coding RNA sequence and the 5' end sequence of the non-coding RNA sequence, wherein the 5' end sequence of the non-coding RNA sequence and the 3' end sequence of the non-coding RNA sequence are arranged in this order to form the non-coding RNA sequence, or

iii) The target sequence contains an open reading frame encoding the target peptide or protein.

(a) An open reading frame containing the target peptide or protein, or

(b) The 3' end sequence of the open reading frame encoding the target peptide or protein and the 5' end sequence of the open reading frame encoding the target peptide or protein are included from the 5' end to the 3' end, wherein the 5' end sequence of the open reading frame encoding the target peptide or protein and the 3' end sequence of the open reading frame encoding the target peptide or protein are arranged in this order to form the open reading frame encoding the target peptide or protein.

The RNA molecule of claim 4, wherein the translational functional element is a translation initiation element, wherein the translation initiation element is an internal ribosome entry site (IRES).

The RNA molecule as described in claim 5, wherein the IRES is selected from the IRES of Coxsackievirus B3 (CVB3), Enterovirus B107 (EVB107), Human Rhinovirus B3 (HRVB3), Enterovirus A (EV-A) or Human Rhinovirus B6 (HRVB6).

The RNA molecule of claim 6, wherein the IRES comprises a nucleotide sequence having at least 80% sequence identity with SEQ ID NO:4, SEQ ID NO:61, SEQ ID NO:74, SEQ ID NO:75 or SEQ ID NO:76.

The RNA molecule of claim 5, wherein the translational functional element is an IRES, wherein the IRES is selected from the IRES of Coxsackievirus B3 (CVB3), Enterovirus B107 (EVB107), Human Rhinovirus B3 (HRVB3), or Enterovirus A (EV-A), and the target sequence comprises, from the 5' end to the 3' end, the 3' end sequence of the IRES, the open reading frame encoding the target peptide or protein, and the 5' end sequence of the IRES, wherein the 3' end sequence and the 5' end sequence of the IRES respectively comprise:

(1) The nucleotide sequences shown in SEQ ID NO:26 and 27,

(2) The nucleotide sequences shown in SEQ ID NO:40 and 41,

(3) The nucleotide sequences shown in SEQ ID NO:46 and 47,

(4) The nucleotide sequences shown in SEQ ID NO:55 and 56,

(5) The nucleotide sequences shown in SEQ ID NO: 62 and 63, or

(6) The nucleotide sequences shown in SEQ ID NO:65 and 66.

The RNA molecule of claim 4, wherein the open reading frame encoding the target peptide or protein is an open reading frame encoding green fluorescent protein, wherein the target sequence comprises, from the 5' end to the 3' end, the 3' end sequence of the open reading frame encoding the target peptide or protein, the translational functional element, and the 5' end sequence of the open reading frame encoding the target peptide or protein; preferably, the 3' end sequence of the open reading frame encoding the target peptide or protein and the 5' end sequence of the open reading frame encoding the target peptide or protein respectively comprise nucleotides AA and SEQ ID NO:12, or

The open reading frame encoding the target peptide or protein is an open reading frame encoding luciferase, wherein the target sequence from the 5' end to the 3' end includes the 3' end sequence of the open reading frame encoding the target peptide or protein, the translational functional element, and the 5' end sequence of the open reading frame encoding the target peptide or protein; preferably, the 3' end sequence of the open reading frame encoding the target peptide or protein and the 5' end sequence of the open reading frame encoding the target peptide or protein respectively contain AA and the nucleotide sequences shown in SEQ ID NO:29, or SEQ ID NO:32 and 33.

The RNA molecule of claim 1, wherein the 3' self-splicing intron fragment and the 5' self-splicing intron fragment are derived from the self-splicing intron in the RecA gene of Bacillus anthracis, or from the self-splicing intron in the 23S ribosomal gene of Coxiella behnkeni.

The RNA molecule of claim 1 further includes a 5' homologous arm at the 5' end of the 3' self-splicing intron fragment containing a 3' splice site, and a 3' homologous arm at the 3' end of the 5' self-splicing intron fragment containing a 5' splice site, wherein the elements are operatively connected, and wherein the 5' homologous arm and the 3' homologous arm are complementary.

A single-stranded DNA molecule comprising, from its 5' end to its 3' end: a 3' self-splicing intron fragment containing a 3' splice site, a target sequence, and a 5' self-splicing intron fragment containing a 5' splice site, wherein the elements are operatively linked.

The 3' self-splicing intron fragment and the 5' self-splicing intron fragment enable the RNA molecule transcribed from the complementary strand of the single-stranded DNA molecule to circularize.

The 5' self-splicing intron contains an internal guide sequence.

ii) The 3' end sequence of the internal guide sequence is initiated by a nucleotide with a base of G and is reverse complementary to the 3' end sequence of the target sequence, wherein the 3' end sequence of the target sequence ends with a nucleotide with a base of T. The 3' end sequence of the internal guide sequence of the RNA molecule transcribed from the complementary strand of the single-stranded DNA molecule is able to form a G:U pair with the nucleotide with a base of U at the end of the 3' end sequence of the transcribed target sequence.

The single-stranded DNA molecule of claim 12, wherein i) the 5' end sequence of the internal guide sequence is anticomplementary to the 5' end sequence of the target sequence, and ii) the 3' end sequence of the internal guide sequence is initiated by a G-base nucleotide and is anticomplementary to the 3' end sequence of the target sequence, wherein the 3' end sequence of the target sequence ends with a T-base nucleotide, wherein the G-base initiating nucleotide of the 3' end sequence of the internal guide sequence of the RNA molecule transcribed from the complementary strand of the single-stranded DNA molecule can form a G:U pairing with the U-base nucleotide at the end of the 3' end sequence of the transcribed target sequence.

The single-stranded DNA molecule as described in claim 13, wherein,

i) The target sequence contains an open reading frame encoding a target peptide or protein and a sequence of translational functional elements, wherein the translational functional elements are selected from translation initiation elements and translation enhancement elements, and wherein the target sequence contains from the 5' end to the 3' end...

(a) The sequence of the translational functional element and the open reading frame encoding the target peptide or protein.

(b) The open reading frame encoding the target peptide or protein, and the sequence of the translational functional element.

(c) The 3' end sequence of the translational functional element, the open reading frame encoding the target peptide or protein, and the 5' end sequence of the translational functional element, wherein the 5' end sequence of the translational functional element and the 3' end sequence of the translational functional element are arranged in this order to form the sequence of the translational functional element, or

(d) The 3' end sequence of the open reading frame encoding the target peptide or protein, the sequence of the translational functional element, and the 5' end sequence of the open reading frame encoding the target peptide or protein, wherein the 5' end sequence of the open reading frame encoding the target peptide or protein and the 3' end sequence of the open reading frame encoding the target peptide or protein are arranged in this order to form the open reading frame encoding the target peptide or protein.

ii) The target sequence contains a non-coding DNA sequence, which

(a) A sequence containing the non-coding DNA, or

(b) The sequence from the 5' end to the 3' end comprises the 3' end sequence of the non-coding DNA sequence and the 5' end sequence of the non-coding DNA sequence, wherein the 5' end sequence of the non-coding DNA sequence and the 3' end sequence of the non-coding DNA sequence are arranged in this order to form the sequence of the non-coding DNA.

(a) An open reading frame containing the target peptide or protein, or

(b) The sequence from the 5' end to the 3' end comprises the 3' end sequence of the open reading frame encoding the target peptide or protein and the 5' end sequence of the open reading frame encoding the target peptide or protein, wherein the 5' end sequence of the open reading frame encoding the target peptide or protein and the 3' end sequence of the open reading frame encoding the target peptide or protein, when arranged in this order, form the open reading frame encoding the target peptide or protein.

iv) The target sequence contains a single cloning site or a multiple cloning site, or

v) The target sequence comprises a monoclonal or multiple clonal site and a sequence of a translational element, wherein from the 5' end to the 3' end it comprises the 3' end sequence of the translational element sequence, the monoclonal or multiple clonal site, and the 5' end sequence of the translational element sequence, wherein the 5' end sequence of the translational element sequence and the 3' end sequence of the translational element sequence are arranged in this order to form the sequence of the translational element.

The single-stranded DNA molecule of claim 14, wherein the translational functional element is a translation initiation element, wherein the translation initiation element is an internal ribosome entry site (IRES).

The single-stranded DNA molecule of claim 15, wherein the IRES is selected from the IRES of Coxsackievirus B3 (CVB3), Enterovirus B107 (EVB107), Human Rhinovirus B3 (HRVB3), Enterovirus A (EV-A) or Human Rhinovirus B6 (HRVB6).

The single-stranded DNA molecule of claim 15, wherein the translational functional element is an IRES, wherein the IRES is an IRES of Coxsackievirus B3 (CVB3), Enterovirus B107 (EVB107), Human Rhinovirus B3 (HRVB3), or Enterovirus A (EV-A), wherein the target sequence comprises, from the 5' end to the 3' end, the 3' end sequence of the IRES sequence, the open reading frame encoding the target peptide or protein, and the 5' end sequence of the IRES sequence, or comprises, from the 5' end to the 3' end, the 3' end sequence of the IRES sequence, the single cloning site or multiple cloning site, and the 5' end sequence of the IRES sequence, wherein the 3' end sequence of the IRES sequence and the 5' end sequence of the IRES sequence respectively comprise:

(1) The nucleotide sequences shown in SEQ ID NO:26 and 27,

(2) The nucleotide sequences shown in SEQ ID NO:40 and 41,

(3) The nucleotide sequences shown in SEQ ID NO:46 and 47,

(4) The nucleotide sequences shown in SEQ ID NO:55 and 56,

(5) The nucleotide sequences shown in SEQ ID NO: 62 and 63, or

(6) The nucleotide sequences shown in SEQ ID NO:65 and 66.

The single-stranded DNA molecule of claim 15, wherein the open reading frame encoding the target peptide or protein is an open reading frame encoding green fluorescent protein, wherein the target sequence comprises, from the 5' end to the 3' end, the 3' end sequence of the open reading frame encoding the target peptide or protein, the sequence of the translational functional element, and the 5' end sequence of the open reading frame encoding the target peptide or protein; preferably, the 3' end sequence of the open reading frame encoding the target peptide or protein and the 5' end sequence of the open reading frame encoding the target peptide or protein respectively comprise nucleotides AA and SEQ ID NO: 12, or

The open reading frame encoding the target peptide or protein is an open reading frame encoding luciferase, wherein the target sequence from the 5' end to the 3' end includes the 3' end sequence of the open reading frame encoding the target peptide or protein, the sequence of the translational functional element, and the 5' end sequence of the open reading frame encoding the target peptide or protein; preferably, the 3' end sequence of the open reading frame encoding the target peptide or protein and the 5' end sequence of the open reading frame encoding the target peptide or protein respectively contain AA and the nucleotide sequences shown in SEQ ID NO:29, or SEQ ID NO:32 and 33.

The single-stranded DNA molecule of claim 12, wherein the 3' self-splicing intron fragment and the 5' self-splicing intron fragment are derived from the self-splicing intron in the RecA gene of Bacillus anthracis, or from the self-splicing intron in the 23S ribosome gene of Coxiella behnkeni.

The single-stranded DNA molecule of claim 19, wherein the 3' self-splicing intron fragment comprises the nucleotide sequence shown in SEQ ID NO: 10 or 21.

The single-stranded DNA molecule of claim 12 further includes a sequence of a 5' homologous arm at the 5' end of the 3' self-splicing intron fragment and a sequence of a 3' homologous arm at the 3' end of the 5' self-splicing intron fragment, wherein the 5' homologous arm and the 3' homologous arm are complementary.

The single-stranded DNA molecule of claim 20 further comprises an RNA polymerase promoter sequence at the 5' end of the sequence of the 5' homologous arm; or further comprises a restriction endonuclease site at the 3' end of the sequence of the 3' homologous arm.

A single-stranded DNA molecule whose complementary strand is capable of being transcribed into an RNA molecule according to any one of claims 1-11.

A double-stranded DNA molecule comprising i) a single-stranded DNA molecule as described in any one of claims 12-23, and ii) a second strand complementary to the single-stranded DNA molecule.

A vector for preparing circular RNA, comprising an RNA molecule according to any one of claims 1-11, a single-stranded DNA molecule according to any one of claims 12-23, or a double-stranded DNA molecule according to claim 24.

A method for preparing circular RNA, comprising:

i) Incubate the RNA molecule of any one of claims 1-11 under suitable conditions, wherein suitable conditions include the presence of magnesium ions and guanosine triphosphate (GTP), or

(ii) (a) Transcribe RNA in vitro from the complementary strand of the single-stranded DNA molecule of any one of claims 12-23, the double-stranded DNA molecule of claim 24, or the vector of claim 25 under suitable conditions, and (b) Incubate the RNA under suitable conditions, wherein suitable conditions in step (b) include the presence of magnesium ions and guanosine triphosphate (GTP).

A host cell comprising an RNA molecule according to any one of claims 1-11, a single-stranded DNA molecule according to any one of claims 12-23, a double-stranded DNA molecule according to claim 24, or a vector according to claim 25.

Use of the RNA molecule of any one of claims 1-11, the single-stranded DNA molecule of any one of claims 12-23, or the double-stranded DNA molecule of claim 24 in the preparation of a circular RNA molecule that does not contain an exon adjacent to a 3' self-splicing intron fragment or a 5' self-splicing intron fragment.

The circular RNA molecule prepared by the method according to claim 26.

The circular RNA molecule of claim 29 does not contain an exon adjacent to a 3' self-splicing intron fragment or an exon adjacent to a 5' self-splicing intron fragment.

A circular RNA comprising a target sequence, a selectable translational functional element, and an exon optionally adjacent to a 3' self-splicing intron fragment or a 5' self-splicing intron fragment.

The circular RNA of claim 31 comprises a target sequence, a translational functional element, and an exon adjacent to a 3' self-splicing intron fragment.

The circular RNA of claim 31 comprises a target sequence, a translational functional element, and an exon adjacent to a 5' self-splicing intron fragment.

The circular RNA according to claim 31 has a sequence consisting of a target sequence and a translational functional element; or its sequence consists of a target sequence.

The circular RNA according to claim 34, wherein the translational functional element is a translation initiation element; preferably, the translation initiation element is an internal ribosome entry site (IRES).

According to claim 35, the circular RNA is selected from the IRES of Coxsackievirus B3 (CVB3), Enterovirus B107 (EVB107), Human Rhinovirus B3 (HRVB3), Enterovirus A (EV-A), or Human Rhinovirus B6 (HRVB6).

A composition comprising the circular RNA of any one of claims 29-36 and an open circular RNA thereof, wherein the circular RNA comprises at least 30% molar percentage in the composition, preferably at least 40%.

The composition according to claim 37, wherein the molar percentage of the circular RNA in the composition is determined by capillary electrophoresis.