CN111876432B

CN111876432B - Acquisition and application of a group of novel liver-targeted adeno-associated viruses

Info

Publication number: CN111876432B
Application number: CN202010742484.2A
Authority: CN
Inventors: 张婷婷; 王超; 丰硕
Original assignee: Beijing Solobio Genetechnology Co Ltd; Staidson Beijing Biopharmaceutical Co Ltd
Current assignee: Beijing Solobio Genetechnology Co Ltd
Priority date: 2020-07-29
Filing date: 2020-07-29
Publication date: 2022-10-28
Anticipated expiration: 2040-07-29
Also published as: CN115960921B; CN115927398A; CN116121274B; CN115960921A; CN116121275B; CN111876432A; CN116121275A; CN115927398B; CN116121274A

Abstract

The present invention provides a set of adeno-associated viruses having properties of interest, e.g., targeting properties and/or neutralizing properties, obtained by directed evolution and in vivo screening methods. The present invention also provides adeno-associated virus capsid protein and viral particles comprising the adeno-associated virus capsid protein, which have excellent or superior targeting and/or neutralizing properties.

Description

The acquisition and application of a group of liver-targeted novel adeno-associated viruses

技术领域technical field

本发明涉及一组通过定向进化和体内筛选的方法而获得的腺相关病毒，以及优化的腺相关病毒衣壳蛋白和包含所述衣壳蛋白的病毒载体。The invention relates to a group of adeno-associated viruses obtained through directed evolution and in vivo screening, optimized adeno-associated virus capsid proteins and virus vectors containing the capsid proteins.

背景技术Background technique

迄今以血清型定名的腺相关病毒(AAV)亚型分为AAV1-AAV12(高光平等，2004，JVirolm 78:6381-6388；Mori S等，2004，Virology 330:375-383；Schmidt M等，2008，JVirol 82:1399-1406)，主要以人类和灵长类动物为宿主，其中AAV1-6是从人类组织中分离获得，并且具有明确的抗体反应性，因而较为经典并得以公认。AAV7和AAV8是高光平等通过基因工程手段从猕猴心脏组织拯救得到的，疑为在进化中已经绝迹的AAV亚型，这一策略为AAV新亚型的发现和重组病毒的设计改造提供了新的思路和范例(高光平等，2002，ProcNatl Acad Sci USA 99:11854-11859)，而AAV9-12是利用同类技术路线分别从人和短尾猴的组织中制备的。尽管不同AAV血清型的病毒都具有正二十面体的结构，但其衣壳蛋白在序列与空间构象上的多样性，使得其细胞表面的结合受体以及对细胞的感染嗜性存在显著差异(Timpe J等，2005，Curr Gene Ther 5:273-284)。在感染嗜性方面，AAV2的感染谱较广，尤对神经细胞效果最好；AAV1和AAV7在骨骼肌中的转导效率较高；AAV3易于转导巨核细胞；AAV5和AAV6感染呼吸道上皮细胞的优势显著；而AAV8转导肝细胞的效率优于其它亚型。Adeno-associated virus (AAV) subtypes named by serotypes are divided into AAV1-AAV12 (Gao Guangping, 2004, JVirolm 78:6381-6388; Mori S et al., 2004, Virology 330:375-383; Schmidt M et al., 2008 , JVirol 82:1399-1406), which mainly use humans and primates as hosts, among which AAV1-6 is isolated from human tissues and has clear antibody reactivity, so it is more classic and recognized. AAV7 and AAV8 were rescued by Gao Guangping and others from macaque heart tissue through genetic engineering. They are suspected to be AAV subtypes that have been extinct in evolution. This strategy provides a new way for the discovery of new AAV subtypes and the design and transformation of recombinant viruses. Ideas and examples (Gao Guangping, 2002, ProcNatl Acad Sci USA 99:11854-11859), and AAV9-12 was prepared from human and macaque tissues using similar technical routes. Although viruses of different AAV serotypes all have icosahedral structures, the diversity of their capsid proteins in sequence and spatial conformation makes their cell surface binding receptors and infection tropism to cells significantly different ( Timpe J et al., 2005, Curr Gene Ther 5:273-284). In terms of infection tropism, AAV2 has a wider infection spectrum, especially for nerve cells; AAV1 and AAV7 have higher transduction efficiency in skeletal muscle; AAV3 is easy to transduce megakaryocytes; The advantages are significant; and the efficiency of AAV8 transducing hepatocytes is better than other subtypes.

AAV病毒本身是复制缺陷型的，在没有辅助病毒存在的情况下只能在宿主细胞呈潜伏状态。AAV病毒载体的生产则需要辅助质粒(helper)提供腺病毒(Ad)参与AAV复制的关键基因。这些Ad基因包括：早期基因E1A负责AAV的转录活性，早期基因E1B和E4参与AAVmRNA的成熟化，早期基因E2A和VA增强AAV RNA翻译(Berns等，1984，Adeno-associatedvirus 563–592)。The AAV virus itself is replication-deficient and can only be latent in the host cell in the absence of a helper virus. The production of AAV viral vectors requires a helper plasmid (helper) to provide key genes involved in AAV replication from adenovirus (Ad). These Ad genes include: the early gene E1A is responsible for the transcriptional activity of AAV, the early genes E1B and E4 are involved in the maturation of AAV mRNA, and the early genes E2A and VA enhance AAV RNA translation (Berns et al., 1984, Adeno-associated virus 563–592).

重组腺相关病毒(rAAV)作为基因治疗载体，具有非常多的优势，如感染效率高、感染范围广、长期表达和高安全性等(David AF等，2007，BMC Bio 7:75)，已被广泛应用于临床实验。目前以AAV为载体的基因治疗项目进入临床研究的有100多项，治疗的疾病范围扩展到肿瘤、视网膜疾病、关节炎、艾滋病、心力衰竭、肌营养不良症、神经系统疾病及其它一系列基因缺陷疾病。As a gene therapy vector, recombinant adeno-associated virus (rAAV) has many advantages, such as high infection efficiency, wide infection range, long-term expression and high safety (David AF et al., 2007, BMC Bio 7:75), has been Widely used in clinical trials. At present, more than 100 gene therapy projects using AAV as vectors have entered clinical research, and the range of diseases treated has expanded to tumors, retinal diseases, arthritis, AIDS, heart failure, muscular dystrophy, nervous system diseases and a series of other genes deficiency disease.

目前正在开展的基于rAAV载体的眼部疾病基因治疗临床研究，大多数是针对视网膜色素上皮特异性65kDa蛋白编码基因(RPE65)突变引起的先天性黑蒙(LCA，Leber’scongenital amaurosis)。Spark公司研发的Luxturna药物已于2017年12月通过FDA审批上市，此药物是rAAV2携带hRPE65v2基因治疗LCA及遗传性视网膜发育不良。受试者在注射后的测试结果显示其药物的有效性，并且没有发现明显副作用，尤其是载体相关的副作用(Russell S等，2017，Lancet 390:849-860)。其它进入临床的基因治疗眼科疾病还有无脉络膜症，多数研究处于临床I、II期，其中阿尔伯塔大学利用rAAV2携带Rab护卫蛋白1编码基因(REP1)治疗项目，在2018年公布的I期研究结果显示了药物的安全性及有效性。Most of the ongoing clinical research on eye disease gene therapy based on rAAV vectors is aimed at congenital amaurosis (LCA, Leber’s congenital amaurosis) caused by mutations in the retinal pigment epithelium-specific 65kDa protein coding gene (RPE65). The Luxturna drug developed by Spark has been approved by the FDA for marketing in December 2017. This drug is rAAV2 carrying the hRPE65v2 gene to treat LCA and hereditary retinal dysplasia. The test results of the subjects after injection showed the effectiveness of the drug, and no obvious side effects were found, especially vehicle-related side effects (Russell S et al., 2017, Lancet 390:849-860). Other gene therapy eye diseases that have entered the clinic include choroideremia, most of which are in clinical phase I and II. Among them, the University of Alberta uses rAAV2 to carry the gene encoding Rab guard protein 1 (REP1) treatment project, which was announced in phase I in 2018. The results of the study showed the safety and effectiveness of the drug.

目前正在开展的基于rAAV载体的血友病基因治疗临床研究有二十多个，其中以B型血友病研究最多。具有长期表达能力的AAV基因药物是治疗血友病B的理想选择。UniQure公司正在进行临床I/II期研究，用AAV5携带人凝血因子IX编码基因(hFIX)治疗B型血友病，给药后1年的随访数据显示了药物安全性。患者给药后1周出现体液免疫应答，但不影响凝血因子IX的表达水平，使用现在T细胞检测“金标准”系统检测不到T细胞活化。转氨酶有升高但是不影响FIX活性，也没有发现肝毒性反应(Miesbach W等，2018，Blood 131:1022-1031)。目前还有一些利用AAV8载体治疗血友病的基因治疗项目，这些都是以肝脏为靶向性的临床研究(Nathwani AC等，2006，Blood.107:2653-2661)，AAV8也是目前公认的肝脏靶向最优的AAV血清型。Currently, there are more than 20 clinical studies on gene therapy for hemophilia based on rAAV vectors, among which hemophilia B is the most researched. AAV gene medicine with long-term expression ability is an ideal choice for the treatment of hemophilia B. UniQure is conducting phase I/II clinical research, using AAV5 carrying the gene encoding human coagulation factor IX (hFIX) to treat hemophilia B, and the follow-up data of 1 year after administration shows the safety of the drug. The patient developed a humoral immune response 1 week after administration, but it did not affect the expression level of coagulation factor IX, and T cell activation could not be detected using the current "gold standard" system for T cell detection. Transaminases increased but did not affect FIX activity, and no hepatotoxicity was found (Miesbach W et al., 2018, Blood 131:1022-1031). At present, there are still some gene therapy projects using AAV8 vectors to treat hemophilia, and these are liver-targeted clinical studies (Nathwani AC et al., 2006, Blood.107:2653-2661), and AAV8 is also currently recognized as a liver-targeted gene therapy project. Targets optimal AAV serotypes.

脊髓性肌萎缩(spinal muscular atrophy，SMA)系指一类由于以脊髓前角细胞为主的变性导致肌无力和肌萎缩的疾病。AveXis公司的AVXS-101目前已经上市，利用AAV9在神经系统感染的优势，携带运动神经元存活蛋白编码基因(SMN1)治疗SMA取得了很好的效果，并且在临床试验中所有的患者没有出现载体副作用相关的临床症状(Mendell JR等，2017，N Engl J Med 377:1713-1722)。Spinal muscular atrophy (SMA) refers to a group of diseases that cause muscle weakness and atrophy due to the degeneration of the anterior horn cells of the spinal cord. AveXis’s AVXS-101 is currently on the market. Taking advantage of AAV9’s advantages in nervous system infection, carrying the gene encoding surviving motor neuron (SMN1) in the treatment of SMA has achieved good results, and no carrier appeared in all patients in clinical trials Clinical symptoms associated with side effects (Mendell JR et al., 2017, N Engl J Med 377:1713-1722).

其它以AAV为载体的罕见病基因治疗项目，例如，利用AAV1携带酸性α-葡萄糖苷酶编码基因(GAA)治疗庞贝氏病(Smith BK等，2013，Hum Gene Ther 24:630-640)，利用改造后的AAV2.5携带微小抗肌萎缩蛋白编码基因(minidystrophin)治疗杜氏肌营养不良(DMD)(Bowles DE等，2012，Mol Ther 20:443-455)，这些临床研究结果均显示出药物的有效性及安全性。Other rare disease gene therapy projects using AAV as a vector, for example, using AAV1 to carry the gene encoding acid α-glucosidase (GAA) to treat Pompe disease (Smith BK et al., 2013, Hum Gene Ther 24:630-640), Duchenne muscular dystrophy (DMD) was treated with the modified AAV2.5 carrying the minidystrophin gene (minidystrophin) (Bowles DE et al., 2012, Mol Ther 20:443-455). The results of these clinical studies all showed that the drug effectiveness and safety.

天然AAV靶向性是有限的，尤其是在使用AAV载体进行系统给药时，根据血清型选择的不同，能够有效感染靶细胞组织的比例差异很大，没有使AAV的利用率最大化；另外其它非靶向组织细胞具有被感染的潜在风险。另外，又因为人和其它灵长类动物受AAV自然感染产生了针对天然型AAV的中和抗体，这会大大降低AAV的半衰期，而影响其药物活性的发挥。Natural AAV targeting is limited, especially when using AAV vectors for systemic administration, depending on the selection of serotypes, the proportion of cells and tissues that can effectively infect target cells varies greatly, which does not maximize the utilization of AAV; in addition Other non-targeted tissue cells are potentially at risk of infection. In addition, because humans and other primates are naturally infected by AAV, they produce neutralizing antibodies against natural AAV, which will greatly reduce the half-life of AAV and affect its drug activity.

目前对于AAV外壳蛋白改造的研究越来越多，改造的目的：一方面能够增强病毒载体的靶向性，另一方面能够降低病毒载体的免疫原性反应。At present, there are more and more studies on the modification of AAV coat protein. The purpose of modification is: on the one hand, it can enhance the targeting of viral vectors, and on the other hand, it can reduce the immunogenic response of viral vectors.

对于病毒细胞受体机制研究比较明确的载体血清型，可直接针对其细胞受体相关区域的氨基酸进行小范围或定点改造。AAV2在细胞上的受体是目前研究比较明确的。硫酸乙酰肝素蛋白多糖(HSPG)是AAV2和AAV3型的主要细胞受体，2型AAV外壳蛋白上R484、R487、K532、R585、R588氨基酸位点的改变会影响其与HSPG的结合(Opie,S.R等，2003，J Virol77:6995–7006；Summerford,C等，1999，Nat Med 5:78–82)。For the carrier serotypes with relatively clear research on the mechanism of virus cell receptors, small-scale or fixed-point modification can be directly targeted at the amino acids in the relevant regions of the cell receptors. The receptors of AAV2 on cells are relatively clear in current research. Heparan sulfate proteoglycan (HSPG) is the main cellular receptor of AAV2 and AAV3 types, and the changes of R484, R487, K532, R585, R588 amino acid sites on the type 2 AAV coat protein will affect its binding to HSPG (Opie, S.R et al., 2003, J Virol 77:6995–7006; Summerford, C et al., 1999, Nat Med 5:78–82).

改造后的新型AAV载体已被用于基因治疗临床研究中，例如AAV2.5携带minidystrophin基因治疗DMD(Bowles DE等，2012，Mol Ther 20:443-455)，其I期临床研究已经完成，并且针对新型AAV的安全性进行了大量研究。该项目所使用AAV2.5外壳蛋白为嵌合体，将AAV1外壳蛋白中与骨骼肌靶向相关的5个氨基酸移植到AAV2的外壳蛋白上。此嵌合体不但增强了对骨骼肌的靶向性，而且体液免疫反应明显低于AAV2，这些临床试验证明了改造后的病毒载体的安全性。The modified new AAV vector has been used in gene therapy clinical research, such as AAV2.5 carrying minidystrophin gene therapy for DMD (Bowles DE et al., 2012, Mol Ther 20:443-455), its phase I clinical research has been completed, and Numerous studies have been conducted on the safety of novel AAVs. The AAV2.5 coat protein used in this project is a chimera, and the 5 amino acids related to skeletal muscle targeting in the AAV1 coat protein are grafted onto the AAV2 coat protein. This chimera not only enhances the targeting of skeletal muscle, but also the humoral immune response is significantly lower than that of AAV2. These clinical trials have proved the safety of the modified viral vector.

另外，病毒的细胞受体机制不明确的情况下，我们也可以利用DNA改组(DNAshuffling)或易错PCR的方法，获得功能优化的新型AAV病毒载体的外壳蛋白。例如Grimm等人利用AAV构建的改组文库，在静脉注射免疫球蛋白的严格筛选情况下，得到一个AAV2，8，9组成的嵌合体AAV-DJ，该载体对成纤维细胞和肺等多种细胞系有更高的转导效率(Grimm D等，2008，J Virol 82：5887-5911)。Jang等人利用DNA改组文库筛选到了一个能高效感染神经干细胞的变体(Jang J H，2011，Mol Ther 19：667-675)。In addition, when the cell receptor mechanism of the virus is not clear, we can also use DNA shuffling (DNA shuffling) or error-prone PCR methods to obtain a function-optimized coat protein of a new AAV viral vector. For example, Grimm et al. used the shuffled library constructed by AAV to obtain a chimeric AAV-DJ composed of AAV2, 8, and 9 under the strict screening of intravenous immunoglobulin. lines have higher transduction efficiencies (Grimm D et al., 2008, J Virol 82:5887-5911). Jang et al. used a DNA shuffling library to screen a variant that can efficiently infect neural stem cells (Jang J H, 2011, Mol Ther 19:667-675).

目前基因治疗药物已日趋成为国内外研究的热点，而且为了使基因治疗药物更好更长久地发挥作用，寻找一种功能优化的新型AAV载体使其更好地满足做为基因治疗载体的需求是亟待解决的问题。At present, gene therapy drugs have increasingly become a research hotspot at home and abroad, and in order to make gene therapy drugs work better and longer, it is important to find a new type of AAV vector with optimized functions to better meet the needs of gene therapy vectors. Problems to be solved.

发明内容SUMMARY OF THE INVENTION

基于寻找新型AAV载体的需求，本发明提供了一组通过体内筛选来实现病毒定向进化的方法而获得的包含感兴趣特质AAV衣壳的病毒，例如，靶向性特质和/或中和性特质(例如，逃避中和抗体的能力)。本发明还提供了AAV衣壳和包含AAV衣壳的病毒粒子。Based on the need to find novel AAV vectors, the present invention provides a set of viruses containing AAV capsids with interesting traits, such as targeting and/or neutralizing traits, obtained through in vivo screening to achieve directed evolution of viruses (eg, the ability to evade neutralizing antibodies). The invention also provides AAV capsids and virions comprising AAV capsids.

一方面，本发明提供了一种编码AAV衣壳蛋白的核酸，所述核酸包含选自以下组中的AAV衣壳蛋白编码序列：In one aspect, the invention provides a nucleic acid encoding an AAV capsid protein, said nucleic acid comprising an AAV capsid protein coding sequence selected from the group consisting of:

(a)图3A的核苷酸序列(L1)(SEQ ID NO:1)；(a) the nucleotide sequence (L1) of Figure 3A (SEQ ID NO: 1);

(b)图3C的核苷酸序列(L4)(SEQ ID NO:3)；(b) the nucleotide sequence (L4) of Figure 3C (SEQ ID NO:3);

(c)图3E的核苷酸序列(L10)(SEQ ID NO:5)；(c) the nucleotide sequence (L10) of Figure 3E (SEQ ID NO:5);

(d)图3G的核苷酸序列(L52)(SEQ ID NO:7)；(d) the nucleotide sequence (L52) of Figure 3G (SEQ ID NO: 7);

(e)图3I的核苷酸序列(L58)(SEQ ID NO:9)；(e) the nucleotide sequence (L58) of Figure 3I (SEQ ID NO:9);

(f)图3K的核苷酸序列(L84)(SEQ ID NO:11)；(f) the nucleotide sequence (L84) of Figure 3K (SEQ ID NO: 11);

(g)图3M的核苷酸序列(L37)(SEQ ID NO:13)；(g) the nucleotide sequence (L37) of Figure 3M (SEQ ID NO: 13);

(h)图3O的核苷酸序列(L107)(SEQ ID NO:15)；(h) the nucleotide sequence (L107) of Figure 30 (SEQ ID NO: 15);

(i)图3Q的核苷酸序列(L57)(SEQ ID NO:17)；或(i) the nucleotide sequence (L57) of Figure 3Q (SEQ ID NO: 17); or

(j)编码由(a)-(i)任一核苷酸编码的AAV衣壳蛋白但由于遗传密码的简并性而不同于(a)-(i)的核苷酸序列。(j) A nucleotide sequence encoding the AAV capsid protein encoded by any one of (a)-(i) but different from (a)-(i) due to the degeneracy of the genetic code.

另一方面，本发明提供了由本发明的核酸编码的AAV衣壳蛋白，所述AAV衣壳蛋白的氨基酸序列选自SEQ ID NO:2，SEQ ID NO:4，SEQ ID NO:6，SEQ ID NO:8，SEQ ID NO:10，SEQ ID NO:12，SEQ ID NO:14，SEQ ID NO:16，或SEQ ID NO:18中的任一种。In another aspect, the present invention provides the AAV capsid protein encoded by the nucleic acid of the present invention, the amino acid sequence of the AAV capsid protein is selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID Any of NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, or SEQ ID NO:18.

本发明进一步提供了包含一种病毒基因组和本发明AAV衣壳蛋白的重组病毒粒子，其中病毒基因组包裹于AAV衣壳蛋白中。本发明进一步提供了一种AAV病毒基因组和本发明AAV衣壳蛋白的重组腺相关病毒粒子，其中病毒基因组包裹于AAV衣壳蛋白中。在具体的实施例中，病毒基因组是一个包含异源核酸的重组载体基因组。The present invention further provides a recombinant virion comprising a viral genome and the AAV capsid protein of the present invention, wherein the viral genome is encapsulated in the AAV capsid protein. The present invention further provides a recombinant adeno-associated virus particle of an AAV viral genome and the AAV capsid protein of the present invention, wherein the viral genome is encapsulated in the AAV capsid protein. In specific embodiments, the viral genome is a recombinant vector genome comprising heterologous nucleic acid.

本发明提供了一种细胞，该细胞包含本发明的核酸、AAV衣壳蛋白、重组病毒粒子和/或重组腺相关病毒粒子。The invention provides a cell comprising the nucleic acid of the invention, AAV capsid protein, recombinant virus particles and/or recombinant adeno-associated virus particles.

本发明提供了一种药物组合物，该药物组合物包含一种药学上可接受的载体和本发明的核酸、AAV衣壳蛋白、重组病毒粒子、重组腺相关病毒粒子和/或细胞。The present invention provides a pharmaceutical composition, which comprises a pharmaceutically acceptable carrier and the nucleic acid of the present invention, AAV capsid protein, recombinant virus particles, recombinant adeno-associated virus particles and/or cells.

本发明提供了本文中所述的核酸、AAV衣壳蛋白、重组病毒粒子、重组腺相关病毒粒子、细胞和/或药物组合物在制备用于预防或治疗疾病的药物中的用途。The present invention provides the use of the nucleic acid, AAV capsid protein, recombinant virus particle, recombinant adeno-associated virus particle, cell and/or pharmaceutical composition described herein in the preparation of medicines for preventing or treating diseases.

本发明提供了一种生产包含AAV衣壳蛋白的重组病毒粒子的方法，所述方法包括：在体外向细胞提供本发明的核酸、AAV Rep蛋白编码序列、包含异源核酸的重组载体基因组，以及产生生产性感染的辅助子，允许包含AAV衣壳蛋白的重组病毒粒子的组装，以及重组载体基因组的衣壳化。The present invention provides a method for producing recombinant virion comprising AAV capsid protein, said method comprising: providing nucleic acid of the present invention, AAV Rep protein coding sequence, recombinant vector genome comprising heterologous nucleic acid to cells in vitro, and Produces helpers for productive infection, allowing assembly of recombinant virions containing AAV capsid proteins, and encapsidation of recombinant vector genomes.

本发明提供了一种生产包含AAV衣壳蛋白的重组AAV粒子的方法，所述方法包括：在体外向细胞提供本发明的核酸、AAV Rep蛋白编码序列、包含异源核酸的rAAV基因组，以及产生生产性感染的辅助子，允许包含AAV衣壳蛋白的重组病毒粒子的组装，以及重组载体基因组的衣壳化。The present invention provides a method for producing a recombinant AAV particle comprising an AAV capsid protein, the method comprising: providing the nucleic acid of the present invention, an AAV Rep protein coding sequence, an rAAV genome comprising a heterologous nucleic acid to cells in vitro, and producing Helper for productive infection, allowing assembly of recombinant virions containing AAV capsid proteins, and encapsidation of recombinant vector genomes.

本发明提供了一种将感兴趣的核酸递送给细胞的方法，所述方法包括向细胞提供本发明的核酸、AAV衣壳蛋白、重组病毒粒子、重组腺相关病毒粒子和/或药物组合物。The invention provides a method of delivering a nucleic acid of interest to a cell, the method comprising providing the cell with a nucleic acid of the invention, an AAV capsid protein, a recombinant virion, a recombinant adeno-associated virion, and/or a pharmaceutical composition.

本发明提供了一种向哺乳动物个体递送感兴趣核酸的方法，所述方法包括：将有效剂量本发明的核酸、AAV衣壳蛋白、重组病毒粒子、重组腺相关病毒粒子、细胞和/或药物组合物施用于哺乳动物。The present invention provides a method for delivering a nucleic acid of interest to a mammalian individual, the method comprising: administering an effective dose of the nucleic acid of the present invention, AAV capsid protein, recombinant virion, recombinant adeno-associated virion, cell and/or drug The compositions are administered to mammals.

本发明还提供了一种识别具有某种感兴趣嗜性特性的病毒载体例如AAV载体或AAV衣壳蛋白的方法，所述方法包括：The present invention also provides a method of identifying a viral vector such as an AAV vector or an AAV capsid protein having a certain tropism property of interest, the method comprising:

(a)提供病毒载体例如AAV载体集合，其中所述集合中的每种病毒载体包括：(a) providing a collection of viral vectors, such as AAV vectors, wherein each viral vector in the collection comprises:

(i)一种AAV衣壳蛋白，其包含通过改组两个或多个不同AAV衣壳蛋白编码序列而产生的衣壳蛋白，其中两个或多个不同AAV衣壳蛋白的氨基酸序列至少有两个氨基酸的不同；和(ii)一种病毒载体基因组例如一种AAV病毒基因组，包括一种编码(i)所述AAV衣壳蛋白的编码序列、一种AAV Rep蛋白编码序列、至少一种与AAV Rep蛋白相互作用的末端重复序列(例如5’和/或3’末端重复)，其中病毒载体基因组包裹于AAV衣壳蛋白中。(i) an AAV capsid protein comprising a capsid protein produced by shuffling two or more different AAV capsid protein coding sequences, wherein the amino acid sequences of the two or more different AAV capsid proteins have at least two and (ii) a viral vector genome such as an AAV viral genome comprising a coding sequence encoding (i) the AAV capsid protein, an AAV Rep protein coding sequence, at least one and AAV Rep protein interacting terminal repeats (eg 5' and/or 3' terminal repeats) where the viral vector genome is encapsulated in the AAV capsid protein.

(b)将病毒载体集合施用于哺乳动物个体，和(b) administering the collection of viral vectors to a mammalian individual, and

(c)从目标组织中回收多个病毒粒子或编码AAV衣壳蛋白的病毒基因组的病毒载体，从而鉴定出具有感兴趣嗜性的病毒载体或AAV衣壳蛋白。(c) recovering multiple virions or viral vectors encoding the viral genome of the AAV capsid protein from the target tissue, thereby identifying viral vectors or AAV capsid proteins with a tropism of interest.

本发明的积极效果在于：The positive effects of the present invention are:

本发明提供了一组通过定向进化和体内筛选的方法而获得的新型AAV病毒载体、AAV衣壳蛋白以及包含所述AAV衣壳蛋白的病毒粒子，相对于现有技术中的AAV病毒载体而言，具有感兴趣特质AAV衣壳的病毒，例如，靶向性特质(更高的肝组织靶向性)和/或中和性特质(例如，逃避中和抗体的能力)。The present invention provides a group of novel AAV viral vectors, AAV capsid proteins and virus particles comprising the AAV capsid proteins obtained by directed evolution and in vivo screening methods, compared to the AAV viral vectors in the prior art , viruses with AAV capsids with properties of interest, eg, targeting properties (higher liver tissue targeting) and/or neutralizing properties (eg, ability to evade neutralizing antibodies).

下面结合附图和具体实施方式对本公开做进一步说明，并非对本公开的限制。凡是依照本专利申请公开内容所进行的任何本领域等同替换，均属于本专利的保护范围。The present disclosure will be further described below in conjunction with the accompanying drawings and specific embodiments, which are not intended to limit the present disclosure. Any equivalent replacement in the field based on the disclosure content of this patent application falls within the scope of protection of this patent.

附图说明Description of drawings

图1.质粒文库随机挑选阳性克隆酶切图谱。1-12分别对应随机挑选阳性克隆样品，M代表5000bp DNA Marker。Figure 1. Restriction pattern of positive clones randomly selected from the plasmid library. 1-12 respectively correspond to randomly selected positive clone samples, and M stands for 5000bp DNA Marker.

图2.两轮体内筛选后，AAV突变体在肝脏、骨骼肌和心脏中出现的次数。经第一次体内筛选后共获得370个阳性克隆，经第二次体内筛选后共获得9组高频靶向肝脏的新型AAV序列。Figure 2. Number of occurrences of AAV mutants in liver, skeletal muscle, and heart after two rounds of in vivo selection. A total of 370 positive clones were obtained after the first in vivo screening, and a total of 9 groups of novel AAV sequences targeting the liver were obtained after the second in vivo screening.

图3A-3R.新型AAV-Cap序列。其中序列图分为核苷酸序列及编码的氨基酸序列。Figures 3A-3R. Novel AAV-Cap sequences. The sequence diagram is divided into nucleotide sequence and encoded amino acid sequence.

图4A-4F.不同体外细胞系感染比较(CAG-EGFP)。不同AAV载体包装成携带CAG-EGFP的相应病毒，在体外通过不同MOI感染不同细胞系，48h后进行流式细胞术检测分析。数值为平均值±标准差。Figures 4A-4F. Comparison of infection of different in vitro cell lines (CAG-EGFP). Different AAV vectors were packaged into corresponding viruses carrying CAG-EGFP, and different cell lines were infected with different MOIs in vitro, and flow cytometry analysis was performed 48 hours later. Values are mean ± standard deviation.

图5A-5E.不同细胞系体外感染比较(CAG-Luciferase)。不同AAV载体包装成携带CAG-Luciferase的相应病毒，在体外通过MOI 500感染不同细胞系，48h后检测Luciferase活性。数值为平均值±标准差。Figures 5A-5E. Comparison of in vitro infection of different cell lines (CAG-Luciferase). Different AAV vectors were packaged into corresponding viruses carrying CAG-Luciferase, and different cell lines were infected in vitro by MOI 500, and Luciferase activity was detected after 48 hours. Values are mean ± standard deviation.

图6.系统注射AAV载体后，小鼠不同组织中载体基因组拷贝数。不同AAV载体包装成携带CAG-Luciferase的相应病毒，以1E+11vg剂量尾部静脉注射小鼠2周后，不同组织中载体基因组拷贝数比较。数值为平均值±标准差。Figure 6. Vector genome copy number in different tissues of mice after systemic injection of AAV vector. Different AAV vectors were packaged into corresponding viruses carrying CAG-Luciferase, and after 2 weeks of tail vein injection into mice at a dose of 1E+11vg, the copy number of the vector genome in different tissues was compared. Values are mean ± standard deviation.

图7A-7E.系统注射AAV载体后，小鼠不同组织中Luciferase活性。不同AAV载体包装成携带CAG-Luciferase的相应病毒以1E+11vg剂量尾部静脉注射小鼠2周后，不同组织中Luciferase活性比较。数值为平均值±标准差。Figures 7A-7E. Luciferase activity in different tissues of mice after systemic injection of AAV vector. Different AAV vectors packaged into the corresponding virus carrying CAG-Luciferase were injected into the tail vein of mice at a dose of 1E+11vg for 2 weeks, and the Luciferase activity in different tissues was compared. Values are mean ± standard deviation.

具体实施方式Detailed ways

本发明提供了一组通过体内筛选来实现病毒定向进化的方法而获得的包含感兴趣特质AAV衣壳的病毒载体，例如，靶向性特质和/或中和性特质(例如，逃避中和抗体的能力)。本发明还提供了AAV衣壳和包含AAV衣壳的病毒粒子。The present invention provides a set of viral vectors comprising AAV capsids for traits of interest, e.g., targeting traits and/or neutralizing traits (e.g., evasion of neutralizing antibodies) obtained by in vivo screening for directed evolution of viruses Ability). The invention also provides AAV capsids and virions comprising AAV capsids.

本发明将在下文中进行更详细的阐述。本说明书并非旨在详细列出实施本发明的所有不同方式，此外，本发明所建议的各种实施例的诸多变化在不背离本发明的前提下对于本领域技术人员而言是显而易见的。因此，以下说明书旨在说明本发明的一些具体实施例，而不是详尽地说明其所有排列、组合和变化。The present invention will be explained in more detail hereinafter. This description is not intended to be an exhaustive list of all the different ways of carrying out the invention; moreover, many variations of the various embodiments suggested by the invention will be apparent to those skilled in the art without departing from the invention. Therefore, the following description is intended to illustrate some specific embodiments of the present invention, rather than exhaustively explain all permutations, combinations and changes thereof.

除非另有定义，否则本发明所用的所有技术和科学术语的含义与本发明所属领域的普通技术人员通常理解的含义相同。本发明中用于描述本发明的术语仅用于描述特定实施例，并不用于限制本发明。Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The terms used in the present invention to describe the present invention are only used to describe specific embodiments, and are not used to limit the present invention.

除另有说明外，本领域技术人员已知的标准方法可用于生产重组和合成多肽、抗体或其抗原结合片段、操纵核酸序列、生产转化细胞、构建重组AAV、修饰衣壳蛋白、包装包含有AAV rep和/或cap编码序列的载体，以及瞬时或稳定地转染包装细胞。这些技术为本领域技术人员所熟知，参见SAMBROOK等,MOLECULAR CLONING:A LABORATORY MANUAL 2nd Ed.(1989，Cold Spring Harbor,N.Y)；F.M.AUSUBEL等,CURRENT PROTOCOLS IN MOLECULARBIOLOGY(Green Publishing Associates,Inc.and John Wiley&Sons,Inc.,New York)。Standard methods known to those skilled in the art can be used to produce recombinant and synthetic polypeptides, antibodies or antigen-binding fragments thereof, manipulate nucleic acid sequences, produce transformed cells, construct recombinant AAV, modify capsid proteins, package containing Vectors for AAV rep and/or cap coding sequences, and for transient or stable transfection of packaging cells. These techniques are well known to those skilled in the art, see SAMBROOK et al., MOLECULAR CLONING: A LABORATORY MANUAL 2nd Ed. (1989, Cold Spring Harbor, N.Y); F.M. AUSUBEL et al., CURRENT PROTOCOLS IN MOLECULARBIOLOGY (Green Publishing Associates, Inc. and John Wiley & Sons, Inc., New York).

本文所提及的所有出版物、专利申请、专利、核苷酸序列、氨基酸序列和其它参考文献全部以引用方式并入。All publications, patent applications, patents, nucleotide sequences, amino acid sequences and other references mentioned herein are incorporated by reference in their entirety.

I定义I define

本发明说明书及其权利要求书中AAV衣壳亚单位中所有氨基酸位置的指定与VP1衣壳亚单位编号有关。The designation of all amino acid positions in the AAV capsid subunits in the specification and claims of the present invention is related to the numbering of the VP1 capsid subunits.

在本发明的说明书及其权利要求书中，除非上下文另有明确说明，否则单数形式“一个”、“一种”和“所述”也包括复数形式。In the description and claims of the present invention, unless the context clearly dictates otherwise, the singular forms "a", "an" and "the" also include the plural forms.

本发明中，“和/或”指包含一个或多个所述内容的任何和所有可能组合。In the present invention, "and/or" refers to any and all possible combinations including one or more of the stated contents.

如本文所用，术语“基本上包含”与核酸、蛋白质或衣壳结构有关，意指所述核酸、蛋白质或衣壳结构包含任何可显著改变所述感兴趣核酸、蛋白质或衣壳结构功能的元素，例如，所述蛋白质或衣壳或所述核酸编码的蛋白质或衣壳的靶向性特性或中和特性。As used herein, the term "substantially comprising" in relation to a nucleic acid, protein or capsid structure means that said nucleic acid, protein or capsid structure comprises any element that significantly alters the function of said nucleic acid, protein or capsid structure of interest , for example, the targeting or neutralizing properties of the protein or capsid or of the protein or capsid encoded by the nucleic acid.

本发明上下文中的术语“腺相关病毒(AAV)”，包括但不限于AAV1、AAV2、AAV3A、AAV3B、AAV4、AAV5、AAV6、AAV7、AAV8、AAV9、AAV10、AAV11、禽AAV、牛AAV、犬AAV、马AAV和羊AAV以及现在已知的或后来发现的任何其它AAV(BERNARD N.FIELDS等,VIROLOGY,volume 2,chapter 69，4th ed.,Lippincott-Raven Publishers)。已经确定的一些额外的AAV血清型和分支，也包括在本发明术语“AAV”中(高光平等，2004，J.Virology 78:6381-6388)。The term "adeno-associated virus (AAV)" in the context of the present invention includes, but is not limited to, AAV1, AAV2, AAV3A, AAV3B, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, avian AAV, bovine AAV, canine AAV, AAV, equine AAV and ovine AAV and any other AAV now known or later discovered (BERNARD N. FIELDS et al., VIROLOGY, volume 2, chapter 69, 4th ed., Lippincott-Raven Publishers). Some additional AAV serotypes and clades that have been identified are also included in the term "AAV" of the present invention (Gao Guangping, 2004, J. Virology 78:6381-6388).

本领域已知各种AAV和细小病毒的基因组序列，以及ITRS、Rep蛋白和衣壳蛋白亚单位的序列，此类序列可在文献或公共数据库中找到。例如，在GenBank数据库中，Genbank登记号NC 002077、NC 001401、NC 001729、NC 001863、NC 001829、NC 001862、NC 000883、NC001701、NC 001510、AF063497、U89790、AF043303、AF028705、AF028704、J02275、J01901、J02275、X01457、AF288061、AH009962、AY028226、AY028223、NC 00135、NC 001540、AF513851、AF5 13852、AY530579、AY63 1965，AY63 1966，这些公开内容通过引用并入本文中。另外，srivistava等，1983，J.Virology 45:555；Chiorini等，1998，J.Virology 71:6823；Chiorini等，1999，J.Virology 73:1309；Bantel-Schaal等，1999，J.Virology 73:939；Xiao等，1999，J.Virology 73:3994；Muramatsu等，1996，Virology 221:208；Shade等，1986，J.Virol.58:921；Gao等，2002，Proc.Nat.Acad.Sci.USA 99:11854；国际公开专利WO00/28061、WO 99/61601、WO 98/11244；美国专利6156303；这些公开内容也通过引用整体并入本文中。AAV1、AAV2和AAV3末端重复序列的早期描述，参见Xiao,X,1996,"Characterization of Adeno-associated virus(AAV)DNA replication andintegration,"Ph.D.Dissertation,University of Pittsburgh,Pittsburgh,Pa，这些内容通过引用整体并入本文中。The genome sequences of various AAV and parvoviruses, as well as the sequences of ITRS, Rep proteins and capsid protein subunits are known in the art and can be found in the literature or in public databases. For example, in the GenBank database, Genbank accession numbers NC 002077, NC 001401, NC 001729, NC 001863, NC 001829, NC 001862, NC 000883, NC001701, NC 001510, AF063497, U89790, AF043303, J01, AF0282705, 74 J02275, X01457, AF288061, AH009962, AY028226, AY028223, NC 00135, NC 001540, AF513851, AF5 13852, AY530579, AY63 1965, AY63 1966, the disclosures of which are incorporated herein by reference. Additionally, srivistava et al., 1983, J. Virology 45:555; Chiorini et al., 1998, J. Virology 71:6823; Chiorini et al., 1999, J. Virology 73:1309; Bantel-Schaal et al., 1999, J. Virology 73: 939; Xiao et al., 1999, J. Virology 73:3994; Muramatsu et al., 1996, Virology 221:208; Shade et al., 1986, J. Virol.58:921; Gao et al., 2002, Proc. Nat. Acad. Sci. USA 99:11854; International Published Patents WO00/28061, WO 99/61601, WO 98/11244; US Patent 6156303; the disclosures of which are also incorporated herein by reference in their entirety. For the early description of AAV1, AAV2 and AAV3 terminal repeat sequences, see Xiao, X, 1996, "Characterization of Adeno-associated virus (AAV) DNA replication and integration," Ph.D. Dissertation, University of Pittsburgh, Pittsburgh, Pa, these contents Incorporated herein by reference in its entirety.

“改组”或“嵌合”的AAV衣壳编码序列或AAV衣壳蛋白是将两个或多个不同的AAV衣壳蛋白序列混合后产生的核酸序列和氨基酸序列结合了两个或多个衣壳序列的部分。一种“改组”或“嵌合”的AAV病毒粒子包含一种被“改组的”或“嵌合的”AAV衣壳蛋白。A "shuffled" or "chimeric" AAV capsid coding sequence or AAV capsid protein is a nucleic acid sequence and amino acid sequence produced by mixing two or more different AAV capsid protein sequences that combine two or more Part of the shell sequence. A "shuffled" or "chimeric" AAV virion comprises a "shuffled" or "chimeric" AAV capsid protein.

此处使用的“靶向性”一词是指病毒优先进入某些细胞或组织类型和/或优先与细胞表面相互作用从而促进其进入某些细胞或组织类型，任选地且优选地在细胞中表达病毒基因组携带的序列，例如，重组病毒表达异源核苷酸序列。在重组AAV基因组的情况下，病毒基因组的基因表达可来自稳定整合的前病毒和/或非整合的附加体，以及病毒核酸可能在细胞内发生的任何其它形式。The term "targeted" as used herein means that the virus preferentially enters certain cell or tissue types and/or preferentially interacts with the cell surface to facilitate its entry into certain cell or tissue types, optionally and preferably in cells A sequence carried by a viral genome is expressed in a recombinant virus, for example, a heterologous nucleotide sequence is expressed in a recombinant virus. In the case of recombinant AAV genomes, gene expression of the viral genome may be from stably integrated proviruses and/or non-integrating episomes, as well as any other form in which viral nucleic acid may occur intracellularly.

术语“靶向性特性”是指一个或多个靶细胞、组织和/或器官的转导模式。例如，一些改组的AAV衣壳可能显示肝脏、性腺和/或生殖细胞的有效转导，一些改组的AAV衣壳对骨骼肌、横膈膜肌和/或心肌组织的只有低水平的转导，典型改组AAV衣壳具有的靶向性特征是肝脏的高转导和骨骼肌的低转导。The term "targeting properties" refers to the mode of transduction of one or more target cells, tissues and/or organs. For example, some shuffled AAV capsids may show efficient transduction of liver, gonad, and/or germ cells, some shuffled AAV capsids have only low levels of transduction of skeletal muscle, diaphragmatic muscle, and/or cardiac tissue, The targeting characteristics of typical shuffled AAV capsids are high transduction to liver and low transduction to skeletal muscle.

如本文所用，由病毒载体对细胞的“转导”是指通过将核酸由病毒载体携带并随后通过病毒载体将遗传物质转移到细胞中。As used herein, "transduction" of a cell by a viral vector refers to the transfer of genetic material into a cell by the carrying of nucleic acid by the viral vector and subsequent transfer of genetic material by the viral vector.

如本文所用，除非上下文另有说明，否则病毒粒子、载体、衣壳或衣壳蛋白质的“集合”或“多个”意指两个或多个。As used herein, unless the context dictates otherwise, a "collection" or "plurality" of virions, vectors, capsids or capsid proteins means two or more.

除非另有说明，“有效转导”或“有效靶向性”或类似术语可参照合适的对照品，例如，相对于对照品至少50％、60％、70％、80％、85％、90％、95％或更多的转导或靶向性来确定。Unless otherwise stated, "effective transduction" or "effective targeting" or similar terms may refer to a suitable control, e.g., at least 50%, 60%, 70%, 80%, 85%, 90% relative to the control %, 95% or more transduction or targeting.

类似地，可通过参考适当的对照来确定病毒对目标细胞或组织是否“非有效地转导”或“非具有有效的靶向性”。在具体实施例中，病毒载体对骨骼肌、心肌细胞非有效地转导，在具体实施例中，组织的非有效性转导是有效转导组织转导水平的20％或更少、10％或更少、5％或更少、1％或更少、0.1％或更少。Similarly, whether a virus "does not efficiently transduce" or "does not effectively target" a target cell or tissue can be determined by reference to an appropriate control. In a specific embodiment, the viral vector ineffectively transduces skeletal muscle and cardiomyocytes. In a specific embodiment, the non-effective transduction of the tissue is 20% or less, 10% of the transduction level of the effective transduction tissue or less, 5% or less, 1% or less, 0.1% or less.

如本文所用，除非另有说明，术语"多肽"包括肽和蛋白质。As used herein, unless otherwise stated, the term "polypeptide" includes peptides and proteins.

“核酸”或“核苷酸序列”是核苷酸碱基序列，可以是RNA、DNA或DNA-RNA杂交序列，包括自然发生和非自然发生的核苷酸，但最好是单链或双链DNA序列。"Nucleic acid" or "nucleotide sequence" is a sequence of nucleotide bases, which may be RNA, DNA, or DNA-RNA hybrid sequences, including naturally occurring and non-naturally occurring nucleotides, but preferably single-stranded or double-stranded strand DNA sequence.

如本文所用，"分离的"核酸或核酸序列是指与天然存在的生物体或病毒的至少一些其它成分分离的核酸或核酸序列，所述其它成分，例如，通常与核酸或核酸序列相关的细胞或病毒结构成分或其它多肽或核酸。As used herein, an "isolated" nucleic acid or nucleic acid sequence refers to a nucleic acid or nucleic acid sequence that is separated from at least some other components of a naturally occurring organism or virus, e.g., the cells with which the nucleic acid or nucleic acid sequence is ordinarily associated Or viral structural components or other polypeptides or nucleic acids.

同样，“分离的”多肽是指从自然发生的有机体或病毒的至少一些其它成分中分离的多肽，所述其它成分例如细胞或病毒结构成分或通常与该多肽相关联的其它多肽或核酸。Likewise, an "isolated" polypeptide refers to a polypeptide that is separated from at least some other component of a naturally occurring organism or virus, such as cellular or viral structural components or other polypeptides or nucleic acids with which the polypeptide is ordinarily associated.

术语“治疗”或语法上等同的术语，是指受试者病情的严重程度降低或至少部分改善或改善，和/或至少达到一种临床症状的缓解或减少，和/或病情进展有延迟和/或预防或延缓疾病或障碍的发生。此处术语“治疗”也包括对受试者的预防性治疗，例如，预防感染、癌症或疾病的发生。如本文所用，术语“预防”及其语法上等同的术语，包括任何类型的预防降低病情发生率、延缓病情发生和/或进展和/或减轻与病情相关症状的治疗。因此，除非上下文另有说明，术语“治疗”或语法等效术语是指预防和治疗方法或方案。The term "treatment" or grammatically equivalent terms means a reduction in the severity or at least partial amelioration or amelioration of a subject's condition, and/or achieving remission or reduction of at least one clinical symptom, and/or a delay in the progression of the condition and and/or prevent or delay the onset of a disease or disorder. The term "treatment" herein also includes prophylactic treatment of a subject, eg, preventing the development of infection, cancer or disease. As used herein, the term "prevention" and its grammatical equivalents include any type of preventive treatment that reduces the incidence of a condition, delays the onset and/or progression of a condition, and/or alleviates symptoms associated with a condition. Accordingly, unless the context dictates otherwise, the term "treatment" or grammatical equivalents refers to prophylactic and therapeutic methods or regimens.

在此使用的"有效的"剂量是足以对受试者获得某种改进或益处的剂量。或者说，"有效的"剂量是使受试者至少一种临床症状缓解或减少的剂量。本领域技术人员将理解，只要受试者获得改进或益处，治疗效果不必是完全的或治愈的。As used herein, an "effective" dose is a dose sufficient to achieve some improvement or benefit in a subject. Alternatively, an "effective" dose is that dose that alleviates or reduces at least one clinical symptom in a subject. Those skilled in the art will appreciate that the effect of the treatment need not be complete or curative as long as the subject obtains an improvement or benefit.

“异源核苷酸序列”或“异源核酸”通常不是在病毒中自然发生的序列。通常，异源核酸或核苷酸序列包含编码多肽和/或非翻译RNA的开放阅读框。A "heterologous nucleotide sequence" or "heterologous nucleic acid" is generally not a sequence that occurs naturally in a virus. Typically, a heterologous nucleic acid or nucleotide sequence comprises an open reading frame encoding a polypeptide and/or untranslated RNA.

"治疗性多肽"可以是一种多肽，可以减轻或减少因细胞或受试者中蛋白缺失或缺陷而导致的症状。此外，"治疗性多肽"可以是一种以其它方式给受试者带来益处的多肽，例如抗癌效果或移植存活率的提高。A "therapeutic polypeptide" may be a polypeptide that alleviates or reduces symptoms resulting from a missing or defective protein in a cell or subject. In addition, a "therapeutic polypeptide" may be a polypeptide that otherwise benefits the subject, such as an anti-cancer effect or an increase in transplant survival.

如本文所用，“载体”、“病毒载体”、“递送载体”通常指作为核酸递送载体的病毒粒子，它包括包装在病毒体内的病毒核酸即载体基因组。根据本发明的病毒载体包括本发明嵌合的AAV衣壳，并且可以包装AAV或重组AAV基因组或包括病毒核酸在内的任何其它核酸。或者，在某些情况下，术语“载体”、“病毒载体”、“递送载体”可用于指在没有病毒粒子情况下的载体基因组和/或指作为转运体的病毒衣壳，以传递与衣壳相连或包装在衣壳内的分子。As used herein, "vector", "viral vector", and "delivery vector" generally refer to virus particles as nucleic acid delivery vehicles, which include viral nucleic acid packaged in the virus body, that is, the vector genome. Viral vectors according to the invention comprise chimeric AAV capsids of the invention and can package AAV or recombinant AAV genome or any other nucleic acid including viral nucleic acid. Alternatively, in some cases, the terms "vector", "viral vector", "delivery vehicle" may be used to refer to the vector genome in the absence of the virion and/or to the viral capsid as a transporter to deliver and Shell-attached or packaged molecule within a capsid.

“重组AAV载体基因组”或“rAAV基因组”是一种AAV基因组，其包含至少一个反向末端重复和一个或多个异源核苷酸序列。rAAV载体通常将145个碱基末端重复序列(TRs)保留在顺式结构中以产生病毒；然而，修饰的AAV-TRs和非AAV-TRs也可用于此目的。所有其它的病毒序列是可有可无的，并且可以反式提供(Muzyczka，1992，curr-topicsmicrobial.Immunol.158:97)。rAAV载体视情况可包含两个TRs，例如AAV TRs，通常位于异源核苷酸序列的5'和3'端，但不必与之相邻。TRs可以是相同的，也可以是不同的。载体基因组也可以在3'或5'端包含一个TR。A "recombinant AAV vector genome" or "rAAV genome" is an AAV genome comprising at least one inverted terminal repeat and one or more heterologous nucleotide sequences. rAAV vectors typically retain 145-base terminal repeats (TRs) in cis for virus production; however, modified AAV-TRs and non-AAV-TRs can also be used for this purpose. All other viral sequences are dispensable and can be provided in trans (Muzyczka, 1992, curr-topicsmicrobial. Immunol. 158:97). rAAV vectors optionally contain two TRs, such as AAV TRs, usually located 5' and 3' to, but not necessarily adjacent to, the heterologous nucleotide sequence. TRs can be the same or different. The vector genome may also contain a TR at the 3' or 5' end.

术语“末端重复”或“tr”包括形成发夹结构并作为反向末端重复发挥作用的任何病毒末端重复或合成序列，即介导诸如复制、病毒包装、整合和/或前病毒拯救等所需功能。TR可以是AAV TR或非AAV TR。例如，非AAV TR序列可以是其它细小病毒，例如，犬细小病毒CPV、小鼠细小病毒MVM、人类细小病毒B-19，或作为SV40复制源的SV40发夹，并可以通过截断、替换、删除、插入进一步修改。此外，TR可以部分或完全合成，如美国专利US5478745中所述的“双D序列”。The term "terminal repeat" or "tr" includes any viral terminal repeat or synthetic sequence that forms a hairpin structure and functions as an inverted terminal repeat, i.e., required to mediate, for example, replication, viral packaging, integration and/or proviral rescue Function. TR can be AAV TR or non-AAV TR. For example, the non-AAV TR sequence can be other parvoviruses, e.g., canine parvovirus CPV, mouse parvovirus MVM, human parvovirus B-19, or the SV40 hairpin that is the source of SV40 replication, and can be modified by truncation, substitution, deletion , insert further modifications. In addition, TR can be partially or fully synthesized, such as the "double D sequence" described in US Patent No. 5,478,745.

“AAV-末端重复”或“AAV TR”可来自任何AAV，包括但不限于血清型1、2、3、4、5、6、7、8、9、10、11或任何其它已知或以后发现的AAV。只要末端重复介导所需功能，例如复制、病毒包装、整合和/或前病毒拯救等，则无需具有天然末端重复序列，例如，天然AAV TR序列可通过插入、删除、截断和/或错义突变而改变。"AAV-terminal repeat" or "AAV TR" may be from any AAV, including but not limited to serotypes 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or any other known or later Discovery of AAV. It is not necessary to have native terminal repeat sequences as long as the terminal repeats mediate the desired function, such as replication, viral packaging, integration, and/or proviral rescue, etc., e.g., native AAV TR sequences can be modified by insertions, deletions, truncations, and/or missense change by mutation.

术语“重组AAV粒子”和“重组AAV颗粒”可以互换使用。“重组AAV粒子”或“重组AAV颗粒”包含包装在AAV衣壳内的重组AAV载体基因组。The terms "recombinant AAV particle" and "recombinant AAV particle" are used interchangeably. A "recombinant AAV particle" or "recombinant AAV particle" comprises a recombinant AAV vector genome packaged within an AAV capsid.

“基本保留”某种特性，是指至少保留约75％、85％、90％、95％、97％、98％、99％或100％的所述特性，例如，活性或其它可测量特性。By "substantially retaining" a property is meant that at least about 75%, 85%, 90%, 95%, 97%, 98%, 99%, or 100% of said property, eg, activity or other measurable property, is retained.

II、定向进化和体内筛选法鉴定的嵌合AAV衣壳II. Chimeric AAV capsids identified by directed evolution and in vivo screening

发明人已鉴定出具有感兴趣特征的“嵌合”或“改组”AAV衣壳结构，例如，靶向性特性和/或中和特性。在具体实施例中，嵌合AAV衣壳显示肝脏的高效转导和/或骨骼肌和/或心肌的低效转导。The inventors have identified "chimeric" or "shuffled" AAV capsid structures with interesting characteristics, eg, targeting properties and/or neutralizing properties. In specific embodiments, chimeric AAV capsids exhibit efficient transduction of liver and/or inefficient transduction of skeletal and/or cardiac muscle.

因此，在一些实施例中，本发明提供包含或基本上包含以下氨基酸序列或由以下氨基酸序列组成的嵌合AAV衣壳以及包含所述嵌合AAV衣壳的病毒，所述氨基酸序列在图3B、3D、3F、3H、3J、3L、3N、3P或3R中展示。Accordingly, in some embodiments, the invention provides chimeric AAV capsids comprising or consisting essentially of or consisting of the amino acid sequence shown in Figure 3B and viruses comprising said chimeric AAV capsids , 3D, 3F, 3H, 3J, 3L, 3N, 3P or 3R.

在具体实施例中，所述嵌合AAV衣壳蛋白可以包含或基本上包含分别由图3B、图3D、图3F、图3H、图3J、图3L、图3N、图3P或图3R所示的氨基酸序列，或由这些图所示的氨基酸序列组成。In specific embodiments, the chimeric AAV capsid protein may comprise or substantially comprise the expression shown in Figure 3B, Figure 3D, Figure 3F, Figure 3H, Figure 3J, Figure 3L, Figure 3N, Figure 3P or Figure 3R, respectively. or consisting of the amino acid sequences shown in these figures.

此外，在非限制性实施例中，本发明的嵌合AAV衣壳蛋白可由核酸编码，所述核酸包含或基本上包含分别由图3A、图3C、图3E、图3G、图3I、图3K、图3M、图3O或图3Q所示的核苷酸序列，或由这些图所示的核苷酸序列组成；或编码由上述任何一种核苷酸序列编码的AAV衣壳或衣壳蛋白的核苷酸序列，但由于密码子的简并性而与上述核苷酸序列不同。在本发明说明书和所附权利要求书中的所有氨基酸位置的指定均与VP1编号有关。本领域技术人员将理解，由于AAV衣壳编码序列的重叠，本文所述的修改也可能导致VP2和/或VP3衣壳亚基的改变。Furthermore, in a non-limiting example, a chimeric AAV capsid protein of the invention may be encoded by a nucleic acid comprising or essentially comprising the , the nucleotide sequence shown in Figure 3M, Figure 3O or Figure 3Q, or consist of the nucleotide sequence shown in these Figures; or encode the AAV capsid or capsid protein encoded by any of the above-mentioned nucleotide sequences nucleotide sequence, but differs from the above nucleotide sequence due to codon degeneracy. All designations of amino acid positions in the present specification and appended claims are relative to VP1 numbering. Those skilled in the art will appreciate that the modifications described herein may also result in changes in the VP2 and/or VP3 capsid subunits due to overlapping AAV capsid coding sequences.

本发明还提供了包含或基本上包含以下氨基酸序列或由以下氨基酸序列组成的嵌合AAV衣壳蛋白，评价诸如病毒转导和/或抗体中和等生物学性质的方法在本领域内是众所周知的。The present invention also provides a chimeric AAV capsid protein comprising or substantially comprising or consisting of the following amino acid sequence, methods for evaluating biological properties such as viral transduction and/or antibody neutralization are well known in the art of.

保守氨基酸取代在本领域已知。在具体实施例中，保守氨基酸取代包括以下组中的一个或多个取代：甘氨酸、丙氨酸；缬氨酸、异亮氨酸、亮氨酸；天冬氨酸、谷氨酸；天冬酰胺、谷氨酰胺；丝氨酸、苏氨酸；赖氨酸、精氨酸；和/或苯丙氨酸、酪氨酸。Conservative amino acid substitutions are known in the art. In particular embodiments, conservative amino acid substitutions include one or more of the following group: glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid; aspartic acid Amide, Glutamine; Serine, Threonine; Lysine, Arginine; and/or Phenylalanine, Tyrosine.

本领域技术人员显而易见地将图3B、3D、3F、3H、3J、3L、3N、3P、3R所示的嵌合AAV衣壳蛋白的氨基酸序列应用于本领域已知的其它任何修饰，通过进一步修饰以获得所需的特性。例如，对AAV2衣壳序列进行R484E和R585E突变，则改善AAV载体对心脏的转导(Muller等，2006，Cardiovascular Research 70:70-78)。做为进一步非限制性修饰的可能性，对衣壳蛋白进行修饰以纳入靶向序列或有助于纯化和/或检测的序列，例如衣壳蛋白可与谷胱甘肽-S-转移酶、麦芽糖结合蛋白、肝素/硫酸肝素结合域、多聚-HIS、配体和/或报告蛋白、免疫球蛋白Fc片段、单链抗体、血凝素、C-MYC、标签表位等的全部或部分融合以形成融合蛋白。本领域已知的将靶向肽插入AAV衣壳的方法，例如，国际专利WO 00/28004；Nicklin等，2001，Molecular Therapy 474-181；White等，2004，Circulation 109:513-319；Muller等，2003，Nature Biotech 21:1040-1046。Those skilled in the art will obviously apply the amino acid sequence of the chimeric AAV capsid protein shown in Figure 3B, 3D, 3F, 3H, 3J, 3L, 3N, 3P, 3R to any other modification known in the art, by further Modified to obtain desired properties. For example, R484E and R585E mutations in the AAV2 capsid sequence improved transduction of the heart by AAV vectors (Muller et al., 2006, Cardiovascular Research 70:70-78). As a further non-limiting modification possibility, the capsid protein can be modified to include targeting sequences or sequences that facilitate purification and/or detection, e.g. capsid protein can interact with glutathione-S-transferase, All or part of maltose binding protein, heparin/heparan sulfate binding domain, poly-HIS, ligand and/or reporter protein, immunoglobulin Fc fragment, single chain antibody, hemagglutinin, C-MYC, tag epitope, etc. fused to form a fusion protein. Methods known in the art for inserting targeting peptides into AAV capsids, for example, International Patent WO 00/28004; Nicklin et al., 2001, Molecular Therapy 474-181; White et al., 2004, Circulation 109:513-319; Muller et al. , 2003, Nature Biotech 21:1040-1046.

本发明的病毒可进一步包含如国际专利WO01/92551及美国专利US7465583中所述的双重病毒基因组。The virus of the present invention may further comprise a double viral genome as described in International Patent WO01/92551 and US Patent US7465583.

本发明还提供了包含本发明嵌合AAV衣壳蛋白的重组病毒粒子，其中载体基因组包裹于病毒粒子中，优选AAV载体基因组。在具体实施例中，本发明提供了一种重组AAV粒子，其包含本发明的嵌合AAV衣壳蛋白，其中AAV载体基因组包裹于AAV衣壳中。The present invention also provides a recombinant virion comprising the chimeric AAV capsid protein of the present invention, wherein the vector genome is encapsulated in the virion, preferably the AAV vector genome. In a specific embodiment, the present invention provides a recombinant AAV particle comprising the chimeric AAV capsid protein of the present invention, wherein the AAV vector genome is encapsulated in the AAV capsid.

在具体实施例中，病毒是包含感兴趣异源核酸的重组载体。因此，本发明可用于将核酸在体外和体内递送至细胞。在代表性实施例中，本发明的重组载体用于将核酸递送或转移到动物细胞，优选哺乳动物细胞。In specific embodiments, the virus is a recombinant vector comprising a heterologous nucleic acid of interest. Accordingly, the present invention can be used to deliver nucleic acids to cells in vitro and in vivo. In representative embodiments, the recombinant vectors of the invention are used to deliver or transfer nucleic acids into animal cells, preferably mammalian cells.

任何异源核苷酸序列均可由本发明病毒载体递送。感兴趣的核酸包括编码多肽的核酸，任选治疗性多肽和/或免疫原性多肽。Any heterologous nucleotide sequence can be delivered by the viral vectors of the invention. Nucleic acids of interest include nucleic acids encoding polypeptides, optionally therapeutic and/or immunogenic polypeptides.

治疗性多肽包括但不限于胰岛素、胰高血糖素、生长激素、甲状旁腺激素、生长激素释放因子、卵泡刺激素、黄体激素、人绒毛膜促性腺激素、血管内皮生长因子、促血管生成素、血管抑制素、粒细胞集落刺激因子、促红细胞生产素、结缔组织生长因子、碱性成纤维细胞生长因子、酸性成纤维细胞生长因子、表皮生长因子、血小板衍生的生长因子、胰岛素生长因子Ⅰ和Ⅱ、转化生长因子α超家族中的任一个、活化素、抑制素、骨形态发生蛋白中的任一个、神经生长因子、脑衍生神经营养因子、神经营养蛋白NT-3和NT-4/5、睫状神经营养因子、胶质细胞系衍生神经营养因子、聚集蛋白、脑信号蛋白/瓦解蛋白家族的任一个、导蛋白-1和导蛋白-2、肝细胞生长因子、肝配蛋白、头蛋白、sonic hedgehog蛋白和酪氨酸羟化酶、血小板生成素、白介素IL-1到1L-25、单核细胞化学诱导物蛋白、白血病抑制因子、粒细胞-巨噬细胞集落刺激因子、Fas配体、肿瘤坏死因子α和β、干扰素α、β和γ、干细胞因子、flk-2/flt3配体。免疫系统产生的基因产物也可用于本发明，这些产物包括但不限于免疫球蛋白1gG、IgM、IgA、IgD和IgE、嵌合免疫球蛋白、人源化抗体、单链抗体、T细胞受体、嵌合T细胞受体、单链T细胞受体、Ⅰ类和Ⅱ类MHC分子以及改造的免疫球蛋白、补体调节蛋白、膜辅因子蛋白、衰变加速因子、CR1、CF2和CD59，低密度脂蛋白受体、高密度脂蛋白受体、极低密度脂蛋白受体和清除受体、糖皮质激素受体和雌激素受体、维生素D受体和其它核受体、jun/fos、max、mad、血清效应因子、AP-1、AP2、myb、MyoD和肌细胞生成蛋白、TFE3、E2F、ATF1、ATF2、ATF3、ATF4、ZF5、NFAT、CREB、HNF-4、C/EBP、CCAAT-盒结合蛋白、干扰素调节因子、Wilms肿瘤蛋白、ETS-结合蛋白、STAT、GATA-盒结合蛋白和翼状螺旋蛋白的叉头蛋白家族，氨甲酰合成酶1、鸟氨酸转氨甲酰酶、精氨基琥珀酸合成酶、精氨基琥珀酸裂合酶、精氨酸酶、延胡索酰乙酰乙酸水解酶、苯丙氨酸羟化酶、α-1抗胰蛋白酶、葡萄糖苷酶、葡萄糖-6-磷酸酶、胆色素原脱氨酶、胱硫醚B合成酶、支链酮酸脱羧酶、异戊酰-CoA脱氢酶、丙酰-CoA羧化酶、甲基丙二酰CoA变位酶、戊二酰-CoA脱氢酶、胰岛素、β-葡糖苷酶、丙酮酸羧酸盐、肝磷酸化酶、磷酸化酶激酶、甘氨酸脱羧酶、H-蛋白、T-蛋白、囊性纤维化跨膜传导调节蛋白、抗肌萎缩蛋白、α-半乳糖苷酶、β-半乳糖苷酶、溶酶体酶、凝血因子、以及在需要的个体中具有治疗作用的任何其它多肽。Therapeutic polypeptides include but are not limited to insulin, glucagon, growth hormone, parathyroid hormone, growth hormone releasing factor, follicle stimulating hormone, luteinizing hormone, human chorionic gonadotropin, vascular endothelial growth factor, angiopoietin , angiostatin, granulocyte colony-stimulating factor, erythropoietin, connective tissue growth factor, basic fibroblast growth factor, acidic fibroblast growth factor, epidermal growth factor, platelet-derived growth factor, insulin growth factor I and II, any of the transforming growth factor alpha superfamily, activin, inhibin, any of the bone morphogenetic proteins, nerve growth factor, brain-derived neurotrophic factor, neurotrophins NT-3 and NT-4/ 5. Ciliary neurotrophic factor, glial cell line-derived neurotrophic factor, aggregation protein, any one of the semaphorin/disintegration protein family, netrin-1 and netrin-2, hepatocyte growth factor, ephrin, Noggin, sonic hedgehog protein and tyrosine hydroxylase, thrombopoietin, interleukins IL-1 to 1L-25, monocyte chemoattractant protein, leukemia inhibitory factor, granulocyte-macrophage colony-stimulating factor, Fas Ligands, tumor necrosis factor alpha and beta, interferon alpha, beta and gamma, stem cell factor, flk-2/flt3 ligand. Gene products produced by the immune system can also be used in the present invention, these products include but are not limited to immunoglobulins IgG, IgM, IgA, IgD and IgE, chimeric immunoglobulins, humanized antibodies, single chain antibodies, T cell receptors , chimeric T cell receptor, single chain T cell receptor, MHC class I and class II molecules and engineered immunoglobulins, complement regulatory proteins, membrane cofactor proteins, decay accelerating factors, CR1, CF2 and CD59, low density Lipoprotein receptors, high-density lipoprotein receptors, very low-density lipoprotein receptors and clearance receptors, glucocorticoid receptors and estrogen receptors, vitamin D receptors and other nuclear receptors, jun/fos, max , mad, serum effectors, AP-1, AP2, myb, MyoD and myogenin, TFE3, E2F, ATF1, ATF2, ATF3, ATF4, ZF5, NFAT, CREB, HNF-4, C/EBP, CCAAT- Box-binding proteins, interferon regulators, Wilms tumor protein, ETS-binding proteins, STAT, GATA-box-binding proteins and the forkhead protein family of winged helix proteins, carbamyl synthetase 1, ornithine transcarbamylase , argininosuccinate synthase, argininosuccinate lyase, arginase, fumaryl acetoacetate hydrolase, phenylalanine hydroxylase, alpha-1 antitrypsin, glucosidase, glucose-6 -Phosphatase, porphobilinogen deaminase, cystathionine B synthase, branched-chain ketoacid decarboxylase, isovaleryl-CoA dehydrogenase, propionyl-CoA carboxylase, methylmalonyl-CoA transmutation Enzyme, glutaryl-CoA dehydrogenase, insulin, beta-glucosidase, pyruvate carboxylate, liver phosphorylase, phosphorylase kinase, glycine decarboxylase, H-protein, T-protein, cystic fibrosis Transmembrane conductance regulator protein, dystrophin, α-galactosidase, β-galactosidase, lysosomal enzymes, coagulation factors, and any other polypeptide having a therapeutic effect in an individual in need thereof.

编码多肽的异源核苷酸序列包括编码报告基因多肽的序列。本领域已知的报告基因编码的多肽，包括但不限于绿色荧光蛋白、β-半乳糖苷酶、碱性磷酸酶、荧光素酶和氯霉素乙酰转移酶等。A heterologous nucleotide sequence encoding a polypeptide includes a sequence encoding a reporter polypeptide. Polypeptides encoded by reporter genes known in the art include, but are not limited to, green fluorescent protein, β-galactosidase, alkaline phosphatase, luciferase, and chloramphenicol acetyltransferase.

另一方面，异源核酸可编码一种反义寡核苷酸，包括核酶、干扰RNA，干扰RNA包括介导基因沉默的小干扰RNA(Sharp等，2000，Science 287:2431)、microRNA、其它非翻译功能RNAs，如“指导”RNAS(Gorman等，1998，Proc.Nat.Acad.Sci.USA 95:4929；美国专利US5869248)等。In another aspect, the heterologous nucleic acid can encode an antisense oligonucleotide, including ribozymes, interfering RNAs, including small interfering RNAs that mediate gene silencing (Sharp et al., 2000, Science 287:2431), microRNAs, Other non-translational functional RNAs, such as "guide" RNAS (Gorman et al., 1998, Proc. Nat. Acad. Sci. USA 95:4929; US Patent US5869248) and the like.

所属技术领域内已知，反义核酸和抑制性RNA序列可诱导“外显子跳跃”。因此，异源核酸可以编码反义核酸或抑制RNA，诱导适当的外显子跳跃。Antisense nucleic acids and inhibitory RNA sequences are known in the art to induce "exon skipping". Thus, the heterologous nucleic acid can encode an antisense nucleic acid or suppressor RNA, inducing proper exon skipping.

核酶是以位点特异性方式裂解核酸的RNA蛋白复合物。核酶具有特定的催化域，具有内切酶活性(Kim等，1987，Proc.Natl.Acad.Sci.USA 84:8788；Gerlach等，1987，Nature328:802；Forster and Symons，1987，Cell 49:211)。Ribozymes are RNA-protein complexes that cleave nucleic acids in a site-specific manner. Ribozyme has a specific catalytic domain and has endonuclease activity (Kim et al., 1987, Proc. 211).

microRNAs是一种天然的细胞RNA分子，通过控制mRNA的稳定性来调节多个基因的表达。特定microRNA的过度表达或减少可用于治疗功能障碍，并已被证明在许多疾病状态和疾病动物模型中有效(Couzin，2008，Science 319：1782-1784)。嵌合AAV可用于将microRNA导入细胞、组织和受试者中，用于治疗遗传病和获得性疾病，或增强某些组织的功能并促进其生长，例如，mir-1、mir-133、mir-206和/或mir-208可用于治疗心脏和骨骼肌疾病(Chen等，2006，Genet 38：228-233；van Rooij等，2008，Trends Genet。24：159-166)，microRNA也可用于基因传递后调节免疫系统(Brown等，2007，Blood 110：4144-4152)。microRNAs are natural cellular RNA molecules that regulate the expression of multiple genes by controlling mRNA stability. Overexpression or reduction of specific microRNAs can be used to treat dysfunction and has been shown to be effective in many disease states and animal models of disease (Couzin, 2008, Science 319: 1782-1784). Chimeric AAVs can be used to introduce microRNAs into cells, tissues, and subjects for the treatment of genetic and acquired diseases, or to enhance the function and growth of certain tissues, e.g., mir-1, mir-133, mir -206 and/or mir-208 can be used to treat heart and skeletal muscle diseases (Chen et al., 2006, Genet 38: 228-233; van Rooij et al., 2008, Trends Genet. 24: 159-166), microRNA can also be used in gene Post-transmission modulation of the immune system (Brown et al., 2007, Blood 110:4144-4152).

本文所用术语“反义寡核苷酸”，包括“反义RNA”，是指与特定DNA或RNA序列互补并特异杂交的核酸。反义寡核苷酸和编码相同的核酸可以按照常规技术制造。As used herein, the term "antisense oligonucleotide", including "antisense RNA", refers to a nucleic acid that is complementary to and specifically hybridizes to a specific DNA or RNA sequence. Antisense oligonucleotides and nucleic acids encoding the same can be produced according to conventional techniques.

本领域技术人员理解，反义寡核苷酸不必与目标序列完全互补，只要序列相似程度足以使反义核苷酸序列特异地与目标序列杂交并减少蛋白质产物的产生，例如，相似度至少约30％、40％、50％、60％、70％、80％、90％、95％或更多。为了确定杂交的特异性，这种寡核苷酸与靶序列的杂交可以在弱、中、甚至严格的条件下进行。Those skilled in the art understand that the antisense oligonucleotide need not be completely complementary to the target sequence, as long as the sequence similarity is sufficient to allow the antisense nucleotide sequence to specifically hybridize to the target sequence and reduce the production of protein products, for example, the similarity is at least about 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more. Hybridization of such oligonucleotides to target sequences can be performed under mildly, moderately or even stringent conditions in order to determine the specificity of hybridization.

反义寡核苷酸可通过本领域已知的程序化学合成和酶结合反应合成。例如，一个反义寡核苷酸可以用自然产生的核苷酸或各种修饰的核苷酸进行化学合成，其目的是增加分子的生物稳定性和/或增加反义链和正义链之间形成的双链稳定性，例如，可以使用硫代磷酸酯衍生物和吖啶取代的核苷酸。Antisense oligonucleotides can be synthesized by chemical synthesis and enzymatic conjugation reactions by procedures known in the art. For example, an antisense oligonucleotide can be chemically synthesized using naturally occurring nucleotides or various modified nucleotides in order to increase the biological stability of the molecule and/or to increase the distance between the antisense and sense strands. The duplex formed is stabilized, for example, using phosphorothioate derivatives and acridine-substituted nucleotides.

可用于产生反义寡核苷酸的修饰核苷酸，包括5-氟尿嘧啶、5-溴尿嘧啶、5-氯尿嘧啶、5-碘尿嘧啶、次黄嘌呤、黄嘌呤、4-乙酰胞嘧啶、5-羧甲基氨基甲基-2-硫尿嘧啶、5-羧甲基氨基甲基尿嘧啶、二氢尿嘧啶、β-D-半乳糖基喹啉、肌苷、N6-异戊烯基赖氨酸、1-甲基鸟嘌呤、1-甲基肌苷、2,2-二甲基鸟嘌呤、2-甲基腺嘌呤、2-甲基鸟嘌呤、3-甲基胞苷、5-甲基胞嘧啶、N6-腺嘌呤、7-甲基鸟嘌呤、5-甲基氨基甲基尿嘧啶、5-甲氧基氨基甲基-2-硫脲嘧啶、β-D-甘露糖基喹啉、5’-甲氧基羧甲基脲嘧啶、5-甲氧基脲嘧啶、2-甲硫基-N6-异戊烯基胺、脲嘧啶-5-氧乙酸、喹啉、2-硫胞嘧啶、5-甲基-2-硫脲嘧啶、2-硫脲嘧啶、4-硫脲嘧啶、5-甲基脲嘧啶、脲嘧啶-5-氧乙酸甲酯、尿嘧啶-5-氧乙酸、5-甲基-2-硫脲嘧啶、和2,6-二氨基嘌呤等。Modified nucleotides that can be used to generate antisense oligonucleotides, including 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine , 5-carboxymethylaminomethyl-2-thiouracil, 5-carboxymethylaminomethyluracil, dihydrouracil, β-D-galactosylquinoline, inosine, N6-isopentene lysine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytidine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, β-D-mannose Quinoline, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6-prenylamine, uracil-5-oxyacetic acid, quinoline, 2 -thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5-oxoacetate methyl ester, uracil-5- Oxyacetic acid, 5-methyl-2-thiouracil, and 2,6-diaminopurine, etc.

反义寡核苷酸可经化学修饰与另一分子共价结合。例如，反义寡核苷酸可以与一种分子结合，这种分子有助于传递到感兴趣的细胞，增强鼻粘膜的吸收、提供可检测的标记物、增加寡核苷酸的生物利用度、提高寡核苷酸的稳定性、或改善制剂或药代动力学特性等。共轭分子包括但不限于胆固醇、脂类、多胺、聚酰胺、聚酯、报告分子、生物素、染料、聚乙二醇、人血清白蛋白、酶、抗体或抗体片段或细胞受体配体。Antisense oligonucleotides can be chemically modified to covalently bind another molecule. For example, antisense oligonucleotides can be conjugated to a molecule that facilitates delivery to cells of interest, enhances uptake in the nasal mucosa, provides a detectable marker, increases bioavailability of the oligonucleotide , Improve the stability of oligonucleotides, or improve formulation or pharmacokinetic properties, etc. Conjugated molecules include, but are not limited to, cholesterol, lipids, polyamines, polyamides, polyesters, reporter molecules, biotin, dyes, polyethylene glycol, human serum albumin, enzymes, antibodies or antibody fragments, or cell receptor ligands body.

为提高稳定性、核酸酶抗性、生物利用度、制剂特征和/或药代动力学特性，对核酸进行的其它修饰在本领域内也是已知的。Other modifications to nucleic acids to improve stability, nuclease resistance, bioavailability, formulation characteristics and/or pharmacokinetic properties are also known in the art.

RNA干扰是转录后基因沉默的一种机制，它将与靶序列相对应的双链RNA(DsRNA)导入细胞或生物体中，导致相应mRNA的降解。RNAi实现基因沉默的机制已在多篇综述文章中报道(Sharp等，2001，Genes Dcv 15:485-490；Hammond等，2001，Nature Rev Gen 2:110-119)。在基因表达恢复之前，RNAi效应在多个细胞分裂中持续存在。因此，RNAi是一种在RNA水平上进行靶向敲除的有效方法。RNAi已被证明在人类细胞，包括人类胚胎肾脏和HeLa细胞中获得成功(Elbashir等，2001，Nature 411:494-498)。现已证明，大约21个核苷酸的短合成dsRNA，也被称为“短干扰RNA”，可以在不触发抗病毒反应的情况下介导哺乳动物细胞的沉默(Elbashir等，2001，Nature 411:494-498；Caplen等，2001，Proc.Nat Acad.Sci.98:9742)。RNA interference is a mechanism of post-transcriptional gene silencing, which introduces double-stranded RNA (DsRNA) corresponding to the target sequence into cells or organisms, resulting in the degradation of the corresponding mRNA. The mechanism by which RNAi achieves gene silencing has been reported in several review articles (Sharp et al., 2001, Genes Dcv 15:485-490; Hammond et al., 2001, Nature Rev Gen 2:110-119). RNAi effects persisted across multiple cell divisions before gene expression was restored. Therefore, RNAi is an effective method for targeted knockout at the RNA level. RNAi has been shown to be successful in human cells, including human embryonic kidney and HeLa cells (Elbashir et al., 2001, Nature 411:494-498). Short synthetic dsRNAs of approximately 21 nucleotides, also known as "short interfering RNAs", have been shown to mediate silencing in mammalian cells without triggering antiviral responses (Elbashir et al., 2001, Nature 411 :494-498; Caplen et al., 2001, Proc. Nat Acad. Sci. 98:9742).

RNAi分子可以是短发夹RNA(Paddison等，2002，PNAS USA 99:1443-1448)，它在细胞中通过RNaseIII酶切作用加工成20-25长度的sirna分子。ShRNA通常有一个茎环结构，即两个反向重复序列由一个短间隔序列连接。The RNAi molecule can be short hairpin RNA (Paddison et al., 2002, PNAS USA 99:1443-1448), which is processed into 20-25 siRNA molecules in cells by RNaseIII enzyme cleavage. ShRNA usually has a stem-loop structure, that is, two inverted repeats connected by a short spacer.

产生RNAi的方法包括化学合成、体外转录、Dicer体外或体内消化长dsRNA、递送载体在体内表达以及从PCR来源RNAi表达盒在体内表达。Methods to generate RNAi include chemical synthesis, in vitro transcription, Dicer digestion of long dsRNA in vitro or in vivo, in vivo expression from delivery vectors, and in vivo expression from PCR-derived RNAi expression cassettes.

RNAi分子的反义区可以完全与目标序列互补，但只要它与目标序列特异性杂交并减少蛋白质产物的产生，如至少约30％、40％、50％、60％、70％、80％、90％、95％或更多，则不需要与目标序列完全互补。在一些实施例中，所述寡核苷酸与目标序列的杂交可在如上文所定义的弱严格性、中等严格性或甚至高等严格性条件下进行。The antisense region of the RNAi molecule can be completely complementary to the target sequence, but as long as it specifically hybridizes to the target sequence and reduces the production of protein product, such as at least about 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more, it does not need to be completely complementary to the target sequence. In some embodiments, the hybridization of the oligonucleotide to the target sequence may be performed under conditions of low stringency, medium stringency or even high stringency as defined above.

在其它实施方案中，RNAi的反义区域与目标序列具有至少约60％、70％、80％、90％、95％、97％、98％或更高的序列一致性，并减少蛋白质产物的产生至少约30％、40％、50％、60％、70％、80％、90％、95％或更多。在一些实施例中，与目标序列相比，反义区域包含1、2、3、4、5、6、7、8、9或10个不匹配。不匹配通常在dsRNA末端比在中心部分更容易接受。RNAi分子可以包含修饰糖、修饰核苷酸、骨架连接以及上述反义寡核苷酸的其它修饰。In other embodiments, the antisense region of the RNAi has at least about 60%, 70%, 80%, 90%, 95%, 97%, 98% or more sequence identity to the target sequence and reduces the protein product. Produces at least about 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more. In some embodiments, the antisense region comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 mismatches compared to the target sequence. Mismatches are generally more acceptable at the ends of the dsRNA than in the center. RNAi molecules may contain modified sugars, modified nucleotides, backbone linkages, and other modifications of the antisense oligonucleotides described above.

本发明还提供表达免疫原性多肽的重组病毒载体。异源核酸可编码本领域已知的任何感兴趣的免疫原，包括但不限于来自人类免疫缺陷病毒、流感病毒、Gag蛋白、肿瘤抗原、癌症抗原、细菌抗原、病毒抗原等。或者，该免疫原可呈现在病毒衣壳中，例如，通过共价修饰与病毒外壳结合。The invention also provides recombinant viral vectors expressing immunogenic polypeptides. The heterologous nucleic acid may encode any immunogen of interest known in the art, including but not limited to those from human immunodeficiency virus, influenza virus, Gag proteins, tumor antigens, cancer antigens, bacterial antigens, viral antigens, and the like. Alternatively, the immunogen can be presented in the viral capsid, eg, associated with the viral capsid by covalent modification.

细小病毒用作疫苗为本领域所公知(US5916563、US5905040、US5882652)。抗原可能存在于病毒衣壳中，或者抗原可以从引入重组载体基因组的异源核酸中表达。The use of parvoviruses as vaccines is well known in the art (US5916563, US5905040, US5882652). The antigen may be present in the viral capsid, or the antigen may be expressed from heterologous nucleic acid introduced into the recombinant vector genome.

免疫原性多肽或免疫原可以是任何适用于保护受试者免受疾病的多肽，包括但不限于微生物、细菌、原生动物、寄生虫、真菌和病毒性疾病。免疫原可以是原粘液病毒免疫原，例如，流感病毒免疫原，流感病毒血凝素表面蛋白或流感病毒核蛋白基因；或慢病毒免疫原，例如，马传染性贫血病毒免疫原、猿免疫缺陷病毒免疫原或人类免疫缺陷病毒免疫原；或沙粒病毒免疫原，例如，拉沙热病毒免疫原；或痘病毒免疫原、黄病毒免疫原、丝状病毒免疫原、布尼亚病毒免疫原、冠状病毒免疫原或严重急性呼吸综合征免疫原。免疫原还可以是脊髓灰质炎免疫原、疱疹免疫原、腮腺炎免疫原、麻疹免疫原、风疹免疫原、白喉毒素或其它白喉免疫原、百日咳抗原、肝炎免疫原或本领域已知的任何其它疫苗免疫原。An immunogenic polypeptide or immunogen can be any polypeptide suitable for protecting a subject from disease, including but not limited to microbial, bacterial, protozoan, parasitic, fungal, and viral diseases. The immunogen can be a native myxovirus immunogen, e.g., influenza virus immunogen, influenza virus hemagglutinin surface protein, or influenza virus nucleoprotein gene; or a lentiviral immunogen, e.g., equine infectious anemia virus immunogen, simian immunodeficiency Viral immunogen or human immunodeficiency virus immunogen; or arenavirus immunogen, e.g., Lassa fever virus immunogen; or poxvirus immunogen, flavivirus immunogen, filovirus immunogen, bunyavirus immunogen , coronavirus immunogen or severe acute respiratory syndrome immunogen. The immunogen can also be a polio immunogen, a herpes immunogen, a mumps immunogen, a measles immunogen, a rubella immunogen, a diphtheria toxin or other diphtheria immunogen, a pertussis antigen, a hepatitis immunogen, or any other immunogen known in the art. Vaccine immunogen.

或者，免疫原可以是任何肿瘤或癌细胞抗原。任选地，肿瘤或癌抗原表达于癌细胞表面。示例性癌症和肿瘤抗原包括但不限于：BRCA1基因产物、BRCA2基因产物、GP100、酪氨酸酶、GAGE-1/2、RAGE、NY-ESO-1、CDK-4、3-连环蛋白、MUM-1、caspase-8、HPVE、SART-1、PRAME、p15、黑色素瘤肿瘤抗原、HER-2/neu基因产物、雌激素受体、乳脂球蛋白、p53肿瘤抑制蛋白、粘蛋白抗原、端粒酶、核基质蛋白、前列腺酸磷酸酶、乳头瘤病毒抗原、以及与下列癌症包括黑色素瘤、腺癌、胸腺瘤、肉瘤、肺癌、肝癌、结直肠癌、非霍奇金淋巴瘤、霍奇金淋巴瘤、白血病、子宫癌、乳腺癌、前列腺癌、卵巢癌、宫颈癌、膀胱癌、肾癌、胰腺癌、脑癌、肾癌、胃癌、食管癌和头颈癌等相关的抗原。Alternatively, the immunogen can be any tumor or cancer cell antigen. Optionally, tumor or cancer antigens are expressed on the surface of cancer cells. Exemplary cancer and tumor antigens include, but are not limited to: BRCA1 gene product, BRCA2 gene product, GP100, tyrosinase, GAGE-1/2, RAGE, NY-ESO-1, CDK-4, 3-catenin, MUM -1, caspase-8, HPVE, SART-1, PRAME, p15, melanoma tumor antigen, HER-2/neu gene product, estrogen receptor, lactoglobulin, p53 tumor suppressor protein, mucin antigen, telomere Enzyme, nuclear matrix protein, prostatic acid phosphatase, papillomavirus antigen, and the following cancers including melanoma, adenocarcinoma, thymoma, sarcoma, lung cancer, liver cancer, colorectal cancer, non-Hodgkin lymphoma, Hodgkin Antigens related to lymphoma, leukemia, uterine cancer, breast cancer, prostate cancer, ovarian cancer, cervical cancer, bladder cancer, kidney cancer, pancreatic cancer, brain cancer, kidney cancer, gastric cancer, esophageal cancer and head and neck cancer.

或者，异源核苷酸序列可以编码在体外或体内细胞中产生的任何多肽。例如，可将病毒载体引入培养细胞，并从中分离出表达的蛋白产物。Alternatively, the heterologous nucleotide sequence may encode any polypeptide produced in a cell in vitro or in vivo. For example, viral vectors can be introduced into cultured cells and the expressed protein product can be isolated therefrom.

本领域的技术人员理解，感兴趣的异源核酸可与适当的控制序列进行可操作性连接。例如，异源核酸可能与转录翻译控制信号、复制起源、多聚腺苷酸化信号、内核糖体进入位点、启动子、增强子等表达控制元件相连接。Those skilled in the art understand that a heterologous nucleic acid of interest may be operably linked to appropriate control sequences. For example, a heterologous nucleic acid may be linked to transcriptional translational control signals, replication origins, polyadenylation signals, endosomal entry sites, promoters, enhancers, and other expression control elements.

本领域的技术人员将进一步理解，根据所需的表达水平和组织特异性表达，可以使用各种启动子/增强子元件。启动子/增强子可能是组成型或诱导型，这取决于所需的表达模式。启动子/增强子可以是天然的或外来的，也可以是自然的或合成的序列。Those skilled in the art will further appreciate that various promoter/enhancer elements may be used depending on the desired level of expression and tissue-specific expression. Promoters/enhancers may be constitutive or inducible, depending on the desired expression pattern. Promoters/enhancers can be native or foreign, and can be natural or synthetic sequences.

启动子/增强子元件可以是靶细胞或受试者固有的，也可以是异源核酸序列固有的。通常选择启动子/增强子元件，使其在感兴趣的目标细胞中发挥作用。在代表性的实施例中，启动子/增强子元件是哺乳动物启动子/增强子元件，启动子/增强元件可以是组成型或可诱导型。The promoter/enhancer element can be native to the target cell or subject, or it can be native to the heterologous nucleic acid sequence. Promoter/enhancer elements are generally chosen such that they function in the target cell of interest. In representative embodiments, the promoter/enhancer element is a mammalian promoter/enhancer element, which can be constitutive or inducible.

诱导表达控制元件通常用于那些需要调节异源核酸序列过度表达的应用中。用于基因传递的可诱导启动子/增强子元件可以是组织特异性或组织优选的启动子/增强子元件，并且包括肌肉特异性或优选、神经组织特异性或优选、眼睛(包括视网膜特异性和角膜性)特异性或优选、肝脏特异性或优选、骨髓特异性或优选、胰腺特异性或优选、脾脏特异性或优选、肺特异性或优选。其它可诱导的启动子/增强子元件包括激素诱导和金属诱导元件。示例性诱导启动子/增强子元件包括但不限于Tet开/关元件、RU486诱导启动子、雷帕霉素诱导启动子和金属硫蛋白启动子。Inducible expression control elements are commonly used in those applications requiring regulation of overexpression of heterologous nucleic acid sequences. Inducible promoter/enhancer elements for gene delivery can be tissue-specific or tissue-preferred promoter/enhancer elements, and include muscle-specific or preferred, neural tissue-specific or preferred, eye (including retina-specific) and corneal) specific or preferred, liver specific or preferred, bone marrow specific or preferred, pancreas specific or preferred, spleen specific or preferred, lung specific or preferred. Other inducible promoter/enhancer elements include hormone-inducible and metal-inducible elements. Exemplary inducible promoter/enhancer elements include, but are not limited to, Tet on/off elements, RU486 inducible promoters, rapamycin inducible promoters, and metallothionein promoters.

在将异源核酸序列转录并在目标细胞中翻译的实施例中，通常使用特定的起始信号来有效翻译插入的蛋白质编码序列。这些外源翻译控制序列，可能包括ATG起始密码子和相邻序列，可以起始的多种形式，包括天然的和合成的。In embodiments where a heterologous nucleic acid sequence is transcribed and translated in a target cell, specific initiation signals are typically used to efficiently translate the inserted protein coding sequence. These exogenous translational control sequences, possibly including the ATG initiation codon and adjacent sequences, can be initiated in a variety of forms, both natural and synthetic.

本发明提供包含一种嵌合AAV衣壳和一种AAV基因组的嵌合AAV粒子。本发明还提供了所述嵌合AAV颗粒的集合或库，其中所述集合或库包含2个或更多、10个或更多、50个或更多、10²个或更多、10³个或更多、10⁴或更多、10⁵或更多、或10⁶或更多不同的序列。The present invention provides chimeric AAV particles comprising a chimeric AAV capsid and an AAV genome. The present invention also provides a collection or library of said chimeric AAV particles, wherein said collection or library comprises ² or more, 10 or more, 50 or more, 102 or more, ¹⁰³ or more, 10 ⁴ or more, 10 ⁵ or more, or 10 ⁶ or more different sequences.

本发明还包括“空”衣壳粒子，其包含、组成或基本上由本发明的嵌合AAV衣壳蛋白组成，参见美国专利US5863541所述。本发明的嵌合AAV衣壳可用作“衣壳运载体”，可被共价连接、结合或包装并转移到细胞中的分子包括DNA、RNA、脂质、碳水化合物、多肽、小有机分子或这些分子的组合。此外，分子可以与病毒衣壳的外部相关联，以便将分子转移到宿主的靶细胞中。在本发明的一个实施例中，分子与衣壳蛋白共价连接。共价连接分子的方法已为本领域的技术人员所熟知。The present invention also includes "empty" capsid particles comprising, consisting of, or consisting essentially of the chimeric AAV capsid protein of the present invention, as described in US Pat. No. 5,863,541. The chimeric AAV capsids of the present invention can be used as "capsid vehicles", molecules that can be covalently linked, bound or packaged and transferred into cells include DNA, RNA, lipids, carbohydrates, polypeptides, small organic molecules or combinations of these molecules. In addition, molecules can be associated with the exterior of the viral capsid in order to transfer the molecule into target cells of the host. In one embodiment of the invention, the molecule is covalently linked to the capsid protein. Methods of covalently linking molecules are well known to those skilled in the art.

本发明的病毒衣壳还可用于提高针对新的衣壳结构的抗体。作为另一种选择，可以将外源氨基酸序列插入病毒衣壳中，以便将抗原呈递给细胞，例如给受试者施用以产生对外源氨基酸序列的免疫反应。The viral capsids of the invention can also be used to raise antibodies against novel capsid structures. Alternatively, exogenous amino acid sequences can be inserted into viral capsids for presentation of antigens to cells, eg, administered to a subject to generate an immune response to the exogenous amino acid sequences.

本发明还提供了编码本发明的嵌合衣壳蛋白的核酸。进一步提供包含所述核酸的载体和包含本发明核酸和/或载体的细胞。例如，所述核酸、载体和细胞可用做产生本文所述病毒载体的试剂。The invention also provides nucleic acids encoding the chimeric capsid proteins of the invention. Further provided are vectors comprising said nucleic acids and cells comprising nucleic acids and/or vectors of the invention. For example, the nucleic acids, vectors, and cells are useful as reagents for producing the viral vectors described herein.

在示例性实施方案中，本发明提供了编码图3B、3D、3F、3H、3J、3L、3N、3P、或3R所示的AAV衣壳的核酸序列。代表性的核酸序列包含或基本上包含分别由图3A(L1)图3C(L4)、图3E(L10)、图3G(L52)、图3I(L58)、图3K(L84)、图3M(L37)、图3O(L107)或图3Q(L57)所示的核苷酸序列，或由这些图所示的核苷酸序列组成，或编码由上述任何一种核苷酸序列编码的AAV衣壳或衣壳蛋白的核苷酸序列，但由于密码子的简并性而与上述核苷酸序列不同，其允许不同的核酸序列编码相同的AAV衣壳。In exemplary embodiments, the present invention provides a nucleic acid sequence encoding an AAV capsid as shown in Figure 3B, 3D, 3F, 3H, 3J, 3L, 3N, 3P, or 3R. The representative nucleic acid sequence comprises or comprises substantially respectively by Fig. 3A (L1) Fig. 3C (L4), Fig. 3E (L10), Fig. 3G (L52), Fig. 3I (L58), Fig. 3K (L84), Fig. 3M ( L37), the nucleotide sequence shown in Figure 3O (L107) or Figure 3Q (L57), or consists of the nucleotide sequence shown in these figures, or encodes an AAV coat encoded by any of the above-mentioned nucleotide sequences The nucleotide sequence of the shell or capsid protein, but differs from the above nucleotide sequences due to codon degeneracy, which allows different nucleic acid sequences to encode the same AAV capsid.

本发明还提供编码上述AAV衣壳蛋白的变体和融合蛋白的核酸。在具体实施例中，在本领域技术人员已知的标准条件下，核酸与本文特别公开的核酸序列的互补链杂交，编码变体衣壳蛋白。本文特别公开的核酸序列参见图3A、图3C、图3E、图3G、图3I、图3K、图3M、图3O或图3Q，任选地，所述变体衣壳蛋白实质上保留由图3A、图3C、图3E、图3G、图3I、图3K、图3M、图3O或图3Q所示的核酸序列编码的衣壳蛋白的至少一种性质。例如，变体衣壳蛋白的病毒粒子可以基本上保留包含由如图3A、图3C、图3E、图3G、图3I、图3K、图3M、图3O、图3Q所示的核酸编码序列编码衣壳蛋白的病毒粒子的靶向性特征。这种序列的杂交可以在弱、中甚至严格的条件下进行(Sambrook等，1989，Molecular Cloning,A Laboratory Manual 2dEd，Cold Spring Harbor Laboratory)。The present invention also provides nucleic acids encoding variants and fusion proteins of the above-mentioned AAV capsid proteins. In specific embodiments, the nucleic acid encodes a variant capsid protein by hybridizing to the complement of a nucleic acid sequence specifically disclosed herein under standard conditions known to those skilled in the art. See Figure 3A, Figure 3C, Figure 3E, Figure 3G, Figure 3I, Figure 3K, Figure 3M, Figure 3O or Figure 3Q for the nucleic acid sequences disclosed herein. At least one property of the capsid protein encoded by the nucleic acid sequence shown in Figure 3A, Figure 3C, Figure 3E, Figure 3G, Figure 3I, Figure 3K, Figure 3M, Figure 3O or Figure 3Q. For example, the virion of the variant capsid protein can substantially retain and comprise the nucleic acid coding sequence coding shown in Figure 3A, Figure 3C, Figure 3E, Figure 3G, Figure 3I, Figure 3K, Figure 3M, Figure 3O, Figure 3Q. Virion targeting characteristics of capsid proteins. Hybridization of such sequences can be performed under weak, moderate or even stringent conditions (Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual 2dEd, Cold Spring Harbor Laboratory).

如本领域所知，可使用许多不同的程序来确定核酸或多肽是否与已知序列具有一致性或相似性百分比。本文所用的百分比标识是指当使用BLASTN与其它核酸比对时，核酸或其片段与另一核酸具有指定的百分比，其中BLASTN程序可通过互联网从美国国家生物技术信息中心(NCBI)获取。As is known in the art, a number of different programs can be used to determine whether a nucleic acid or polypeptide has a percent identity or similarity to a known sequence. A percentage designation as used herein refers to a nucleic acid or fragment thereof having a specified percentage to another nucleic acid when aligned to another nucleic acid using BLASTN, the program available on the Internet from the National Center for Biotechnology Information (NCBI).

当提及多肽时，百分比同一性或相似性表明，当与另一种蛋白质或其一部分在使用BLASTP测定的共同长度上进行比较时，所述多肽表现出特定百分比的同一性或相似性。这也可通过互联网从美国国家生物技术信息中心(NCBI)获取。多肽的一致性或相似性百分比通常用序列分析软件，例如，参见威斯康星大学生物技术中心遗传学计算机组的序列分析软件包。蛋白质分析软件使用各种替换、缺失和其它修饰的同源性来匹配类似的序列。保守的替换通常包括以下几类中的替换：甘氨酸、丙氨酸；缬氨酸、异亮氨酸、亮氨酸；天冬氨酸、谷氨酸；天冬酰胺、谷氨酰胺；丝氨酸、苏氨酸；赖氨酸、精氨酸；苯丙氨酸、酪氨酸。When referring to polypeptides, percent identity or similarity indicates that the polypeptide exhibits the specified percent identity or similarity when compared to another protein or portion thereof over a common length as determined using BLASTP. This is also available via the Internet from the National Center for Biotechnology Information (NCBI). The percent identity or similarity of polypeptides is typically determined using sequence analysis software, eg, see the Sequence Analysis Package of the Genetics Computer Group at the University of Wisconsin Biotechnology Center. Protein analysis software uses the homology of various substitutions, deletions, and other modifications to match similar sequences. Conservative substitutions typically include substitutions in the following categories: glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid; asparagine, glutamine; Threonine; Lysine, Arginine; Phenylalanine, Tyrosine.

在特定实施方案中，核酸可以包含、组成或基本上包含但不限于质粒、噬菌体、病毒载体、细菌人工染色体或酵母人工染色体等载体。病毒载体包含但不限于腺相关病毒载体、腺病毒载体、疱疹病毒载体、杆状病毒载体或杂合病毒载体。In certain embodiments, a nucleic acid may comprise, consist of, or consist essentially of, but is not limited to, vectors such as plasmids, bacteriophages, viral vectors, bacterial artificial chromosomes, or yeast artificial chromosomes. Viral vectors include, but are not limited to, adeno-associated viral vectors, adenoviral vectors, herpes viral vectors, baculoviral vectors, or hybrid viral vectors.

在一些实施例中，编码嵌合AAV衣壳蛋白的核酸进一步包括AAV Rep蛋白编码序列。In some embodiments, the nucleic acid encoding the chimeric AAV capsid protein further includes an AAV Rep protein encoding sequence.

本发明还提供稳定地包含本发明核酸的细胞。例如，核酸可以稳定地转入细胞的基因组中，或者可以稳定地维持在附加体形式，例如，“基于EBV的核附加体”。The invention also provides cells stably comprising a nucleic acid of the invention. For example, a nucleic acid can be stably transferred into the genome of a cell, or can be stably maintained in an episomal form, eg, an "EBV-based nuclear episome".

核酸可插入递送载体中，例如，病毒递送载体。例如，本发明的核酸可以封装在AAV粒子、腺病毒粒子、疱疹病毒粒子、杆状病毒粒子或任何其它合适的病毒粒子中。Nucleic acids can be inserted into delivery vehicles, eg, viral delivery vehicles. For example, nucleic acids of the invention may be encapsulated in AAV particles, adenovirus particles, herpes virus particles, baculovirus particles, or any other suitable virus particles.

此外，所述核酸可以与启动子元件可操作地连接。Additionally, the nucleic acid may be operably linked to a promoter element.

本发明还提供了一种生产本发明病毒载体的方法。在一个有代表性的实施例中，本发明提供了一种生产重组病毒载体的方法，该方法包括在体外向细胞提供，包括异源核酸以及足以将AAV模板包装成病毒粒子的信号序列，例如，AAV末端重复序列；还包括足以将模板复制和包装成病毒粒子的AAV序列，例如AAV Rep序列，和编码本发明的AAV衣壳的序列。该方法还进一步包括从细胞中收集病毒粒子的步骤，病毒颗粒可以从培养基和/或裂解细胞中收集。The present invention also provides a method for producing the viral vector of the present invention. In an exemplary embodiment, the invention provides a method of producing a recombinant viral vector comprising providing, in vitro to a cell, a signal sequence comprising a heterologous nucleic acid and sufficient to package an AAV template into a virion, e.g. , AAV terminal repeat sequence; also includes AAV sequences sufficient for template replication and packaging into virions, such as AAV Rep sequences, and sequences encoding AAV capsids of the present invention. The method further comprises the step of collecting the virus particles from the cells, the virus particles can be collected from the culture medium and/or lysed cells.

在一个说明性的实施方案中，本发明提供了一种制备包含AAV衣壳的重组AAV颗粒的方法，所述方法包括：向体外细胞提供编码本发明的嵌合AAV衣壳的核酸、AAV Rep编码序列、包含异源核酸的AAV载体基因组，和用于生产感染性AAV的辅助因子，允许AAV载体基因组包封于AAV衣壳中并完成AAV颗粒的组装。In an illustrative embodiment, the invention provides a method of producing a recombinant AAV particle comprising an AAV capsid, the method comprising: providing to a cell in vitro a nucleic acid encoding a chimeric AAV capsid of the invention, an AAV Rep The coding sequence, AAV vector genome comprising heterologous nucleic acid, and cofactors for production of infectious AAV allow encapsulation of the AAV vector genome into an AAV capsid and complete assembly of the AAV particle.

细胞通常是允许AAV病毒复制的细胞。可使用本领域已知的任何适宜细胞，包括但不限于HEK293细胞系、HEK293T细胞系、HEK293A细胞系、HEK293S细胞系、HEK293FT细胞系、HEK293F细胞系、HEK293H细胞系、HeLa细胞系、SF9细胞系、SF21细胞系、SF900细胞系、BHK细胞系中的一种或几种。The cells are typically cells that allow replication of the AAV virus. Any suitable cell known in the art may be used, including but not limited to HEK293 cell line, HEK293T cell line, HEK293A cell line, HEK293S cell line, HEK293FT cell line, HEK293F cell line, HEK293H cell line, HeLa cell line, SF9 cell line , SF21 cell line, SF900 cell line, BHK cell line or one or more.

AAV复制和衣壳序列可以通过本领域已知的任何方法提供。目前的方法通常在单个质粒上表达AAV rep和cap基因。AAV复制和包装序列不需要一起提供。AAV rep和/或cap基因序列可由任何病毒或非病毒载体提供。例如，rep和/或cap基因序列可由杂交腺病毒或疱疹病毒载体提供。EBV载体也可用于表达AAVcap和/或rep基因序列。作为另一种选择，rep和/或cap基因序列可在细胞内稳定存在，处于游离或整合状态。AAV replication and capsid sequences can be provided by any method known in the art. Current methods typically express the AAV rep and cap genes on a single plasmid. AAV replication and packaging sequences need not be provided together. AAV rep and/or cap gene sequences can be provided by any viral or non-viral vector. For example, rep and/or cap gene sequences can be provided by hybrid adenoviral or herpes viral vectors. EBV vectors can also be used to express AAVcap and/or rep gene sequences. Alternatively, the rep and/or cap gene sequences can be stably present in the cell, either in an episomal or integrated state.

通常AAV rep和/或cap基因序列不会被AAV包装序列所包围，以防止对这些序列进行救援和/或包装。Typically AAV rep and/or cap gene sequences are not surrounded by AAV packaging sequences to prevent rescue and/or packaging of these sequences.

模板或载体基因组可以使用本领域已知的任何方法提供给细胞。模板或载体基因组可由非病毒载体或病毒载体提供。在具体实施例中，模板或载体基因组由疱疹病毒或腺病毒载体提供。杆状病毒载体、EBV载体也可用于传递模板或载体基因组。在另一个代表性实施例中，模板或载体基因组由复制的rAAV病毒提供。在其它实施例中，AAV原病毒稳定地整合到细胞的染色体中。Template or vector genomes can be provided to cells using any method known in the art. The template or vector genome can be provided by a non-viral vector or a viral vector. In specific embodiments, the template or vector genome is provided by a herpesvirus or adenovirus vector. Baculovirus vectors, EBV vectors can also be used to deliver templates or vector genomes. In another representative embodiment, the template or vector genome is provided by a replicating rAAV virus. In other embodiments, the AAV provirus is stably integrated into the chromosome of the cell.

为了获得最大的病毒滴度，通常向细胞提供辅助病毒，例如腺病毒或疱疹病毒，这些辅助病毒是产生感染性AAV所必需的。本领域已知的用于AAV复制所必需的辅助病毒序列，通常由辅助腺病毒或疱疹病毒载体提供。可选择地，腺病毒或疱疹病毒序列可由另一非病毒或病毒载体提供(Ferrari等，1997，Nature Med.3:1295)。此外，辅助病毒功能可由辅助基因整合在包装细胞的染色体中来提供或作为稳定的染色体外元件来维持。To achieve maximum viral titers, cells are often provided with helper viruses, such as adenoviruses or herpesviruses, which are required for the production of infectious AAV. The necessary helper viral sequences known in the art for AAV replication are typically provided by helper adenoviral or herpes viral vectors. Alternatively, the adenoviral or herpes viral sequences can be provided by another non-viral or viral vector (Ferrari et al., 1997, Nature Med. 3:1295). In addition, helper viral function can be provided by integration of the helper gene in the chromosome of the packaging cell or maintained as a stable extrachromosomal element.

本领域技术人员理解，在单个辅助构建体上提供AAV复制和衣壳序列以及辅助病毒序列可能是有利的。该辅助构建体可以是非病毒或病毒构建体，也可以选择是包含AAVrep/cap基因序列的杂交腺病毒或杂交疱疹病毒。Those skilled in the art appreciate that it may be advantageous to provide AAV replication and capsid sequences as well as helper virus sequences on a single helper construct. The auxiliary construct can be a non-viral or viral construct, and can also optionally be a hybrid adenovirus or a hybrid herpes virus containing the AAVrep/cap gene sequence.

在一个具体实施例中，AAV rep和/或cap基因序列和腺病毒辅助序列由单个腺病毒辅助载体提供。这个载体进一步包含rAAV基因组模板。可将AAV rep和/或cap序列和/或rAAV模板插入腺病毒的删除区域，例如，Ela或E3区域。In a specific embodiment, the AAV rep and/or cap gene sequences and the adenoviral helper sequences are provided by a single adenoviral helper vector. This vector further contains the rAAV genome template. AAV rep and/or cap sequences and/or rAAV templates can be inserted into deleted regions of the adenovirus, eg, the Ela or E3 regions.

在另一个实施例中，AAV rep和/或cap序列和腺病毒辅助序列由单个腺病毒辅助载体提供。rAAV基因组模板由质粒提供。在另一个说明性实施例中，AAV rep和/或cap序列和腺病毒辅助序列由单个腺病毒辅助载体提供，并且rAAV基因组模板作为前体整合到细胞中。或者，rAAV模板由作为染色体外元件维持在细胞内的EBV载体提供。在另一个示例性实施例中，AAV rep和/或cap序列和腺病毒辅助序列由单个腺病毒载体提供。rAAV基因组模板作为单独的复制病毒载体提供。例如，rAAV模板可以由rAAV粒子或第二种重组腺病毒粒子提供。In another embodiment, the AAV rep and/or cap sequences and the adenoviral helper sequences are provided by a single adenoviral helper vector. The rAAV genome template is provided by a plasmid. In another illustrative embodiment, the AAV rep and/or cap sequences and adenoviral helper sequences are provided by a single adenoviral helper vector, and the rAAV genomic template is integrated into the cell as a precursor. Alternatively, the rAAV template is provided by an EBV vector maintained inside the cell as an extrachromosomal element. In another exemplary embodiment, the AAV rep and/or cap sequences and the adenoviral helper sequences are provided by a single adenoviral vector. The rAAV genome template is provided as a separate replicating viral vector. For example, the rAAV template can be provided by an rAAV particle or a second recombinant adenoviral particle.

疱疹病毒也可用作AAV包装方法中的辅助病毒。Herpes viruses can also be used as helper viruses in AAV packaging methods.

作为另一种替代方法，本发明的病毒载体可在昆虫细胞中使用杆状病毒载体来传递rep和/或cap基因和rAAV模板(Urabe等，2002，Human Gene Therapy 13:1935-1943)。As another alternative, the viral vectors of the present invention can use baculovirus vectors to deliver rep and/or cap genes and rAAV templates in insect cells (Urabe et al., 2002, Human Gene Therapy 13:1935-1943).

生产AAV的其它方法也可以使用稳定转化的包装细胞(参见美国专US5658785)。Other methods of producing AAV may also use stably transformed packaging cells (see US Pat. No. 5,658,785).

无辅助病毒污染的AAV载体可通过本领域已知的任何方法获得。例如，AAV和辅助病毒可以很容易地根据大小进行区分。也可根据对肝素底物的亲和力将AAV与辅助病毒分离。在代表性实施例中，使用复制缺陷辅助病毒，使得任何污染的辅助病毒都不能复制。作为另一种选择，可以使用缺乏晚期基因表达的辅助腺病毒，因为只有腺病毒早期基因表达介导AAV病毒的包装。本领域已知对晚期基因表达有缺陷的腺病毒突变体，例如，TS100K和TS149腺病毒突变体。AAV vectors free of helper virus contamination can be obtained by any method known in the art. For example, AAV and helper viruses can be easily distinguished based on size. AAV can also be separated from the helper virus based on its affinity for the heparin substrate. In representative embodiments, a replication deficient helper virus is used such that any contaminating helper virus cannot replicate. Alternatively, a helper adenovirus lacking late gene expression can be used, since only adenoviral early gene expression mediates packaging of AAV virus. Adenovirus mutants that are defective for late gene expression are known in the art, eg, TS100K and TS149 adenovirus mutants.

本发明的包装方法可用于生产高滴度病毒粒子。在具体实施例中，病毒储备液的滴度至少为约10⁵Tu/ml、至少约10⁶Tu/ml、至少约10⁷Tu/ml、至少约10⁸Tu/ml、至少约10⁹Tu/ml、至少约10¹⁰Tu/ml。The packaging method of the present invention can be used to produce high titer virus particles. In specific embodiments, the viral stock has a titer of at least about 10 ⁵ Tu/ml, at least about 10 ⁶ Tu/ml, at least about 10 ⁷ Tu/ml, at least about 10 ⁸ Tu/ml, at least about 10 ⁹ Tu /ml, at least about 10 ¹⁰ Tu/ml.

新型的衣壳蛋白和衣壳结构可用于产生抗体，例如用于诊断或治疗用途或用做研究试剂。因此，本发明还提供了针对本发明的新型衣壳蛋白的抗体。Novel capsid proteins and capsid structures can be used to generate antibodies, for example, for diagnostic or therapeutic applications or as research reagents. Accordingly, the present invention also provides antibodies against the novel capsid proteins of the present invention.

本文所用术语“抗体”或“抗体片段”是指所有类型的免疫球蛋白，包括IgG、IgM、IgA、IgD和IgE。抗体可为单克隆或多克隆，且可为来源于任何物种，包括小鼠、大鼠、兔、马、山羊、绵羊、鸡、猴、羊驼或人类，或可为嵌合抗体、人源化抗体、人抗体。抗体可以是重组单克隆抗体、或从噬菌体库、酵母库、哺乳细胞展示库中筛选。The term "antibody" or "antibody fragment" as used herein refers to all classes of immunoglobulins, including IgG, IgM, IgA, IgD and IgE. Antibodies can be monoclonal or polyclonal, and can be of any species origin, including mouse, rat, rabbit, horse, goat, sheep, chicken, monkey, alpaca, or human, or can be chimeric, human Antibodies, human antibodies. Antibodies can be recombinant monoclonal antibodies, or screened from phage libraries, yeast libraries, and mammalian cell display libraries.

包括在本发明范围内的抗体片段包括Fab、F(ab’)₂和Fc片段，以及从除IgG以外的抗体获得的相应片段。这种片段可以通过已知的技术产生。例如，F(ab’)₂片段可由抗体分子通过胃蛋白酶消化产生，Fab片段可通过减少F(ab’)₂片段的二硫键生成。或者，可以构建Fab表达库，以便快速、容易地识别具有所需特异性的单克隆Fab片段。Antibody fragments included within the scope of the present invention include Fab, F(ab') ₂ and Fc fragments, as well as corresponding fragments obtained from antibodies other than IgG. Such fragments can be produced by known techniques. For example, F(ab') ₂ fragments can be produced from antibody molecules by pepsin digestion, and Fab fragments can be generated by reducing the disulfide bonds of F(ab') ₂ fragments. Alternatively, Fab expression libraries can be constructed to quickly and easily identify monoclonal Fab fragments with the desired specificity.

多克隆抗体可以通过病毒免疫合适的动物，例如兔、山羊等，从动物收集免疫血清，并从免疫血清中分离所得。Polyclonal antibodies can be immunized with viruses to suitable animals, such as rabbits, goats, etc., and the immune serum is collected from the animals and isolated from the immune serum.

本发明还包括将异源核苷酸序列传递至广泛细胞范围的方法，包括分裂细胞和非分裂细胞。本发明的病毒载体可用于向体外细胞递送感兴趣的核苷酸序列，例如，在体外产生多肽或用于体外基因治疗。载体还可用于将核苷酸序列传递给需要个体的方法中，例如，表达免疫原性或治疗性多肽。The invention also includes methods of delivering heterologous nucleotide sequences to a wide range of cells, including dividing and non-dividing cells. The viral vectors of the present invention can be used to deliver nucleotide sequences of interest to cells in vitro, for example, to produce polypeptides in vitro or for gene therapy in vitro. Vectors can also be used in methods of delivering a nucleotide sequence to an individual in need thereof, eg, to express an immunogenic or therapeutic polypeptide.

一般而言，本发明病毒载体可用于递送具有生物效应的任何外源核酸以治疗或改善与基因表达相关的任何疾病。此外，本发明可用于治疗递送治疗性多肽可改善的任何疾病。示例性疾病症状包括但不限于：囊性纤维化(囊性纤维化跨膜调节蛋白)和肺部其它疾病、血友病A(因子VIII)、血友病B(因子IX)、地中海贫血(β-珠蛋白)、贫血(促红细胞生成素)和其它血液疾病、老年痴呆症(GDF)、多发性硬化症(β-干扰素)、帕金森氏病(神经胶质细胞源性神经营养因子)、亨廷顿氏病(消除重复的抑制性RNA，包括但不限于RNAi，如sirna或shrna、反义RNA或microRNA)、肌萎缩侧索硬化症、癫痫(甘丙肽、神经营养因子)和其它神经疾病、癌症(内皮抑素、血管抑素、TRAIL、FAS配体、包括干扰素的细胞因子、抑制VEGF的抑制性RNA包括但不限于siRNA、shRNA、反义RNA、microRNA、多重耐药基因产物、癌症免疫原)、糖尿病(胰岛素、PGC-α1、GLP-1、肌抑素前体肽、葡萄糖转运蛋白)、肌营养不良包括杜氏肌营养不良和贝克肌营养不良(例如肌萎缩蛋白、mini-萎缩蛋白、micro-肌萎缩蛋白、胰岛素样生长因子等)、糖原贮积病例如Fabry病(α-半乳糖苷酶)和Pompe病(溶酶体酸α-葡萄糖苷酶)、先天性肺气肿(抗胰蛋白酶)、Lesch-Nyhan综合征(次黄嘌呤鸟嘌呤磷酸核糖基转移酶)、Niemann-Pick病(神经鞘磷脂酶)、视网膜退行性疾病以及眼睛和视网膜的其它疾病(治疗黄斑变性的PDGF、内皮抑素和/或血管抑素)、星形细胞瘤(内皮抑素、血管抑素和/或抑制VEGF的RNAi)、胶质母细胞瘤(内皮生长因子、血管内皮生长因子和/或RNAi抗血管内皮生长因子)、肝脏(用于乙肝和/或丙肝基因的RNAi，例如siRNA或shRNA、microRNA或反义RNA)、充血性心力衰竭或外周动脉疾病(磷酸酶蛋白抑制剂I、磷脂酶、肌浆内网Ca2-ATPase、调节磷脂酶基因的锌指蛋白、磷脂酶抑制剂等)、关节炎(胰岛素样生长因子)、艾滋病(可溶性CD4)、肌肉萎缩(胰岛素样生长因子I、肌生长抑制素前肽、抗凋亡因子等)、肢体缺血(VEGF、FGF、PGC-Iα、EC-SOD、HIF)、肾虚(促红细胞生成素)、关节炎(IRAP和TNFα等可溶性受体等抗炎因子)、肝炎(α-干扰素)、低密度脂蛋白受体缺乏(LDL受体)、高氨血症(鸟氨酸转氨酶)、苯丙酮尿症(苯丙氨酸羟化酶)、自身免疫性疾病等。In general, the viral vectors of the present invention can be used to deliver any foreign nucleic acid with biological effects to treat or improve any disease related to gene expression. In addition, the present invention can be used to treat any disease that can be ameliorated by delivery of a therapeutic polypeptide. Exemplary disease symptoms include, but are not limited to: cystic fibrosis (cystic fibrosis transmembrane regulatory protein) and other diseases of the lung, hemophilia A (factor VIII), hemophilia B (factor IX), thalassemia ( β-globin), anemia (erythropoietin) and other blood disorders, Alzheimer’s disease (GDF), multiple sclerosis (β-interferon), Parkinson’s disease (glial cell-derived neurotrophic factor ), Huntington's disease (elimination of repetitive inhibitory RNA, including but not limited to RNAi, such as siRNA or shRNA, antisense RNA or microRNA), ALS, epilepsy (galanin, neurotrophic factor) and others Neurological diseases, cancer (endostatin, angiostatin, TRAIL, FAS ligand, cytokines including interferon, inhibitory RNA that inhibits VEGF including but not limited to siRNA, shRNA, antisense RNA, microRNA, multidrug resistance gene products, cancer immunogens), diabetes (insulin, PGC-α1, GLP-1, myostatin pro-peptide, glucose transporter), muscular dystrophies including Duchenne muscular dystrophy and Becker muscular dystrophy (e.g. dystrophin, mini-dystrophin, micro-dystrophin, insulin-like growth factor, etc.), glycogen storage diseases such as Fabry disease (α-galactosidase) and Pompe disease (lysosomal acid α-glucosidase), congenital emphysema (antitrypsin), Lesch-Nyhan syndrome (hypoxanthine-guanine phosphoribosyltransferase), Niemann-Pick disease (sphingomyelinase), retinal degenerative diseases, and other diseases of the eye and retina (PDGF, endostatin, and/or angiostatin for macular degeneration), astrocytoma (endostatin, angiostatin, and/or RNAi that inhibits VEGF), glioblastoma (endothelial growth factor, angiostatin endothelial growth factor and/or RNAi anti-vascular endothelial growth factor), liver (RNAi for hepatitis B and/or hepatitis C genes, such as siRNA or shRNA, microRNA or antisense RNA), congestive heart failure or peripheral arterial disease (phosphatase protein inhibitor I, phospholipase, intrasarcoplasmic reticulum Ca2-ATPase, zinc finger protein regulating phospholipase gene, phospholipase inhibitor, etc.), arthritis (insulin-like growth factor), AIDS (soluble CD4), muscle atrophy ( Insulin-like growth factor I, myostatin propeptide, anti-apoptotic factors, etc.), limb ischemia (VEGF, FGF, PGC-Iα, EC-SOD, HIF), kidney deficiency (erythropoietin), arthritis ( Anti-inflammatory factors such as soluble receptors such as IRAP and TNFα), hepatitis (α-interferon), low-density lipoprotein receptor deficiency (LDL receptor), hyperammonemia (ornithine aminotransferase), phenylketonuria ( phenylalanine hydroxylase), autoimmune diseases, etc.

本发明还可在器官移植后用于增加移植的成功率和/或减少器官移植或辅助疗法的副作用，例如，通过给予免疫抑制剂或抑制性核酸来阻断细胞因子产物。The invention may also be used after organ transplantation to increase the success rate of transplantation and/or reduce the side effects of organ transplantation or adjuvant therapy, for example, by administering immunosuppressants or inhibitory nucleic acids to block cytokine production.

基因转移在认识和治疗疾病方面有着巨大的潜在用途。有许多遗传性疾病中有缺陷的基因是已知的，并且已经被克隆。一般来说，上述疾病状态分为两类：第一为缺陷状态，通常为酶缺陷，通常以隐性方式遗传；第二为不平衡状态，可能涉及调节蛋白或结构蛋白，通常以显性方式遗传。对于缺陷状态疾病，基因转移可将正常基因带到受影响的组织中进行替代治疗，以及使用抑制性RNA为疾病创建动物模型，抑制性RNA包括siRNA或shRNA、microRNA或反义RNA。对于不平衡的疾病状态，基因转移可用于在模型系统中创建疾病状态，然后可用于治疗疾病状态。因此，根据本发明的病毒载体允许治疗遗传疾病。如本文所用，通过部分或全部补救疾病或使疾病更严重的缺陷或失衡来治疗疾病状态。Gene transfer has enormous potential utility in understanding and treating disease. There are many genetic disorders in which defective genes are known and have been cloned. In general, the above disease states fall into two categories: first, deficient states, usually enzyme deficient, usually inherited in a recessive manner; second, imbalanced states, which may involve regulatory or structural proteins, usually in a dominant manner genetic. For deficient state diseases, gene transfer can bring normal genes into affected tissues for replacement therapy, as well as create animal models of disease using inhibitory RNA, including siRNA or shRNA, microRNA or antisense RNA. For unbalanced disease states, gene transfer can be used to create the disease state in a model system, which can then be used to treat the disease state. Thus, the viral vectors according to the invention allow the treatment of genetic diseases. As used herein, a disease state is treated by partially or fully remediating the disease or a defect or imbalance that makes the disease worse.

此外，本发明的病毒载体在诊断和筛选方法中有进一步用途，其中感兴趣的基因在细胞培养系统或转基因动物模型中短暂或稳定地表达。本发明还可用于递送核酸以用于蛋白质生产，例如用于实验室、工业或商业目的。In addition, the viral vectors of the present invention have further utility in diagnostic and screening methods in which a gene of interest is transiently or stably expressed in cell culture systems or transgenic animal models. The invention can also be used to deliver nucleic acids for protein production, eg for laboratory, industrial or commercial purposes.

或者，可将病毒载体投予细胞并且将改变的细胞投予个体。将异源核酸引入细胞，并将细胞给予受试者，其中任选地表达编码免疫原的异源核酸并诱导受试者对免疫原的免疫反应。在具体实施例中，所述细胞是抗原呈递细胞，例如树突细胞。Alternatively, the viral vector can be administered to the cells and the altered cells administered to the individual. The heterologous nucleic acid is introduced into the cell, and the cell is administered to the subject, wherein the heterologous nucleic acid encoding the immunogen is optionally expressed and an immune response to the immunogen is induced in the subject. In specific embodiments, said cells are antigen presenting cells, such as dendritic cells.

“主动免疫反应”或“主动免疫”的特征是“与免疫原接触后宿主组织和细胞的参与”。它涉及到淋巴组织中免疫调节细胞的分化和增殖，从而导致抗体的合成或细胞介导的反应，或两者兼而有之。或者，宿主通过感染或接种疫苗暴露于免疫原后，会产生积极的免疫反应。主动免疫可与被动免疫形成对比，后者是通过“将预先形成的物质，例如抗体、转移因子、胸腺移植物、白细胞介素-2，从主动免疫宿主转移到非免疫宿主”获得的。An "active immune response" or "active immunity" is characterized by the "involvement of host tissues and cells following contact with the immunogen". It involves the differentiation and proliferation of immunoregulatory cells in lymphoid tissues, leading to antibody synthesis or cell-mediated responses, or both. Alternatively, the host mounts an aggressive immune response following exposure to the immunogen through infection or vaccination. Active immunity can be contrasted with passive immunity, which is obtained by "transferring preformed substances, such as antibodies, transfer factors, thymus grafts, interleukin-2, from an actively immunized host to a non-immune host".

本文所用的“保护性”免疫反应或“保护性”免疫表明，免疫反应给予受试者一些益处，因为它可以预防或降低疾病的发生率。或者，保护性免疫反应或保护性免疫可用于治疗疾病，特别是癌症或肿瘤，例如，引起癌症或肿瘤的消退和/或防止转移和/或防止转移结节的生长。只要治疗的益处大于缺点，保护作用可以是完全的或部分的。A "protective" immune response or "protective" immunity as used herein means that the immune response confers some benefit on the subject in that it prevents or reduces the incidence of disease. Alternatively, a protective immune response or protective immunity can be used to treat a disease, in particular a cancer or a tumor, for example, to cause regression of the cancer or tumor and/or to prevent metastasis and/or to prevent the growth of metastatic nodules. Protection can be complete or partial as long as the benefits of the treatment outweigh the disadvantages.

本发明病毒载体也可通过投予癌细胞抗原或免疫相似分子或任何其它免疫原来用于癌症免疫治疗，对癌细胞产生免疫反应。举例而言，在治疗癌症患者时，可通过投予包含编码癌细胞抗原异源核苷酸序列的病毒载体来产生针对个体中癌细胞抗原的免疫反应。如本文所述，病毒载体可在体外或通过活体外方法投予个体。The viral vectors of the present invention can also be used in cancer immunotherapy by administering cancer cell antigens or immunologically similar molecules or any other immunogen to generate an immune response to cancer cells. For example, in treating a cancer patient, an immune response against a cancer cell antigen in the individual can be generated by administering a viral vector comprising a heterologous nucleotide sequence encoding a cancer cell antigen. As described herein, viral vectors can be administered to a subject in vitro or by in vitro methods.

如本文所用，“癌症”一词包括形成肿瘤的癌症。同样，“癌组织”一词也包括肿瘤。“癌细胞抗原”包括肿瘤抗原。As used herein, the term "cancer" includes tumor-forming cancers. Likewise, the term "cancerous tissue" includes tumors. "Cancer cell antigen" includes tumor antigens.

术语“癌症”在本领域中具有其理解的含义，例如，具有扩散或转移到身体遥远部位的不受控制的组织生长。示例性癌症包括但不限于白血病、淋巴瘤、结直肠癌、肾癌、肝癌、乳腺癌、肺癌、前列腺癌、睾丸癌、卵巢癌、子宫癌、宫颈癌、脑癌、骨癌、肉瘤、黑色素瘤、头颈部癌、食管癌、甲状腺癌等。在本发明实施例中，本发明实施于治疗和/或预防肿瘤形成癌症。The term "cancer" has its understood meaning in the art, eg, uncontrolled tissue growth with spread or metastasis to distant parts of the body. Exemplary cancers include, but are not limited to, leukemia, lymphoma, colorectal cancer, kidney cancer, liver cancer, breast cancer, lung cancer, prostate cancer, testicular cancer, ovarian cancer, uterine cancer, cervical cancer, brain cancer, bone cancer, sarcoma, melanoma tumor, head and neck cancer, esophageal cancer, thyroid cancer, etc. In an embodiment of the invention, the invention is practiced in the treatment and/or prevention of neoplastic cancer.

癌细胞抗原已在上文中进行了描述。术语“治疗癌症”，旨在降低癌症的严重程度，预防或至少部分消除癌症。例如，在特定情况下，这些术语表明癌症的治疗是预防或减少的，或至少部分消除的。在又一代表性实施例中，这些术语表明阻止或减少或至少部分地消除转移结节的生长，例如，在外科切除原发肿瘤后。根据“预防癌症”术语，旨在至少部分地消除或减少癌症的发生率或发病率。或者说，受试者的癌症发病或进展可能被减缓、控制、可能性或概率降低或延迟。Cancer cell antigens have been described above. The term "treating cancer" aims at reducing the severity of, preventing or at least partially eliminating cancer. For example, these terms indicate that the treatment of cancer prevents or reduces, or at least partially eliminates, under certain circumstances. In yet another representative embodiment, these terms indicate arresting or reducing or at least partially eliminating the growth of metastatic nodules, eg, following surgical resection of the primary tumor. According to the term "prevention of cancer", the aim is to at least partially eliminate or reduce the occurrence or incidence of cancer. In other words, the onset or progression of the subject's cancer may be slowed, controlled, reduced or delayed in likelihood or probability.

在具体实施例中，细胞可从患有癌症的个体中移出并与本发明的病毒载体接触。然后将修饰后的细胞给予受试者，由此引发对癌细胞抗原的免疫反应。这种方法特别适用于免疫功能低下的受试者，这些受试者在体内不能产生足够的免疫反应，即不能产生足够数量的增强抗体。In a specific example, cells can be removed from an individual with cancer and contacted with a viral vector of the invention. The modified cells are then administered to the subject, thereby eliciting an immune response to the cancer cell antigen. This approach is particularly useful in immunocompromised subjects who are unable to mount an adequate immune response in vivo, ie, do not produce sufficient quantities of boosting antibodies.

本领域已知免疫调节性细胞因子，例如α-干扰素、β-干扰素、γ-干扰素、ω-干扰素、τ-干扰素、白细胞介素-lα、白细胞介素-1β、白细胞介素-2、白细胞介素-3、白细胞介素-4、白细胞介素5、白细胞介素-6、白细胞介素-7、白细胞介素-8，白细胞介素-9，白细胞介素-10，白细胞介素-11，白细胞介素12，白细胞介素-13，白细胞介素-14，白细胞介素-18，B细胞生长因子，CD40配体，肿瘤坏死因子-α，肿瘤坏死因子-β，单核细胞趋化蛋白-1，粒细胞-巨噬细胞集落刺激因子，和淋巴毒素，和免疫调节性细胞因子，例如CTL诱导性细胞因子，可与病毒载体一起给予受试者。Immunomodulatory cytokines are known in the art, such as α-interferon, β-interferon, γ-interferon, ω-interferon, τ-interferon, interleukin-1α, interleukin-1β, interleukin Interleukin-2, Interleukin-3, Interleukin-4, Interleukin-5, Interleukin-6, Interleukin-7, Interleukin-8, Interleukin-9, Interleukin-10 , interleukin-11, interleukin-12, interleukin-13, interleukin-14, interleukin-18, B cell growth factor, CD40 ligand, tumor necrosis factor-alpha, tumor necrosis factor-beta , monocyte chemoattractant protein-1, granulocyte-macrophage colony-stimulating factor, and lymphotoxin, and immunomodulatory cytokines, such as CTL-inducing cytokines, can be administered to a subject together with a viral vector.

细胞因子可以通过本领域已知的任何方法进行注射。可以将外源性细胞因子注射到受试者体内，或者使用合适的载体将编码细胞因子的核苷酸序列传递给受试者，并在体内产生细胞因子。Cytokines can be injected by any method known in the art. The exogenous cytokine can be injected into the subject, or the nucleotide sequence encoding the cytokine can be delivered to the subject using a suitable vector, and the cytokine can be produced in vivo.

根据本发明的重组病毒载体可在兽医和医学中应用。适合的对象包括鸟类和哺乳动物。本文所用术语“鸟类”包括但不限于鸡、鸭、鹅、鹌鹑、火鸡、野鸡、鹦鹉。本文所用术语“哺乳动物”包括但不限于人类、灵长类、非人类灵长类、牛、绵羊、山羊、猪、马、猫、狗、兔子、啮齿动物等。人类受试者包括新生儿、婴儿、少年和成人。任选地，所述“需要”本发明方法的个体，例如，因为所述个体具有或被认为具有包括本发明所述的疾病风险，或者将从包括本发明所述的核酸递送中受益。作为进一步的选择，受试者可以是实验室动物和/或疾病的动物模型。The recombinant viral vector according to the present invention can be used in veterinary medicine and medicine. Suitable subjects include birds and mammals. The term "bird" as used herein includes, but is not limited to, chicken, duck, goose, quail, turkey, pheasant, parrot. The term "mammal" as used herein includes, but is not limited to, humans, primates, non-human primates, cattle, sheep, goats, pigs, horses, cats, dogs, rabbits, rodents, and the like. Human subjects include neonates, infants, juveniles and adults. Optionally, the individual is "in need of" the method of the invention, for example, because the individual has or is believed to be at risk of a disease comprising the invention, or would benefit from delivery of a nucleic acid comprising the invention. As a further option, the subject can be a laboratory animal and/or an animal model of disease.

在具体实施例中，本发明提供包含本发明病毒载体的药物组合物，所述病毒载体位于药学上可接受的载体中，并且任选的包括其它药剂、稳定剂、缓冲剂、载体、佐剂、稀释剂等，用于注射的载体通常是液体。对于其它的方法，运输载体既可以是固体的，也可以是液体的。吸入给药时，载体将是可呼吸的，最好是固体或液体微粒形式。In a specific embodiment, the present invention provides a pharmaceutical composition comprising the viral vector of the present invention, the viral vector is located in a pharmaceutically acceptable carrier, and optionally includes other agents, stabilizers, buffers, carriers, adjuvants , diluent, etc., the carrier for injection is usually a liquid. For other methods, the delivery vehicle can be either solid or liquid. For administration by inhalation, the carrier will be respirable, preferably in solid or liquid particulate form.

“药学上可接受”是指一种无毒或无其它不需要特性的材料，即，该材料可以给受试者服用而不会产生任何不需要的生物效应。"Pharmaceutically acceptable" means a material that is non-toxic or otherwise undesired, ie, the material can be administered to a subject without producing any undesired biological effects.

本发明的一个方面是在体外将核苷酸序列转移到细胞的方法。病毒载体可根据适合特定靶细胞的标准转导方法以适当的感染倍数导入细胞。要施用的病毒载体或衣壳的滴度可根据目标细胞类型和数量以及特定病毒载体或衣壳而变化。在具体实施例中，将至少约10³感染单位、更优选地至少约10⁵感染单位导入细胞。One aspect of the invention is a method of transferring a nucleotide sequence to a cell in vitro. Viral vectors can be introduced into cells at appropriate multiplies of infection according to standard transduction methods appropriate for the particular target cell. The titer of the viral vector or capsid to be administered will vary depending on the target cell type and number and the particular viral vector or capsid. In specific embodiments, at least about 10 ³ infectious units, more preferably at least about 10 ⁵ infectious units, are introduced into the cells.

要导入病毒载体的细胞可以是任何类型的，包括但不限于神经细胞，包括外周和中枢神经系统的细胞，特别是脑细胞，例如神经元、寡树突细胞、神经胶质细胞、星形胶质细胞，肺细胞、包括视网膜细胞、视网膜色素上皮和角膜细胞的眼部细胞、包括肠和呼吸上皮细胞的上皮细胞、包括成肌细胞，肌管和肌纤维的骨骼肌细胞、膈肌细胞、树突细胞、包括胰岛细胞的胰腺细胞、肝细胞、包括平滑肌细胞和上皮细胞的胃肠道细胞、包括心肌细胞的心脏细胞、骨髓细胞、造血干细胞、脾细胞、角质细胞、成纤维细胞、内皮细胞、前列腺细胞、包括例如软骨、半月板、滑膜和骨髓的关节细胞、生殖细胞等。作为另一种选择，细胞可以是干细胞，例如，神经干细胞、肝干细胞。作为另一种选择，细胞可以是癌症或肿瘤细胞，如上所述的癌症和肿瘤。此外，如上所述，细胞可以来自任何物种的来源。The cells into which the viral vector is to be introduced can be of any type, including but not limited to nerve cells, including cells of the peripheral and central nervous system, especially brain cells, such as neurons, oligodendrocytes, glial cells, astrocytes Pulmonary cells, ocular cells including retinal cells, retinal pigment epithelium and corneal cells, epithelial cells including intestinal and respiratory epithelial cells, skeletal muscle cells including myoblasts, myotubes and muscle fibers, diaphragm cells, dendrites cells, pancreatic cells including islet cells, liver cells, gastrointestinal cells including smooth muscle cells and epithelial cells, cardiac cells including cardiomyocytes, bone marrow cells, hematopoietic stem cells, spleen cells, keratinocytes, fibroblasts, endothelial cells, Prostate cells, joint cells including, for example, cartilage, meniscus, synovium, and bone marrow, germ cells, and the like. Alternatively, the cells may be stem cells, eg, neural stem cells, hepatic stem cells. Alternatively, the cells may be cancer or tumor cells, cancers and tumors as described above. Furthermore, as noted above, cells can be derived from any species of source.

可将病毒载体引入离体细胞以向个体投予经修饰的细胞。在具体实施例中，将细胞从个体中移出，在其中导入病毒载体，然后将细胞回输回个体中。从受试者身上移出细胞用于活体外治疗，然后再导入受试者的方法为本领域所知，参见例如美国专利US5399346。或者，将重组病毒载体导入来自另一个体的细胞、培养细胞或来自任何其它合适来源的细胞中，并且将细胞给予需要的个体。Viral vectors can be introduced into cells ex vivo to administer the modified cells to an individual. In specific embodiments, cells are removed from the individual, a viral vector is introduced therein, and the cells are then returned to the individual. Methods of removing cells from a subject for ex vivo therapy and then reintroducing them into the subject are known in the art, see eg US Pat. No. 5,399,346. Alternatively, the recombinant viral vector is introduced into cells from another individual, in culture, or from any other suitable source, and the cells are administered to the individual in need.

适合于体外基因治疗的细胞如上文所述。给予受试者的细胞剂量将随受试者的年龄、条件和种类、细胞类型、细胞表达的核酸、给药方式等而变化。通常，在药学上可接受的载体中，每剂量至少给药10²至10⁸或约10³至约10⁶个细胞。在具体实施例中，将用病毒载体转导的细胞与药物载体组合以有效量给予个体。Cells suitable for in vitro gene therapy are described above. The dose of cells administered to a subject will vary with the subject's age, condition and species, cell type, nucleic acid expressed by the cells, mode of administration, and the like. Typically, at least ¹⁰² to ¹⁰⁸ or about ¹⁰³ to about ¹⁰⁶ cells are administered per dose in a pharmaceutically acceptable carrier. In specific embodiments, cells transduced with a viral vector are administered to an individual in an effective amount in combination with a pharmaceutical carrier.

本发明的另一方面是向受试者施用本发明病毒载体或衣壳的方法。在具体实施例中，所述方法包括将感兴趣的核酸传递给动物个体的方法，所述方法包括：向动物个体施用有效量的本发明病毒载体。可通过本领域已知的任何方法将本发明病毒载体投予人类个体或需要其的动物。任选地，病毒载体以有效剂量在药学上可接受的载体中递送。Another aspect of the invention is a method of administering a viral vector or capsid of the invention to a subject. In specific embodiments, the method includes a method of delivering a nucleic acid of interest to an individual animal, the method comprising: administering to the individual animal an effective amount of the viral vector of the present invention. Administration of the viral vectors of the invention to a human subject or animal in need thereof can be by any method known in the art. Optionally, the viral vector is delivered in a pharmaceutically acceptable carrier in an effective dose.

本发明病毒载体可进一步投予个体以引起免疫原性反应，例如，作为疫苗。通常，本发明疫苗包含有效量的病毒与药学上可接受的载体组合。任选地，剂量足以产生保护性免疫反应。只要给予免疫原性多肽的益处大于其任何缺陷，所给予的保护程度就不必是完全的或永久的。受试者和免疫原如上文所述。The viral vectors of the invention can further be administered to an individual to elicit an immunogenic response, eg, as a vaccine. Typically, the vaccines of the invention comprise an effective amount of virus in combination with a pharmaceutically acceptable carrier. Optionally, the dose is sufficient to produce a protective immune response. The degree of protection conferred need not be complete or permanent as long as the benefits of conferring the immunogenic polypeptide outweigh any disadvantages. Subjects and immunogens are as described above.

给受试者注射的病毒载体的剂量将取决于给药方式、待治疗的疾病或状况、个体的情况、特定的病毒载体和待递送的核酸，并且可以常规方式确定。达到治疗效果的示范性剂量是至少约10⁵，10⁶，10⁷，10⁸，10⁹，10¹⁰，10¹¹，10¹²，10¹³，10¹⁴，10¹⁵Tu或更多的病毒滴度，优选约10⁷或10⁸～10¹²，10¹³或10¹⁴Tu、更优选约10¹²Tu的病毒滴度。The dose of viral vector injected into a subject will depend on the mode of administration, the disease or condition being treated, the individual's condition, the particular viral vector and nucleic acid to be delivered, and can be determined in a routine manner. Exemplary doses to achieve a therapeutic effect are viral titers of at least about 10 ⁵ , 10 ⁶ , 10 ⁷ , 10 ⁸ , 10 ⁹ , 10 ¹⁰ , 10 ¹¹ , 10 ¹² , 10 ¹³ , 10 ¹⁴ , 10 ¹⁵ Tu or more , preferably about 10 ⁷ or 10 ⁸ to 10 ¹² , 10 ¹³ or 10 ¹⁴ Tu, more preferably about 10 ¹² Tu.

在具体实施例中，可以使用不止一次的给药，例如，两次、三次、四次或更多次给药，可以在不同的时间间隔内来实现所需的基因表达水平。In particular embodiments, more than one dose may be used, eg, two, three, four or more doses, may be at different time intervals to achieve the desired level of gene expression.

给药方式的实例包括口服、直肠、粘膜、外用、鼻内、吸入、口腔、阴道、鞘内、眼内、经皮、子宫内、肠外、静脉内、皮下、皮内、肌肉内、皮内、胸膜内、脑内和关节内、皮肤和粘膜表面、内淋巴管内等，以及直接组织或器官注射，例如在肝、骨骼肌、心肌、膈肌或脑内直接注射。也可对肿瘤施用，例如，在肿瘤或淋巴结内或附近注射。在任何给定情况下，最合适的路径将取决于所治疗条件的性质和严重性以及所使用的特定载体的性质。Examples of modes of administration include oral, rectal, mucosal, topical, intranasal, inhalation, buccal, vaginal, intrathecal, intraocular, transdermal, intrauterine, parenteral, intravenous, subcutaneous, intradermal, intramuscular, dermal Intrapleural, intracerebral and intraarticular, skin and mucosal surfaces, intralymphatic vessels, etc., and direct tissue or organ injections, such as direct injection in the liver, skeletal muscle, cardiac muscle, diaphragm or brain. Administration to tumors can also be performed, for example, by injection in or near the tumor or lymph nodes. The most appropriate route in any given case will depend upon the nature and severity of the condition being treated and the nature of the particular vector employed.

注射物可以常规形式制备，既可以是液体溶液或悬浮液，也可以是适合于注射前液体溶液或悬浮液的固体形式，或者是乳状液。或者，可以局部而不是全身性的方式来施用病毒载体，例如，以一种特定的方式，如缓释制剂。此外，病毒载体可以干燥形式输送至外科植入基质，例如骨移植替代物、缝合线、支架等。Injectables can be prepared in conventional forms, either as liquid solutions or suspensions, solid forms suitable for liquid solution or suspension prior to injection, or as emulsions. Alternatively, viral vectors may be administered in a local rather than systemic manner, for example, in a specific manner, such as a sustained release formulation. In addition, viral vectors can be delivered in dry form to surgical implant substrates, such as bone graft substitutes, sutures, scaffolds, and the like.

适于经口服给药的医药组合物可以离散单位呈现，例如胶囊、缓存剂、含片或片剂，每一单位均包含预定量的本发明组合物。作为粉末或颗粒、作为水性液体或非水性液体的溶液或悬浮液、或作为水包油或油包水乳化液。口服可通过将本发明病毒载体复合到能够抵抗动物肠道内消化酶降解的载体来完成。此类载体的实例包括本领域已知的胶囊或片剂。此类制剂是通过任何合适的药学方法制备的，所述方法包括将所述成分与合适的载体结合起来的步骤，所述载体可包含一种或多种如上文所述的辅料成分。一般而言，根据本发明实施例的药物组合物是通过将所述组合物与液体或细分固体载体或两者均匀且紧密地混合，然后对所得混合物进行成形来制备的。例如，可以通过压缩或成型含有所述组合物的粉末或颗粒来制备片剂，可以选择使用一种或多种辅料成分。压片是通过在适当的机器中压缩自由流动形式的组合物来制备的，例如任选地与粘合剂、润滑剂、惰性稀释剂和/或表面活性/分散剂混合的粉末或颗粒。通过在适当的机器中用惰性液体粘合剂湿润粉末状化合物来成型片剂。Pharmaceutical compositions suitable for oral administration may be presented as discrete units such as capsules, cachets, troches or tablets, each unit containing a predetermined amount of a composition of the invention. As a powder or granules, as a solution or suspension in an aqueous or non-aqueous liquid, or as an oil-in-water or water-in-oil emulsion. Oral administration can be accomplished by complexing the viral vector of the present invention to a vector that is resistant to degradation by digestive enzymes in the gut of the animal. Examples of such carriers include capsules or tablets known in the art. Such formulations are prepared by any suitable method of pharmacy which includes the step of bringing into association the ingredients with a suitable carrier which may contain one or more accessory ingredients as hereinbefore described. In general, pharmaceutical compositions according to embodiments of the present invention are prepared by uniformly and intimately mixing said composition with liquid or finely divided solid carriers or both, and then shaping the resulting mixture. For example, a tablet may be prepared by compressing or molding a powder or granules containing the composition, optionally with one or more accessory ingredients. Compressed tablets are prepared by compressing in a suitable machine the composition in a free-flowing form, such as a powder or granules, optionally mixed with a binder, lubricant, inert diluent and/or surface active/dispersing agents. Tablets are formed by wetting the powdered compound with an inert liquid binder in a suitable machine.

适用于口腔给药的药物组合物包括含片，其在调味基中包含本发明的成分，调味基通常为蔗糖和阿拉伯树胶或黄芪胶，以及包含如明胶和甘油或蔗糖和阿拉伯树胶惰性基成分的锭剂。Pharmaceutical compositions suitable for oral administration include lozenges comprising the ingredients of the invention in a flavored base, typically sucrose and acacia or tragacanth, and an inert base such as gelatin and glycerin or sucrose and acacia. lozenges.

适于非经肠给药的药物组合物可包含本发明组合物的无菌水性和非水性注射溶液，所述制剂任选地与预期受体的血液等渗。这些制剂可含有抗氧化剂、缓冲剂、抑菌剂和溶质，使成分与预期受体的血液等渗。水性和非水性无菌悬浮液、溶液和乳剂可包括悬浮剂和增稠剂。非水溶剂的实例包括丙二醇、聚乙二醇、植物油和可注射有机酯。水性载体包括水、酒精/水溶液、乳液或悬浮液，包括盐水和缓冲介质。非肠道药物包括氯化钠溶液、林格氏葡萄糖、葡萄糖和氯化钠、乳酸林格氏或固定油。静脉注射的载体包括液体和营养补充剂、电解质补充剂，例如基于林格氏葡萄糖的补充剂等。防腐剂和其它添加剂例如抗菌剂、抗氧化剂、螯合剂和惰性气体等也可存在。Pharmaceutical compositions suitable for parenteral administration may comprise sterile aqueous and non-aqueous injectable solutions of the compositions of this invention, the formulations being optionally isotonic with the blood of the intended recipient. These formulations may contain antioxidants, buffers, bacteriostats, and solutes to render the ingredients isotonic with the blood of the intended recipient. Aqueous and non-aqueous sterile suspensions, solutions and emulsions may contain suspending and thickening agents. Examples of non-aqueous solvents include propylene glycol, polyethylene glycol, vegetable oils, and injectable organic esters. Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or suspensions, including saline and buffered media. Parenteral agents include sodium chloride solution, Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's, or fixed oils. Intravenous vehicles include fluid and nutrient replenishers, electrolyte replenishers such as those based on Ringer's dextrose, and the like. Preservatives and other additives such as antimicrobials, antioxidants, chelating agents and inert gases and the like may also be present.

这些成分可以单位剂量或多剂量容器表示，例如，在密封的安瓿和小瓶中，并且可以储存在冷冻干燥条件下，只需要在使用前立即添加无菌液体载体，例如，盐水或注射用水。The compositions can be presented in unit-dose or multi-dose containers, for example, in sealed ampoules and vials, and can be stored in a freeze-dried condition requiring only the addition of a sterile liquid carrier, for example, saline or water for injection, immediately before use.

临时注射溶液和悬浮液可由上述无菌粉末、颗粒和片剂制备。例如，可以在密封容器中以单位剂量形式提供本发明的可注射、稳定、无菌的组合物。所述组合物可以冻干物的形式提供，所述冻干物可以用合适的药学上可接受的载体重组，以形成适于注射到个体中的液体组合物。单位剂型可为本发明组合物的约1μg至约10g。当所述组合物基本上不溶于水时，可以加入生理上可接受的足够量的乳化剂，以使所述组合物在水性载体中乳化。其中一种有用的乳化剂是磷脂酰胆碱。Extemporaneous injection solutions and suspensions can be prepared from sterile powders, granules and tablets as described above. For example, the injectable, stable, sterile compositions of the invention can be presented in unit dosage form in sealed containers. The composition may be provided in the form of a lyophilizate which can be reconstituted with a suitable pharmaceutically acceptable carrier to form a liquid composition suitable for injection into an individual. Unit dosage forms may be from about 1 μg to about 10 g of the compositions of the invention. When the composition is substantially water-insoluble, an emulsifying agent may be added in a physiologically acceptable amount sufficient to emulsify the composition in the aqueous carrier. One useful emulsifier is phosphatidylcholine.

适用于直肠给药的药物成分可以用单位剂量栓剂表示。这些可以通过将该组分与一种或多种传统固体载体混合，然后形成所得到的混合物来制备。Pharmaceutical compositions suitable for rectal administration may be presented as unit dose suppositories. These can be prepared by mixing the components with one or more conventional solid carriers and forming the resulting mixture.

本发明的适合局部应用于皮肤的药物组合物可以采取软膏、乳膏、乳液、糊剂、凝胶、喷雾、气雾剂或油的形式。可使用的载体包括但不限于凡士林、羊毛脂、聚乙二醇、醇类、透皮增强剂和其两种或两种以上的组合。例如，在一些实施例中，局部递送可通过将本发明的药物组合物与能够进入皮肤的亲脂性试剂混合来执行。Pharmaceutical compositions of the present invention suitable for topical application to the skin may take the form of ointments, creams, lotions, pastes, gels, sprays, aerosols or oils. Carriers that can be used include, but are not limited to, petrolatum, lanolin, polyethylene glycol, alcohols, skin penetration enhancers and combinations of two or more thereof. For example, in some embodiments, topical delivery can be performed by admixing a pharmaceutical composition of the invention with a lipophilic agent capable of entering the skin.

适合经皮给药的药物成分可以是离散的贴片形式，适合与受试者的表皮长时间保持密切接触。适合经皮给药的组合物也可以通过离子导入递送，并且通常采用本发明组合物的任选缓冲水溶液的形式。适宜调配物可包含柠檬酸盐或bis/tris缓冲液或乙醇/水，且可含有0.1至0.2M活性成分。A pharmaceutical composition suitable for transdermal administration may be in the form of a discrete patch, adapted to remain in close contact with the epidermis of a subject for an extended period of time. Compositions suitable for transdermal administration may also be delivered by iontophoresis, and generally take the form of optionally buffered aqueous solutions of compositions of the invention. Suitable formulations may comprise citrate or bis/tris buffer or ethanol/water and may contain from 0.1 to 0.2M active ingredient.

本文所揭示之病毒载体可通过任何适宜方法投予个体之肺，例如投予由该个体吸入的病毒载体组成的可呼吸粒子的悬浮液。可吸入颗粒物可以是液体或固体。包含病毒载体的液体粒子的气溶胶可以通过任何适当的方法产生，例如使用本领域技术人员已知的压力驱动气溶胶雾化器或超声波雾化器。包含病毒载体的固体颗粒的气溶胶同样可以通过医药领域已知的技术与任何固体颗粒药物气溶胶发生器一起产生。The viral vectors disclosed herein may be administered to the lungs of a subject by any suitable method, such as administration of a suspension of respirable particles consisting of the viral vectors inhaled by the subject. Respirable particulate matter can be liquid or solid. Aerosols of liquid particles comprising viral vectors may be generated by any suitable method, for example using pressure-driven aerosol nebulizers or ultrasonic nebulizers known to those skilled in the art. Aerosols of solid particles comprising viral vectors can likewise be generated by techniques known in the medical art with any solid particle drug aerosol generator.

III嵌合AAV衣壳病毒载体的定向进化和体内筛选方法III Directed evolution and in vivo screening methods of chimeric AAV capsid virus vectors

本发明还包括一种方法，用于制备包含嵌合AAV衣壳的病毒库，然后在体内筛选具有一种或多种所需特性的嵌合AAV衣壳或病毒。所需特性的非限制性例子包括靶向性特征、逃避中和抗体的能力以及改善细胞内运输等。The present invention also includes a method for preparing a viral library comprising chimeric AAV capsids and then selecting in vivo for chimeric AAV capsids or viruses having one or more desired properties. Non-limiting examples of desirable properties include targeting characteristics, the ability to evade neutralizing antibodies, and improved intracellular trafficking, among others.

在代表性的实施方案中，本发明提供了一种识别感兴趣特性的病毒载体的方法，例如，一种识别感兴趣特性的AAV载体或AAV衣壳的方法，该方法包括：In representative embodiments, the invention provides a method of identifying a viral vector with a property of interest, e.g., a method of identifying an AAV vector or an AAV capsid with a property of interest, the method comprising:

第一，提供病毒载体例如AAV颗粒的集合，其中该集合内的每个AAV载体包括：包含通过对两种或多种不同AAV的衣壳编码序列进行改组而产生的衣壳蛋白，其中所述两种或多种不同AAV的衣壳氨基酸序列相差至少两个氨基酸；和病毒载体基因组，例如AAV载体基因组，所述病毒载体基因组包含前述改组产生的AAV衣壳蛋白编码序列、一种AAV Rep的编码序列以及至少一个末端重复例如AAV的5’和/或3’末端重复，其中所述病毒载体基因组包裹于AAV衣壳中。First, a collection of viral vectors, such as AAV particles, is provided, wherein each AAV vector within the collection comprises a capsid protein produced by shuffling the capsid coding sequences of two or more different AAVs, wherein the The capsid amino acid sequences of two or more different AAVs differ by at least two amino acids; and a viral vector genome, such as an AAV vector genome, comprising the aforementioned shuffled AAV capsid protein coding sequence, an AAV Rep A coding sequence and at least one terminal repeat such as the 5' and/or 3' terminal repeats of AAV, wherein the viral vector genome is encapsulated in an AAV capsid.

第二，向受试者施用病毒载体的集合；Second, administering the collection of viral vectors to the subject;

第三，从目标组织中回收多个作为病毒粒子或作为编码AAV衣壳的病毒载体基因组的病毒载体,从而鉴定具有感兴趣特性的病毒载体或AAV衣壳。Third, multiple viral vectors are recovered from target tissues as virions or as viral vector genomes encoding AAV capsids, thereby identifying viral vectors or AAV capsids with properties of interest.

本发明还可用于识别一种在体内具有逃避中和抗体能力的嵌合AAV衣壳或病毒颗粒，例如，在人血清中发现的中和抗体。例如，可通过向受试者注射人类免疫球蛋白来进行对中和抗体抗性的体内筛选。例如，向非人类哺乳动物受试者注射IVIG。IVIG天然含有针对人类所有常见AAV的抗体混合物。或者，可以给受试者注射特定的中和抗体，然后将嵌合病毒库注射到受试者中，对进入感兴趣的目标组织(例如心脏、骨骼肌、肝脏等)的病毒基因组进行选择，从目标组织分离出的基因组与能够逃避中和的衣壳相对应。The invention can also be used to identify a chimeric AAV capsid or virion that has the ability to evade neutralizing antibodies in vivo, eg, neutralizing antibodies found in human serum. For example, in vivo screening for neutralizing antibody resistance can be performed by injecting a subject with human immunoglobulin. For example, IVIG is injected into a non-human mammalian subject. IVIG naturally contains a cocktail of antibodies against all common AAVs in humans. Alternatively, the subject can be injected with specific neutralizing antibodies, and then the chimeric virus library can be injected into the subject to select for viral genomes that enter the target tissue of interest (e.g., heart, skeletal muscle, liver, etc.), Genomes isolated from target tissues correspond to capsids that escape neutralization.

因此，在具有代表性的实施例中，本发明提供了一种识别具有逃避中和IgGs能力的嵌合AAV衣壳或病毒载体的方法，所述方法包括：Accordingly, in an exemplary embodiment, the invention provides a method of identifying chimeric AAV capsids or viral vectors that have the ability to evade neutralizing IgGs, the method comprising:

第一，向哺乳动物个体施用IgGs；First, administering IgGs to a mammalian individual;

第二，提供病毒载体例如AAV颗粒的集合，其中该集合内的每个AAV载体包括：包含由通过对两种或多种不同AAV的衣壳编码序列进行改组而产生的衣壳蛋白，其中所述两种或多种不同AAV的衣壳氨基酸序列相差至少两个氨基酸；和病毒载体基因组，例如AAV载体基因组，所述病毒载体基因组包含前述改组产生的AAV衣壳蛋白编码序列、一种AAV Rep的编码序列以及至少一个末端重复例如AAV的5’和/或3’末端重复，其中所述病毒载体基因组包裹于AAV衣壳中。Second, a collection of viral vectors, such as AAV particles, is provided, wherein each AAV vector within the collection comprises a capsid protein produced by shuffling the capsid coding sequences of two or more different AAVs, wherein the The capsid amino acid sequences of the two or more different AAVs differ by at least two amino acids; and a viral vector genome, such as an AAV vector genome, which comprises the AAV capsid protein coding sequence produced by the aforementioned shuffling, an AAV Rep The coding sequence and at least one terminal repeat such as the 5' and/or 3' terminal repeat of AAV, wherein the viral vector genome is wrapped in the AAV capsid.

第三，向受试者施用病毒载体的集合；Third, administering the collection of viral vectors to the subject;

第四，从目标组织中回收多个作为病毒粒子或作为编码AAV衣壳的病毒载体基因组的病毒载体，从而鉴定具有逃避中和抗体特性的病毒载体或AAV衣壳。Fourth, multiple viral vectors are recovered from target tissues as virions or as viral vector genomes encoding AAV capsids, thereby identifying viral vectors or AAV capsids with properties to evade neutralizing antibodies.

“逃避”中和抗体，意指与适当的对照组相比，中和至少部分减少，但只要中和程度与对照组相比有所降低，并且只要一些载体能够到达并转导目标组织，则“逃避”的程度就不需要完全。"Evading" neutralizing antibodies, meaning neutralization is at least partially reduced compared to an appropriate control, but as long as the degree of neutralization is reduced compared to a control, and as long as some vector is able to reach and transduce the target tissue, then The degree of "escape" does not need to be complete.

在具体实施例中，目标组织是肝脏、骨骼肌、心肌、横膈膜肌、肾脏、胰腺、脾脏、胃肠道、肺、关节组织、舌头、卵巢、睾丸、生殖细胞、癌细胞或上述的组合。In specific embodiments, the target tissue is liver, skeletal muscle, cardiac muscle, diaphragmatic muscle, kidney, pancreas, spleen, gastrointestinal tract, lung, joint tissue, tongue, ovary, testis, germ cells, cancer cells, or any of the above combination.

两个或多个AAV衣壳的任意组合的序列，无论是自然发生的还是修改的，无论是现在已知的还是以后发现的，都可以被“改组”来产生包含嵌合衣壳的AAV载体集合。在代表性实施例中，AAV载体的集合包含通过从以下两个或多个衣壳序列改组而生成嵌合衣壳，所述AAV衣壳包括：AAV1、AAV2、AAV3A、AAV3B、AAV4、AAV5、AAV6、AAV7、AAV8、AAV9、AAV10、AAV11、AAV12、禽AAV、牛AAV、犬AAV、马AAV、绵羊AAV、鹅AAV或蛇AAV。如上所述，AAV衣壳的集合可以进一步包含目前已知或后来发现的非自然发生的AAV衣壳。这种修饰包括替换，包括修饰核酸和/或氨基酸的替换，以及删除和/或插入。The sequences of any combination of two or more AAV capsids, whether naturally occurring or modified, whether now known or later discovered, can be "shuffled" to generate AAV vectors containing chimeric capsids gather. In representative embodiments, the collection of AAV vectors comprises chimeric capsids generated by shuffling from two or more capsid sequences comprising: AAV1, AAV2, AAV3A, AAV3B, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12, Avian AAV, Bovine AAV, Canine AAV, Equine AAV, Sheep AAV, Goose AAV or Snake AAV. As noted above, the collection of AAV capsids may further comprise currently known or later discovered non-naturally occurring AAV capsids. Such modifications include substitutions, including substitutions of modified nucleic acids and/or amino acids, as well as deletions and/or insertions.

通过本领域已知的将突变引入核酸和/或氨基酸序列的任何方法，例如使用化学诱变剂、易错的PCR、盒式突变等，可以实现嵌合衣壳的进一步多样性。Further diversity of chimeric capsids can be achieved by any method known in the art to introduce mutations into nucleic acid and/or amino acid sequences, eg, use of chemical mutagens, error-prone PCR, cassette mutagenesis, and the like.

任选地，可将体内筛选方法与一轮或多轮体外筛选组合以进一步优化载体。例如，可以进行体内选择，以识别具有所需特性的嵌合AAV衣壳，然后体外选择可用于识别具有逃避抗体中和能力的AAV衣壳。Optionally, the in vivo screening method can be combined with one or more rounds of in vitro selection to further optimize the vector. For example, in vivo selection can be performed to identify chimeric AAV capsids with desired properties, and then in vitro selection can be used to identify AAV capsids with the ability to evade antibody neutralization.

AAV颗粒的集合可以通过任何合适的方法给受试者服用。在具体实施例中，将集合物投予受试者的血液，例如静脉或关节内。The collection of AAV particles can be administered to a subject by any suitable method. In specific embodiments, the collection is administered into the blood of a subject, eg, intravenously or intra-articularly.

给药方式和受试者如本文其它地方所述。Administration and subjects are as described elsewhere herein.

本发明可用于在体内识别具有所需特性的嵌合病毒或病毒衣壳。因此，在具体实施例中，本发明的方法包括从两个或多个目标组织中回收AAV粒子或编码相同AAV粒子的病毒基因组，并识别对所述两个或多个目标组织具有所需特性的嵌合病毒或嵌合AAV衣壳。例如，在具体实施例中，识别对骨骼肌和/或心肌具有低效靶向性和或对肝脏具有高效靶向性的嵌合病毒或嵌合AAV衣壳。The present invention can be used to identify chimeric viruses or viral capsids with desired properties in vivo. Thus, in specific embodiments, the methods of the invention comprise recovering AAV particles or viral genomes encoding the same AAV particles from two or more target tissues, and identifying a protein having desired properties for said two or more target tissues. chimeric virus or chimeric AAV capsid. For example, in particular embodiments, chimeric viruses or chimeric AAV capsids with inefficient targeting of skeletal muscle and/or cardiac muscle and or efficient targeting of the liver are identified.

目标细胞或组织或其之一也可以是癌细胞或肿瘤组织。例如，可以对癌症的动物模型施用嵌合病毒，并且从癌细胞或肿瘤中分离编码该病毒的嵌合病毒外壳或病毒基因组。在具有代表性的实施例中，动物模型可以是具有增加形成癌症或肿瘤可能性的模型，或者可以是将人类肿瘤细胞移植到动物中的异种移植模型。The target cell or tissue or one of them may also be a cancer cell or a tumor tissue. For example, a chimeric virus can be administered to an animal model of cancer and the chimeric viral coat or viral genome encoding the virus isolated from cancer cells or tumors. In representative embodiments, the animal model can be a model with an increased likelihood of developing a cancer or tumor, or can be a xenograft model in which human tumor cells are transplanted into an animal.

DNA“改组”或“嵌合“的示例性方法，也称为“分子育种”、“快速强制进化”等，在本领域是已知的(US5605793、US6165793、US6117679，Stemmer,1994，Proc.Nati.Acad.Sci91:10747-10751，和Soong等,2000，Nature Genetics 25:436-439)。这种方法也被应用于病毒的定向进化(美国专利US6096548，美国专利US6596539)。在一个代表性实施例中，通过同源和/或非同源重组将AAV衣壳蛋白编码序列的集合在体外片段化和重组，以形成“嵌合”AAV衣壳蛋白的集合。每个嵌合衣壳包裹一种包含相应衣壳编码序列的核酸，例如AAV基因组，从而生成嵌合病毒。嵌合病毒集合被施用于受试者，并根据感兴趣的特性进行体内选择。例如，可以从一个或多个靶组织中分离出嵌合病毒，以识别具有所需特性的优化的衣壳蛋白。Exemplary methods of DNA "shuffling" or "chimerism", also known as "molecular breeding", "rapid forced evolution", etc., are known in the art (US5605793, US6165793, US6117679, Stemmer, 1994, Proc. Nati Acad. Sci91:10747-10751, and Soong et al., 2000, Nature Genetics 25:436-439). This method has also been applied to directed evolution of viruses (US Patent US6096548, US Patent US6596539). In a representative embodiment, collections of AAV capsid protein coding sequences are fragmented and recombined in vitro by homologous and/or non-homologous recombination to form collections of "chimeric" AAV capsid proteins. Each chimeric capsid encapsulates a nucleic acid, such as an AAV genome, comprising the corresponding capsid coding sequence, thereby generating a chimeric virus. Collections of chimeric viruses are administered to subjects and selected in vivo for properties of interest. For example, chimeric viruses can be isolated from one or more target tissues to identify optimized capsid proteins with desired properties.

Ⅳ具体实施方式Ⅳ specific implementation

在一方面，本发明提供了一种编码AAV衣壳蛋白的核酸，所述核酸包含选自以下组中任一核苷酸序列：In one aspect, the invention provides a nucleic acid encoding an AAV capsid protein, said nucleic acid comprising any nucleotide sequence selected from the group consisting of:

(a)核苷酸序列SEQ ID NO:1；(a) nucleotide sequence SEQ ID NO: 1;

(b)核苷酸序列SEQ ID NO:3；(b) nucleotide sequence SEQ ID NO: 3;

(c)核苷酸序列SEQ ID NO:5；(c) nucleotide sequence SEQ ID NO:5;

(d)核苷酸序列SEQ ID NO:7；(d) nucleotide sequence SEQ ID NO:7;

(e)核苷酸序列SEQ ID NO:9；(e) nucleotide sequence SEQ ID NO:9;

(f)核苷酸序列SEQ ID NO:11；(f) nucleotide sequence SEQ ID NO: 11;

(g)核苷酸序列SEQ ID NO:13；(g) nucleotide sequence SEQ ID NO: 13;

(h)核苷酸序列SEQ ID NO:15；(h) nucleotide sequence SEQ ID NO: 15;

(i)核苷酸序列SEQ ID NO:17；或(i) the nucleotide sequence of SEQ ID NO: 17; or

(j)由(a)-(i)任一核苷酸序列编码的AAV衣壳蛋白的核苷酸序列，其因遗传密码的简并性而与(a)-(i)的核苷酸序列不同。(j) the nucleotide sequence of the AAV capsid protein encoded by any nucleotide sequence of (a)-(i), which is different from the nucleotide of (a)-(i) due to the degeneracy of the genetic code The sequence is different.

所述核酸为质粒、噬菌体、病毒载体、细菌人工染色体、酵母人工染色体，优选包含有编码序列的AAV载体，更优选地所述核酸还包含一种AAV Rep蛋白的编码序列。The nucleic acid is a plasmid, phage, virus vector, bacterial artificial chromosome, yeast artificial chromosome, preferably an AAV vector containing a coding sequence, more preferably the nucleic acid also contains a coding sequence of an AAV Rep protein.

在一方面，本发明还提供一种由以上所述核酸编码的AAV衣壳蛋白，所述AAV衣壳蛋白的氨基酸序列包含选自SEQ ID NO:2，SEQ ID NO:4，SEQ ID NO:6，SEQ ID NO:8，SEQID NO:10，SEQ ID NO:12，SEQ ID NO:14，SEQ ID NO:16，或SEQ ID NO:18中的任一种。优选地，所述AAV衣壳共价连接、结合或包裹一种组合物，所述组合物选自DNA分子、RNA分子、多肽、碳水化合物、脂质体和小的有机分子中的一种或几种。In one aspect, the present invention also provides an AAV capsid protein encoded by the nucleic acid described above, the amino acid sequence of the AAV capsid protein comprises a sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 6. Any one of SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, or SEQ ID NO:18. Preferably, the AAV capsid covalently links, binds or encapsulates a composition selected from one or more of DNA molecules, RNA molecules, polypeptides, carbohydrates, liposomes and small organic molecules Several kinds.

在另一方面，本发明提供了一种重组病毒粒子，其包含前面所述的核酸和/或衣壳蛋白。所述重组病毒粒子选自重组AAV病毒粒子，重组腺病毒粒子，重组疱疹病毒粒子、重组杆状病毒粒子，或重组杂合病毒粒子。In another aspect, the present invention provides a recombinant virus particle comprising the aforementioned nucleic acid and/or capsid protein. The recombinant virus particles are selected from recombinant AAV virus particles, recombinant adenovirus particles, recombinant herpes virus particles, recombinant baculovirus particles, or recombinant hybrid virus particles.

在另一方面，本发明提供了一种重组AAV病毒粒子，其包含AAV载体基因组和前述AAV衣壳蛋白，其中AAV载体基因组包裹于AAV衣壳中。所述AAV载体基因组中包含异源核酸序列。所述异源核酸序列编码选自反义RNA、microRNA、shRNA、多肽和免疫原中一种或几种。优选地，所述异源核酸序列编码的多肽为治疗性多肽或报告基因，其中所述异源核酸编码的治疗性多肽选自胰岛素、胰高血糖素、生长激素、生长激素释放因子、促红细胞生成素、胰岛素生长因子、转化生长因子α、肝细胞生长因子、酪氨酸羟化酶、血小板生成素、白介素1-白介素25、低密度脂蛋白受体、糖皮质激素受体、维生素D受体、干扰素调节因子、Ⅷ因子、Ⅸ因子、葡萄糖苷酶、葡萄糖-6-磷酸酶、异戊酰-CoA脱氢酶、丙酰-CoA羧化酶、β-葡糖苷酶、肝磷酸化酶、磷酸化酶激酶、甘氨酸脱羧酶、α-半乳糖苷酶、β-半乳糖苷酶和溶酶体酶所组成组中的一种或几种。In another aspect, the present invention provides a recombinant AAV virion comprising the AAV vector genome and the aforementioned AAV capsid protein, wherein the AAV vector genome is wrapped in the AAV capsid. The AAV vector genome contains heterologous nucleic acid sequences. The heterologous nucleic acid sequence encodes one or more selected from antisense RNA, microRNA, shRNA, polypeptide and immunogen. Preferably, the polypeptide encoded by the heterologous nucleic acid sequence is a therapeutic polypeptide or a reporter gene, wherein the therapeutic polypeptide encoded by the heterologous nucleic acid sequence is selected from insulin, glucagon, growth hormone, growth hormone releasing factor, erythrocyte-stimulating Propoietin, insulin growth factor, transforming growth factor alpha, hepatocyte growth factor, tyrosine hydroxylase, thrombopoietin, interleukin 1-interleukin 25, low-density lipoprotein receptor, glucocorticoid receptor, vitamin D receptor body, interferon regulatory factor, factor VIII, factor IX, glucosidase, glucose-6-phosphatase, isovaleryl-CoA dehydrogenase, propionyl-CoA carboxylase, beta-glucosidase, liver phosphorylation One or more of the group consisting of enzyme, phosphorylase kinase, glycine decarboxylase, α-galactosidase, β-galactosidase and lysosomal enzyme.

在一方面，本发明提供一种细胞，其包含前述的核酸、AAV衣壳蛋白、重组病毒粒子和/或重组AAV病毒粒子。In one aspect, the present invention provides a cell comprising the aforementioned nucleic acid, AAV capsid protein, recombinant virion and/or recombinant AAV virion.

在另一方面，本发明提供一种药物组合物，其包含一种药学上可接受载体或辅料和选自权前述的核酸、AAV衣壳蛋白、重组病毒粒子、重组AAV病毒粒子和/或前述细胞中的一种或几种。In another aspect, the present invention provides a pharmaceutical composition, which comprises a pharmaceutically acceptable carrier or adjuvant and selected from the aforementioned nucleic acid, AAV capsid protein, recombinant virion, recombinant AAV virion and/or the aforementioned one or more types of cells.

在另一方面，本发明还提供了前述的核酸、AAV衣壳蛋白、重组病毒粒子、重组AAV病毒粒子、细胞和/或前述药物组合物中的一种或几种在制备用于预防或治疗疾病的药物中的用途，所述疾病选自由囊性纤维化和肺部其它疾病、血友病A、血友病B、地中海贫血、贫血和其它血液疾病、老年痴呆症、多发性硬化症、帕金森氏病、亨廷顿氏病、肌萎缩侧索硬化症、癫痫、癌症、糖尿病、肌营养不良、糖原贮积病和其它代谢缺陷、先天性肺气肿、Lesch-Nyhan综合征、Niemann-Pick病、艾滋病、肝炎、高氨血症和脊髓性脑共济失调所组成组中的一种或几种。In another aspect, the present invention also provides one or more of the aforementioned nucleic acid, AAV capsid protein, recombinant virion, recombinant AAV virion, cell and/or the aforementioned pharmaceutical composition in preparation for prevention or treatment Use in medicine for a disease selected from the group consisting of cystic fibrosis and other diseases of the lung, hemophilia A, hemophilia B, thalassemia, anemia and other blood diseases, Alzheimer's disease, multiple sclerosis, Parkinson's disease, Huntington's disease, ALS, epilepsy, cancer, diabetes, muscular dystrophy, glycogen storage disease and other metabolic defects, congenital emphysema, Lesch-Nyhan syndrome, Niemann- One or more of the group consisting of Pick's disease, AIDS, hepatitis, hyperammonemia, and spinal ataxia.

在一方面，本发明提供了一种制备重组AAV病毒粒子的方法，所述方法包括在体外给细胞提供一种前述的核酸、一种AAV Rep蛋白的核酸编码序列、一种携带异源核酸序列的AAV载体基因组、有助于产生感染性AAV的辅助因子，以及允许AAV载体基因组包裹于前述核酸编码的AAV衣壳中，并实现重组AAV病毒粒子的组装，所述方法为AAV载体制备系统，包括两质粒包装系统、三质粒包装系统、杆状病毒包装系统和以Ad或HSV为辅毒的AAV包装系统等。In one aspect, the present invention provides a method for preparing recombinant AAV virions, the method comprising providing cells with the aforementioned nucleic acid, a nucleic acid coding sequence of AAV Rep protein, a heterologous nucleic acid sequence carrying The AAV vector genome, the auxiliary factor that helps to produce infectious AAV, and allows the AAV vector genome to be wrapped in the AAV capsid encoded by the aforementioned nucleic acid, and realizes the assembly of the recombinant AAV virion, the method is an AAV vector preparation system, Including two-plasmid packaging system, three-plasmid packaging system, baculovirus packaging system and AAV packaging system with Ad or HSV as co-virus, etc.

在一方面，本发明还提供了一种在体外向细胞递送异源核酸的方法，所述方法包括向细胞施用前述的核酸、病毒衣壳蛋白、重组病毒粒子、重组AAV病毒粒子和/或药物组合物，所述细胞是哺乳动物细胞，优选人细胞，优选人干细胞或肝细胞。In one aspect, the present invention also provides a method for delivering heterologous nucleic acid to cells in vitro, said method comprising administering the aforementioned nucleic acid, viral capsid protein, recombinant virion, recombinant AAV virion and/or drug to the cell Compositions, the cells are mammalian cells, preferably human cells, preferably human stem cells or hepatocytes.

在另一方面，本发明还提供了一种向哺乳动物递送异源核酸的方法，所述方法包括向哺乳动物个体施用有效剂量前述的核酸、病毒衣壳蛋白、重组病毒粒子、重组AAV病毒粒子、前述细胞和/或前述的药物组合物，其中所述哺乳动物为人类个体或灵长类个体。In another aspect, the present invention also provides a method for delivering heterologous nucleic acid to mammals, said method comprising administering an effective dose of the aforementioned nucleic acid, viral capsid protein, recombinant virion, or recombinant AAV virion to a mammalian individual . The aforementioned cell and/or the aforementioned pharmaceutical composition, wherein the mammal is a human individual or a primate individual.

在描述本发明之后，将在以下实施例中更详细地阐述本发明，这些实施例仅用于说明目的，并不用于限制本发明。After describing the present invention, it will be illustrated in more detail in the following examples, which are provided for illustrative purposes only and are not intended to limit the present invention.

实施例Example

下面的实施例描述了通过定向进化和体内筛选方法对AAV衣壳基因进行改组，以产生一组对肝脏靶向性优良或优异的载体。采用DNA改组技术对AAV衣壳基因进行改组并构建AAV衣壳基因库，用于小鼠模型体内筛选。分离小鼠肝脏富集的AAV突变体，并通过对AAV突变体外壳的体外和体内活性测试来对其靶向性进行确定。The following examples describe the shuffling of AAV capsid genes by directed evolution and in vivo screening methods to generate a set of vectors with good or excellent liver targeting. The AAV capsid gene was shuffled by DNA shuffling technology and an AAV capsid gene library was constructed for in vivo screening in mouse models. Mouse liver-enriched AAV mutants were isolated and their targeting determined by in vitro and in vivo activity testing of the AAV mutant coat.

实施例1嵌合AAV质粒文库构建Example 1 Chimeric AAV plasmid library construction

为了获得嵌合的AAV外壳基因Cap，首先利用上游引物primerA和下游引物primerB，分别从不同血清型AAV母本扩增外壳全长基因，选择的母本AAV外壳基因包括AAV1外壳(NCBI SequenceID:AF063497.1，Cap的核酸编码序列2223-4433)、AAV2外壳(NCBISequenceID:AF043303.1，Cap的核酸编码序列2203-4410)、AAV3B外壳(NCBI SequenceID:AF028705.1，Cap的核酸编码序列2208-4418)、AAV7外壳(NCBI SequenceID:AF513851.1，Cap的核酸编码序列2222-4435)、AAV8外壳(NCBI SequenceID:AF513852.1，Cap的核酸编码序列2121-4337)、AAV9外壳(NCBI SequenceID:530579.1，Cap的核酸编码序列1-2211)。上游引物primerA为5’-CCCAAGCTTCGATCAACTACGCAGACAGGTACCAA-3’，下游引物primerB为5’-ATAAGAATGCGGCCGCAGAGACCAAAGTTCAACTGAAACGA-3’，在primeSTAR Max DNA polymerase(TaKaRa公司，货号R045A)作用下进行PCR扩增，扩增条件：95℃5min 1个循环；98℃8s，60℃5s，72℃15s，35个循环；72℃5min，1个循环。In order to obtain the chimeric AAV coat gene Cap, first use the upstream primer primerA and the downstream primer primerB to amplify the full-length coat gene from different serotype AAV mothers, and the selected maternal AAV coat gene includes the AAV1 coat (NCBI SequenceID: AF063497 .1, Cap's nucleic acid coding sequence 2223-4433), AAV2 shell (NCBI SequenceID: AF043303.1, Cap's nucleic acid coding sequence 2203-4410), AAV3B shell (NCBI SequenceID: AF028705.1, Cap's nucleic acid coding sequence 2208-4418 ), AAV7 shell (NCBI SequenceID: AF513851.1, the nucleic acid coding sequence 2222-4435 of Cap), AAV8 shell (NCBI SequenceID: AF513852.1, the nucleic acid coding sequence 2121-4337 of Cap), AAV9 shell (NCBI SequenceID: 530579.1, Cap's nucleic acid coding sequence 1-2211). The upstream primer primerA is 5'-CCCAAGCTTCGATCAACTACGCAGACAGGTACCAA-3', and the downstream primerB is 5'-ATAAGAATGCGGCCGCAGAGACCAAAGTTCAACTGAAACGA-3'. PCR amplification is performed under the action of primeSTAR Max DNA polymerase (TaKaRa Company, product number R045A). Amplification conditions: 95°C for 5min 1 cycle; 98°C for 8s, 60°C for 5s, 72°C for 15s, 35 cycles; 72°C for 5min, 1 cycle.

将扩增获得的Cap基因等质量混合，获得总质量为4ug的DNA模板，用0.04U的DNaseI进行随机破碎消化，通常条件下22℃消化6-8min后，75℃灭活DNaseI酶10min。琼脂糖凝胶电泳获得弥散性的DNA条带，其长度大部分集中在500-1000bp，回收这部分长度范围的DNA片段。The equal masses of the amplified Cap genes were mixed to obtain a DNA template with a total mass of 4ug, which was randomly fragmented and digested with 0.04U of DNaseI. Under normal conditions, the DNaseI enzyme was inactivated at 75°C for 10 minutes after digestion at 22°C for 6-8min. Diffuse DNA bands were obtained by agarose gel electrophoresis, most of which were concentrated in the length of 500-1000bp, and the DNA fragments in this part of the length range were recovered.

回收的DNA片段首先互为引物进行随机的无引物扩增，为增加其多样性及扩增效率，采用非常规单一退火PCR模式(94℃60s，65℃90s，72℃90s，10个循环；94℃60s，62℃90s，72℃90s，10个循环；94℃60s，59℃90s，72℃90s，10个循环；94℃60s，56℃90s，72℃90s，10个循环；94℃60s，53℃90s，72℃90s，10个循环；94℃60s，50℃90s，72℃90s，10个循环；94℃60s，47℃90s，72℃90s，10个循环)。The recovered DNA fragments were first used as primers for random amplification without primers. In order to increase its diversity and amplification efficiency, an unconventional single annealing PCR mode was adopted (94°C for 60s, 65°C for 90s, 72°C for 90s, 10 cycles; 94°C for 60s, 62°C for 90s, 72°C for 90s, 10 cycles; 94°C for 60s, 59°C for 90s, 72°C for 90s, 10 cycles; 94°C for 60s, 56°C for 90s, 72°C for 90s, 10 cycles; 94°C 60s, 53°C 90s, 72°C 90s, 10 cycles; 94°C 60s, 50°C 90s, 72°C 90s, 10 cycles; 94°C 60s, 47°C 90s, 72°C 90s, 10 cycles).

以无引物PCR产物为模板，利用上游引物T-primerA(5’-ACGCCTGCCGTTCGACGATTCCCAAGCTTCGATCAACTACGCAGACAGGTACCAA-3’)和下游引物T-primerB(5’-ACGCGCGGATCTTCCAGAGATAAGAATGCGGCCGCAGAGACCAAAGTTCAACTGAAACGA-3’)，在HiFi DNA聚合酶(TransGenBiotech公司，货号AS131-22)作用下，进行嵌合Cap基因的全长扩增。扩增程序94℃30s，62℃30s，72℃2.5min，共40循环。获得长度约为2500bp的嵌合全长Cap DNA片段，用HindIII和NotI进行双酶切，与经过同样双酶切处理的pSNAV2.3载体(含有单链AAV2基因组ITR及聚合酶Rep基因序列，缺失Cap基因)连接，连接产物电击转化到E.coli HST08细胞中，制备了一个库容大于1E+6个克隆的嵌合型AAV质粒文库。PCR鉴定为阳性克隆的质粒分别进行PstI、HaeIII、TaqI酶切鉴定，观察每个质粒酶切形态差异(图1)，以判定质粒文库的多样性。Using the primerless PCR product as a template, using the upstream primer T-primerA (5'-ACGCCTGCCGTTCGACGATTCCCAAGCTTCGATCAACTACGCAGACAGGTACCAA-3') and the downstream primer T-primerB (5'-ACGCGCGGATCTTCCAGAGATAAGAATGCGGCCGCAGAGACCAAAGTTCAACTGAAACGA-3'), HiFi DNA polymerase (TransGenBiotech, product number AS131 Under the action of -22), the full-length amplification of the chimeric Cap gene is carried out. The amplification program was 94°C for 30s, 62°C for 30s, and 72°C for 2.5min, a total of 40 cycles. A chimeric full-length Cap DNA fragment with a length of about 2500bp was obtained, which was double-digested with HindIII and NotI, and pSNAV2.3 vector (containing single-stranded AAV2 genome ITR and polymerase Rep gene sequence, deleted Cap gene) ligation, the ligation product was transformed into E.coli HST08 cells by electric shock, and a chimeric AAV plasmid library with a library capacity greater than 1E+6 clones was prepared. Plasmids identified as positive clones by PCR were identified by PstI, HaeIII, and TaqI digestion, and the differences in the shape of each plasmid were observed (Figure 1) to determine the diversity of the plasmid library.

实施例2新型AAV病毒文库的包装及滴度检测Embodiment 2 Packaging and titer detection of novel AAV virus library

上述嵌合质粒文库在helper辅助下进行自我包装复制，包装方法为两步法，首先用文库质粒、R2C2、helper质粒以质量比1:1:2的比例进行转染，包装成AAV中间病毒库，此时组成病毒库的AAV为杂合型AAV，即病毒外壳蛋白不一定是由其包装的基因组表达，收获液进行氯仿抽提纯化后，检测基因组滴度；第二步以感染指数MOI为100感染293T细胞，保证每个细胞感染的病毒基因组拷贝数＜10，此时产生的最终AAV病毒库为纯合型，收获液进行氯仿抽提纯化后，检测基因组滴度(参见Muller等，2003，Nature Biotechnology，21:1040-1046)。在包装时使用文库质粒质量非常低，用来保证每一个新型AAV基因组能够被包装于其自身表达的Cap蛋白形成的外壳内。包装的病毒颗粒能够正常表达外壳蛋白，证明其能够形成完整的病毒外壳，具备感染能力。The above-mentioned chimeric plasmid library is self-packaged and replicated with helper assistance. The packaging method is a two-step method. First, the library plasmid, R2C2, and helper plasmids are transfected at a mass ratio of 1:1:2, and packaged into an AAV intermediate virus library. At this time, the AAV that constitutes the virus library is a heterozygous AAV, that is, the viral coat protein is not necessarily expressed by the genome packaged by it, and the titer of the genome is detected after the harvest solution is extracted and purified with chloroform; the second step is to use the infection index MOI as 100 infected 293T cells to ensure that the genome copy number of virus infected by each cell was less than 10, and the final AAV virus pool produced at this time was homozygous. , Nature Biotechnology, 21:1040-1046). The quality of library plasmids used in packaging is very low to ensure that each novel AAV genome can be packaged in the shell formed by its own expressed Cap protein. The packaged virus particles can normally express the coat protein, which proves that it can form a complete virus coat and has the ability to infect.

取适量纯化AAV样品，按下表(表1)配制DNase I消化反应混合液，37℃孵育30min，75℃孵育10min，失活DNase I。Take an appropriate amount of purified AAV sample, prepare a DNase I digestion reaction mixture as shown in the following table (Table 1), incubate at 37°C for 30 minutes, and incubate at 75°C for 10 minutes to inactivate DNase I.

表1Table 1

AAV样品AAV samples 5μl5μl 10×Dnase I缓冲液10×DNase I buffer 5μl5μl Dnase IDNase I 1μl1μl 无Rnase的水RNase-free water 39μl39μl 合计total 50μl50μl

处理后的AAV纯化样品稀释适当的倍数后，参照下表(表2)配置Q-PCR反应体系，并按照下列程序进行检测。After the treated AAV purified sample was diluted to an appropriate multiple, the Q-PCR reaction system was configured with reference to the following table (Table 2), and the detection was performed according to the following procedure.

表2Table 2

其中使用的引物参见下表(表3)：The primers used wherein refer to the following table (Table 3):

表3table 3

正向引物(5’-3’)Forward primer (5'-3') AAGGTGGTGGATGAGTGCTACAAAGGTGGTGGATGAGTGCTACA 反向引物(5’-3’)Reverse primer (5'-3') TGGAGCTCAGGCTGGGTTTTGGAGCTCAGGCTGGGTTT 探针引物Probe primer CCCCAATTACTTGCTCCCCCCAATTACTTGCTCC

包装产量结果参见下表(表4)：See the following table (Table 4) for the packaging output results:

表4Table 4

病毒载体名称Virus vector name 基因组滴度(vg/ml)Genome titer (vg/ml) 新型AAV病毒库Novel AAV Virus Library 2.5E+112.5E+11

实施例3新型AAV载体在小鼠体内筛选与富集Example 3 Screening and enrichment of novel AAV vectors in mice

新型AAV载体包装而成的病毒以1.5E+11vg/只剂量通过尾部静脉注射到8周龄C57BL/6J小鼠体内，进行肝脏靶向筛选与富集。注射三天后处死小鼠，收集全部肝脏组织，液氮匀浆后提取总DNA。再次用T-primerA/T-primerB这对引物进行扩增，扩增后的新型AAV全长Cap基因混合而成的DNA用上述方法构建到pSNAV2.3载体上。通过PCR鉴定获得370个阳性单克隆进行等质量混合，形成第二次的质粒文库。再包装成病毒文库，再次以1.5E+11vg/只尾部静脉注射到8周龄C57BL/6J小鼠体内，注射三天后处死小鼠，收集全部肝脏组织、心脏组织及骨骼肌。分别从以上组织随机挑选100、50、50个阳性克隆进行测序。测序后的阳性克隆用BioEdit，VectorNTI，ClustalX2及TreeViewX进行序列比对分析。The virus packaged by the new AAV vector was injected into 8-week-old C57BL/6J mice at a dose of 1.5E+11vg/mouse through the tail vein for liver-targeted screening and enrichment. Three days after the injection, the mice were sacrificed, all the liver tissues were collected, and the total DNA was extracted after liquid nitrogen homogenization. The pair of primers T-primerA/T-primerB was used to amplify again, and the DNA obtained by mixing the amplified novel AAV full-length Cap gene was constructed on the pSNAV2.3 vector by the above method. 370 positive single clones obtained through PCR identification were mixed with equal mass to form the second plasmid library. Repackage into a virus library, inject 1.5E+11vg/tail vein into 8-week-old C57BL/6J mice again, kill the mice three days after injection, and collect all liver tissues, heart tissues and skeletal muscles. 100, 50, and 50 positive clones were randomly selected from the above tissues for sequencing. The positive clones after sequencing were compared and analyzed by BioEdit, VectorNTI, ClustalX2 and TreeViewX.

选择肝脏高频出现，心脏和骨骼肌低频出现或不出现的新型AAV载体。共获得9组高频肝脏靶向新型AAV突变体(图2)，分析以上9种新型AAV-Cap突变体的核苷酸序列和氨基酸序列(图3)。由于其它4种包装滴度低，不适合工业化需要。所以，选择其中5种新型AAV，即L37、L57、L58、L107、L10进行后续感染活性测试实验。Select novel AAV vectors with high frequency in the liver and low or no heart and skeletal muscle. A total of 9 groups of high-frequency liver-targeting novel AAV mutants were obtained (Fig. 2), and the nucleotide and amino acid sequences of the above 9 novel AAV-Cap mutants were analyzed (Fig. 3). Because the titer of other 4 kinds of packages is low, it is not suitable for industrialization needs. Therefore, five of the novel AAVs, namely L37, L57, L58, L107, and L10, were selected for subsequent infection activity test experiments.

实施例4新型AAV对体外人肝细胞系感染活性Example 4 Novel AAV Infection Activity of Human Liver Cell Lines in Vitro

4.1采用绿色荧光蛋白检测系统测试新型AAV对体外人肝细胞系感染活性4.1 Using the green fluorescent protein detection system to test the infection activity of the new AAV on human liver cell lines in vitro

为了初步测试新型AAV对肝脏靶向性感染活性，首先对6种人正常肝脏或肝癌细胞系进行体外感染活性测试。将获得的新型AAV Cap基因L57、L58、L107、L10分别构建到含有2型AAV Rep的RC质粒载体上，该质粒携带CAG-EGFP外源基因，包装出的病毒命名为AAV2/57、AAV2/58、AAV2/107、AAV2/10，并检测病毒滴度(结果未显示)。In order to preliminarily test the liver-targeted infection activity of the new AAV, the in vitro infection activity test was first performed on six human normal liver or liver cancer cell lines. The obtained new AAV Cap genes L57, L58, L107, and L10 were respectively constructed on the RC plasmid vector containing type 2 AAV Rep, which carried the CAG-EGFP exogenous gene, and the packaged viruses were named AAV2/57, AAV2/ 58, AAV2/107, AAV2/10, and virus titer was detected (results not shown).

将上述4种携带绿色荧光蛋白的重组新型AAV病毒和携带相同绿色荧光蛋白的AAV2/8重组病毒分别同时感染人多种肝细胞系，感染48小时后，流式检测感染活性。为了避免病毒载体在某一种细胞系上感染效率过低或过饱和，每种细胞系至少使用两种感染指数(MOI)进行感染，从图(4A-4F)可看出，当使用2-3种感染指数感染人肝脏细胞系7721、HepG2、Huh7、L02时，新型AAV2/10、AAV2/57、AAV2/58、AAV2/107对人肝脏细胞系的感染均表现出明显的剂量依赖关系。接下来以其中一种MOI针对每种细胞的感染结果进行描述，其余MOI结果详见图4。The above four recombinant AAV viruses carrying the green fluorescent protein and the AAV2/8 recombinant virus carrying the same green fluorescent protein were simultaneously infected with various human liver cell lines. After 48 hours of infection, the infection activity was detected by flow cytometry. In order to avoid low infection efficiency or oversaturation of viral vectors on a certain cell line, each cell line is infected with at least two infection indices (MOI). It can be seen from Figures (4A-4F) that when using 2- When the three infection indices infected human liver cell lines 7721, HepG2, Huh7, and L02, the infection of the new AAV2/10, AAV2/57, AAV2/58, and AAV2/107 on human liver cell lines showed a significant dose-dependent relationship. Next, the infection results of each type of cell are described with one of the MOIs, and the results of other MOIs are shown in Figure 4.

当上述4种重组病毒以感染指数(MOI)为2500感染人肝脏细胞系7402，感染48h后进行流式细胞术检测分析，结果(图4A)显示：在此MOI下，AAV2/10感染效率是AAV2/8感染效率的4.6倍，AAV2/57感染效率是AAV2/8感染效率的2.2倍，AAV2/58感染效率是AAV2/8感染效率的3.9倍，AAV2/107感染效率是AAV2/8感染效率的1.6倍。When the above four recombinant viruses infected the human liver cell line 7402 with an infection index (MOI) of 2500, flow cytometry analysis was performed 48 hours after infection, and the results (Figure 4A) showed that: under this MOI, the infection efficiency of AAV2/10 was The infection efficiency of AAV2/8 is 4.6 times, the infection efficiency of AAV2/57 is 2.2 times that of AAV2/8, the infection efficiency of AAV2/58 is 3.9 times that of AAV2/8, and the infection efficiency of AAV2/107 is that of AAV2/8 1.6 times.

当上述4种重组病毒以感染指数(MOI)为2500感染人肝脏细胞系7721，感染48h后进行流式细胞术检测分析，结果(图4B)显示，在此MOI下，AAV2/10感染效率最高，AAV2/107感染效率与其接近，AAV2/10感染效率是AAV2/8感染效率的23.4倍，AAV2/107感染效率是AAV2/8感染效率的21.8倍，AAV2/57感染效率是AAV2/8感染效率的4.6倍，AAV2/58感染效率是AAV2/8感染效率的15.9倍。When the above four recombinant viruses infected human liver cell line 7721 with an infection index (MOI) of 2500, flow cytometry analysis was performed 48 hours after infection, and the results (Figure 4B) showed that under this MOI, AAV2/10 had the highest infection efficiency , AAV2/107 infection efficiency is close to it, AAV2/10 infection efficiency is 23.4 times of AAV2/8 infection efficiency, AAV2/107 infection efficiency is 21.8 times of AAV2/8 infection efficiency, AAV2/57 infection efficiency is AAV2/8 infection efficiency 4.6 times of AAV2/58 infection efficiency is 15.9 times of AAV2/8 infection efficiency.

当上述4种重组病毒以感染指数(MOI)为2500感染人肝脏细胞系HepG2，感染48h后进行流式细胞术检测分析，结果(图4C)显示，在此MOI下，AAV2/10感染效率最高，AAV2/107感染效率与其接近，AAV2/10感染效率是AAV2/8感染效率的14.8倍，AAV2/107感染效率是AAV2/8感染效率的13.5倍。AAV2/57感染效率是AAV2/8感染效率的5.5倍，AAV2/58感染效率是AAV2/8感染效率的10.9倍。When the above four recombinant viruses infected the human liver cell line HepG2 with an infection index (MOI) of 2500, flow cytometry analysis was performed 48 hours after infection, and the results (Figure 4C) showed that AAV2/10 had the highest infection efficiency at this MOI , the infection efficiency of AAV2/107 is close to it, the infection efficiency of AAV2/10 is 14.8 times that of AAV2/8, and the infection efficiency of AAV2/107 is 13.5 times that of AAV2/8. The infection efficiency of AAV2/57 was 5.5 times that of AAV2/8, and the infection efficiency of AAV2/58 was 10.9 times that of AAV2/8.

当上述4种重组病毒以感染指数(MOI)为2500感染人肝脏细胞系Huh7，感染48h后进行流式细胞术检测分析，结果(图4D)显示，在此MOI下AAV2/10感染效率最高，是AAV2/8的31.8倍。AAV2/57感染效率是AAV2/8的1.4倍，AAV2/58感染效率是AAV2/8的6.6倍，AAV2/107感染效率是AAV2/8的19.5倍。When the above four recombinant viruses infected the human liver cell line Huh7 with an infection index (MOI) of 2500, flow cytometry analysis was performed 48 hours after infection, and the results (Figure 4D) showed that AAV2/10 had the highest infection efficiency at this MOI, It is 31.8 times that of AAV2/8. The infection efficiency of AAV2/57 was 1.4 times that of AAV2/8, the infection efficiency of AAV2/58 was 6.6 times that of AAV2/8, and the infection efficiency of AAV2/107 was 19.5 times that of AAV2/8.

当上述4种重组病毒以感染指数(MOI)为10000感染人肝脏细胞系Huh6，感染48h后进行流式细胞术检测分析，结果(图4E)显示，在此MOI下AAV2/10感染效率最高，AAV2/107感染效率与其接近，AAV2/10感染效率是AAV2/8感染效率的2.9倍，AAV2/107感染效率是AAV2/8感染效率的2.7倍。AAV2/57感染效率是AAV2/8感染效率的1.8倍，AAV2/58感染效率是AAV2/8感染效率的2.6倍。When the above four recombinant viruses infected the human liver cell line Huh6 with an infection index (MOI) of 10,000, flow cytometry analysis was performed 48 hours after infection, and the results (Figure 4E) showed that AAV2/10 had the highest infection efficiency at this MOI, The infection efficiency of AAV2/107 was close to it, the infection efficiency of AAV2/10 was 2.9 times that of AAV2/8, and the infection efficiency of AAV2/107 was 2.7 times that of AAV2/8. The infection efficiency of AAV2/57 was 1.8 times that of AAV2/8, and the infection efficiency of AAV2/58 was 2.6 times that of AAV2/8.

当上述4种重组病毒以感染指数(MOI)为50000感染人肝脏细胞系L02，感染48h后进行流式细胞术检测分析，结果(图4F)显示，在此MOI下，AAV2/10感染效率最高，是AAV2/8感染效率的42倍，AAV2/107感染效率是AAV2/8感染效率的26.8倍，AAV2/57感染效率是AAV2/8感染效率的2.5倍，AAV2/58感染效率是AAV2/8感染效率的10.8倍。When the above four recombinant viruses infected the human liver cell line L02 with an infection index (MOI) of 50,000, flow cytometry analysis was performed 48 hours after infection, and the results (Figure 4F) showed that AAV2/10 had the highest infection efficiency at this MOI , is 42 times the infection efficiency of AAV2/8, the infection efficiency of AAV2/107 is 26.8 times the infection efficiency of AAV2/8, the infection efficiency of AAV2/57 is 2.5 times the infection efficiency of AAV2/8, and the infection efficiency of AAV2/58 is AAV2/8 10.8 times the infection efficiency.

由以上结果可知，这4种重组新型AAV对人肝细胞或肝癌细胞系感染活性均高于阳性对照AAV2/8。From the above results, it can be seen that the infection activity of these four recombinant novel AAVs on human hepatocytes or liver cancer cell lines is higher than that of the positive control AAV2/8.

4.2采用荧光素酶检测系统测试新型AAV对体外人肝细胞系感染活性4.2 Using the luciferase detection system to test the infection activity of the novel AAV on human hepatic cell lines in vitro

将获得的新型AAV Cap基因L37、L57、L58、L107、L10构建到含有2型AAV Rep的RC质粒载体上，该质粒上携带有CAG-Luciferase外源基因，包装出的病毒命名为AAV2/37、AAV2/57、AAV2/58、AAV2/107、AAV2/10，并检测病毒滴度(结果未显示)。上述重组新型AAV病毒以感染指数MOI为500对5种正常肝脏或肝癌细胞系进行感染，48小时后，采用荧光素酶检测系统检测感染活性。在每个检测值上扣除NC背景值之后进行感染活性的比较。The obtained new AAV Cap genes L37, L57, L58, L107, L10 were constructed on the RC plasmid vector containing type 2 AAV Rep, which carried the CAG-Luciferase exogenous gene, and the packaged virus was named AAV2/37 , AAV2/57, AAV2/58, AAV2/107, AAV2/10, and detect virus titers (results not shown). The above-mentioned recombinant novel AAV virus was used to infect five normal liver or liver cancer cell lines with an MOI of 500. After 48 hours, the infection activity was detected by a luciferase detection system. Comparisons of infective activity were performed after subtracting the NC background value from each assay.

上述5种重组新型AAV病毒对人肝细胞系Huh7感染活性比较，结果(图5A)显示，感染活性由高到低依次为AAV2/10＞AAV2/107＞AAV2/58＞AAV2/37＞AAV2/57。The comparison of the infective activity of the above five recombinant novel AAV viruses on the human liver cell line Huh7 showed that the infective activity from high to low was AAV2/10>AAV2/107>AAV2/58>AAV2/37>AAV2/ 57.

上述5种重组新型AAV病毒对人肝细胞系7402感染活性比较，结果(图5B)显示，AAV2/10的活性最高，AAV2/107与其接近，AAV2/58与AAV2/37感染活性接近，感染活性由高到低依次是AAV2/10＞AAV2/107＞AAV2/58＞AAV2/37＞AAV2/57。The comparison of the infection activity of the above five recombinant new AAV viruses on the human liver cell line 7402 showed that AAV2/10 had the highest activity, AAV2/107 was close to it, AAV2/58 was close to AAV2/37, and the infection activity was close to that of AAV2/37. The order from high to low is AAV2/10>AAV2/107>AAV2/58>AAV2/37>AAV2/57.

上述5种重组新型AAV病毒对人肝细胞系7721感染活性比较，结果(图5C)显示，AAV2/10的活性最高，AAV2/58与AAV2/37感染活性接近，感染活性由高到低依次是AAV2/10＞AAV2/107＞AAV2/58＞AAV2/37＞AAV2/57。The comparison of the infection activity of the above five recombinant new AAV viruses on the human liver cell line 7721 showed that AAV2/10 had the highest activity, AAV2/58 was close to AAV2/37, and the order of infection activity from high to low was AAV2/10>AAV2/107>AAV2/58>AAV2/37>AAV2/57.

上述5种重组新型AAV病毒对人肝细胞系HepG2感染活性比较，结果(图5D)显示，AAV2/10的活性最高，感染活性由高到低依次是AAV2/10＞AAV2/37＞AAV2/107＞AAV2/58＞AAV2/57。The comparison of the infection activity of the above five recombinant new AAV viruses on the human liver cell line HepG2 showed that AAV2/10 had the highest activity, and the order of infection activity from high to low was AAV2/10>AAV2/37>AAV2/107 > AAV2/58 > AAV2/57.

上述5种重组新型AAV病毒在人肝细胞系L02感染活性比较，结果(图5E)显示，AAV2/10的活性最高，感染活性由高到低依次是AAV2/10＞AAV2/107＞AAV2/37＞AAV2/58＞AAV2/57。The comparison of the infection activity of the above five recombinant new AAV viruses in the human liver cell line L02 showed that AAV2/10 had the highest activity, and the order of infection activity from high to low was AAV2/10>AAV2/107>AAV2/37 > AAV2/58 > AAV2/57.

由以上结果可知，在上述几种新型AAV中，AAV2/10活性相对最高，AAV2/57活性相对较低。From the above results, it can be seen that among the above-mentioned several novel AAVs, AAV2/10 has the highest activity and AAV2/57 has the lowest activity.

实施例5新型AAV体内活性测试Example 5 In vivo activity test of novel AAV

5种新型AAV Cap外壳基因L37、L57、L58、L107、L10分别构建到含有2型AAV Rep的RC质粒载体上，该质粒携带CAG-Luciferase外源基因，包装出的病毒命名为AAV2/37、AAV2/57、AAV2/58、AAV2/107、AAV2/10，检测病毒滴度(结果未显示)，并将这些新型AAV病毒进行体内感染活性比较。选择6-8周C57BL/6J小鼠以1E+11vg/只的剂量进行尾部静脉注射，2周后处死小鼠，提取组织DNA，测试肝脏、心脏、骨骼肌中AAV载体基因组拷贝数，并分别测试这3种组织中荧光素酶表达情况。Five new AAV Cap coat genes L37, L57, L58, L107, and L10 were respectively constructed on the RC plasmid vector containing type 2 AAV Rep, which carried the exogenous CAG-Luciferase gene, and the packaged virus was named AAV2/37, AAV2/57, AAV2/58, AAV2/107, AAV2/10, the virus titers were detected (results not shown), and the in vivo infection activities of these novel AAV viruses were compared. Select 6-8 week old C57BL/6J mice with a dose of 1E+11vg/mouse for tail vein injection, kill the mice 2 weeks later, extract tissue DNA, test the copy number of AAV vector genome in liver, heart, and skeletal muscle, and respectively The expression of luciferase in these three tissues was tested.

载体基因组拷贝数检测结果(图6)显示，AAV2/58相较于其它新型AAV在肝脏中的载体基因组拷贝数最高，且与同组的心脏和骨骼肌相比，肝脏中的基因组拷贝数显著升高；虽然AAV2/37、AAV2/57、AAV2/107和AAV2/10载体基因组拷贝数比AAV2/58载体基因组拷贝数低，但相较于同组的心脏和骨骼肌中的基因拷贝数来比，肝脏中的基因组拷贝数较高。The detection results of the vector genome copy number (Figure 6) showed that AAV2/58 had the highest vector genome copy number in the liver compared with other new AAVs, and compared with the heart and skeletal muscle of the same group, the genome copy number in the liver was significantly Elevated; although AAV2/37, AAV2/57, AAV2/107 and AAV2/10 vector genome copy number is lower than AAV2/58 vector genome copy number, compared with the gene copy number in heart and skeletal muscle of the same group Genome copy number was higher in the liver than in the liver.

荧光素酶表达情况检测结果(图7A-E)显示，AAV2/37、AAV2/57、AAV2/58、AAV2/107、AAV2/10在肝脏中的荧光素酶表达情况比其在同组的心脏和骨骼肌中荧光素酶的表达水平高。The detection results of luciferase expression (Figure 7A-E) showed that the luciferase expression of AAV2/37, AAV2/57, AAV2/58, AAV2/107, and AAV2/10 in the liver was higher than that in the heart of the same group. and high expression levels of luciferase in skeletal muscle.

这些结果表明这5种新型AAV载体都具有优良的肝脏靶向性，且AAV2/10相较于其它新型AAV载体在肝脏中的荧光素酶表达水平最高，其次是AAV2/58，之后是AAV2/37和AAV2/107在肝脏中荧光素酶表达水平接近，AAV2/57相对最低。These results indicated that these five novel AAV vectors all had excellent liver targeting, and AAV2/10 had the highest luciferase expression level in the liver compared with other novel AAV vectors, followed by AAV2/58, followed by AAV2/ 37 and AAV2/107 had similar levels of luciferase expression in the liver, and AAV2/57 was relatively the lowest.

实施例6猴血清或人血清中新型AAV中和抗体检测Example 6 Detection of novel AAV neutralizing antibodies in monkey serum or human serum

因为AAV自然被人和其他灵长类感染，产生的针对天然型AAV的中和抗体，会大大降低AAV的半衰期。这里检测新型AAV的中和抗体水平。Because AAV is naturally infected by humans and other primates, neutralizing antibodies against natural AAV will greatly reduce the half-life of AAV. The level of neutralizing antibodies to novel AAV was detected here.

6.1猴血清中新型AAV和AAV2/8中和抗体的检测与比较6.1 Detection and comparison of novel AAV and AAV2/8 neutralizing antibodies in monkey serum

本实验采用病毒感染，固定其MOI值，将血清进行稀释，检测食蟹猴血清中新型AAV载体与AAV2/8的中和抗体水平。在一系列血清稀释度中找到血清病毒混合物感染细胞效率为病毒无血清感染细胞效率的50％，以此稀释度的倒数作为中和抗体量，针对每种病毒载体评价其中和抗体(参见Lochrie MA等，2006，Virology 353：68-82；Mori S等，2006，Jpn JInfect Dis 59：285-293等)。本实验共对10只食蟹猴血清进行检测，血清系列稀释后，分别判定并比较每个样品中新型AAV的中和抗体与AAV2/8的中和抗体。In this experiment, the virus was used to infect, the MOI value was fixed, and the serum was diluted to detect the neutralizing antibody levels of the new AAV vector and AAV2/8 in the serum of cynomolgus monkeys. In a series of serum dilutions, the efficiency of infecting cells with serum-virus mixture was found to be 50% of the efficiency of virus-free serum infecting cells, and the reciprocal of this dilution was used as the amount of neutralizing antibodies, and neutralizing antibodies were evaluated for each viral vector (see Lochrie MA et al., 2006, Virology 353:68-82; Mori S et al., 2006, Jpn JInfect Dis 59:285-293, etc.). In this experiment, the serum of 10 cynomolgus monkeys was tested. After the serum was serially diluted, the neutralizing antibody of the new AAV and the neutralizing antibody of AAV2/8 in each sample were determined and compared.

具体步骤如下：将获得的新型AAV Cap基因L37、L57、L58、L107、L10分别构建到含有2型AAV Rep的RC质粒载体上，该质粒携带CAG-EGFP外源基因，包装出的病毒命名为AAV2/37、AAV2/57、AAV2/58、AAV2/107、AAV2/10，并检测病毒滴度(结果未显示)。7402细胞接种24孔板，食蟹猴血清样品进行系列稀释，新型AAV病毒AAV2/37、AAV2/57、AAV2/58、AAV2/107、AAV2/10和携带相同绿色荧光蛋白的AAV2/8重组病毒感染指数MOI为2000，稀释的血清样品与病毒液1:1进行混合，混合后37℃共孵育1h，然后细胞中加入血清样品与病毒液的混合液。48h后收获细胞，流式细胞术检测感染效率。The specific steps are as follows: the obtained new AAV Cap genes L37, L57, L58, L107, and L10 were respectively constructed on the RC plasmid vector containing type 2 AAV Rep, which carried the CAG-EGFP exogenous gene, and the packaged virus was named as AAV2/37, AAV2/57, AAV2/58, AAV2/107, AAV2/10, and virus titers were detected (results not shown). 7402 cells were inoculated into 24-well plates, cynomolgus monkey serum samples were serially diluted, new AAV viruses AAV2/37, AAV2/57, AAV2/58, AAV2/107, AAV2/10 and AAV2/8 recombinant viruses carrying the same green fluorescent protein The MOI of infection index is 2000, and the diluted serum sample is mixed with the virus solution at 1:1, and incubated at 37°C for 1 hour after mixing, and then the mixture of the serum sample and the virus solution is added to the cells. The cells were harvested after 48 h, and the infection efficiency was detected by flow cytometry.

中和抗体检测结果如表5所示，中和抗体量低于5时，视为针对该种病毒中和抗体为阴性。10份血清样本分别编号为1#、2#、3#、4#、5#、6#、7#、8#、9#、10#，其中7#针对所有病毒载体中和抗体均为阴性；1#、9#针对新型AAV载体中和抗体均为阴性，而针对AAV2/8的中和抗体均为阳性；2#、3#、4#、5#、6#、8#、10#样品的新型AAV中和抗体均明显低于AAV2/8。The results of the neutralizing antibody test are shown in Table 5. When the amount of neutralizing antibody is less than 5, it is considered negative for the neutralizing antibody against the virus. The 10 serum samples were numbered 1#, 2#, 3#, 4#, 5#, 6#, 7#, 8#, 9#, 10#, of which 7# was negative for all virus vector neutralizing antibodies ; 1#, 9# were negative for the neutralizing antibody against the new AAV vector, but were positive for the neutralizing antibody against AAV2/8; 2#, 3#, 4#, 5#, 6#, 8#, 10# The new AAV neutralizing antibodies of the samples were significantly lower than those of AAV2/8.

因此通过对食蟹猴血清中新型AAV和AAV2/8中和抗体的综合比较，在猴群体中新型AAV的中和抗体明显低于AAV2/8的中和抗体，新型AAV作为药物递送载体，具有更低的免疫原性。Therefore, through a comprehensive comparison of the new AAV and AAV2/8 neutralizing antibodies in the serum of cynomolgus monkeys, the neutralizing antibodies of the new AAV in monkey populations are significantly lower than those of AAV2/8, and the new AAV, as a drug delivery carrier, has Lower immunogenicity.

表5食蟹猴10份血清中和抗体检测结果Table 5 Test results of neutralizing antibody in 10 sera of cynomolgus monkeys

6.2人血清中新型AAV和AAV2/8中和抗体的检测与比较6.2 Detection and comparison of novel AAV and AAV2/8 neutralizing antibodies in human serum

本实验采用固定病毒感染指数MOI，将血清进行稀释，检测个体人血清中新型AAV的中和抗体的水平与AAV2/8中和抗体水平。在一系列血清稀释度中找到血清病毒混合物感染细胞效率为病毒无血清感染细胞效率的50％，以此稀释度的倒数作为中和抗体量，针对每种病毒载体评价其中和抗体。本实验共进行10人份血清检测，对血清进行了一系列稀释，分别比较每个样品中新型AAV的中和抗体与AAV2/8的中和抗体。In this experiment, fixed virus infection index MOI was used to dilute the serum to detect the level of neutralizing antibody to new AAV and the level of neutralizing antibody to AAV2/8 in individual human serum. In a series of serum dilutions, find that the cell infection efficiency of the serum-virus mixture is 50% of the cell infection efficiency of the virus-free serum, and the reciprocal of this dilution is used as the amount of neutralizing antibodies, and the neutralizing antibodies are evaluated for each virus vector. In this experiment, a total of 10 human sera were tested, and the sera were serially diluted to compare the neutralizing antibodies of the novel AAV and the neutralizing antibodies of AAV2/8 in each sample.

具体步骤如同6.1中具体步骤所示，不同之处在于，所选用的血清为正常人血清。本实验随机挑选10份正常人血清进行中和抗体实验，分别比较每个样品中新型AAV中和抗体和AAV2/8中和抗体。中和抗体检测结果如表6所示，其中3#样品针对所有病毒中和抗体均表现为阴性；2#样品针对新型AAV中和抗体均为阴性，而针对AAV2/8中和抗体为阳性；1#、4#、5#、6#、7#、8#、9#、10#样品针对新型AAV的中和抗体均明显低于AAV2/8的中和抗体。实验结果证明在人类群体中，针对新型AAV的中和抗体与AAV2/8的中和抗体相比较，新型AAV更适合作为药物递送载体，能够降低反应原性。The specific steps are as shown in the specific steps in 6.1, the difference is that the serum used is normal human serum. In this experiment, 10 normal human sera were randomly selected for neutralizing antibody experiments, and the new AAV neutralizing antibodies and AAV2/8 neutralizing antibodies in each sample were compared. The neutralizing antibody detection results are shown in Table 6, wherein the 3# sample is negative for all virus neutralizing antibodies; the 2# sample is negative for the new AAV neutralizing antibody, and positive for the AAV2/8 neutralizing antibody; 1#, 4#, 5#, 6#, 7#, 8#, 9#, 10# samples had significantly lower neutralizing antibodies against the new AAV than AAV2/8. The experimental results prove that in the human population, compared with neutralizing antibodies against AAV2/8, the new AAV is more suitable as a drug delivery carrier and can reduce reactogenicity.

表6 10份人血清中和抗体检测结果Table 6 Test results of neutralizing antibodies in 10 human sera

综合以上结果，通过新型AAV与AAV2/8的血清中中和抗体的比较，新型AAV相比AAV2/8载体具有更低的免疫原性，更适合作为基因治疗载体。Based on the above results, the comparison of serum neutralizing antibodies between the new AAV and AAV2/8 shows that the new AAV has lower immunogenicity than the AAV2/8 vector and is more suitable as a gene therapy vector.

序列表sequence listing

<110> 舒泰神（北京) 生物制药股份有限公司；北京三诺佳邑生物技术有限责任公司<110> Shutaishen (Beijing) Biopharmaceutical Co., Ltd.; Beijing Sannuojiayi Biotechnology Co., Ltd.

<120> 一组肝靶向新型腺相关病毒的获得及其应用<120> Obtainment and application of a group of liver-targeting novel adeno-associated viruses

<130> 2020070729<130> 2020070729

<160> 18<160> 18

<170> SIPOSequenceListing 1.0<170> SIPOSequenceListing 1.0

<210> 1<210> 1

<211> 2211<211> 2211

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 1<400> 1

atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 60atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 60

gagtggtggg acctgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac 120gagtggtggg acctgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac 120

aacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 180aacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac 240aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac 240

cagcagctgc aggcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt 300cagcagctgc aggcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt 300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360

gcgaaaaaga ggatccttga gcctcttggt ctggttgagg aagcggctaa gacggctcct 420gcgaaaaaga ggatccttga gcctcttggt ctggttgagg aagcggctaa gacggctcct 420

ggaaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgc gggtattggc 480ggaaagaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgc gggtattggc 480

aaatcgggtg cacagcccgc taaaaagaga ctcaactttg gtcagactgg cgactcagag 540aaatcgggtg cacagcccgc taaaaagaga ctcaactttg gtcagactgg cgactcagag 540

tcagttccag accctcaacc tctcggagaa ccaccagcag cgccctctgg tgtgggacct 600tcagttccag accctcaacc tctcggagaa ccaccagcag cgccctctgg tgtgggacct 600

aatacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660aatacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660

gtgggtaatg cctcaggaaa ttggcattgc gattcccaat ggctgggcga cagagtcatc 720gtgggtaatg cctcaggaaa ttggcattgc gattcccaat ggctgggcga cagagtcatc 720

accaccagca ccagaacctg ggccctgccc acttacaaca accatctcta caagcaaatc 780accaccagca ccagaacctg ggccctgccc acttacaaca accatctcta caagcaaatc 780

tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg 840tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg 840

gggtattttg actttaacag attccactgc cacttttcac cacgtgactg gcagcgactc 900gggtattttg actttaacag attccactgc cacttttcac cacgtgactg gcagcgactc 900

atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt caacatccaa 960atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt caacatccaa 960

gtcaaggagg tcacgacgaa tgatggcgtc acgaccatcg ctaataacct taccagcacg 1020gtcaaggagg tcacgacgaa tgatggcgtc acgaccatcg ctaataacct taccagcacg 1020

gttcaagtct tctcggactc ggagtaccag ttgccgtacg tcctcggctc tgcgcaccag 1080gttcaagtct tctcggactc ggagtaccag ttgccgtacg tcctcggctc tgcgcaccag 1080

ggttgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcaatacgg ctacctgacg 1140ggttgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcaatacgg ctacctgacg 1140

ctcaacaatg gcagccaagc cgtgggacgt tcatcctttt accgcctgga atatttccct 1200ctcaacaatg gcagccaagc cgtgggacgt tcatcctttt accgcctgga atatttccct 1200

tctcagatgc tgagaacggg caacaacttt accttcagct acacctttga ggaagtgcct 1260tctcagatgc tgagaacgggg caacaacttt accttcagct acacctttga ggaagtgcct 1260

ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc tctcatcgac 1320ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc tctcatcgac 1320

cagtacctgt attacctgaa cagaactcag aatcagtccg gaagtgccca aaacaaggac 1380cagtacctgt attacctgaa cagaactcag aatcagtccg gaagtgccca aaacaaggac 1380

ttgctgttta gccgtgggtc tccagctggc atgtctgttc agcccaaaaa ctggctacct 1440ttgctgttta gccgtgggtc tccagctggc atgtctgttc agcccaaaaa ctggctacct 1440

ggaccctgtt accggcagca gcgcgtttct aaaacaaaaa cagacaacaa caacagcaac 1500ggaccctgtt accggcagca gcgcgtttct aaaacaaaaa cagacaacaa caacagcaac 1500

tttacctgga ctggtgcttc aaaatataac ctcaatgggc gtgaatccat catcaaccct 1560tttacctgga ctggtgcttc aaaatataac ctcaatgggc gtgaatccat catcaaccct 1560

ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat aagcggtgtc 1620ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat aagcggtgtc 1620

atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcatgatc 1680atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcatgatc 1680

acagacgaag aggaaatcaa agccactaac cccgtggcca ccgaaagatt tgggactgtg 1740acagacgaag aggaaatcaa agccactaac cccgtggcca ccgaaagatt tgggactgtg 1740

gcagtcaatc tccagagcag cagcacagac cctgcgaccg gagatgtgca cgttatggga 1800gcagtcaatc tccagagcag cagcacagac cctgcgaccg gagatgtgca cgttatggga 1800

gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc tatttgggcc 1860gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc tatttgggcc 1860

aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactt 1920aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactt 1920

aagcacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa tcctccggca 1980aagcacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa tcctccggca 1980

gagttttcag ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc 2040gagttttcag ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc 2040

gtggagattg aatgggagct gcagaaagaa aacagcaagc gctggaatcc cgaagtgcag 2100gtggagattg aatgggagct gcagaaagaa aacagcaagc gctggaatcc cgaagtgcag 2100

tacacatcca attatgcaaa atctgccaac gttgatttta ctgtggacaa caatggactt 2160tacacatcca attatgcaaa atctgccaac gttgatttta ctgtggacaa caatggactt 2160

tatactgagc ctcgccccat tggcacccgt taccttaccc gtcccctgta a 2211tatactgagc ctcgccccat tggcacccgt taccttaccc gtcccctgta a 2211

<210> 2<210> 2

<211> 736<211> 736

<212> PRT<212> PRT

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 2<400> 2

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu SerMet Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser

1 5 10 151 5 10 15

Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys ProGlu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro

20 25 30 20 25 30

Lys Ala Asn Gln Gln Lys Gln Asp Asn Gly Arg Gly Leu Val Leu ProLys Ala Asn Gln Gln Lys Gln Asp Asn Gly Arg Gly Leu Val Leu Pro

35 40 45 35 40 45

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu ProGly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro

50 55 60 50 55 60

Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr AspVal Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp

65 70 75 8065 70 75 80

Gln Gln Leu Gln Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His AlaGln Gln Leu Gln Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala

85 90 95 85 90 95

Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly GlyAsp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly

100 105 110 100 105 110

Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Ile Leu Glu ProAsn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Ile Leu Glu Pro

115 120 125 115 120 125

Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys ArgLeu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg

130 135 140 130 135 140

Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile GlyPro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Gly

145 150 155 160145 150 155 160

Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln ThrLys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr

165 170 175 165 170 175

Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro ProGly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro

180 185 190 180 185 190

Ala Ala Pro Ser Gly Val Gly Pro Asn Thr Met Ala Ser Gly Gly GlyAla Ala Pro Ser Gly Val Gly Pro Asn Thr Met Ala Ser Gly Gly Gly

195 200 205 195 200 205

Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn AlaAla Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala

210 215 220 210 215 220

Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val IleSer Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile

225 230 235 240225 230 235 240

Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His LeuThr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu

245 250 255 245 250 255

Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn HisTyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His

260 265 270 260 265 270

Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg PheTyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe

275 280 285 275 280 285

His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn AsnHis Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn

290 295 300 290 295 300

Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile GlnTrp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln

305 310 315 320305 310 315 320

Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn AsnVal Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn

325 330 335 325 330 335

Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu ProLeu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro

340 345 350 340 345 350

Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro AlaTyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala

355 360 365 355 360 365

Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn GlyAsp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly

370 375 380 370 375 380

Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Arg Leu Glu Tyr Phe ProSer Gln Ala Val Gly Arg Ser Ser Phe Tyr Arg Leu Glu Tyr Phe Pro

385 390 395 400385 390 395 400

Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr PheSer Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe

405 410 415 405 410 415

Glu Glu Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu AspGlu Glu Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp

420 425 430 420 425 430

Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn ArgArg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg

435 440 445 435 440 445

Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe SerThr Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe Ser

450 455 460 450 455 460

Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu ProArg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro

465 470 475 480465 470 475 480

Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp AsnGly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn

485 490 495 485 490 495

Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu AsnAsn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn

500 505 510 500 505 510

Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His LysGly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys

515 520 525 515 520 525

Asp Asp Lys Asp Lys Phe Phe Pro Ile Ser Gly Val Met Ile Phe GlyAsp Asp Lys Asp Lys Phe Phe Pro Ile Ser Gly Val Met Ile Phe Gly

530 535 540 530 535 540

Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met IleLys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile

545 550 555 560545 550 555 560

Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu ArgThr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg

565 570 575 565 570 575

Phe Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro AlaPhe Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro Ala

580 585 590 580 585 590

Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp GlnThr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln

595 600 605 595 600 605

Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro HisAsp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His

610 615 620 610 615 620

Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly LeuThr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu

625 630 635 640625 630 635 640

Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro AlaLys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala

645 650 655 645 650 655

Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile ThrAsn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr

660 665 670 660 665 670

Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu GlnGln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln

675 680 685 675 680 685

Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser AsnLys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn

690 695 700 690 695 700

Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly LeuTyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu

705 710 715 720705 710 715 720

Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro LeuTyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu

725 730 735 725 730 735

<210> 3<210> 3

<211> 2214<211> 2214

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 3<400> 3

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt 300cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt 300

caggggcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360caggggcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360

gcgaaaaaga gggttcttga acctctgggc ctggttgagg aacctgttaa gacggctccg 420gcgaaaaaga gggttcttga acctctgggc ctggttgagg aacctgttaa gacggctccg 420

ggaaaaaaga ggccggtaga gccgccacct cagcgttccc ccgactcctc cacgggcatc 480ggaaaaaaga ggccggtaga gccgccacct cagcgttccc ccgactcctc cacgggcatc 480

ggcaagaaag gccagcagcc cgccaaaaag agactcaatt ttggtcagac tggcgactca 540ggcaagaaag gccagcagcc cgccaaaaag agactcaatt ttggtcagac tggcgactca 540

gagtcagtcc ccgacccaca acctctcgga gaacctccag caacccccgc tgctgtggga 600gagtcagtcc ccgacccaca acctctcgga gaacctccag caacccccgc tgctgtggga 600

cctactacaa tggcttcagg cggtggcgca ccaatggcag acaataacga aggcgccgac 660cctactacaa tggcttcagg cggtggcgca ccaatggcag acaataacga aggcgccgac 660

ggagtgggta atgcctcagg aaattggcat tgcgattcca catggctggg cgacagagtc 720ggagtgggta atgcctcagg aaattggcat tgcgattcca catggctggg cgacagagtc 720

atcaccacca gcacccgcac ctgggccttg cccacctaca ataaccacct ctacaagcaa 780atcaccacca gcacccgcac ctgggccttg cccacctaca ataaccacct ctacaagcaa 780

atctccagtg cttcaacggg ggccagcaac gacaaccact acttcggcta cagcaccccc 840atctccagtg cttcaacggg ggccagcaac gacaaccact acttcggcta cagcacccccc 840

tgggggtatt ttgatttcaa cagattccac tgccatttct caccacgtga ctggcagcga 900tggggggtatt ttgatttcaa cagattccac tgccatttct caccacgtga ctggcagcga 900

ctcatcaaca acaactgggg attccgaccc aagagactca acttcaaact cttcaacatc 960ctcatcaaca acaactgggg attccgaccc aagagactca acttcaaact cttcaacatc 960

caagtcaagg aggtcacgac gaatgacggc gttacgacca tcgctaataa ccttaccagc 1020caagtcaagg aggtcacgac gaatgacggc gttacgacca tcgctaataa ccttaccagc 1020

acgattcagg tattctcgga ctcggagtac cagcttccgt acgtcctcgg ctctgcgcac 1080acgattcagg tattctcgga ctcggagtac cagcttccgt acgtcctcgg ctctgcgcac 1080

cagggctgcc tccctccgtt cccggcggac gtgttcatga ttccgcaata cggctacctg 1140cagggctgcc tccctccgtt cccggcggac gtgttcatga ttccgcaata cggctacctg 1140

acgctcaaca atggcagcca agccgtggga cgttcatcct tttactgcct ggaatatttc 1200acgctcaaca atggcagcca agccgtggga cgttcatcct tttactgcct ggaatatttc 1200

ccttctcaga tgctgagaac gggcaacaac tttaccttca gctacacctt tgaggaagtg 1260ccttctcaga tgctgagaac gggcaacaac tttacccttca gctacacctt tgaggaagtg 1260

cctttccaca gcagctacgc gcacagccag agcctggacc ggctgatgaa tcctctcatc 1320cctttccaca gcagctacgc gcacagccag agcctggacc ggctgatgaa tcctctcatc 1320

gaccagtacc tgtattacct gaacagaact cagaatcagt ccggaagtgc ccaaaacaag 1380gaccagtacc tgtattacct gaacagaact cagaatcagt ccggaagtgc ccaaaacaag 1380

gacttgctgt ttagccgtgg gtcaccagct ggcatgtctg ttcagcccaa aaactggcta 1440gacttgctgt ttagccgtgg gtcaccagct ggcatgtctg ttcagcccaa aaactggcta 1440

cctggaccct gttaccggca gcagcgcgtt tctaaaacaa aaacagacaa caacaacagc 1500cctggaccct gttaccggca gcagcgcgtt tctaaaacaa aaacagacaa caacaacagc 1500

aactttacct ggactggtgc ttcaaaatat aacctcaatg ggcgtgaatc catcatcaac 1560aactttacct ggactggtgc ttcaaaatat aacctcaatg ggcgtgaatc catcatcaac 1560

cctggcactg ctatggcctc acacaaagac gacgaagaca agttctttcc catgagcggt 1620cctggcactg ctatggcctc acacaaagac gacgaagaca agttctttcc catgagcggt 1620

gtcatgattt ttggaaaaga gagcgccgga gcttcaaaca ctgcattgga caatgtcatg 1680gtcatgattt ttggaaaaga gagcgccgga gcttcaaaca ctgcattgga caatgtcatg 1680

attacagacg aagaggaaat taaagccact aaccccgtgg ccaccgaaag atttgggacc 1740attacagacg aagaggaaat taaagccact aaccccgtgg ccaccgaaag atttgggacc 1740

gtggcagtca atttccagag cagcagcaca gaccctgcga ccggagatgt gcatgctatg 1800gtggcagtca atttccagag cagcagcaca gaccctgcga ccggagatgt gcatgctatg 1800

ggagcattac ctggcatggt gtggcaagat agagacgtgt acctgcaggg tcccatttgg 1860ggagcattac ctggcatggt gtggcaagat agagacgtgt acctgcaggg tcccatttgg 1860

gccaaaattc ctcacacaga tggacacttt cacccgtctc ctcttatggg cggctttgga 1920gccaaaattc ctcacacaga tggacacttt cacccgtctc ctcttatggg cggctttgga 1920

ctcaagaacc cgcctcctca gatcctcatc aaaaacacgc ctgttcctgc gaatcctccg 1980ctcaagaacc cgcctcctca gatcctcatc aaaaacacgc ctgttcctgc gaatcctccg 1980

gcagagtttt cggctacaaa gtttgcttca ttcatcaccc agtattccac aggacaagtg 2040gcagagtttt cggctacaaa gtttgcttca ttcatcaccc agtattccac aggacaagtg 2040

agcgtggaga ttgaatggga gctgcagaaa gaaaacagca aacgctggaa tcccgaagtg 2100agcgtggaga ttgaatggga gctgcagaaa gaaaacagca aacgctggaa tcccgaagtg 2100

cagtatacat ctaactatgc aaaatctgcc aacgttgatt ttactgagga caacaatgga 2160cagtatacat ctaactatgc aaaatctgcc aacgttgatt ttactgagga caacaatgga 2160

ctttatactg agcctcgccc cattggcacc cgttacctta cccgtcccct gtaa 2214ctttatactg agcctcgccc cattggcacc cgttacctta cccgtcccct gtaa 2214

<210> 4<210> 4

<211> 737<211> 737

<212> PRT<212> PRT

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 4<400> 4

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His AlaGln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala

85 90 95 85 90 95

Asp Ala Glu Phe Gln Gly Arg Leu Gln Glu Asp Thr Ser Phe Gly GlyAsp Ala Glu Phe Gln Gly Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly

100 105 110 100 105 110

Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu ProAsn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro

115 120 125 115 120 125

Leu Gly Leu Val Glu Glu Pro Val Lys Thr Ala Pro Gly Lys Lys ArgLeu Gly Leu Val Glu Glu Pro Val Lys Thr Ala Pro Gly Lys Lys Arg

130 135 140 130 135 140

Pro Val Glu Pro Pro Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly IlePro Val Glu Pro Pro Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile

145 150 155 160145 150 155 160

Gly Lys Lys Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly GlnGly Lys Lys Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln

165 170 175 165 170 175

Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu ProThr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro

180 185 190 180 185 190

Pro Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly GlyPro Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly

195 200 205 195 200 205

Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly AsnGly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn

210 215 220 210 215 220

Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg ValAla Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val

225 230 235 240225 230 235 240

Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn HisIle Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His

245 250 255 245 250 255

Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp AsnLeu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn

260 265 270 260 265 270

His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn ArgHis Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg

275 280 285 275 280 285

Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn AsnPhe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn

290 295 300 290 295 300

Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn IleAsn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile

305 310 315 320305 310 315 320

Gln Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala AsnGln Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn

325 330 335 325 330 335

Asn Leu Thr Ser Thr Ile Gln Val Phe Ser Asp Ser Glu Tyr Gln LeuAsn Leu Thr Ser Thr Ile Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu

340 345 350 340 345 350

Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe ProPro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro

355 360 365 355 360 365

Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn AsnAla Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn

370 375 380 370 375 380

Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr PheGly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe

385 390 395 400385 390 395 400

Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr ThrPro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr

405 410 415 405 410 415

Phe Glu Glu Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser LeuPhe Glu Glu Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu

420 425 430 420 425 430

Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu AsnAsp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn

435 440 445 435 440 445

Arg Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu PheArg Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe

450 455 460 450 455 460

Ser Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp LeuSer Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu

465 470 475 480465 470 475 480

Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr AspPro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp

485 490 495 485 490 495

Asn Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn LeuAsn Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu

500 505 510 500 505 510

Asn Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser HisAsn Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His

515 520 525 515 520 525

Lys Asp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile PheLys Asp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe

530 535 540 530 535 540

Gly Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val MetGly Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met

545 550 555 560545 550 555 560

Ile Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr GluIle Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu

565 570 575 565 570 575

Arg Phe Gly Thr Val Ala Val Asn Phe Gln Ser Ser Ser Thr Asp ProArg Phe Gly Thr Val Ala Val Asn Phe Gln Ser Ser Ser Thr Asp Pro

580 585 590 580 585 590

Ala Thr Gly Asp Val His Ala Met Gly Ala Leu Pro Gly Met Val TrpAla Thr Gly Asp Val His Ala Met Gly Ala Leu Pro Gly Met Val Trp

595 600 605 595 600 605

Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile ProGln Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro

610 615 620 610 615 620

His Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe GlyHis Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly

625 630 635 640625 630 635 640

Leu Lys Asn Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val ProLeu Lys Asn Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro

645 650 655 645 650 655

Ala Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe IleAla Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile

660 665 670 660 665 670

Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu LeuThr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu

675 680 685 675 680 685

Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr SerGln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser

690 695 700 690 695 700

Asn Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Glu Asp Asn Asn GlyAsn Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Glu Asp Asn Asn Gly

705 710 715 720705 710 715 720

Leu Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg ProLeu Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro

725 730 735 725 730 735

LeuLeu

<210> 5<210> 5

<211> 2208<211> 2208

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 5<400> 5

gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac 120gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac 120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 180gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 180

cagcagctca aagccggaga caacccgtac ctcaagtaca accacgccga cgccgagttc 300cagcagctca aagccggaga caacccgtac ctcaagtaca accacgccga cgccgagttc 300

caggagcggc tcaaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360caggagcggc tcaaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360

gcgaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct 420gcgaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct 420

ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcatcggc 480ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcatcggc 480

aagacaggcc agcagcccgc taaaaaaaga ctcaattttg gtcagactgg cgactcagag 540aagacaggcc agcagcccgc taaaaaaaga ctcaattttg gtcagactgg cgactcagag 540

tcagttccag accctcaacc tctcggagaa cctccagcag cgccctctgg tgtgggacct 600tcagttccag accctcaacc tctcggagaa cctccagcag cgccctctgg tgtgggacct 600

aatacaatgg ctgcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660aatacaatgg ctgcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660

gtgggtagtt cctcgggaaa ttggcattgc gattccacat ggatgggcga cagagtcatc 720gtggggtagtt cctcgggaaa ttggcattgc gattccacat ggatgggcga cagagtcatc 720

accaccagca cccgaacctg ggccctgccc acctacaaca accacctcta caaacaaatt 780accaccagca cccgaacctg ggccctgccc acctacaaca accacctcta caaacaaatt 780

tccagccaat caggagcctc gaacgacaat cactactttg gctacagcac cccttggggg 840tccagccaat caggagcctc gaacgacaat cactactttg gctacagcac cccttggggg 840

tattttgact ttaacagatt ccactgccac ttctcaccac gtgactggca gcgactcatt 900tattttgact ttaacagatt ccactgccac ttctcaccac gtgactggca gcgactcatt 900

aacaacaatt ggggattccg gcccaagaga ctcaacttca agctcttcaa catccaagtc 960aacaacaatt ggggattccg gcccaagaga ctcaacttca agctcttcaa catccaagtc 960

aaggaggtca cgacgaatga tggcgtcaca accatcgcta ataaccttac cagcacggtt 1020aaggaggtca cgacgaatga tggcgtcaca accatcgcta ataaccttac cagcacggtt 1020

caagtcttct cggactcgga gtaccagctt ccgtacgtcc tcggctctgc gcaccagggc 1080caagtcttct cggactcgga gtaccagctt ccgtacgtcc tcggctctgc gcaccagggc 1080

tgcctgcctc cgttcccggc ggacgtcttc atgattcctc agtacggcta cctgactccc 1140tgcctgcctc cgttcccggc ggacgtcttc atgattcctc agtacggcta cctgactccc 1140

aacaatggca gtcagtctgt gggacgttcc tccttctact gcctggagta cttcccttct 1200aacaatggca gtcagtctgt gggacgttcc tccttctact gcctggagta cttcccttct 1200

cagatgctga gaacgggcaa caactttacc ttcagctaca cctttgagga agtgcctttc 1260cagatgctga gaacgggcaa caactttacc ttcagctaca cctttgagga agtgcctttc 1260

cacagcagct acgcgcacag ccagagcctg gaccggctga tgaatcctct catcgaccag 1320cacagcagct acgcgcacag ccagagcctg gaccggctga tgaatcctct catcgaccag 1320

tacctgtatt acctgaacag aactcaaaat cagtccggaa gtgcccaaaa caaggacttg 1380tacctgtatt acctgaacag aactcaaaat cagtccggaa gtgcccaaaa caaggacttg 1380

ctgtttagcc gtgggtctcc agctggcatg tctgttcagc ccaaaaactg gctacctgga 1440ctgtttagcc gtgggtctcc agctggcatg tctgttcagc ccaaaaactg gctacctgga 1440

ccctgttatc agcagcagcg cgtttctaaa acaaaaacag acaacaacaa cagcaacttt 1500ccctgttatc agcagcagcg cgtttctaaa acaaaaacag acaacaacaa cagcaacttt 1500

acctggactg gtgcttcaaa atataacctc aatgggcgtg aatccatcat caaccctggc 1560acctggactg gtgcttcaaa atataacctc aatgggcgtg aatccatcat caaccctggc 1560

actgctatgg cctcacacaa agacgacaaa gacaagttct ttcccatgag cggtgtcatg 1620actgctatgg cctcacacaa agacgacaaa gacaagttct ttcccatgag cggtgtcatg 1620

atttttggaa aggagagcgc cggagcttca aacactgcat tggacaatgt catgatcaca 1680attttggaa aggagagcgc cggagcttca aacactgcat tggacaatgt catgatcaca 1680

gacgaagagg aaatcaaagc cactaacccc gtggccaccg aaagatttgg gaccgtggca 1740gacgaagagg aaatcaaagc cactaaccccc gtggccaccg aaagatttgg gaccgtggca 1740

gtcaatctcc agagcagcag cacagaccct gcgaccggag atgtgcatgt tatgggagcc 1800gtcaatctcc agagcagcag cacagaccct gcgaccggag atgtgcatgt tatggggagcc 1800

ttacctggaa tggtgtggca agacagagac gtatacctgc agggtcctat ttgggccaaa 1860ttacctggaa tggtgtggca aagacagagac gtatacctgc agggtcctat ttgggccaaa 1860

attcctcaca cggatggaca ctttcacccg tctcctctca tgggcggctt tggacttaag 1920attcctcaca cggatggaca ctttcacccg tctcctctca tgggcggctt tggacttaag 1920

cacccgcctc ctcagatcct catcaaaaac acgcctgttc ctgcgaatcc tccggcagag 1980cacccgcctc ctcagatcct catcaaaaac acgcctgttc ctgcgaatcc tccggcagag 1980

ttttcggcta caaagtttgc ttcattcatc acccagtatt ccacaggacg agtgagcgtg 2040ttttcggcta caaagtttgc ttcattcatc accccagtatt ccacaggacg agtgagcgtg 2040

gagattgaat gggagccgca gaaagaaaac agcaaacgct ggaatcccga agtgcagtat 2100gagattgaat gggagccgca gaaagaaaac agcaaacgct ggaatcccga agtgcagtat 2100

acatctaact atgcaaaatc tgccaacgtt gattttactg tggacaacaa tggactttat 2160acatctaact atgcaaaatc tgccaacgtt gattttactg tggacaacaa tggactttat 2160

actgagcctc gccccattgg cacccgttac cttacccgtc ccctgtaa 2208actgagcctc gccccattgg cacccgttac cttacccgtc ccctgtaa 2208

<210> 6<210> 6

<211> 735<211> 735

<212> PRT<212> PRT

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 6<400> 6

1 5 10 151 5 10 15

Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys ProGlu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro

20 25 30 20 25 30

Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu ProLys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His AlaGln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala

85 90 95 85 90 95

Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly GlyAsp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly

100 105 110 100 105 110

115 120 125 115 120 125

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys ArgLeu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg

130 135 140 130 135 140

Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile GlyPro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Ser Gly Ile Gly

145 150 155 160145 150 155 160

Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln ThrLys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr

165 170 175 165 170 175

180 185 190 180 185 190

Ala Ala Pro Ser Gly Val Gly Pro Asn Thr Met Ala Ala Gly Gly GlyAla Ala Pro Ser Gly Val Gly Pro Asn Thr Met Ala Ala Gly Gly Gly

195 200 205 195 200 205

Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser SerAla Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser

210 215 220 210 215 220

Ser Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val IleSer Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val Ile

225 230 235 240225 230 235 240

245 250 255 245 250 255

Tyr Lys Gln Ile Ser Ser Gln Ser Gly Ala Ser Asn Asp Asn His TyrTyr Lys Gln Ile Ser Ser Gln Ser Gly Ala Ser Asn Asp Asn His Tyr

260 265 270 260 265 270

Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe HisPhe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His

275 280 285 275 280 285

Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn TrpCys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn Trp

290 295 300 290 295 300

Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln ValGly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln Val

305 310 315 320305 310 315 320

Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn LeuLys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn Leu

325 330 335 325 330 335

Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro TyrThr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro Tyr

340 345 350 340 345 350

Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala AspVal Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala Asp

355 360 365 355 360 365

Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Pro Asn Asn Gly SerVal Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Pro Asn Asn Gly Ser

370 375 380 370 375 380

Gln Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro SerGln Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser

385 390 395 400385 390 395 400

Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe GluGln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu

405 410 415 405 410 415

Glu Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp ArgGlu Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp Arg

420 425 430 420 425 430

Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg ThrLeu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg Thr

435 440 445 435 440 445

Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe Ser ArgGln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe Ser Arg

450 455 460 450 455 460

Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro GlyGly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro Gly

465 470 475 480465 470 475 480

Pro Cys Tyr Gln Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn AsnPro Cys Tyr Gln Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn Asn

485 490 495 485 490 495

Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn GlyAsn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn Gly

500 505 510 500 505 510

Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys AspArg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys Asp

515 520 525 515 520 525

Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly LysAsp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly Lys

530 535 540 530 535 540

Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile ThrGlu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile Thr

545 550 555 560545 550 555 560

Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg PheAsp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg Phe

565 570 575 565 570 575

Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro Ala ThrGly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro Ala Thr

580 585 590 580 585 590

Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln AspGly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln Asp

595 600 605 595 600 605

Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His ThrArg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His Thr

610 615 620 610 615 620

Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu LysAsp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys

625 630 635 640625 630 635 640

His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala AsnHis Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala Asn

645 650 655 645 650 655

Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr GlnPro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr Gln

660 665 670 660 665 670

Tyr Ser Thr Gly Arg Val Ser Val Glu Ile Glu Trp Glu Pro Gln LysTyr Ser Thr Gly Arg Val Ser Val Glu Ile Glu Trp Glu Pro Gln Lys

675 680 685 675 680 685

Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn TyrGlu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn Tyr

690 695 700 690 695 700

Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu TyrAla Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu Tyr

705 710 715 720705 710 715 720

Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro LeuThr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu

725 730 735 725 730 735

<210> 7<210> 7

<211> 2211<211> 2211

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 7<400> 7

atggctgccg atggttatct tccagattgg ctcgaggaca acctttctga aggcattcgt 60atggctgccg atggttatct tccagattgg ctcgaggaca acctttctga aggcattcgt 60

gagtggtggg ctctgaaacc tggagtccct caacccaaag cgaaccaaca acaccaggac 120gagtggtggg ctctgaaacc tggagtccct caacccaaag cgaaccaaca acacaggac 120

aaccgtcggg gtcttgtgct tccgggttac aaatacctcg gacccggtaa cggactcgac 180aaccgtcggg gtcttgtgct tccgggttac aaatacctcg gacccggtaa cggactcgac 180

aaaggagagc cggtcaacga ggcagacgcc gcggccctcg agcacgacaa agcatacgac 240aaaggagagc cggtcaacga ggcagacgcc gcggccctcg agcacgacaa agcatacgac 240

cagcagctca aggccggtga caacccgtac ctcaagtaca accacgccga cgccgagttt 300cagcagctca aggccggtga caacccgtac ctcaagtaca accacgccga cgccgagttt 300

gccaaaaaga ggatccttga gcctcttggt ctggttgagg aagcagctaa gacggctcct 420gccaaaaaga ggatccttga gcctcttggt ctggttgagg aagcagctaa gacggctcct 420

aaatcgggtg cacagcccgc taaaaagaga ctcaatttcg gtcagactgg cgacacagag 540aaatcgggtg cacagcccgc taaaaagaga ctcaatttcg gtcagactgg cgacacagag 540

tcagtcccag accctcaacc aatcggagaa cctccagcag cgccctctgg tgtgggacct 600tcagtcccag accctcaacc aatcggagaa cctccagcag cgccctctgg tgtgggacct 600

aatacaatgg ctgcgggcgg tggcgcacca atggcagaca ataacgaggg tgccgatgga 660aatacaatgg ctgcgggcgg tggcgcacca atggcagaca ataacgaggg tgccgatgga 660

gtgggtaatt cctcaggaaa ttggcattgc gattcccaat ggctgggcga cagagtcatc 720gtgggtaatt cctcaggaaa ttggcattgc gattcccaat ggctgggcga cagagtcatc 720

ctcaacaatg gcagccaagc cgtgggacgt tcatcctttt actgcctgga atatttccct 1200ctcaacaatg gcagccaagc cgtgggacgt tcatcctttt actgcctgga atatttccct 1200

atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcacgatc 1680atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcacgatc 1680

gcagtcaatc tccagagcag cagcacagac cctgcgaccg gagatgtgca tgttatggga 1800gcagtcaatc tccagagcag cagcacagac cctgcgaccg gagatgtgca tgttatggga 1800

aagattcctc acacggatgg caactttcac ccgtctcctc tcatgggcgg ctttggactt 1920aagattcctc acacggatgg caactttcac ccgtctcctc tcatgggcgg ctttggactt 1920

gagttttcgg ctacaaagtt tgcttcattc atcacccaat actccacagg acaagtgagt 2040gagttttcgg ctacaaagtt tgcttcattc atcacccaat actccacagg acaagtgagt 2040

gtggaaattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag 2100gtggaaattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag 2100

<210> 8<210> 8

<211> 736<211> 736

<212> PRT<212> PRT

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 8<400> 8

1 5 10 151 5 10 15

Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Val Pro Gln ProGlu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Val Pro Gln Pro

20 25 30 20 25 30

Lys Ala Asn Gln Gln His Gln Asp Asn Arg Arg Gly Leu Val Leu ProLys Ala Asn Gln Gln His Gln Asp Asn Arg Arg Gly Leu Val Leu Pro

35 40 45 35 40 45

Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu ProGly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro

50 55 60 50 55 60

Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr AspVal Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

145 150 155 160145 150 155 160

165 170 175 165 170 175

Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro ProGly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro

180 185 190 180 185 190

195 200 205 195 200 205

Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn SerAla Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser

210 215 220 210 215 220

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe ProSer Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

530 535 540 530 535 540

Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Thr IleLys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Thr Ile

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly LeuThr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

705 710 715 720705 710 715 720

725 730 735 725 730 735

<210> 9<210> 9

<211> 2211<211> 2211

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 9<400> 9

ctcaacaatg gcagccaagc cgtgggacgt tcatcctttt actgtctgga atatttccct 1200ctcaacaatg gcagccaagc cgtgggacgt tcatcctttt actgtctgga atatttccct 1200

ggcactgcta tggcctcaca caaagacgac gaagacaagt tctttcccat gagcggtgtc 1620ggcactgcta tggcctcaca caaagacgac gaagacaagt tctttcccat gagcggtgtc 1620

ctgatttttg gaaaagagag cgccggagct tcaaacactg cattggacaa tgtcatgatt 1680ctgatttttg gaaaagagag cgccggagct tcaaacactg cattggaca tgtcatgatt 1680

acagacgagg aggaaattaa agccactaac cccgtggcca ccgaaaaatt tgggactgtg 1740acagacgagg aggaaattaa agccactaac cccgtggcca ccgaaaaatt tgggactgtg 1740

gagttttcgg ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc 2040gagttttcgg ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc 2040

gtggagattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag 2100gtggagattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag 2100

tatacatcta actatgcaaa atctgccaac gttgatttta ctgtggacaa caatggactt 2160tatacatcta actatgcaaa atctgccaac gttgatttta ctgtggaca caatggactt 2160

<210> 10<210> 10

<211> 736<211> 736

<212> PRT<212> PRT

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 10<400> 10

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

145 150 155 160145 150 155 160

165 170 175 165 170 175

180 185 190 180 185 190

195 200 205 195 200 205

210 215 220 210 215 220

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

Asp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly Val Leu Ile Phe GlyAsp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly Val Leu Ile Phe Gly

530 535 540 530 535 540

545 550 555 560545 550 555 560

Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu LysThr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Lys

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

705 710 715 720705 710 715 720

725 730 735 725 730 735

<210> 11<210> 11

<211> 2211<211> 2211

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 11<400> 11

atggctggcg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 60atggctggcg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 60

gagtggtggg acttgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac 120gagtggtggg acttgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac 120

aagggggagc cggtcaacgc agcagacgca gcggccctcg agcacgacaa ggcctacgac 240aagggggagc cggtcaacgc agcagacgca gcggccctcg agcacgacaa ggcctacgac 240

gcgaaaaagc ggcttcttga acctcttggt ctggttgagg aagcggctaa gacggctcct 420gcgaaaaagc ggcttcttga acctcttggt ctggttgagg aagcggctaa gacggctcct 420

ggaaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgc gggtattagc 480ggaaagaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgc gggtattagc 480

tcagtcccag accctcaacc aatcggagaa cctcccgcag ccccctcagg tgtgggatct 600tcagtcccag accctcaacc aatcggagaa cctcccgcag ccccctcagg tgtgggatct 600

cttacaatgg cttcaggtgg tggcgcacca gtggcagaca ataacgaagg cgccgacgga 660cttacaatgg cttcaggtgg tggcgcacca gtggcagaca ataacgaagg cgccgacgga 660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggatgggcga cagagtcatc 720gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggatgggcga cagagtcatc 720

accaccagca cccgaacttg ggccttgccc acctataaca accacctcta caagcaaatc 780accaccagca cccgaacttg ggccttgccc acctataaca accacctcta caagcaaatc 780

tccagtgctt caacgggggc cagcaacgat aaccactact tcggctacag caccccctgg 840tccagtgctt caacgggggc cagcaacgat aaccactact tcggctacag caccccctgg 840

gggtattttg acttcaacag attccactgc cacttttcac cacgtgactg gcagcgactc 900gggtattttg acttcaacag attccactgc cacttttcac cacgtgactg gcagcgactc 900

atcaacaaca actggggatt ccggcccaag agactcagct tcaagctctt caacatccag 960atcaacaaca actggggatt ccggcccaag agactcagct tcaagctctt caacatccag 960

gtcaaagagg ttacggacaa caatggagtc aagaccatcg ccaataatct taccagcacg 1020gtcaaagagg ttacggacaa caatggagtc aagaccatcg ccaataatct taccagcacg 1020

gtccaggtct tcacggactc agactatcag ctcccgtacg ttctcggctc tgcccaccag 1080gtccagggtct tcacggactc agactatcag ctcccgtacg ttctcggctc tgcccaccag 1080

ggctgcctgc ctccgttccc ggcggacgtg ttcatgattc cccagtacgg ctacctgaca 1140ggctgcctgc ctccgttccc ggcggacgtg ttcatgattc cccagtacgg ctacctgaca 1140

ctcaacaacg gtagtcaggc cgtgggacgc tcctccttct actgcctgga atactttcct 1200ctcaacaacg gtagtcaggc cgtgggacgc tcctccttct actgcctgga atactttcct 1200

ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat gagcggtgtc 1620ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat gagcggtgtc 1620

aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactc 1920aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactc 1920

aagcacccgc ctcctcagat cctcatcaaa aacacacccg ttcctgcgaa tcctccggca 1980aagcacccgc ctcctcagat cctcatcaaa aacacacccg ttcctgcgaa tcctccggca 1980

tacacatcca attatgcaaa atctgccgac gttgatttta ctgtggacaa caatggactt 2160tacacatcca attatgcaaa atctgccgac gttgatttta ctgtggacaa caatggactt 2160

<210> 12<210> 12

<211> 736<211> 736

<212> PRT<212> PRT

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 12<400> 12

Met Ala Gly Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu SerMet Ala Gly Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu ProAsn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro

115 120 125 115 120 125

130 135 140 130 135 140

Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile SerPro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Ser

145 150 155 160145 150 155 160

165 170 175 165 170 175

180 185 190 180 185 190

Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly GlyAla Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly

195 200 205 195 200 205

Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn AlaAla Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala

210 215 220 210 215 220

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

Trp Gly Phe Arg Pro Lys Arg Leu Ser Phe Lys Leu Phe Asn Ile GlnTrp Gly Phe Arg Pro Lys Arg Leu Ser Phe Lys Leu Phe Asn Ile Gln

305 310 315 320305 310 315 320

Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn AsnVal Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn Asn

325 330 335 325 330 335

Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu ProLeu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu Pro

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe GlyAsp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly

530 535 540 530 535 540

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

Tyr Ala Lys Ser Ala Asp Val Asp Phe Thr Val Asp Asn Asn Gly LeuTyr Ala Lys Ser Ala Asp Val Asp Phe Thr Val Asp Asn Asn Gly Leu

705 710 715 720705 710 715 720

725 730 735 725 730 735

<210> 13<210> 13

<211> 2211<211> 2211

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 13<400> 13

atggctgccg atggttatct tccagattgg ctcgaggaca accttagtga gggcattcgc 60atggctgccg atggttatct tccagattgg ctcgaggaca accttagtga gggcattcgc 60

aggggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac 240aggggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac 240

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccgg 360caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccgg 360

gccaagaagc gggttctcga acctctcggt ctggttgagg aagcagctaa aacggctcct 420gccaagaagc gggttctcga acctctcggt ctggttgagg aagcagctaa aacggctcct 420

ggaaagaaga ggcctgtaga tcagtctcct caggaaccgg actcatcatc tggtgttggc 480ggaaagaaga ggcctgtaga tcagtctcct caggaaccgg actcatcatc tggtgttggc 480

aaatcgggca aacagcctgc cagaaaaaga ctcaatttcg gtcagactgg cgactcagag 540aaatcgggca aacagcctgc cagaaaaaga ctcaatttcg gtcagactgg cgactcagag 540

tcagtccccg accctcaacc tctcggagaa cctccagcag cgccctctag tgtgggatct 600tcagtccccg accctcaacc tctcggagaa cctccagcag cgccctctag tgtgggatct 600

ggtacagtgg ctgcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660ggtacagtgg ctgcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc 720gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc 720

accaccagca cccgaacatg ggccttgccc acctataaca accacctcta caagcaaatc 780accaccagca cccgaacatg ggccttgccc acctataaca accacctcta caagcaaatc 780

gggtattttg atttcaacag attccactgc catttctcac cacgtgactg gcagcgactc 900gggtattttg atttcaacag attccactgc catttctcac cacgtgactg gcagcgactc 900

ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcaatacgg ctacctgacg 1140ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcaatacgg ctacctgacg 1140

gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc catttgggcc 1860gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc catttgggcc 1860

<210> 14<210> 14

<211> 736<211> 736

<212> PRT<212> PRT

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 14<400> 14

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Arg Gly Glu ProGly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Arg Gly Glu Pro

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

Asn Leu Gly Arg Ala Val Phe Arg Ala Lys Lys Arg Val Leu Glu ProAsn Leu Gly Arg Ala Val Phe Arg Ala Lys Lys Arg Val Leu Glu Pro

115 120 125 115 120 125

130 135 140 130 135 140

Pro Val Asp Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Val GlyPro Val Asp Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Ser Gly Val Gly

145 150 155 160145 150 155 160

Lys Ser Gly Lys Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln ThrLys Ser Gly Lys Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln Thr

165 170 175 165 170 175

180 185 190 180 185 190

Ala Ala Pro Ser Ser Val Gly Ser Gly Thr Val Ala Ala Gly Gly GlyAla Ala Pro Ser Ser Val Gly Ser Gly Thr Val Ala Ala Gly Gly Gly

195 200 205 195 200 205

210 215 220 210 215 220

Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val IleSer Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

530 535 540 530 535 540

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

705 710 715 720705 710 715 720

725 730 735 725 730 735

<210> 15<210> 15

<211> 2211<211> 2211

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 15<400> 15

atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga aggaataaga 60atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga aggaataaga 60

cagtggtgga agctcaaacc tggcccacca ccaccaaagc ccgcagagcg gcataaggac 120cagtggtgga agctcaaacc tggcccacca ccaccaaagc ccgcagagcg gcataaggac 120

gacagcaggg gtcttgtgct tcctgggtac aagtacctcg gacccttcaa cggactcgac 180gacagcaggg gtcttgtgct tcctgggtac aagtacctcg gacccttcaa cggactcgac 180

aagggggagc cggtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac 240aagggggagc cggtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac 240

caggagcgtc tgcaagaaga tacgtcattt gggggcaacc tcgggcgagc agtcttccag 360caggagcgtc tgcaagaaga tacgtcattt gggggcaacc tcgggcgagc agtcttccag 360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct 420gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct 420

ggaaagaaac gtccggtaga gcagtctcct caggaaccgg actcctccgc gggtattggc 480ggaaagaaac gtccggtaga gcagtctcct caggaaccgg actcctccgc gggtattggc 480

aaatcgggtg cacggcccgc taaaaagaga ctcaattttg gtcagactgg cgacacagag 540aaatcgggtg cacggcccgc taaaaagaga ctcaattttg gtcagactgg cgacacagag 540

tcagtcccag accctcaacc aatcggagaa cctcccgcag cccccacaag tttgggatct 600tcagtcccag accctcaacc aatcggagaa cctcccgcag cccccacaag tttgggatct 600

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcgtc 720gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcgtc 720

accaccagca cccgcacctg ggccttgccc acctataaca accacctcta caagcaaatc 780accaccagca cccgcacctg ggccttgccc acctataaca accacctcta caagcaaatc 780

gggtattttg atttcaacag attccactgc cacttttcac cacgtgactg gcagcgactc 900gggtattttg atttcaacag attccactgc cacttttcac cacgtgactg gcagcgactc 900

gtccaggtct tcacggactc agactatcag ctcccgtacg tgctcgggtc ggctcacgag 1080gtccagggtct tcacggactc agactatcag ctcccgtacg tgctcgggtc ggctcacgag 1080

ctcaacaatg gcagccaagc cgtgggtcgt tcgtcctttt actgcctgga atatttcccg 1200ctcaacaatg gcagccaagc cgtgggtcgt tcgtcctttt actgcctgga atatttcccg 1200

tcgcaaatgc taagaacagg caacaacttt accttcagct acacctttga ggaagtgcct 1260tcgcaaatgc taagaacagg caacaacttt accttcagct acacctttga ggaagtgcct 1260

gcagtcaatc tccagagcag cagcacggac cctgcgactg gagatgtgca tgttatggga 1800gcagtcaatc tccagagcag cagcacggac cctgcgactg gagatgtgca tgttatggga 1800

<210> 16<210> 16

<211> 736<211> 736

<212> PRT<212> PRT

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 16<400> 16

1 5 10 151 5 10 15

Glu Gly Ile Arg Gln Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro ProGlu Gly Ile Arg Gln Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro Pro

20 25 30 20 25 30

Lys Pro Ala Glu Arg His Lys Asp Asp Ser Arg Gly Leu Val Leu ProLys Pro Ala Glu Arg His Lys Asp Asp Ser Arg Gly Leu Val Leu Pro

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

145 150 155 160145 150 155 160

Lys Ser Gly Ala Arg Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln ThrLys Ser Gly Ala Arg Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr

165 170 175 165 170 175

180 185 190 180 185 190

Ala Ala Pro Thr Ser Leu Gly Ser Asn Thr Met Ala Ser Gly Gly GlyAla Ala Pro Thr Ser Leu Gly Ser Asn Thr Met Ala Ser Gly Gly Gly

195 200 205 195 200 205

210 215 220 210 215 220

Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val ValSer Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Val

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro AlaTyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro Ala

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

530 535 540 530 535 540

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

705 710 715 720705 710 715 720

725 730 735 725 730 735

<210> 17<210> 17

<211> 2214<211> 2214

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 17<400> 17

atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga aggcattcgt 60atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga aggcattcgt 60

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgcggagttt 300cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgcggagttt 300

caggagcgcc ttaaagaaga tacgtctttt gggggcaacc tcggacgagc agtcttccag 360caggagcgcc ttaaagaaga tacgtctttt gggggcaacc tcggacgagc agtcttccag 360

ggaaagaaga gaccggtaga gccatcaccc cagcgttctc cagactcctc tacgggcatc 480ggaaagaaga gaccggtaga gccatcaccc cagcgttctc cagactcctc tacgggcatc 480

ggcaagaaag gccaacagcc cgccagaaaa agactcaatt tcggtcagac tggcgacaca 540ggcaagaaag gccaacagcc cgccagaaaa agactcaatt tcggtcagac tggcgacaca 540

gagtcagtcc cagaccctca accaatcgga gaacctcccg cagccccctc aggtgtggaa 600gagtcagtcc cagaccctca accaatcgga gaacctcccg cagccccctc aggtgtggaa 600

tctcttacaa tggcttcagg tggtggcgca ccagtggcag acagtaacga aggcgccgac 660tctcttacaa tggcttcagg tggtggcgca ccagtggcag acagtaacga aggcgccgac 660

atcaccaaca gcacccgaac atgggccttg cccacctata acaaccacct ctacaagcaa 780atcaccaaca gcacccgaac atgggccttg cccacctata acaaccacct ctacaagcaa 780

tgggggtatt ttgatttcaa cagattccac tgccactttt caccacgtga ctggcagcga 900tgggggtatt ttgatttcaa cagattccac tgccactttt caccacgtga ctggcagcga 900

ctcatcaaca acaattgggg attccggccc aagagactca acttcaaact cttcaacatc 960ctcatcaaca acaattgggg attccggccc aagagactca acttcaaact cttcaacatc 960

caagtcaagg aggtcacgac gaatgatggc gtcacaacca tcgctaataa ccttaccagc 1020caagtcaagg aggtcacgac gaatgatggc gtcacaacca tcgctaataa ccttaccagc 1020

acggttcaag tcttctcgga ctcggagtac cagcttccgt acgtcctcgg ctctgcgcac 1080acggttcaag tcttctcgga ctcggagtac cagcttccgt acgtcctcgg ctctgcgcac 1080

gaccaatacc tgtattacct gaacagaact caaaatcagt ccggaagtgc ccaaaacaag 1380gaccaatacc tgtattacct gaacagaact caaaatcagt ccggaagtgc ccaaaacaag 1380

gacttgctgt ttagccgtgg gtctccagct ggcatgtctg ttcagcccaa aaactggcta 1440gacttgctgt ttagccgtgg gtctccagct ggcatgtctg ttcagcccaa aaactggcta 1440

cctggaccct gttatcggca gcagcgcgtt tctaaaacaa aaacagacaa caacaacagc 1500cctggaccct gttatcggca gcagcgcgtt tctaaaacaa aaacagacaa caacaacagc 1500

aattttacct ggactggtgc ttcaaaatat aacctcaatg ggcgtgaatc catcatcaac 1560aattttacct ggactggtgc ttcaaaatat aacctcaatg ggcgtgaatc catcatcaac 1560

attacagacg aagaggaaat taaagccact aaccctgtgg ccaccgaaag atttgggacc 1740attacagacg aagaggaaat taaagccact aaccctgtgg ccaccgaaag atttgggacc 1740

gcggagtttt cagctacaaa gtttgcttca ttcatcaccc aatactccac aggacaagtg 2040gcggagtttt cagctacaaa gtttgcttca ttcatcaccc aatactccac aggacaagtg 2040

agtgtggaaa ttgaatggga gctgcagaaa gaaaacagca agcgctggaa tcccgaagtg 2100agtgtggaaa ttgaatggga gctgcagaaa gaaaacagca agcgctggaa tcccgaagtg 2100

cagtacacat ccaattatgc aaaatctgcc aacgttgatt ttactgtgga caacaatgga 2160cagtacacat ccaattatgc aaaatctgcc aacgttgatt ttactgtgga caacaatgga 2160

<210> 18<210> 18

<211> 737<211> 737

<212> PRT<212> PRT

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 18<400> 18

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly IlePro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile

145 150 155 160145 150 155 160

Gly Lys Lys Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly GlnGly Lys Lys Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln

165 170 175 165 170 175

Thr Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu ProThr Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro

180 185 190 180 185 190

Pro Ala Ala Pro Ser Gly Val Glu Ser Leu Thr Met Ala Ser Gly GlyPro Ala Ala Pro Ser Gly Val Glu Ser Leu Thr Met Ala Ser Gly Gly

195 200 205 195 200 205

Gly Ala Pro Val Ala Asp Ser Asn Glu Gly Ala Asp Gly Val Gly AsnGly Ala Pro Val Ala Asp Ser Asn Glu Gly Ala Asp Gly Val Gly Asn

210 215 220 210 215 220

225 230 235 240225 230 235 240

Ile Thr Asn Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn HisIle Thr Asn Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

Asn Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln LeuAsn Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

530 535 540 530 535 540

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

Asn Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn GlyAsn Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly

705 710 715 720705 710 715 720

725 730 735 725 730 735

LeuLeu

Claims

1. A nucleic acid, characterized in that, the nucleic acid comprises:

A nucleotide sequence shown by SEQ ID NO: 5 for encoding an adeno-associated virus capsid protein; or

The nucleotide sequence of the adeno-associated virus capsid protein encoded by the nucleotide sequence shown in SEQ ID NO:5, which is different from the nucleotide sequence shown in SEQ ID NO:5 due to the degeneracy of the genetic code .

2. The nucleic acid according to claim 1, wherein the nucleic acid is a plasmid, phage, viral vector, bacterial artificial chromosome or yeast artificial chromosome.

3. The nucleic acid according to claim 2, characterized in that the nucleic acid is an adeno-associated virus vector containing a coding sequence.

4. The nucleic acid of claim 3, further comprising a coding sequence of an adeno-associated virus Rep protein.

5. The adeno-associated virus capsid protein encoded by the nucleotide sequence shown in SEQ ID NO:5 according to claim 1.

6. The adeno-associated virus capsid protein of claim 5, wherein the amino acid sequence of the adeno-associated virus capsid protein is the amino acid sequence shown in SEQ ID NO:6.

7. The adeno-associated virus capsid protein according to claim 5 or 6, characterized in that, the adeno-associated virus capsid protein is covalently linked, bound or encapsulated with a composition selected from the group consisting of DNA, RNA One or more of peptides, carbohydrates, liposomes and small organic molecules.

8. A recombinant virion, characterized in that the recombinant virion comprises the nucleic acid according to any one of claims 1-4 and/or the capsid protein according to any one of claims 5-7.

9. The recombinant virus particle according to claim 8, characterized in that, the recombinant virus particle is a recombinant adeno-associated virus particle, a recombinant adenovirus particle, a recombinant herpes virus particle, a recombinant baculovirus particle, or a recombinant hybrid virus particle .

10. A recombinant adeno-associated virus particle, characterized in that, the recombinant adeno-associated virus particle comprises an adeno-associated virus vector genome and the adeno-associated virus capsid protein according to claim 5 or 6, wherein the adeno-associated virus vector genome wraps in the adeno-associated virus capsid protein.

11. The recombinant adeno-associated virus particle according to claim 10, characterized in that the genome of the adeno-associated virus vector contains heterologous nucleic acid sequences.

12. The recombinant adeno-associated virus particle according to claim 11, characterized in that, the heterologous nucleic acid sequence encodes one or more selected from antisense RNA, microRNA, shRNA, polypeptide and immunogen.

13. The recombinant adeno-associated virus particle according to claim 12, characterized in that, the polypeptide encoded by the heterologous nucleic acid sequence is a therapeutic polypeptide or a reporter gene.

14. The recombinant adeno-associated virus particle according to claim 13, wherein the therapeutic polypeptide encoded by the heterologous nucleic acid is selected from insulin, glucagon, growth hormone, growth hormone releasing factor, erythropoietin, Insulin growth factor, transforming growth factor alpha, hepatocyte growth factor, tyrosine hydroxylase, thrombopoietin, interleukin 1-interleukin 25, low-density lipoprotein receptor, glucocorticoid receptor, vitamin D receptor, interference Factor Ⅷ, factor Ⅸ, glucosidase, glucose-6-phosphatase, isovaleryl-CoA dehydrogenase, propionyl-CoA carboxylase, β-glucosidase, liver phosphorylase, phosphate One or more of the group consisting of karase kinase, glycine decarboxylase, α-galactosidase, β-galactosidase and lysosomal enzymes.

15. A cell, characterized in that, the cell comprises the nucleic acid according to any one of claims 1-4, the adeno-associated virus capsid protein according to any one of claims 5-7, the adeno-associated virus capsid protein according to any one of claims 8-9 The recombinant virus particle described in any one of claims and/or the recombinant adeno-associated virus particle described in any one of claims 10-14.

16. The cell according to claim 15, wherein the cell is selected from Escherichia coli, HEK293 cell line, HEK293T cell line, HEK293A cell line, HEK293S cell line, HEK293FT cell line, HEK293F cell line, HEK293H cell line, One or more of HeLa cell line, SF9 cell line, SF21 cell line, SF900 cell line, and BHK cell line.

17. A pharmaceutical composition, characterized in that the pharmaceutical composition comprises a pharmaceutically acceptable carrier or adjuvant and a nucleic acid selected from any of claims 1-4, any of claims 5-7, A described adeno-associated virus capsid protein, a recombinant adeno-associated virus particle described in any one of claims 8-9, a recombinant adeno-associated virus particle described in any one of claims 10-14 and/or a described recombinant adeno-associated virus particle described in claim 15 one or more of the cells.

18. The nucleic acid according to any one of claims 1-4, the adeno-associated virus capsid protein according to any one of claims 5-7, the recombinant virus particle according to any one of claims 8-9, any one of the claims One or more of the recombinant adeno-associated virus particles described in any one of 10-14, the cells described in claim 15 and/or the pharmaceutical composition described in claim 17 are prepared for the prevention or treatment of diseases Use in medicine, wherein the disease is one or more selected from the group consisting of hemophilia A, hemophilia B, diabetes, glycogen storage disease, Niemann Pick disease, hepatitis and hyperammonemia.

19. A method for preparing recombinant adeno-associated virus particles, characterized in that the method comprises providing cells with a nucleic acid according to claim 1, a nucleic acid coding sequence of an adeno-associated virus Rep protein, a an adeno-associated virus vector genome carrying a heterologous nucleic acid sequence, an auxiliary factor that helps produce an infectious adeno-associated virus, and allows the adeno-associated virus vector genome to be wrapped in the adeno-associated virus capsid protein encoded by the nucleic acid described in claim 1, And realize the assembly of recombinant adeno-associated virus particles.

20. The method for preparing recombinant adeno-associated virus particles according to claim 19, characterized in that, the method is an adeno-associated virus vector preparation system, comprising a two-plasmid packaging system, a three-plasmid packaging system, a baculovirus packaging system and Adeno-associated virus packaging system with adenovirus or herpes simplex virus as the auxiliary virus, etc.

21. A method for delivering heterologous nucleic acid to cells in vitro, characterized in that the method comprises administering the nucleic acid according to any one of claims 1-4, the nucleic acid according to any one of claims 5-7 to cells Adeno-associated virus capsid protein, the recombinant virus particle described in claim 8 or 9, the recombinant adeno-associated virus particle described in any one of claims 10-14 and/or the pharmaceutical composition described in claim 17, wherein the Said cells are mammalian hepatocytes.

22. The method of delivering heterologous nucleic acid to a cell of claim 21, wherein the cell is a human hepatocyte.