CN111876432A

CN111876432A - Acquisition and application of a group of novel liver-targeted adeno-associated viruses

Info

Publication number: CN111876432A
Application number: CN202010742484.2A
Authority: CN
Inventors: 张婷婷; 王超; 丰硕
Original assignee: Beijing Solobio Genetechnology Co Ltd; Staidson Beijing Biopharmaceutical Co Ltd
Current assignee: Beijing Solobio Genetechnology Co Ltd
Priority date: 2020-07-29
Filing date: 2020-07-29
Publication date: 2020-11-03
Anticipated expiration: 2040-07-29
Also published as: CN116121274B; CN111876432B; CN115927398A; CN116121274A; CN115927398B; CN116121275A; CN115960921A; CN116121275B; CN115960921B

Abstract

The present invention provides a set of adeno-associated viruses having properties of interest, e.g., targeting properties and/or neutralizing properties, obtained by directed evolution and in vivo screening methods. The present invention also provides adeno-associated virus capsid protein and viral particles comprising the adeno-associated virus capsid protein, which have excellent or superior targeting and/or neutralizing properties.

Description

Acquisition and application of a group of novel liver-targeted adeno-associated viruses

技术领域technical field

本发明涉及一组通过定向进化和体内筛选的方法而获得的腺相关病毒，以及优化的腺相关病毒衣壳蛋白和包含所述衣壳蛋白的病毒载体。The present invention relates to a group of adeno-associated viruses obtained by methods of directed evolution and in vivo screening, as well as optimized adeno-associated virus capsid proteins and viral vectors comprising the capsid proteins.

背景技术Background technique

迄今以血清型定名的腺相关病毒(AAV)亚型分为AAV1-AAV12(高光平等，2004，JVirolm 78:6381-6388；Mori S等，2004，Virology 330:375-383；Schmidt M等，2008，JVirol 82:1399-1406)，主要以人类和灵长类动物为宿主，其中AAV1-6是从人类组织中分离获得，并且具有明确的抗体反应性，因而较为经典并得以公认。AAV7和AAV8是高光平等通过基因工程手段从猕猴心脏组织拯救得到的，疑为在进化中已经绝迹的AAV亚型，这一策略为AAV新亚型的发现和重组病毒的设计改造提供了新的思路和范例(高光平等，2002，ProcNatl Acad Sci USA 99:11854-11859)，而AAV9-12是利用同类技术路线分别从人和短尾猴的组织中制备的。尽管不同AAV血清型的病毒都具有正二十面体的结构，但其衣壳蛋白在序列与空间构象上的多样性，使得其细胞表面的结合受体以及对细胞的感染嗜性存在显著差异(Timpe J等，2005，Curr Gene Ther 5:273-284)。在感染嗜性方面，AAV2的感染谱较广，尤对神经细胞效果最好；AAV1和AAV7在骨骼肌中的转导效率较高；AAV3易于转导巨核细胞；AAV5和AAV6感染呼吸道上皮细胞的优势显著；而AAV8转导肝细胞的效率优于其它亚型。Adeno-associated virus (AAV) subtypes named by serotype so far are divided into AAV1-AAV12 (Gao Guangping et al., 2004, JVirolm 78:6381-6388; Mori S et al., 2004, Virology 330:375-383; Schmidt M et al., 2008 , JVirol 82:1399-1406), mainly in humans and primates as hosts, of which AAV1-6 are isolated from human tissues and have clear antibody reactivity, so they are more classic and recognized. AAV7 and AAV8 were rescued from macaque heart tissue by Gao Guangping through genetic engineering. They are suspected to be AAV subtypes that have disappeared in evolution. This strategy provides new opportunities for the discovery of new AAV subtypes and the design and modification of recombinant viruses. Ideas and paradigms (Gao Guang et al., 2002, ProcNatl Acad Sci USA 99: 11854-11859), and AAV9-12 were prepared from human and cynomolgus monkey tissues using the same technical route. Although viruses of different AAV serotypes have an icosahedral structure, the diversity of their capsid proteins in sequence and spatial conformation makes their cell surface binding receptors and cell infection tropisms significantly different ( Timpe J et al, 2005, Curr Gene Ther 5:273-284). In terms of infection tropism, AAV2 has a broad infection spectrum, especially for nerve cells; AAV1 and AAV7 have higher transduction efficiency in skeletal muscle; AAV3 is easy to transduce megakaryocytes; AAV5 and AAV6 infect respiratory epithelial cells. The advantage is significant; while AAV8 transduces hepatocytes more efficiently than other subtypes.

AAV病毒本身是复制缺陷型的，在没有辅助病毒存在的情况下只能在宿主细胞呈潜伏状态。AAV病毒载体的生产则需要辅助质粒(helper)提供腺病毒(Ad)参与AAV复制的关键基因。这些Ad基因包括：早期基因E1A负责AAV的转录活性，早期基因E1B和E4参与AAVmRNA的成熟化，早期基因E2A和VA增强AAV RNA翻译(Berns等，1984，Adeno-associatedvirus 563–592)。The AAV virus itself is replication-deficient and can only remain latent in host cells in the absence of a helper virus. The production of AAV viral vectors requires a helper plasmid (helper) to provide adenovirus (Ad) with key genes involved in AAV replication. These Ad genes include: the early gene E1A is responsible for the transcriptional activity of AAV, the early genes E1B and E4 are involved in the maturation of AAV mRNA, and the early genes E2A and VA enhance AAV RNA translation (Berns et al., 1984, Adeno-associated virus 563–592).

重组腺相关病毒(rAAV)作为基因治疗载体，具有非常多的优势，如感染效率高、感染范围广、长期表达和高安全性等(David AF等，2007，BMC Bio 7:75)，已被广泛应用于临床实验。目前以AAV为载体的基因治疗项目进入临床研究的有100多项，治疗的疾病范围扩展到肿瘤、视网膜疾病、关节炎、艾滋病、心力衰竭、肌营养不良症、神经系统疾病及其它一系列基因缺陷疾病。Recombinant adeno-associated virus (rAAV) as a gene therapy vector has many advantages, such as high infection efficiency, wide infection range, long-term expression and high safety (David AF et al., 2007, BMC Bio 7:75), and has been used by Widely used in clinical experiments. At present, more than 100 gene therapy projects using AAV as a vector have entered clinical research, and the treatment range of diseases has expanded to tumors, retinal diseases, arthritis, AIDS, heart failure, muscular dystrophy, nervous system diseases and a series of other genes Defective disease.

目前正在开展的基于rAAV载体的眼部疾病基因治疗临床研究，大多数是针对视网膜色素上皮特异性65kDa蛋白编码基因(RPE65)突变引起的先天性黑蒙(LCA，Leber’scongenital amaurosis)。Spark公司研发的Luxturna药物已于2017年12月通过FDA审批上市，此药物是rAAV2携带hRPE65v2基因治疗LCA及遗传性视网膜发育不良。受试者在注射后的测试结果显示其药物的有效性，并且没有发现明显副作用，尤其是载体相关的副作用(Russell S等，2017，Lancet 390:849-860)。其它进入临床的基因治疗眼科疾病还有无脉络膜症，多数研究处于临床I、II期，其中阿尔伯塔大学利用rAAV2携带Rab护卫蛋白1编码基因(REP1)治疗项目，在2018年公布的I期研究结果显示了药物的安全性及有效性。Currently, most of the clinical studies on gene therapy for eye diseases based on rAAV vectors are aimed at Leber's congenital amaurosis (LCA) caused by mutations in the retinal pigment epithelium-specific 65kDa protein-encoding gene (RPE65). The Luxturna drug developed by Spark has been approved by the FDA in December 2017. This drug is rAAV2 carrying hRPE65v2 gene therapy for LCA and hereditary retinal dysplasia. The test results of the subjects after injection showed the effectiveness of the drug, and no obvious side effects, especially vehicle-related side effects, were found (Russell S et al., 2017, Lancet 390:849-860). Other gene therapy ophthalmic diseases that have entered the clinic include choroideremia, most of which are in clinical phases I and II. Among them, the University of Alberta uses rAAV2 to carry the Rab guard protein 1 encoding gene (REP1) treatment project, which was announced in 2018. Phase I The results of the study showed the safety and efficacy of the drug.

目前正在开展的基于rAAV载体的血友病基因治疗临床研究有二十多个，其中以B型血友病研究最多。具有长期表达能力的AAV基因药物是治疗血友病B的理想选择。UniQure公司正在进行临床I/II期研究，用AAV5携带人凝血因子IX编码基因(hFIX)治疗B型血友病，给药后1年的随访数据显示了药物安全性。患者给药后1周出现体液免疫应答，但不影响凝血因子IX的表达水平，使用现在T细胞检测“金标准”系统检测不到T细胞活化。转氨酶有升高但是不影响FIX活性，也没有发现肝毒性反应(Miesbach W等，2018，Blood 131:1022-1031)。目前还有一些利用AAV8载体治疗血友病的基因治疗项目，这些都是以肝脏为靶向性的临床研究(Nathwani AC等，2006，Blood.107:2653-2661)，AAV8也是目前公认的肝脏靶向最优的AAV血清型。Currently, there are more than 20 clinical studies on hemophilia gene therapy based on rAAV vector, among which hemophilia B is the most studied. AAV gene drugs with long-term expression ability are ideal for the treatment of hemophilia B. UniQure is conducting a clinical phase I/II study with AAV5 carrying the gene encoding human coagulation factor IX (hFIX) in the treatment of hemophilia B, and the 1-year follow-up data after administration shows the safety of the drug. The patient developed a humoral immune response 1 week after administration, but did not affect the expression level of coagulation factor IX, and T cell activation could not be detected using the current "gold standard" system for T cell detection. Transaminases were elevated but did not affect FIX activity, and no hepatotoxicity was found (Miesbach W et al., 2018, Blood 131:1022-1031). At present, there are some gene therapy projects using AAV8 vectors to treat hemophilia. These are clinical studies targeting the liver (Nathwani AC et al., 2006, Blood. 107:2653-2661). AAV8 is also currently recognized as the liver Targets optimal AAV serotypes.

脊髓性肌萎缩(spinal muscular atrophy，SMA)系指一类由于以脊髓前角细胞为主的变性导致肌无力和肌萎缩的疾病。AveXis公司的AVXS-101目前已经上市，利用AAV9在神经系统感染的优势，携带运动神经元存活蛋白编码基因(SMN1)治疗SMA取得了很好的效果，并且在临床试验中所有的患者没有出现载体副作用相关的临床症状(Mendell JR等，2017，N Engl J Med 377:1713-1722)。Spinal muscular atrophy (SMA) refers to a class of diseases that cause muscle weakness and muscle atrophy due to the degeneration of cells in the anterior horn of the spinal cord. AveXis' AVXS-101 is currently on the market. Taking advantage of the advantages of AAV9 in nervous system infections, carrying the survivin motor neuron-encoding gene (SMN1) has achieved good results in the treatment of SMA, and all patients in the clinical trial did not have the carrier. Clinical symptoms associated with side effects (Mendell JR et al, 2017, N Engl J Med 377:1713-1722).

其它以AAV为载体的罕见病基因治疗项目，例如，利用AAV1携带酸性α-葡萄糖苷酶编码基因(GAA)治疗庞贝氏病(Smith BK等，2013，Hum Gene Ther 24:630-640)，利用改造后的AAV2.5携带微小抗肌萎缩蛋白编码基因(minidystrophin)治疗杜氏肌营养不良(DMD)(Bowles DE等，2012，Mol Ther 20:443-455)，这些临床研究结果均显示出药物的有效性及安全性。Other rare disease gene therapy projects using AAV as a vector, for example, using AAV1 to carry the gene encoding acid α-glucosidase (GAA) to treat Pompe disease (Smith BK et al., 2013, Hum Gene Ther 24: 630-640), Treatment of Duchenne muscular dystrophy (DMD) with modified AAV2.5 carrying the gene encoding minidystrophin (Bowles DE et al., 2012, Mol Ther 20:443-455), the results of these clinical studies have shown that the drug effectiveness and safety.

天然AAV靶向性是有限的，尤其是在使用AAV载体进行系统给药时，根据血清型选择的不同，能够有效感染靶细胞组织的比例差异很大，没有使AAV的利用率最大化；另外其它非靶向组织细胞具有被感染的潜在风险。另外，又因为人和其它灵长类动物受AAV自然感染产生了针对天然型AAV的中和抗体，这会大大降低AAV的半衰期，而影响其药物活性的发挥。Natural AAV targeting is limited, especially when using AAV vectors for systemic administration, depending on the serotype selection, the proportion of target cells and tissues that can be effectively infected varies greatly, which does not maximize the utilization of AAV; Other non-targeted tissue cells are potentially at risk of infection. In addition, because humans and other primates are naturally infected by AAV, neutralizing antibodies against natural AAV are produced, which will greatly reduce the half-life of AAV and affect its drug activity.

目前对于AAV外壳蛋白改造的研究越来越多，改造的目的：一方面能够增强病毒载体的靶向性，另一方面能够降低病毒载体的免疫原性反应。At present, there are more and more studies on the modification of AAV coat protein. The purpose of the modification is: on the one hand, it can enhance the targeting of the viral vector, and on the other hand, it can reduce the immunogenic response of the viral vector.

对于病毒细胞受体机制研究比较明确的载体血清型，可直接针对其细胞受体相关区域的氨基酸进行小范围或定点改造。AAV2在细胞上的受体是目前研究比较明确的。硫酸乙酰肝素蛋白多糖(HSPG)是AAV2和AAV3型的主要细胞受体，2型AAV外壳蛋白上R484、R487、K532、R585、R588氨基酸位点的改变会影响其与HSPG的结合(Opie,S.R等，2003，J Virol77:6995–7006；Summerford,C等，1999，Nat Med 5:78–82)。For the carrier serotypes whose mechanism of viral cell receptors is relatively clear, small-scale or site-specific modifications can be carried out directly on the amino acids in the relevant regions of their cell receptors. The receptor of AAV2 on cells is relatively clear in current research. Heparan sulfate proteoglycan (HSPG) is the major cellular receptor for AAV2 and AAV3, and changes in amino acid sites R484, R487, K532, R585, and R588 on the coat protein of type 2 AAV affect its binding to HSPG (Opie, S.R. et al, 2003, J Virol 77:6995-7006; Summerford, C et al, 1999, Nat Med 5:78-82).

改造后的新型AAV载体已被用于基因治疗临床研究中，例如AAV2.5携带minidystrophin基因治疗DMD(Bowles DE等，2012，Mol Ther 20:443-455)，其I期临床研究已经完成，并且针对新型AAV的安全性进行了大量研究。该项目所使用AAV2.5外壳蛋白为嵌合体，将AAV1外壳蛋白中与骨骼肌靶向相关的5个氨基酸移植到AAV2的外壳蛋白上。此嵌合体不但增强了对骨骼肌的靶向性，而且体液免疫反应明显低于AAV2，这些临床试验证明了改造后的病毒载体的安全性。Modified novel AAV vectors have been used in gene therapy clinical studies, such as AAV2.5 carrying minidystrophin gene therapy DMD (Bowles DE et al., 2012, Mol Ther 20:443-455), whose phase I clinical studies have been completed, and A lot of research has been done on the safety of novel AAVs. The AAV2.5 coat protein used in this project is a chimera, and 5 amino acids related to skeletal muscle targeting in the AAV1 coat protein are grafted onto the AAV2 coat protein. This chimera not only enhanced the targeting to skeletal muscle, but also the humoral immune response was significantly lower than that of AAV2. These clinical trials demonstrated the safety of the modified viral vector.

另外，病毒的细胞受体机制不明确的情况下，我们也可以利用DNA改组(DNAshuffling)或易错PCR的方法，获得功能优化的新型AAV病毒载体的外壳蛋白。例如Grimm等人利用AAV构建的改组文库，在静脉注射免疫球蛋白的严格筛选情况下，得到一个AAV2，8，9组成的嵌合体AAV-DJ，该载体对成纤维细胞和肺等多种细胞系有更高的转导效率(Grimm D等，2008，J Virol 82：5887-5911)。Jang等人利用DNA改组文库筛选到了一个能高效感染神经干细胞的变体(Jang J H，2011，Mol Ther 19：667-675)。In addition, when the cellular receptor mechanism of the virus is not clear, we can also use DNA shuffling (DNA shuffling) or error-prone PCR to obtain the coat protein of a novel AAV viral vector with optimized functions. For example, Grimm et al. used the shuffling library constructed by AAV to obtain a chimera AAV-DJ composed of AAV2, 8, and 9 under the strict screening of intravenous immunoglobulin. The vector is suitable for fibroblasts and lung cells. lines have higher transduction efficiencies (Grimm D et al., 2008, J Virol 82:5887-5911). Jang et al. screened a variant that can efficiently infect neural stem cells using a DNA shuffling library (Jang J H, 2011, Mol Ther 19: 667-675).

目前基因治疗药物已日趋成为国内外研究的热点，而且为了使基因治疗药物更好更长久地发挥作用，寻找一种功能优化的新型AAV载体使其更好地满足做为基因治疗载体的需求是亟待解决的问题。At present, gene therapy drugs have increasingly become a research hotspot at home and abroad, and in order to make gene therapy drugs work better and longer, it is necessary to find a new type of AAV vector with optimized functions to better meet the needs of gene therapy vectors. Problems to be solved.

发明内容SUMMARY OF THE INVENTION

基于寻找新型AAV载体的需求，本发明提供了一组通过体内筛选来实现病毒定向进化的方法而获得的包含感兴趣特质AAV衣壳的病毒，例如，靶向性特质和/或中和性特质(例如，逃避中和抗体的能力)。本发明还提供了AAV衣壳和包含AAV衣壳的病毒粒子。Based on the need to find novel AAV vectors, the present invention provides a set of viruses containing AAV capsids with traits of interest, such as targeting traits and/or neutralizing traits, obtained by in vivo screening methods for virus directed evolution (eg, the ability to evade neutralizing antibodies). The present invention also provides AAV capsids and virions comprising AAV capsids.

一方面，本发明提供了一种编码AAV衣壳蛋白的核酸，所述核酸包含选自以下组中的AAV衣壳蛋白编码序列：In one aspect, the invention provides a nucleic acid encoding an AAV capsid protein, the nucleic acid comprising an AAV capsid protein coding sequence selected from the group consisting of:

(a)图3A的核苷酸序列(L1)(SEQ ID NO:1)；(a) Nucleotide sequence (L1) of Figure 3A (SEQ ID NO: 1);

(b)图3C的核苷酸序列(L4)(SEQ ID NO:3)；(b) the nucleotide sequence (L4) of Figure 3C (SEQ ID NO: 3);

(c)图3E的核苷酸序列(L10)(SEQ ID NO:5)；(c) the nucleotide sequence (L10) of Figure 3E (SEQ ID NO: 5);

(d)图3G的核苷酸序列(L52)(SEQ ID NO:7)；(d) the nucleotide sequence (L52) of Figure 3G (SEQ ID NO: 7);

(e)图3I的核苷酸序列(L58)(SEQ ID NO:9)；(e) the nucleotide sequence (L58) of Figure 3I (SEQ ID NO: 9);

(f)图3K的核苷酸序列(L84)(SEQ ID NO:11)；(f) the nucleotide sequence (L84) of Figure 3K (SEQ ID NO: 11);

(g)图3M的核苷酸序列(L37)(SEQ ID NO:13)；(g) the nucleotide sequence (L37) of Figure 3M (SEQ ID NO: 13);

(h)图3O的核苷酸序列(L107)(SEQ ID NO:15)；(h) the nucleotide sequence (L107) of Figure 30 (SEQ ID NO: 15);

(i)图3Q的核苷酸序列(L57)(SEQ ID NO:17)；或(i) the nucleotide sequence (L57) of Figure 3Q (SEQ ID NO: 17); or

(j)编码由(a)-(i)任一核苷酸编码的AAV衣壳蛋白但由于遗传密码的简并性而不同于(a)-(i)的核苷酸序列。(j) A nucleotide sequence encoding the AAV capsid protein encoded by any of nucleotides (a)-(i) but different from (a)-(i) due to the degeneracy of the genetic code.

另一方面，本发明提供了由本发明的核酸编码的AAV衣壳蛋白，所述AAV衣壳蛋白的氨基酸序列选自SEQ ID NO:2，SEQ ID NO:4，SEQ ID NO:6，SEQ ID NO:8，SEQ ID NO:10，SEQ ID NO:12，SEQ ID NO:14，SEQ ID NO:16，或SEQ ID NO:18中的任一种。In another aspect, the present invention provides an AAV capsid protein encoded by the nucleic acid of the present invention, wherein the amino acid sequence of the AAV capsid protein is selected from SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID Any of NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, or SEQ ID NO:18.

本发明进一步提供了包含一种病毒基因组和本发明AAV衣壳蛋白的重组病毒粒子，其中病毒基因组包裹于AAV衣壳蛋白中。本发明进一步提供了一种AAV病毒基因组和本发明AAV衣壳蛋白的重组腺相关病毒粒子，其中病毒基因组包裹于AAV衣壳蛋白中。在具体的实施例中，病毒基因组是一个包含异源核酸的重组载体基因组。The present invention further provides recombinant virions comprising a viral genome and the AAV capsid protein of the present invention, wherein the viral genome is encapsulated in the AAV capsid protein. The present invention further provides an AAV virus genome and a recombinant adeno-associated virus particle of the AAV capsid protein of the present invention, wherein the viral genome is encapsulated in the AAV capsid protein. In specific embodiments, the viral genome is a recombinant vector genome comprising heterologous nucleic acid.

本发明提供了一种细胞，该细胞包含本发明的核酸、AAV衣壳蛋白、重组病毒粒子和/或重组腺相关病毒粒子。The present invention provides a cell comprising the nucleic acid, AAV capsid protein, recombinant virus particle and/or recombinant adeno-associated virus particle of the present invention.

本发明提供了一种药物组合物，该药物组合物包含一种药学上可接受的载体和本发明的核酸、AAV衣壳蛋白、重组病毒粒子、重组腺相关病毒粒子和/或细胞。The present invention provides a pharmaceutical composition comprising a pharmaceutically acceptable carrier and the nucleic acid, AAV capsid protein, recombinant virus particle, recombinant adeno-associated virus particle and/or cell of the present invention.

本发明提供了本文中所述的核酸、AAV衣壳蛋白、重组病毒粒子、重组腺相关病毒粒子、细胞和/或药物组合物在制备用于预防或治疗疾病的药物中的用途。The present invention provides the use of the nucleic acid, AAV capsid protein, recombinant virus particle, recombinant adeno-associated virus particle, cell and/or pharmaceutical composition described herein in the manufacture of a medicament for preventing or treating a disease.

本发明提供了一种生产包含AAV衣壳蛋白的重组病毒粒子的方法，所述方法包括：在体外向细胞提供本发明的核酸、AAV Rep蛋白编码序列、包含异源核酸的重组载体基因组，以及产生生产性感染的辅助子，允许包含AAV衣壳蛋白的重组病毒粒子的组装，以及重组载体基因组的衣壳化。The present invention provides a method of producing a recombinant virion comprising an AAV capsid protein, the method comprising: providing a nucleic acid of the present invention, an AAV Rep protein coding sequence, a recombinant vector genome comprising a heterologous nucleic acid to a cell in vitro, and A helper that produces productive infection, allows assembly of recombinant virions containing AAV capsid proteins, and encapsidation of recombinant vector genomes.

本发明提供了一种生产包含AAV衣壳蛋白的重组AAV粒子的方法，所述方法包括：在体外向细胞提供本发明的核酸、AAV Rep蛋白编码序列、包含异源核酸的rAAV基因组，以及产生生产性感染的辅助子，允许包含AAV衣壳蛋白的重组病毒粒子的组装，以及重组载体基因组的衣壳化。The present invention provides a method of producing recombinant AAV particles comprising an AAV capsid protein, the method comprising: providing a nucleic acid of the present invention, an AAV Rep protein coding sequence, an rAAV genome comprising a heterologous nucleic acid to a cell in vitro, and producing Helper for productive infection, allowing assembly of recombinant virions containing AAV capsid proteins, and encapsidation of recombinant vector genomes.

本发明提供了一种将感兴趣的核酸递送给细胞的方法，所述方法包括向细胞提供本发明的核酸、AAV衣壳蛋白、重组病毒粒子、重组腺相关病毒粒子和/或药物组合物。The present invention provides a method of delivering a nucleic acid of interest to a cell, the method comprising providing the cell with the nucleic acid, AAV capsid protein, recombinant virion, recombinant adeno-associated virion and/or pharmaceutical composition of the invention.

本发明提供了一种向哺乳动物个体递送感兴趣核酸的方法，所述方法包括：将有效剂量本发明的核酸、AAV衣壳蛋白、重组病毒粒子、重组腺相关病毒粒子、细胞和/或药物组合物施用于哺乳动物。The present invention provides a method of delivering a nucleic acid of interest to a mammalian subject, the method comprising: administering an effective dose of the nucleic acid, AAV capsid protein, recombinant virus particle, recombinant adeno-associated virus particle, cell and/or drug of the present invention The composition is administered to a mammal.

本发明还提供了一种识别具有某种感兴趣嗜性特性的病毒载体例如AAV载体或AAV衣壳蛋白的方法，所述方法包括：The present invention also provides a method for identifying viral vectors, such as AAV vectors or AAV capsid proteins, having a certain tropism characteristic of interest, the method comprising:

(a)提供病毒载体例如AAV载体集合，其中所述集合中的每种病毒载体包括：(a) providing a collection of viral vectors, such as AAV vectors, wherein each viral vector in the collection comprises:

(i)一种AAV衣壳蛋白，其包含通过改组两个或多个不同AAV衣壳蛋白编码序列而产生的衣壳蛋白，其中两个或多个不同AAV衣壳蛋白的氨基酸序列至少有两个氨基酸的不同；和(ii)一种病毒载体基因组例如一种AAV病毒基因组，包括一种编码(i)所述AAV衣壳蛋白的编码序列、一种AAV Rep蛋白编码序列、至少一种与AAV Rep蛋白相互作用的末端重复序列(例如5’和/或3’末端重复)，其中病毒载体基因组包裹于AAV衣壳蛋白中。(i) an AAV capsid protein comprising a capsid protein produced by shuffling two or more different AAV capsid protein coding sequences, wherein the amino acid sequences of the two or more different AAV capsid proteins have at least two and (ii) a viral vector genome, such as an AAV viral genome, comprising a coding sequence encoding (i) the AAV capsid protein, an AAV Rep protein coding sequence, at least one AAV Rep protein interacting terminal repeats (eg, 5' and/or 3' terminal repeats) in which the viral vector genome is encapsulated in the AAV capsid protein.

(b)将病毒载体集合施用于哺乳动物个体，和(b) administering the viral vector collection to a mammalian individual, and

(c)从目标组织中回收多个病毒粒子或编码AAV衣壳蛋白的病毒基因组的病毒载体，从而鉴定出具有感兴趣嗜性的病毒载体或AAV衣壳蛋白。(c) Recovery of multiple virions or viral vectors of viral genomes encoding AAV capsid proteins from the tissue of interest, thereby identifying viral vectors or AAV capsid proteins with a tropism of interest.

本发明的积极效果在于：The positive effects of the present invention are:

本发明提供了一组通过定向进化和体内筛选的方法而获得的新型AAV病毒载体、AAV衣壳蛋白以及包含所述AAV衣壳蛋白的病毒粒子，相对于现有技术中的AAV病毒载体而言，具有感兴趣特质AAV衣壳的病毒，例如，靶向性特质(更高的肝组织靶向性)和/或中和性特质(例如，逃避中和抗体的能力)。The present invention provides a set of novel AAV viral vectors, AAV capsid proteins and virus particles comprising the AAV capsid proteins obtained by methods of directed evolution and in vivo screening, compared with the AAV viral vectors in the prior art , viruses with AAV capsids having traits of interest, eg, targeting traits (higher liver tissue targeting) and/or neutralizing traits (eg, ability to evade neutralizing antibodies).

下面结合附图和具体实施方式对本公开做进一步说明，并非对本公开的限制。凡是依照本专利申请公开内容所进行的任何本领域等同替换，均属于本专利的保护范围。The present disclosure will be further described below with reference to the accompanying drawings and specific embodiments, but is not intended to limit the present disclosure. Any equivalent replacement in the art based on the disclosure content of this patent application shall fall within the protection scope of this patent.

附图说明Description of drawings

图1.质粒文库随机挑选阳性克隆酶切图谱。1-12分别对应随机挑选阳性克隆样品，M代表5000bp DNA Marker。Figure 1. The restriction map of randomly selected positive clones from the plasmid library. 1-12 correspond to randomly selected positive clone samples respectively, and M represents 5000bp DNA Marker.

图2.两轮体内筛选后，AAV突变体在肝脏、骨骼肌和心脏中出现的次数。经第一次体内筛选后共获得370个阳性克隆，经第二次体内筛选后共获得9组高频靶向肝脏的新型AAV序列。Figure 2. Frequency of AAV mutants in liver, skeletal muscle, and heart after two rounds of in vivo screening. After the first in vivo screening, a total of 370 positive clones were obtained, and after the second in vivo screening, a total of 9 groups of novel AAV sequences targeting the liver with high frequency were obtained.

图3A-3R.新型AAV-Cap序列。其中序列图分为核苷酸序列及编码的氨基酸序列。Figures 3A-3R. Novel AAV-Cap sequences. The sequence diagram is divided into nucleotide sequence and encoded amino acid sequence.

图4A-4F.不同体外细胞系感染比较(CAG-EGFP)。不同AAV载体包装成携带CAG-EGFP的相应病毒，在体外通过不同MOI感染不同细胞系，48h后进行流式细胞术检测分析。数值为平均值±标准差。Figures 4A-4F. Comparison of infection of different in vitro cell lines (CAG-EGFP). Different AAV vectors were packaged into corresponding viruses carrying CAG-EGFP, and different cell lines were infected with different MOIs in vitro, and flow cytometry analysis was performed 48 hours later. Values are mean ± standard deviation.

图5A-5E.不同细胞系体外感染比较(CAG-Luciferase)。不同AAV载体包装成携带CAG-Luciferase的相应病毒，在体外通过MOI 500感染不同细胞系，48h后检测Luciferase活性。数值为平均值±标准差。Figures 5A-5E. Comparison of in vitro infection of different cell lines (CAG-Luciferase). Different AAV vectors were packaged into corresponding viruses carrying CAG-Luciferase, and different cell lines were infected with MOI 500 in vitro, and Luciferase activity was detected after 48 hours. Values are mean ± standard deviation.

图6.系统注射AAV载体后，小鼠不同组织中载体基因组拷贝数。不同AAV载体包装成携带CAG-Luciferase的相应病毒，以1E+11vg剂量尾部静脉注射小鼠2周后，不同组织中载体基因组拷贝数比较。数值为平均值±标准差。Figure 6. Vector genome copy number in different tissues of mice after systemic injection of AAV vector. Different AAV vectors were packaged into the corresponding viruses carrying CAG-Luciferase. After 2 weeks of tail vein injection into mice at a dose of 1E+11vg, the number of vector genome copies in different tissues was compared. Values are mean ± standard deviation.

图7A-7E.系统注射AAV载体后，小鼠不同组织中Luciferase活性。不同AAV载体包装成携带CAG-Luciferase的相应病毒以1E+11vg剂量尾部静脉注射小鼠2周后，不同组织中Luciferase活性比较。数值为平均值±标准差。Figures 7A-7E. Luciferase activity in different tissues of mice after systemic injection of AAV vector. Different AAV vectors packaged into corresponding viruses carrying CAG-Luciferase were injected into mice by tail vein at a dose of 1E+11vg for 2 weeks, and the Luciferase activity in different tissues was compared. Values are mean ± standard deviation.

具体实施方式Detailed ways

本发明提供了一组通过体内筛选来实现病毒定向进化的方法而获得的包含感兴趣特质AAV衣壳的病毒载体，例如，靶向性特质和/或中和性特质(例如，逃避中和抗体的能力)。本发明还提供了AAV衣壳和包含AAV衣壳的病毒粒子。The present invention provides a set of viral vectors comprising AAV capsids with traits of interest, e.g., targeting traits and/or neutralizing traits (e.g., evading neutralizing antibodies, obtained by in vivo screening for directed evolution of viruses) Ability). The present invention also provides AAV capsids and virions comprising AAV capsids.

本发明将在下文中进行更详细的阐述。本说明书并非旨在详细列出实施本发明的所有不同方式，此外，本发明所建议的各种实施例的诸多变化在不背离本发明的前提下对于本领域技术人员而言是显而易见的。因此，以下说明书旨在说明本发明的一些具体实施例，而不是详尽地说明其所有排列、组合和变化。The present invention will be explained in more detail below. This description is not intended to enumerate in detail all the different ways to carry out the invention, and moreover, many variations of the various embodiments proposed by the invention will be apparent to those skilled in the art without departing from the invention. Accordingly, the following description is intended to illustrate some specific embodiments of the invention, but not to be exhaustive of all permutations, combinations and variations thereof.

除非另有定义，否则本发明所用的所有技术和科学术语的含义与本发明所属领域的普通技术人员通常理解的含义相同。本发明中用于描述本发明的术语仅用于描述特定实施例，并不用于限制本发明。Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The terms used to describe the present invention in the present invention are only used to describe specific embodiments, and are not used to limit the present invention.

除另有说明外，本领域技术人员已知的标准方法可用于生产重组和合成多肽、抗体或其抗原结合片段、操纵核酸序列、生产转化细胞、构建重组AAV、修饰衣壳蛋白、包装包含有AAV rep和/或cap编码序列的载体，以及瞬时或稳定地转染包装细胞。这些技术为本领域技术人员所熟知，参见SAMBROOK等,MOLECULAR CLONING:A LABORATORY MANUAL 2nd Ed.(1989，Cold Spring Harbor,N.Y)；F.M.AUSUBEL等,CURRENT PROTOCOLS IN MOLECULARBIOLOGY(Green Publishing Associates,Inc.and John Wiley&Sons,Inc.,New York)。Unless otherwise indicated, standard methods known to those skilled in the art can be used to produce recombinant and synthetic polypeptides, antibodies or antigen-binding fragments thereof, manipulate nucleic acid sequences, produce transformed cells, construct recombinant AAVs, modify capsid proteins, package AAV rep and/or cap coding sequences, and transient or stable transfection of packaging cells. These techniques are well known to those skilled in the art, see SAMBROOK et al, MOLECULAR CLONING: A LABORATORY MANUAL 2nd Ed. (1989, Cold Spring Harbor, N.Y); F.M. AUSUBEL et al, CURRENT PROTOCOLS IN MOLECULARBIOLOGY (Green Publishing Associates, Inc. and John Wiley & Sons, Inc., New York).

本文所提及的所有出版物、专利申请、专利、核苷酸序列、氨基酸序列和其它参考文献全部以引用方式并入。All publications, patent applications, patents, nucleotide sequences, amino acid sequences, and other references mentioned herein are incorporated by reference in their entirety.

I定义IDefinition

本发明说明书及其权利要求书中AAV衣壳亚单位中所有氨基酸位置的指定与VP1衣壳亚单位编号有关。The assignment of all amino acid positions in the AAV capsid subunit in the present specification and its claims relates to the VP1 capsid subunit numbering.

在本发明的说明书及其权利要求书中，除非上下文另有明确说明，否则单数形式“一个”、“一种”和“所述”也包括复数形式。In this specification and its claims, the singular forms "a," "an," and "the" include the plural forms as well, unless the context clearly dictates otherwise.

本发明中，“和/或”指包含一个或多个所述内容的任何和所有可能组合。As used herein, "and/or" is meant to include any and all possible combinations of one or more of the stated items.

如本文所用，术语“基本上包含”与核酸、蛋白质或衣壳结构有关，意指所述核酸、蛋白质或衣壳结构包含任何可显著改变所述感兴趣核酸、蛋白质或衣壳结构功能的元素，例如，所述蛋白质或衣壳或所述核酸编码的蛋白质或衣壳的靶向性特性或中和特性。As used herein, the term "substantially comprises" in relation to a nucleic acid, protein or capsid structure means that the nucleic acid, protein or capsid structure comprises any element that significantly alters the function of the nucleic acid, protein or capsid structure of interest For example, the targeting or neutralizing properties of the protein or capsid or the protein or capsid encoded by the nucleic acid.

本发明上下文中的术语“腺相关病毒(AAV)”，包括但不限于AAV1、AAV2、AAV3A、AAV3B、AAV4、AAV5、AAV6、AAV7、AAV8、AAV9、AAV10、AAV11、禽AAV、牛AAV、犬AAV、马AAV和羊AAV以及现在已知的或后来发现的任何其它AAV(BERNARD N.FIELDS等,VIROLOGY,volume 2,chapter 69，4th ed.,Lippincott-Raven Publishers)。已经确定的一些额外的AAV血清型和分支，也包括在本发明术语“AAV”中(高光平等，2004，J.Virology 78:6381-6388)。The term "adeno-associated virus (AAV)" in the context of the present invention includes, but is not limited to, AAV1, AAV2, AAV3A, AAV3B, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, avian AAV, bovine AAV, canine AAV AAV, equine AAV and sheep AAV and any other AAV now known or later discovered (BERNARD N. FIELDS et al., VIROLOGY, volume 2, chapter 69, 4th ed., Lippincott-Raven Publishers). Some additional AAV serotypes and clades have been identified and are also included in the term "AAV" of the present invention (Gao Guang et al., 2004, J. Virology 78:6381-6388).

本领域已知各种AAV和细小病毒的基因组序列，以及ITRS、Rep蛋白和衣壳蛋白亚单位的序列，此类序列可在文献或公共数据库中找到。例如，在GenBank数据库中，Genbank登记号NC 002077、NC 001401、NC 001729、NC 001863、NC 001829、NC 001862、NC 000883、NC001701、NC 001510、AF063497、U89790、AF043303、AF028705、AF028704、J02275、J01901、J02275、X01457、AF288061、AH009962、AY028226、AY028223、NC 00135、NC 001540、AF513851、AF5 13852、AY530579、AY63 1965，AY63 1966，这些公开内容通过引用并入本文中。另外，srivistava等，1983，J.Virology 45:555；Chiorini等，1998，J.Virology 71:6823；Chiorini等，1999，J.Virology 73:1309；Bantel-Schaal等，1999，J.Virology 73:939；Xiao等，1999，J.Virology 73:3994；Muramatsu等，1996，Virology 221:208；Shade等，1986，J.Virol.58:921；Gao等，2002，Proc.Nat.Acad.Sci.USA 99:11854；国际公开专利WO00/28061、WO 99/61601、WO 98/11244；美国专利6156303；这些公开内容也通过引用整体并入本文中。AAV1、AAV2和AAV3末端重复序列的早期描述，参见Xiao,X,1996,"Characterization of Adeno-associated virus(AAV)DNA replication andintegration,"Ph.D.Dissertation,University of Pittsburgh,Pittsburgh,Pa，这些内容通过引用整体并入本文中。The genomic sequences of various AAV and parvoviruses are known in the art, as well as the sequences of the ITRS, Rep protein and capsid protein subunits, and such sequences can be found in the literature or in public databases. For example, in the GenBank database, Genbank accession numbers NC 002077, NC 001401, NC 001729, NC 001863, NC 001829, NC 001862, NC 000883, NC001701, NC 001510, AF063497, U89790, AF043303, AF028715, AF08 J02275, X01457, AF288061, AH009962, AY028226, AY028223, NC 00135, NC 001540, AF513851, AF5 13852, AY530579, AY63 1965, AY63 1966, the disclosures of which are incorporated herein by reference. In addition, Srivistava et al., 1983, J. Virology 45:555; Chiorini et al., 1998, J. Virology 71:6823; Chiorini et al., 1999, J. Virology 73:1309; Bantel-Schaal et al., 1999, J. Virology 73: 939; Xiao et al., 1999, J. Virology 73:3994; Muramatsu et al., 1996, Virology 221:208; Shade et al., 1986, J.Virol. 58:921; Gao et al., 2002, Proc.Nat.Acad.Sci. USA 99:11854; International Publications WO 00/28061, WO 99/61601, WO 98/11244; US Patent 6,156,303; these disclosures are also incorporated herein by reference in their entirety. For an early description of AAV1, AAV2, and AAV3 terminal repeats, see Xiao, X, 1996, "Characterization of Adeno-associated virus (AAV) DNA replication and integration," Ph.D. Dissertation, University of Pittsburgh, Pittsburgh, Pa. Incorporated herein by reference in its entirety.

“改组”或“嵌合”的AAV衣壳编码序列或AAV衣壳蛋白是将两个或多个不同的AAV衣壳蛋白序列混合后产生的核酸序列和氨基酸序列结合了两个或多个衣壳序列的部分。一种“改组”或“嵌合”的AAV病毒粒子包含一种被“改组的”或“嵌合的”AAV衣壳蛋白。"Shuffled" or "chimeric" AAV capsid coding sequences or AAV capsid proteins are nucleic acid and amino acid sequences produced by mixing two or more different AAV capsid protein sequences that combine two or more capsid sequences. part of the shell sequence. A "shuffled" or "chimeric" AAV virion comprises a "shuffled" or "chimeric" AAV capsid protein.

此处使用的“靶向性”一词是指病毒优先进入某些细胞或组织类型和/或优先与细胞表面相互作用从而促进其进入某些细胞或组织类型，任选地且优选地在细胞中表达病毒基因组携带的序列，例如，重组病毒表达异源核苷酸序列。在重组AAV基因组的情况下，病毒基因组的基因表达可来自稳定整合的前病毒和/或非整合的附加体，以及病毒核酸可能在细胞内发生的任何其它形式。The term "targeted" as used herein means that the virus preferentially enters certain cells or tissue types and/or preferentially interacts with the cell surface to facilitate its entry into certain cells or tissue types, optionally and preferably in cells Sequences carried by the viral genome are expressed in a recombinant virus, for example, a recombinant virus expresses a heterologous nucleotide sequence. In the case of recombinant AAV genomes, the gene expression of the viral genome may be from stably integrated proviruses and/or non-integrated episomes, as well as any other form of viral nucleic acid that may occur intracellularly.

术语“靶向性特性”是指一个或多个靶细胞、组织和/或器官的转导模式。例如，一些改组的AAV衣壳可能显示肝脏、性腺和/或生殖细胞的有效转导，一些改组的AAV衣壳对骨骼肌、横膈膜肌和/或心肌组织的只有低水平的转导，典型改组AAV衣壳具有的靶向性特征是肝脏的高转导和骨骼肌的低转导。The term "targeting property" refers to the transduction pattern of one or more target cells, tissues and/or organs. For example, some shuffled AAV capsids may show efficient transduction of liver, gonads and/or germ cells, some shuffled AAV capsids have only low levels of transduction to skeletal, diaphragm and/or cardiac tissue, Typical shuffled AAV capsids have targeting features of high transduction in liver and low transduction in skeletal muscle.

如本文所用，由病毒载体对细胞的“转导”是指通过将核酸由病毒载体携带并随后通过病毒载体将遗传物质转移到细胞中。As used herein, "transduction" of a cell by a viral vector refers to the transfer of genetic material into a cell by carrying nucleic acid by the viral vector and subsequent transfer of genetic material by the viral vector.

如本文所用，除非上下文另有说明，否则病毒粒子、载体、衣壳或衣壳蛋白质的“集合”或“多个”意指两个或多个。As used herein, a "collection" or "plurality" of a virion, vector, capsid or capsid protein means two or more unless the context dictates otherwise.

除非另有说明，“有效转导”或“有效靶向性”或类似术语可参照合适的对照品，例如，相对于对照品至少50％、60％、70％、80％、85％、90％、95％或更多的转导或靶向性来确定。Unless otherwise stated, "effective transduction" or "effective targeting" or similar terms may refer to a suitable control, eg, at least 50%, 60%, 70%, 80%, 85%, 90% relative to the control %, 95% or more transduction or targeting.

类似地，可通过参考适当的对照来确定病毒对目标细胞或组织是否“非有效地转导”或“非具有有效的靶向性”。在具体实施例中，病毒载体对骨骼肌、心肌细胞非有效地转导，在具体实施例中，组织的非有效性转导是有效转导组织转导水平的20％或更少、10％或更少、5％或更少、1％或更少、0.1％或更少。Similarly, it can be determined whether a virus "ineffectively transduces" or "ineffectively targets" a target cell or tissue by reference to an appropriate control. In specific embodiments, the viral vector transduces skeletal muscle, cardiomyocytes inefficiently, and in specific embodiments, the ineffective transduction of tissue is 20% or less, 10% of the level of transduction of efficiently transduced tissue or less, 5% or less, 1% or less, 0.1% or less.

如本文所用，除非另有说明，术语"多肽"包括肽和蛋白质。As used herein, unless otherwise specified, the term "polypeptide" includes both peptides and proteins.

“核酸”或“核苷酸序列”是核苷酸碱基序列，可以是RNA、DNA或DNA-RNA杂交序列，包括自然发生和非自然发生的核苷酸，但最好是单链或双链DNA序列。A "nucleic acid" or "nucleotide sequence" is a sequence of nucleotide bases, which may be RNA, DNA, or DNA-RNA hybrid sequences, including naturally occurring and non-naturally occurring nucleotides, but preferably single-stranded or double-stranded Strand DNA sequence.

如本文所用，"分离的"核酸或核酸序列是指与天然存在的生物体或病毒的至少一些其它成分分离的核酸或核酸序列，所述其它成分，例如，通常与核酸或核酸序列相关的细胞或病毒结构成分或其它多肽或核酸。As used herein, an "isolated" nucleic acid or nucleic acid sequence refers to a nucleic acid or nucleic acid sequence that is separated from at least some other component of a naturally occurring organism or virus, eg, a cell with which the nucleic acid or nucleic acid sequence is normally associated or viral structural components or other polypeptides or nucleic acids.

同样，“分离的”多肽是指从自然发生的有机体或病毒的至少一些其它成分中分离的多肽，所述其它成分例如细胞或病毒结构成分或通常与该多肽相关联的其它多肽或核酸。Likewise, an "isolated" polypeptide refers to a polypeptide that is isolated from at least some other components of a naturally occurring organism or virus, such as cellular or viral structural components or other polypeptides or nucleic acids with which the polypeptide is commonly associated.

术语“治疗”或语法上等同的术语，是指受试者病情的严重程度降低或至少部分改善或改善，和/或至少达到一种临床症状的缓解或减少，和/或病情进展有延迟和/或预防或延缓疾病或障碍的发生。此处术语“治疗”也包括对受试者的预防性治疗，例如，预防感染、癌症或疾病的发生。如本文所用，术语“预防”及其语法上等同的术语，包括任何类型的预防降低病情发生率、延缓病情发生和/或进展和/或减轻与病情相关症状的治疗。因此，除非上下文另有说明，术语“治疗”或语法等效术语是指预防和治疗方法或方案。The term "treating" or a grammatically equivalent term refers to a reduction in the severity or at least partial improvement or amelioration of a subject's condition, and/or a relief or reduction of at least one clinical symptom, and/or a delay in the progression of the disease and /or prevent or delay the onset of a disease or disorder. The term "treatment" as used herein also includes prophylactic treatment of a subject, eg, to prevent the development of infection, cancer or disease. As used herein, the term "prophylaxis" and its grammatical equivalents includes any type of treatment that reduces the incidence, delays the onset and/or progression of the disease and/or alleviates symptoms associated with the disease. Thus, unless the context dictates otherwise, the term "treatment" or a grammatical equivalent refers to both prophylactic and therapeutic methods or regimens.

在此使用的"有效的"剂量是足以对受试者获得某种改进或益处的剂量。或者说，"有效的"剂量是使受试者至少一种临床症状缓解或减少的剂量。本领域技术人员将理解，只要受试者获得改进或益处，治疗效果不必是完全的或治愈的。As used herein, an "effective" dose is a dose sufficient to obtain some improvement or benefit in a subject. Alternatively, an "effective" dose is one that relieves or reduces at least one clinical symptom in a subject. Those skilled in the art will understand that the therapeutic effect need not be complete or curative so long as the subject experiences improvement or benefit.

“异源核苷酸序列”或“异源核酸”通常不是在病毒中自然发生的序列。通常，异源核酸或核苷酸序列包含编码多肽和/或非翻译RNA的开放阅读框。A "heterologous nucleotide sequence" or "heterologous nucleic acid" is generally not a sequence that occurs naturally in a virus. Typically, a heterologous nucleic acid or nucleotide sequence comprises an open reading frame encoding a polypeptide and/or untranslated RNA.

"治疗性多肽"可以是一种多肽，可以减轻或减少因细胞或受试者中蛋白缺失或缺陷而导致的症状。此外，"治疗性多肽"可以是一种以其它方式给受试者带来益处的多肽，例如抗癌效果或移植存活率的提高。A "therapeutic polypeptide" can be a polypeptide that alleviates or reduces symptoms resulting from a deletion or deficiency of a protein in a cell or subject. In addition, a "therapeutic polypeptide" can be one that otherwise confers a benefit to the subject, such as an anticancer effect or an increase in transplant survival.

如本文所用，“载体”、“病毒载体”、“递送载体”通常指作为核酸递送载体的病毒粒子，它包括包装在病毒体内的病毒核酸即载体基因组。根据本发明的病毒载体包括本发明嵌合的AAV衣壳，并且可以包装AAV或重组AAV基因组或包括病毒核酸在内的任何其它核酸。或者，在某些情况下，术语“载体”、“病毒载体”、“递送载体”可用于指在没有病毒粒子情况下的载体基因组和/或指作为转运体的病毒衣壳，以传递与衣壳相连或包装在衣壳内的分子。As used herein, "vector", "viral vector" and "delivery vector" generally refer to virions as nucleic acid delivery vehicles, which include viral nucleic acid packaged within the virion, ie, the vector genome. The viral vector according to the present invention comprises the chimeric AAV capsid of the present invention and can package AAV or recombinant AAV genome or any other nucleic acid including viral nucleic acid. Alternatively, in some cases, the terms "vector", "viral vector", "delivery vector" may be used to refer to the vector genome in the absence of virions and/or to the viral capsid as a transporter to deliver the Shell-attached or packaged molecules within a capsid.

“重组AAV载体基因组”或“rAAV基因组”是一种AAV基因组，其包含至少一个反向末端重复和一个或多个异源核苷酸序列。rAAV载体通常将145个碱基末端重复序列(TRs)保留在顺式结构中以产生病毒；然而，修饰的AAV-TRs和非AAV-TRs也可用于此目的。所有其它的病毒序列是可有可无的，并且可以反式提供(Muzyczka，1992，curr-topicsmicrobial.Immunol.158:97)。rAAV载体视情况可包含两个TRs，例如AAV TRs，通常位于异源核苷酸序列的5'和3'端，但不必与之相邻。TRs可以是相同的，也可以是不同的。载体基因组也可以在3'或5'端包含一个TR。A "recombinant AAV vector genome" or "rAAV genome" is an AAV genome comprising at least one inverted terminal repeat and one or more heterologous nucleotide sequences. rAAV vectors typically retain 145 base terminal repeats (TRs) in cis to generate viruses; however, modified AAV-TRs and non-AAV-TRs can also be used for this purpose. All other viral sequences are dispensable and available in trans (Muzyczka, 1992, curr-topicsmicrobial. Immunol. 158:97). An rAAV vector may optionally contain two TRs, eg, AAV TRs, usually located 5' and 3' to the heterologous nucleotide sequence, but not necessarily adjacent to it. The TRs can be the same or different. The vector genome may also contain a TR at the 3' or 5' end.

术语“末端重复”或“tr”包括形成发夹结构并作为反向末端重复发挥作用的任何病毒末端重复或合成序列，即介导诸如复制、病毒包装、整合和/或前病毒拯救等所需功能。TR可以是AAV TR或非AAV TR。例如，非AAV TR序列可以是其它细小病毒，例如，犬细小病毒CPV、小鼠细小病毒MVM、人类细小病毒B-19，或作为SV40复制源的SV40发夹，并可以通过截断、替换、删除、插入进一步修改。此外，TR可以部分或完全合成，如美国专利US5478745中所述的“双D序列”。The term "terminal repeat" or "tr" includes any viral terminal repeat or synthetic sequence that forms a hairpin structure and functions as an inverted terminal repeat, i.e. required to mediate factors such as replication, viral packaging, integration and/or proviral rescue, etc. Function. TRs can be AAV TRs or non-AAV TRs. For example, non-AAV TR sequences can be other parvoviruses, e.g., canine parvovirus CPV, mouse parvovirus MVM, human parvovirus B-19, or the SV40 hairpin that is the source of replication for SV40, and can be modified by truncation, substitution, deletion , Insert further modifications. In addition, TR can be partially or fully synthesized, as described in US Pat. No. 5,478,745 for the "double D sequence".

“AAV-末端重复”或“AAV TR”可来自任何AAV，包括但不限于血清型1、2、3、4、5、6、7、8、9、10、11或任何其它已知或以后发现的AAV。只要末端重复介导所需功能，例如复制、病毒包装、整合和/或前病毒拯救等，则无需具有天然末端重复序列，例如，天然AAV TR序列可通过插入、删除、截断和/或错义突变而改变。An "AAV-terminal repeat" or "AAV TR" can be from any AAV, including but not limited to serotypes 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or any other known or later Discovered AAV. It is not necessary to have native terminal repeats as long as the terminal repeats mediate the desired function, such as replication, viral packaging, integration and/or proviral rescue, etc., e.g., native AAV TR sequences can be modified by insertion, deletion, truncation and/or missense change by mutation.

术语“重组AAV粒子”和“重组AAV颗粒”可以互换使用。“重组AAV粒子”或“重组AAV颗粒”包含包装在AAV衣壳内的重组AAV载体基因组。The terms "recombinant AAV particle" and "recombinant AAV particle" are used interchangeably. A "recombinant AAV particle" or "recombinant AAV particle" comprises a recombinant AAV vector genome packaged within an AAV capsid.

“基本保留”某种特性，是指至少保留约75％、85％、90％、95％、97％、98％、99％或100％的所述特性，例如，活性或其它可测量特性。"Substantially retaining" a property means retaining at least about 75%, 85%, 90%, 95%, 97%, 98%, 99% or 100% of the property, eg, activity or other measurable property.

II、定向进化和体内筛选法鉴定的嵌合AAV衣壳II. Chimeric AAV capsids identified by directed evolution and in vivo screening

发明人已鉴定出具有感兴趣特征的“嵌合”或“改组”AAV衣壳结构，例如，靶向性特性和/或中和特性。在具体实施例中，嵌合AAV衣壳显示肝脏的高效转导和/或骨骼肌和/或心肌的低效转导。The inventors have identified "chimeric" or "shuffled" AAV capsid structures with characteristics of interest, eg, targeting properties and/or neutralizing properties. In specific embodiments, the chimeric AAV capsids exhibit efficient transduction of liver and/or inefficient transduction of skeletal and/or cardiac muscle.

因此，在一些实施例中，本发明提供包含或基本上包含以下氨基酸序列或由以下氨基酸序列组成的嵌合AAV衣壳以及包含所述嵌合AAV衣壳的病毒，所述氨基酸序列在图3B、3D、3F、3H、3J、3L、3N、3P或3R中展示。Accordingly, in some embodiments, the present invention provides chimeric AAV capsids comprising or substantially comprising or consisting of the following amino acid sequences, and viruses comprising the chimeric AAV capsids, the amino acid sequences in Figure 3B , 3D, 3F, 3H, 3J, 3L, 3N, 3P or 3R.

在具体实施例中，所述嵌合AAV衣壳蛋白可以包含或基本上包含分别由图3B、图3D、图3F、图3H、图3J、图3L、图3N、图3P或图3R所示的氨基酸序列，或由这些图所示的氨基酸序列组成。In particular embodiments, the chimeric AAV capsid protein may comprise or consist essentially of that shown in Figure 3B, Figure 3D, Figure 3F, Figure 3H, Figure 3J, Figure 3L, Figure 3N, Figure 3P, or Figure 3R, respectively the amino acid sequence of, or consists of the amino acid sequences shown in these figures.

此外，在非限制性实施例中，本发明的嵌合AAV衣壳蛋白可由核酸编码，所述核酸包含或基本上包含分别由图3A、图3C、图3E、图3G、图3I、图3K、图3M、图3O或图3Q所示的核苷酸序列，或由这些图所示的核苷酸序列组成；或编码由上述任何一种核苷酸序列编码的AAV衣壳或衣壳蛋白的核苷酸序列，但由于密码子的简并性而与上述核苷酸序列不同。在本发明说明书和所附权利要求书中的所有氨基酸位置的指定均与VP1编号有关。本领域技术人员将理解，由于AAV衣壳编码序列的重叠，本文所述的修改也可能导致VP2和/或VP3衣壳亚基的改变。Furthermore, in a non-limiting example, the chimeric AAV capsid proteins of the present invention may be encoded by nucleic acids comprising or substantially comprising the components shown in Figure 3A, Figure 3C, Figure 3E, Figure 3G, Figure 3I, Figure 3K, respectively , the nucleotide sequence shown in Figure 3M, Figure 3O or Figure 3Q, or consist of the nucleotide sequences shown in these Figures; or encode an AAV capsid or capsid protein encoded by any of the above nucleotide sequences , but differs from the above nucleotide sequence due to the degeneracy of codons. All assignments of amino acid positions in the present specification and in the appended claims relate to VP1 numbering. Those of skill in the art will appreciate that the modifications described herein may also result in changes to the VP2 and/or VP3 capsid subunits due to overlapping AAV capsid coding sequences.

本发明还提供了包含或基本上包含以下氨基酸序列或由以下氨基酸序列组成的嵌合AAV衣壳蛋白，评价诸如病毒转导和/或抗体中和等生物学性质的方法在本领域内是众所周知的。The present invention also provides chimeric AAV capsid proteins comprising or consisting essentially of the following amino acid sequences, methods of assessing biological properties such as viral transduction and/or antibody neutralization are well known in the art of.

保守氨基酸取代在本领域已知。在具体实施例中，保守氨基酸取代包括以下组中的一个或多个取代：甘氨酸、丙氨酸；缬氨酸、异亮氨酸、亮氨酸；天冬氨酸、谷氨酸；天冬酰胺、谷氨酰胺；丝氨酸、苏氨酸；赖氨酸、精氨酸；和/或苯丙氨酸、酪氨酸。Conservative amino acid substitutions are known in the art. In specific embodiments, conservative amino acid substitutions include one or more of the following group substitutions: glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid; aspartic acid amide, glutamine; serine, threonine; lysine, arginine; and/or phenylalanine, tyrosine.

本领域技术人员显而易见地将图3B、3D、3F、3H、3J、3L、3N、3P、3R所示的嵌合AAV衣壳蛋白的氨基酸序列应用于本领域已知的其它任何修饰，通过进一步修饰以获得所需的特性。例如，对AAV2衣壳序列进行R484E和R585E突变，则改善AAV载体对心脏的转导(Muller等，2006，Cardiovascular Research 70:70-78)。做为进一步非限制性修饰的可能性，对衣壳蛋白进行修饰以纳入靶向序列或有助于纯化和/或检测的序列，例如衣壳蛋白可与谷胱甘肽-S-转移酶、麦芽糖结合蛋白、肝素/硫酸肝素结合域、多聚-HIS、配体和/或报告蛋白、免疫球蛋白Fc片段、单链抗体、血凝素、C-MYC、标签表位等的全部或部分融合以形成融合蛋白。本领域已知的将靶向肽插入AAV衣壳的方法，例如，国际专利WO 00/28004；Nicklin等，2001，Molecular Therapy 474-181；White等，2004，Circulation 109:513-319；Muller等，2003，Nature Biotech 21:1040-1046。It will be obvious to those skilled in the art to apply the amino acid sequence of the chimeric AAV capsid protein shown in Figures 3B, 3D, 3F, 3H, 3J, 3L, 3N, 3P, 3R to any other modifications known in the art, by further Retouch to obtain desired properties. For example, R484E and R585E mutations in the AAV2 capsid sequence improved transduction of the heart by AAV vectors (Muller et al., 2006, Cardiovascular Research 70:70-78). As a further non-limiting modification possibility, the capsid protein is modified to incorporate targeting sequences or sequences that facilitate purification and/or detection, e.g. the capsid protein can interact with glutathione-S-transferase, All or part of maltose binding protein, heparin/heparin sulfate binding domain, poly-HIS, ligand and/or reporter protein, immunoglobulin Fc fragment, single chain antibody, hemagglutinin, C-MYC, tag epitopes, etc. fused to form fusion proteins. Methods of inserting targeting peptides into AAV capsids are known in the art, eg, International Patent WO 00/28004; Nicklin et al., 2001, Molecular Therapy 474-181; White et al., 2004, Circulation 109:513-319; Muller et al. , 2003, Nature Biotech 21:1040-1046.

本发明的病毒可进一步包含如国际专利WO01/92551及美国专利US7465583中所述的双重病毒基因组。The virus of the present invention may further comprise a dual viral genome as described in International Patent WO01/92551 and US Patent US7465583.

本发明还提供了包含本发明嵌合AAV衣壳蛋白的重组病毒粒子，其中载体基因组包裹于病毒粒子中，优选AAV载体基因组。在具体实施例中，本发明提供了一种重组AAV粒子，其包含本发明的嵌合AAV衣壳蛋白，其中AAV载体基因组包裹于AAV衣壳中。The present invention also provides a recombinant virus particle comprising the chimeric AAV capsid protein of the present invention, wherein the vector genome is encapsulated in the virus particle, preferably the AAV vector genome. In a specific embodiment, the present invention provides a recombinant AAV particle comprising the chimeric AAV capsid protein of the present invention, wherein the AAV vector genome is encapsulated in the AAV capsid.

在具体实施例中，病毒是包含感兴趣异源核酸的重组载体。因此，本发明可用于将核酸在体外和体内递送至细胞。在代表性实施例中，本发明的重组载体用于将核酸递送或转移到动物细胞，优选哺乳动物细胞。In specific embodiments, the virus is a recombinant vector comprising a heterologous nucleic acid of interest. Thus, the present invention can be used to deliver nucleic acids to cells in vitro and in vivo. In representative embodiments, the recombinant vectors of the present invention are used to deliver or transfer nucleic acids to animal cells, preferably mammalian cells.

任何异源核苷酸序列均可由本发明病毒载体递送。感兴趣的核酸包括编码多肽的核酸，任选治疗性多肽和/或免疫原性多肽。Any heterologous nucleotide sequence can be delivered by the viral vector of the present invention. Nucleic acids of interest include nucleic acids encoding polypeptides, optionally therapeutic polypeptides and/or immunogenic polypeptides.

治疗性多肽包括但不限于胰岛素、胰高血糖素、生长激素、甲状旁腺激素、生长激素释放因子、卵泡刺激素、黄体激素、人绒毛膜促性腺激素、血管内皮生长因子、促血管生成素、血管抑制素、粒细胞集落刺激因子、促红细胞生产素、结缔组织生长因子、碱性成纤维细胞生长因子、酸性成纤维细胞生长因子、表皮生长因子、血小板衍生的生长因子、胰岛素生长因子Ⅰ和Ⅱ、转化生长因子α超家族中的任一个、活化素、抑制素、骨形态发生蛋白中的任一个、神经生长因子、脑衍生神经营养因子、神经营养蛋白NT-3和NT-4/5、睫状神经营养因子、胶质细胞系衍生神经营养因子、聚集蛋白、脑信号蛋白/瓦解蛋白家族的任一个、导蛋白-1和导蛋白-2、肝细胞生长因子、肝配蛋白、头蛋白、sonic hedgehog蛋白和酪氨酸羟化酶、血小板生成素、白介素IL-1到1L-25、单核细胞化学诱导物蛋白、白血病抑制因子、粒细胞-巨噬细胞集落刺激因子、Fas配体、肿瘤坏死因子α和β、干扰素α、β和γ、干细胞因子、flk-2/flt3配体。免疫系统产生的基因产物也可用于本发明，这些产物包括但不限于免疫球蛋白1gG、IgM、IgA、IgD和IgE、嵌合免疫球蛋白、人源化抗体、单链抗体、T细胞受体、嵌合T细胞受体、单链T细胞受体、Ⅰ类和Ⅱ类MHC分子以及改造的免疫球蛋白、补体调节蛋白、膜辅因子蛋白、衰变加速因子、CR1、CF2和CD59，低密度脂蛋白受体、高密度脂蛋白受体、极低密度脂蛋白受体和清除受体、糖皮质激素受体和雌激素受体、维生素D受体和其它核受体、jun/fos、max、mad、血清效应因子、AP-1、AP2、myb、MyoD和肌细胞生成蛋白、TFE3、E2F、ATF1、ATF2、ATF3、ATF4、ZF5、NFAT、CREB、HNF-4、C/EBP、CCAAT-盒结合蛋白、干扰素调节因子、Wilms肿瘤蛋白、ETS-结合蛋白、STAT、GATA-盒结合蛋白和翼状螺旋蛋白的叉头蛋白家族，氨甲酰合成酶1、鸟氨酸转氨甲酰酶、精氨基琥珀酸合成酶、精氨基琥珀酸裂合酶、精氨酸酶、延胡索酰乙酰乙酸水解酶、苯丙氨酸羟化酶、α-1抗胰蛋白酶、葡萄糖苷酶、葡萄糖-6-磷酸酶、胆色素原脱氨酶、胱硫醚B合成酶、支链酮酸脱羧酶、异戊酰-CoA脱氢酶、丙酰-CoA羧化酶、甲基丙二酰CoA变位酶、戊二酰-CoA脱氢酶、胰岛素、β-葡糖苷酶、丙酮酸羧酸盐、肝磷酸化酶、磷酸化酶激酶、甘氨酸脱羧酶、H-蛋白、T-蛋白、囊性纤维化跨膜传导调节蛋白、抗肌萎缩蛋白、α-半乳糖苷酶、β-半乳糖苷酶、溶酶体酶、凝血因子、以及在需要的个体中具有治疗作用的任何其它多肽。Therapeutic polypeptides include, but are not limited to, insulin, glucagon, growth hormone, parathyroid hormone, growth hormone releasing factor, follicle stimulating hormone, progesterone, human chorionic gonadotropin, vascular endothelial growth factor, angiopoietin , angiostatin, granulocyte colony stimulating factor, erythropoietin, connective tissue growth factor, basic fibroblast growth factor, acidic fibroblast growth factor, epidermal growth factor, platelet-derived growth factor, insulin growth factor I and II, any of the transforming growth factor alpha superfamily, activin, inhibin, any of bone morphogenetic proteins, nerve growth factor, brain-derived neurotrophic factor, neurotrophin NT-3 and NT-4/ 5. Ciliary neurotrophic factor, glial cell line-derived neurotrophic factor, aggregin, any of the semaphorin/disintegrin family, transin-1 and transin-2, hepatocyte growth factor, ephrin, Noggin, sonic hedgehog protein and tyrosine hydroxylase, thrombopoietin, interleukins IL-1 to IL-25, monocyte chemoattractant protein, leukemia inhibitory factor, granulocyte-macrophage colony stimulating factor, Fas Ligands, tumor necrosis factor alpha and beta, interferon alpha, beta and gamma, stem cell factor, flk-2/flt3 ligand. Gene products produced by the immune system can also be used in the present invention, including but not limited to immunoglobulins IgG, IgM, IgA, IgD and IgE, chimeric immunoglobulins, humanized antibodies, single chain antibodies, T cell receptors , chimeric T cell receptors, single-chain T cell receptors, class I and class II MHC molecules, and engineered immunoglobulins, complement regulatory proteins, membrane cofactor proteins, decay accelerating factors, CR1, CF2, and CD59, low density Lipoprotein receptors, high density lipoprotein receptors, very low density lipoprotein receptors and clearance receptors, glucocorticoid receptors and estrogen receptors, vitamin D receptors and other nuclear receptors, jun/fos, max , mad, serum effector, AP-1, AP2, myb, MyoD and myogenin, TFE3, E2F, ATF1, ATF2, ATF3, ATF4, ZF5, NFAT, CREB, HNF-4, C/EBP, CCAAT- Forkhead protein family of box-binding proteins, interferon regulators, Wilms tumor proteins, ETS-binding proteins, STATs, GATA-box-binding proteins, and winged helical proteins, carbamoyl synthase 1, ornithine transcarbamylase , argininosuccinate synthase, argininosuccinate lyase, arginase, fumaric acetoacetate hydrolase, phenylalanine hydroxylase, alpha-1 antitrypsin, glucosidase, glucose-6 - Phosphatase, porphobilinogen deaminase, cystathionine B synthase, branched-chain ketoacid decarboxylase, isovaleryl-CoA dehydrogenase, propionyl-CoA carboxylase, methylmalonyl-CoA metathesis Enzymes, Glutaryl-CoA Dehydrogenase, Insulin, Beta-Glucosidase, Pyruvate Carboxylate, Liver Phosphorylase, Phosphorylase Kinase, Glycine Decarboxylase, H-Protein, T-Protein, Cystic Fiber transmembrane conductance regulator proteins, dystrophin, alpha-galactosidase, beta-galactosidase, lysosomal enzymes, coagulation factors, and any other polypeptides that have therapeutic effects in an individual in need thereof.

编码多肽的异源核苷酸序列包括编码报告基因多肽的序列。本领域已知的报告基因编码的多肽，包括但不限于绿色荧光蛋白、β-半乳糖苷酶、碱性磷酸酶、荧光素酶和氯霉素乙酰转移酶等。A heterologous nucleotide sequence encoding a polypeptide includes a sequence encoding a reporter gene polypeptide. Polypeptides encoded by reporter genes known in the art include, but are not limited to, green fluorescent protein, β-galactosidase, alkaline phosphatase, luciferase, and chloramphenicol acetyltransferase.

另一方面，异源核酸可编码一种反义寡核苷酸，包括核酶、干扰RNA，干扰RNA包括介导基因沉默的小干扰RNA(Sharp等，2000，Science 287:2431)、microRNA、其它非翻译功能RNAs，如“指导”RNAS(Gorman等，1998，Proc.Nat.Acad.Sci.USA 95:4929；美国专利US5869248)等。On the other hand, the heterologous nucleic acid can encode an antisense oligonucleotide, including ribozymes, interfering RNAs including small interfering RNAs that mediate gene silencing (Sharp et al., 2000, Science 287:2431), microRNAs, Other non-translated functional RNAs, such as "guide" RNAS (Gorman et al., 1998, Proc. Nat. Acad. Sci. USA 95:4929; US Pat. No. 5,869,248) and the like.

所属技术领域内已知，反义核酸和抑制性RNA序列可诱导“外显子跳跃”。因此，异源核酸可以编码反义核酸或抑制RNA，诱导适当的外显子跳跃。It is known in the art that antisense nucleic acid and inhibitory RNA sequences can induce "exon skipping". Thus, the heterologous nucleic acid can encode an antisense nucleic acid or inhibitory RNA that induces appropriate exon skipping.

核酶是以位点特异性方式裂解核酸的RNA蛋白复合物。核酶具有特定的催化域，具有内切酶活性(Kim等，1987，Proc.Natl.Acad.Sci.USA 84:8788；Gerlach等，1987，Nature328:802；Forster and Symons，1987，Cell 49:211)。Ribozymes are RNA-protein complexes that cleave nucleic acids in a site-specific manner. Ribozymes have specific catalytic domains with endonuclease activity (Kim et al., 1987, Proc. Natl. Acad. Sci. USA 84:8788; Gerlach et al., 1987, Nature 328:802; Forster and Symons, 1987, Cell 49: 211).

microRNAs是一种天然的细胞RNA分子，通过控制mRNA的稳定性来调节多个基因的表达。特定microRNA的过度表达或减少可用于治疗功能障碍，并已被证明在许多疾病状态和疾病动物模型中有效(Couzin，2008，Science 319：1782-1784)。嵌合AAV可用于将microRNA导入细胞、组织和受试者中，用于治疗遗传病和获得性疾病，或增强某些组织的功能并促进其生长，例如，mir-1、mir-133、mir-206和/或mir-208可用于治疗心脏和骨骼肌疾病(Chen等，2006，Genet 38：228-233；van Rooij等，2008，Trends Genet。24：159-166)，microRNA也可用于基因传递后调节免疫系统(Brown等，2007，Blood 110：4144-4152)。MicroRNAs are natural cellular RNA molecules that regulate the expression of multiple genes by controlling mRNA stability. Overexpression or reduction of specific microRNAs can be used to treat dysfunction and has been shown to be effective in many disease states and animal models of disease (Couzin, 2008, Science 319:1782-1784). Chimeric AAVs can be used to introduce microRNAs into cells, tissues, and subjects for the treatment of genetic and acquired diseases, or to enhance the function and growth of certain tissues, e.g., mir-1, mir-133, mir -206 and/or mir-208 can be used in the treatment of cardiac and skeletal muscle diseases (Chen et al., 2006, Genet 38:228-233; van Rooij et al., 2008, Trends Genet. 24:159-166), and microRNAs can also be used in genes Modulation of the immune system after delivery (Brown et al., 2007, Blood 110:4144-4152).

本文所用术语“反义寡核苷酸”，包括“反义RNA”，是指与特定DNA或RNA序列互补并特异杂交的核酸。反义寡核苷酸和编码相同的核酸可以按照常规技术制造。The term "antisense oligonucleotide," including "antisense RNA," as used herein, refers to a nucleic acid that is complementary to and specifically hybridizes to a specific DNA or RNA sequence. Antisense oligonucleotides and nucleic acids encoding the same can be made according to conventional techniques.

本领域技术人员理解，反义寡核苷酸不必与目标序列完全互补，只要序列相似程度足以使反义核苷酸序列特异地与目标序列杂交并减少蛋白质产物的产生，例如，相似度至少约30％、40％、50％、60％、70％、80％、90％、95％或更多。为了确定杂交的特异性，这种寡核苷酸与靶序列的杂交可以在弱、中、甚至严格的条件下进行。It is understood by those of skill in the art that an antisense oligonucleotide does not have to be completely complementary to a target sequence, so long as the sequence similarity is sufficient to allow the antisense nucleotide sequence to specifically hybridize to the target sequence and reduce production of a protein product, eg, at least about 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more. Hybridization of such oligonucleotides to target sequences can be performed under weak, moderate, or even stringent conditions in order to determine the specificity of hybridization.

反义寡核苷酸可通过本领域已知的程序化学合成和酶结合反应合成。例如，一个反义寡核苷酸可以用自然产生的核苷酸或各种修饰的核苷酸进行化学合成，其目的是增加分子的生物稳定性和/或增加反义链和正义链之间形成的双链稳定性，例如，可以使用硫代磷酸酯衍生物和吖啶取代的核苷酸。Antisense oligonucleotides can be synthesized by chemical synthesis and enzymatic conjugation reactions by procedures known in the art. For example, an antisense oligonucleotide can be chemically synthesized using naturally occurring nucleotides or various modified nucleotides with the purpose of increasing the biological stability of the molecule and/or increasing the distance between the antisense and sense strands For the stability of the formed duplex, for example, phosphorothioate derivatives and acridine-substituted nucleotides can be used.

可用于产生反义寡核苷酸的修饰核苷酸，包括5-氟尿嘧啶、5-溴尿嘧啶、5-氯尿嘧啶、5-碘尿嘧啶、次黄嘌呤、黄嘌呤、4-乙酰胞嘧啶、5-羧甲基氨基甲基-2-硫尿嘧啶、5-羧甲基氨基甲基尿嘧啶、二氢尿嘧啶、β-D-半乳糖基喹啉、肌苷、N6-异戊烯基赖氨酸、1-甲基鸟嘌呤、1-甲基肌苷、2,2-二甲基鸟嘌呤、2-甲基腺嘌呤、2-甲基鸟嘌呤、3-甲基胞苷、5-甲基胞嘧啶、N6-腺嘌呤、7-甲基鸟嘌呤、5-甲基氨基甲基尿嘧啶、5-甲氧基氨基甲基-2-硫脲嘧啶、β-D-甘露糖基喹啉、5’-甲氧基羧甲基脲嘧啶、5-甲氧基脲嘧啶、2-甲硫基-N6-异戊烯基胺、脲嘧啶-5-氧乙酸、喹啉、2-硫胞嘧啶、5-甲基-2-硫脲嘧啶、2-硫脲嘧啶、4-硫脲嘧啶、5-甲基脲嘧啶、脲嘧啶-5-氧乙酸甲酯、尿嘧啶-5-氧乙酸、5-甲基-2-硫脲嘧啶、和2,6-二氨基嘌呤等。Modified nucleotides that can be used to generate antisense oligonucleotides, including 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine , 5-carboxymethylaminomethyl-2-thiouracil, 5-carboxymethylaminomethyluracil, dihydrouracil, β-D-galactosylquinoline, inosine, N6-isoamyl lysine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytidine, 5-Methylcytosine, N6-Adenine, 7-Methylguanine, 5-Methylaminomethyluracil, 5-Methoxyaminomethyl-2-thiouracil, β-D-Mannose quinoline, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6-prenylamine, uracil-5-oxyacetic acid, quinoline, 2 -thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5-oxoacetate methyl ester, uracil-5- Oxyacetic acid, 5-methyl-2-thiouracil, and 2,6-diaminopurine, etc.

反义寡核苷酸可经化学修饰与另一分子共价结合。例如，反义寡核苷酸可以与一种分子结合，这种分子有助于传递到感兴趣的细胞，增强鼻粘膜的吸收、提供可检测的标记物、增加寡核苷酸的生物利用度、提高寡核苷酸的稳定性、或改善制剂或药代动力学特性等。共轭分子包括但不限于胆固醇、脂类、多胺、聚酰胺、聚酯、报告分子、生物素、染料、聚乙二醇、人血清白蛋白、酶、抗体或抗体片段或细胞受体配体。Antisense oligonucleotides can be chemically modified to covalently bind to another molecule. For example, antisense oligonucleotides can be conjugated to a molecule that facilitates delivery to cells of interest, enhances absorption from the nasal mucosa, provides a detectable label, and increases the bioavailability of the oligonucleotide , improve the stability of oligonucleotides, or improve formulation or pharmacokinetic properties, etc. Conjugated molecules include, but are not limited to, cholesterol, lipids, polyamines, polyamides, polyesters, reporter molecules, biotin, dyes, polyethylene glycol, human serum albumin, enzymes, antibodies or antibody fragments, or cellular receptor ligands. body.

为提高稳定性、核酸酶抗性、生物利用度、制剂特征和/或药代动力学特性，对核酸进行的其它修饰在本领域内也是已知的。Other modifications to nucleic acids are also known in the art to improve stability, nuclease resistance, bioavailability, formulation characteristics, and/or pharmacokinetic properties.

RNA干扰是转录后基因沉默的一种机制，它将与靶序列相对应的双链RNA(DsRNA)导入细胞或生物体中，导致相应mRNA的降解。RNAi实现基因沉默的机制已在多篇综述文章中报道(Sharp等，2001，Genes Dcv 15:485-490；Hammond等，2001，Nature Rev Gen 2:110-119)。在基因表达恢复之前，RNAi效应在多个细胞分裂中持续存在。因此，RNAi是一种在RNA水平上进行靶向敲除的有效方法。RNAi已被证明在人类细胞，包括人类胚胎肾脏和HeLa细胞中获得成功(Elbashir等，2001，Nature 411:494-498)。现已证明，大约21个核苷酸的短合成dsRNA，也被称为“短干扰RNA”，可以在不触发抗病毒反应的情况下介导哺乳动物细胞的沉默(Elbashir等，2001，Nature 411:494-498；Caplen等，2001，Proc.Nat Acad.Sci.98:9742)。RNA interference is a mechanism of post-transcriptional gene silencing that introduces double-stranded RNA (DsRNA) corresponding to the target sequence into cells or organisms, resulting in the degradation of the corresponding mRNA. The mechanism by which RNAi achieves gene silencing has been reported in several review articles (Sharp et al., 2001, Genes Dcv 15:485-490; Hammond et al., 2001, Nature Rev Gen 2:110-119). RNAi effects persist across multiple cell divisions until gene expression is restored. Therefore, RNAi is an efficient method for targeted knockout at the RNA level. RNAi has been shown to be successful in human cells, including human embryonic kidney and HeLa cells (Elbashir et al., 2001, Nature 411:494-498). It has been demonstrated that short synthetic dsRNAs of approximately 21 nucleotides, also known as "short interfering RNAs", can mediate silencing in mammalian cells without triggering an antiviral response (Elbashir et al., 2001, Nature 411 : 494-498; Caplen et al., 2001, Proc. Nat Acad. Sci. 98:9742).

RNAi分子可以是短发夹RNA(Paddison等，2002，PNAS USA 99:1443-1448)，它在细胞中通过RNaseIII酶切作用加工成20-25长度的sirna分子。ShRNA通常有一个茎环结构，即两个反向重复序列由一个短间隔序列连接。The RNAi molecule can be a short hairpin RNA (Paddison et al., 2002, PNAS USA 99:1443-1448), which is processed in cells into siRNA molecules of 20-25 length by RNaseIII cleavage. ShRNAs usually have a stem-loop structure, that is, two inverted repeats are connected by a short spacer sequence.

产生RNAi的方法包括化学合成、体外转录、Dicer体外或体内消化长dsRNA、递送载体在体内表达以及从PCR来源RNAi表达盒在体内表达。Methods of producing RNAi include chemical synthesis, in vitro transcription, Dicer digestion of long dsRNA in vitro or in vivo, delivery vector expression in vivo, and in vivo expression from PCR-derived RNAi expression cassettes.

RNAi分子的反义区可以完全与目标序列互补，但只要它与目标序列特异性杂交并减少蛋白质产物的产生，如至少约30％、40％、50％、60％、70％、80％、90％、95％或更多，则不需要与目标序列完全互补。在一些实施例中，所述寡核苷酸与目标序列的杂交可在如上文所定义的弱严格性、中等严格性或甚至高等严格性条件下进行。The antisense region of the RNAi molecule can be completely complementary to the target sequence, as long as it specifically hybridizes to the target sequence and reduces the production of protein products, such as at least about 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more, it does not need to be fully complementary to the target sequence. In some embodiments, the hybridization of the oligonucleotide to the target sequence can be performed under conditions of weak, moderate or even high stringency as defined above.

在其它实施方案中，RNAi的反义区域与目标序列具有至少约60％、70％、80％、90％、95％、97％、98％或更高的序列一致性，并减少蛋白质产物的产生至少约30％、40％、50％、60％、70％、80％、90％、95％或更多。在一些实施例中，与目标序列相比，反义区域包含1、2、3、4、5、6、7、8、9或10个不匹配。不匹配通常在dsRNA末端比在中心部分更容易接受。RNAi分子可以包含修饰糖、修饰核苷酸、骨架连接以及上述反义寡核苷酸的其它修饰。In other embodiments, the antisense region of RNAi has at least about 60%, 70%, 80%, 90%, 95%, 97%, 98% or greater sequence identity to the target sequence and reduces the protein product Produce at least about 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more. In some embodiments, the antisense region comprises 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 mismatches compared to the target sequence. Mismatches are generally more acceptable at the ends of the dsRNA than at the central portion. RNAi molecules may contain modified sugars, modified nucleotides, backbone linkages, and other modifications of the antisense oligonucleotides described above.

本发明还提供表达免疫原性多肽的重组病毒载体。异源核酸可编码本领域已知的任何感兴趣的免疫原，包括但不限于来自人类免疫缺陷病毒、流感病毒、Gag蛋白、肿瘤抗原、癌症抗原、细菌抗原、病毒抗原等。或者，该免疫原可呈现在病毒衣壳中，例如，通过共价修饰与病毒外壳结合。The present invention also provides recombinant viral vectors expressing immunogenic polypeptides. The heterologous nucleic acid can encode any immunogen of interest known in the art, including, but not limited to, from human immunodeficiency virus, influenza virus, Gag proteins, tumor antigens, cancer antigens, bacterial antigens, viral antigens, and the like. Alternatively, the immunogen can be presented in the viral capsid, eg, bound to the viral coat by covalent modification.

细小病毒用作疫苗为本领域所公知(US5916563、US5905040、US5882652)。抗原可能存在于病毒衣壳中，或者抗原可以从引入重组载体基因组的异源核酸中表达。The use of parvoviruses as vaccines is well known in the art (US5916563, US5905040, US5882652). The antigen may be present in the viral capsid, or the antigen may be expressed from a heterologous nucleic acid introduced into the recombinant vector genome.

免疫原性多肽或免疫原可以是任何适用于保护受试者免受疾病的多肽，包括但不限于微生物、细菌、原生动物、寄生虫、真菌和病毒性疾病。免疫原可以是原粘液病毒免疫原，例如，流感病毒免疫原，流感病毒血凝素表面蛋白或流感病毒核蛋白基因；或慢病毒免疫原，例如，马传染性贫血病毒免疫原、猿免疫缺陷病毒免疫原或人类免疫缺陷病毒免疫原；或沙粒病毒免疫原，例如，拉沙热病毒免疫原；或痘病毒免疫原、黄病毒免疫原、丝状病毒免疫原、布尼亚病毒免疫原、冠状病毒免疫原或严重急性呼吸综合征免疫原。免疫原还可以是脊髓灰质炎免疫原、疱疹免疫原、腮腺炎免疫原、麻疹免疫原、风疹免疫原、白喉毒素或其它白喉免疫原、百日咳抗原、肝炎免疫原或本领域已知的任何其它疫苗免疫原。An immunogenic polypeptide or immunogen can be any polypeptide suitable for protecting a subject from disease, including but not limited to microbial, bacterial, protozoan, parasitic, fungal, and viral diseases. The immunogen may be a protomyxovirus immunogen, eg, influenza virus immunogen, influenza virus hemagglutinin surface protein or influenza virus nucleoprotein gene; or a lentiviral immunogen, eg, equine infectious anemia virus immunogen, simian immunodeficiency Viral immunogens or human immunodeficiency virus immunogens; or arenavirus immunogens, e.g., Lassa fever virus immunogens; or poxvirus immunogens, flavivirus immunogens, filovirus immunogens, bunyavirus immunogens , coronavirus immunogens or severe acute respiratory syndrome immunogens. The immunogen may also be a polio immunogen, herpes immunogen, mumps immunogen, measles immunogen, rubella immunogen, diphtheria toxin or other diphtheria immunogen, pertussis antigen, hepatitis immunogen or any other known in the art Vaccine immunogens.

或者，免疫原可以是任何肿瘤或癌细胞抗原。任选地，肿瘤或癌抗原表达于癌细胞表面。示例性癌症和肿瘤抗原包括但不限于：BRCA1基因产物、BRCA2基因产物、GP100、酪氨酸酶、GAGE-1/2、RAGE、NY-ESO-1、CDK-4、3-连环蛋白、MUM-1、caspase-8、HPVE、SART-1、PRAME、p15、黑色素瘤肿瘤抗原、HER-2/neu基因产物、雌激素受体、乳脂球蛋白、p53肿瘤抑制蛋白、粘蛋白抗原、端粒酶、核基质蛋白、前列腺酸磷酸酶、乳头瘤病毒抗原、以及与下列癌症包括黑色素瘤、腺癌、胸腺瘤、肉瘤、肺癌、肝癌、结直肠癌、非霍奇金淋巴瘤、霍奇金淋巴瘤、白血病、子宫癌、乳腺癌、前列腺癌、卵巢癌、宫颈癌、膀胱癌、肾癌、胰腺癌、脑癌、肾癌、胃癌、食管癌和头颈癌等相关的抗原。Alternatively, the immunogen can be any tumor or cancer cell antigen. Optionally, tumor or cancer antigens are expressed on the surface of cancer cells. Exemplary cancer and tumor antigens include, but are not limited to: BRCA1 gene product, BRCA2 gene product, GP100, tyrosinase, GAGE-1/2, RAGE, NY-ESO-1, CDK-4, 3-catenin, MUM -1, caspase-8, HPVE, SART-1, PRAME, p15, melanoma tumor antigen, HER-2/neu gene product, estrogen receptor, milk fat globulin, p53 tumor suppressor protein, mucin antigen, telomere Enzymes, nuclear matrix proteins, prostatic acid phosphatase, papilloma virus antigens, and related to the following cancers including melanoma, adenocarcinoma, thymoma, sarcoma, lung cancer, liver cancer, colorectal cancer, non-Hodgkin lymphoma, Hodgkin Antigens related to lymphoma, leukemia, uterine cancer, breast cancer, prostate cancer, ovarian cancer, cervical cancer, bladder cancer, kidney cancer, pancreatic cancer, brain cancer, kidney cancer, stomach cancer, esophageal cancer and head and neck cancer.

或者，异源核苷酸序列可以编码在体外或体内细胞中产生的任何多肽。例如，可将病毒载体引入培养细胞，并从中分离出表达的蛋白产物。Alternatively, the heterologous nucleotide sequence can encode any polypeptide produced in a cell in vitro or in vivo. For example, viral vectors can be introduced into cultured cells and the expressed protein product isolated therefrom.

本领域的技术人员理解，感兴趣的异源核酸可与适当的控制序列进行可操作性连接。例如，异源核酸可能与转录翻译控制信号、复制起源、多聚腺苷酸化信号、内核糖体进入位点、启动子、增强子等表达控制元件相连接。One of skill in the art understands that a heterologous nucleic acid of interest can be operably linked with appropriate control sequences. For example, a heterologous nucleic acid may be linked to expression control elements such as transcriptional translational control signals, origin of replication, polyadenylation signals, endosome entry sites, promoters, enhancers, and the like.

本领域的技术人员将进一步理解，根据所需的表达水平和组织特异性表达，可以使用各种启动子/增强子元件。启动子/增强子可能是组成型或诱导型，这取决于所需的表达模式。启动子/增强子可以是天然的或外来的，也可以是自然的或合成的序列。Those skilled in the art will further appreciate that various promoter/enhancer elements can be used depending on the desired level of expression and tissue specific expression. Promoters/enhancers may be constitutive or inducible, depending on the desired expression pattern. Promoters/enhancers can be natural or foreign, and can be natural or synthetic sequences.

启动子/增强子元件可以是靶细胞或受试者固有的，也可以是异源核酸序列固有的。通常选择启动子/增强子元件，使其在感兴趣的目标细胞中发挥作用。在代表性的实施例中，启动子/增强子元件是哺乳动物启动子/增强子元件，启动子/增强元件可以是组成型或可诱导型。The promoter/enhancer element can be native to the target cell or subject, or native to the heterologous nucleic acid sequence. Promoter/enhancer elements are typically selected to function in the target cell of interest. In representative embodiments, the promoter/enhancer element is a mammalian promoter/enhancer element, which may be constitutive or inducible.

诱导表达控制元件通常用于那些需要调节异源核酸序列过度表达的应用中。用于基因传递的可诱导启动子/增强子元件可以是组织特异性或组织优选的启动子/增强子元件，并且包括肌肉特异性或优选、神经组织特异性或优选、眼睛(包括视网膜特异性和角膜性)特异性或优选、肝脏特异性或优选、骨髓特异性或优选、胰腺特异性或优选、脾脏特异性或优选、肺特异性或优选。其它可诱导的启动子/增强子元件包括激素诱导和金属诱导元件。示例性诱导启动子/增强子元件包括但不限于Tet开/关元件、RU486诱导启动子、雷帕霉素诱导启动子和金属硫蛋白启动子。Inducible expression control elements are commonly used in applications that require regulation of overexpression of heterologous nucleic acid sequences. Inducible promoter/enhancer elements for gene delivery can be tissue-specific or tissue-preferred promoter/enhancer elements, and include muscle-specific or preferred, neural tissue-specific or preferred, ocular (including retina-specific) and corneal) specific or preferred, liver specific or preferred, bone marrow specific or preferred, pancreas specific or preferred, spleen specific or preferred, lung specific or preferred. Other inducible promoter/enhancer elements include hormone-inducible and metal-inducible elements. Exemplary inducible promoter/enhancer elements include, but are not limited to, Tet on/off elements, RU486 inducible promoters, rapamycin inducible promoters, and metallothionein promoters.

在将异源核酸序列转录并在目标细胞中翻译的实施例中，通常使用特定的起始信号来有效翻译插入的蛋白质编码序列。这些外源翻译控制序列，可能包括ATG起始密码子和相邻序列，可以起始的多种形式，包括天然的和合成的。In embodiments where a heterologous nucleic acid sequence is transcribed and translated in a target cell, a specific initiation signal is typically used to efficiently translate the inserted protein-coding sequence. These exogenous translational control sequences, possibly including the ATG initiation codon and adjacent sequences, can be initiated in a variety of forms, both natural and synthetic.

本发明提供包含一种嵌合AAV衣壳和一种AAV基因组的嵌合AAV粒子。本发明还提供了所述嵌合AAV颗粒的集合或库，其中所述集合或库包含2个或更多、10个或更多、50个或更多、10²个或更多、10³个或更多、10⁴或更多、10⁵或更多、或10⁶或更多不同的序列。The present invention provides chimeric AAV particles comprising a chimeric AAV capsid and an AAV genome. The present invention also provides a collection or library of said chimeric AAV particles, wherein said collection or library comprises ² or more, 10 or more, 50 or more, 102 or more, ¹⁰³ or more, 10 ⁴ or more, 10 ⁵ or more, or 10 ⁶ or more different sequences.

本发明还包括“空”衣壳粒子，其包含、组成或基本上由本发明的嵌合AAV衣壳蛋白组成，参见美国专利US5863541所述。本发明的嵌合AAV衣壳可用作“衣壳运载体”，可被共价连接、结合或包装并转移到细胞中的分子包括DNA、RNA、脂质、碳水化合物、多肽、小有机分子或这些分子的组合。此外，分子可以与病毒衣壳的外部相关联，以便将分子转移到宿主的靶细胞中。在本发明的一个实施例中，分子与衣壳蛋白共价连接。共价连接分子的方法已为本领域的技术人员所熟知。The present invention also includes "empty" capsid particles comprising, consisting of, or consisting essentially of the chimeric AAV capsid proteins of the present invention, as described in US Pat. No. 5,863,541. The chimeric AAV capsids of the invention can be used as "capsid carriers", molecules that can be covalently linked, bound or packaged and transferred into cells including DNA, RNA, lipids, carbohydrates, polypeptides, small organic molecules or a combination of these molecules. In addition, the molecule can be associated with the exterior of the viral capsid in order to transfer the molecule into the target cell of the host. In one embodiment of the invention, the molecule is covalently linked to the capsid protein. Methods of covalently linking molecules are well known to those skilled in the art.

本发明的病毒衣壳还可用于提高针对新的衣壳结构的抗体。作为另一种选择，可以将外源氨基酸序列插入病毒衣壳中，以便将抗原呈递给细胞，例如给受试者施用以产生对外源氨基酸序列的免疫反应。The viral capsids of the present invention can also be used to raise antibodies against novel capsid structures. Alternatively, exogenous amino acid sequences can be inserted into the viral capsid for antigen presentation to cells, eg, administration to a subject to generate an immune response to the exogenous amino acid sequence.

本发明还提供了编码本发明的嵌合衣壳蛋白的核酸。进一步提供包含所述核酸的载体和包含本发明核酸和/或载体的细胞。例如，所述核酸、载体和细胞可用做产生本文所述病毒载体的试剂。The present invention also provides nucleic acids encoding the chimeric capsid proteins of the present invention. Further provided are vectors comprising the nucleic acids and cells comprising the nucleic acids and/or vectors of the invention. For example, the nucleic acids, vectors, and cells can be used as reagents for producing the viral vectors described herein.

在示例性实施方案中，本发明提供了编码图3B、3D、3F、3H、3J、3L、3N、3P、或3R所示的AAV衣壳的核酸序列。代表性的核酸序列包含或基本上包含分别由图3A(L1)图3C(L4)、图3E(L10)、图3G(L52)、图3I(L58)、图3K(L84)、图3M(L37)、图3O(L107)或图3Q(L57)所示的核苷酸序列，或由这些图所示的核苷酸序列组成，或编码由上述任何一种核苷酸序列编码的AAV衣壳或衣壳蛋白的核苷酸序列，但由于密码子的简并性而与上述核苷酸序列不同，其允许不同的核酸序列编码相同的AAV衣壳。In an exemplary embodiment, the invention provides nucleic acid sequences encoding the AAV capsids shown in Figures 3B, 3D, 3F, 3H, 3J, 3L, 3N, 3P, or 3R. Representative nucleic acid sequences comprise or consist essentially of those shown in Figure 3A (L1) Figure 3C (L4), Figure 3E (L10), Figure 3G (L52), Figure 3I (L58), Figure 3K (L84), Figure 3M ( L37), Figure 3O (L107) or Figure 3Q (L57), or consist of the nucleotide sequences shown in these Figures, or encode an AAV coat encoded by any of the above nucleotide sequences Nucleotide sequence of a shell or capsid protein, but differs from the above-mentioned nucleotide sequence due to the degeneracy of codons, which allows different nucleic acid sequences to encode the same AAV capsid.

本发明还提供编码上述AAV衣壳蛋白的变体和融合蛋白的核酸。在具体实施例中，在本领域技术人员已知的标准条件下，核酸与本文特别公开的核酸序列的互补链杂交，编码变体衣壳蛋白。本文特别公开的核酸序列参见图3A、图3C、图3E、图3G、图3I、图3K、图3M、图3O或图3Q，任选地，所述变体衣壳蛋白实质上保留由图3A、图3C、图3E、图3G、图3I、图3K、图3M、图3O或图3Q所示的核酸序列编码的衣壳蛋白的至少一种性质。例如，变体衣壳蛋白的病毒粒子可以基本上保留包含由如图3A、图3C、图3E、图3G、图3I、图3K、图3M、图3O、图3Q所示的核酸编码序列编码衣壳蛋白的病毒粒子的靶向性特征。这种序列的杂交可以在弱、中甚至严格的条件下进行(Sambrook等，1989，Molecular Cloning,A Laboratory Manual 2dEd，Cold Spring Harbor Laboratory)。The present invention also provides nucleic acids encoding variants and fusion proteins of the AAV capsid proteins described above. In specific embodiments, the nucleic acid hybridizes to the complementary strand of the nucleic acid sequence specifically disclosed herein, encoding a variant capsid protein, under standard conditions known to those of skill in the art. See Figure 3A, Figure 3C, Figure 3E, Figure 3G, Figure 3I, Figure 3K, Figure 3M, Figure 3O, or Figure 3Q for nucleic acid sequences specifically disclosed herein, optionally, the variant capsid protein substantially retains the structure shown in Figure 3 At least one property of a capsid protein encoded by a nucleic acid sequence shown in 3A, 3C, 3E, 3G, 3I, 3K, 3M, 3O or 3Q. For example, a virion of a variant capsid protein may substantially retain a nucleic acid encoding sequence encoded by the nucleic acid shown in Figure 3A, Figure 3C, Figure 3E, Figure 3G, Figure 3I, Figure 3K, Figure 3M, Figure 3O, Figure 3Q Targeting characterization of virions with capsid proteins. Hybridization of such sequences can be performed under weak, moderate or even stringent conditions (Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual 2dEd, Cold Spring Harbor Laboratory).

如本领域所知，可使用许多不同的程序来确定核酸或多肽是否与已知序列具有一致性或相似性百分比。本文所用的百分比标识是指当使用BLASTN与其它核酸比对时，核酸或其片段与另一核酸具有指定的百分比，其中BLASTN程序可通过互联网从美国国家生物技术信息中心(NCBI)获取。As known in the art, many different programs can be used to determine whether a nucleic acid or polypeptide has percent identity or similarity to a known sequence. Percent designation as used herein means that a nucleic acid or fragment thereof has a specified percentage of another nucleic acid when aligned to other nucleic acids using BLASTN, the program available from the National Center for Biotechnology Information (NCBI) via the Internet.

当提及多肽时，百分比同一性或相似性表明，当与另一种蛋白质或其一部分在使用BLASTP测定的共同长度上进行比较时，所述多肽表现出特定百分比的同一性或相似性。这也可通过互联网从美国国家生物技术信息中心(NCBI)获取。多肽的一致性或相似性百分比通常用序列分析软件，例如，参见威斯康星大学生物技术中心遗传学计算机组的序列分析软件包。蛋白质分析软件使用各种替换、缺失和其它修饰的同源性来匹配类似的序列。保守的替换通常包括以下几类中的替换：甘氨酸、丙氨酸；缬氨酸、异亮氨酸、亮氨酸；天冬氨酸、谷氨酸；天冬酰胺、谷氨酰胺；丝氨酸、苏氨酸；赖氨酸、精氨酸；苯丙氨酸、酪氨酸。When referring to a polypeptide, percent identity or similarity indicates that the polypeptide exhibits a particular percent identity or similarity when compared to another protein, or a portion thereof, over a common length determined using BLASTP. This is also available from the National Center for Biotechnology Information (NCBI) over the Internet. Percent identity or similarity of polypeptides is typically determined using sequence analysis software, eg, see the sequence analysis software package of the Genetics Computer Group at the University of Wisconsin Biotechnology Center. Protein analysis software uses the homology of various substitutions, deletions and other modifications to match similar sequences. Conservative substitutions typically include substitutions in the following classes: glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid; asparagine, glutamine; serine, Threonine; Lysine, Arginine; Phenylalanine, Tyrosine.

在特定实施方案中，核酸可以包含、组成或基本上包含但不限于质粒、噬菌体、病毒载体、细菌人工染色体或酵母人工染色体等载体。病毒载体包含但不限于腺相关病毒载体、腺病毒载体、疱疹病毒载体、杆状病毒载体或杂合病毒载体。In certain embodiments, the nucleic acid may comprise, consist of, or consist essentially of, but is not limited to, vectors such as plasmids, bacteriophages, viral vectors, bacterial artificial chromosomes, or yeast artificial chromosomes. Viral vectors include, but are not limited to, adeno-associated viral vectors, adenoviral vectors, herpes viral vectors, baculovirus vectors, or hybrid viral vectors.

在一些实施例中，编码嵌合AAV衣壳蛋白的核酸进一步包括AAV Rep蛋白编码序列。In some embodiments, the nucleic acid encoding the chimeric AAV capsid protein further comprises an AAV Rep protein coding sequence.

本发明还提供稳定地包含本发明核酸的细胞。例如，核酸可以稳定地转入细胞的基因组中，或者可以稳定地维持在附加体形式，例如，“基于EBV的核附加体”。The present invention also provides cells stably comprising the nucleic acids of the present invention. For example, nucleic acids can be stably transferred into the genome of a cell, or can be stably maintained in episomal form, eg, "EBV-based nuclear episomes."

核酸可插入递送载体中，例如，病毒递送载体。例如，本发明的核酸可以封装在AAV粒子、腺病毒粒子、疱疹病毒粒子、杆状病毒粒子或任何其它合适的病毒粒子中。The nucleic acid can be inserted into a delivery vector, eg, a viral delivery vector. For example, the nucleic acids of the invention can be encapsulated in AAV particles, adenovirus particles, herpes virus particles, baculovirus particles, or any other suitable virus particle.

此外，所述核酸可以与启动子元件可操作地连接。Furthermore, the nucleic acid can be operably linked to a promoter element.

本发明还提供了一种生产本发明病毒载体的方法。在一个有代表性的实施例中，本发明提供了一种生产重组病毒载体的方法，该方法包括在体外向细胞提供，包括异源核酸以及足以将AAV模板包装成病毒粒子的信号序列，例如，AAV末端重复序列；还包括足以将模板复制和包装成病毒粒子的AAV序列，例如AAV Rep序列，和编码本发明的AAV衣壳的序列。该方法还进一步包括从细胞中收集病毒粒子的步骤，病毒颗粒可以从培养基和/或裂解细胞中收集。The present invention also provides a method for producing the viral vector of the present invention. In a representative embodiment, the present invention provides a method of producing a recombinant viral vector comprising providing to a cell in vitro, a heterologous nucleic acid and a signal sequence sufficient to package an AAV template into a virion, e.g. , AAV terminal repeats; also include AAV sequences sufficient to replicate and package templates into virions, such as AAV Rep sequences, and sequences encoding the AAV capsids of the invention. The method further includes the step of collecting viral particles from the cells, the viral particles can be collected from the culture medium and/or lysed cells.

在一个说明性的实施方案中，本发明提供了一种制备包含AAV衣壳的重组AAV颗粒的方法，所述方法包括：向体外细胞提供编码本发明的嵌合AAV衣壳的核酸、AAV Rep编码序列、包含异源核酸的AAV载体基因组，和用于生产感染性AAV的辅助因子，允许AAV载体基因组包封于AAV衣壳中并完成AAV颗粒的组装。In an illustrative embodiment, the invention provides a method of making recombinant AAV particles comprising an AAV capsid, the method comprising: providing a nucleic acid encoding a chimeric AAV capsid of the invention, an AAV Rep, to an in vitro cell The coding sequence, the AAV vector genome comprising the heterologous nucleic acid, and the cofactors for the production of infectious AAV, allow the AAV vector genome to be encapsulated in the AAV capsid and complete the assembly of the AAV particle.

细胞通常是允许AAV病毒复制的细胞。可使用本领域已知的任何适宜细胞，包括但不限于HEK293细胞系、HEK293T细胞系、HEK293A细胞系、HEK293S细胞系、HEK293FT细胞系、HEK293F细胞系、HEK293H细胞系、HeLa细胞系、SF9细胞系、SF21细胞系、SF900细胞系、BHK细胞系中的一种或几种。The cells are typically cells that allow the AAV virus to replicate. Any suitable cell known in the art can be used, including but not limited to HEK293 cell line, HEK293T cell line, HEK293A cell line, HEK293S cell line, HEK293FT cell line, HEK293F cell line, HEK293H cell line, HeLa cell line, SF9 cell line , one or more of SF21 cell line, SF900 cell line and BHK cell line.

AAV复制和衣壳序列可以通过本领域已知的任何方法提供。目前的方法通常在单个质粒上表达AAV rep和cap基因。AAV复制和包装序列不需要一起提供。AAV rep和/或cap基因序列可由任何病毒或非病毒载体提供。例如，rep和/或cap基因序列可由杂交腺病毒或疱疹病毒载体提供。EBV载体也可用于表达AAVcap和/或rep基因序列。作为另一种选择，rep和/或cap基因序列可在细胞内稳定存在，处于游离或整合状态。AAV replication and capsid sequences can be provided by any method known in the art. Current methods typically express AAV rep and cap genes on a single plasmid. AAV replication and packaging sequences do not need to be provided together. AAV rep and/or cap gene sequences can be provided by any viral or non-viral vector. For example, rep and/or cap gene sequences can be provided by hybrid adenovirus or herpes virus vectors. EBV vectors can also be used to express AAVcap and/or rep gene sequences. Alternatively, the rep and/or cap gene sequences can be stably present in the cell in an episomal or integrated state.

通常AAV rep和/或cap基因序列不会被AAV包装序列所包围，以防止对这些序列进行救援和/或包装。Typically AAV rep and/or cap gene sequences are not surrounded by AAV packaging sequences to prevent rescue and/or packaging of these sequences.

模板或载体基因组可以使用本领域已知的任何方法提供给细胞。模板或载体基因组可由非病毒载体或病毒载体提供。在具体实施例中，模板或载体基因组由疱疹病毒或腺病毒载体提供。杆状病毒载体、EBV载体也可用于传递模板或载体基因组。在另一个代表性实施例中，模板或载体基因组由复制的rAAV病毒提供。在其它实施例中，AAV原病毒稳定地整合到细胞的染色体中。Template or vector genomes can be provided to cells using any method known in the art. Template or vector genomes can be provided by non-viral or viral vectors. In specific embodiments, the template or vector genome is provided by a herpes virus or adenovirus vector. Baculovirus vectors, EBV vectors can also be used to deliver template or vector genomes. In another representative embodiment, the template or vector genome is provided by a replicating rAAV virus. In other embodiments, the AAV provirus is stably integrated into the chromosome of the cell.

为了获得最大的病毒滴度，通常向细胞提供辅助病毒，例如腺病毒或疱疹病毒，这些辅助病毒是产生感染性AAV所必需的。本领域已知的用于AAV复制所必需的辅助病毒序列，通常由辅助腺病毒或疱疹病毒载体提供。可选择地，腺病毒或疱疹病毒序列可由另一非病毒或病毒载体提供(Ferrari等，1997，Nature Med.3:1295)。此外，辅助病毒功能可由辅助基因整合在包装细胞的染色体中来提供或作为稳定的染色体外元件来维持。To obtain maximal viral titers, cells are typically provided with helper viruses, such as adenovirus or herpes virus, which are required for the production of infectious AAV. Helper viral sequences known in the art to be necessary for AAV replication are typically provided by helper adenovirus or herpesvirus vectors. Alternatively, the adenovirus or herpesvirus sequence may be provided by another non-viral or viral vector (Ferrari et al., 1997, Nature Med. 3:1295). In addition, helper virus function can be provided by the integration of helper genes into the chromosome of the packaging cell or maintained as a stable extrachromosomal element.

本领域技术人员理解，在单个辅助构建体上提供AAV复制和衣壳序列以及辅助病毒序列可能是有利的。该辅助构建体可以是非病毒或病毒构建体，也可以选择是包含AAVrep/cap基因序列的杂交腺病毒或杂交疱疹病毒。Those skilled in the art understand that it may be advantageous to provide the AAV replication and capsid sequences and the helper virus sequences on a single helper construct. The helper construct may be a non-viral or viral construct, or may be selected to be a hybrid adenovirus or a hybrid herpes virus comprising the AAVrep/cap gene sequence.

在一个具体实施例中，AAV rep和/或cap基因序列和腺病毒辅助序列由单个腺病毒辅助载体提供。这个载体进一步包含rAAV基因组模板。可将AAV rep和/或cap序列和/或rAAV模板插入腺病毒的删除区域，例如，Ela或E3区域。In a specific embodiment, the AAV rep and/or cap gene sequences and adenoviral helper sequences are provided by a single adenoviral helper vector. This vector further contains the rAAV genomic template. AAV rep and/or cap sequences and/or rAAV templates can be inserted into deletion regions of the adenovirus, eg, the Ela or E3 regions.

在另一个实施例中，AAV rep和/或cap序列和腺病毒辅助序列由单个腺病毒辅助载体提供。rAAV基因组模板由质粒提供。在另一个说明性实施例中，AAV rep和/或cap序列和腺病毒辅助序列由单个腺病毒辅助载体提供，并且rAAV基因组模板作为前体整合到细胞中。或者，rAAV模板由作为染色体外元件维持在细胞内的EBV载体提供。在另一个示例性实施例中，AAV rep和/或cap序列和腺病毒辅助序列由单个腺病毒载体提供。rAAV基因组模板作为单独的复制病毒载体提供。例如，rAAV模板可以由rAAV粒子或第二种重组腺病毒粒子提供。In another embodiment, the AAV rep and/or cap sequences and adenoviral helper sequences are provided by a single adenoviral helper vector. The rAAV genome template is provided by the plasmid. In another illustrative embodiment, the AAV rep and/or cap sequences and adenoviral helper sequences are provided by a single adenoviral helper vector, and the rAAV genome template is integrated into the cell as a precursor. Alternatively, the rAAV template is provided by an EBV vector maintained within the cell as an extrachromosomal element. In another exemplary embodiment, the AAV rep and/or cap sequences and adenoviral helper sequences are provided by a single adenoviral vector. The rAAV genome template is provided as a separate replicating viral vector. For example, the rAAV template can be provided by rAAV particles or a second recombinant adenoviral particle.

疱疹病毒也可用作AAV包装方法中的辅助病毒。Herpes viruses can also be used as helper viruses in AAV packaging methods.

作为另一种替代方法，本发明的病毒载体可在昆虫细胞中使用杆状病毒载体来传递rep和/或cap基因和rAAV模板(Urabe等，2002，Human Gene Therapy 13:1935-1943)。As another alternative, the viral vectors of the present invention can use baculovirus vectors to deliver rep and/or cap genes and rAAV templates in insect cells (Urabe et al., 2002, Human Gene Therapy 13:1935-1943).

生产AAV的其它方法也可以使用稳定转化的包装细胞(参见美国专US5658785)。Other methods of producing AAV may also use stably transformed packaging cells (see US Pat. No. 5,658,785).

无辅助病毒污染的AAV载体可通过本领域已知的任何方法获得。例如，AAV和辅助病毒可以很容易地根据大小进行区分。也可根据对肝素底物的亲和力将AAV与辅助病毒分离。在代表性实施例中，使用复制缺陷辅助病毒，使得任何污染的辅助病毒都不能复制。作为另一种选择，可以使用缺乏晚期基因表达的辅助腺病毒，因为只有腺病毒早期基因表达介导AAV病毒的包装。本领域已知对晚期基因表达有缺陷的腺病毒突变体，例如，TS100K和TS149腺病毒突变体。AAV vectors free of helper virus contamination can be obtained by any method known in the art. For example, AAV and helper viruses can be easily distinguished by size. AAVs can also be separated from helper viruses based on their affinity for the heparin substrate. In a representative embodiment, a replication-deficient helper virus is used such that any contaminating helper virus cannot replicate. Alternatively, helper adenoviruses that lack late gene expression can be used, since only adenoviral early gene expression mediates packaging of AAV viruses. Adenovirus mutants deficient in late gene expression are known in the art, eg, TS100K and TS149 adenovirus mutants.

本发明的包装方法可用于生产高滴度病毒粒子。在具体实施例中，病毒储备液的滴度至少为约10⁵Tu/ml、至少约10⁶Tu/ml、至少约10⁷Tu/ml、至少约10⁸Tu/ml、至少约10⁹Tu/ml、至少约10¹⁰Tu/ml。The packaging method of the present invention can be used to produce high titer virus particles. In specific embodiments, the viral stock has a titer of at least about 10 ⁵ Tu/ml, at least about 10 ⁶ Tu/ml, at least about 10 ⁷ Tu/ml, at least about 10 ⁸ Tu/ml, at least about 10 ⁹ Tu /ml, at least about 10 ¹⁰ Tu/ml.

新型的衣壳蛋白和衣壳结构可用于产生抗体，例如用于诊断或治疗用途或用做研究试剂。因此，本发明还提供了针对本发明的新型衣壳蛋白的抗体。Novel capsid proteins and capsid structures can be used to generate antibodies, eg, for diagnostic or therapeutic use or as research reagents. Accordingly, the present invention also provides antibodies against the novel capsid proteins of the present invention.

本文所用术语“抗体”或“抗体片段”是指所有类型的免疫球蛋白，包括IgG、IgM、IgA、IgD和IgE。抗体可为单克隆或多克隆，且可为来源于任何物种，包括小鼠、大鼠、兔、马、山羊、绵羊、鸡、猴、羊驼或人类，或可为嵌合抗体、人源化抗体、人抗体。抗体可以是重组单克隆抗体、或从噬菌体库、酵母库、哺乳细胞展示库中筛选。The term "antibody" or "antibody fragment" as used herein refers to all types of immunoglobulins, including IgG, IgM, IgA, IgD and IgE. Antibodies may be monoclonal or polyclonal, and may be derived from any species, including mouse, rat, rabbit, horse, goat, sheep, chicken, monkey, alpaca, or human, or may be chimeric, human Antibodies, human antibodies. Antibodies can be recombinant monoclonal antibodies, or screened from phage libraries, yeast libraries, mammalian cell display libraries.

包括在本发明范围内的抗体片段包括Fab、F(ab’)₂和Fc片段，以及从除IgG以外的抗体获得的相应片段。这种片段可以通过已知的技术产生。例如，F(ab’)₂片段可由抗体分子通过胃蛋白酶消化产生，Fab片段可通过减少F(ab’)₂片段的二硫键生成。或者，可以构建Fab表达库，以便快速、容易地识别具有所需特异性的单克隆Fab片段。Antibody fragments included within the scope of the present invention include Fab, F(ab') ₂ and Fc fragments, as well as corresponding fragments obtained from antibodies other than IgG. Such fragments can be produced by known techniques. For example, F(ab') ₂ fragments can be produced by pepsin digestion of antibody molecules, and Fab fragments can be produced by reducing the disulfide bonds of F(ab') ₂ fragments. Alternatively, Fab expression libraries can be constructed for quick and easy identification of monoclonal Fab fragments with the desired specificity.

多克隆抗体可以通过病毒免疫合适的动物，例如兔、山羊等，从动物收集免疫血清，并从免疫血清中分离所得。Polyclonal antibodies can be obtained by immunizing suitable animals, such as rabbits, goats, etc., with viruses, collecting immune serum from the animals, and separating them from the immune serum.

本发明还包括将异源核苷酸序列传递至广泛细胞范围的方法，包括分裂细胞和非分裂细胞。本发明的病毒载体可用于向体外细胞递送感兴趣的核苷酸序列，例如，在体外产生多肽或用于体外基因治疗。载体还可用于将核苷酸序列传递给需要个体的方法中，例如，表达免疫原性或治疗性多肽。The present invention also includes methods of delivering heterologous nucleotide sequences to a wide range of cells, including dividing and non-dividing cells. The viral vectors of the present invention can be used to deliver nucleotide sequences of interest to cells in vitro, eg, to produce polypeptides in vitro or for gene therapy in vitro. Vectors can also be used in methods of delivering nucleotide sequences to an individual in need thereof, eg, expressing immunogenic or therapeutic polypeptides.

一般而言，本发明病毒载体可用于递送具有生物效应的任何外源核酸以治疗或改善与基因表达相关的任何疾病。此外，本发明可用于治疗递送治疗性多肽可改善的任何疾病。示例性疾病症状包括但不限于：囊性纤维化(囊性纤维化跨膜调节蛋白)和肺部其它疾病、血友病A(因子VIII)、血友病B(因子IX)、地中海贫血(β-珠蛋白)、贫血(促红细胞生成素)和其它血液疾病、老年痴呆症(GDF)、多发性硬化症(β-干扰素)、帕金森氏病(神经胶质细胞源性神经营养因子)、亨廷顿氏病(消除重复的抑制性RNA，包括但不限于RNAi，如sirna或shrna、反义RNA或microRNA)、肌萎缩侧索硬化症、癫痫(甘丙肽、神经营养因子)和其它神经疾病、癌症(内皮抑素、血管抑素、TRAIL、FAS配体、包括干扰素的细胞因子、抑制VEGF的抑制性RNA包括但不限于siRNA、shRNA、反义RNA、microRNA、多重耐药基因产物、癌症免疫原)、糖尿病(胰岛素、PGC-α1、GLP-1、肌抑素前体肽、葡萄糖转运蛋白)、肌营养不良包括杜氏肌营养不良和贝克肌营养不良(例如肌萎缩蛋白、mini-萎缩蛋白、micro-肌萎缩蛋白、胰岛素样生长因子等)、糖原贮积病例如Fabry病(α-半乳糖苷酶)和Pompe病(溶酶体酸α-葡萄糖苷酶)、先天性肺气肿(抗胰蛋白酶)、Lesch-Nyhan综合征(次黄嘌呤鸟嘌呤磷酸核糖基转移酶)、Niemann-Pick病(神经鞘磷脂酶)、视网膜退行性疾病以及眼睛和视网膜的其它疾病(治疗黄斑变性的PDGF、内皮抑素和/或血管抑素)、星形细胞瘤(内皮抑素、血管抑素和/或抑制VEGF的RNAi)、胶质母细胞瘤(内皮生长因子、血管内皮生长因子和/或RNAi抗血管内皮生长因子)、肝脏(用于乙肝和/或丙肝基因的RNAi，例如siRNA或shRNA、microRNA或反义RNA)、充血性心力衰竭或外周动脉疾病(磷酸酶蛋白抑制剂I、磷脂酶、肌浆内网Ca2-ATPase、调节磷脂酶基因的锌指蛋白、磷脂酶抑制剂等)、关节炎(胰岛素样生长因子)、艾滋病(可溶性CD4)、肌肉萎缩(胰岛素样生长因子I、肌生长抑制素前肽、抗凋亡因子等)、肢体缺血(VEGF、FGF、PGC-Iα、EC-SOD、HIF)、肾虚(促红细胞生成素)、关节炎(IRAP和TNFα等可溶性受体等抗炎因子)、肝炎(α-干扰素)、低密度脂蛋白受体缺乏(LDL受体)、高氨血症(鸟氨酸转氨酶)、苯丙酮尿症(苯丙氨酸羟化酶)、自身免疫性疾病等。In general, the viral vectors of the present invention can be used to deliver any exogenous nucleic acid that has a biological effect to treat or ameliorate any disease associated with gene expression. Furthermore, the present invention can be used to treat any disease that can be ameliorated by the delivery of a therapeutic polypeptide. Exemplary disease symptoms include, but are not limited to: cystic fibrosis (cystic fibrosis transmembrane regulatory protein) and other diseases of the lung, hemophilia A (factor VIII), hemophilia B (factor IX), thalassemia ( beta-globin), anemia (erythropoietin) and other blood disorders, Alzheimer's disease (GDF), multiple sclerosis (beta-interferon), Parkinson's disease (glial cell-derived neurotrophic factor) ), Huntington's disease (repeated inhibitory RNAs including but not limited to RNAi such as siRNA or shRNA, antisense RNA or microRNA), amyotrophic lateral sclerosis, epilepsy (galanin, neurotrophic factor) and others Neurological diseases, cancer (endostatin, angiostatin, TRAIL, FAS ligands, cytokines including interferons, inhibitory RNAs that inhibit VEGF including but not limited to siRNA, shRNA, antisense RNA, microRNA, multidrug resistance genes products, cancer immunogens), diabetes (insulin, PGC-α1, GLP-1, myostatin precursor peptide, glucose transporter), muscular dystrophies including Duchenne muscular dystrophy and Baker muscular dystrophy (e.g. dystrophin, mini-trophin, micro-dystrophin, insulin-like growth factor, etc.), glycogen storage diseases such as Fabry disease (alpha-galactosidase) and Pompe disease (lysosomal acid alpha-glucosidase), congenital Emphysema (antitrypsin), Lesch-Nyhan syndrome (hypoxanthine guanine phosphoribosyltransferase), Niemann-Pick disease (sphingomyelinase), retinal degenerative diseases, and other diseases of the eye and retina (PDGF, endostatin and/or angiostatin for macular degeneration), astrocytoma (endostatin, angiostatin and/or VEGF-inhibiting RNAi), glioblastoma (endothelial growth factor, angiostatin) Endothelial growth factor and/or RNAi anti-vascular endothelial growth factor), liver (RNAi for hepatitis B and/or hepatitis C genes, such as siRNA or shRNA, microRNA or antisense RNA), congestive heart failure or peripheral arterial disease (phosphatase Protein inhibitor I, phospholipase, sarcoplasmic reticulum Ca2-ATPase, zinc finger proteins that regulate phospholipase genes, phospholipase inhibitors, etc.), arthritis (insulin-like growth factor), AIDS (soluble CD4), muscle atrophy ( Insulin-like growth factor I, myostatin propeptide, anti-apoptotic factors, etc.), limb ischemia (VEGF, FGF, PGC-Iα, EC-SOD, HIF), kidney deficiency (erythropoietin), arthritis ( Anti-inflammatory factors such as soluble receptors such as IRAP and TNFα), hepatitis (α-interferon), low density lipoprotein receptor deficiency (LDL receptor), hyperammonemia (ornithine aminotransferase), phenylketonuria ( phenylalanine hydroxylase), autoimmune diseases, etc.

本发明还可在器官移植后用于增加移植的成功率和/或减少器官移植或辅助疗法的副作用，例如，通过给予免疫抑制剂或抑制性核酸来阻断细胞因子产物。The present invention may also be used after organ transplantation to increase the success rate of transplantation and/or reduce the side effects of organ transplantation or adjunctive therapy, eg, by administering immunosuppressive or inhibitory nucleic acids to block cytokine production.

基因转移在认识和治疗疾病方面有着巨大的潜在用途。有许多遗传性疾病中有缺陷的基因是已知的，并且已经被克隆。一般来说，上述疾病状态分为两类：第一为缺陷状态，通常为酶缺陷，通常以隐性方式遗传；第二为不平衡状态，可能涉及调节蛋白或结构蛋白，通常以显性方式遗传。对于缺陷状态疾病，基因转移可将正常基因带到受影响的组织中进行替代治疗，以及使用抑制性RNA为疾病创建动物模型，抑制性RNA包括siRNA或shRNA、microRNA或反义RNA。对于不平衡的疾病状态，基因转移可用于在模型系统中创建疾病状态，然后可用于治疗疾病状态。因此，根据本发明的病毒载体允许治疗遗传疾病。如本文所用，通过部分或全部补救疾病或使疾病更严重的缺陷或失衡来治疗疾病状态。Gene transfer has enormous potential use in understanding and treating disease. There are many genetic diseases where defective genes are known and have been cloned. Generally speaking, the above-mentioned disease states are divided into two categories: the first is a deficient state, usually an enzyme deficiency, and is usually inherited in a recessive manner; the second is an unbalanced state, which may involve regulatory or structural proteins, usually in a dominant manner. hereditary. For deficient state diseases, gene transfer can bring normal genes into affected tissues for replacement therapy, as well as create animal models of the disease using inhibitory RNAs, including siRNA or shRNA, microRNA or antisense RNA. For imbalanced disease states, gene transfer can be used to create disease states in model systems, which can then be used to treat disease states. Thus, the viral vectors according to the present invention allow the treatment of genetic diseases. As used herein, a disease state is treated by partially or fully remedying a disease or a deficiency or imbalance that makes the disease worse.

此外，本发明的病毒载体在诊断和筛选方法中有进一步用途，其中感兴趣的基因在细胞培养系统或转基因动物模型中短暂或稳定地表达。本发明还可用于递送核酸以用于蛋白质生产，例如用于实验室、工业或商业目的。In addition, the viral vectors of the present invention have further utility in diagnostic and screening methods in which a gene of interest is transiently or stably expressed in cell culture systems or transgenic animal models. The present invention can also be used to deliver nucleic acids for protein production, eg, for laboratory, industrial or commercial purposes.

或者，可将病毒载体投予细胞并且将改变的细胞投予个体。将异源核酸引入细胞，并将细胞给予受试者，其中任选地表达编码免疫原的异源核酸并诱导受试者对免疫原的免疫反应。在具体实施例中，所述细胞是抗原呈递细胞，例如树突细胞。Alternatively, viral vectors can be administered to cells and the altered cells administered to an individual. The heterologous nucleic acid is introduced into a cell, and the cell is administered to a subject, wherein the heterologous nucleic acid encoding the immunogen is optionally expressed and an immune response to the immunogen is induced in the subject. In specific embodiments, the cells are antigen presenting cells, such as dendritic cells.

“主动免疫反应”或“主动免疫”的特征是“与免疫原接触后宿主组织和细胞的参与”。它涉及到淋巴组织中免疫调节细胞的分化和增殖，从而导致抗体的合成或细胞介导的反应，或两者兼而有之。或者，宿主通过感染或接种疫苗暴露于免疫原后，会产生积极的免疫反应。主动免疫可与被动免疫形成对比，后者是通过“将预先形成的物质，例如抗体、转移因子、胸腺移植物、白细胞介素-2，从主动免疫宿主转移到非免疫宿主”获得的。An "active immune response" or "active immunity" is characterized by "the involvement of host tissues and cells following exposure to an immunogen." It involves the differentiation and proliferation of immunoregulatory cells in lymphoid tissue, leading to the synthesis of antibodies or a cell-mediated response, or both. Alternatively, after the host is exposed to the immunogen through infection or vaccination, a positive immune response occurs. Active immunity can be contrasted with passive immunity, which is obtained by "transferring preformed substances, such as antibodies, transfer factors, thymic grafts, interleukin-2, from an actively immunized host to a non-immunized host".

本文所用的“保护性”免疫反应或“保护性”免疫表明，免疫反应给予受试者一些益处，因为它可以预防或降低疾病的发生率。或者，保护性免疫反应或保护性免疫可用于治疗疾病，特别是癌症或肿瘤，例如，引起癌症或肿瘤的消退和/或防止转移和/或防止转移结节的生长。只要治疗的益处大于缺点，保护作用可以是完全的或部分的。A "protective" immune response or "protective" immunity, as used herein, indicates that an immune response confers some benefit to a subject because it prevents or reduces the incidence of disease. Alternatively, a protective immune response or protective immunity can be used to treat a disease, particularly a cancer or tumor, eg, cause regression of the cancer or tumor and/or prevent metastasis and/or prevent the growth of metastatic nodules. Protection may be complete or partial as long as the benefits of the treatment outweigh the disadvantages.

本发明病毒载体也可通过投予癌细胞抗原或免疫相似分子或任何其它免疫原来用于癌症免疫治疗，对癌细胞产生免疫反应。举例而言，在治疗癌症患者时，可通过投予包含编码癌细胞抗原异源核苷酸序列的病毒载体来产生针对个体中癌细胞抗原的免疫反应。如本文所述，病毒载体可在体外或通过活体外方法投予个体。The viral vectors of the present invention can also be used in cancer immunotherapy by administering cancer cell antigens or immunosimilar molecules or any other immunogen to generate an immune response against cancer cells. For example, in treating a cancer patient, an immune response against a cancer cell antigen in the individual can be generated by administering a viral vector comprising a heterologous nucleotide sequence encoding the cancer cell antigen. As described herein, viral vectors can be administered to an individual in vitro or by in vitro methods.

如本文所用，“癌症”一词包括形成肿瘤的癌症。同样，“癌组织”一词也包括肿瘤。“癌细胞抗原”包括肿瘤抗原。As used herein, the term "cancer" includes tumor-forming cancers. Likewise, the term "cancerous tissue" includes tumors. "Cancer cell antigen" includes tumor antigens.

术语“癌症”在本领域中具有其理解的含义，例如，具有扩散或转移到身体遥远部位的不受控制的组织生长。示例性癌症包括但不限于白血病、淋巴瘤、结直肠癌、肾癌、肝癌、乳腺癌、肺癌、前列腺癌、睾丸癌、卵巢癌、子宫癌、宫颈癌、脑癌、骨癌、肉瘤、黑色素瘤、头颈部癌、食管癌、甲状腺癌等。在本发明实施例中，本发明实施于治疗和/或预防肿瘤形成癌症。The term "cancer" has its meaning as understood in the art, eg, having uncontrolled tissue growth that has spread or metastasized to distant parts of the body. Exemplary cancers include, but are not limited to, leukemia, lymphoma, colorectal cancer, kidney cancer, liver cancer, breast cancer, lung cancer, prostate cancer, testicular cancer, ovarian cancer, uterine cancer, cervical cancer, brain cancer, bone cancer, sarcoma, melanoma cancer, head and neck cancer, esophageal cancer, thyroid cancer, etc. In embodiments of the present invention, the present invention is practiced in the treatment and/or prevention of tumor-forming cancers.

癌细胞抗原已在上文中进行了描述。术语“治疗癌症”，旨在降低癌症的严重程度，预防或至少部分消除癌症。例如，在特定情况下，这些术语表明癌症的治疗是预防或减少的，或至少部分消除的。在又一代表性实施例中，这些术语表明阻止或减少或至少部分地消除转移结节的生长，例如，在外科切除原发肿瘤后。根据“预防癌症”术语，旨在至少部分地消除或减少癌症的发生率或发病率。或者说，受试者的癌症发病或进展可能被减缓、控制、可能性或概率降低或延迟。Cancer cell antigens have been described above. The term "treating cancer" is intended to reduce the severity of cancer, prevent or at least partially eliminate cancer. For example, in certain circumstances, these terms indicate that the treatment of cancer is to prevent or reduce, or at least partially eliminate. In yet another representative embodiment, these terms indicate preventing or reducing or at least partially eliminating the growth of metastatic nodules, eg, following surgical resection of the primary tumor. According to the term "cancer prevention", it is intended to at least partially eliminate or reduce the incidence or incidence of cancer. Alternatively, the onset or progression of the subject's cancer may be slowed, controlled, less likely or likely, or delayed.

在具体实施例中，细胞可从患有癌症的个体中移出并与本发明的病毒载体接触。然后将修饰后的细胞给予受试者，由此引发对癌细胞抗原的免疫反应。这种方法特别适用于免疫功能低下的受试者，这些受试者在体内不能产生足够的免疫反应，即不能产生足够数量的增强抗体。In specific embodiments, cells can be removed from an individual with cancer and contacted with a viral vector of the invention. The modified cells are then administered to a subject, thereby eliciting an immune response to cancer cell antigens. This approach is particularly useful in immunocompromised subjects who do not develop an adequate immune response in the body, i.e. do not produce sufficient numbers of boosting antibodies.

本领域已知免疫调节性细胞因子，例如α-干扰素、β-干扰素、γ-干扰素、ω-干扰素、τ-干扰素、白细胞介素-lα、白细胞介素-1β、白细胞介素-2、白细胞介素-3、白细胞介素-4、白细胞介素5、白细胞介素-6、白细胞介素-7、白细胞介素-8，白细胞介素-9，白细胞介素-10，白细胞介素-11，白细胞介素12，白细胞介素-13，白细胞介素-14，白细胞介素-18，B细胞生长因子，CD40配体，肿瘤坏死因子-α，肿瘤坏死因子-β，单核细胞趋化蛋白-1，粒细胞-巨噬细胞集落刺激因子，和淋巴毒素，和免疫调节性细胞因子，例如CTL诱导性细胞因子，可与病毒载体一起给予受试者。Immunomodulatory cytokines are known in the art, such as alpha-interferon, beta-interferon, gamma-interferon, omega-interferon, tau-interferon, interleukin-1α, interleukin-1β, interleukin IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10 , interleukin-11, interleukin-12, interleukin-13, interleukin-14, interleukin-18, B cell growth factor, CD40 ligand, tumor necrosis factor-alpha, tumor necrosis factor-beta , monocyte chemoattractant protein-1, granulocyte-macrophage colony-stimulating factor, and lymphotoxin, and immunomodulatory cytokines, such as CTL-inducible cytokines, can be administered to a subject with a viral vector.

细胞因子可以通过本领域已知的任何方法进行注射。可以将外源性细胞因子注射到受试者体内，或者使用合适的载体将编码细胞因子的核苷酸序列传递给受试者，并在体内产生细胞因子。Cytokines can be injected by any method known in the art. Exogenous cytokines can be injected into a subject, or a nucleotide sequence encoding the cytokine can be delivered to the subject using a suitable vector, and the cytokine can be produced in vivo.

根据本发明的重组病毒载体可在兽医和医学中应用。适合的对象包括鸟类和哺乳动物。本文所用术语“鸟类”包括但不限于鸡、鸭、鹅、鹌鹑、火鸡、野鸡、鹦鹉。本文所用术语“哺乳动物”包括但不限于人类、灵长类、非人类灵长类、牛、绵羊、山羊、猪、马、猫、狗、兔子、啮齿动物等。人类受试者包括新生儿、婴儿、少年和成人。任选地，所述“需要”本发明方法的个体，例如，因为所述个体具有或被认为具有包括本发明所述的疾病风险，或者将从包括本发明所述的核酸递送中受益。作为进一步的选择，受试者可以是实验室动物和/或疾病的动物模型。The recombinant viral vectors according to the present invention can be used in veterinary and medical applications. Suitable subjects include birds and mammals. The term "bird" as used herein includes, but is not limited to, chickens, ducks, geese, quail, turkeys, pheasants, parrots. The term "mammal" as used herein includes, but is not limited to, humans, primates, non-human primates, cattle, sheep, goats, pigs, horses, cats, dogs, rabbits, rodents, and the like. Human subjects include neonates, infants, juveniles and adults. Optionally, the individual "in need" of the methods of the invention, eg, because the individual has or is believed to be at risk for, or would benefit from, the delivery of nucleic acids including the invention. As a further option, the subject can be a laboratory animal and/or an animal model of disease.

在具体实施例中，本发明提供包含本发明病毒载体的药物组合物，所述病毒载体位于药学上可接受的载体中，并且任选的包括其它药剂、稳定剂、缓冲剂、载体、佐剂、稀释剂等，用于注射的载体通常是液体。对于其它的方法，运输载体既可以是固体的，也可以是液体的。吸入给药时，载体将是可呼吸的，最好是固体或液体微粒形式。In particular embodiments, the present invention provides pharmaceutical compositions comprising a viral vector of the present invention in a pharmaceutically acceptable carrier and optionally including other agents, stabilizers, buffers, carriers, adjuvants , diluents, etc., and the carrier for injection is usually a liquid. For other methods, the transport vehicle can be either solid or liquid. For administration by inhalation, the carrier will be respirable, preferably in the form of solid or liquid particulates.

“药学上可接受”是指一种无毒或无其它不需要特性的材料，即，该材料可以给受试者服用而不会产生任何不需要的生物效应。"Pharmaceutically acceptable" refers to a material that is non-toxic or free of other undesirable properties, ie, the material can be administered to a subject without producing any undesirable biological effects.

本发明的一个方面是在体外将核苷酸序列转移到细胞的方法。病毒载体可根据适合特定靶细胞的标准转导方法以适当的感染倍数导入细胞。要施用的病毒载体或衣壳的滴度可根据目标细胞类型和数量以及特定病毒载体或衣壳而变化。在具体实施例中，将至少约10³感染单位、更优选地至少约10⁵感染单位导入细胞。One aspect of the invention is a method of transferring a nucleotide sequence into a cell in vitro. Viral vectors can be introduced into cells at appropriate infection folds according to standard transduction methods appropriate to the particular target cell. The titer of the viral vector or capsid to be administered can vary depending on the target cell type and quantity and the particular viral vector or capsid. In specific embodiments, at least about ¹⁰³ infectious units, more preferably at least about ¹⁰⁵ infectious units, are introduced into the cells.

要导入病毒载体的细胞可以是任何类型的，包括但不限于神经细胞，包括外周和中枢神经系统的细胞，特别是脑细胞，例如神经元、寡树突细胞、神经胶质细胞、星形胶质细胞，肺细胞、包括视网膜细胞、视网膜色素上皮和角膜细胞的眼部细胞、包括肠和呼吸上皮细胞的上皮细胞、包括成肌细胞，肌管和肌纤维的骨骼肌细胞、膈肌细胞、树突细胞、包括胰岛细胞的胰腺细胞、肝细胞、包括平滑肌细胞和上皮细胞的胃肠道细胞、包括心肌细胞的心脏细胞、骨髓细胞、造血干细胞、脾细胞、角质细胞、成纤维细胞、内皮细胞、前列腺细胞、包括例如软骨、半月板、滑膜和骨髓的关节细胞、生殖细胞等。作为另一种选择，细胞可以是干细胞，例如，神经干细胞、肝干细胞。作为另一种选择，细胞可以是癌症或肿瘤细胞，如上所述的癌症和肿瘤。此外，如上所述，细胞可以来自任何物种的来源。The cells into which the viral vector is to be introduced may be of any type, including but not limited to neural cells, including cells of the peripheral and central nervous systems, especially brain cells, such as neurons, oligodendritic cells, glial cells, astrocytes Plasma cells, lung cells, ocular cells including retinal cells, retinal pigment epithelium and corneal cells, epithelial cells including intestinal and respiratory epithelial cells, skeletal muscle cells including myoblasts, myotubes and muscle fibers, diaphragmatic muscle cells, dendrites cells, pancreatic cells including islet cells, hepatocytes, gastrointestinal cells including smooth muscle cells and epithelial cells, cardiac cells including cardiomyocytes, bone marrow cells, hematopoietic stem cells, spleen cells, keratinocytes, fibroblasts, endothelial cells, Prostate cells, including, for example, cartilage, meniscus, synovium, and bone marrow joint cells, germ cells, and the like. Alternatively, the cells may be stem cells, eg, neural stem cells, hepatic stem cells. Alternatively, the cells may be cancer or tumor cells, as described above. Furthermore, as mentioned above, the cells can be derived from any species of origin.

可将病毒载体引入离体细胞以向个体投予经修饰的细胞。在具体实施例中，将细胞从个体中移出，在其中导入病毒载体，然后将细胞回输回个体中。从受试者身上移出细胞用于活体外治疗，然后再导入受试者的方法为本领域所知，参见例如美国专利US5399346。或者，将重组病毒载体导入来自另一个体的细胞、培养细胞或来自任何其它合适来源的细胞中，并且将细胞给予需要的个体。Viral vectors can be introduced into cells ex vivo to administer the modified cells to an individual. In specific embodiments, the cells are removed from the subject, into which the viral vector is introduced, and the cells are then infused back into the subject. Methods for removing cells from a subject for in vitro treatment and then reintroducing the subject are known in the art, see eg, US Pat. No. 5,399,346. Alternatively, the recombinant viral vector is introduced into cells from another individual, cultured cells, or cells from any other suitable source, and the cells are administered to an individual in need.

适合于体外基因治疗的细胞如上文所述。给予受试者的细胞剂量将随受试者的年龄、条件和种类、细胞类型、细胞表达的核酸、给药方式等而变化。通常，在药学上可接受的载体中，每剂量至少给药10²至10⁸或约10³至约10⁶个细胞。在具体实施例中，将用病毒载体转导的细胞与药物载体组合以有效量给予个体。Cells suitable for in vitro gene therapy are as described above. The dose of cells administered to a subject will vary with the age, condition and species of the subject, cell type, nucleic acids expressed by the cells, mode of administration, and the like. Typically, at least 10 ² to 10 ⁸ or about 10 ³ to about 10 ⁶ cells are administered per dose in a pharmaceutically acceptable carrier. In specific embodiments, cells transduced with a viral vector are administered to an individual in an effective amount in combination with a pharmaceutical carrier.

本发明的另一方面是向受试者施用本发明病毒载体或衣壳的方法。在具体实施例中，所述方法包括将感兴趣的核酸传递给动物个体的方法，所述方法包括：向动物个体施用有效量的本发明病毒载体。可通过本领域已知的任何方法将本发明病毒载体投予人类个体或需要其的动物。任选地，病毒载体以有效剂量在药学上可接受的载体中递送。Another aspect of the invention is a method of administering to a subject a viral vector or capsid of the invention. In specific embodiments, the method includes a method of delivering a nucleic acid of interest to an animal subject, the method comprising administering to the animal subject an effective amount of a viral vector of the invention. The viral vectors of the present invention can be administered to human subjects or animals in need thereof by any method known in the art. Optionally, the viral vector is delivered in an effective dose in a pharmaceutically acceptable carrier.

本发明病毒载体可进一步投予个体以引起免疫原性反应，例如，作为疫苗。通常，本发明疫苗包含有效量的病毒与药学上可接受的载体组合。任选地，剂量足以产生保护性免疫反应。只要给予免疫原性多肽的益处大于其任何缺陷，所给予的保护程度就不必是完全的或永久的。受试者和免疫原如上文所述。The viral vectors of the present invention can be further administered to an individual to elicit an immunogenic response, eg, as a vaccine. Typically, the vaccines of the present invention comprise an effective amount of virus in combination with a pharmaceutically acceptable carrier. Optionally, the dose is sufficient to generate a protective immune response. As long as the benefits of administering the immunogenic polypeptide outweigh any disadvantages thereof, the degree of protection imparted need not be complete or permanent. Subjects and immunogens are as described above.

给受试者注射的病毒载体的剂量将取决于给药方式、待治疗的疾病或状况、个体的情况、特定的病毒载体和待递送的核酸，并且可以常规方式确定。达到治疗效果的示范性剂量是至少约10⁵，10⁶，10⁷，10⁸，10⁹，10¹⁰，10¹¹，10¹²，10¹³，10¹⁴，10¹⁵Tu或更多的病毒滴度，优选约10⁷或10⁸～10¹²，10¹³或10¹⁴Tu、更优选约10¹²Tu的病毒滴度。The dose of viral vector injected into a subject will depend on the mode of administration, the disease or condition to be treated, the circumstances of the individual, the particular viral vector and nucleic acid to be delivered, and can be determined in a conventional manner. Exemplary doses to achieve a therapeutic effect are viral titers of at least about ¹⁰⁵ , ¹⁰⁶ , ¹⁰⁷ , ¹⁰⁸ , ¹⁰⁹ , ¹⁰¹⁰ , ¹⁰¹¹ , ¹⁰¹² , ¹⁰¹³ , ¹⁰¹⁴ , ¹⁰¹⁵ Tu or more , preferably a virus titer of about 10 ⁷ or 10 ⁸ to 10 ¹² , 10 ¹³ or 10 ¹⁴ Tu, more preferably about 10 ¹² Tu.

在具体实施例中，可以使用不止一次的给药，例如，两次、三次、四次或更多次给药，可以在不同的时间间隔内来实现所需的基因表达水平。In particular embodiments, more than one administration may be used, eg, two, three, four or more administrations, at different time intervals, to achieve the desired level of gene expression.

给药方式的实例包括口服、直肠、粘膜、外用、鼻内、吸入、口腔、阴道、鞘内、眼内、经皮、子宫内、肠外、静脉内、皮下、皮内、肌肉内、皮内、胸膜内、脑内和关节内、皮肤和粘膜表面、内淋巴管内等，以及直接组织或器官注射，例如在肝、骨骼肌、心肌、膈肌或脑内直接注射。也可对肿瘤施用，例如，在肿瘤或淋巴结内或附近注射。在任何给定情况下，最合适的路径将取决于所治疗条件的性质和严重性以及所使用的特定载体的性质。Examples of modes of administration include oral, rectal, mucosal, topical, intranasal, inhalation, buccal, vaginal, intrathecal, intraocular, transdermal, intrauterine, parenteral, intravenous, subcutaneous, intradermal, intramuscular, dermal Intra, intrapleural, intracerebral and intraarticular, skin and mucosal surfaces, intralymphatic vessels, etc., as well as direct tissue or organ injection, such as direct injection into the liver, skeletal muscle, myocardium, diaphragm or brain. Tumors can also be administered, eg, injected in or near the tumor or lymph nodes. In any given situation, the most appropriate route will depend on the nature and severity of the condition being treated and the nature of the particular carrier used.

注射物可以常规形式制备，既可以是液体溶液或悬浮液，也可以是适合于注射前液体溶液或悬浮液的固体形式，或者是乳状液。或者，可以局部而不是全身性的方式来施用病毒载体，例如，以一种特定的方式，如缓释制剂。此外，病毒载体可以干燥形式输送至外科植入基质，例如骨移植替代物、缝合线、支架等。Injectables can be prepared in conventional forms, either as liquid solutions or suspensions, solid forms suitable for liquid solutions or suspensions prior to injection, or as emulsions. Alternatively, the viral vector can be administered in a local rather than systemic manner, eg, in a specific manner, such as a sustained release formulation. In addition, viral vectors can be delivered in dry form to surgical implant matrices such as bone graft substitutes, sutures, scaffolds, and the like.

适于经口服给药的医药组合物可以离散单位呈现，例如胶囊、缓存剂、含片或片剂，每一单位均包含预定量的本发明组合物。作为粉末或颗粒、作为水性液体或非水性液体的溶液或悬浮液、或作为水包油或油包水乳化液。口服可通过将本发明病毒载体复合到能够抵抗动物肠道内消化酶降解的载体来完成。此类载体的实例包括本领域已知的胶囊或片剂。此类制剂是通过任何合适的药学方法制备的，所述方法包括将所述成分与合适的载体结合起来的步骤，所述载体可包含一种或多种如上文所述的辅料成分。一般而言，根据本发明实施例的药物组合物是通过将所述组合物与液体或细分固体载体或两者均匀且紧密地混合，然后对所得混合物进行成形来制备的。例如，可以通过压缩或成型含有所述组合物的粉末或颗粒来制备片剂，可以选择使用一种或多种辅料成分。压片是通过在适当的机器中压缩自由流动形式的组合物来制备的，例如任选地与粘合剂、润滑剂、惰性稀释剂和/或表面活性/分散剂混合的粉末或颗粒。通过在适当的机器中用惰性液体粘合剂湿润粉末状化合物来成型片剂。Pharmaceutical compositions suitable for oral administration may be presented in discrete units such as capsules, buffers, troches, or tablets, each unit containing a predetermined quantity of a composition of the present invention. As powders or granules, as solutions or suspensions in aqueous or non-aqueous liquids, or as oil-in-water or water-in-oil emulsions. Oral administration can be accomplished by complexing the viral vector of the present invention into a vector capable of resisting degradation by digestive enzymes in the intestinal tract of animals. Examples of such carriers include capsules or tablets known in the art. Such formulations are prepared by any suitable method of pharmacy which includes the step of bringing into association the ingredients with a suitable carrier, which may contain one or more excipient ingredients as described above. In general, pharmaceutical compositions according to embodiments of the present invention are prepared by uniformly and intimately admixing the composition with a liquid or finely divided solid carrier, or both, and then shaping the resulting mixture. For example, a tablet may be prepared by compressing or molding a powder or granules containing the composition, optionally with one or more excipient ingredients. Compressed tablets are prepared by compressing in a suitable machine a free-flowing form of the composition, such as powder or granules, optionally mixed with binders, lubricants, inert diluents and/or surface active/dispersing agents. Tablets are formed by moistening the powdered compound with an inert liquid binder in a suitable machine.

适用于口腔给药的药物组合物包括含片，其在调味基中包含本发明的成分，调味基通常为蔗糖和阿拉伯树胶或黄芪胶，以及包含如明胶和甘油或蔗糖和阿拉伯树胶惰性基成分的锭剂。Pharmaceutical compositions suitable for oral administration include lozenges containing the ingredients of the present invention in a flavored base, typically sucrose and acacia or tragacanth, and inert base ingredients such as gelatin and glycerin or sucrose and acacia lozenges.

适于非经肠给药的药物组合物可包含本发明组合物的无菌水性和非水性注射溶液，所述制剂任选地与预期受体的血液等渗。这些制剂可含有抗氧化剂、缓冲剂、抑菌剂和溶质，使成分与预期受体的血液等渗。水性和非水性无菌悬浮液、溶液和乳剂可包括悬浮剂和增稠剂。非水溶剂的实例包括丙二醇、聚乙二醇、植物油和可注射有机酯。水性载体包括水、酒精/水溶液、乳液或悬浮液，包括盐水和缓冲介质。非肠道药物包括氯化钠溶液、林格氏葡萄糖、葡萄糖和氯化钠、乳酸林格氏或固定油。静脉注射的载体包括液体和营养补充剂、电解质补充剂，例如基于林格氏葡萄糖的补充剂等。防腐剂和其它添加剂例如抗菌剂、抗氧化剂、螯合剂和惰性气体等也可存在。Pharmaceutical compositions suitable for parenteral administration may comprise sterile aqueous and non-aqueous injectable solutions of the compositions of the present invention, which preparations are optionally isotonic with the blood of the intended recipient. These formulations may contain antioxidants, buffers, bacteriostatic agents and solutes to render the components isotonic with the blood of the intended recipient. Aqueous and non-aqueous sterile suspensions, solutions and emulsions may include suspending and thickening agents. Examples of non-aqueous solvents include propylene glycol, polyethylene glycol, vegetable oils, and injectable organic esters. Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or suspensions, including saline and buffered media. Parenteral drugs include sodium chloride solution, Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's, or fixed oils. Intravenous vehicles include fluid and nutrient replenishers, electrolyte replenishers, such as those based on Ringer's dextrose, and the like. Preservatives and other additives such as antimicrobials, antioxidants, chelating agents, and inert gases and the like may also be present.

这些成分可以单位剂量或多剂量容器表示，例如，在密封的安瓿和小瓶中，并且可以储存在冷冻干燥条件下，只需要在使用前立即添加无菌液体载体，例如，盐水或注射用水。These ingredients can be presented in unit-dose or multi-dose containers, eg, in sealed ampoules and vials, and can be stored in a freeze-dried condition requiring only the addition of a sterile liquid carrier, eg, saline or water for injection, immediately before use.

临时注射溶液和悬浮液可由上述无菌粉末、颗粒和片剂制备。例如，可以在密封容器中以单位剂量形式提供本发明的可注射、稳定、无菌的组合物。所述组合物可以冻干物的形式提供，所述冻干物可以用合适的药学上可接受的载体重组，以形成适于注射到个体中的液体组合物。单位剂型可为本发明组合物的约1μg至约10g。当所述组合物基本上不溶于水时，可以加入生理上可接受的足够量的乳化剂，以使所述组合物在水性载体中乳化。其中一种有用的乳化剂是磷脂酰胆碱。Extemporaneous injection solutions and suspensions may be prepared from sterile powders, granules and tablets as described above. For example, the injectable, stable, sterile compositions of the present invention can be presented in unit dosage form in sealed containers. The composition may be provided in the form of a lyophilisate, which may be reconstituted with a suitable pharmaceutically acceptable carrier to form a liquid composition suitable for injection into a subject. A unit dosage form can be from about 1 μg to about 10 g of the composition of the present invention. When the composition is substantially insoluble in water, an emulsifier may be added in a sufficient amount that is physiologically acceptable to emulsify the composition in the aqueous carrier. One such useful emulsifier is phosphatidylcholine.

适用于直肠给药的药物成分可以用单位剂量栓剂表示。这些可以通过将该组分与一种或多种传统固体载体混合，然后形成所得到的混合物来制备。Compositions of the drug suitable for rectal administration may be presented as unit dose suppositories. These can be prepared by mixing the components with one or more conventional solid carriers and then forming the resulting mixture.

本发明的适合局部应用于皮肤的药物组合物可以采取软膏、乳膏、乳液、糊剂、凝胶、喷雾、气雾剂或油的形式。可使用的载体包括但不限于凡士林、羊毛脂、聚乙二醇、醇类、透皮增强剂和其两种或两种以上的组合。例如，在一些实施例中，局部递送可通过将本发明的药物组合物与能够进入皮肤的亲脂性试剂混合来执行。Pharmaceutical compositions of the present invention suitable for topical application to the skin may take the form of ointments, creams, lotions, pastes, gels, sprays, aerosols or oils. Carriers that can be used include, but are not limited to, petrolatum, lanolin, polyethylene glycols, alcohols, transdermal enhancers, and combinations of two or more thereof. For example, in some embodiments, topical delivery can be performed by admixing a pharmaceutical composition of the present invention with a lipophilic agent capable of entering the skin.

适合经皮给药的药物成分可以是离散的贴片形式，适合与受试者的表皮长时间保持密切接触。适合经皮给药的组合物也可以通过离子导入递送，并且通常采用本发明组合物的任选缓冲水溶液的形式。适宜调配物可包含柠檬酸盐或bis/tris缓冲液或乙醇/水，且可含有0.1至0.2M活性成分。Pharmaceutical compositions suitable for transdermal administration may be in the form of discrete patches adapted to remain in intimate contact with the epidermis of a subject for extended periods of time. Compositions suitable for transdermal administration can also be delivered by iontophoresis, and typically take the form of an optionally buffered aqueous solution of the compositions of the present invention. Suitable formulations may contain citrate or bis/tris buffer or ethanol/water and may contain 0.1 to 0.2M active ingredient.

本文所揭示之病毒载体可通过任何适宜方法投予个体之肺，例如投予由该个体吸入的病毒载体组成的可呼吸粒子的悬浮液。可吸入颗粒物可以是液体或固体。包含病毒载体的液体粒子的气溶胶可以通过任何适当的方法产生，例如使用本领域技术人员已知的压力驱动气溶胶雾化器或超声波雾化器。包含病毒载体的固体颗粒的气溶胶同样可以通过医药领域已知的技术与任何固体颗粒药物气溶胶发生器一起产生。The viral vectors disclosed herein can be administered to the lungs of an individual by any suitable method, such as administering a suspension of respirable particles consisting of the viral vector inhaled by the individual. Respirable particulate matter can be liquid or solid. Aerosols containing liquid particles of viral vectors can be produced by any suitable method, such as using a pressure driven aerosol nebulizer or an ultrasonic nebulizer known to those skilled in the art. Aerosols of solid particles containing viral vectors can likewise be produced by techniques known in the pharmaceutical art with any solid particle pharmaceutical aerosol generator.

III嵌合AAV衣壳病毒载体的定向进化和体内筛选方法III Directed Evolution and In Vivo Screening Methods of Chimeric AAV Capsid Virus Vectors

本发明还包括一种方法，用于制备包含嵌合AAV衣壳的病毒库，然后在体内筛选具有一种或多种所需特性的嵌合AAV衣壳或病毒。所需特性的非限制性例子包括靶向性特征、逃避中和抗体的能力以及改善细胞内运输等。The present invention also includes a method for preparing a viral library comprising chimeric AAV capsids and then screening in vivo for chimeric AAV capsids or viruses having one or more desired properties. Non-limiting examples of desirable properties include targeting characteristics, the ability to evade neutralizing antibodies, and improved intracellular trafficking, among others.

在代表性的实施方案中，本发明提供了一种识别感兴趣特性的病毒载体的方法，例如，一种识别感兴趣特性的AAV载体或AAV衣壳的方法，该方法包括：In representative embodiments, the present invention provides a method of identifying a viral vector for a property of interest, e.g., a method of identifying an AAV vector or AAV capsid for a property of interest, the method comprising:

第一，提供病毒载体例如AAV颗粒的集合，其中该集合内的每个AAV载体包括：包含通过对两种或多种不同AAV的衣壳编码序列进行改组而产生的衣壳蛋白，其中所述两种或多种不同AAV的衣壳氨基酸序列相差至少两个氨基酸；和病毒载体基因组，例如AAV载体基因组，所述病毒载体基因组包含前述改组产生的AAV衣壳蛋白编码序列、一种AAV Rep的编码序列以及至少一个末端重复例如AAV的5’和/或3’末端重复，其中所述病毒载体基因组包裹于AAV衣壳中。First, a collection of viral vectors, such as AAV particles, is provided, wherein each AAV vector within the collection comprises a capsid protein produced by shuffling the capsid coding sequences of two or more different AAVs, wherein the The capsid amino acid sequences of two or more different AAVs differ by at least two amino acids; and a viral vector genome, such as an AAV vector genome, comprising the previously shuffled AAV capsid protein coding sequence, an AAV Rep The coding sequence and at least one terminal repeat such as the 5' and/or 3' terminal repeats of AAV in which the viral vector genome is packaged in the AAV capsid.

第二，向受试者施用病毒载体的集合；Second, administering to the subject a collection of viral vectors;

第三，从目标组织中回收多个作为病毒粒子或作为编码AAV衣壳的病毒载体基因组的病毒载体,从而鉴定具有感兴趣特性的病毒载体或AAV衣壳。Third, multiple viral vectors are recovered from target tissues as virions or as viral vector genomes encoding AAV capsids to identify viral vectors or AAV capsids with properties of interest.

本发明还可用于识别一种在体内具有逃避中和抗体能力的嵌合AAV衣壳或病毒颗粒，例如，在人血清中发现的中和抗体。例如，可通过向受试者注射人类免疫球蛋白来进行对中和抗体抗性的体内筛选。例如，向非人类哺乳动物受试者注射IVIG。IVIG天然含有针对人类所有常见AAV的抗体混合物。或者，可以给受试者注射特定的中和抗体，然后将嵌合病毒库注射到受试者中，对进入感兴趣的目标组织(例如心脏、骨骼肌、肝脏等)的病毒基因组进行选择，从目标组织分离出的基因组与能够逃避中和的衣壳相对应。The present invention can also be used to identify a chimeric AAV capsid or virus particle that has the ability to evade neutralizing antibodies in vivo, eg, neutralizing antibodies found in human serum. For example, in vivo screening for neutralizing antibody resistance can be performed by injecting a subject with human immunoglobulin. For example, IVIG is injected into a non-human mammalian subject. IVIG naturally contains a cocktail of antibodies against all common AAVs in humans. Alternatively, a subject can be injected with a specific neutralizing antibody, and then a pool of chimeric viruses can be injected into the subject to select for viral genomes that enter the target tissue of interest (eg, heart, skeletal muscle, liver, etc.), Genomes isolated from target tissues correspond to capsids capable of evading neutralization.

因此，在具有代表性的实施例中，本发明提供了一种识别具有逃避中和IgGs能力的嵌合AAV衣壳或病毒载体的方法，所述方法包括：Accordingly, in representative embodiments, the present invention provides a method of identifying a chimeric AAV capsid or viral vector capable of evading neutralizing IgGs, the method comprising:

第一，向哺乳动物个体施用IgGs；First, administering IgGs to individual mammals;

第二，提供病毒载体例如AAV颗粒的集合，其中该集合内的每个AAV载体包括：包含由通过对两种或多种不同AAV的衣壳编码序列进行改组而产生的衣壳蛋白，其中所述两种或多种不同AAV的衣壳氨基酸序列相差至少两个氨基酸；和病毒载体基因组，例如AAV载体基因组，所述病毒载体基因组包含前述改组产生的AAV衣壳蛋白编码序列、一种AAV Rep的编码序列以及至少一个末端重复例如AAV的5’和/或3’末端重复，其中所述病毒载体基因组包裹于AAV衣壳中。Second, a collection of viral vectors, such as AAV particles, is provided, wherein each AAV vector within the collection comprises a capsid protein produced by shuffling the capsid-encoding sequences of two or more different AAVs, wherein the The capsid amino acid sequences of the two or more different AAVs differ by at least two amino acids; and a viral vector genome, such as an AAV vector genome, comprising the AAV capsid protein coding sequence produced by the aforementioned shuffling, an AAV Rep and at least one terminal repeat, such as the 5' and/or 3' terminal repeat of AAV, wherein the viral vector genome is packaged in the AAV capsid.

第三，向受试者施用病毒载体的集合；third, administering the collection of viral vectors to the subject;

第四，从目标组织中回收多个作为病毒粒子或作为编码AAV衣壳的病毒载体基因组的病毒载体，从而鉴定具有逃避中和抗体特性的病毒载体或AAV衣壳。Fourth, multiple viral vectors as virions or as viral vector genomes encoding AAV capsids are recovered from target tissues to identify viral vectors or AAV capsids with neutralizing antibody evasion properties.

“逃避”中和抗体，意指与适当的对照组相比，中和至少部分减少，但只要中和程度与对照组相比有所降低，并且只要一些载体能够到达并转导目标组织，则“逃避”的程度就不需要完全。"Escape" neutralizing antibodies, meaning that neutralization is at least partially reduced compared to an appropriate control, but as long as the degree of neutralization is reduced compared to the control, and as long as some vector is able to reach and transduce the target tissue, then The degree of "escape" does not need to be complete.

在具体实施例中，目标组织是肝脏、骨骼肌、心肌、横膈膜肌、肾脏、胰腺、脾脏、胃肠道、肺、关节组织、舌头、卵巢、睾丸、生殖细胞、癌细胞或上述的组合。In specific embodiments, the target tissue is liver, skeletal muscle, cardiac muscle, diaphragm muscle, kidney, pancreas, spleen, gastrointestinal tract, lung, joint tissue, tongue, ovary, testis, germ cells, cancer cells, or the above combination.

两个或多个AAV衣壳的任意组合的序列，无论是自然发生的还是修改的，无论是现在已知的还是以后发现的，都可以被“改组”来产生包含嵌合衣壳的AAV载体集合。在代表性实施例中，AAV载体的集合包含通过从以下两个或多个衣壳序列改组而生成嵌合衣壳，所述AAV衣壳包括：AAV1、AAV2、AAV3A、AAV3B、AAV4、AAV5、AAV6、AAV7、AAV8、AAV9、AAV10、AAV11、AAV12、禽AAV、牛AAV、犬AAV、马AAV、绵羊AAV、鹅AAV或蛇AAV。如上所述，AAV衣壳的集合可以进一步包含目前已知或后来发现的非自然发生的AAV衣壳。这种修饰包括替换，包括修饰核酸和/或氨基酸的替换，以及删除和/或插入。Sequences of any combination of two or more AAV capsids, whether naturally occurring or modified, whether now known or later discovered, can be "shuffled" to generate AAV vectors containing chimeric capsids gather. In representative embodiments, the collection of AAV vectors comprises the generation of chimeric capsids by shuffling from two or more of the following capsid sequences, the AAV capsids comprising: AAV1, AAV2, AAV3A, AAV3B, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12, avian AAV, bovine AAV, canine AAV, equine AAV, ovine AAV, goose AAV or snake AAV. As noted above, the collection of AAV capsids may further comprise currently known or later discovered non-naturally occurring AAV capsids. Such modifications include substitutions, including substitutions of modified nucleic acids and/or amino acids, as well as deletions and/or insertions.

通过本领域已知的将突变引入核酸和/或氨基酸序列的任何方法，例如使用化学诱变剂、易错的PCR、盒式突变等，可以实现嵌合衣壳的进一步多样性。Further diversity of chimeric capsids can be achieved by any method known in the art for introducing mutations into nucleic acid and/or amino acid sequences, eg, the use of chemical mutagens, error-prone PCR, cassette mutagenesis, and the like.

任选地，可将体内筛选方法与一轮或多轮体外筛选组合以进一步优化载体。例如，可以进行体内选择，以识别具有所需特性的嵌合AAV衣壳，然后体外选择可用于识别具有逃避抗体中和能力的AAV衣壳。Optionally, in vivo screening methods can be combined with one or more rounds of in vitro screening to further optimize the vector. For example, in vivo selection can be performed to identify chimeric AAV capsids with desired properties, and then in vitro selection can be used to identify AAV capsids with the ability to evade neutralization by antibodies.

AAV颗粒的集合可以通过任何合适的方法给受试者服用。在具体实施例中，将集合物投予受试者的血液，例如静脉或关节内。The collection of AAV particles can be administered to a subject by any suitable method. In specific embodiments, the collection is administered to the blood of the subject, eg, intravenously or intra-articularly.

给药方式和受试者如本文其它地方所述。The mode of administration and subjects are as described elsewhere herein.

本发明可用于在体内识别具有所需特性的嵌合病毒或病毒衣壳。因此，在具体实施例中，本发明的方法包括从两个或多个目标组织中回收AAV粒子或编码相同AAV粒子的病毒基因组，并识别对所述两个或多个目标组织具有所需特性的嵌合病毒或嵌合AAV衣壳。例如，在具体实施例中，识别对骨骼肌和/或心肌具有低效靶向性和或对肝脏具有高效靶向性的嵌合病毒或嵌合AAV衣壳。The present invention can be used to identify chimeric viruses or viral capsids with desired properties in vivo. Thus, in specific embodiments, the methods of the present invention comprise recovering AAV particles or viral genomes encoding the same AAV particles from two or more target tissues, and identifying those two or more target tissues with desired properties chimeric virus or chimeric AAV capsid. For example, in specific embodiments, chimeric viruses or chimeric AAV capsids are identified that have inefficient targeting to skeletal muscle and/or cardiac muscle and/or have efficient targeting to liver.

目标细胞或组织或其之一也可以是癌细胞或肿瘤组织。例如，可以对癌症的动物模型施用嵌合病毒，并且从癌细胞或肿瘤中分离编码该病毒的嵌合病毒外壳或病毒基因组。在具有代表性的实施例中，动物模型可以是具有增加形成癌症或肿瘤可能性的模型，或者可以是将人类肿瘤细胞移植到动物中的异种移植模型。The target cell or tissue, or one of them, can also be a cancer cell or tumor tissue. For example, a chimeric virus can be administered to an animal model of cancer and the chimeric viral coat or viral genome encoding the virus isolated from cancer cells or tumors. In representative embodiments, the animal model can be a model with an increased likelihood of developing a cancer or tumor, or can be a xenograft model in which human tumor cells are transplanted into an animal.

DNA“改组”或“嵌合“的示例性方法，也称为“分子育种”、“快速强制进化”等，在本领域是已知的(US5605793、US6165793、US6117679，Stemmer,1994，Proc.Nati.Acad.Sci91:10747-10751，和Soong等,2000，Nature Genetics 25:436-439)。这种方法也被应用于病毒的定向进化(美国专利US6096548，美国专利US6596539)。在一个代表性实施例中，通过同源和/或非同源重组将AAV衣壳蛋白编码序列的集合在体外片段化和重组，以形成“嵌合”AAV衣壳蛋白的集合。每个嵌合衣壳包裹一种包含相应衣壳编码序列的核酸，例如AAV基因组，从而生成嵌合病毒。嵌合病毒集合被施用于受试者，并根据感兴趣的特性进行体内选择。例如，可以从一个或多个靶组织中分离出嵌合病毒，以识别具有所需特性的优化的衣壳蛋白。Exemplary methods of DNA "shuffling" or "chimerism", also known as "molecular breeding", "rapid forced evolution", etc., are known in the art (US5605793, US6165793, US6117679, Stemmer, 1994, Proc.Nati . Acad. Sci 91:10747-10751, and Soong et al., 2000, Nature Genetics 25:436-439). This approach has also been applied to directed evolution of viruses (US Patent US6096548, US Patent US6596539). In a representative embodiment, a collection of AAV capsid protein coding sequences is fragmented and recombined in vitro by homologous and/or non-homologous recombination to form a collection of "chimeric" AAV capsid proteins. Each chimeric capsid encapsulates a nucleic acid, eg, an AAV genome, comprising the corresponding capsid-encoding sequence, thereby creating a chimeric virus. Chimeric virus pools are administered to subjects and selected in vivo for properties of interest. For example, chimeric viruses can be isolated from one or more target tissues to identify optimized capsid proteins with desired properties.

Ⅳ具体实施方式IV Specific Embodiments

在一方面，本发明提供了一种编码AAV衣壳蛋白的核酸，所述核酸包含选自以下组中任一核苷酸序列：In one aspect, the invention provides a nucleic acid encoding an AAV capsid protein, the nucleic acid comprising any nucleotide sequence selected from the group consisting of:

(a)核苷酸序列SEQ ID NO:1；(a) the nucleotide sequence of SEQ ID NO: 1;

(b)核苷酸序列SEQ ID NO:3；(b) the nucleotide sequence of SEQ ID NO: 3;

(c)核苷酸序列SEQ ID NO:5；(c) the nucleotide sequence of SEQ ID NO: 5;

(d)核苷酸序列SEQ ID NO:7；(d) the nucleotide sequence of SEQ ID NO: 7;

(e)核苷酸序列SEQ ID NO:9；(e) the nucleotide sequence of SEQ ID NO: 9;

(f)核苷酸序列SEQ ID NO:11；(f) the nucleotide sequence of SEQ ID NO: 11;

(g)核苷酸序列SEQ ID NO:13；(g) the nucleotide sequence of SEQ ID NO: 13;

(h)核苷酸序列SEQ ID NO:15；(h) the nucleotide sequence of SEQ ID NO: 15;

(i)核苷酸序列SEQ ID NO:17；或(i) the nucleotide sequence of SEQ ID NO: 17; or

(j)由(a)-(i)任一核苷酸序列编码的AAV衣壳蛋白的核苷酸序列，其因遗传密码的简并性而与(a)-(i)的核苷酸序列不同。(j) The nucleotide sequence of the AAV capsid protein encoded by any of the nucleotide sequences (a)-(i), which, due to the degeneracy of the genetic code, is different from the nucleotides of (a)-(i) The sequence is different.

所述核酸为质粒、噬菌体、病毒载体、细菌人工染色体、酵母人工染色体，优选包含有编码序列的AAV载体，更优选地所述核酸还包含一种AAV Rep蛋白的编码序列。The nucleic acid is a plasmid, bacteriophage, viral vector, bacterial artificial chromosome, yeast artificial chromosome, preferably an AAV vector containing a coding sequence, more preferably, the nucleic acid further comprises a coding sequence of an AAV Rep protein.

在一方面，本发明还提供一种由以上所述核酸编码的AAV衣壳蛋白，所述AAV衣壳蛋白的氨基酸序列包含选自SEQ ID NO:2，SEQ ID NO:4，SEQ ID NO:6，SEQ ID NO:8，SEQID NO:10，SEQ ID NO:12，SEQ ID NO:14，SEQ ID NO:16，或SEQ ID NO:18中的任一种。优选地，所述AAV衣壳共价连接、结合或包裹一种组合物，所述组合物选自DNA分子、RNA分子、多肽、碳水化合物、脂质体和小的有机分子中的一种或几种。In one aspect, the present invention also provides an AAV capsid protein encoded by the above-mentioned nucleic acid, the amino acid sequence of the AAV capsid protein comprising selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 6. Any of SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, or SEQ ID NO: 18. Preferably, the AAV capsid covalently links, binds or encapsulates a composition selected from one of DNA molecules, RNA molecules, polypeptides, carbohydrates, liposomes and small organic molecules or several.

在另一方面，本发明提供了一种重组病毒粒子，其包含前面所述的核酸和/或衣壳蛋白。所述重组病毒粒子选自重组AAV病毒粒子，重组腺病毒粒子，重组疱疹病毒粒子、重组杆状病毒粒子，或重组杂合病毒粒子。In another aspect, the present invention provides a recombinant virus particle comprising the aforementioned nucleic acid and/or capsid protein. The recombinant virus particles are selected from recombinant AAV virus particles, recombinant adenovirus particles, recombinant herpes virus particles, recombinant baculovirus particles, or recombinant hybrid virus particles.

在另一方面，本发明提供了一种重组AAV病毒粒子，其包含AAV载体基因组和前述AAV衣壳蛋白，其中AAV载体基因组包裹于AAV衣壳中。所述AAV载体基因组中包含异源核酸序列。所述异源核酸序列编码选自反义RNA、microRNA、shRNA、多肽和免疫原中一种或几种。优选地，所述异源核酸序列编码的多肽为治疗性多肽或报告基因，其中所述异源核酸编码的治疗性多肽选自胰岛素、胰高血糖素、生长激素、生长激素释放因子、促红细胞生成素、胰岛素生长因子、转化生长因子α、肝细胞生长因子、酪氨酸羟化酶、血小板生成素、白介素1-白介素25、低密度脂蛋白受体、糖皮质激素受体、维生素D受体、干扰素调节因子、Ⅷ因子、Ⅸ因子、葡萄糖苷酶、葡萄糖-6-磷酸酶、异戊酰-CoA脱氢酶、丙酰-CoA羧化酶、β-葡糖苷酶、肝磷酸化酶、磷酸化酶激酶、甘氨酸脱羧酶、α-半乳糖苷酶、β-半乳糖苷酶和溶酶体酶所组成组中的一种或几种。In another aspect, the present invention provides a recombinant AAV virion comprising an AAV vector genome and the aforementioned AAV capsid protein, wherein the AAV vector genome is encapsulated in the AAV capsid. The AAV vector genome contains heterologous nucleic acid sequences. The heterologous nucleic acid sequence encodes one or more selected from antisense RNA, microRNA, shRNA, polypeptide and immunogen. Preferably, the polypeptide encoded by the heterologous nucleic acid sequence is a therapeutic polypeptide or a reporter gene, wherein the therapeutic polypeptide encoded by the heterologous nucleic acid is selected from insulin, glucagon, growth hormone, growth hormone releasing factor, erythropoietin Progenin, insulin growth factor, transforming growth factor alpha, hepatocyte growth factor, tyrosine hydroxylase, thrombopoietin, interleukin 1-interleukin 25, low density lipoprotein receptor, glucocorticoid receptor, vitamin D receptor body, interferon regulatory factor, factor VIII, factor IX, glucosidase, glucose-6-phosphatase, isovaleryl-CoA dehydrogenase, propionyl-CoA carboxylase, beta-glucosidase, liver phosphorylation One or more of the group consisting of enzyme, phosphorylase kinase, glycine decarboxylase, α-galactosidase, β-galactosidase and lysosomal enzyme.

在一方面，本发明提供一种细胞，其包含前述的核酸、AAV衣壳蛋白、重组病毒粒子和/或重组AAV病毒粒子。In one aspect, the present invention provides a cell comprising the aforementioned nucleic acid, AAV capsid protein, recombinant virion and/or recombinant AAV virion.

在另一方面，本发明提供一种药物组合物，其包含一种药学上可接受载体或辅料和选自权前述的核酸、AAV衣壳蛋白、重组病毒粒子、重组AAV病毒粒子和/或前述细胞中的一种或几种。In another aspect, the present invention provides a pharmaceutical composition comprising a pharmaceutically acceptable carrier or adjuvant and selected from the aforementioned nucleic acid, AAV capsid protein, recombinant virus particle, recombinant AAV virus particle and/or the aforementioned one or more of the cells.

在另一方面，本发明还提供了前述的核酸、AAV衣壳蛋白、重组病毒粒子、重组AAV病毒粒子、细胞和/或前述药物组合物中的一种或几种在制备用于预防或治疗疾病的药物中的用途，所述疾病选自由囊性纤维化和肺部其它疾病、血友病A、血友病B、地中海贫血、贫血和其它血液疾病、老年痴呆症、多发性硬化症、帕金森氏病、亨廷顿氏病、肌萎缩侧索硬化症、癫痫、癌症、糖尿病、肌营养不良、糖原贮积病和其它代谢缺陷、先天性肺气肿、Lesch-Nyhan综合征、Niemann-Pick病、艾滋病、肝炎、高氨血症和脊髓性脑共济失调所组成组中的一种或几种。In another aspect, the present invention also provides that one or more of the aforementioned nucleic acid, AAV capsid protein, recombinant virus particle, recombinant AAV virus particle, cell and/or the aforementioned pharmaceutical composition are prepared for prevention or treatment Use in a medicament for a disease selected from cystic fibrosis and other lung diseases, hemophilia A, hemophilia B, thalassemia, anemia and other blood diseases, Alzheimer's disease, multiple sclerosis, Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, epilepsy, cancer, diabetes, muscular dystrophy, glycogen storage disease and other metabolic defects, congenital emphysema, Lesch-Nyhan syndrome, Niemann- One or more of the group consisting of Pick disease, AIDS, hepatitis, hyperammonemia and spinal cerebral ataxia.

在一方面，本发明提供了一种制备重组AAV病毒粒子的方法，所述方法包括在体外给细胞提供一种前述的核酸、一种AAV Rep蛋白的核酸编码序列、一种携带异源核酸序列的AAV载体基因组、有助于产生感染性AAV的辅助因子，以及允许AAV载体基因组包裹于前述核酸编码的AAV衣壳中，并实现重组AAV病毒粒子的组装，所述方法为AAV载体制备系统，包括两质粒包装系统、三质粒包装系统、杆状病毒包装系统和以Ad或HSV为辅毒的AAV包装系统等。In one aspect, the present invention provides a method for preparing recombinant AAV virions, the method comprising providing a cell with the aforementioned nucleic acid, a nucleic acid encoding sequence for an AAV Rep protein, a nucleic acid sequence carrying a heterologous nucleic acid in vitro AAV vector genome, cofactors that help to produce infectious AAV, and allow AAV vector genome to be wrapped in the AAV capsid encoded by the aforementioned nucleic acid, and realize the assembly of recombinant AAV virus particles, the method is an AAV vector preparation system, Including two-plasmid packaging system, three-plasmid packaging system, baculovirus packaging system and AAV packaging system with Ad or HSV as co-drug.

在一方面，本发明还提供了一种在体外向细胞递送异源核酸的方法，所述方法包括向细胞施用前述的核酸、病毒衣壳蛋白、重组病毒粒子、重组AAV病毒粒子和/或药物组合物，所述细胞是哺乳动物细胞，优选人细胞，优选人干细胞或肝细胞。In one aspect, the present invention also provides a method of delivering heterologous nucleic acid to a cell in vitro, the method comprising administering to the cell the aforementioned nucleic acid, viral capsid protein, recombinant virion, recombinant AAV virion and/or drug A composition wherein the cells are mammalian cells, preferably human cells, preferably human stem cells or hepatocytes.

在另一方面，本发明还提供了一种向哺乳动物递送异源核酸的方法，所述方法包括向哺乳动物个体施用有效剂量前述的核酸、病毒衣壳蛋白、重组病毒粒子、重组AAV病毒粒子、前述细胞和/或前述的药物组合物，其中所述哺乳动物为人类个体或灵长类个体。In another aspect, the present invention also provides a method of delivering a heterologous nucleic acid to a mammal, the method comprising administering to the mammalian individual an effective dose of the aforementioned nucleic acid, viral capsid protein, recombinant virion, recombinant AAV virion , the aforementioned cell and/or the aforementioned pharmaceutical composition, wherein the mammal is a human individual or a primate individual.

在描述本发明之后，将在以下实施例中更详细地阐述本发明，这些实施例仅用于说明目的，并不用于限制本发明。Having described the present invention, the present invention will be illustrated in more detail in the following examples, which are for illustrative purposes only and are not intended to limit the present invention.

实施例Example

下面的实施例描述了通过定向进化和体内筛选方法对AAV衣壳基因进行改组，以产生一组对肝脏靶向性优良或优异的载体。采用DNA改组技术对AAV衣壳基因进行改组并构建AAV衣壳基因库，用于小鼠模型体内筛选。分离小鼠肝脏富集的AAV突变体，并通过对AAV突变体外壳的体外和体内活性测试来对其靶向性进行确定。The following examples describe the shuffling of AAV capsid genes by directed evolution and in vivo screening methods to generate a set of vectors that are superior or superior to liver targeting. The AAV capsid gene was shuffled by DNA shuffling technology and an AAV capsid gene library was constructed for in vivo screening of mouse models. Mouse liver-enriched AAV mutants were isolated and their targeting was determined by testing for in vitro and in vivo activity of the AAV mutant coat.

实施例1嵌合AAV质粒文库构建Example 1 Construction of chimeric AAV plasmid library

为了获得嵌合的AAV外壳基因Cap，首先利用上游引物primerA和下游引物primerB，分别从不同血清型AAV母本扩增外壳全长基因，选择的母本AAV外壳基因包括AAV1外壳(NCBI SequenceID:AF063497.1，Cap的核酸编码序列2223-4433)、AAV2外壳(NCBISequenceID:AF043303.1，Cap的核酸编码序列2203-4410)、AAV3B外壳(NCBI SequenceID:AF028705.1，Cap的核酸编码序列2208-4418)、AAV7外壳(NCBI SequenceID:AF513851.1，Cap的核酸编码序列2222-4435)、AAV8外壳(NCBI SequenceID:AF513852.1，Cap的核酸编码序列2121-4337)、AAV9外壳(NCBI SequenceID:530579.1，Cap的核酸编码序列1-2211)。上游引物primerA为5’-CCCAAGCTTCGATCAACTACGCAGACAGGTACCAA-3’，下游引物primerB为5’-ATAAGAATGCGGCCGCAGAGACCAAAGTTCAACTGAAACGA-3’，在primeSTAR Max DNA polymerase(TaKaRa公司，货号R045A)作用下进行PCR扩增，扩增条件：95℃5min 1个循环；98℃8s，60℃5s，72℃15s，35个循环；72℃5min，1个循环。In order to obtain the chimeric AAV coat gene Cap, firstly, using the upstream primer primerA and the downstream primer primerB to amplify the full-length coat gene from different serotype AAV mothers, respectively, the selected maternal AAV coat gene includes the AAV1 coat (NCBI SequenceID: AF063497 .1, Nucleic acid coding sequence 2223-4433 of Cap), AAV2 coat (NCBISequenceID: AF043303.1, Nucleic acid coding sequence 2203-4410 of Cap), AAV3B coat (NCBI SequenceID: AF028705.1, Nucleic acid coding sequence 2208-4418 of Cap) ), AAV7 shell (NCBI SequenceID: AF513851.1, nucleic acid coding sequence 2222-4435 of Cap), AAV8 shell (NCBI SequenceID: AF513852.1, nucleic acid coding sequence 2121-4337 of Cap), AAV9 shell (NCBI SequenceID: 530579.1, Cap's nucleic acid coding sequence 1-2211). The upstream primer primerA is 5'-CCCAAGCTTCGATCAACTACGCAGACAGGTACCAA-3', and the downstream primer primerB is 5'-ATAAGAATGCGGCCGCAGAGACCAAAGTTCAACTGAAACGA-3'. PCR amplification was performed under the action of primeSTAR Max DNA polymerase (TaKaRa Company, Cat. No. R045A). Amplification conditions: 95℃ for 5min 1 cycle; 98°C for 8s, 60°C for 5s, 72°C for 15s, 35 cycles; 72°C for 5min, 1 cycle.

将扩增获得的Cap基因等质量混合，获得总质量为4ug的DNA模板，用0.04U的DNaseI进行随机破碎消化，通常条件下22℃消化6-8min后，75℃灭活DNaseI酶10min。琼脂糖凝胶电泳获得弥散性的DNA条带，其长度大部分集中在500-1000bp，回收这部分长度范围的DNA片段。The amplified Cap genes were mixed with equal mass to obtain a DNA template with a total mass of 4ug, which was randomly fragmented and digested with 0.04U of DNaseI. Under normal conditions, after digestion at 22°C for 6-8 minutes, the DNaseI enzyme was inactivated at 75°C for 10 minutes. Dispersed DNA bands were obtained by agarose gel electrophoresis, most of which were concentrated in 500-1000 bp in length, and DNA fragments in this part of the length range were recovered.

回收的DNA片段首先互为引物进行随机的无引物扩增，为增加其多样性及扩增效率，采用非常规单一退火PCR模式(94℃60s，65℃90s，72℃90s，10个循环；94℃60s，62℃90s，72℃90s，10个循环；94℃60s，59℃90s，72℃90s，10个循环；94℃60s，56℃90s，72℃90s，10个循环；94℃60s，53℃90s，72℃90s，10个循环；94℃60s，50℃90s，72℃90s，10个循环；94℃60s，47℃90s，72℃90s，10个循环)。The recovered DNA fragments were firstly subjected to random primerless amplification as primers for each other. In order to increase their diversity and amplification efficiency, an unconventional single annealing PCR mode was used (94°C for 60s, 65°C for 90s, 72°C for 90s, 10 cycles; 94℃ for 60s, 62℃ for 90s, 72℃ for 90s, 10 cycles; 94℃ for 60s, 59℃ for 90s, 72℃ for 90s, 10 cycles; 94℃ for 60s, 56℃ for 90s, 72℃ for 90s, 10 cycles; 94℃ 60s, 53°C 90s, 72°C 90s, 10 cycles; 94°C 60s, 50°C 90s, 72°C 90s, 10 cycles; 94°C 60s, 47°C 90s, 72°C 90s, 10 cycles).

以无引物PCR产物为模板，利用上游引物T-primerA(5’-ACGCCTGCCGTTCGACGATTCCCAAGCTTCGATCAACTACGCAGACAGGTACCAA-3’)和下游引物T-primerB(5’-ACGCGCGGATCTTCCAGAGATAAGAATGCGGCCGCAGAGACCAAAGTTCAACTGAAACGA-3’)，在HiFi DNA聚合酶(TransGenBiotech公司，货号AS131-22)作用下，进行嵌合Cap基因的全长扩增。扩增程序94℃30s，62℃30s，72℃2.5min，共40循环。获得长度约为2500bp的嵌合全长Cap DNA片段，用HindIII和NotI进行双酶切，与经过同样双酶切处理的pSNAV2.3载体(含有单链AAV2基因组ITR及聚合酶Rep基因序列，缺失Cap基因)连接，连接产物电击转化到E.coli HST08细胞中，制备了一个库容大于1E+6个克隆的嵌合型AAV质粒文库。PCR鉴定为阳性克隆的质粒分别进行PstI、HaeIII、TaqI酶切鉴定，观察每个质粒酶切形态差异(图1)，以判定质粒文库的多样性。Using the primerless PCR product as a template, the upstream primer T-primerA (5'-ACGCCTGCCGTTCGACGATTCCCAAGCTTCGATCAACTACGCAGACAGGTACCAA-3') and the downstream primer T-primerB (5'-ACGCGCGGATCTTTCCAGAGATAAGAATGCGGCCGCAGAGACCAAAGTTCAACTGAAACGA-3') were used in HiFi DNA polymerase (TransGenBiotech, Cat. No. AS131). Under the action of -22), the full-length amplification of the chimeric Cap gene is carried out. The amplification program was 94 °C for 30 s, 62 °C for 30 s, and 72 °C for 2.5 min, for a total of 40 cycles. A chimeric full-length Cap DNA fragment with a length of about 2500 bp was obtained, which was double digested with HindIII and NotI, and the pSNAV2.3 vector (containing the single-stranded AAV2 genome ITR and polymerase Rep gene sequences, which was subjected to the same double digestion treatment, was deleted). Cap gene) ligation, the ligation product was electroporated into E. coli HST08 cells, and a chimeric AAV plasmid library with a capacity of more than 1E+6 clones was prepared. The plasmids identified as positive clones by PCR were digested with PstI, HaeIII, and TaqI, respectively, and the differences in the shape of each plasmid were observed (Figure 1) to determine the diversity of the plasmid library.

实施例2新型AAV病毒文库的包装及滴度检测Example 2 Packaging and titer detection of novel AAV virus library

上述嵌合质粒文库在helper辅助下进行自我包装复制，包装方法为两步法，首先用文库质粒、R2C2、helper质粒以质量比1:1:2的比例进行转染，包装成AAV中间病毒库，此时组成病毒库的AAV为杂合型AAV，即病毒外壳蛋白不一定是由其包装的基因组表达，收获液进行氯仿抽提纯化后，检测基因组滴度；第二步以感染指数MOI为100感染293T细胞，保证每个细胞感染的病毒基因组拷贝数＜10，此时产生的最终AAV病毒库为纯合型，收获液进行氯仿抽提纯化后，检测基因组滴度(参见Muller等，2003，Nature Biotechnology，21:1040-1046)。在包装时使用文库质粒质量非常低，用来保证每一个新型AAV基因组能够被包装于其自身表达的Cap蛋白形成的外壳内。包装的病毒颗粒能够正常表达外壳蛋白，证明其能够形成完整的病毒外壳，具备感染能力。The above-mentioned chimeric plasmid library is self-packaged and replicated with helper assistance. The packaging method is a two-step method. First, the library plasmid, R2C2, and helper plasmid are used for transfection in a mass ratio of 1:1:2, and packaged into an AAV intermediate virus library. At this time, the AAV that constitutes the virus library is a hybrid AAV, that is, the viral coat protein is not necessarily expressed by its packaged genome. After the harvest liquid is extracted and purified by chloroform, the genome titer is detected; the second step is to take the infection index MOI as 100 infected 293T cells to ensure that the number of copies of the virus genome infected by each cell is less than 10, and the final AAV virus library generated at this time is homozygous. After the harvest liquid is extracted and purified with chloroform, the genome titer is detected (see Muller et al., 2003 , Nature Biotechnology, 21:1040-1046). The library plasmids used in packaging are of very low quality to ensure that each novel AAV genome can be packaged in a shell formed by its own expressed Cap protein. The packaged virus particles can express the coat protein normally, which proves that they can form a complete viral coat and have the ability to infect.

取适量纯化AAV样品，按下表(表1)配制DNase I消化反应混合液，37℃孵育30min，75℃孵育10min，失活DNase I。Take an appropriate amount of purified AAV sample, prepare a DNase I digestion reaction mixture according to the following table (Table 1), incubate at 37 °C for 30 min, and incubate at 75 °C for 10 min to inactivate DNase I.

表1Table 1

AAV样品AAV samples 5μl5μl 10×Dnase I缓冲液10×DNase I buffer 5μl5μl Dnase IDNAse I 1μl1μl 无Rnase的水RNase-free water 39μl39μl 合计total 50μl50μl

处理后的AAV纯化样品稀释适当的倍数后，参照下表(表2)配置Q-PCR反应体系，并按照下列程序进行检测。After the treated AAV purified sample is diluted by an appropriate factor, the Q-PCR reaction system is configured with reference to the following table (Table 2), and the detection is performed according to the following procedure.

表2Table 2

其中使用的引物参见下表(表3)：The primers used are shown in the following table (Table 3):

表3table 3

正向引物(5’-3’)Forward primer (5'-3') AAGGTGGTGGATGAGTGCTACAAAGGTGGTGGATGAGTGCTACA 反向引物(5’-3’)Reverse primer (5'-3') TGGAGCTCAGGCTGGGTTTTGGAGCTCAGGCTGGGTTT 探针引物probe primer CCCCAATTACTTGCTCCCCCCAATTACTTGCTCC

包装产量结果参见下表(表4)：The packaging yield results are shown in the table below (Table 4):

表4Table 4

病毒载体名称viral vector name 基因组滴度(vg/ml)Genome titer (vg/ml) 新型AAV病毒库Novel AAV virus library 2.5E+112.5E+11

实施例3新型AAV载体在小鼠体内筛选与富集Example 3 Screening and enrichment of novel AAV vectors in mice

新型AAV载体包装而成的病毒以1.5E+11vg/只剂量通过尾部静脉注射到8周龄C57BL/6J小鼠体内，进行肝脏靶向筛选与富集。注射三天后处死小鼠，收集全部肝脏组织，液氮匀浆后提取总DNA。再次用T-primerA/T-primerB这对引物进行扩增，扩增后的新型AAV全长Cap基因混合而成的DNA用上述方法构建到pSNAV2.3载体上。通过PCR鉴定获得370个阳性单克隆进行等质量混合，形成第二次的质粒文库。再包装成病毒文库，再次以1.5E+11vg/只尾部静脉注射到8周龄C57BL/6J小鼠体内，注射三天后处死小鼠，收集全部肝脏组织、心脏组织及骨骼肌。分别从以上组织随机挑选100、50、50个阳性克隆进行测序。测序后的阳性克隆用BioEdit，VectorNTI，ClustalX2及TreeViewX进行序列比对分析。The virus packaged by the novel AAV vector was injected into 8-week-old C57BL/6J mice at a dose of 1.5E+11vg/vesicle via tail vein for liver targeting screening and enrichment. Mice were sacrificed three days after injection, and all liver tissues were collected, and total DNA was extracted after homogenization in liquid nitrogen. The primer pair T-primerA/T-primerB was used for amplification again, and the amplified DNA obtained by mixing the full-length Cap genes of the novel AAV was constructed into the pSNAV2.3 vector by the above method. 370 positive monoclones were obtained by PCR identification and mixed with equal mass to form the second plasmid library. It was repackaged into a virus library and injected into 8-week-old C57BL/6J mice with 1.5E+11vg/tail vein again. The mice were sacrificed three days after injection, and all liver tissue, heart tissue and skeletal muscle were collected. 100, 50 and 50 positive clones were randomly selected from the above tissues for sequencing. The sequenced positive clones were analyzed by BioEdit, VectorNTI, ClustalX2 and TreeViewX.

选择肝脏高频出现，心脏和骨骼肌低频出现或不出现的新型AAV载体。共获得9组高频肝脏靶向新型AAV突变体(图2)，分析以上9种新型AAV-Cap突变体的核苷酸序列和氨基酸序列(图3)。由于其它4种包装滴度低，不适合工业化需要。所以，选择其中5种新型AAV，即L37、L57、L58、L107、L10进行后续感染活性测试实验。Novel AAV vectors with high frequency presence in the liver and low frequency presence or absence in the heart and skeletal muscle were selected. A total of 9 groups of high-frequency liver-targeted novel AAV mutants were obtained (Fig. 2), and the nucleotide and amino acid sequences of the above 9 novel AAV-Cap mutants were analyzed (Fig. 3). Due to the low titers of the other four packages, they are not suitable for industrialization. Therefore, five new AAVs, namely L37, L57, L58, L107, and L10, were selected for subsequent infection activity testing experiments.

实施例4新型AAV对体外人肝细胞系感染活性Example 4 Infectious activity of novel AAV on human liver cell lines in vitro

4.1采用绿色荧光蛋白检测系统测试新型AAV对体外人肝细胞系感染活性4.1 Using the green fluorescent protein detection system to test the infection activity of the new AAV on human liver cell lines in vitro

为了初步测试新型AAV对肝脏靶向性感染活性，首先对6种人正常肝脏或肝癌细胞系进行体外感染活性测试。将获得的新型AAV Cap基因L57、L58、L107、L10分别构建到含有2型AAV Rep的RC质粒载体上，该质粒携带CAG-EGFP外源基因，包装出的病毒命名为AAV2/57、AAV2/58、AAV2/107、AAV2/10，并检测病毒滴度(结果未显示)。In order to preliminarily test the liver-targeted infection activity of the novel AAV, the in vitro infection activity was first tested on 6 human normal liver or liver cancer cell lines. The obtained novel AAV Cap genes L57, L58, L107, and L10 were respectively constructed on the RC plasmid vector containing type 2 AAV Rep, which carried the CAG-EGFP foreign gene, and the packaged viruses were named AAV2/57, AAV2/ 58, AAV2/107, AAV2/10, and virus titers were detected (results not shown).

将上述4种携带绿色荧光蛋白的重组新型AAV病毒和携带相同绿色荧光蛋白的AAV2/8重组病毒分别同时感染人多种肝细胞系，感染48小时后，流式检测感染活性。为了避免病毒载体在某一种细胞系上感染效率过低或过饱和，每种细胞系至少使用两种感染指数(MOI)进行感染，从图(4A-4F)可看出，当使用2-3种感染指数感染人肝脏细胞系7721、HepG2、Huh7、L02时，新型AAV2/10、AAV2/57、AAV2/58、AAV2/107对人肝脏细胞系的感染均表现出明显的剂量依赖关系。接下来以其中一种MOI针对每种细胞的感染结果进行描述，其余MOI结果详见图4。The above four recombinant novel AAV viruses carrying green fluorescent protein and AAV2/8 recombinant viruses carrying the same green fluorescent protein were simultaneously infected with various human hepatocyte cell lines. After 48 hours of infection, the infective activity was detected by flow cytometry. In order to avoid the infection efficiency of viral vector is too low or supersaturated on a certain cell line, each cell line was infected with at least two indices of infection (MOI). When the three infection indexes infect human liver cell lines 7721, HepG2, Huh7, and L02, the new AAV2/10, AAV2/57, AAV2/58, and AAV2/107 all showed a significant dose-dependent relationship to the infection of human liver cell lines. Next, the infection results of each cell at one of the MOIs are described, and the results of the remaining MOIs are shown in Figure 4.

当上述4种重组病毒以感染指数(MOI)为2500感染人肝脏细胞系7402，感染48h后进行流式细胞术检测分析，结果(图4A)显示：在此MOI下，AAV2/10感染效率是AAV2/8感染效率的4.6倍，AAV2/57感染效率是AAV2/8感染效率的2.2倍，AAV2/58感染效率是AAV2/8感染效率的3.9倍，AAV2/107感染效率是AAV2/8感染效率的1.6倍。When the above-mentioned four recombinant viruses infected human liver cell line 7402 with an infection index (MOI) of 2500, flow cytometry analysis was performed 48 hours after infection. The results (Fig. 4A) showed that at this MOI, the infection efficiency of AAV2/10 was The infection efficiency of AAV2/8 is 4.6 times, the infection efficiency of AAV2/57 is 2.2 times that of AAV2/8, the infection efficiency of AAV2/58 is 3.9 times that of AAV2/8, and the infection efficiency of AAV2/107 is AAV2/8. 1.6 times.

当上述4种重组病毒以感染指数(MOI)为2500感染人肝脏细胞系7721，感染48h后进行流式细胞术检测分析，结果(图4B)显示，在此MOI下，AAV2/10感染效率最高，AAV2/107感染效率与其接近，AAV2/10感染效率是AAV2/8感染效率的23.4倍，AAV2/107感染效率是AAV2/8感染效率的21.8倍，AAV2/57感染效率是AAV2/8感染效率的4.6倍，AAV2/58感染效率是AAV2/8感染效率的15.9倍。When the above four recombinant viruses infect human liver cell line 7721 with an infection index (MOI) of 2500, flow cytometry analysis was performed 48 hours after infection, the results (Fig. 4B) showed that at this MOI, AAV2/10 had the highest infection efficiency , AAV2/107 infection efficiency is close to it, AAV2/10 infection efficiency is 23.4 times higher than AAV2/8 infection efficiency, AAV2/107 infection efficiency is 21.8 times higher than AAV2/8 infection efficiency, AAV2/57 infection efficiency is AAV2/8 infection efficiency The infection efficiency of AAV2/58 was 15.9 times that of AAV2/8.

当上述4种重组病毒以感染指数(MOI)为2500感染人肝脏细胞系HepG2，感染48h后进行流式细胞术检测分析，结果(图4C)显示，在此MOI下，AAV2/10感染效率最高，AAV2/107感染效率与其接近，AAV2/10感染效率是AAV2/8感染效率的14.8倍，AAV2/107感染效率是AAV2/8感染效率的13.5倍。AAV2/57感染效率是AAV2/8感染效率的5.5倍，AAV2/58感染效率是AAV2/8感染效率的10.9倍。When the above four recombinant viruses infected the human liver cell line HepG2 with an infection index (MOI) of 2500, flow cytometry analysis was performed 48 hours after infection. The results (Fig. 4C) showed that at this MOI, AAV2/10 had the highest infection efficiency , AAV2/107 infection efficiency is close to it, AAV2/10 infection efficiency is 14.8 times higher than AAV2/8 infection efficiency, AAV2/107 infection efficiency is 13.5 times higher than AAV2/8 infection efficiency. The infection efficiency of AAV2/57 was 5.5 times that of AAV2/8, and the infection efficiency of AAV2/58 was 10.9 times that of AAV2/8.

当上述4种重组病毒以感染指数(MOI)为2500感染人肝脏细胞系Huh7，感染48h后进行流式细胞术检测分析，结果(图4D)显示，在此MOI下AAV2/10感染效率最高，是AAV2/8的31.8倍。AAV2/57感染效率是AAV2/8的1.4倍，AAV2/58感染效率是AAV2/8的6.6倍，AAV2/107感染效率是AAV2/8的19.5倍。When the above four recombinant viruses infected human liver cell line Huh7 with an infection index (MOI) of 2500, flow cytometry analysis was performed 48 hours after infection. The results (Fig. 4D) showed that AAV2/10 had the highest infection efficiency at this MOI. It is 31.8 times that of AAV2/8. The infection efficiency of AAV2/57 was 1.4 times that of AAV2/8, the infection efficiency of AAV2/58 was 6.6 times that of AAV2/8, and the infection efficiency of AAV2/107 was 19.5 times that of AAV2/8.

当上述4种重组病毒以感染指数(MOI)为10000感染人肝脏细胞系Huh6，感染48h后进行流式细胞术检测分析，结果(图4E)显示，在此MOI下AAV2/10感染效率最高，AAV2/107感染效率与其接近，AAV2/10感染效率是AAV2/8感染效率的2.9倍，AAV2/107感染效率是AAV2/8感染效率的2.7倍。AAV2/57感染效率是AAV2/8感染效率的1.8倍，AAV2/58感染效率是AAV2/8感染效率的2.6倍。When the above four recombinant viruses infected the human liver cell line Huh6 with an infection index (MOI) of 10,000, flow cytometry analysis was performed 48 hours after infection. The results (Fig. 4E) showed that AAV2/10 had the highest infection efficiency at this MOI. The infection efficiency of AAV2/107 was close to it, the infection efficiency of AAV2/10 was 2.9 times that of AAV2/8, and the infection efficiency of AAV2/107 was 2.7 times that of AAV2/8. The infection efficiency of AAV2/57 was 1.8 times that of AAV2/8, and the infection efficiency of AAV2/58 was 2.6 times that of AAV2/8.

当上述4种重组病毒以感染指数(MOI)为50000感染人肝脏细胞系L02，感染48h后进行流式细胞术检测分析，结果(图4F)显示，在此MOI下，AAV2/10感染效率最高，是AAV2/8感染效率的42倍，AAV2/107感染效率是AAV2/8感染效率的26.8倍，AAV2/57感染效率是AAV2/8感染效率的2.5倍，AAV2/58感染效率是AAV2/8感染效率的10.8倍。When the above four recombinant viruses infected human liver cell line L02 with an infection index (MOI) of 50,000, flow cytometry analysis was performed 48 hours after infection. The results (Fig. 4F) showed that at this MOI, AAV2/10 had the highest infection efficiency , the infection efficiency of AAV2/8 is 42 times, the infection efficiency of AAV2/107 is 26.8 times that of AAV2/8, the infection efficiency of AAV2/57 is 2.5 times that of AAV2/8, and the infection efficiency of AAV2/58 is AAV2/8. 10.8 times the infection efficiency.

由以上结果可知，这4种重组新型AAV对人肝细胞或肝癌细胞系感染活性均高于阳性对照AAV2/8。It can be seen from the above results that the infection activities of these four new recombinant AAVs on human hepatocytes or liver cancer cell lines are higher than that of the positive control AAV2/8.

4.2采用荧光素酶检测系统测试新型AAV对体外人肝细胞系感染活性4.2 The luciferase detection system was used to test the infection activity of the novel AAV on human liver cell lines in vitro

将获得的新型AAV Cap基因L37、L57、L58、L107、L10构建到含有2型AAV Rep的RC质粒载体上，该质粒上携带有CAG-Luciferase外源基因，包装出的病毒命名为AAV2/37、AAV2/57、AAV2/58、AAV2/107、AAV2/10，并检测病毒滴度(结果未显示)。上述重组新型AAV病毒以感染指数MOI为500对5种正常肝脏或肝癌细胞系进行感染，48小时后，采用荧光素酶检测系统检测感染活性。在每个检测值上扣除NC背景值之后进行感染活性的比较。The obtained new AAV Cap genes L37, L57, L58, L107, L10 were constructed on the RC plasmid vector containing type 2 AAV Rep, which carried the CAG-Luciferase foreign gene, and the packaged virus was named AAV2/37 , AAV2/57, AAV2/58, AAV2/107, AAV2/10, and virus titers were detected (results not shown). The above recombinant novel AAV virus was used to infect 5 normal liver or liver cancer cell lines with an infection index MOI of 500. After 48 hours, the luciferase detection system was used to detect the infection activity. Comparisons of infectious activity were performed after subtracting NC background values from each assay value.

上述5种重组新型AAV病毒对人肝细胞系Huh7感染活性比较，结果(图5A)显示，感染活性由高到低依次为AAV2/10＞AAV2/107＞AAV2/58＞AAV2/37＞AAV2/57。The comparison of the above-mentioned five recombinant novel AAV viruses on the infection activity of human liver cell line Huh7, the results (Fig. 5A) show that the infection activity from high to low is AAV2/10>AAV2/107>AAV2/58>AAV2/37>AAV2/ 57.

上述5种重组新型AAV病毒对人肝细胞系7402感染活性比较，结果(图5B)显示，AAV2/10的活性最高，AAV2/107与其接近，AAV2/58与AAV2/37感染活性接近，感染活性由高到低依次是AAV2/10＞AAV2/107＞AAV2/58＞AAV2/37＞AAV2/57。Comparison of the above-mentioned five recombinant novel AAV viruses on the infection activity of human hepatocyte cell line 7402, the results (Figure 5B) show that AAV2/10 has the highest activity, AAV2/107 is close to it, AAV2/58 is close to AAV2/37, and the infection activity is close to that of AAV2/58. The order from high to low is AAV2/10>AAV2/107>AAV2/58>AAV2/37>AAV2/57.

上述5种重组新型AAV病毒对人肝细胞系7721感染活性比较，结果(图5C)显示，AAV2/10的活性最高，AAV2/58与AAV2/37感染活性接近，感染活性由高到低依次是AAV2/10＞AAV2/107＞AAV2/58＞AAV2/37＞AAV2/57。The comparison of the above-mentioned five recombinant novel AAV viruses on the infection activity of human hepatocyte cell line 7721, the results (Fig. 5C) show that AAV2/10 has the highest activity, AAV2/58 and AAV2/37 have close infection activities, and the infection activities are from high to low. AAV2/10>AAV2/107>AAV2/58>AAV2/37>AAV2/57.

上述5种重组新型AAV病毒对人肝细胞系HepG2感染活性比较，结果(图5D)显示，AAV2/10的活性最高，感染活性由高到低依次是AAV2/10＞AAV2/37＞AAV2/107＞AAV2/58＞AAV2/57。Comparison of the above-mentioned five recombinant novel AAV viruses on the infection activity of human liver cell line HepG2, the results (Figure 5D) show that AAV2/10 has the highest activity, and the infection activity is AAV2/10>AAV2/37>AAV2/107 >AAV2/58>AAV2/57.

上述5种重组新型AAV病毒在人肝细胞系L02感染活性比较，结果(图5E)显示，AAV2/10的活性最高，感染活性由高到低依次是AAV2/10＞AAV2/107＞AAV2/37＞AAV2/58＞AAV2/57。Comparison of the above-mentioned five recombinant novel AAV viruses in the human liver cell line L02 infection activity, the results (Figure 5E) show that AAV2/10 has the highest activity, and the infection activity from high to low is AAV2/10>AAV2/107>AAV2/37 >AAV2/58>AAV2/57.

由以上结果可知，在上述几种新型AAV中，AAV2/10活性相对最高，AAV2/57活性相对较低。It can be seen from the above results that AAV2/10 has the highest activity and AAV2/57 has relatively low activity among the above-mentioned novel AAVs.

实施例5新型AAV体内活性测试Example 5 In vivo activity test of novel AAV

5种新型AAV Cap外壳基因L37、L57、L58、L107、L10分别构建到含有2型AAV Rep的RC质粒载体上，该质粒携带CAG-Luciferase外源基因，包装出的病毒命名为AAV2/37、AAV2/57、AAV2/58、AAV2/107、AAV2/10，检测病毒滴度(结果未显示)，并将这些新型AAV病毒进行体内感染活性比较。选择6-8周C57BL/6J小鼠以1E+11vg/只的剂量进行尾部静脉注射，2周后处死小鼠，提取组织DNA，测试肝脏、心脏、骨骼肌中AAV载体基因组拷贝数，并分别测试这3种组织中荧光素酶表达情况。Five novel AAV Cap coat genes L37, L57, L58, L107, and L10 were respectively constructed on the RC plasmid vector containing type 2 AAV Rep, which carried the CAG-Luciferase exogenous gene, and the packaged viruses were named AAV2/37, AAV2/57, AAV2/58, AAV2/107, AAV2/10, virus titers were detected (results not shown), and these novel AAV viruses were compared for in vivo infectious activity. C57BL/6J mice at 6-8 weeks were selected for tail vein injection at a dose of 1E+11vg/mice. The mice were sacrificed after 2 weeks, and the tissue DNA was extracted to test the number of copies of the AAV vector genome in the liver, heart, and skeletal muscle. The luciferase expression in these three tissues was tested.

载体基因组拷贝数检测结果(图6)显示，AAV2/58相较于其它新型AAV在肝脏中的载体基因组拷贝数最高，且与同组的心脏和骨骼肌相比，肝脏中的基因组拷贝数显著升高；虽然AAV2/37、AAV2/57、AAV2/107和AAV2/10载体基因组拷贝数比AAV2/58载体基因组拷贝数低，但相较于同组的心脏和骨骼肌中的基因拷贝数来比，肝脏中的基因组拷贝数较高。The results of vector genome copy number detection (Fig. 6) showed that AAV2/58 had the highest vector genome copy number in the liver compared with other novel AAVs, and compared with the heart and skeletal muscle of the same group, the genome copy number in the liver was significantly higher. Elevated; although AAV2/37, AAV2/57, AAV2/107, and AAV2/10 vector genome copy numbers were lower than AAV2/58 vector genome copy numbers, they were higher than gene copy numbers in the heart and skeletal muscle of the same group The genome copy number in the liver is higher than that in the liver.

荧光素酶表达情况检测结果(图7A-E)显示，AAV2/37、AAV2/57、AAV2/58、AAV2/107、AAV2/10在肝脏中的荧光素酶表达情况比其在同组的心脏和骨骼肌中荧光素酶的表达水平高。The detection results of luciferase expression (Figure 7A-E) showed that the luciferase expression of AAV2/37, AAV2/57, AAV2/58, AAV2/107, and AAV2/10 in the liver was higher than that in the heart of the same group. and high levels of luciferase expression in skeletal muscle.

这些结果表明这5种新型AAV载体都具有优良的肝脏靶向性，且AAV2/10相较于其它新型AAV载体在肝脏中的荧光素酶表达水平最高，其次是AAV2/58，之后是AAV2/37和AAV2/107在肝脏中荧光素酶表达水平接近，AAV2/57相对最低。These results show that these five novel AAV vectors all have excellent liver targeting, and AAV2/10 has the highest expression level of luciferase in liver compared with other novel AAV vectors, followed by AAV2/58, followed by AAV2/ 37 and AAV2/107 had similar luciferase expression levels in liver, and AAV2/57 was relatively the lowest.

实施例6猴血清或人血清中新型AAV中和抗体检测Example 6 Detection of novel AAV neutralizing antibodies in monkey serum or human serum

因为AAV自然被人和其他灵长类感染，产生的针对天然型AAV的中和抗体，会大大降低AAV的半衰期。这里检测新型AAV的中和抗体水平。Because AAV is naturally infected by humans and other primates, the production of neutralizing antibodies against native AAV will greatly reduce the half-life of AAV. Neutralizing antibody levels of novel AAVs were tested here.

6.1猴血清中新型AAV和AAV2/8中和抗体的检测与比较6.1 Detection and comparison of novel AAV and AAV2/8 neutralizing antibodies in monkey serum

本实验采用病毒感染，固定其MOI值，将血清进行稀释，检测食蟹猴血清中新型AAV载体与AAV2/8的中和抗体水平。在一系列血清稀释度中找到血清病毒混合物感染细胞效率为病毒无血清感染细胞效率的50％，以此稀释度的倒数作为中和抗体量，针对每种病毒载体评价其中和抗体(参见Lochrie MA等，2006，Virology 353：68-82；Mori S等，2006，Jpn JInfect Dis 59：285-293等)。本实验共对10只食蟹猴血清进行检测，血清系列稀释后，分别判定并比较每个样品中新型AAV的中和抗体与AAV2/8的中和抗体。In this experiment, virus infection was used, the MOI value was fixed, and the serum was diluted to detect the neutralizing antibody levels of the new AAV vector and AAV2/8 in the serum of cynomolgus monkeys. The serum virus mixture was found in a series of serum dilutions to infect cells with an efficiency of 50% of that of the virus without serum, and the inverse of this dilution was used as the amount of neutralizing antibody, which was evaluated against each viral vector (see Lochrie MA et al, 2006, Virology 353:68-82; Mori S et al, 2006, Jpn JInfect Dis 59:285-293 et al). A total of 10 cynomolgus monkey sera were tested in this experiment. After serial dilution of sera, the neutralizing antibodies of the novel AAV and the neutralizing antibodies of AAV2/8 in each sample were determined and compared.

具体步骤如下：将获得的新型AAV Cap基因L37、L57、L58、L107、L10分别构建到含有2型AAV Rep的RC质粒载体上，该质粒携带CAG-EGFP外源基因，包装出的病毒命名为AAV2/37、AAV2/57、AAV2/58、AAV2/107、AAV2/10，并检测病毒滴度(结果未显示)。7402细胞接种24孔板，食蟹猴血清样品进行系列稀释，新型AAV病毒AAV2/37、AAV2/57、AAV2/58、AAV2/107、AAV2/10和携带相同绿色荧光蛋白的AAV2/8重组病毒感染指数MOI为2000，稀释的血清样品与病毒液1:1进行混合，混合后37℃共孵育1h，然后细胞中加入血清样品与病毒液的混合液。48h后收获细胞，流式细胞术检测感染效率。The specific steps are as follows: the obtained new AAV Cap genes L37, L57, L58, L107 and L10 are respectively constructed on the RC plasmid vector containing type 2 AAV Rep, the plasmid carries the CAG-EGFP exogenous gene, and the packaged virus is named as AAV2/37, AAV2/57, AAV2/58, AAV2/107, AAV2/10, and virus titers were tested (results not shown). 7402 cells were seeded in 24-well plates, cynomolgus monkey serum samples were serially diluted, novel AAV viruses AAV2/37, AAV2/57, AAV2/58, AAV2/107, AAV2/10 and AAV2/8 recombinant viruses carrying the same green fluorescent protein The infection index MOI was 2000. The diluted serum sample and virus solution were mixed 1:1, and incubated at 37°C for 1 h, and then the mixture of serum sample and virus solution was added to the cells. Cells were harvested after 48 h, and the infection efficiency was detected by flow cytometry.

中和抗体检测结果如表5所示，中和抗体量低于5时，视为针对该种病毒中和抗体为阴性。10份血清样本分别编号为1#、2#、3#、4#、5#、6#、7#、8#、9#、10#，其中7#针对所有病毒载体中和抗体均为阴性；1#、9#针对新型AAV载体中和抗体均为阴性，而针对AAV2/8的中和抗体均为阳性；2#、3#、4#、5#、6#、8#、10#样品的新型AAV中和抗体均明显低于AAV2/8。The test results of neutralizing antibodies are shown in Table 5. When the amount of neutralizing antibodies is less than 5, it is regarded as negative for neutralizing antibodies against this virus. The 10 serum samples were numbered 1#, 2#, 3#, 4#, 5#, 6#, 7#, 8#, 9#, 10#, of which 7# was negative for neutralizing antibodies against all viral vectors ; 1#, 9# were negative for the neutralizing antibodies against the new AAV vector, while the neutralizing antibodies against AAV2/8 were positive; 2#, 3#, 4#, 5#, 6#, 8#, 10# The novel AAV neutralizing antibodies of the samples were all significantly lower than those of AAV2/8.

因此通过对食蟹猴血清中新型AAV和AAV2/8中和抗体的综合比较，在猴群体中新型AAV的中和抗体明显低于AAV2/8的中和抗体，新型AAV作为药物递送载体，具有更低的免疫原性。Therefore, through a comprehensive comparison of the new AAV and AAV2/8 neutralizing antibodies in cynomolgus monkey serum, the neutralizing antibody of the new AAV in the monkey population is significantly lower than that of AAV2/8. As a drug delivery carrier, the new AAV has lower immunogenicity.

表5食蟹猴10份血清中和抗体检测结果Table 5 Detecting results of neutralizing antibodies in 10 serums of cynomolgus monkeys

6.2人血清中新型AAV和AAV2/8中和抗体的检测与比较6.2 Detection and comparison of novel AAV and AAV2/8 neutralizing antibodies in human serum

本实验采用固定病毒感染指数MOI，将血清进行稀释，检测个体人血清中新型AAV的中和抗体的水平与AAV2/8中和抗体水平。在一系列血清稀释度中找到血清病毒混合物感染细胞效率为病毒无血清感染细胞效率的50％，以此稀释度的倒数作为中和抗体量，针对每种病毒载体评价其中和抗体。本实验共进行10人份血清检测，对血清进行了一系列稀释，分别比较每个样品中新型AAV的中和抗体与AAV2/8的中和抗体。In this experiment, a fixed MOI of viral infection index was used, and the serum was diluted to detect the level of neutralizing antibody of the new AAV and the level of AAV2/8 neutralizing antibody in individual human serum. The serum virus mixture was found in a series of serum dilutions to infect cells with an efficiency of 50% of the virus-free serum-infected cells, and the inverse of this dilution was used as the amount of neutralizing antibody, which was evaluated against each viral vector. A total of 10 serum samples were tested in this experiment, and the serum was serially diluted to compare the neutralizing antibodies of the novel AAV and the neutralizing antibodies of AAV2/8 in each sample.

具体步骤如同6.1中具体步骤所示，不同之处在于，所选用的血清为正常人血清。本实验随机挑选10份正常人血清进行中和抗体实验，分别比较每个样品中新型AAV中和抗体和AAV2/8中和抗体。中和抗体检测结果如表6所示，其中3#样品针对所有病毒中和抗体均表现为阴性；2#样品针对新型AAV中和抗体均为阴性，而针对AAV2/8中和抗体为阳性；1#、4#、5#、6#、7#、8#、9#、10#样品针对新型AAV的中和抗体均明显低于AAV2/8的中和抗体。实验结果证明在人类群体中，针对新型AAV的中和抗体与AAV2/8的中和抗体相比较，新型AAV更适合作为药物递送载体，能够降低反应原性。The specific steps are as shown in the specific steps in 6.1, the difference is that the selected serum is normal human serum. In this experiment, 10 normal human sera were randomly selected for neutralizing antibody experiments, and the new AAV neutralizing antibodies and AAV2/8 neutralizing antibodies in each sample were compared respectively. The neutralizing antibody detection results are shown in Table 6, in which the 3# sample was negative for all virus neutralizing antibodies; the 2# sample was negative for the new AAV neutralizing antibodies, while the AAV2/8 neutralizing antibodies were positive; The neutralizing antibodies of samples 1#, 4#, 5#, 6#, 7#, 8#, 9#, and 10# against the new AAV were significantly lower than those of AAV2/8. The experimental results demonstrate that in the human population, the neutralizing antibody against the novel AAV is more suitable as a drug delivery vehicle than the neutralizing antibody against AAV2/8, which can reduce the reactogenicity.

表6 10份人血清中和抗体检测结果Table 6 Test results of neutralizing antibodies in 10 human serums

综合以上结果，通过新型AAV与AAV2/8的血清中中和抗体的比较，新型AAV相比AAV2/8载体具有更低的免疫原性，更适合作为基因治疗载体。Based on the above results, by comparing the serum neutralizing antibodies of the new AAV and AAV2/8, the new AAV has lower immunogenicity than the AAV2/8 vector, and is more suitable as a gene therapy vector.

序列表sequence listing

<110> 舒泰神（北京) 生物制药股份有限公司；北京三诺佳邑生物技术有限责任公司<110> Shutaishen (Beijing) Biopharmaceutical Co., Ltd.; Beijing Sannuo Jiayi Biotechnology Co., Ltd.

<120> 一组肝靶向新型腺相关病毒的获得及其应用<120> Acquisition and application of a group of novel liver-targeted adeno-associated viruses

<130> 2020070729<130> 2020070729

<160> 18<160> 18

<170> SIPOSequenceListing 1.0<170> SIPOSequenceListing 1.0

<210> 1<210> 1

<211> 2211<211> 2211

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 1<400> 1

atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 60atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 60

gagtggtggg acctgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac 120gagtggtggg acctgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac 120

aacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 180aacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac 240aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac 240

cagcagctgc aggcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt 300cagcagctgc aggcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt 300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360

gcgaaaaaga ggatccttga gcctcttggt ctggttgagg aagcggctaa gacggctcct 420gcgaaaaaga ggatccttga gcctcttggt ctggttgagg aagcggctaa gacggctcct 420

ggaaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgc gggtattggc 480ggaaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgc gggtattggc 480

aaatcgggtg cacagcccgc taaaaagaga ctcaactttg gtcagactgg cgactcagag 540aaatcgggtg cacagcccgc taaaaagaga ctcaactttg gtcagactgg cgactcagag 540

tcagttccag accctcaacc tctcggagaa ccaccagcag cgccctctgg tgtgggacct 600tcagttccag accctcaacc tctcggagaa ccaccagcag cgccctctgg tgtgggacct 600

aatacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660aatacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660

gtgggtaatg cctcaggaaa ttggcattgc gattcccaat ggctgggcga cagagtcatc 720gtgggtaatg cctcaggaaa ttggcattgc gattcccaat ggctgggcga cagagtcatc 720

accaccagca ccagaacctg ggccctgccc acttacaaca accatctcta caagcaaatc 780accaccagca ccagaacctg ggccctgccc acttacaaca accatctcta caagcaaatc 780

tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg 840tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg 840

gggtattttg actttaacag attccactgc cacttttcac cacgtgactg gcagcgactc 900gggtattttg actttaacag attccactgc cacttttcac cacgtgactg gcagcgactc 900

atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt caacatccaa 960atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt caacatccaa 960

gtcaaggagg tcacgacgaa tgatggcgtc acgaccatcg ctaataacct taccagcacg 1020gtcaaggagg tcacgacgaa tgatggcgtc acgaccatcg ctaataacct taccagcacg 1020

gttcaagtct tctcggactc ggagtaccag ttgccgtacg tcctcggctc tgcgcaccag 1080gttcaagtct tctcggactc ggagtaccag ttgccgtacg tcctcggctc tgcgcaccag 1080

ggttgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcaatacgg ctacctgacg 1140ggttgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcaatacgg ctacctgacg 1140

ctcaacaatg gcagccaagc cgtgggacgt tcatcctttt accgcctgga atatttccct 1200ctcaacaatg gcagccaagc cgtgggacgt tcatcctttt accgcctgga atatttccct 1200

tctcagatgc tgagaacggg caacaacttt accttcagct acacctttga ggaagtgcct 1260tctcagatgc tgagaacggg caacaacttt accttcagct acacctttga ggaagtgcct 1260

ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc tctcatcgac 1320ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc tctcatcgac 1320

cagtacctgt attacctgaa cagaactcag aatcagtccg gaagtgccca aaacaaggac 1380cagtacctgt attacctgaa cagaactcag aatcagtccg gaagtgccca aaacaaggac 1380

ttgctgttta gccgtgggtc tccagctggc atgtctgttc agcccaaaaa ctggctacct 1440ttgctgttta gccgtgggtc tccagctggc atgtctgttc agcccaaaaa ctggctacct 1440

ggaccctgtt accggcagca gcgcgtttct aaaacaaaaa cagacaacaa caacagcaac 1500ggaccctgtt accggcagca gcgcgtttct aaaacaaaaa cagacaacaa caacagcaac 1500

tttacctgga ctggtgcttc aaaatataac ctcaatgggc gtgaatccat catcaaccct 1560tttacctgga ctggtgcttc aaaatataac ctcaatgggc gtgaatccat catcaaccct 1560

ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat aagcggtgtc 1620ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat aagcggtgtc 1620

atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcatgatc 1680atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcatgatc 1680

acagacgaag aggaaatcaa agccactaac cccgtggcca ccgaaagatt tgggactgtg 1740acagacgaag aggaaatcaa agccactaac cccgtggcca ccgaaagatt tgggactgtg 1740

gcagtcaatc tccagagcag cagcacagac cctgcgaccg gagatgtgca cgttatggga 1800gcagtcaatc tccagagcag cagcacagac cctgcgaccg gagatgtgca cgttatggga 1800

gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc tatttgggcc 1860gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc tatttgggcc 1860

aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactt 1920aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactt 1920

aagcacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa tcctccggca 1980aagcacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa tcctccggca 1980

gagttttcag ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc 2040gagttttcag ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc 2040

gtggagattg aatgggagct gcagaaagaa aacagcaagc gctggaatcc cgaagtgcag 2100gtggagattg aatgggagct gcagaaagaa aacagcaagc gctggaatcc cgaagtgcag 2100

tacacatcca attatgcaaa atctgccaac gttgatttta ctgtggacaa caatggactt 2160tacacatcca attatgcaaa atctgccaac gttgatttta ctgtggacaa caatggactt 2160

tatactgagc ctcgccccat tggcacccgt taccttaccc gtcccctgta a 2211tatactgagc ctcgccccat tggcacccgt taccttaccc gtcccctgta a 2211

<210> 2<210> 2

<211> 736<211> 736

<212> PRT<212> PRT

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 2<400> 2

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu SerMet Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser

1 5 10 151 5 10 15

Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys ProGlu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro

20 25 30 20 25 30

Lys Ala Asn Gln Gln Lys Gln Asp Asn Gly Arg Gly Leu Val Leu ProLys Ala Asn Gln Gln Lys Gln Asp Asn Gly Arg Gly Leu Val Leu Pro

35 40 45 35 40 45

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu ProGly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro

50 55 60 50 55 60

Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr AspVal Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp

65 70 75 8065 70 75 80

Gln Gln Leu Gln Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His AlaGln Gln Leu Gln Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala

85 90 95 85 90 95

Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly GlyAsp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly

100 105 110 100 105 110

Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Ile Leu Glu ProAsn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Ile Leu Glu Pro

115 120 125 115 120 125

Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys ArgLeu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg

130 135 140 130 135 140

Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile GlyPro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Gly

145 150 155 160145 150 155 160

Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln ThrLys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr

165 170 175 165 170 175

Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro ProGly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro

180 185 190 180 185 190

Ala Ala Pro Ser Gly Val Gly Pro Asn Thr Met Ala Ser Gly Gly GlyAla Ala Pro Ser Gly Val Gly Pro Asn Thr Met Ala Ser Gly Gly Gly

195 200 205 195 200 205

Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn AlaAla Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala

210 215 220 210 215 220

Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val IleSer Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile

225 230 235 240225 230 235 240

Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His LeuThr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu

245 250 255 245 250 255

Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn HisTyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His

260 265 270 260 265 270

Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg PheTyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe

275 280 285 275 280 285

His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn AsnHis Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn

290 295 300 290 295 300

Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile GlnTrp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln

305 310 315 320305 310 315 320

Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn AsnVal Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn

325 330 335 325 330 335

Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu ProLeu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro

340 345 350 340 345 350

Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro AlaTyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala

355 360 365 355 360 365

Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn GlyAsp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly

370 375 380 370 375 380

Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Arg Leu Glu Tyr Phe ProSer Gln Ala Val Gly Arg Ser Ser Phe Tyr Arg Leu Glu Tyr Phe Pro

385 390 395 400385 390 395 400

Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr PheSer Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe

405 410 415 405 410 415

Glu Glu Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu AspGlu Glu Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp

420 425 430 420 425 430

Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn ArgArg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg

435 440 445 435 440 445

Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe SerThr Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe Ser

450 455 460 450 455 460

Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu ProArg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro

465 470 475 480465 470 475 480

Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp AsnGly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn

485 490 495 485 490 495

Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu AsnAsn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn

500 505 510 500 505 510

Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His LysGly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys

515 520 525 515 520 525

Asp Asp Lys Asp Lys Phe Phe Pro Ile Ser Gly Val Met Ile Phe GlyAsp Asp Lys Asp Lys Phe Phe Pro Ile Ser Gly Val Met Ile Phe Gly

530 535 540 530 535 540

Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met IleLys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile

545 550 555 560545 550 555 560

Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu ArgThr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg

565 570 575 565 570 575

Phe Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro AlaPhe Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro Ala

580 585 590 580 585 590

Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp GlnThr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln

595 600 605 595 600 605

Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro HisAsp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His

610 615 620 610 615 620

Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly LeuThr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu

625 630 635 640625 630 635 640

Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro AlaLys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala

645 650 655 645 650 655

Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile ThrAsn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr

660 665 670 660 665 670

Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu GlnGln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln

675 680 685 675 680 685

Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser AsnLys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn

690 695 700 690 695 700

Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly LeuTyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu

705 710 715 720705 710 715 720

Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro LeuTyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu

725 730 735 725 730 735

<210> 3<210> 3

<211> 2214<211> 2214

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 3<400> 3

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt 300cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt 300

caggggcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360caggggcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360

gcgaaaaaga gggttcttga acctctgggc ctggttgagg aacctgttaa gacggctccg 420gcgaaaaaga gggttcttga acctctgggc ctggttgagg aacctgttaa gacggctccg 420

ggaaaaaaga ggccggtaga gccgccacct cagcgttccc ccgactcctc cacgggcatc 480ggaaaaaaga ggccggtaga gccgccacct cagcgttccc ccgactcctc cacgggcatc 480

ggcaagaaag gccagcagcc cgccaaaaag agactcaatt ttggtcagac tggcgactca 540ggcaagaaag gccagcagcc cgccaaaaag agactcaatt ttggtcagac tggcgactca 540

gagtcagtcc ccgacccaca acctctcgga gaacctccag caacccccgc tgctgtggga 600gagtcagtcc ccgacccaca acctctcgga gaacctccag caacccccgc tgctgtggga 600

cctactacaa tggcttcagg cggtggcgca ccaatggcag acaataacga aggcgccgac 660cctactacaa tggcttcagg cggtggcgca ccaatggcag acaataacga aggcgccgac 660

ggagtgggta atgcctcagg aaattggcat tgcgattcca catggctggg cgacagagtc 720ggagtgggta atgcctcagg aaattggcat tgcgattcca catggctggg cgacagagtc 720

atcaccacca gcacccgcac ctgggccttg cccacctaca ataaccacct ctacaagcaa 780atcaccacca gcacccgcac ctgggccttg cccacctaca ataaccacct ctacaagcaa 780

atctccagtg cttcaacggg ggccagcaac gacaaccact acttcggcta cagcaccccc 840atctccagtg cttcaacggg ggccagcaac gacaaccact acttcggcta cagcaccccc 840

tgggggtatt ttgatttcaa cagattccac tgccatttct caccacgtga ctggcagcga 900tgggggtatt ttgatttcaa cagattccac tgccatttct caccacgtga ctggcagcga 900

ctcatcaaca acaactgggg attccgaccc aagagactca acttcaaact cttcaacatc 960ctcatcaaca acaactgggg attccgaccc aagagactca acttcaaact cttcaacatc 960

caagtcaagg aggtcacgac gaatgacggc gttacgacca tcgctaataa ccttaccagc 1020caagtcaagg aggtcacgac gaatgacggc gttacgacca tcgctaataa ccttaccagc 1020

acgattcagg tattctcgga ctcggagtac cagcttccgt acgtcctcgg ctctgcgcac 1080acgattcagg tattctcgga ctcggagtac cagcttccgt acgtcctcgg ctctgcgcac 1080

cagggctgcc tccctccgtt cccggcggac gtgttcatga ttccgcaata cggctacctg 1140cagggctgcc tccctccgtt cccggcggac gtgttcatga ttccgcaata cggctacctg 1140

acgctcaaca atggcagcca agccgtggga cgttcatcct tttactgcct ggaatatttc 1200acgctcaaca atggcagcca agccgtggga cgttcatcct tttactgcct ggaatatttc 1200

ccttctcaga tgctgagaac gggcaacaac tttaccttca gctacacctt tgaggaagtg 1260ccttctcaga tgctgagaac gggcaacaac tttaccttca gctacacctt tgaggaagtg 1260

cctttccaca gcagctacgc gcacagccag agcctggacc ggctgatgaa tcctctcatc 1320cctttccaca gcagctacgc gcacagccag agcctggacc ggctgatgaa tcctctcatc 1320

gaccagtacc tgtattacct gaacagaact cagaatcagt ccggaagtgc ccaaaacaag 1380gaccagtacc tgtattacct gaacagaact cagaatcagt ccggaagtgc ccaaaacaag 1380

gacttgctgt ttagccgtgg gtcaccagct ggcatgtctg ttcagcccaa aaactggcta 1440gacttgctgt ttagccgtgg gtcaccagct ggcatgtctg ttcagcccaa aaactggcta 1440

cctggaccct gttaccggca gcagcgcgtt tctaaaacaa aaacagacaa caacaacagc 1500cctggaccct gttaccggca gcagcgcgtt tctaaaacaa aaacagacaa caacaacagc 1500

aactttacct ggactggtgc ttcaaaatat aacctcaatg ggcgtgaatc catcatcaac 1560aactttacct ggactggtgc ttcaaaatat aacctcaatg ggcgtgaatc catcatcaac 1560

cctggcactg ctatggcctc acacaaagac gacgaagaca agttctttcc catgagcggt 1620cctggcactg ctatggcctc acacaaagac gacgaagaca agttctttcc catgagcggt 1620

gtcatgattt ttggaaaaga gagcgccgga gcttcaaaca ctgcattgga caatgtcatg 1680gtcatgattt ttggaaaaga gagcgccgga gcttcaaaca ctgcattgga caatgtcatg 1680

attacagacg aagaggaaat taaagccact aaccccgtgg ccaccgaaag atttgggacc 1740attacagacg aagaggaaat taaagccact aaccccgtgg ccaccgaaag atttgggacc 1740

gtggcagtca atttccagag cagcagcaca gaccctgcga ccggagatgt gcatgctatg 1800gtggcagtca atttccagag cagcagcaca gaccctgcga ccggagatgt gcatgctatg 1800

ggagcattac ctggcatggt gtggcaagat agagacgtgt acctgcaggg tcccatttgg 1860ggagcattac ctggcatggt gtggcaagat agagacgtgt acctgcaggg tcccatttgg 1860

gccaaaattc ctcacacaga tggacacttt cacccgtctc ctcttatggg cggctttgga 1920gccaaaattc ctcacacaga tggacacttt cacccgtctc ctcttatggg cggctttgga 1920

ctcaagaacc cgcctcctca gatcctcatc aaaaacacgc ctgttcctgc gaatcctccg 1980ctcaagaacc cgcctcctca gatcctcatc aaaaacacgc ctgttcctgc gaatcctccg 1980

gcagagtttt cggctacaaa gtttgcttca ttcatcaccc agtattccac aggacaagtg 2040gcagagtttt cggctacaaa gtttgcttca ttcatcaccc agtattccac aggacaagtg 2040

agcgtggaga ttgaatggga gctgcagaaa gaaaacagca aacgctggaa tcccgaagtg 2100agcgtggaga ttgaatggga gctgcagaaa gaaaacagca aacgctggaa tcccgaagtg 2100

cagtatacat ctaactatgc aaaatctgcc aacgttgatt ttactgagga caacaatgga 2160cagtatacat ctaactatgc aaaatctgcc aacgttgatt ttactgagga caacaatgga 2160

ctttatactg agcctcgccc cattggcacc cgttacctta cccgtcccct gtaa 2214ctttatactg agcctcgccc cattggcacc cgttacctta cccgtcccct gtaa 2214

<210> 4<210> 4

<211> 737<211> 737

<212> PRT<212> PRT

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 4<400> 4

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His AlaGln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala

85 90 95 85 90 95

Asp Ala Glu Phe Gln Gly Arg Leu Gln Glu Asp Thr Ser Phe Gly GlyAsp Ala Glu Phe Gln Gly Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly

100 105 110 100 105 110

Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu ProAsn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro

115 120 125 115 120 125

Leu Gly Leu Val Glu Glu Pro Val Lys Thr Ala Pro Gly Lys Lys ArgLeu Gly Leu Val Glu Glu Pro Val Lys Thr Ala Pro Gly Lys Lys Arg

130 135 140 130 135 140

Pro Val Glu Pro Pro Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly IlePro Val Glu Pro Pro Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile

145 150 155 160145 150 155 160

Gly Lys Lys Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly GlnGly Lys Lys Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln

165 170 175 165 170 175

Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu ProThr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro

180 185 190 180 185 190

Pro Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly GlyPro Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly

195 200 205 195 200 205

Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly AsnGly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn

210 215 220 210 215 220

Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg ValAla Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val

225 230 235 240225 230 235 240

Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn HisIle Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His

245 250 255 245 250 255

Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp AsnLeu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn

260 265 270 260 265 270

His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn ArgHis Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg

275 280 285 275 280 285

Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn AsnPhe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn

290 295 300 290 295 300

Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn IleAsn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile

305 310 315 320305 310 315 320

Gln Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala AsnGln Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn

325 330 335 325 330 335

Asn Leu Thr Ser Thr Ile Gln Val Phe Ser Asp Ser Glu Tyr Gln LeuAsn Leu Thr Ser Thr Ile Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu

340 345 350 340 345 350

Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe ProPro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro

355 360 365 355 360 365

Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn AsnAla Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn

370 375 380 370 375 380

Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr PheGly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe

385 390 395 400385 390 395 400

Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr ThrPro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr

405 410 415 405 410 415

Phe Glu Glu Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser LeuPhe Glu Glu Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu

420 425 430 420 425 430

Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu AsnAsp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn

435 440 445 435 440 445

Arg Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu PheArg Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe

450 455 460 450 455 460

Ser Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp LeuSer Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu

465 470 475 480465 470 475 480

Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr AspPro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp

485 490 495 485 490 495

Asn Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn LeuAsn Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu

500 505 510 500 505 510

Asn Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser HisAsn Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His

515 520 525 515 520 525

Lys Asp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile PheLys Asp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe

530 535 540 530 535 540

Gly Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val MetGly Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met

545 550 555 560545 550 555 560

Ile Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr GluIle Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu

565 570 575 565 570 575

Arg Phe Gly Thr Val Ala Val Asn Phe Gln Ser Ser Ser Thr Asp ProArg Phe Gly Thr Val Ala Val Asn Phe Gln Ser Ser Ser Thr Asp Pro

580 585 590 580 585 590

Ala Thr Gly Asp Val His Ala Met Gly Ala Leu Pro Gly Met Val TrpAla Thr Gly Asp Val His Ala Met Gly Ala Leu Pro Gly Met Val Trp

595 600 605 595 600 605

Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile ProGln Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro

610 615 620 610 615 620

His Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe GlyHis Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly

625 630 635 640625 630 635 640

Leu Lys Asn Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val ProLeu Lys Asn Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro

645 650 655 645 650 655

Ala Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe IleAla Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile

660 665 670 660 665 670

Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu LeuThr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu

675 680 685 675 680 685

Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr SerGln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser

690 695 700 690 695 700

Asn Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Glu Asp Asn Asn GlyAsn Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Glu Asp Asn Asn Gly

705 710 715 720705 710 715 720

Leu Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg ProLeu Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro

725 730 735 725 730 735

LeuLeu

<210> 5<210> 5

<211> 2208<211> 2208

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 5<400> 5

gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac 120gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac 120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 180gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 180

cagcagctca aagccggaga caacccgtac ctcaagtaca accacgccga cgccgagttc 300cagcagctca aagccggaga caacccgtac ctcaagtaca accacgccga cgccgagttc 300

caggagcggc tcaaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360caggagcggc tcaaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360

gcgaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct 420gcgaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct 420

ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcatcggc 480ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcatcggc 480

aagacaggcc agcagcccgc taaaaaaaga ctcaattttg gtcagactgg cgactcagag 540aagacaggcc agcagcccgc taaaaaaaga ctcaattttg gtcagactgg cgactcagag 540

tcagttccag accctcaacc tctcggagaa cctccagcag cgccctctgg tgtgggacct 600tcagttccag accctcaacc tctcggagaa cctccagcag cgccctctgg tgtgggacct 600

aatacaatgg ctgcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660aatacaatgg ctgcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660

gtgggtagtt cctcgggaaa ttggcattgc gattccacat ggatgggcga cagagtcatc 720gtgggtagtt cctcgggaaa ttggcattgc gattccacat ggatgggcga cagagtcatc 720

accaccagca cccgaacctg ggccctgccc acctacaaca accacctcta caaacaaatt 780accaccagca cccgaacctg ggccctgccc acctacaaca accacctcta caaacaaatt 780

tccagccaat caggagcctc gaacgacaat cactactttg gctacagcac cccttggggg 840tccagccaat caggagcctc gaacgacaat cactactttg gctacagcac cccttggggg 840

tattttgact ttaacagatt ccactgccac ttctcaccac gtgactggca gcgactcatt 900tattttgact ttaacagatt ccactgccac ttctcaccac gtgactggca gcgactcatt 900

aacaacaatt ggggattccg gcccaagaga ctcaacttca agctcttcaa catccaagtc 960aacaacaatt ggggattccg gcccaagaga ctcaacttca agctcttcaa catccaagtc 960

aaggaggtca cgacgaatga tggcgtcaca accatcgcta ataaccttac cagcacggtt 1020aaggaggtca cgacgaatga tggcgtcaca accatcgcta ataaccttac cagcacggtt 1020

caagtcttct cggactcgga gtaccagctt ccgtacgtcc tcggctctgc gcaccagggc 1080caagtcttct cggactcgga gtaccagctt ccgtacgtcc tcggctctgc gcaccagggc 1080

tgcctgcctc cgttcccggc ggacgtcttc atgattcctc agtacggcta cctgactccc 1140tgcctgcctc cgttcccggc ggacgtcttc atgattcctc agtacggcta cctgactccc 1140

aacaatggca gtcagtctgt gggacgttcc tccttctact gcctggagta cttcccttct 1200aacaatggca gtcagtctgt gggacgttcc tccttctact gcctggagta cttcccttct 1200

cagatgctga gaacgggcaa caactttacc ttcagctaca cctttgagga agtgcctttc 1260cagatgctga gaacgggcaa caactttacc ttcagctaca cctttgagga agtgcctttc 1260

cacagcagct acgcgcacag ccagagcctg gaccggctga tgaatcctct catcgaccag 1320cacagcagct acgcgcacag ccagagcctg gaccggctga tgaatcctct catcgaccag 1320

tacctgtatt acctgaacag aactcaaaat cagtccggaa gtgcccaaaa caaggacttg 1380tacctgtatt acctgaacag aactcaaaat cagtccggaa gtgcccaaaa caaggacttg 1380

ctgtttagcc gtgggtctcc agctggcatg tctgttcagc ccaaaaactg gctacctgga 1440ctgtttagcc gtgggtctcc agctggcatg tctgttcagc ccaaaaactg gctacctgga 1440

ccctgttatc agcagcagcg cgtttctaaa acaaaaacag acaacaacaa cagcaacttt 1500ccctgttatc agcagcagcg cgtttctaaa acaaaaacag acaacaacaa cagcaacttt 1500

acctggactg gtgcttcaaa atataacctc aatgggcgtg aatccatcat caaccctggc 1560acctggactg gtgcttcaaa atataacctc aatgggcgtg aatccatcat caaccctggc 1560

actgctatgg cctcacacaa agacgacaaa gacaagttct ttcccatgag cggtgtcatg 1620actgctatgg cctcacacaa agacgacaaa gacaagttct ttcccatgag cggtgtcatg 1620

atttttggaa aggagagcgc cggagcttca aacactgcat tggacaatgt catgatcaca 1680atttttggaa aggagagcgc cggagcttca aacactgcat tggacaatgt catgatcaca 1680

gacgaagagg aaatcaaagc cactaacccc gtggccaccg aaagatttgg gaccgtggca 1740gacgaagagg aaatcaaagc cactaacccc gtggccaccg aaagatttgg gaccgtggca 1740

gtcaatctcc agagcagcag cacagaccct gcgaccggag atgtgcatgt tatgggagcc 1800gtcaatctcc agagcagcag cacagaccct gcgaccggag atgtgcatgt tatgggagcc 1800

ttacctggaa tggtgtggca agacagagac gtatacctgc agggtcctat ttgggccaaa 1860ttacctggaa tggtgtggca agacagagac gtatacctgc agggtcctat ttgggccaaa 1860

attcctcaca cggatggaca ctttcacccg tctcctctca tgggcggctt tggacttaag 1920attcctcaca cggatggaca ctttcacccg tctcctctca tgggcggctt tggacttaag 1920

cacccgcctc ctcagatcct catcaaaaac acgcctgttc ctgcgaatcc tccggcagag 1980cacccgcctc ctcagatcct catcaaaaac acgcctgttc ctgcgaatcc tccggcagag 1980

ttttcggcta caaagtttgc ttcattcatc acccagtatt ccacaggacg agtgagcgtg 2040ttttcggcta caaagtttgc ttcattcatc acccagtatt ccacaggacg agtgagcgtg 2040

gagattgaat gggagccgca gaaagaaaac agcaaacgct ggaatcccga agtgcagtat 2100gagattgaat gggagccgca gaaagaaaac agcaaacgct ggaatcccga agtgcagtat 2100

acatctaact atgcaaaatc tgccaacgtt gattttactg tggacaacaa tggactttat 2160acatctaact atgcaaaatc tgccaacgtt gattttactg tggacaacaa tggactttat 2160

actgagcctc gccccattgg cacccgttac cttacccgtc ccctgtaa 2208actgagcctc gccccattgg cacccgttac cttacccgtc ccctgtaa 2208

<210> 6<210> 6

<211> 735<211> 735

<212> PRT<212> PRT

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 6<400> 6

1 5 10 151 5 10 15

Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys ProGlu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro

20 25 30 20 25 30

Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu ProLys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His AlaGln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala

85 90 95 85 90 95

Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly GlyAsp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly

100 105 110 100 105 110

115 120 125 115 120 125

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys ArgLeu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg

130 135 140 130 135 140

Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile GlyPro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly

145 150 155 160145 150 155 160

Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln ThrLys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr

165 170 175 165 170 175

180 185 190 180 185 190

Ala Ala Pro Ser Gly Val Gly Pro Asn Thr Met Ala Ala Gly Gly GlyAla Ala Pro Ser Gly Val Gly Pro Asn Thr Met Ala Ala Gly Gly Gly

195 200 205 195 200 205

Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser SerAla Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser

210 215 220 210 215 220

Ser Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val IleSer Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val Ile

225 230 235 240225 230 235 240

245 250 255 245 250 255

Tyr Lys Gln Ile Ser Ser Gln Ser Gly Ala Ser Asn Asp Asn His TyrTyr Lys Gln Ile Ser Ser Gln Ser Gly Ala Ser Asn Asp Asn His Tyr

260 265 270 260 265 270

Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe HisPhe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His

275 280 285 275 280 285

Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn TrpCys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn Trp

290 295 300 290 295 300

Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln ValGly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln Val

305 310 315 320305 310 315 320

Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn LeuLys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn Leu

325 330 335 325 330 335

Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro TyrThr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro Tyr

340 345 350 340 345 350

Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala AspVal Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala Asp

355 360 365 355 360 365

Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Pro Asn Asn Gly SerVal Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Pro Asn Asn Gly Ser

370 375 380 370 375 380

Gln Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro SerGln Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser

385 390 395 400385 390 395 400

Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe GluGln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu

405 410 415 405 410 415

Glu Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp ArgGlu Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp Arg

420 425 430 420 425 430

Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg ThrLeu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg Thr

435 440 445 435 440 445

Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe Ser ArgGln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe Ser Arg

450 455 460 450 455 460

Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro GlyGly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro Gly

465 470 475 480465 470 475 480

Pro Cys Tyr Gln Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn AsnPro Cys Tyr Gln Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn Asn

485 490 495 485 490 495

Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn GlyAsn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn Gly

500 505 510 500 505 510

Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys AspArg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys Asp

515 520 525 515 520 525

Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly LysAsp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly Lys

530 535 540 530 535 540

Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile ThrGlu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile Thr

545 550 555 560545 550 555 560

Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg PheAsp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg Phe

565 570 575 565 570 575

Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro Ala ThrGly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro Ala Thr

580 585 590 580 585 590

Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln AspGly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln Asp

595 600 605 595 600 605

Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His ThrArg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His Thr

610 615 620 610 615 620

Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu LysAsp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys

625 630 635 640625 630 635 640

His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala AsnHis Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala Asn

645 650 655 645 650 655

Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr GlnPro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr Gln

660 665 670 660 665 670

Tyr Ser Thr Gly Arg Val Ser Val Glu Ile Glu Trp Glu Pro Gln LysTyr Ser Thr Gly Arg Val Ser Val Glu Ile Glu Trp Glu Pro Gln Lys

675 680 685 675 680 685

Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn TyrGlu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn Tyr

690 695 700 690 695 700

Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu TyrAla Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu Tyr

705 710 715 720705 710 715 720

Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro LeuThr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu

725 730 735 725 730 735

<210> 7<210> 7

<211> 2211<211> 2211

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 7<400> 7

atggctgccg atggttatct tccagattgg ctcgaggaca acctttctga aggcattcgt 60atggctgccg atggttatct tccagattgg ctcgaggaca acctttctga aggcattcgt 60

gagtggtggg ctctgaaacc tggagtccct caacccaaag cgaaccaaca acaccaggac 120gagtggtggg ctctgaaacc tggagtccct caacccaaag cgaaccaaca acaccaggac 120

aaccgtcggg gtcttgtgct tccgggttac aaatacctcg gacccggtaa cggactcgac 180aaccgtcggg gtcttgtgct tccgggttac aaatacctcg gacccggtaa cggactcgac 180

aaaggagagc cggtcaacga ggcagacgcc gcggccctcg agcacgacaa agcatacgac 240aaaggagagc cggtcaacga ggcagacgcc gcggccctcg agcacgacaa agcatacgac 240

cagcagctca aggccggtga caacccgtac ctcaagtaca accacgccga cgccgagttt 300cagcagctca aggccggtga caacccgtac ctcaagtaca accacgccga cgccgagttt 300

gccaaaaaga ggatccttga gcctcttggt ctggttgagg aagcagctaa gacggctcct 420gccaaaaaga ggatccttga gcctcttggt ctggttgagg aagcagctaa gacggctcct 420

aaatcgggtg cacagcccgc taaaaagaga ctcaatttcg gtcagactgg cgacacagag 540aaatcgggtg cacagcccgc taaaaagaga ctcaatttcg gtcagactgg cgacacagag 540

tcagtcccag accctcaacc aatcggagaa cctccagcag cgccctctgg tgtgggacct 600tcagtcccag accctcaacc aatcggagaa cctccagcag cgccctctgg tgtgggacct 600

aatacaatgg ctgcgggcgg tggcgcacca atggcagaca ataacgaggg tgccgatgga 660aatacaatgg ctgcgggcgg tggcgcacca atggcagaca ataacgaggg tgccgatgga 660

gtgggtaatt cctcaggaaa ttggcattgc gattcccaat ggctgggcga cagagtcatc 720gtgggtaatt cctcaggaaa ttggcattgc gattcccaat ggctgggcga cagagtcatc 720

ctcaacaatg gcagccaagc cgtgggacgt tcatcctttt actgcctgga atatttccct 1200ctcaacaatg gcagccaagc cgtgggacgt tcatcctttt actgcctgga atatttccct 1200

atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcacgatc 1680atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcacgatc 1680

gcagtcaatc tccagagcag cagcacagac cctgcgaccg gagatgtgca tgttatggga 1800gcagtcaatc tccagagcag cagcacagac cctgcgaccg gagatgtgca tgttatggga 1800

aagattcctc acacggatgg caactttcac ccgtctcctc tcatgggcgg ctttggactt 1920aagattcctc acacggatgg caactttcac ccgtctcctc tcatgggcgg ctttggactt 1920

gagttttcgg ctacaaagtt tgcttcattc atcacccaat actccacagg acaagtgagt 2040gagttttcgg ctacaaagtt tgcttcattc atcacccaat actccacagg acaagtgagt 2040

gtggaaattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag 2100gtggaaattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag 2100

<210> 8<210> 8

<211> 736<211> 736

<212> PRT<212> PRT

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 8<400> 8

1 5 10 151 5 10 15

Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Val Pro Gln ProGlu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Val Pro Gln Pro

20 25 30 20 25 30

Lys Ala Asn Gln Gln His Gln Asp Asn Arg Arg Gly Leu Val Leu ProLys Ala Asn Gln Gln His Gln Asp Asn Arg Arg Gly Leu Val Leu Pro

35 40 45 35 40 45

Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu ProGly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro

50 55 60 50 55 60

Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr AspVal Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

145 150 155 160145 150 155 160

165 170 175 165 170 175

Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro ProGly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro

180 185 190 180 185 190

195 200 205 195 200 205

Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn SerAla Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser

210 215 220 210 215 220

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe ProSer Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

530 535 540 530 535 540

Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Thr IleLys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Thr Ile

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly LeuThr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

705 710 715 720705 710 715 720

725 730 735 725 730 735

<210> 9<210> 9

<211> 2211<211> 2211

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 9<400> 9

ctcaacaatg gcagccaagc cgtgggacgt tcatcctttt actgtctgga atatttccct 1200ctcaacaatg gcagccaagc cgtgggacgt tcatcctttt actgtctgga atatttccct 1200

ggcactgcta tggcctcaca caaagacgac gaagacaagt tctttcccat gagcggtgtc 1620ggcactgcta tggcctcaca caaagacgac gaagacaagt tctttcccat gagcggtgtc 1620

ctgatttttg gaaaagagag cgccggagct tcaaacactg cattggacaa tgtcatgatt 1680ctgatttttg gaaaagagag cgccggagct tcaaacactg cattggacaa tgtcatgatt 1680

acagacgagg aggaaattaa agccactaac cccgtggcca ccgaaaaatt tgggactgtg 1740acagacgagg aggaaattaa agccactaac cccgtggcca ccgaaaaatt tgggactgtg 1740

gagttttcgg ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc 2040gagttttcgg ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc 2040

gtggagattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag 2100gtggagattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag 2100

tatacatcta actatgcaaa atctgccaac gttgatttta ctgtggacaa caatggactt 2160tatacatcta actatgcaaa atctgccaac gttgatttta ctgtggacaa caatggactt 2160

<210> 10<210> 10

<211> 736<211> 736

<212> PRT<212> PRT

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 10<400> 10

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

145 150 155 160145 150 155 160

165 170 175 165 170 175

180 185 190 180 185 190

195 200 205 195 200 205

210 215 220 210 215 220

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

Asp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly Val Leu Ile Phe GlyAsp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly Val Leu Ile Phe Gly

530 535 540 530 535 540

545 550 555 560545 550 555 560

Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu LysThr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Lys

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

705 710 715 720705 710 715 720

725 730 735 725 730 735

<210> 11<210> 11

<211> 2211<211> 2211

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 11<400> 11

atggctggcg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 60atggctggcg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 60

gagtggtggg acttgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac 120gagtggtggg acttgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac 120

aagggggagc cggtcaacgc agcagacgca gcggccctcg agcacgacaa ggcctacgac 240aagggggagc cggtcaacgc agcagacgca gcggccctcg agcacgacaa ggcctacgac 240

gcgaaaaagc ggcttcttga acctcttggt ctggttgagg aagcggctaa gacggctcct 420gcgaaaaagc ggcttcttga acctcttggt ctggttgagg aagcggctaa gacggctcct 420

ggaaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgc gggtattagc 480ggaaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgc gggtattagc 480

tcagtcccag accctcaacc aatcggagaa cctcccgcag ccccctcagg tgtgggatct 600tcagtcccag accctcaacc aatcggagaa cctcccgcag ccccctcagg tgtgggatct 600

cttacaatgg cttcaggtgg tggcgcacca gtggcagaca ataacgaagg cgccgacgga 660cttacaatgg cttcaggtgg tggcgcacca gtggcagaca ataacgaagg cgccgacgga 660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggatgggcga cagagtcatc 720gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggatgggcga cagagtcatc 720

accaccagca cccgaacttg ggccttgccc acctataaca accacctcta caagcaaatc 780accaccagca cccgaacttg ggccttgccc acctataaca accacctcta caagcaaatc 780

tccagtgctt caacgggggc cagcaacgat aaccactact tcggctacag caccccctgg 840tccagtgctt caacgggggc cagcaacgat aaccactact tcggctacag caccccctgg 840

gggtattttg acttcaacag attccactgc cacttttcac cacgtgactg gcagcgactc 900gggtattttg acttcaacag attccactgc cacttttcac cacgtgactg gcagcgactc 900

atcaacaaca actggggatt ccggcccaag agactcagct tcaagctctt caacatccag 960atcaacaaca actggggatt ccggcccaag agactcagct tcaagctctt caacatccag 960

gtcaaagagg ttacggacaa caatggagtc aagaccatcg ccaataatct taccagcacg 1020gtcaaagagg ttacggacaa caatggagtc aagaccatcg ccaataatct taccagcacg 1020

gtccaggtct tcacggactc agactatcag ctcccgtacg ttctcggctc tgcccaccag 1080gtccaggtct tcacggactc agactatcag ctcccgtacg ttctcggctc tgcccaccag 1080

ggctgcctgc ctccgttccc ggcggacgtg ttcatgattc cccagtacgg ctacctgaca 1140ggctgcctgc ctccgttccc ggcggacgtg ttcatgattc cccagtacgg ctacctgaca 1140

ctcaacaacg gtagtcaggc cgtgggacgc tcctccttct actgcctgga atactttcct 1200ctcaacaacg gtagtcaggc cgtgggacgc tcctccttct actgcctgga atactttcct 1200

ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat gagcggtgtc 1620ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat gagcggtgtc 1620

aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactc 1920aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactc 1920

aagcacccgc ctcctcagat cctcatcaaa aacacacccg ttcctgcgaa tcctccggca 1980aagcacccgc ctcctcagat cctcatcaaa aacacacccg ttcctgcgaa tcctccggca 1980

tacacatcca attatgcaaa atctgccgac gttgatttta ctgtggacaa caatggactt 2160tacacatcca attatgcaaa atctgccgac gttgatttta ctgtggacaa caatggactt 2160

<210> 12<210> 12

<211> 736<211> 736

<212> PRT<212> PRT

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 12<400> 12

Met Ala Gly Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu SerMet Ala Gly Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu ProAsn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro

115 120 125 115 120 125

130 135 140 130 135 140

Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile SerPro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Ser

145 150 155 160145 150 155 160

165 170 175 165 170 175

180 185 190 180 185 190

Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly GlyAla Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly

195 200 205 195 200 205

Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn AlaAla Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala

210 215 220 210 215 220

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

Trp Gly Phe Arg Pro Lys Arg Leu Ser Phe Lys Leu Phe Asn Ile GlnTrp Gly Phe Arg Pro Lys Arg Leu Ser Phe Lys Leu Phe Asn Ile Gln

305 310 315 320305 310 315 320

Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn AsnVal Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn Asn

325 330 335 325 330 335

Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu ProLeu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu Pro

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe GlyAsp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly

530 535 540 530 535 540

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

Tyr Ala Lys Ser Ala Asp Val Asp Phe Thr Val Asp Asn Asn Gly LeuTyr Ala Lys Ser Ala Asp Val Asp Phe Thr Val Asp Asn Asn Gly Leu

705 710 715 720705 710 715 720

725 730 735 725 730 735

<210> 13<210> 13

<211> 2211<211> 2211

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 13<400> 13

atggctgccg atggttatct tccagattgg ctcgaggaca accttagtga gggcattcgc 60atggctgccg atggttatct tccagattgg ctcgaggaca accttagtga gggcattcgc 60

aggggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac 240aggggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac 240

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccgg 360caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccgg 360

gccaagaagc gggttctcga acctctcggt ctggttgagg aagcagctaa aacggctcct 420gccaagaagc gggttctcga acctctcggt ctggttgagg aagcagctaa aacggctcct 420

ggaaagaaga ggcctgtaga tcagtctcct caggaaccgg actcatcatc tggtgttggc 480ggaaagaaga ggcctgtaga tcagtctcct caggaaccgg actcatcatc tggtgttggc 480

aaatcgggca aacagcctgc cagaaaaaga ctcaatttcg gtcagactgg cgactcagag 540aaatcgggca aacagcctgc cagaaaaaga ctcaatttcg gtcagactgg cgactcagag 540

tcagtccccg accctcaacc tctcggagaa cctccagcag cgccctctag tgtgggatct 600tcagtccccg accctcaacc tctcggagaa cctccagcag cgccctctag tgtgggatct 600

ggtacagtgg ctgcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660ggtacagtgg ctgcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc 720gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc 720

accaccagca cccgaacatg ggccttgccc acctataaca accacctcta caagcaaatc 780accaccagca cccgaacatg ggccttgccc acctataaca accacctcta caagcaaatc 780

gggtattttg atttcaacag attccactgc catttctcac cacgtgactg gcagcgactc 900gggtattttg atttcaacag attccactgc catttctcac cacgtgactg gcagcgactc 900

ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcaatacgg ctacctgacg 1140ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcaatacgg ctacctgacg 1140

gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc catttgggcc 1860gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc catttgggcc 1860

<210> 14<210> 14

<211> 736<211> 736

<212> PRT<212> PRT

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 14<400> 14

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Arg Gly Glu ProGly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Arg Gly Glu Pro

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

Asn Leu Gly Arg Ala Val Phe Arg Ala Lys Lys Arg Val Leu Glu ProAsn Leu Gly Arg Ala Val Phe Arg Ala Lys Lys Arg Val Leu Glu Pro

115 120 125 115 120 125

130 135 140 130 135 140

Pro Val Asp Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Val GlyPro Val Asp Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Val Gly

145 150 155 160145 150 155 160

Lys Ser Gly Lys Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln ThrLys Ser Gly Lys Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln Thr

165 170 175 165 170 175

180 185 190 180 185 190

Ala Ala Pro Ser Ser Val Gly Ser Gly Thr Val Ala Ala Gly Gly GlyAla Ala Pro Ser Ser Val Gly Ser Gly Thr Val Ala Ala Gly Gly Gly

195 200 205 195 200 205

210 215 220 210 215 220

Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val IleSer Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

530 535 540 530 535 540

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

705 710 715 720705 710 715 720

725 730 735 725 730 735

<210> 15<210> 15

<211> 2211<211> 2211

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 15<400> 15

atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga aggaataaga 60atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga aggaataaga 60

cagtggtgga agctcaaacc tggcccacca ccaccaaagc ccgcagagcg gcataaggac 120cagtggtgga agctcaaacc tggcccacca ccaccaaagc ccgcagagcg gcataaggac 120

gacagcaggg gtcttgtgct tcctgggtac aagtacctcg gacccttcaa cggactcgac 180gacagcaggg gtcttgtgct tcctgggtac aagtacctcg gacccttcaa cggactcgac 180

aagggggagc cggtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac 240aagggggagc cggtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac 240

caggagcgtc tgcaagaaga tacgtcattt gggggcaacc tcgggcgagc agtcttccag 360caggagcgtc tgcaagaaga tacgtcattt gggggcaacc tcgggcgagc agtcttccag 360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct 420gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct 420

ggaaagaaac gtccggtaga gcagtctcct caggaaccgg actcctccgc gggtattggc 480ggaaagaaac gtccggtaga gcagtctcct caggaaccgg actcctccgc gggtattggc 480

aaatcgggtg cacggcccgc taaaaagaga ctcaattttg gtcagactgg cgacacagag 540aaatcgggtg cacggcccgc taaaaagaga ctcaattttg gtcagactgg cgacacagag 540

tcagtcccag accctcaacc aatcggagaa cctcccgcag cccccacaag tttgggatct 600tcagtcccag accctcaacc aatcggagaa cctcccgcag cccccacaag tttgggatct 600

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcgtc 720gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcgtc 720

accaccagca cccgcacctg ggccttgccc acctataaca accacctcta caagcaaatc 780accaccagca cccgcacctg ggccttgccc acctataaca accacctcta caagcaaatc 780

gggtattttg atttcaacag attccactgc cacttttcac cacgtgactg gcagcgactc 900gggtattttg atttcaacag attccactgc cacttttcac cacgtgactg gcagcgactc 900

gtccaggtct tcacggactc agactatcag ctcccgtacg tgctcgggtc ggctcacgag 1080gtccaggtct tcacggactc agactatcag ctcccgtacg tgctcgggtc ggctcacgag 1080

ctcaacaatg gcagccaagc cgtgggtcgt tcgtcctttt actgcctgga atatttcccg 1200ctcaacaatg gcagccaagc cgtgggtcgt tcgtcctttt actgcctgga atatttcccg 1200

tcgcaaatgc taagaacagg caacaacttt accttcagct acacctttga ggaagtgcct 1260tcgcaaatgc taagaacagg caacaacttt accttcagct acacctttga ggaagtgcct 1260

gcagtcaatc tccagagcag cagcacggac cctgcgactg gagatgtgca tgttatggga 1800gcagtcaatc tccagagcag cagcacggac cctgcgactg gagatgtgca tgttatggga 1800

<210> 16<210> 16

<211> 736<211> 736

<212> PRT<212> PRT

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 16<400> 16

1 5 10 151 5 10 15

Glu Gly Ile Arg Gln Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro ProGlu Gly Ile Arg Gln Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro

20 25 30 20 25 30

Lys Pro Ala Glu Arg His Lys Asp Asp Ser Arg Gly Leu Val Leu ProLys Pro Ala Glu Arg His Lys Asp Asp Ser Arg Gly Leu Val Leu Pro

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

145 150 155 160145 150 155 160

Lys Ser Gly Ala Arg Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln ThrLys Ser Gly Ala Arg Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr

165 170 175 165 170 175

180 185 190 180 185 190

Ala Ala Pro Thr Ser Leu Gly Ser Asn Thr Met Ala Ser Gly Gly GlyAla Ala Pro Thr Ser Leu Gly Ser Asn Thr Met Ala Ser Gly Gly Gly

195 200 205 195 200 205

210 215 220 210 215 220

Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val ValSer Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Val

225 230 235 240225 230 235 240

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

340 345 350 340 345 350

Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro AlaTyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro Ala

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

530 535 540 530 535 540

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

705 710 715 720705 710 715 720

725 730 735 725 730 735

<210> 17<210> 17

<211> 2214<211> 2214

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 17<400> 17

atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga aggcattcgt 60atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga aggcattcgt 60

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgcggagttt 300cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgcggagttt 300

caggagcgcc ttaaagaaga tacgtctttt gggggcaacc tcggacgagc agtcttccag 360caggagcgcc ttaaagaaga tacgtctttt gggggcaacc tcggacgagc agtcttccag 360

ggaaagaaga gaccggtaga gccatcaccc cagcgttctc cagactcctc tacgggcatc 480ggaaagaaga gaccggtaga gccatcaccc cagcgttctc cagactcctc tacgggcatc 480

ggcaagaaag gccaacagcc cgccagaaaa agactcaatt tcggtcagac tggcgacaca 540ggcaagaaag gccaacagcc cgccagaaaa agactcaatt tcggtcagac tggcgacaca 540

gagtcagtcc cagaccctca accaatcgga gaacctcccg cagccccctc aggtgtggaa 600gagtcagtcc cagaccctca accaatcgga gaacctcccg cagccccctc aggtgtggaa 600

tctcttacaa tggcttcagg tggtggcgca ccagtggcag acagtaacga aggcgccgac 660tctcttacaa tggcttcagg tggtggcgca ccagtggcag acagtaacga aggcgccgac 660

atcaccaaca gcacccgaac atgggccttg cccacctata acaaccacct ctacaagcaa 780atcaccaaca gcacccgaac atgggccttg cccacctata acaaccacct ctacaagcaa 780

tgggggtatt ttgatttcaa cagattccac tgccactttt caccacgtga ctggcagcga 900tgggggtatt ttgatttcaa cagattccac tgccactttt caccacgtga ctggcagcga 900

ctcatcaaca acaattgggg attccggccc aagagactca acttcaaact cttcaacatc 960ctcatcaaca acaattgggg attccggccc aagagactca acttcaaact cttcaacatc 960

caagtcaagg aggtcacgac gaatgatggc gtcacaacca tcgctaataa ccttaccagc 1020caagtcaagg aggtcacgac gaatgatggc gtcacaacca tcgctaataa ccttaccagc 1020

acggttcaag tcttctcgga ctcggagtac cagcttccgt acgtcctcgg ctctgcgcac 1080acggttcaag tcttctcgga ctcggagtac cagcttccgt acgtcctcgg ctctgcgcac 1080

gaccaatacc tgtattacct gaacagaact caaaatcagt ccggaagtgc ccaaaacaag 1380gaccaatacc tgtattacct gaacagaact caaaatcagt ccggaagtgc ccaaaacaag 1380

gacttgctgt ttagccgtgg gtctccagct ggcatgtctg ttcagcccaa aaactggcta 1440gacttgctgt ttagccgtgg gtctccagct ggcatgtctg ttcagcccaa aaactggcta 1440

cctggaccct gttatcggca gcagcgcgtt tctaaaacaa aaacagacaa caacaacagc 1500cctggaccct gttatcggca gcagcgcgtt tctaaaacaa aaacagacaa caacaacagc 1500

aattttacct ggactggtgc ttcaaaatat aacctcaatg ggcgtgaatc catcatcaac 1560aattttacct ggactggtgc ttcaaaatat aacctcaatg ggcgtgaatc catcatcaac 1560

attacagacg aagaggaaat taaagccact aaccctgtgg ccaccgaaag atttgggacc 1740attacagacg aagaggaaat taaagccact aaccctgtgg ccaccgaaag atttgggacc 1740

gcggagtttt cagctacaaa gtttgcttca ttcatcaccc aatactccac aggacaagtg 2040gcggagtttt cagctacaaa gtttgcttca ttcatcaccc aatactccac aggacaagtg 2040

agtgtggaaa ttgaatggga gctgcagaaa gaaaacagca agcgctggaa tcccgaagtg 2100agtgtggaaa ttgaatggga gctgcagaaa gaaaacagca agcgctggaa tcccgaagtg 2100

cagtacacat ccaattatgc aaaatctgcc aacgttgatt ttactgtgga caacaatgga 2160cagtacacat ccaattatgc aaaatctgcc aacgttgatt ttactgtgga caacaatgga 2160

<210> 18<210> 18

<211> 737<211> 737

<212> PRT<212> PRT

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 18<400> 18

1 5 10 151 5 10 15

20 25 30 20 25 30

35 40 45 35 40 45

50 55 60 50 55 60

65 70 75 8065 70 75 80

85 90 95 85 90 95

100 105 110 100 105 110

115 120 125 115 120 125

130 135 140 130 135 140

Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly IlePro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile

145 150 155 160145 150 155 160

Gly Lys Lys Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly GlnGly Lys Lys Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln

165 170 175 165 170 175

Thr Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu ProThr Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro

180 185 190 180 185 190

Pro Ala Ala Pro Ser Gly Val Glu Ser Leu Thr Met Ala Ser Gly GlyPro Ala Ala Pro Ser Gly Val Glu Ser Leu Thr Met Ala Ser Gly Gly

195 200 205 195 200 205

Gly Ala Pro Val Ala Asp Ser Asn Glu Gly Ala Asp Gly Val Gly AsnGly Ala Pro Val Ala Asp Ser Asn Glu Gly Ala Asp Gly Val Gly Asn

210 215 220 210 215 220

225 230 235 240225 230 235 240

Ile Thr Asn Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn HisIle Thr Asn Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His

245 250 255 245 250 255

260 265 270 260 265 270

275 280 285 275 280 285

290 295 300 290 295 300

305 310 315 320305 310 315 320

325 330 335 325 330 335

Asn Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln LeuAsn Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu

340 345 350 340 345 350

355 360 365 355 360 365

370 375 380 370 375 380

385 390 395 400385 390 395 400

405 410 415 405 410 415

420 425 430 420 425 430

435 440 445 435 440 445

450 455 460 450 455 460

465 470 475 480465 470 475 480

485 490 495 485 490 495

500 505 510 500 505 510

515 520 525 515 520 525

530 535 540 530 535 540

545 550 555 560545 550 555 560

565 570 575 565 570 575

580 585 590 580 585 590

595 600 605 595 600 605

610 615 620 610 615 620

625 630 635 640625 630 635 640

645 650 655 645 650 655

660 665 670 660 665 670

675 680 685 675 680 685

690 695 700 690 695 700

Asn Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn GlyAsn Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly

705 710 715 720705 710 715 720

725 730 735 725 730 735

LeuLeu

Claims

1. A nucleic acid encoding an adeno-associated virus capsid protein, wherein the nucleic acid comprises a nucleotide sequence selected from any one of the group consisting of:

(a) 1, nucleotide sequence SEQ ID NO;

(b) 3, nucleotide sequence SEQ ID NO;

(c) nucleotide sequence SEQ ID NO 5;

(d) nucleotide sequence SEQ ID NO 7;

(e) the nucleotide sequence of SEQ ID NO 9;

(f) nucleotide sequence SEQ ID NO 11;

(g) 13 in nucleotide sequence SEQ ID NO;

(h) 15, nucleotide sequence SEQ ID NO;

(i) nucleotide sequence SEQ ID NO 17; or

(j) A nucleotide sequence of an adeno-associated virus capsid protein encoded by a nucleotide sequence of any one of (a) - (i) which differs from the nucleotide sequence of (a) - (i) due to the degeneracy of the genetic code.

2. The nucleic acid of claim 1, wherein the nucleic acid is a plasmid, a bacteriophage, a viral vector, a bacterial artificial chromosome, or a yeast artificial chromosome.

3. The nucleic acid of claim 2, wherein the nucleic acid is an adeno-associated viral vector comprising a coding sequence.

4. The nucleic acid of claim 3, further comprising a coding sequence for an adeno-associated viral Rep protein.

5. An adeno-associated virus capsid protein encoded by the nucleic acid of claim 1.

6. The adeno-associated virus capsid protein according to claim 5, wherein the amino acid sequence of the adeno-associated virus capsid protein comprises any one of SEQ ID NO 2, SEQ ID NO 4, SEQ ID NO 6, SEQ ID NO 8, SEQ ID NO 10, SEQ ID NO 12, SEQ ID NO 14, SEQ ID NO 16, or SEQ ID NO 18.

7. The adeno-associated virus capsid protein according to claim 5 or 6, wherein said adeno-associated virus capsid protein is covalently linked to, associated with or encapsulating a composition selected from the group consisting of one or more of a DNA molecule, an RNA molecule, a polypeptide, a carbohydrate, a liposome and a small organic molecule.

8. A recombinant viral particle comprising a nucleic acid according to any one of claims 1 to 4 and/or a capsid protein according to any one of claims 5 to 7.

9. The recombinant viral particle of claim 8, wherein the recombinant viral particle is a recombinant adeno-associated viral particle, a recombinant adenoviral particle, a recombinant herpes viral particle, a recombinant baculovirus particle, or a recombinant hybrid viral particle.

10. A recombinant adeno-associated viral particle comprising an adeno-associated viral vector genome and the adeno-associated viral capsid protein of claim 5 or 6, wherein the adeno-associated viral vector genome is enveloped in the adeno-associated viral capsid protein.

11. The recombinant adeno-associated viral particle of claim 10, wherein the genome of the adeno-associated viral vector comprises a heterologous nucleic acid sequence.

12. The recombinant adeno-associated viral particle according to claim 11 wherein the heterologous nucleic acid sequence encodes one or more selected from the group consisting of antisense RNA, microRNA, shRNA, polypeptides and immunogens.

13. The recombinant adeno-associated viral particle of claim 12, wherein the heterologous nucleic acid sequence encodes a polypeptide that is a therapeutic polypeptide or a reporter gene.

14. The recombinant adeno-associated viral particle according to claim 13, the therapeutic polypeptide encoded by the heterologous nucleic acid is selected from the group consisting of insulin, glucagon, growth hormone releasing factor, erythropoietin, insulin growth factor, transforming growth factor alpha, hepatocyte growth factor, tyrosine hydroxylase, thrombopoietin, interleukin 1-interleukin 25, low density lipoprotein receptor, glucocorticoid receptor, vitamin D receptor, interferon regulatory factor, factor viii, factor ix, glucosidase, glucose-6-phosphatase, isovaleryl-CoA dehydrogenase, propionyl-CoA carboxylase, beta-glucosidase, liver phosphorylase, phosphorylase kinase, glycine decarboxylase, alpha-galactosidase, beta-galactosidase, and lysosomal enzyme.

15. A cell comprising the nucleic acid of any one of claims 1 to 4, the adeno-associated virus capsid protein of any one of claims 5 to 7, the recombinant virion of any one of claims 8 to 9 and/or the recombinant adeno-associated virion of any one of claims 10 to 14.

16. The cell of claim 15, wherein the cell is selected from the group consisting of e.coli, a HEK293 cell line, a HEK293T cell line, a HEK293A cell line, a HEK293S cell line, a HEK293FT cell line, a HEK293F cell line, a HEK293H cell line, a HeLa cell line, a SF9 cell line, a SF21 cell line, a SF900 cell line, and a BHK cell line.

17. A pharmaceutical composition comprising a pharmaceutically acceptable carrier or excipient and one or more selected from the group consisting of the nucleic acid of any one of claims 1 to 4, the adeno-associated virus capsid protein of any one of claims 5 to 7, the recombinant virion of any one of claims 8 to 9, the recombinant adeno-associated virion of any one of claims 10 to 14 and/or the cell of claim 15.

18. Use of one or more of the nucleic acid of any one of claims 1-4, the adeno-associated virus capsid protein of any one of claims 5-7, the recombinant viral particle of any one of claims 8-9, the recombinant adeno-associated viral particle of any one of claims 10-14, the cell of claim 15 and/or the pharmaceutical composition of claim 17 in the manufacture of a medicament for the prevention or treatment of a disease.

19. The disease of claim 18, wherein the disease is selected from the group consisting of cystic fibrosis and other diseases of the lung, hemophilia a, hemophilia B, thalassemia, anemia and other blood disorders, senile dementia, multiple sclerosis, parkinson's disease, huntington's disease, amyotrophic lateral sclerosis, epilepsy, cancer, diabetes, muscular dystrophy, glycogen storage disease and other metabolic defects, congenital emphysema, Lesch-Nyhan syndrome, Niemann-Pick disease, aids, hepatitis, hyperammonemia, and spinocerebral ataxia.

20. A method of producing recombinant adeno-associated virions, the method comprising providing a nucleic acid of claim 1, a nucleic acid encoding a Rep protein of an adeno-associated virus, an adeno-associated viral vector genome carrying heterologous nucleic acid sequences, a helper factor that facilitates production of infectious gonadal-associated virus, and allowing the adeno-associated viral vector genome to be enveloped in adeno-associated viral capsid proteins encoded by the nucleic acid of claim 1 and allowing assembly of the recombinant adeno-associated virions in vitro.

21. The method of claim 20, wherein the method is an adeno-associated virus vector production system, comprising a two-plasmid packaging system, a three-plasmid packaging system, a baculovirus packaging system, and an adeno-associated virus packaging system using adenovirus or herpes simplex virus as a helper virus.

22. A method of delivering a heterologous nucleic acid to a cell in vitro, comprising administering to the cell the nucleic acid of any one of claims 1-4, the adeno-associated virus capsid protein of any one of claims 5-7, the recombinant virion of claim 8 or 9, the recombinant adeno-associated virion of any one of claims 10-14, and/or the pharmaceutical composition of claim 17.

23. The method of delivering a heterologous nucleic acid to a cell according to claim 22, wherein the cell is a mammalian cell, preferably a human stem cell or a liver cell.

24. A method of delivering a heterologous nucleic acid to a mammal, the method comprising administering to a mammalian subject an effective amount of the nucleic acid of any one of claims 1 to 4, the adeno-associated virus capsid protein of any one of claims 5 to 7, the recombinant virion of claim 8 or 9, the recombinant adeno-associated virion of any one of claims 10 to 14, the cell of claim 15, and/or the pharmaceutical composition of claim 17.

25. The method of claim 24, wherein the mammal is a human subject or a primate subject.