CN109777832B - 碱基编辑模拟、修复与甘露糖苷贮积症相关的man2b1c2248t突变的试剂和方法 - Google Patents
碱基编辑模拟、修复与甘露糖苷贮积症相关的man2b1c2248t突变的试剂和方法 Download PDFInfo
- Publication number
- CN109777832B CN109777832B CN201910105320.6A CN201910105320A CN109777832B CN 109777832 B CN109777832 B CN 109777832B CN 201910105320 A CN201910105320 A CN 201910105320A CN 109777832 B CN109777832 B CN 109777832B
- Authority
- CN
- China
- Prior art keywords
- man2b1
- repair
- sgrna
- mutation
- cells
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000008439 repair process Effects 0.000 title claims abstract description 48
- 238000000034 method Methods 0.000 title claims abstract description 33
- 101000979046 Homo sapiens Lysosomal alpha-mannosidase Proteins 0.000 title claims abstract description 21
- 102100023231 Lysosomal alpha-mannosidase Human genes 0.000 title claims abstract description 21
- 238000004088 simulation Methods 0.000 title claims abstract description 10
- 239000003153 chemical reaction reagent Substances 0.000 title abstract description 6
- 230000035772 mutation Effects 0.000 claims abstract description 57
- 238000002703 mutagenesis Methods 0.000 claims abstract description 4
- 231100000350 mutagenesis Toxicity 0.000 claims abstract description 4
- 238000010276 construction Methods 0.000 claims description 7
- 238000005516 engineering process Methods 0.000 abstract description 19
- 208000027933 Mannosidase Deficiency disease Diseases 0.000 abstract description 10
- 238000001514 detection method Methods 0.000 abstract description 8
- 239000003795 chemical substances by application Substances 0.000 abstract 1
- 210000004027 cell Anatomy 0.000 description 71
- 239000000872 buffer Substances 0.000 description 36
- 108090000623 proteins and genes Proteins 0.000 description 33
- 102000004169 proteins and genes Human genes 0.000 description 30
- 239000013612 plasmid Substances 0.000 description 28
- 108091033409 CRISPR Proteins 0.000 description 26
- 239000013598 vector Substances 0.000 description 26
- 108091027544 Subgenomic mRNA Proteins 0.000 description 25
- 108020004414 DNA Proteins 0.000 description 24
- 238000002156 mixing Methods 0.000 description 21
- 238000013518 transcription Methods 0.000 description 19
- 230000035897 transcription Effects 0.000 description 19
- 230000001717 pathogenic effect Effects 0.000 description 18
- 239000000203 mixture Substances 0.000 description 16
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 15
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 15
- 239000007788 liquid Substances 0.000 description 15
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 15
- 239000012634 fragment Substances 0.000 description 14
- 238000000338 in vitro Methods 0.000 description 14
- 238000011084 recovery Methods 0.000 description 14
- 239000006228 supernatant Substances 0.000 description 14
- 241000894006 Bacteria Species 0.000 description 13
- 239000000047 product Substances 0.000 description 13
- 238000006243 chemical reaction Methods 0.000 description 12
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 12
- 238000011144 upstream manufacturing Methods 0.000 description 12
- 239000012154 double-distilled water Substances 0.000 description 11
- 239000002609 medium Substances 0.000 description 11
- 239000000499 gel Substances 0.000 description 10
- 108020004999 messenger RNA Proteins 0.000 description 10
- 239000000243 solution Substances 0.000 description 9
- 238000010354 CRISPR gene editing Methods 0.000 description 8
- 238000010828 elution Methods 0.000 description 8
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 8
- 239000002699 waste material Substances 0.000 description 8
- 238000010362 genome editing Methods 0.000 description 7
- 238000012165 high-throughput sequencing Methods 0.000 description 7
- 238000001890 transfection Methods 0.000 description 7
- 238000004113 cell culture Methods 0.000 description 6
- 239000007795 chemical reaction product Substances 0.000 description 6
- 229910017052 cobalt Inorganic materials 0.000 description 6
- 239000010941 cobalt Substances 0.000 description 6
- GUTLYIVDDKVIGB-UHFFFAOYSA-N cobalt atom Chemical compound [Co] GUTLYIVDDKVIGB-UHFFFAOYSA-N 0.000 description 6
- 201000010099 disease Diseases 0.000 description 6
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 6
- 239000006166 lysate Substances 0.000 description 6
- 238000005406 washing Methods 0.000 description 6
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 5
- 102100035102 E3 ubiquitin-protein ligase MYCBP2 Human genes 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 102000003960 Ligases Human genes 0.000 description 5
- 108090000364 Ligases Proteins 0.000 description 5
- 210000001185 bone marrow Anatomy 0.000 description 5
- 238000004520 electroporation Methods 0.000 description 5
- 102000012758 APOBEC-1 Deaminase Human genes 0.000 description 4
- 108010079649 APOBEC-1 Deaminase Proteins 0.000 description 4
- 238000010453 CRISPR/Cas method Methods 0.000 description 4
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 4
- 102100031573 Hematopoietic progenitor cell antigen CD34 Human genes 0.000 description 4
- 101000777663 Homo sapiens Hematopoietic progenitor cell antigen CD34 Proteins 0.000 description 4
- 101100129193 Mus musculus Man2b1 gene Proteins 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- 229930182555 Penicillin Natural products 0.000 description 4
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 4
- 238000005119 centrifugation Methods 0.000 description 4
- 238000001976 enzyme digestion Methods 0.000 description 4
- 238000010438 heat treatment Methods 0.000 description 4
- 210000005260 human cell Anatomy 0.000 description 4
- 238000004255 ion exchange chromatography Methods 0.000 description 4
- 229940049954 penicillin Drugs 0.000 description 4
- 238000001556 precipitation Methods 0.000 description 4
- 229960005322 streptomycin Drugs 0.000 description 4
- 239000012096 transfection reagent Substances 0.000 description 4
- 241000180579 Arca Species 0.000 description 3
- 102000004127 Cytokines Human genes 0.000 description 3
- 108090000695 Cytokines Proteins 0.000 description 3
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 3
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 3
- 108010053770 Deoxyribonucleases Proteins 0.000 description 3
- 102000016911 Deoxyribonucleases Human genes 0.000 description 3
- 101150030800 MAN2B1 gene Proteins 0.000 description 3
- 229910021380 Manganese Chloride Inorganic materials 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- 101710130181 Protochlorophyllide reductase A, chloroplastic Proteins 0.000 description 3
- 201000008333 alpha-mannosidosis Diseases 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 238000004440 column chromatography Methods 0.000 description 3
- 238000005520 cutting process Methods 0.000 description 3
- 238000011068 loading method Methods 0.000 description 3
- 210000004698 lymphocyte Anatomy 0.000 description 3
- 239000011565 manganese chloride Substances 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 239000011535 reaction buffer Substances 0.000 description 3
- 230000003007 single stranded DNA break Effects 0.000 description 3
- 230000001502 supplementing effect Effects 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- QGZKDVFQNNGYKY-UHFFFAOYSA-O Ammonium Chemical compound [NH4+] QGZKDVFQNNGYKY-UHFFFAOYSA-O 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 108010067770 Endopeptidase K Proteins 0.000 description 2
- 240000000220 Panda oleosa Species 0.000 description 2
- 235000016496 Panda oleosa Nutrition 0.000 description 2
- 229920001213 Polysorbate 20 Polymers 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- 101150063416 add gene Proteins 0.000 description 2
- 238000001042 affinity chromatography Methods 0.000 description 2
- 150000001413 amino acids Chemical class 0.000 description 2
- 230000027455 binding Effects 0.000 description 2
- 238000007664 blowing Methods 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- NKLPQNGYXWVELD-UHFFFAOYSA-M coomassie brilliant blue Chemical compound [Na+].C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=2C=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=C1 NKLPQNGYXWVELD-UHFFFAOYSA-M 0.000 description 2
- 230000009089 cytolysis Effects 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 238000012350 deep sequencing Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 238000002641 enzyme replacement therapy Methods 0.000 description 2
- 210000003743 erythrocyte Anatomy 0.000 description 2
- 235000019441 ethanol Nutrition 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 230000037433 frameshift Effects 0.000 description 2
- 239000006481 glucose medium Substances 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 238000002744 homologous recombination Methods 0.000 description 2
- 230000006801 homologous recombination Effects 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L magnesium chloride Substances [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- 229910001629 magnesium chloride Inorganic materials 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 239000002184 metal Substances 0.000 description 2
- 229910052751 metal Inorganic materials 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000006780 non-homologous end joining Effects 0.000 description 2
- 108020004707 nucleic acids Proteins 0.000 description 2
- 102000039446 nucleic acids Human genes 0.000 description 2
- 150000007523 nucleic acids Chemical class 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 2
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- 238000004064 recycling Methods 0.000 description 2
- 239000012266 salt solution Substances 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 238000011282 treatment Methods 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 239000012130 whole-cell lysate Substances 0.000 description 2
- BGFHMYJZJZLMHW-UHFFFAOYSA-N 4-[2-[[2-(1-benzothiophen-3-yl)-9-propan-2-ylpurin-6-yl]amino]ethyl]phenol Chemical compound N1=C(C=2C3=CC=CC=C3SC=2)N=C2N(C(C)C)C=NC2=C1NCCC1=CC=C(O)C=C1 BGFHMYJZJZLMHW-UHFFFAOYSA-N 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 101000760085 Daucus carota 21 kDa protein Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 241001123946 Gaga Species 0.000 description 1
- 102000001696 Mannosidases Human genes 0.000 description 1
- 108010054377 Mannosidases Proteins 0.000 description 1
- 208000036626 Mental retardation Diseases 0.000 description 1
- 206010036790 Productive cough Diseases 0.000 description 1
- 230000026279 RNA modification Effects 0.000 description 1
- 230000004570 RNA-binding Effects 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 206010072610 Skeletal dysplasia Diseases 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 238000007605 air drying Methods 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- DZUDZSQDKOESQQ-UHFFFAOYSA-N cobalt hydrogen peroxide Chemical compound [Co].OO DZUDZSQDKOESQQ-UHFFFAOYSA-N 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000001815 facial effect Effects 0.000 description 1
- 108700014844 flt3 ligand Proteins 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 231100000221 frame shift mutation induction Toxicity 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 238000010363 gene targeting Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000003205 genotyping method Methods 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 208000016354 hearing loss disease Diseases 0.000 description 1
- 238000011134 hematopoietic stem cell transplantation Methods 0.000 description 1
- 210000003917 human chromosome Anatomy 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 208000016245 inborn errors of metabolism Diseases 0.000 description 1
- 208000015978 inherited metabolic disease Diseases 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 230000002934 lysing effect Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 238000007747 plating Methods 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 230000026447 protein localization Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000007480 sanger sequencing Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 208000024794 sputum Diseases 0.000 description 1
- 210000003802 sputum Anatomy 0.000 description 1
- 238000009168 stem cell therapy Methods 0.000 description 1
- 238000009580 stem-cell therapy Methods 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000002636 symptomatic treatment Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
Images
Landscapes
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
本发明提供了一种碱基编辑模拟、修复与甘露糖苷贮积症相关的MAN2B1C2248T突变的试剂和方法。高效模拟MAN2B1C2248T突变的试剂盒包括碱基编辑系统以及针对MAN2B1C2248T位点的突变mt‑sgRNA;高效修复MAN2B1C2248T突变的试剂盒包括碱基编辑系统以及针对MAN2B1C2248T位点的突变re‑sgRNA。本发明利用碱基编辑技术,通过精准的CT或AG单碱基突变,从而可以模拟、修复MAN2B1C2248T的突变,从而为治疗该类突变引起的甘露糖苷贮积症提供了高效安全的方法。
Description
技术领域
本发明涉及基因修复领域,更具体来说,在人细胞中利用碱基编辑模拟、修复与甘露糖苷贮积症相关的MAN2B1C2248T(α-mannosidosis)相关突变的试剂和方法。
背景技术
α-甘露糖苷贮积症(α-mannosidosis)是一种常染色体隐性遗传代谢病,主要是由于编码甘露糖苷酶的基因MAN2B1发生突变,导致α-甘露糖苷酶缺乏所引起的罕见溶酶体贮积症1。本病发病率大约为1/500000—1/1000000,在全球范围内均有发病。其临床症状多样,并没有一个统一的、独有的临床表型,主要表现为免疫缺失、听力受损、面容扭曲、骨骼发育异常、精神迟缓及其他并发症2。目前对α-甘露糖苷贮积症主要采用早期对症治疗,包括造血干细胞移植,酶替代疗法3。其中造血干细胞治疗必须权衡整个治疗过程相关的发病率和死亡率的风险,目前还不是一个最理想的选择;酶替代疗法只是暂时缓解症状,也不是一种最优选择。因此目前尚无特效疗法彻底根除此类疾病。鉴于α-甘露糖苷贮积症主要是由 MAN2B1单基因突变引起的,对突变基因进行修复将是根治此类疾病的有效手段。
高通量测序技术的发展与普及,使得我们可以获得更多的与疾病相关的突变,而如何对这些突变进行修复将是后基因组时代必须面对的问题。来自于细菌或古细菌中的CRISPR/Cas技术,自被应用于编辑哺乳动物细胞以来,已经成为基因编辑领域发展最快、应用最广的技术,引发了基因编辑领域的革命4。目前,CRISPR/Cas9系统已经被成功地用于DNA的敲除、敲入、替代、修饰、标记,RNA修饰和基因转录调节等研究5,6。并已经成功应用于多个物种的基因编辑6,7。CRISPR/Cas9介导的基因编辑是在sgRNA(single guided RNA) 通过靶序列互补引导Cas9蛋白定位剪切双链DNA,造成双链DNA断裂(double-strand breaks,DSB),在没有模板的条件下,发生非同源末端连接修复,造成移码突变(frameshiftmutation),导致基因敲除(knockout);在有模板的条件下,通过同源重组进行修复,实现基因敲入 (knockin),由于HDR效率低(整合很少发生),而且非同源性末端接合机制容易产生随机插入和删除(indel),使得在断裂点附近可能随机引入新的碱基,从而导致不精确的基因编辑8。
基于CRISPR/Cas9技术所构建的碱基编辑技术(Base editing),目前有胞嘧啶碱基编辑器(CBEs,cytosine base editors)和鸟嘌呤碱基编辑器(ABEs,Adenine baseeditors)。其能够在目标基因中精确而有效地引入点突变,而不需要双链DNA断裂或任何供体模板,展现出很大的基因编辑潜力9,10。目前利用碱基编辑,在酵母、植物、哺乳动物和人类细胞中进行了修正和遗传多样性研究11,12。MAN2B1基因位于人类第19号染色体上,该基因总长21.5kb13。目前HGMD突变数据库收录的和文献报道的MAN2B1基因突变已达一百多种,其中一半以上是错义或无义突变。其中一个错义突变c.2248C>T发生频率大约为27.5%。本发明将在人的细胞中利用碱基编辑技术实现安全高效地模拟与修复MAN2B1C2248T突变14,15,为临床上治疗相应突变引起的甘露糖苷贮积症提供可靠的试剂和方法。
参考文献
1.Paciotti,S,Codini,M,Tasegian,A,Ceccarini,MR,Cataldi,S,Arcuri,C,etal.(2017). Lysosomal alpha-mannosidase and alpha-mannosidosis.Front Biosci-Landmrk 22: 157-167.
2.Lund,AM,Borgwardt,L,Cattaneo,F,Ardigo,D,Geraci,S,Gil-Campos,M,etal.(2018). Comprehensive long-term efficacy and safety of recombinant humanalpha-mannosidase (velmanase alfa)treatment in patients with alpha-mannosidosis.Journal of inherited metabolic disease.
3.吴晓昀,张杰,and郭奕斌(2013).α-甘露糖苷贮积症的分子遗传学及其在临床诊断与防治方面的意义.分子诊断与治疗杂志5:261-267.
4.Gaj,T,Gersbach,CA,and Barbas,CF(2013).ZFN,TALEN,and CRISPR/Cas-based methods for genome engineering.Trends in Biotechnology 31:397-405.
5.Hsu,Patrick D,Lander,Eric S,and Zhang,F(2014).Development andApplications of CRISPR-Cas9for Genome Engineering.Cell 157:1262-1278.
6.Komor,AC,Badran,AH,and Liu,DR(2017).CRISPR-Based Technologies forthe Manipulation of Eukaryotic Genomes.Cell 168:20-36.
7.Barrangou,R,and Doudna,JA(2016).Applications of CRISPR technologiesin research and beyond.Nature Biotechnology 34:933.
8.Kosicki,M,Tomberg,K,and Bradley,A(2018).Repair of double-strandbreaks induced by CRISPR–Cas9leads to large deletions and complexrearrangements.Nature biotechnology.
9.Komor,AC,Kim,YB,Packer,MS,Zuris,JA,and Liu,DR(2016).Programmableediting of a target base in genomic DNA without double-stranded DNAcleavage.Nature 533: 420-424.
10.Gaudelli,NM,Komor,AC,Rees,HA,Packer,MS,Badran,AH,Bryson,DI,et al.(2017). Programmable base editing of A*T to G*C in genomic DNA without DNAcleavage. Nature 551:464-471.
11.Ren,B,Yan,F,Kuang,Y,Li,N,Zhang,D,Zhou,X,et al.(2018).Improved BaseEditor for Efficiently Inducing Genetic Variations in Rice with CRISPR/Cas9-Guided Hyperactive hAID Mutant.Molecular Plant 11:623-626.
12.Nishida,K,Arazoe,T,Yachie,N,Banno,S,Kakimoto,M,Tabata,M,et al.(2016). Targeted nucleotide editing using hybrid prokaryotic and vertebrateadaptive immune systems.Science 353.
13.Hilde Monica Frostad Riise,Thomas Berg, Nilssen,GiovanniRomeo,Ole Kristian Tollersrud,and Ceccherini,I(1997).Genomic structure of thehuman Lysosomal alpha-Mannosidase Gene(MANB).Genomics.
14.Riise Stensland,HM,Klenow,HB,Van Nguyen,L,Hansen,GM,Malm,D,andNilssen, O(2012).Identification of 83novel alpha-mannosidosis-associatedsequence variants: functional analysis of MAN2B1missense mutations.Humanmutation 33:511-520.
15.de Carvalho,TG,da Silveira Matte,U,Giugliani,R,and Baldo,G(2015).Genome Editing:Potential Treatment for Lysosomal Storage Diseases.CurrentStem Cell Reports 1:9-15.
发明内容
本发明的目的是提供一种高效模拟、修复MAN2B1C2248T突变的试剂盒和方法。
为了达到上述目的,本发明提供了一种制作甘露糖苷贮积症MAN2B1C2248T突变的方法,其特征在于,包括:在人细胞中,利用CRISPR/Cas9结合ssODN或者利用碱基编辑的方法制作含有MAN2B1C2248T的细胞,本方法包括基因编辑系统及所使用的mt-sgRNA。
优选地,利用CRISPR/Cas9结合ssODN方法时,所用mt-sgRNA对应的DNA序列为 SEQID NO.1,所用的ssODN序列为SEQ ID NO.2;利用碱基编辑时,所用mt-sgRNA对应的DNA序列为SEQ ID NO.3。
优选地,利用CRISPR/Cas9结合ssODN方法制作突变时,所用的Cas9为spCas9或XCas9。
优选地,利用碱基编辑制作突变时,所用的编辑系统为EE-XBE3、YE2-XBE3或者YEE-XBE3,优选选择EE-XBE3。
优选地,所述的含有MAN2B1C2248T的突变细胞为人HEK293T细胞或人的造血干细胞(HSC)。
优选地,所述的含有MAN2B1C2248T的突变细胞的构建方法为:根据MAN2B1C2248T位点设计突变mt-sgRNA和相应的突变ssODN;构建mt-sgRNA的表达载体,体外将Cas9蛋白和转录出来的mt-sgRNA组成RNP结合ssODN的方式并电转细胞,PCR产物deep sequencing 鉴定突变效率,并挑选出纯合子突变细胞株。
优选地,所述的含有MAN2B1C2248T的突变细胞的构建方法为:根据MAN2B1C2248T位点设计突变mt-sgRNA,构建mt-sgRNA的表达载体,利用碱基编辑系统EE-XBE3或者 YE2-XBE3或者YEE-XBE3和突变mt-sgRNA转染相应的细胞,鉴定突变效率。
本发明提供了一种高效修复MAN2B1C2248T突变的试剂盒,其特征在于:在含有MAN2B1C2248T的突变细胞中,利用针对MAN2B1C2248T位点的修复re-sgRNA引导碱基编辑系统到突变位点进行碱基编辑修复,收集转染后的细胞,鉴定修复率。本试剂盒和方法包括碱基编辑系统(base editor)以及针对MAN2B1C2248T位点的修复re-sgRNA。
优选地,所述的碱基编辑系统为spCas9-ABE,XCas9-ABE,优先XCas9-ABE。
优选地,所述的碱基编辑系统可以是质粒,mRNA或蛋白形式,优先选择蛋白形式。
优选地,所述的针对MAN2B1C2248T位点的修复re-sgRNA的序列,其对应的DNA序列为SEQ ID NO.4。
优选地,所使用的sgRNA可以是质粒形式,也可以是RNA形式,优先选择RNA形式。
优选地,所述的碱基编辑修复甘露糖苷贮积症突变方法还包括:Sanger测序检测编辑效率;高通量测序on-target,indel和off-target的效率。
本发明提供了一种在人细胞中制作突变和修复突变的组合,其特征在于,根据MAN2B1C2248T位点,对于CRISPR/Cas9方法设计突变mt-sgRNA和相应的突变ssODN;对于碱基编辑设计突变mt-sgRNA;针对MAN2B1C2248T位点设计修复re-sgRNA以及相应的碱基编辑系统。
优选地,所述的mt-sgRNA的序列为SEQ ID NO.1,ssODN的序列为SEQ ID NO.2或mt-sgRNA的序列为SEQ ID NO.3,re-sgRNA的序列为SEQ ID NO.4。
MAN2B1的突变是造成甘露糖苷贮积症的主要成因,从遗传上彻底修复此突变,将是治疗该疾病最有效的措施。碱基编辑系统提供了一种精确改变DNA,即将C变为T或者A变为G的方法。
发明人将利用基于碱基编辑或CRISPR/Cas9结合ssODN的同源重组方法制作含有MAN2B1C2248T突变的细胞株,其后利用base editor结合合适的sgRNA修复此突变,利用深度测序的方式检测修复效率和脱靶情况。而为治疗该类突变引起的甘露糖苷贮积症提供了高效安全的方法。
附图说明
图1A至图1D为在HEK293T细胞系中利用CRISPR/Cas9结合ssODN制作 MAN2B1C2248T突变的方法:图1A是相应突变在染色体上的位置。致病突变将引起氨基酸的改变,使Arg变成Trp;图1B是利用CRISPR/Cas9结合ssODN的方式可以制作相应突变的细胞株,ssODN长为113nt,使用的突变mt-sgRNA用下划线表示,黑色斜体是所用的PAM 序列;图1C为利用Cas9蛋白和mt-sgRNA结合ssODN的方式电转HEK293T细胞,通过流式分选单细胞,对单克隆细胞进行基因型鉴定;图1D为获得的纯合子突变的细胞株测序峰图。
图2A-2C为利用碱基编辑技术获得MAN2B1C2248T突变细胞:图2A是相应突变在染色体上的位置。致病突变将引起氨基酸的改变,使Arg变成Trp;图2B是利用碱基编辑, EE-XBE3/YE2-XBE3/YEE-XBE3,制作相应的突变细胞株,使用的突变mt-sgRNA用下划线表示,黑色斜体是所用的PAM序列;图2C为三种碱基编辑系统对致病位点及其临近位点的编辑效率,C5为致病位点。
图3A-3D为利用碱基编辑系统对突变细胞株进行修复:图3A为利用ABE对靶位点进行修复的模式图,使用的修复re-sgRNA用下划线表示,黑色斜体是所用的PAM序列;图 3B为利用XABE和ABE系统对靶位点进行修复,T6为相应的致病位点;图3C为对XABE 的不同形式进行修复效率的比较,分别有蛋白形式,mRNA形式和质粒形式;图3D为对修复过的细胞株进行测序,证明了修复的特异性。
图4A和图4B为突变细胞株进行脱靶分析:图4A为对修复细胞株通过高通量测序方法检测indel;图4B为对XABE所使用的修复sgRNA潜在的10个脱靶位点进行高通量测序检测off-target。
图5A和图5B为利用碱基编辑系统对HSC细胞进行突变和修复:图5A为获得的CD34+阳性细胞的比例;图5B为利用碱基编辑突变HSC细胞,致病突变的比例约为60%(Control),编辑过的HSC经过碱基编辑进行修复,利用XABE修复过的HSC致病突变的比例约为27%(XABE),利用ABE修复过的HSC致病突变的比例为55%(ABE)。
具体实施方式
下面将结合实施例对本发明的实施方案进行清楚、完整的描述,显然,所描述的实施例仅用于说明本发明的一部分实施例,而不应视为限制本发明的范围。实施例中未注明具体条件者,按照常规条件或制造商建议的条件进行。所用试剂或仪器未注明生产厂商者,视为可以通过市售购买获得的常规产品。
以上所述仅为本发明较佳实施例而已,并不用以限制本发明,凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。
实施例1
本实施例中,在细胞株上利用Cas9/sgRNA结合ssODN制作突变MAN2B1C2248T突变细胞株,本方法将利用Cas9mRNA和sgRNA的RNA结合ssODN形式实现,参见图1A-图1D。
1.1质粒构建
在突变位点附近,设计突变mt-sgRNA(SEQ ID NO.1),合成oligos,上游序列为:5’-taggCTACACAGACAGCAATGGCC-3’(SEQ ID NO.(5)),下游序列为: 5’-aaacGGCCATTGCTGTCTGTGTAG-3’(SEQ ID NO.(6)),上下游序列通过程序(95℃, 5min;95℃-85℃at-2℃/s;85℃-25℃at-0.1℃/s;hold at 4℃)退火,连接到经过BsaI(NEB:R0539L)线性化的PUC57-T7sgRNA载体上(addgene:51132)上。线性化体系如下所示:PUC57-T7sgRNA 2μg;buffer(NEB:R0539L)6μL;BsaI 2μL;ddH2O补齐到60μL。37℃酶切过夜。所使用的同源模板ssODN(SEQ ID NO.4),利用PAGE纯化的方式由生工生物公司(http://www.sangon.com/)合成。连接体系如下:T4连接buffer(NEB:M0202L)1μL,线性化载体20ng,退火的oligo片段(10μM)5μL,T4连接酶(NEB:M0202L)0.5μL, ddH2O补齐到10μL.16℃连接过夜。连接的载体通过转化,挑菌,鉴定,鉴定引物上游序列:5’-CGCCAGGGTTTTCCCAGTCACGAC-3’(SEQ ID NO.7),下游序列为相应oligo 的下游序列。对阳性克隆摇菌提取质粒(Axygene:AP-MN-P-250G)测定浓度备用。获得的突变质粒命名为mt-T7-sgRNA。
1.2sgRNA的体外转录
以构建的mt-T7sgRNA为模板,扩增含有sgRNA的片段,所用引物为:F: 5’-TCTCGCGCGTTTCGGTGATGACGG-3’,(SEQ ID NO.8)R: 5’-AAAAAAAGCACCGACTCGGTGCCACTTTTTC-3’(SEQ ID NO.9)。扩增体系如下: 2Xbuffer(诺唯赞:P505)25μL;dNTP 1μL;F(10pmol/μL)2μL;R(10pmol/μL)2μL;模板1ng;DNA聚合酶(诺唯赞:P505)0.5μL;ddH2O补齐到50μL。扩增出来的PCR 产物经过下述步骤纯化:每100μL体积加4μL RNAsecure(Life:AM7005);60℃15分钟;加入三倍体积的PCR-A(Axygen:AP-PCR-250G)过柱,离心,12000转/分钟离心1分钟;加入500μL W2,离心1分钟;空转1分钟;加入20μL无RNAase水洗脱。
利用体外转录试剂盒(Ambion,Life Technologies,AM1354)转录,步骤如下:
反应体系为:reaction buffer 1μL;enzyme mix 1μL;A 1μL;T 1μL;G1μL;C 1μL;模板800ng; H2O补齐到10μL。上述体系混匀后37℃反应5个小时。加入1μL DNase,37℃反应15分钟。利用回收试剂盒(Ambion,Life Technologies,AM1908)回收转录的sgRNA,步骤如下:上步反应体积加入90μL Elution solution移植1.5mlEP管;加入350μL Bindingsolution混匀;加入250μL无水乙醇混匀;上柱;10000转/分钟离心30秒,倒掉废液;加入500μL Washing solution,10000转/分钟离心30秒,倒掉废液;空转1分钟;换收集管,加入100μL Elution solution洗脱;加入10μL醋酸铵(Ambion,Life Technologies,AM1908)混匀;加入275μL无水乙醇混匀;-20℃放置30分钟,同时准备70%乙醇放置-20℃;4℃环境下13000转/分钟离心15分钟。弃上清,加入500μL 70%乙醇;离心5分钟,吸走废液,晾干5分钟;加入20μL 的水溶解;取1μL测浓度。
1.3Cas9的体外转录
spCas9或者xCas9酶切回收。本步骤是将质粒Cas9进行线性化。体系如下:Cas9 10μg; buffer I(NEB:R0539L)10μL;BbsI 4μL(NEB:R0539L);H2O补齐到100μL。混匀之后,37℃酶切过夜。
线性化质粒的回收。酶切产物中加入4μL RNAsecure(Life:AM7005),60℃反应10分钟;利用回收试剂盒(QIAGEN:28004)进行操作其余步骤,加入5倍体积buffer PB,过柱;加入750μL buffer PE离心;空转1分钟;用10μL水洗脱,测定浓度。
体外转录。按照试剂盒(Invitrogen:AM1345)的要求依次加入体系:1入g线性化载体;10μL2XNTP/ARCA;补齐到20μL水;2μL T7ezyme mix;2μL 10xreaction buffer。混合之后37℃反应2小时。加入1μL DNasea反应15分钟。
加尾。转录产物进行加尾处理保证转录mRNA的稳定性。具体体系如下:20μL反应产物;36μL H2O;20μL 5xE-PAP buffer;10μL 25mM MnCl2;10μL ATP solution;4μL PEP。反应体系混匀后37℃反应30分钟。
回收。利用回收试剂盒进行(QIAGEN:74104)。步骤如下:上步反应产物加入350μLbuffer RLT;加入250μL无水乙醇,过柱,离心;加入500μL RPE,离心,加入500μL RPE,离心;空转;加入30μL水洗脱。测定浓度后-80℃保存。
1.4细胞的培养与电转
(1)以HEK293T细胞(购自ATCC)为例,本发明进行真核生物细胞的培养与转染:HEK293T细胞接种培养于添加10%FBS的DMEM高糖培养液中(HyClone,SH30022.01B), 其中含penicillin(100U/ml)和streptomycin(100μg/ml)。
(2)转染前两个小时换成无抗生素的培养基,利用LONZA转染试剂(SF KIT)按照说明书转染,细胞通过计数得1X106个。将Cas9的mRNA,sgRNA和ssODN按照3μg,1.5μg 和3μg的质量混合。电转程序采用DS150,电转后的细胞在6cm的平皿中培养两天。
(3)细胞通过流式分选仪,分选单细胞培养,等两周以后,通过裂解鉴定基因型,裂解液的成分为50mM KCl,1.5mM MgCl2,10mM Tris pH 8.0,0.5%Nonidet P-40,0.5%Tween 20,100μg/ml protease K。挑选纯合突变的细胞株扩大培养。
1.5突变细胞株的筛选与鉴定
通过对分选的单克隆细胞进行鉴定,在选择的14个克隆里,#9克隆是个纯合子突变(图 1D),将此细胞株扩大培养进行后续的应用。
实施例2
本实施例中,在细胞株上利用碱基编辑系统CBEs(Cytosine base editors)制作突变MAN2B1C2248T突变细胞株,本方法将利用相应的BE3的mRNA和sgRNA的RNA形式实现,参见图2A-2C。
1.1质粒构建
sgRNA载体构建:在突变位点附近,设计突变mt-sgRNA(SEQ ID NO.3),合成oligos,上游序列为:5’-taggTGGCCGGGAGATCCTGGAGA-3’(SEQ ID NO.(10)),下游序列为:5’-aaacTCTCCAGGATCTCCCGGCCA-3’(SEQ ID NO.(11)),上下游序列通过程序(95℃,5min;95℃-85℃at-2℃/s;85℃-25℃at-0.1℃/s;hold at 4℃)退火,连接到经过BsaI(NEB:R0539L)线性化的PUC57-T7sgRNA载体上(addgene:51132)上。线性化体系如下所示:PUC57-T7sgRNA 2μg;buffer(NEB:R0539L)6μL;BsaI 2μL;ddH2O 补齐到60μL。37℃酶切过夜。连接体系如下:T4连接buffer(NEB:M0202L)1μL,线性化载体20ng,退火的oligo片段(10μM)5μL,T4连接酶(NEB:M0202L)0.5μL, ddH2O补齐到10μL.16℃连接过夜。连接的载体通过转化,挑菌,鉴定,鉴定引物上游序列:5’-CGCCAGGGTTTTCCCAGTCACGAC-3’(SEQ ID NO.12),下游序列为相应oligo 的下游序列。对阳性克隆摇菌提取质粒(Axygene:AP-MN-P-250G)测定浓度备用。获得的突变质粒命名为mt-T7-sgRNA。
不同类型BE3的构建:构建不同版本的BE3,即EE-XBE3(SEQ ID NO.13)、YE2-XBE3(SEQ ID NO.14)、YEE-XBE3(SEQ ID NO.15)。原始版本XBE3(SEQ ID NO.16)购自 Addgene(108380)。
不同类型APOBEC1的合成:上述三种BE3只是在rAPOBEC1的编码框上有所差异,EE-rAPOBEC1、YE2-rAPOBEC1、YEE-rAPOBEC1,其序列分别由生工生物 (http://www.sangon.com/)合成。所合成片段的两端分别加有NotI和SmaI的酶切位点。合成的片段克隆到常用的pmd19t载体(TAKARA:6013)上。
载体的酶切与纯化:XBE3和上述三种合成的载体经过NotI(NEB:R0189L)、SmaI(NEB:R0141L)酶切,体系如下:Buffer(NEB:R0189L)6uL;质粒2ug;NotI 1μL; SmaI 1μL;ddH2O补齐到60μL。混样后于37℃过夜酶切。酶切产物经过1%的琼脂糖凝胶,进行回收(Axygen:AP-GX-250G)。其中XBE3回收大片段的骨架载体,合成载体回收 APOBEC1的小片段载体。回收按照试剂盒的使用说明进行(Axygen:AP-PCR-250G)。回收后的片段经过Nanodrop 2000检测浓度。
载体的连接、转化与质粒提取:回收的骨架载体和APOBEC1片段经过连接,连接体系如下:T4连接buffer(NEB:M0202L)1μL,骨架载体20ng,APOBEC1片段50ng, T4连接酶(NEB:M0202L)0.5μL,ddH2O补齐到10μL,16℃连接过夜。转化步骤如下:取20μL感受态细胞(TransGen:CD201)在冰上解冻;2μL的连接产物与感受态细胞混合,冰上放置20分钟;42℃热激60秒;冰上放置2分钟,加入400μL复苏LB培养基(MDBio: L001-1kg),摇床30分钟;取70μL涂氨苄平板(50μg/ml,37℃培养箱,培养14个小时。挑选单克隆菌,在4ml液体LB培养基中扩大培养,14小时后提取质粒(Axygene: AP-MN-P-250G)。步骤如下:菌液经过4000转/分钟离心10分钟,倒掉上清培养基;加入 350μL的buffer S1,将菌体吹散,转移到2ml离心管中;加入250μL的buffer S2,上下颠倒8次;加入250μL的buffer S3,颠倒混匀6次,产生沉淀;12000转/分钟离心10分钟,取上清过柱;离心1分钟,倒掉废液,加入500μL的W1,离心一分钟,倒掉废液;加入 750μL的W2,离心,倒掉上清;加入500μL的W2,离心,倒掉上清;空转1分钟;加入 50μL的洗脱液,静置2分钟,离心。获得质粒经过浓度检测,取10μL送测序,阳性质粒保存在-20℃。
最终构建成如下四种BE3:
(1)EE-XBE3,SEQ ID NO.13;
(2)YE2-XBE3,SEQ ID NO.14;
(3)YEE-XBE3,SEQ ID NO.15;
1.2sgRNA的体外转录
以构建的mt-T7sgRNA为模板,扩增含有sgRNA的片段,所用引物为:F: 5’-TCTCGCGCGTTTCGGTGATGACGG-3’,(SEQ ID NO.8)R: 5’-AAAAAAAGCACCGACTCGGTGCCACTTTTTC-3’(SEQ ID NO.9)。扩增体系如下: 2Xbuffer(诺唯赞:P505)25μL;dNTP 1μL;F(10pmol/μL)2μL;R(10pmol/μL)2μL;模板1ng;DNA聚合酶(诺唯赞:P505)0.5μL;ddH2O补齐到50μL。扩增出来的PCR 产物经过下述步骤纯化:每100μL体积加4μL RNAsecure(Life:AM7005);60℃15分钟;加入三倍体积的PCR-A(Axygen:AP-PCR-250G)过柱,离心,12000转/分钟离心1分钟;加入500μL W2,离心1分钟;空转1分钟;加入20μL无RNAase水洗脱。
利用体外转录试剂盒(Ambion,Life Technologies,AM1354)转录,步骤如下:
反应体系为:reaction buffer 1μL;enzyme mix 1μL;A 1μL;T 1μL;G1μL;C 1μL;模板800ng;H2O补齐到10μL。上述体系混匀后37℃反应5个小时。加入1μL DNase,37℃反应15分钟。利用回收试剂盒(Ambion,Life Technologies,AM1908)回收转录的sgRNA,步骤如下:上步反应体积加入90μL Elution solution移植1.5mlEP管;加入350μL Bindingsolution 混匀;加入250μL无水乙醇混匀;上柱;10000转/分钟离心30秒,倒掉废液;加入500μL Washing solution,10000转/分钟离心30秒,倒掉废液;空转1分钟;换收集管,加入100μL Elution solution洗脱;加入10μL醋酸铵(Ambion,Life Technologies,AM1908)混匀;加入275μL 无水乙醇混匀;-20℃放置30分钟,同时准备70%乙醇放置-20℃;4℃环境下13000转/分钟离心15分钟。弃上清,加入500μL 70%乙醇;离心5分钟,吸走废液,晾干5分钟;加入 20μL的水溶解;取1μL测浓度。
1.3XBE3的体外转录
XBE3酶切回收。本步骤是将不同类型的XBE3进行线性化。体系如下:XBE3 10μg;buffer I(NEB:R0539L)10μL;BbsI 4μL(NEB:R0539L);H2O补齐到100μL。混匀之后,37℃酶切过夜。
线性化质粒的回收。酶切产物中加入4μL RNAsecure(Life:AM7005),60℃反应10分钟;利用回收试剂盒(QIAGEN:28004)进行操作其余步骤,加入5倍体积buffer PB,过柱;加入750μL buffer PE离心;空转1分钟;用10μL水洗脱,测定浓度。
体外转录。按照试剂盒(Invitrogen:AM1345)的要求依次加入体系:1入g线性化载体;10μL2XNTP/ARCA;补齐到20μL水;2μL T7ezyme mix;2μL 10xreaction buffer。混合之后37℃反应2小时。加入1μL DNasea反应15分钟。
加尾。转录产物进行加尾处理保证转录mRNA的稳定性。具体体系如下:20μL反应产物;36μL H2O;20μL 5xE-PAP buffer;10μL 25mM MnCl2;10μL ATP solution;4μL PEP。反应体系混匀后37℃反应30分钟。
回收。利用回收试剂盒进行(QIAGEN:74104)。步骤如下:上步反应产物加入350μLbuffer RLT;加入250μL无水乙醇,过柱,离心;加入500μL RPE,离心,加入500μL RPE,离心;空转;加入30μL水洗脱。测定浓度后-80℃保存。
1.4细胞的培养与电转
(1)以HEK293T细胞(购自ATCC)为例,本发明进行真核生物细胞的培养与转染:HEK293T细胞接种培养于添加10%FBS的DMEM高糖培养液中(HyClone,SH30022.01B), 其中含penicillin(100U/ml)和streptomycin(100μg/ml)。
(2)转染前两个小时换成无抗生素的培养基,利用LONZA转染试剂(SF KIT)按照说明书转染,细胞通过计数得1X106个。将XBE3的mRNA两种成分的质量分别为3μg, 1.5μg。电转程序采用DS150,电转后的细胞在6cm的平皿中培养两天。
(3)高通量检测突变效率,图2C所示,三种不同的XBE3对致病位点的编辑有不同的效率。其中EE-XBE3和YE2-XBE3对致病位点C5具有较高的编辑效率,达到50%以上,而YEE-XBE3对致病位点的编辑效率较低。同时,致病位点附近存在其他C,EE-XBE3对 C4有10%的编辑效率,YE2-XBE3有大约20%的编辑效率。因此EE-XBE3对本致病位点具有较好的编辑效率,同时具有较低的非致病位点的编辑效率。
实施例3
本实施例中,对获得的纯合突变细胞株,利用碱基编辑系统对MAN2B1C2248T实现修复。本实施例将利用XABE和ABE质粒结合修复sgRNA进行突变位点的修复,参见图3A-3D。
1.1质粒构建
在突变位点附近,设计修复re-sgRNA,合成oligos,上游序列为:5’-accgCTCCCAGCCATTGCTGTCTG-3’(SEQ ID NO.(17)),下游序列为:5’-aaacCAGACAGCAATGGCtGGGAG-3’(SEQ ID NO.(18)),上下游序列通过程序(95℃,5min; 95℃-85℃at-2℃/s;85℃-25℃at-0.1℃/s;hold at 4℃)退火,连接到经过BsaI(NEB: R0539L)线性化的PGL3-U6sgRNA载体上SEQ ID NO.(19)(addgene:51133)上。线性化体系如下所示:PGL3-U6sgRNA 2μg;buffer(NEB:R0539L)6μL;BsaI 2μL;ddH2O补齐到60μL。37℃酶切过夜。连接体系如下:T4连接buffer(NEB:M0202L)1μL,线性化载体20ng,退火的oligo片段(10μM)5μL,T4连接酶(NEB:M0202L)0.5μL,ddH2O 补齐到10μL.16℃连接过夜。连接的载体通过转化,挑菌,鉴定,鉴定引物上游序列:5’- TTTCCCATGATTCCTTCATA-3’(SEQ ID NO.20),下游序列为相应oligo的下游序列。对阳性克隆摇菌提取质粒(Axygene:AP-MN-P-250G)测定浓度备用。获得的突变质粒命名为re-U6-sgRNA。
1.3细胞的培养与电转
(1)获得的纯合突变细胞株接种培养于添加10%FBS的DMEM高糖培养液中(HyClone,SH30022.01B),其中含penicillin(100U/ml)和streptomycin(100μg/ml)。
(2)转染前两个小时换成无抗生素的培养基,利用LONZA转染试剂(SF KIT)按照说明书转染,细胞通过计数得1X106个。将质粒ABE,sgRNA按照3μg,1.5μg的质量混合。电转程序采用DS150,电转后的细胞在6cm的平皿中培养两天。
1.4修复效率的检测
通过对转染后的细胞进行高通量测序,分析碱基编辑的修复效率。如图3B所示,ABE 因为识别的PAM序列的问题,对致病位点(T6)的编辑效率较低。XABE可以识别NG序列,所以对致病位点具有较高的修复效率。XABE对MAN2B1C2248T是最佳的修复系统。
实施例4
本实施例中,对获得的纯合突变细胞株,利用碱基编辑系统对MAN2B1C2248T实现修复。本实施例将利用XABE的mRNA、蛋白结合修复sgRNA进行突变位点的修复,参见图3A-3D以及图4A和图4B。
1.1蛋白纯化
将XABE质粒的编码区构建到载体pET28a(SEQ ID NO.21优宝生物,VT1207)中。摇菌,8L LB,Kana,大约摇菌4小时(OD=0.8)后,加入IPTG1Mm,,16度摇菌48h。沉菌,离心5000g,20min。重悬,将所有沉菌用bufferA重悬,菌必须完全打散,防止后面破碎的时候堵塞仪器。破碎,将菌液过仪器破碎,直到溶液清亮,一般至少破碎两次。仪器准备,需要清洗3-4遍,高压力部分金属管需要冰浴,仪器使用完毕后需要清洗3-4次。收取10μL全细胞裂解产物,后续western检测。将裂解产物置于50ml离心管中,80000g,40min。收集上清,重复上述步骤,直到颗粒杂质去除干净。0.45um滤器过滤上清,取10μL用于后续western检测,准备开始固相金属亲和层析(Immobilized Metal Affinity Chromatography (IMAC)(钴柱)。钴柱需要用ddH2O洗一遍后,用bufferA润洗几遍。将蛋白样品过钴柱(此次用两个柱子),并收集流出液。重复上述步骤并取10μL样品用于后续western。去杂质,用40mL添加有5mM咪唑的bufferA过柱子,以去除亲和力较低的杂质。收集流出液,并取10μL样品用于后续western。
洗脱,用30ml添加有500mM咪唑的bufferA过柱子,置换出目的蛋白。收集目的蛋白,并取10μL样品用于后续western。洗脱后的钴柱需要用ddH2O清洗,以除去咪唑,之后再用bufferA平衡。western,目的蛋白约160KD,根据蛋白大小配置合适的SDS-PAGE胶, 210V电泳。电泳结束后,将胶割下,置于考马斯亮蓝中,微波炉高温加热1min。之后,用 ddH2O清洗,微波炉加热20min。用水冲洗后,拍照。蛋白浓缩:将洗脱下来的目的蛋白加入到蛋白浓缩柱中,3900rpm,20min。浓缩后的蛋白进行离子交换层析(Ion exchange chromatography(IEC)),以除去与蛋白结合的核酸。离子交换层析的原理即高盐溶液下,这种离子键就会被破坏,从而释放出目的蛋白。层析收集的目的蛋白,经浓缩后进行酶切,以除去His-tag。
1.2质粒构建
在突变位点附近,设计修复re-sgRNA,合成oligos,上游序列为:5’-taggCTCCCAGCCATTGCTGTCTG-3’(SEQ ID NO.(15)),下游序列为:5’-aaacCAGACAGCAATGGCtGGGAG-3’(SEQ ID NO.(16)),上下游序列通过程序(95℃,5min; 95℃-85℃at-2℃/s;85℃-25℃at-0.1℃/s;hold at 4℃)退火,连接到经过BsaI(NEB: R0539L)线性化的PUC57-T7sgRNA载体上(addgene:51132)上。线性化体系如下所示: PUC57-T7sgRNA2μg;buffer(NEB:R0539L)6μL;BsaI 2μL;ddH2O补齐到60μL。37℃酶切过夜。连接体系如下:T4连接buffer(NEB:M0202L)1μL,线性化载体20ng,退火的oligo片段(10μM)5μL,T4连接酶(NEB:M0202L)0.5μL,ddH2O补齐到10μL. 16℃连接过夜。连接的载体通过转化,挑菌,鉴定,鉴定引物上游序列: 5’-CGCCAGGGTTTTCCCAGTCACGAC-3’(SEQ ID NO.12),下游序列为相应oligo的下游序列。对阳性克隆摇菌提取质粒(Axygene:AP-MN-P-250G)测定浓度备用。获得的突变质粒命名为re-T7-sgRNA。
1.2sgRNA的体外转录
以构建的re-T7sgRNA为模板,扩增含有sgRNA的片段,所用引物为:F: 5’-TCTCGCGCGTTTCGGTGATGACGG-3’,(SEQ ID NO.8)R: 5’-AAAAAAAGCACCGACTCGGTGCCACTTTTTC-3’(SEQ ID NO.9)。扩增体系如下: 2Xbuffer(诺唯赞:P505)25μL;dNTP 1μL;F(10pmol/μL)2μL;R(10pmol/μL)2μL;模板1ng;DNA聚合酶(诺唯赞:P505)0.5μL;ddH2O补齐到50μL。扩增出来的PCR 产物经过下述步骤纯化:每100μL体积加4μL RNAsecure(Life:AM7005);60℃15分钟;加入三倍体积的PCR-A(Axygen:AP-PCR-250G)过柱,离心,12000转/分钟离心1分钟;加入500μL W2,离心1分钟;空转1分钟;加入20μL无RNAase水洗脱。
利用体外转录试剂盒(Ambion,Life Technologies,AM1354)转录,步骤如下:
反应体系为:reaction buffer 1μL;enzyme mix 1μL;A 1μL;T 1μL;G1μL;C 1μL;模板800ng;H2O补齐到10μL。上述体系混匀后37℃反应5个小时。加入1μL DNase,37℃反应15分钟。利用回收试剂盒(Ambion,Life Technologies,AM1908)回收转录的sgRNA,步骤如下:上步反应体积加入90μL Elution solution移植1.5mlEP管;加入350μL Bindingsolution 混匀;加入250μL无水乙醇混匀;上柱;10000转/分钟离心30秒,倒掉废液;加入500μL Washing solution,10000转/分钟离心30秒,倒掉废液;空转1分钟;换收集管,加入100μL Elution solution洗脱;加入10μL醋酸铵(Ambion,Life Technologies,AM1908)混匀;加入275μL 无水乙醇混匀;-20℃放置30分钟,同时准备70%乙醇放置-20℃;4℃环境下13000转/分钟离心15分钟。弃上清,加入500μL 70%乙醇;离心5分钟,吸走废液,晾干5分钟;加入 20μL的水溶解;取1μL测浓度。
1.3XABE的体外转录
XABE酶切回收。本步骤是将质粒XABE进行线性化。体系如下:XABE 10μg;buffer I(NEB:R0539L)10μL;BbsI 4μL(NEB:R0539L);H2O补齐到100μL。混匀之后, 37℃酶切过夜。
线性化质粒的回收。酶切产物中加入4μL RNAsecure(Life:AM7005),60℃反应10分钟;利用回收试剂盒(QIAGEN:28004)进行操作其余步骤,加入5倍体积buffer PB,过柱;加入750μL buffer PE离心;空转1分钟;用10μL水洗脱,测定浓度。
体外转录。按照试剂盒(Invitrogen:AM1345)的要求依次加入体系:1入g线性化载体;10μL2XNTP/ARCA;补齐到20μL水;2μL T7ezyme mix;2μL 10xreaction buffer。混合之后37℃反应2小时。加入1μL DNasea反应15分钟。
加尾。转录产物进行加尾处理保证转录mRNA的稳定性。具体体系如下:20μL反应产物;36μL H2O;20μL 5xE-PAP buffer;10μL 25mM MnCl2;10μL ATP solution;4μL PEP。反应体系混匀后37℃反应30分钟。
回收。利用回收试剂盒进行(QIAGEN:74104)。步骤如下:上步反应产物加入350μLbuffer RLT;加入250μL无水乙醇,过柱,离心;加入500μL RPE,离心,加入500μL RPE,离心;空转;加入30μL水洗脱。测定浓度后-80℃保存。
1.4细胞的培养与电转
(1)获得的纯合突变细胞株接种培养于添加10%FBS的DMEM高糖培养液中(HyClone,SH30022.01B),其中含penicillin(100U/ml)和streptomycin(100μg/ml)。
(2)转染前两个小时换成无抗生素的培养基,利用LONZA转染试剂(SF KIT)按照说明书转染,细胞通过计数得1X106个。将XABE的mRNA或蛋白,转录出来的sgRNA 按照3μg,1.5μg的质量混合。电转程序采用DS150,电转后的细胞在6cm的平皿中培养两天。
(3)一部分转染后的细胞分选为单细胞培养,等两周以后,通过裂解鉴定基因型,裂解液的成分为50mM KCl,1.5mM MgCl2,10mM Tris pH 8.0,0.5%Nonidet P-40,0.5%Tween 20,100μg/ml protease K。挑选纯合突变的细胞株扩大培养。另一部分用来鉴定修复效率。通过对转染后的细胞进行高通量测序,分析碱基编辑的修复效率。如图3C所示,XABE 在蛋白、RNA和质粒等三种形式方面,菌可以对致病位点进行修复,而XABE的蛋白的修复效率最高。对分选出来的单细胞,我们也获得了纯合子修复的细胞株(图3D)。
1.6脱靶检测
进一步地,我们对获得的修复克隆所使用的sgRNA潜在的脱靶位点的检测,通过高通量测序发现,在靶位点中没有发现indel,在潜在的脱靶位点中没有发现明显的脱靶(图4A 和图4B)。上述结果证明利用碱基编辑系统可以安全高效的实现甘露糖苷贮积症相关的MAN2B1C2248T突变的修复。
实施例5
本实施例中,我们将对提取出来的HSC细胞,首先利用EE-XBE3制作MAN2B1C2248T突变,再利用XABE结合修复sgRNA对突变的细胞群体进行修复。
1.1HSC细胞的分离与培养
骨髓样本由医院赠送,捐献者已签署知情书。
将获得的大约50mL骨髓标本转移到500mL无菌空瓶中,向其中加入PBS稀释至终体积为280ml,混合均匀。用移液管向8支50mL离心管中各加入15mL淋巴细胞分离液。用移液管向其中每离心管小心、缓慢地加入35mL稀释后的骨髓标本。400g离心30 分钟。用吸痰器吸出第一层液体至略高于淋巴细胞层处,用移液管吸取第二层液体放入新50mL离心管内并补培养基到50mL。300g离心10分钟,去上清。用5mL0.5%BSA PBS重悬细胞,再补至50mL。裂解红细胞,加入人红细胞裂解液1x H-Lyse buffer 10mL,震荡、吹吸混匀室温静置10min。加1x H-wash buffer 10mL颠倒混匀。加0.5%BSA PBS到 50mL,300g离心10分钟,弃上清。对于1x108细胞,将0.3mL 0.5%BSA PBS加入淋巴细胞沉淀中,吹吸重悬,再分别加入0.1mLFcr Blocking Ab和0.1mLCD34-beads。放置于4℃冰箱30分钟,加入20mL0.5%BSAPBS,重悬混匀。300g离心10分钟,去上清。加入0.5% BSA PBS至终体积为4mL。将柱子放在架子上并加入0.5mL 0.5%BSA PBS将柱子润湿,在柱子下方放置离心管用于收集流下液体,待液体停止滴落后向柱子中加入0.5mL细胞悬液。滴完后向柱子中加入0.5mL 0.5%BSAPBS清洗被柱子吸附的细胞,重复5次。将柱子从架子上取下并加入1mL 0.5%BSA PBS,用活塞将液体推至50mL离心管中,重复3次。向离心管中加入0.5%BSA PBS至20mL体积,加到细胞计数器上,显微镜下计数第一遍过柱子的细胞数。300g离心10分钟。根据计数结果,按1x107细胞重悬于4mL 0.5%BSA PBS、过1个柱子计算,重悬至适当体积的0.5%BSA PBS中并过适当数量的柱子。提高CD34+ 细胞比例,重复分选步骤。计数第二遍过柱子的细胞数。根据计数结果,取5x105CD34+ 细胞,0.5%BSA PBS稀释至100μL(实验组),取流过柱子的废液100μl(对照组),CD34、 CD3抗体各加3μl,4℃放半个小时,200μLPBS洗2遍,3500rpm离心3min。重悬于 200μL PBS,进行流式检测,结果发现我们获得的纯度高达97%的CD34+细胞,参见图5A。
1.2蛋白纯化
将EE-XBE3,XABE和ABE质粒的编码区构建到载体pET28a(SEQ ID NO.9优宝生物,VT1207)中。摇菌,8L LB,Kana,大约摇菌4小时(OD=0.8)后,加入IPTG1Mm,,16度摇菌48h。沉菌,离心5000g,20min。重悬,将所有沉菌用bufferA重悬,菌必须完全打散,防止后面破碎的时候堵塞仪器。破碎,将菌液过仪器破碎,直到溶液清亮,一般至少破碎两次。仪器准备,需要清洗3-4遍,高压力部分金属管需要冰浴,仪器使用完毕后需要清洗3-4次。收取10μL全细胞裂解产物,后续western检测。将裂解产物置于50ml离心管中,80000g,40min。收集上清,重复上述步骤,直到颗粒杂质去除干净。0.45um滤器过滤上清,取10μL用于后续western检测,准备开始固相金属亲和层析(Immobilized Metal Affinity Chromatography(IMAC)(钴柱)。钴柱需要用ddH2O洗一遍后,用bufferA润洗几遍。将蛋白样品过钴柱(此次用两个柱子),并收集流出液。重复上述步骤并取10μL样品用于后续 western。去杂质,用40mL添加有5mM咪唑的bufferA过柱子,以去除亲和力较低的杂质。收集流出液,并取10μL样品用于后续western。
洗脱,用30ml添加有500mM咪唑的bufferA过柱子,置换出目的蛋白。收集目的蛋白,并取10μL样品用于后续western。洗脱后的钴柱需要用ddH2O清洗,以除去咪唑,之后再用bufferA平衡。western,目的蛋白约160KD,根据蛋白大小配置合适的SDS-PAGE胶, 210V电泳。电泳结束后,将胶割下,置于考马斯亮蓝中,微波炉高温加热1min。之后,用 ddH2O清洗,微波炉加热20min。用水冲洗后,拍照。蛋白浓缩:将洗脱下来的目的蛋白加入到蛋白浓缩柱中,3900rpm,20min。浓缩后的蛋白进行离子交换层析(Ion exchange chromatography(IEC)),以除去与蛋白结合的核酸。离子交换层析的原理即高盐溶液下,这种离子键就会被破坏,从而释放出目的蛋白。层析收集的目的蛋白,经浓缩后进行酶切,以除去His-tag。
1.3造血干细胞的培养与转染
分离得到的CD34+造血干细胞培养在StemSpan SFEM培养基中,培养基中添加如下细胞因子:100ng/mL的SCF,100ng/mL的Tpo,100ng/mL的Flt3ligand and smallmolecules,1 μM Stem-Regenin 1,和0.5μM MU729。细胞培养在37℃温箱,5%CO2条件,预刺激24小时。将相应的EE-XBE3的蛋白与相应的sgRNA在37℃孵育10分钟,利用LONZA 4Dneclefector进行电转,程序选择EO-100。电转后的细胞继续培养2天。
1.4突变HSC细胞群体的修复
通过对突变的HSC细胞进行检测,发现有大约60%的细胞在致病位点发生了突变,参见图5B。接着,我们利用XABE,ABE和相应的修复sgRNA,在体外形成RNP电转上述获得的突变细胞群体。细胞培养两天后,对修复效率进行检测,发现经过XABE修复后的HSC细胞群体的突变效率讲到了28%左右,而对于ABE,因为其所使用的PAM序列的限制,突变效率只是降到了55%左右(图5B)。上述实验证实,在人的原代细胞中,我们可以利用碱基编辑对致病突变进行修复,本研究将为临床上治疗相应突变引起的甘露糖苷贮积症的患者提供了潜在的治疗方案。
序列表
<110> 国家卫生计生委科学技术研究所
<120> 碱基编辑模拟、修复与甘露糖苷贮积症相关的MAN2B1C2248T突变的试剂和方法
<160> 21
<170> SIPOSequenceListing 1.0
<210> 1
<211> 23
<212> DNA
<213> 人工序列()
<400> 1
ctacacagac agcaatggcc ggg 23
<210> 2
<211> 113
<212> DNA
<213> 人工序列()
<400> 2
ccgttttgac acaccgctgg agacaaaggg acgcttctac acagacagca atggctggga 60
gatcctggag aggaggtggg gggtgactga gagcactgag ggggtggtct gtg 113
<210> 3
<211> 23
<212> DNA
<213> 人工序列()
<400> 3
tggccgggag atcctggaga gga 23
<210> 4
<211> 23
<212> DNA
<213> 人工序列()
<400> 4
acacagacag caatggccgg gag 23
<210> 5
<211> 24
<212> DNA
<213> 人工序列()
<400> 5
taggctacac agacagcaat ggcc 24
<210> 6
<211> 24
<212> DNA
<213> 人工序列()
<400> 6
aaacggccat tgctgtctgt gtag 24
<210> 7
<211> 24
<212> DNA
<213> 人工序列()
<400> 7
cgccagggtt ttcccagtca cgac 24
<210> 8
<211> 24
<212> DNA
<213> 人工序列()
<400> 8
tctcgcgcgt ttcggtgatg acgg 24
<210> 9
<211> 31
<212> DNA
<213> 人工序列()
<400> 9
aaaaaaagca ccgactcggt gccacttttt c 31
<210> 10
<211> 24
<212> DNA
<213> 人工序列()
<400> 10
taggtggccg ggagatcctg gaga 24
<210> 11
<211> 24
<212> DNA
<213> 人工序列()
<400> 11
aaactctcca ggatctcccg gcca 24
<210> 12
<211> 24
<212> DNA
<213> 人工序列()
<400> 12
cgccagggtt ttcccagtca cgac 24
<210> 13
<211> 8532
<212> DNA
<213> 人工序列()
<400> 13
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gagctcagag 420
actggcccag tggctgtgga ccccacattg agacggcgga tcgagcccca tgagtttgag 480
gtattcttcg atccgagaga gctccgcaag gagacctgcc tgctttacga aattaattgg 540
gggggccggc actccatttg gcgacataca tcacagaaca ctaacaagca cgtcgaagtc 600
aacttcatcg agaagttcac gacagaaaga tatttctgtc cgaacacaag gtgcagcatt 660
acctggtttc tcagctggag cccatgcggc gaatgtagta gggccatcac tgaattcctg 720
tcaaggtatc cccacgtcac tctgtttatt tacatcgcaa ggctgtacca ccacgctgac 780
cccgagaatc gacaaggcct ggaagatttg atctcttcag gtgtgactat ccaaattatg 840
actgagcagg agtcaggata ctgctggaga aactttgtga attatagccc gagtaatgaa 900
gcccactggc ctaggtatcc ccatctgtgg gtacgactgt acgttcttga actgtactgc 960
atcatactgg gcctgcctcc ttgtctcaac attctgagaa ggaagcagcc acagctgaca 1020
ttctttacca tcgctcttca gtcttgtcat taccagcgac tgcccccaca cattctctgg 1080
gccaccgggt tgaaaagcgg cagcgagact cccgggacct cagagtccgc cacacccgaa 1140
agtgacaaga agtactccat tgggctcgct atcggcacaa acagcgtcgg ctgggccgtc 1200
attacggacg agtacaaggt gccgagcaaa aaattcaaag ttctgggcaa taccgatcgc 1260
cacagcataa agaagaacct cattggcgcc ctcctgttcg actccgggga gacggccgaa 1320
gccacgcggc tcaaaagaac agcacggcgc agatataccc gcagaaagaa tcggatctgc 1380
tacctgcagg agatctttag taatgagatg gctaaggtgg atgactcttt cttccatagg 1440
ctggaggagt cctttttggt ggaggaggat aaaaagcacg agcgccaccc aatctttggc 1500
aatatcgtgg acgaggtggc gtaccatgaa aagtacccaa ccatatatca tctgaggaag 1560
aagcttgtag acagtactga taaggctgac ttgcggttga tctatctcgc gctggcgcat 1620
atgatcaaat ttcggggaca cttcctcatc gagggggacc tgaacccaga caacagcgat 1680
gtcgacaaac tctttatcca actggttcag acttacaatc agcttttcga agagaacccg 1740
atcaacgcat ccggagttga cgccaaagca atcctgagcg ctaggctgtc caaatcccgg 1800
cggctcgaaa acctcatcgc acagctccct ggggagaaga agaacggcct gtttggtaat 1860
cttatcgccc tgtccctcgg gctgaccccc aactttaaat ctaacttcga cctggccgaa 1920
gataccaagc ttcaactgag caaagacacc tacgatgatg atctcgacaa tctgctggcc 1980
cagatcggcg accagtacgc agaccttttt ttggcggcaa agaacctgtc agacgccatt 2040
ctgctgagtg atattctgcg agtgaacacg gagatcacca aagctccgct gagcgctagt 2100
atgatcaagc tctatgatga gcaccaccaa gacttgactt tgctgaaggc ccttgtcaga 2160
cagcaactgc ctgagaagta caaggaaatt ttcttcgatc agtctaaaaa tggctacgcc 2220
ggatacattg acggcggagc aagccaggag gaattttaca aatttattaa gcccatcttg 2280
gaaaaaatgg acggcaccga ggagctgctg gtaaagctta acagagaaga tctgttgcgc 2340
aaacagcgca ctttcgacaa tggaatcatc ccccaccaga ttcacctggg cgaactgcac 2400
gctatcctca ggcggcaaga ggatttctac ccctttttga aagataacag ggaaaagatt 2460
gagaaaatcc tcacatttcg gataccctac tatgtaggcc ccctcgcccg gggaaattcc 2520
agattcgcgt ggatgactcg caaatcagaa gagaccatca ctccctggaa cttcgagaaa 2580
gtcgtggata agggggcctc tgcccagtcc ttcatcgaaa ggatgactaa ctttgataaa 2640
aatctgccta acgaaaaggt gcttcctaaa cactctctgc tgtacgagta cttcacagtt 2700
tataacgagc tcaccaaggt caaatacgtc acagaaggga tgagaaagcc agcattcctg 2760
tctggagatc agaagaaagc tattgtggac ctcctcttca agacgaaccg gaaagttacc 2820
gtgaaacagc tcaaagaaga ctatttcaaa aagattgaat gtttcgactc tgttgaaatc 2880
agcggagtgg aggatcgctt caacgcatcc ctgggaacgt atcacgatct cctgaaaatc 2940
attaaagaca aggacttcct ggacaatgag gagaacgagg acattcttga ggacattgtc 3000
ctcaccctta cgttgtttga agatagggag atgattgaag aacgcttgaa aacttacgct 3060
catctcttcg acgacaaagt catgaagcag ctcaagaggc gccgatatac aggatggggg 3120
cggctgtcaa gaaaactgat caatgggatc cgagacaagc agagtggaaa gacaatcctg 3180
gattttctta agtccgatgg atttgccaac cggaacttca ttcagttgat ccatgatgac 3240
tctctcacct ttaaggagga catccagaaa gcacaagttt ctggccaggg ggacagtctt 3300
cacgagcaca tcgctaatct tgcaggtagc ccagctatca aaaagggaat actgcagacc 3360
gttaaggtcg tggatgaact cgtcaaagta atgggaaggc ataagcccga gaatatcgtt 3420
atcgagatgg cccgagagaa ccaaaccacc cagaagggac agaagaacag tagggaaagg 3480
atgaagagga ttgaagaggg tataaaagaa ctggggtccc aaatccttaa ggaacaccca 3540
gttgaaaaca cccagcttca gaatgagaag ctctacctgt actacctgca gaacggcagg 3600
gacatgtacg tggatcagga actggacatc aatcggctct ccgactacga cgtggatcat 3660
atcgtgcccc agtcttttct caaagatgat tctattgata ataaagtgtt gacaagatcc 3720
gataaaaaca gagggaagag tgataacgtc ccctcagaag aagttgtcaa gaaaatgaaa 3780
aattattggc ggcagctgct gaacgccaaa ctgatcacac aacggaagtt cgataatctg 3840
actaaggctg aacgaggtgg cctgtctgag ttggataaag ccggtttcat caaaaggcag 3900
cttgttgaga cacgccagat caccaagcac gtggcccaaa ttctcgattc acgcatgaac 3960
accaagtacg atgaaaatga caaactgatt cgagaggtga aagttattac tctgaagtct 4020
aagctggtct cagatttcag aaaggacttt cagttttata aggtgagaga gatcaacaat 4080
taccaccatg cgcatgatgc ctacctgaat gcagtggtag gcactgcact tatcaaaaaa 4140
tatcccaagc ttgaatctga atttgtttac ggagactata aagtgtacga tgttaggaaa 4200
atgatcgcaa agtctgagca ggaaataggc aaggccaccg ctaagtactt cttttacagc 4260
aatattatga attttttcaa gaccgagatt acactggcca atggagagat tcggaagcga 4320
ccacttatcg aaacaaacgg agaaacagga gaaatcgtgt gggacaaggg tagggatttc 4380
gcgacagtcc ggaaggtcct gtccatgccg caggtgaaca tcgttaaaaa gaccgaagta 4440
cagaccggag gcttctccaa ggaaagtatc ctcccgaaaa ggaacagcga caagctgatc 4500
gcacgcaaaa aagattggga ccccaagaaa tacggcggat tcgattctcc tacagtcgct 4560
tacagtgtac tggttgtggc taaagtggag aaagggaagt ctaaaaaact caaaagcgtc 4620
aaggaactgc tgggcatcac aatcatggag cgatcaagct tcgaaaaaaa ccccatcgac 4680
tttctcgagg cgaaaggata taaagaggtc aaaaaagacc tcatcattaa gcttcccaag 4740
tactctctct ttgagcttga aaacggccgg aaacgaatgc tcgctagtgc gggcgtgctg 4800
cagaaaggta acgagctggc actgccctct aaatacgtta atttcttgta tctggccagc 4860
cactatgaaa agctcaaagg gtctcccgaa gataatgagc agaagcagct gttcgtggaa 4920
caacacaaac actaccttga tgagatcatc gagcaaataa gcgaattctc caaaagagtg 4980
atcctcgccg acgctaacct cgataaggtg ctttctgctt acaataagca cagggataag 5040
cccatcaggg agcaggcaga aaacattatc cacttgttta ctctgaccaa cttgggcgcg 5100
cctgcagcct tcaagtactt cgacactacc atagacagaa agcggtacac ctctacaaag 5160
gaggtcctgg acgccacact gattcatcag tcaattacgg ggctctatga aacaagaatc 5220
gacctctctc agctcggtgg agactctggt ggttctacta atctgtcaga tattattgaa 5280
aaggagaccg gtaagcaact ggttatccag gaatccatcc tcatgctccc agaggaggtg 5340
gaagaagtca ttgggaacaa gccggaaagc gatatactcg tgcacaccgc ctacgacgag 5400
agcaccgacg agaatgtcat gcttctgact agcgacgccc ctgaatacaa gccttgggct 5460
ctggtcatac aggatagcaa cggtgagaac aagattaaga tgctctctgg tggttctccc 5520
aagaagaaga ggaaagtcta accggtcatc atcaccatca ccattgagtt taaacccgct 5580
gatcagcctc gactgtgcct tctagttgcc agccatctgt tgtttgcccc tcccccgtgc 5640
cttccttgac cctggaaggt gccactccca ctgtcctttc ctaataaaat gaggaaattg 5700
catcgcattg tctgagtagg tgtcattcta ttctgggggg tggggtgggg caggacagca 5760
agggggagga ttgggaagac aatagcaggc atgctgggga tgcggtgggc tctatggctt 5820
ctgaggcgga aagaaccagc tggggctcga taccgtcgac ctctagctag agcttggcgt 5880
aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt ccacacaaca 5940
tacgagccgg aagcataaag tgtaaagcct agggtgccta atgagtgagc taactcacat 6000
taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt 6060
aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct tccgcttcct 6120
cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa 6180
aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa 6240
aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc 6300
tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga 6360
caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc 6420
cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt 6480
ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct 6540
gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg 6600
agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta 6660
gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct 6720
acactagaag aacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa 6780
gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt 6840
gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta 6900
cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat 6960
caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa 7020
gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct 7080
cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta 7140
cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct 7200
caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg 7260
gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa 7320
gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt 7380
cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta 7440
catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca 7500
gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta 7560
ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct 7620
gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg 7680
cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac 7740
tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact 7800
gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa 7860
atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt 7920
ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat 7980
gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg 8040
acgtcgacgg atcgggagat cgatctcccg atcccctagg gtcgactctc agtacaatct 8100
gctctgatgc cgcatagtta agccagtatc tgctccctgc ttgtgtgttg gaggtcgctg 8160
agtagtgcgc gagcaaaatt taagctacaa caaggcaagg cttgaccgac aattgcatga 8220
agaatctgct tagggttagg cgttttgcgc tgcttcgcga tgtacgggcc agatatacgc 8280
gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 8340
gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 8400
ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 8460
ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 8520
atcaagtgta tc 8532
<210> 14
<211> 8532
<212> DNA
<213> 人工序列()
<400> 14
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gagctcagag 420
actggcccag tggctgtgga ccccacattg agacggcgga tcgagcccca tgagtttgag 480
gtattcttcg atccgagaga gctccgcaag gagacctgcc tgctttacga aattaattgg 540
gggggccggc actccatttg gcgacataca tcacagaaca ctaacaagca cgtcgaagtc 600
aacttcatcg agaagttcac gacagaaaga tatttctgtc cgaacacaag gtgcagcatt 660
acctggtttc tcagctatag cccatgcggc gaatgtagta gggccatcac tgaattcctg 720
tcaaggtatc cccacgtcac tctgtttatt tacatcgcaa ggctgtacca ccacgctgac 780
ccccgcaatc gacaaggcct ggaagatttg atctcttcag gtgtgactat ccaaattatg 840
actgagcagg agtcaggata ctgctggaga aactttgtga attatagccc gagtaatgaa 900
gcccactggc ctaggtatcc ccatctgtgg gtacgactgt acgttcttga actgtactgc 960
atcatactgg gcctgcctcc ttgtctcaac attctgagaa ggaagcagcc acagctgaca 1020
ttctttacca tcgctcttca gtcttgtcat taccagcgac tgcccccaca cattctctgg 1080
gccaccgggt tgaaaagcgg cagcgagact cccgggacct cagagtccgc cacacccgaa 1140
agtgacaaga agtactccat tgggctcgct atcggcacaa acagcgtcgg ctgggccgtc 1200
attacggacg agtacaaggt gccgagcaaa aaattcaaag ttctgggcaa taccgatcgc 1260
cacagcataa agaagaacct cattggcgcc ctcctgttcg actccgggga gacggccgaa 1320
gccacgcggc tcaaaagaac agcacggcgc agatataccc gcagaaagaa tcggatctgc 1380
tacctgcagg agatctttag taatgagatg gctaaggtgg atgactcttt cttccatagg 1440
ctggaggagt cctttttggt ggaggaggat aaaaagcacg agcgccaccc aatctttggc 1500
aatatcgtgg acgaggtggc gtaccatgaa aagtacccaa ccatatatca tctgaggaag 1560
aagcttgtag acagtactga taaggctgac ttgcggttga tctatctcgc gctggcgcat 1620
atgatcaaat ttcggggaca cttcctcatc gagggggacc tgaacccaga caacagcgat 1680
gtcgacaaac tctttatcca actggttcag acttacaatc agcttttcga agagaacccg 1740
atcaacgcat ccggagttga cgccaaagca atcctgagcg ctaggctgtc caaatcccgg 1800
cggctcgaaa acctcatcgc acagctccct ggggagaaga agaacggcct gtttggtaat 1860
cttatcgccc tgtccctcgg gctgaccccc aactttaaat ctaacttcga cctggccgaa 1920
gataccaagc ttcaactgag caaagacacc tacgatgatg atctcgacaa tctgctggcc 1980
cagatcggcg accagtacgc agaccttttt ttggcggcaa agaacctgtc agacgccatt 2040
ctgctgagtg atattctgcg agtgaacacg gagatcacca aagctccgct gagcgctagt 2100
atgatcaagc tctatgatga gcaccaccaa gacttgactt tgctgaaggc ccttgtcaga 2160
cagcaactgc ctgagaagta caaggaaatt ttcttcgatc agtctaaaaa tggctacgcc 2220
ggatacattg acggcggagc aagccaggag gaattttaca aatttattaa gcccatcttg 2280
gaaaaaatgg acggcaccga ggagctgctg gtaaagctta acagagaaga tctgttgcgc 2340
aaacagcgca ctttcgacaa tggaatcatc ccccaccaga ttcacctggg cgaactgcac 2400
gctatcctca ggcggcaaga ggatttctac ccctttttga aagataacag ggaaaagatt 2460
gagaaaatcc tcacatttcg gataccctac tatgtaggcc ccctcgcccg gggaaattcc 2520
agattcgcgt ggatgactcg caaatcagaa gagaccatca ctccctggaa cttcgagaaa 2580
gtcgtggata agggggcctc tgcccagtcc ttcatcgaaa ggatgactaa ctttgataaa 2640
aatctgccta acgaaaaggt gcttcctaaa cactctctgc tgtacgagta cttcacagtt 2700
tataacgagc tcaccaaggt caaatacgtc acagaaggga tgagaaagcc agcattcctg 2760
tctggagatc agaagaaagc tattgtggac ctcctcttca agacgaaccg gaaagttacc 2820
gtgaaacagc tcaaagaaga ctatttcaaa aagattgaat gtttcgactc tgttgaaatc 2880
agcggagtgg aggatcgctt caacgcatcc ctgggaacgt atcacgatct cctgaaaatc 2940
attaaagaca aggacttcct ggacaatgag gagaacgagg acattcttga ggacattgtc 3000
ctcaccctta cgttgtttga agatagggag atgattgaag aacgcttgaa aacttacgct 3060
catctcttcg acgacaaagt catgaagcag ctcaagaggc gccgatatac aggatggggg 3120
cggctgtcaa gaaaactgat caatgggatc cgagacaagc agagtggaaa gacaatcctg 3180
gattttctta agtccgatgg atttgccaac cggaacttca ttcagttgat ccatgatgac 3240
tctctcacct ttaaggagga catccagaaa gcacaagttt ctggccaggg ggacagtctt 3300
cacgagcaca tcgctaatct tgcaggtagc ccagctatca aaaagggaat actgcagacc 3360
gttaaggtcg tggatgaact cgtcaaagta atgggaaggc ataagcccga gaatatcgtt 3420
atcgagatgg cccgagagaa ccaaaccacc cagaagggac agaagaacag tagggaaagg 3480
atgaagagga ttgaagaggg tataaaagaa ctggggtccc aaatccttaa ggaacaccca 3540
gttgaaaaca cccagcttca gaatgagaag ctctacctgt actacctgca gaacggcagg 3600
gacatgtacg tggatcagga actggacatc aatcggctct ccgactacga cgtggatcat 3660
atcgtgcccc agtcttttct caaagatgat tctattgata ataaagtgtt gacaagatcc 3720
gataaaaaca gagggaagag tgataacgtc ccctcagaag aagttgtcaa gaaaatgaaa 3780
aattattggc ggcagctgct gaacgccaaa ctgatcacac aacggaagtt cgataatctg 3840
actaaggctg aacgaggtgg cctgtctgag ttggataaag ccggtttcat caaaaggcag 3900
cttgttgaga cacgccagat caccaagcac gtggcccaaa ttctcgattc acgcatgaac 3960
accaagtacg atgaaaatga caaactgatt cgagaggtga aagttattac tctgaagtct 4020
aagctggtct cagatttcag aaaggacttt cagttttata aggtgagaga gatcaacaat 4080
taccaccatg cgcatgatgc ctacctgaat gcagtggtag gcactgcact tatcaaaaaa 4140
tatcccaagc ttgaatctga atttgtttac ggagactata aagtgtacga tgttaggaaa 4200
atgatcgcaa agtctgagca ggaaataggc aaggccaccg ctaagtactt cttttacagc 4260
aatattatga attttttcaa gaccgagatt acactggcca atggagagat tcggaagcga 4320
ccacttatcg aaacaaacgg agaaacagga gaaatcgtgt gggacaaggg tagggatttc 4380
gcgacagtcc ggaaggtcct gtccatgccg caggtgaaca tcgttaaaaa gaccgaagta 4440
cagaccggag gcttctccaa ggaaagtatc ctcccgaaaa ggaacagcga caagctgatc 4500
gcacgcaaaa aagattggga ccccaagaaa tacggcggat tcgattctcc tacagtcgct 4560
tacagtgtac tggttgtggc taaagtggag aaagggaagt ctaaaaaact caaaagcgtc 4620
aaggaactgc tgggcatcac aatcatggag cgatcaagct tcgaaaaaaa ccccatcgac 4680
tttctcgagg cgaaaggata taaagaggtc aaaaaagacc tcatcattaa gcttcccaag 4740
tactctctct ttgagcttga aaacggccgg aaacgaatgc tcgctagtgc gggcgtgctg 4800
cagaaaggta acgagctggc actgccctct aaatacgtta atttcttgta tctggccagc 4860
cactatgaaa agctcaaagg gtctcccgaa gataatgagc agaagcagct gttcgtggaa 4920
caacacaaac actaccttga tgagatcatc gagcaaataa gcgaattctc caaaagagtg 4980
atcctcgccg acgctaacct cgataaggtg ctttctgctt acaataagca cagggataag 5040
cccatcaggg agcaggcaga aaacattatc cacttgttta ctctgaccaa cttgggcgcg 5100
cctgcagcct tcaagtactt cgacactacc atagacagaa agcggtacac ctctacaaag 5160
gaggtcctgg acgccacact gattcatcag tcaattacgg ggctctatga aacaagaatc 5220
gacctctctc agctcggtgg agactctggt ggttctacta atctgtcaga tattattgaa 5280
aaggagaccg gtaagcaact ggttatccag gaatccatcc tcatgctccc agaggaggtg 5340
gaagaagtca ttgggaacaa gccggaaagc gatatactcg tgcacaccgc ctacgacgag 5400
agcaccgacg agaatgtcat gcttctgact agcgacgccc ctgaatacaa gccttgggct 5460
ctggtcatac aggatagcaa cggtgagaac aagattaaga tgctctctgg tggttctccc 5520
aagaagaaga ggaaagtcta accggtcatc atcaccatca ccattgagtt taaacccgct 5580
gatcagcctc gactgtgcct tctagttgcc agccatctgt tgtttgcccc tcccccgtgc 5640
cttccttgac cctggaaggt gccactccca ctgtcctttc ctaataaaat gaggaaattg 5700
catcgcattg tctgagtagg tgtcattcta ttctgggggg tggggtgggg caggacagca 5760
agggggagga ttgggaagac aatagcaggc atgctgggga tgcggtgggc tctatggctt 5820
ctgaggcgga aagaaccagc tggggctcga taccgtcgac ctctagctag agcttggcgt 5880
aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt ccacacaaca 5940
tacgagccgg aagcataaag tgtaaagcct agggtgccta atgagtgagc taactcacat 6000
taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt 6060
aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct tccgcttcct 6120
cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa 6180
aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa 6240
aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc 6300
tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga 6360
caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc 6420
cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt 6480
ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct 6540
gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg 6600
agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta 6660
gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct 6720
acactagaag aacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa 6780
gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt 6840
gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta 6900
cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat 6960
caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa 7020
gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct 7080
cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta 7140
cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct 7200
caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg 7260
gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa 7320
gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt 7380
cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta 7440
catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca 7500
gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta 7560
ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct 7620
gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg 7680
cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac 7740
tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact 7800
gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa 7860
atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt 7920
ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat 7980
gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg 8040
acgtcgacgg atcgggagat cgatctcccg atcccctagg gtcgactctc agtacaatct 8100
gctctgatgc cgcatagtta agccagtatc tgctccctgc ttgtgtgttg gaggtcgctg 8160
agtagtgcgc gagcaaaatt taagctacaa caaggcaagg cttgaccgac aattgcatga 8220
agaatctgct tagggttagg cgttttgcgc tgcttcgcga tgtacgggcc agatatacgc 8280
gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 8340
gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 8400
ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 8460
ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 8520
atcaagtgta tc 8532
<210> 15
<211> 8532
<212> DNA
<213> 人工序列()
<400> 15
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gagctcagag 420
actggcccag tggctgtgga ccccacattg agacggcgga tcgagcccca tgagtttgag 480
gtattcttcg atccgagaga gctccgcaag gagacctgcc tgctttacga aattaattgg 540
gggggccggc actccatttg gcgacataca tcacagaaca ctaacaagca cgtcgaagtc 600
aacttcatcg agaagttcac gacagaaaga tatttctgtc cgaacacaag gtgcagcatt 660
acctggtttc tcagctacag cccatgcggc gaatgtagta gggccatcac tgaattcctg 720
tcaaggtatc cccacgtcac tctgtttatt tacatcgcaa ggctgtacca ccacgctgac 780
cccgagaatc gacaaggcct ggaggatttg atctcttcag gtgtgactat ccaaattatg 840
actgagcagg agtcaggata ctgctggaga aactttgtga attatagccc gagtaatgaa 900
gcccactggc ctaggtatcc ccatctgtgg gtacgactgt acgttcttga actgtactgc 960
atcatactgg gcctgcctcc ttgtctcaac attctgagaa ggaagcagcc acagctgaca 1020
ttctttacca tcgctcttca gtcttgtcat taccagcgac tgcccccaca cattctctgg 1080
gccaccgggt tgaaaagcgg cagcgagact cccgggacct cagagtccgc cacacccgaa 1140
agtgacaaga agtactccat tgggctcgct atcggcacaa acagcgtcgg ctgggccgtc 1200
attacggacg agtacaaggt gccgagcaaa aaattcaaag ttctgggcaa taccgatcgc 1260
cacagcataa agaagaacct cattggcgcc ctcctgttcg actccgggga gacggccgaa 1320
gccacgcggc tcaaaagaac agcacggcgc agatataccc gcagaaagaa tcggatctgc 1380
tacctgcagg agatctttag taatgagatg gctaaggtgg atgactcttt cttccatagg 1440
ctggaggagt cctttttggt ggaggaggat aaaaagcacg agcgccaccc aatctttggc 1500
aatatcgtgg acgaggtggc gtaccatgaa aagtacccaa ccatatatca tctgaggaag 1560
aagcttgtag acagtactga taaggctgac ttgcggttga tctatctcgc gctggcgcat 1620
atgatcaaat ttcggggaca cttcctcatc gagggggacc tgaacccaga caacagcgat 1680
gtcgacaaac tctttatcca actggttcag acttacaatc agcttttcga agagaacccg 1740
atcaacgcat ccggagttga cgccaaagca atcctgagcg ctaggctgtc caaatcccgg 1800
cggctcgaaa acctcatcgc acagctccct ggggagaaga agaacggcct gtttggtaat 1860
cttatcgccc tgtccctcgg gctgaccccc aactttaaat ctaacttcga cctggccgaa 1920
gataccaagc ttcaactgag caaagacacc tacgatgatg atctcgacaa tctgctggcc 1980
cagatcggcg accagtacgc agaccttttt ttggcggcaa agaacctgtc agacgccatt 2040
ctgctgagtg atattctgcg agtgaacacg gagatcacca aagctccgct gagcgctagt 2100
atgatcaagc tctatgatga gcaccaccaa gacttgactt tgctgaaggc ccttgtcaga 2160
cagcaactgc ctgagaagta caaggaaatt ttcttcgatc agtctaaaaa tggctacgcc 2220
ggatacattg acggcggagc aagccaggag gaattttaca aatttattaa gcccatcttg 2280
gaaaaaatgg acggcaccga ggagctgctg gtaaagctta acagagaaga tctgttgcgc 2340
aaacagcgca ctttcgacaa tggaatcatc ccccaccaga ttcacctggg cgaactgcac 2400
gctatcctca ggcggcaaga ggatttctac ccctttttga aagataacag ggaaaagatt 2460
gagaaaatcc tcacatttcg gataccctac tatgtaggcc ccctcgcccg gggaaattcc 2520
agattcgcgt ggatgactcg caaatcagaa gagaccatca ctccctggaa cttcgagaaa 2580
gtcgtggata agggggcctc tgcccagtcc ttcatcgaaa ggatgactaa ctttgataaa 2640
aatctgccta acgaaaaggt gcttcctaaa cactctctgc tgtacgagta cttcacagtt 2700
tataacgagc tcaccaaggt caaatacgtc acagaaggga tgagaaagcc agcattcctg 2760
tctggagatc agaagaaagc tattgtggac ctcctcttca agacgaaccg gaaagttacc 2820
gtgaaacagc tcaaagaaga ctatttcaaa aagattgaat gtttcgactc tgttgaaatc 2880
agcggagtgg aggatcgctt caacgcatcc ctgggaacgt atcacgatct cctgaaaatc 2940
attaaagaca aggacttcct ggacaatgag gagaacgagg acattcttga ggacattgtc 3000
ctcaccctta cgttgtttga agatagggag atgattgaag aacgcttgaa aacttacgct 3060
catctcttcg acgacaaagt catgaagcag ctcaagaggc gccgatatac aggatggggg 3120
cggctgtcaa gaaaactgat caatgggatc cgagacaagc agagtggaaa gacaatcctg 3180
gattttctta agtccgatgg atttgccaac cggaacttca ttcagttgat ccatgatgac 3240
tctctcacct ttaaggagga catccagaaa gcacaagttt ctggccaggg ggacagtctt 3300
cacgagcaca tcgctaatct tgcaggtagc ccagctatca aaaagggaat actgcagacc 3360
gttaaggtcg tggatgaact cgtcaaagta atgggaaggc ataagcccga gaatatcgtt 3420
atcgagatgg cccgagagaa ccaaaccacc cagaagggac agaagaacag tagggaaagg 3480
atgaagagga ttgaagaggg tataaaagaa ctggggtccc aaatccttaa ggaacaccca 3540
gttgaaaaca cccagcttca gaatgagaag ctctacctgt actacctgca gaacggcagg 3600
gacatgtacg tggatcagga actggacatc aatcggctct ccgactacga cgtggatcat 3660
atcgtgcccc agtcttttct caaagatgat tctattgata ataaagtgtt gacaagatcc 3720
gataaaaaca gagggaagag tgataacgtc ccctcagaag aagttgtcaa gaaaatgaaa 3780
aattattggc ggcagctgct gaacgccaaa ctgatcacac aacggaagtt cgataatctg 3840
actaaggctg aacgaggtgg cctgtctgag ttggataaag ccggtttcat caaaaggcag 3900
cttgttgaga cacgccagat caccaagcac gtggcccaaa ttctcgattc acgcatgaac 3960
accaagtacg atgaaaatga caaactgatt cgagaggtga aagttattac tctgaagtct 4020
aagctggtct cagatttcag aaaggacttt cagttttata aggtgagaga gatcaacaat 4080
taccaccatg cgcatgatgc ctacctgaat gcagtggtag gcactgcact tatcaaaaaa 4140
tatcccaagc ttgaatctga atttgtttac ggagactata aagtgtacga tgttaggaaa 4200
atgatcgcaa agtctgagca ggaaataggc aaggccaccg ctaagtactt cttttacagc 4260
aatattatga attttttcaa gaccgagatt acactggcca atggagagat tcggaagcga 4320
ccacttatcg aaacaaacgg agaaacagga gaaatcgtgt gggacaaggg tagggatttc 4380
gcgacagtcc ggaaggtcct gtccatgccg caggtgaaca tcgttaaaaa gaccgaagta 4440
cagaccggag gcttctccaa ggaaagtatc ctcccgaaaa ggaacagcga caagctgatc 4500
gcacgcaaaa aagattggga ccccaagaaa tacggcggat tcgattctcc tacagtcgct 4560
tacagtgtac tggttgtggc taaagtggag aaagggaagt ctaaaaaact caaaagcgtc 4620
aaggaactgc tgggcatcac aatcatggag cgatcaagct tcgaaaaaaa ccccatcgac 4680
tttctcgagg cgaaaggata taaagaggtc aaaaaagacc tcatcattaa gcttcccaag 4740
tactctctct ttgagcttga aaacggccgg aaacgaatgc tcgctagtgc gggcgtgctg 4800
cagaaaggta acgagctggc actgccctct aaatacgtta atttcttgta tctggccagc 4860
cactatgaaa agctcaaagg gtctcccgaa gataatgagc agaagcagct gttcgtggaa 4920
caacacaaac actaccttga tgagatcatc gagcaaataa gcgaattctc caaaagagtg 4980
atcctcgccg acgctaacct cgataaggtg ctttctgctt acaataagca cagggataag 5040
cccatcaggg agcaggcaga aaacattatc cacttgttta ctctgaccaa cttgggcgcg 5100
cctgcagcct tcaagtactt cgacactacc atagacagaa agcggtacac ctctacaaag 5160
gaggtcctgg acgccacact gattcatcag tcaattacgg ggctctatga aacaagaatc 5220
gacctctctc agctcggtgg agactctggt ggttctacta atctgtcaga tattattgaa 5280
aaggagaccg gtaagcaact ggttatccag gaatccatcc tcatgctccc agaggaggtg 5340
gaagaagtca ttgggaacaa gccggaaagc gatatactcg tgcacaccgc ctacgacgag 5400
agcaccgacg agaatgtcat gcttctgact agcgacgccc ctgaatacaa gccttgggct 5460
ctggtcatac aggatagcaa cggtgagaac aagattaaga tgctctctgg tggttctccc 5520
aagaagaaga ggaaagtcta accggtcatc atcaccatca ccattgagtt taaacccgct 5580
gatcagcctc gactgtgcct tctagttgcc agccatctgt tgtttgcccc tcccccgtgc 5640
cttccttgac cctggaaggt gccactccca ctgtcctttc ctaataaaat gaggaaattg 5700
catcgcattg tctgagtagg tgtcattcta ttctgggggg tggggtgggg caggacagca 5760
agggggagga ttgggaagac aatagcaggc atgctgggga tgcggtgggc tctatggctt 5820
ctgaggcgga aagaaccagc tggggctcga taccgtcgac ctctagctag agcttggcgt 5880
aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt ccacacaaca 5940
tacgagccgg aagcataaag tgtaaagcct agggtgccta atgagtgagc taactcacat 6000
taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt 6060
aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct tccgcttcct 6120
cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa 6180
aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa 6240
aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc 6300
tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga 6360
caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc 6420
cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt 6480
ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct 6540
gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg 6600
agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta 6660
gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct 6720
acactagaag aacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa 6780
gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt 6840
gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta 6900
cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat 6960
caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa 7020
gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct 7080
cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta 7140
cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct 7200
caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg 7260
gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa 7320
gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt 7380
cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta 7440
catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca 7500
gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta 7560
ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct 7620
gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg 7680
cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac 7740
tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact 7800
gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa 7860
atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt 7920
ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat 7980
gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg 8040
acgtcgacgg atcgggagat cgatctcccg atcccctagg gtcgactctc agtacaatct 8100
gctctgatgc cgcatagtta agccagtatc tgctccctgc ttgtgtgttg gaggtcgctg 8160
agtagtgcgc gagcaaaatt taagctacaa caaggcaagg cttgaccgac aattgcatga 8220
agaatctgct tagggttagg cgttttgcgc tgcttcgcga tgtacgggcc agatatacgc 8280
gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 8340
gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 8400
ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 8460
ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 8520
atcaagtgta tc 8532
<210> 16
<211> 8532
<212> DNA
<213> 人工序列()
<400> 16
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gagctcagag 420
actggcccag tggctgtgga ccccacattg agacggcgga tcgagcccca tgagtttgag 480
gtattcttcg atccgagaga gctccgcaag gagacctgcc tgctttacga aattaattgg 540
gggggccggc actccatttg gcgacataca tcacagaaca ctaacaagca cgtcgaagtc 600
aacttcatcg agaagttcac gacagaaaga tatttctgtc cgaacacaag gtgcagcatt 660
acctggtttc tcagctggag cccatgcggc gaatgtagta gggccatcac tgaattcctg 720
tcaaggtatc cccacgtcac tctgtttatt tacatcgcaa ggctgtacca ccacgctgac 780
ccccgcaatc gacaaggcct gcgggatttg atctcttcag gtgtgactat ccaaattatg 840
actgagcagg agtcaggata ctgctggaga aactttgtga attatagccc gagtaatgaa 900
gcccactggc ctaggtatcc ccatctgtgg gtacgactgt acgttcttga actgtactgc 960
atcatactgg gcctgcctcc ttgtctcaac attctgagaa ggaagcagcc acagctgaca 1020
ttctttacca tcgctcttca gtcttgtcat taccagcgac tgcccccaca cattctctgg 1080
gccaccgggt tgaaaagcgg cagcgagact cccgggacct cagagtccgc cacacccgaa 1140
agtgacaaga agtactccat tgggctcgct atcggcacaa acagcgtcgg ctgggccgtc 1200
attacggacg agtacaaggt gccgagcaaa aaattcaaag ttctgggcaa taccgatcgc 1260
cacagcataa agaagaacct cattggcgcc ctcctgttcg actccgggga gacggccgaa 1320
gccacgcggc tcaaaagaac agcacggcgc agatataccc gcagaaagaa tcggatctgc 1380
tacctgcagg agatctttag taatgagatg gctaaggtgg atgactcttt cttccatagg 1440
ctggaggagt cctttttggt ggaggaggat aaaaagcacg agcgccaccc aatctttggc 1500
aatatcgtgg acgaggtggc gtaccatgaa aagtacccaa ccatatatca tctgaggaag 1560
aagcttgtag acagtactga taaggctgac ttgcggttga tctatctcgc gctggcgcat 1620
atgatcaaat ttcggggaca cttcctcatc gagggggacc tgaacccaga caacagcgat 1680
gtcgacaaac tctttatcca actggttcag acttacaatc agcttttcga agagaacccg 1740
atcaacgcat ccggagttga cgccaaagca atcctgagcg ctaggctgtc caaatcccgg 1800
cggctcgaaa acctcatcgc acagctccct ggggagaaga agaacggcct gtttggtaat 1860
cttatcgccc tgtccctcgg gctgaccccc aactttaaat ctaacttcga cctggccgaa 1920
gataccaagc ttcaactgag caaagacacc tacgatgatg atctcgacaa tctgctggcc 1980
cagatcggcg accagtacgc agaccttttt ttggcggcaa agaacctgtc agacgccatt 2040
ctgctgagtg atattctgcg agtgaacacg gagatcacca aagctccgct gagcgctagt 2100
atgatcaagc tctatgatga gcaccaccaa gacttgactt tgctgaaggc ccttgtcaga 2160
cagcaactgc ctgagaagta caaggaaatt ttcttcgatc agtctaaaaa tggctacgcc 2220
ggatacattg acggcggagc aagccaggag gaattttaca aatttattaa gcccatcttg 2280
gaaaaaatgg acggcaccga ggagctgctg gtaaagctta acagagaaga tctgttgcgc 2340
aaacagcgca ctttcgacaa tggaatcatc ccccaccaga ttcacctggg cgaactgcac 2400
gctatcctca ggcggcaaga ggatttctac ccctttttga aagataacag ggaaaagatt 2460
gagaaaatcc tcacatttcg gataccctac tatgtaggcc ccctcgcccg gggaaattcc 2520
agattcgcgt ggatgactcg caaatcagaa gagaccatca ctccctggaa cttcgagaaa 2580
gtcgtggata agggggcctc tgcccagtcc ttcatcgaaa ggatgactaa ctttgataaa 2640
aatctgccta acgaaaaggt gcttcctaaa cactctctgc tgtacgagta cttcacagtt 2700
tataacgagc tcaccaaggt caaatacgtc acagaaggga tgagaaagcc agcattcctg 2760
tctggagatc agaagaaagc tattgtggac ctcctcttca agacgaaccg gaaagttacc 2820
gtgaaacagc tcaaagaaga ctatttcaaa aagattgaat gtttcgactc tgttgaaatc 2880
agcggagtgg aggatcgctt caacgcatcc ctgggaacgt atcacgatct cctgaaaatc 2940
attaaagaca aggacttcct ggacaatgag gagaacgagg acattcttga ggacattgtc 3000
ctcaccctta cgttgtttga agatagggag atgattgaag aacgcttgaa aacttacgct 3060
catctcttcg acgacaaagt catgaagcag ctcaagaggc gccgatatac aggatggggg 3120
cggctgtcaa gaaaactgat caatgggatc cgagacaagc agagtggaaa gacaatcctg 3180
gattttctta agtccgatgg atttgccaac cggaacttca ttcagttgat ccatgatgac 3240
tctctcacct ttaaggagga catccagaaa gcacaagttt ctggccaggg ggacagtctt 3300
cacgagcaca tcgctaatct tgcaggtagc ccagctatca aaaagggaat actgcagacc 3360
gttaaggtcg tggatgaact cgtcaaagta atgggaaggc ataagcccga gaatatcgtt 3420
atcgagatgg cccgagagaa ccaaaccacc cagaagggac agaagaacag tagggaaagg 3480
atgaagagga ttgaagaggg tataaaagaa ctggggtccc aaatccttaa ggaacaccca 3540
gttgaaaaca cccagcttca gaatgagaag ctctacctgt actacctgca gaacggcagg 3600
gacatgtacg tggatcagga actggacatc aatcggctct ccgactacga cgtggatcat 3660
atcgtgcccc agtcttttct caaagatgat tctattgata ataaagtgtt gacaagatcc 3720
gataaaaaca gagggaagag tgataacgtc ccctcagaag aagttgtcaa gaaaatgaaa 3780
aattattggc ggcagctgct gaacgccaaa ctgatcacac aacggaagtt cgataatctg 3840
actaaggctg aacgaggtgg cctgtctgag ttggataaag ccggtttcat caaaaggcag 3900
cttgttgaga cacgccagat caccaagcac gtggcccaaa ttctcgattc acgcatgaac 3960
accaagtacg atgaaaatga caaactgatt cgagaggtga aagttattac tctgaagtct 4020
aagctggtct cagatttcag aaaggacttt cagttttata aggtgagaga gatcaacaat 4080
taccaccatg cgcatgatgc ctacctgaat gcagtggtag gcactgcact tatcaaaaaa 4140
tatcccaagc ttgaatctga atttgtttac ggagactata aagtgtacga tgttaggaaa 4200
atgatcgcaa agtctgagca ggaaataggc aaggccaccg ctaagtactt cttttacagc 4260
aatattatga attttttcaa gaccgagatt acactggcca atggagagat tcggaagcga 4320
ccacttatcg aaacaaacgg agaaacagga gaaatcgtgt gggacaaggg tagggatttc 4380
gcgacagtcc ggaaggtcct gtccatgccg caggtgaaca tcgttaaaaa gaccgaagta 4440
cagaccggag gcttctccaa ggaaagtatc ctcccgaaaa ggaacagcga caagctgatc 4500
gcacgcaaaa aagattggga ccccaagaaa tacggcggat tcgattctcc tacagtcgct 4560
tacagtgtac tggttgtggc taaagtggag aaagggaagt ctaaaaaact caaaagcgtc 4620
aaggaactgc tgggcatcac aatcatggag cgatcaagct tcgaaaaaaa ccccatcgac 4680
tttctcgagg cgaaaggata taaagaggtc aaaaaagacc tcatcattaa gcttcccaag 4740
tactctctct ttgagcttga aaacggccgg aaacgaatgc tcgctagtgc gggcgtgctg 4800
cagaaaggta acgagctggc actgccctct aaatacgtta atttcttgta tctggccagc 4860
cactatgaaa agctcaaagg gtctcccgaa gataatgagc agaagcagct gttcgtggaa 4920
caacacaaac actaccttga tgagatcatc gagcaaataa gcgaattctc caaaagagtg 4980
atcctcgccg acgctaacct cgataaggtg ctttctgctt acaataagca cagggataag 5040
cccatcaggg agcaggcaga aaacattatc cacttgttta ctctgaccaa cttgggcgcg 5100
cctgcagcct tcaagtactt cgacactacc atagacagaa agcggtacac ctctacaaag 5160
gaggtcctgg acgccacact gattcatcag tcaattacgg ggctctatga aacaagaatc 5220
gacctctctc agctcggtgg agactctggt ggttctacta atctgtcaga tattattgaa 5280
aaggagaccg gtaagcaact ggttatccag gaatccatcc tcatgctccc agaggaggtg 5340
gaagaagtca ttgggaacaa gccggaaagc gatatactcg tgcacaccgc ctacgacgag 5400
agcaccgacg agaatgtcat gcttctgact agcgacgccc ctgaatacaa gccttgggct 5460
ctggtcatac aggatagcaa cggtgagaac aagattaaga tgctctctgg tggttctccc 5520
aagaagaaga ggaaagtcta accggtcatc atcaccatca ccattgagtt taaacccgct 5580
gatcagcctc gactgtgcct tctagttgcc agccatctgt tgtttgcccc tcccccgtgc 5640
cttccttgac cctggaaggt gccactccca ctgtcctttc ctaataaaat gaggaaattg 5700
catcgcattg tctgagtagg tgtcattcta ttctgggggg tggggtgggg caggacagca 5760
agggggagga ttgggaagac aatagcaggc atgctgggga tgcggtgggc tctatggctt 5820
ctgaggcgga aagaaccagc tggggctcga taccgtcgac ctctagctag agcttggcgt 5880
aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt ccacacaaca 5940
tacgagccgg aagcataaag tgtaaagcct agggtgccta atgagtgagc taactcacat 6000
taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt 6060
aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct tccgcttcct 6120
cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa 6180
aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa 6240
aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc 6300
tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga 6360
caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc 6420
cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt 6480
ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct 6540
gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg 6600
agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta 6660
gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct 6720
acactagaag aacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa 6780
gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt 6840
gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta 6900
cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat 6960
caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa 7020
gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct 7080
cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta 7140
cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct 7200
caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg 7260
gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa 7320
gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt 7380
cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta 7440
catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca 7500
gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta 7560
ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct 7620
gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg 7680
cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac 7740
tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact 7800
gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa 7860
atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt 7920
ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat 7980
gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg 8040
acgtcgacgg atcgggagat cgatctcccg atcccctagg gtcgactctc agtacaatct 8100
gctctgatgc cgcatagtta agccagtatc tgctccctgc ttgtgtgttg gaggtcgctg 8160
agtagtgcgc gagcaaaatt taagctacaa caaggcaagg cttgaccgac aattgcatga 8220
agaatctgct tagggttagg cgttttgcgc tgcttcgcga tgtacgggcc agatatacgc 8280
gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 8340
gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 8400
ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 8460
ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 8520
atcaagtgta tc 8532
<210> 17
<211> 24
<212> DNA
<213> 人工序列()
<400> 17
accgctccca gccattgctg tctg 24
<210> 18
<211> 24
<212> DNA
<213> 人工序列()
<400> 18
aaaccagaca gcaatggctg ggag 24
<210> 19
<211> 4951
<212> DNA
<213> 人工序列()
<400> 19
ggtaccgatt agtgaacgga tctcgacggt atcgatcacg agactagcct cgagcggccg 60
cccccttcac cgagggccta tttcccatga ttccttcata tttgcatata cgatacaagg 120
ctgttagaga gataattgga attaatttga ctgtaaacac aaagatatta gtacaaaata 180
cgtgacgtag aaagtaataa tttcttgggt agtttgcagt tttaaaatta tgttttaaaa 240
tggactatca tatgcttacc gtaacttgaa agtatttcga tttcttggct ttatatatct 300
tgtggaaagg acgaaacacc gtgagaccga gagagggtct cagttttaga gctagaaata 360
gcaagttaaa ataaggctag tccgttatca acttgaaaaa gtggcaccga gtcggtgctt 420
tttttaaaga attctcgacc tcgagacaaa tggcagtatt catccacaat tttaaaagaa 480
aaggggggat tggggggtac agtgcagggg aaagaatagt agacataata gcaacagaca 540
tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg gtttattaca 600
gggacagcag agatccactt tggccgcggc tcgagggggt tggggttgcg ccttttccaa 660
ggcagccctg ggtttgcgca gggacgcggc tgctctgggc gtggttccgg gaaacgcagc 720
ggcgccgacc ctgggactcg cacattcttc acgtccgttc gcagcgtcac ccggatcttc 780
gccgctaccc ttgtgggccc cccggcgacg cttcctgctc cgcccctaag tcgggaaggt 840
tccttgcggt tcgcggcgtg ccggacgtga caaacggaag ccgcacgtct cactagtacc 900
ctcgcagacg gacagcgcca gggagcaatg gcagcgcgcc gaccgcgatg ggctgtggcc 960
aatagcggct gctcagcagg gcgcgccgag agcagcggcc gggaaggggc ggtgcgggag 1020
gcggggtgtg gggcggtagt gtgggccctg ttcctgcccg cgcggtgttc cgcattctgc 1080
aagcctccgg agcgcacgtc ggcagtcggc tccctcgttg accgaatcac cgacctctct 1140
ccccaggggg atccaccgga gcttaccatg accgagtaca agcccacggt gcgcctcgcc 1200
acccgcgacg acgtccccag ggccgtacgc accctcgccg ccgcgttcgc cgactacccc 1260
gccacgcgcc acaccgtcga tccggaccgc cacatcgagc gggtcaccga gctgcaagaa 1320
ctcttcctca cgcgcgtcgg gctcgacatc ggcaaggtgt gggtcgcgga cgacggcgcc 1380
gcggtggcgg tctggaccac gccggagagc gtcgaagcgg gggcggtgtt cgccgagatc 1440
ggcccgcgca tggccgagtt gagcggttcc cggctggccg cgcagcaaca gatggaaggc 1500
ctcctggcgc cgcaccggcc caaggagccc gcgtggttcc tggccaccgt cggcgtctcg 1560
cccgaccacc agggcaaggg tctgggcagc gccgtcgtgc tccccggagt ggaggcggcc 1620
gagcgcgccg gggtgcccgc cttcctggaa acctccgcgc cccgcaacct ccccttctac 1680
gagcggctcg gcttcaccgt caccgccgac gtcgaggtgc ccgaaggacc gcgcacctgg 1740
tgcatgaccc gcaagcccgg tgcctgacgc ccgccccacg acccgcagcg cccgaccgaa 1800
aggagcgcac gaccccatgc atcggtacct ttaagaccaa tgacttacaa ggcagctgta 1860
gatcttagcc actttctaga gtcggggcgg ccggccgctt cgagcagaca tgataagata 1920
cattgatgag tttggacaaa ccacaactag aatgcagtga aaaaaatgct ttatttgtga 1980
aatttgtgat gctattgctt tatttgtaac cattataagc tgcaataaac aagttaacaa 2040
caacaattgc attcatttta tgtttcaggt tcagggggag gtgtgggagg ttttttaaag 2100
caagtaaaac ctctacaaat gtggtaaaat cgataaggat ccgtcgaccg atgcccttga 2160
gagccttcaa cccagtcagc tccttccggt gggcgcgggg catgactatc gtcgccgcac 2220
ttatgactgt cttctttatc atgcaactcg taggacaggt gccggcagcg ctcttccgct 2280
tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac 2340
tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga 2400
gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat 2460
aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac 2520
ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct 2580
gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg 2640
ctttctcaat gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg 2700
ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt 2760
cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg 2820
attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac 2880
ggctacacta gaaggacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga 2940
aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt 3000
gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt 3060
tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga 3120
ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc 3180
taaagtatat atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct 3240
atctcagcga tctgtctatt tcgttcatcc atagttgcct gactccccgt cgtgtagata 3300
actacgatac gggagggctt accatctggc cccagtgctg caatgatacc gcgggaccca 3360
cgctcaccgg ctccagattt atcagcaata aaccagccag ccggaagggc cgagcgcaga 3420
agtggtcctg caactttatc cgcctccatc cagtctatta attgttgccg ggaagctaga 3480
gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg ccattgctac aggcatcgtg 3540
gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg gttcccaacg atcaaggcga 3600
gttacatgat cccccatgtt gtgcaaaaaa gcggttagct ccttcggtcc tccgatcgtt 3660
gtcagaagta agttggccgc agtgttatca ctcatggtta tggcagcact gcataattct 3720
cttactgtca tgccatccgt aagatgcttt tctgtgactg gtgagtactc aaccaagtca 3780
ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc cggcgtcaat acgggataat 3840
accgcgccac atagcagaac tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga 3900
aaactctcaa ggatcttacc gctgttgaga tccagttcga tgtaacccac tcgtgcaccc 3960
aactgatctt cagcatcttt tactttcacc agcgtttctg ggtgagcaaa aacaggaagg 4020
caaaatgccg caaaaaaggg aataagggcg acacggaaat gttgaatact catactcttc 4080
ctttttcaat attattgaag catttatcag ggttattgtc tcatgagcgg atacatattt 4140
gaatgtattt agaaaaataa acaaataggg gttccgcgca catttccccg aaaagtgcca 4200
cctgacgcgc cctgtagcgg cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg 4260
accgctacac ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc 4320
gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga 4380
tttagtgctt tacggcacct cgaccccaaa aaacttgatt agggtgatgg ttcacgtagt 4440
gggccatcgc cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat 4500
agtggactct tgttccaaac tggaacaaca ctcaacccta tctcggtcta ttcttttgat 4560
ttataaggga ttttgccgat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa 4620
tttaacgcga attttaacaa aatattaacg tttacaattt cccattcgcc attcaggctg 4680
cgcaactgtt gggaagggcg atcggtgcgg gcctcttcgc tattacgcca gcccaagcta 4740
ccatgataag taagtaatat taaggtacgg gaggtacttg gagcggccgc aataaaatat 4800
ctttattttc attacatctg tgtgttggtt ttttgtgtga atcgatagta ctaacatacg 4860
ctctccatca aaacaaaacg aaacaaaaca aactagcaaa ataggctgtc cccagtgcaa 4920
gtgcaggtgc cagaacattt ctctatcgat a 4951
<210> 20
<211> 20
<212> DNA
<213> 人工序列()
<400> 20
tttcccatga ttccttcata 20
<210> 21
<211> 5369
<212> DNA
<213> 人工序列()
<400> 21
atccggatat agttcctcct ttcagcaaaa aacccctcaa gacccgttta gaggccccaa 60
ggggttatgc tagttattgc tcagcggtgg cagcagccaa ctcagcttcc tttcgggctt 120
tgttagcagc cggatctcag tggtggtggt ggtggtgctc gagtgcggcc gcaagcttgt 180
cgacggagct cgaattcgga tccgcgaccc atttgctgtc caccagtcat gctagccata 240
tggctgccgc gcggcaccag gccgctgctg tgatgatgat gatgatggct gctgcccatg 300
gtatatctcc ttcttaaagt taaacaaaat tatttctaga ggggaattgt tatccgctca 360
caattcccct atagtgagtc gtattaattt cgcgggatcg agatctcgat cctctacgcc 420
ggacgcatcg tggccggcat caccggcgcc acaggtgcgg ttgctggcgc ctatatcgcc 480
gacatcaccg atggggaaga tcgggctcgc cacttcgggc tcatgagcgc ttgtttcggc 540
gtgggtatgg tggcaggccc cgtggccggg ggactgttgg gcgccatctc cttgcatgca 600
ccattccttg cggcggcggt gctcaacggc ctcaacctac tactgggctg cttcctaatg 660
caggagtcgc ataagggaga gcgtcgagat cccggacacc atcgaatggc gcaaaacctt 720
tcgcggtatg gcatgatagc gcccggaaga gagtcaattc agggtggtga atgtgaaacc 780
agtaacgtta tacgatgtcg cagagtatgc cggtgtctct tatcagaccg tttcccgcgt 840
ggtgaaccag gccagccacg tttctgcgaa aacgcgggaa aaagtggaag cggcgatggc 900
ggagctgaat tacattccca accgcgtggc acaacaactg gcgggcaaac agtcgttgct 960
gattggcgtt gccacctcca gtctggccct gcacgcgccg tcgcaaattg tcgcggcgat 1020
taaatctcgc gccgatcaac tgggtgccag cgtggtggtg tcgatggtag aacgaagcgg 1080
cgtcgaagcc tgtaaagcgg cggtgcacaa tcttctcgcg caacgcgtca gtgggctgat 1140
cattaactat ccgctggatg accaggatgc cattgctgtg gaagctgcct gcactaatgt 1200
tccggcgtta tttcttgatg tctctgacca gacacccatc aacagtatta ttttctccca 1260
tgaagacggt acgcgactgg gcgtggagca tctggtcgca ttgggtcacc agcaaatcgc 1320
gctgttagcg ggcccattaa gttctgtctc ggcgcgtctg cgtctggctg gctggcataa 1380
atatctcact cgcaatcaaa ttcagccgat agcggaacgg gaaggcgact ggagtgccat 1440
gtccggtttt caacaaacca tgcaaatgct gaatgagggc atcgttccca ctgcgatgct 1500
ggttgccaac gatcagatgg cgctgggcgc aatgcgcgcc attaccgagt ccgggctgcg 1560
cgttggtgcg gatatctcgg tagtgggata cgacgatacc gaagacagct catgttatat 1620
cccgccgtta accaccatca aacaggattt tcgcctgctg gggcaaacca gcgtggaccg 1680
cttgctgcaa ctctctcagg gccaggcggt gaagggcaat cagctgttgc ccgtctcact 1740
ggtgaaaaga aaaaccaccc tggcgcccaa tacgcaaacc gcctctcccc gcgcgttggc 1800
cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 1860
acgcaattaa tgtaagttag ctcactcatt aggcaccggg atctcgaccg atgcccttga 1920
gagccttcaa cccagtcagc tccttccggt gggcgcgggg catgactatc gtcgccgcac 1980
ttatgactgt cttctttatc atgcaactcg taggacaggt gccggcagcg ctctgggtca 2040
ttttcggcga ggaccgcttt cgctggagcg cgacgatgat cggcctgtcg cttgcggtat 2100
tcggaatctt gcacgccctc gctcaagcct tcgtcactgg tcccgccacc aaacgtttcg 2160
gcgagaagca ggccattatc gccggcatgg cggccccacg ggtgcgcatg atcgtgctcc 2220
tgtcgttgag gacccggcta ggctggcggg gttgccttac tggttagcag aatgaatcac 2280
cgatacgcga gcgaacgtga agcgactgct gctgcaaaac gtctgcgacc tgagcaacaa 2340
catgaatggt cttcggtttc cgtgtttcgt aaagtctgga aacgcggaag tcagcgccct 2400
gcaccattat gttccggatc tgcatcgcag gatgctgctg gctaccctgt ggaacaccta 2460
catctgtatt aacgaagcgc tggcattgac cctgagtgat ttttctctgg tcccgccgca 2520
tccataccgc cagttgttta ccctcacaac gttccagtaa ccgggcatgt tcatcatcag 2580
taacccgtat cgtgagcatc ctctctcgtt tcatcggtat cattaccccc atgaacagaa 2640
atccccctta cacggaggca tcagtgacca aacaggaaaa aaccgccctt aacatggccc 2700
gctttatcag aagccagaca ttaacgcttc tggagaaact caacgagctg gacgcggatg 2760
aacaggcaga catctgtgaa tcgcttcacg accacgctga tgagctttac cgcagctgcc 2820
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 2880
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 2940
ttggcgggtg tcggggcgca gccatgaccc agtcacgtag cgatagcgga gtgtatactg 3000
gcttaactat gcggcatcag agcagattgt actgagagtg caccatatat gcggtgtgaa 3060
ataccgcaca gatgcgtaag gagaaaatac cgcatcaggc gctcttccgc ttcctcgctc 3120
actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg 3180
gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc 3240
cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc 3300
ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga 3360
ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc 3420
ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat 3480
agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg 3540
cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc 3600
aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga 3660
gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact 3720
agaaggacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt 3780
ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag 3840
cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg 3900
tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgaa caataaaact 3960
gtctgcttac ataaacagta atacaagggg tgttatgagc catattcaac gggaaacgtc 4020
ttgctctagg ccgcgattaa attccaacat ggatgctgat ttatatgggt ataaatgggc 4080
tcgcgataat gtcgggcaat caggtgcgac aatctatcga ttgtatggga agcccgatgc 4140
gccagagttg tttctgaaac atggcaaagg tagcgttgcc aatgatgtta cagatgagat 4200
ggtcagacta aactggctga cggaatttat gcctcttccg accatcaagc attttatccg 4260
tactcctgat gatgcatggt tactcaccac tgcgatcccc gggaaaacag cattccaggt 4320
attagaagaa tatcctgatt caggtgaaaa tattgttgat gcgctggcag tgttcctgcg 4380
ccggttgcat tcgattcctg tttgtaattg tccttttaac agcgatcgcg tatttcgtct 4440
cgctcaggcg caatcacgaa tgaataacgg tttggttgat gcgagtgatt ttgatgacga 4500
gcgtaatggc tggcctgttg aacaagtctg gaaagaaatg cataaacttt tgccattctc 4560
accggattca gtcgtcactc atggtgattt ctcacttgat aaccttattt ttgacgaggg 4620
gaaattaata ggttgtattg atgttggacg agtcggaatc gcagaccgat accaggatct 4680
tgccatccta tggaactgcc tcggtgagtt ttctccttca ttacagaaac ggctttttca 4740
aaaatatggt attgataatc ctgatatgaa taaattgcag tttcatttga tgctcgatga 4800
gtttttctaa gaattaattc atgagcggat acatatttga atgtatttag aaaaataaac 4860
aaataggggt tccgcgcaca tttccccgaa aagtgccacc tgaaattgta aacgttaata 4920
ttttgttaaa attcgcgtta aatttttgtt aaatcagctc attttttaac caataggccg 4980
aaatcggcaa aatcccttat aaatcaaaag aatagaccga gatagggttg agtgttgttc 5040
cagtttggaa caagagtcca ctattaaaga acgtggactc caacgtcaaa gggcgaaaaa 5100
ccgtctatca gggcgatggc ccactacgtg aaccatcacc ctaatcaagt tttttggggt 5160
cgaggtgccg taaagcacta aatcggaacc ctaaagggag cccccgattt agagcttgac 5220
ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa agcgaaagga gcgggcgcta 5280
gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac cacacccgcc gcgcttaatg 5340
cgccgctaca gggcgcgtcc cattcgcca 5369
Claims (2)
1.一种高效模拟、修复MAN2B1C2248T突变的试剂盒,其特征在于,包括构建MAN2B1C2248T突变的碱基编辑系统BE及相应的mt-sgRNA以及针对MAN2B1C2248T突变修复的碱基编辑系统ABE及相应的re-sgRNA,
所述的针对MAN2B1C2248T位点的突变mt-sgRNA的序列为SEQ ID NO.1或SEQ ID NO.3,针对该突变位点的修复re-sgRNA的序列为SEQ ID NO.4。
2.如权利要求1所述的高效模拟、修复MAN2B1C2248T突变的试剂盒,其特征在于,所述的碱基编辑系统为EE-XBE3和XABE。
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201910105320.6A CN109777832B (zh) | 2019-02-01 | 2019-02-01 | 碱基编辑模拟、修复与甘露糖苷贮积症相关的man2b1c2248t突变的试剂和方法 |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201910105320.6A CN109777832B (zh) | 2019-02-01 | 2019-02-01 | 碱基编辑模拟、修复与甘露糖苷贮积症相关的man2b1c2248t突变的试剂和方法 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN109777832A CN109777832A (zh) | 2019-05-21 |
| CN109777832B true CN109777832B (zh) | 2020-08-04 |
Family
ID=66503958
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201910105320.6A Active CN109777832B (zh) | 2019-02-01 | 2019-02-01 | 碱基编辑模拟、修复与甘露糖苷贮积症相关的man2b1c2248t突变的试剂和方法 |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN109777832B (zh) |
Family Cites Families (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2016094880A1 (en) * | 2014-12-12 | 2016-06-16 | The Broad Institute Inc. | Delivery, use and therapeutic applications of crispr systems and compositions for genome editing as to hematopoietic stem cells (hscs) |
| EP3247808B1 (en) * | 2015-01-21 | 2021-05-05 | Fred Hutchinson Cancer Research Center | Point-of-care and/or portable platform for gene therapy |
| HK1253403A1 (zh) * | 2015-05-28 | 2019-06-14 | Coda Biotherapeutics | 基因組編輯載體 |
| US20210222164A1 (en) * | 2016-06-29 | 2021-07-22 | The Broad Institute, Inc. | Crispr-cas systems having destabilization domain |
| CN109803977B (zh) * | 2016-08-17 | 2023-03-17 | 菲克特生物科学股份有限公司 | 核酸产品及其施用方法 |
-
2019
- 2019-02-01 CN CN201910105320.6A patent/CN109777832B/zh active Active
Also Published As
| Publication number | Publication date |
|---|---|
| CN109777832A (zh) | 2019-05-21 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN111295391B (zh) | 腺病毒及其用途 | |
| AU2017355507B2 (en) | Novel plant cells, plants, and seeds | |
| US20220333137A1 (en) | Methods and products for producing engineered mammalian cell lines with amplified transgenes | |
| KR102147005B1 (ko) | Fad2 성능 유전자좌 및 표적화 파단을 유도할 수 있는 상응하는 표적 부위 특이적 결합 단백질 | |
| CN107164377A (zh) | 基于碱基编辑的基因敲除方法及其应用 | |
| JP2024083457A (ja) | レンチウイルスベクターのバイオ生産法 | |
| KR20160029124A (ko) | Pd-1 항원 또는 pd-1 리간드 항원을 포함하는 바이러스 유사 입자 | |
| CN113874501A (zh) | 使用碱基编辑器进行靶向诱变 | |
| CN113831394A (zh) | 一种非洲猪瘟病毒asfv基因的重组病毒组合及由其制备的疫苗 | |
| CN112639109A (zh) | 无血清培养基中的载体生产 | |
| CN111394268B (zh) | 基因工程菌及其构建方法、应用,生产nad+的方法 | |
| US7339030B2 (en) | Human semaphorin L (H-SemaL) and corresponding semaphorins in other species | |
| CN110199028A (zh) | 用于ii型黏多糖贮积症的基因治疗 | |
| WO1992017581A1 (en) | Mammalian expression vector | |
| KR20140004744A (ko) | 시클로클라빈의 생합성을 위한 유전자 클러스터 | |
| CN106834334A (zh) | 一种用于酵母表面展示抗体Fab片段的质粒 | |
| CN109777832B (zh) | 碱基编辑模拟、修复与甘露糖苷贮积症相关的man2b1c2248t突变的试剂和方法 | |
| CN108138162A (zh) | 重组细胞,重组细胞的制造方法以及有机化合物的生产方法 | |
| CN114426986B (zh) | 一种在赤霉菌中建立的基于CRISPR技术整合的URA-Blaster方法 | |
| CN107034233B (zh) | 一种内源性启动子驱动外源基因表达的方法 | |
| CN109652325B (zh) | δ整合并分泌表达纤维素酶的酿酒酵母工业菌株及应用 | |
| CN116940374A (zh) | 用于疫苗生产以防范冠状病毒的全合成的长链核酸 | |
| RU2823437C2 (ru) | Лечение и/или профилактика заболевания или синдрома, связанного с вирусной инфекцией | |
| CN111560392B (zh) | miRNA表达载体及其应用 | |
| Hung | Structural and functional studies of the Drosophila melanogaster snRNA activating protein complex (DmSNAPc) |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| CB02 | Change of applicant information |
Address after: 100081 Beijing city Haidian District Dahui Temple Road, No. 12 Applicant after: Institute of Science and Technology, National Health Commission Address before: 100081 Beijing city Haidian District Dahui Temple Road, No. 12 Applicant before: RESEARCH INSTITUTE OF PRC NATIONAL HEALTH AND FAMILY PLANNING COMMISSION |
|
| CB02 | Change of applicant information | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |