CN114561406A - 一种参与阳离子肽化合物合成的基因及其用途 - Google Patents
一种参与阳离子肽化合物合成的基因及其用途 Download PDFInfo
- Publication number
- CN114561406A CN114561406A CN202210442081.5A CN202210442081A CN114561406A CN 114561406 A CN114561406 A CN 114561406A CN 202210442081 A CN202210442081 A CN 202210442081A CN 114561406 A CN114561406 A CN 114561406A
- Authority
- CN
- China
- Prior art keywords
- leu
- amino acid
- acid sequence
- glu
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/93—Ligases (6)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K7/00—Peptides having 5 to 20 amino acids in a fully defined sequence; Derivatives thereof
- C07K7/04—Linear peptides containing only normal peptide links
- C07K7/08—Linear peptides containing only normal peptide links having 12 to 20 amino acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y603/00—Ligases forming carbon-nitrogen bonds (6.3)
- C12Y603/02—Acid—amino-acid ligases (peptide synthases)(6.3.2)
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A50/00—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
- Y02A50/30—Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Medicinal Chemistry (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
本发明涉及微生物及分子生物学技术,具体涉及一种参与阳离子肽化合物合成的基因及其用途,能够构建一种由芽孢杆菌属(Bacillus Cohn)细菌产生的阳离子肽化合物的合成系统,从而可以高效地制造该阳离子肽化合物。
Description
技术领域
本发明涉及微生物及分子生物学技术,具体涉及一种参与阳离子肽化合物合成的基因及其用途。
背景技术
耐药细菌已经成为威胁全球公共卫生的重要问题。世界卫生组织(WHO)倡议建立了全球抗生素研究和发展伙伴关系(GARDP),以加速开发新的和改进的抗生素来应对耐药性细菌感染。
目前市场上的常用抗生素为喹诺酮类、β-内酰胺类、大环内酯类和氨基糖苷类等,但耐药细菌已经对上述药物具有耐受性。后来开发了糖肽和脂肽类化合物,如万古霉素、多粘菌素、达托霉素等,以优异的抗菌性受到广泛关注。但已有研究发现万古霉素及多粘菌素的耐受基因,在全球引起高度关注,因此急需寻找新型抗菌剂。
阳离子肽化合物是一类由脂肪酸链及氨基酸衍生单体组成的抗菌肽,主要通过非核糖体肽合成酶(Non-ribosomal peptide synthase,NRPS)途径生物合成。阳离子肽化合物独有的两亲性质及带电性决定了其主要通过破坏细胞膜完整性造成细菌死亡的抗菌机制,具有不易引起细菌耐受性的优点,是极具良好应用前景的抗菌剂产品。
发明内容
因此,本发明要解决的技术问题在于提供一种参与阳离子肽化合物合成的基因及其用途。
一种参与阳离子肽化合物合成的基因,所述基因编码的蛋白质具有由芽孢杆菌属(Bacillus Cohn)细菌产生的阳离子肽化合物的非核糖体肽合成活性及聚合酮酶合成活性,所述蛋白质从N末端侧起依次包含如下的模块:
第一模块:其从N末端侧起依次包含:第一腺苷酰化结构域,其包含如SEQ ID NO:1所示的氨基酸序列或与SEQ ID NO:1所示的氨基酸序列具有70%以上同一性的氨基酸序列;第一肽基载体蛋白结构域,其包含如SEQ ID NO:2所示的氨基酸序列或与SEQ ID NO:2所示的氨基酸序列具有70%以上同一性的氨基酸序列;酮还原酶结构域,其包含如SEQ ID NO:3所示的氨基酸序列或与SEQ ID NO:3所示的氨基酸序列具有70%以上同一性的氨基酸序列;第一缩合结构域,其包含如SEQ ID NO:4所示的氨基酸序列或与SEQ ID NO:4所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第二模块:其从N末端侧起依次包含:第二缩合结构域,其包含如SEQ ID NO:5所示的氨基酸序列或与SEQ ID NO:5所示的氨基酸序列具有70%以上同一性的氨基酸序列;第二腺苷酰化结构域,其包含如SEQ ID NO:6所示的氨基酸序列或与SEQ ID NO:6所示的氨基酸序列具有70%以上同一性的氨基酸序列;第二肽基载体蛋白结构域,其包含如SEQ ID NO:7所示的氨基酸序列或与SEQ ID NO:7所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第三模块:其从N末端侧起依次包含:第三缩合结构域,其包含如SEQ ID NO:8所示的氨基酸序列或与SEQ ID NO:8所示的氨基酸序列具有70%以上同一性的氨基酸序列;第三腺苷酰化结构域,其包含如SEQ ID NO:9所示的氨基酸序列或与SEQ ID NO:9所示的氨基酸序列具有70%以上同一性的氨基酸序列;第三肽基载体蛋白结构域,其包含如SEQ ID NO:10所示的氨基酸序列或与SEQ ID NO:10所示的氨基酸序列具有70%以上同一性的氨基酸序列;终端还原结构域,其包含如SEQ ID NO:11所示的氨基酸序列或与SEQ ID NO:11所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第四模块:其从N末端侧起依次包含:第四缩合结构域,其包含如SEQ ID NO:12所示的氨基酸序列或与SEQ ID NO:12所示的氨基酸序列具有70%以上同一性的氨基酸序列;第四腺苷酰化结构域,其包含如SEQ ID NO:13所示的氨基酸序列或与SEQ ID NO:13所示的氨基酸序列具有70%以上同一性的氨基酸序列;第四肽基载体蛋白结构域,其包含如SEQ IDNO:14所示的氨基酸序列或与SEQ ID NO:14所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第五模块:其从N末端侧起依次包含:第五缩合结构域,其包含如SEQ ID NO:15所示的氨基酸序列或与SEQ ID NO:15所示的氨基酸序列具有70%以上同一性的氨基酸序列;第五腺苷酰化结构域,其包含如SEQ ID NO:16所示的氨基酸序列或与SEQ ID NO:16所示的氨基酸序列具有70%以上同一性的氨基酸序列;第五肽基载体蛋白结构域,其包含如SEQ IDNO:17所示的氨基酸序列或与SEQ ID NO:17所示的氨基酸序列具有70%以上同一性的氨基酸序列;第一差向异构酶结构域,其包含如SEQ ID NO:18所示的氨基酸序列或与SEQ IDNO:18所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第六模块:其从N末端侧起依次包含:第六缩合结构域,其包含如SEQ ID NO:19所示的氨基酸序列或与SEQ ID NO:19所示的氨基酸序列具有70%以上同一性的氨基酸序列;第六腺苷酰化结构域,其包含如SEQ ID NO:20所示的氨基酸序列或与SEQ ID NO:20所示的氨基酸序列具有70%以上同一性的氨基酸序列;第六肽基载体蛋白结构域,其包含如SEQ IDNO:21所示的氨基酸序列或与SEQ ID NO:21所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第七模块:其从N末端侧起依次包含:第七缩合结构域,其包含如SEQ ID NO:22所示的氨基酸序列或与SEQ ID NO:22所示的氨基酸序列具有70%以上同一性的氨基酸序列;第七腺苷酰化结构域,其包含如SEQ ID NO:23所示的氨基酸序列或与SEQ ID NO:23所示的氨基酸序列具有70%以上同一性的氨基酸序列;第七肽基载体蛋白结构域,其包含如SEQ IDNO:24所示的氨基酸序列或与SEQ ID NO:24所示的氨基酸序列具有70%以上同一性的氨基酸序列;第二差向异构酶结构域,其包含如SEQ ID NO:25所示的氨基酸序列或与SEQ IDNO:25所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第八模块:其从N末端侧起依次包含:第八缩合结构域,其包含如SEQ ID NO:26所示的氨基酸序列或与SEQ ID NO:26所示的氨基酸序列具有70%以上同一性的氨基酸序列;第八腺苷酰化结构域,其包含如SEQ ID NO:27所示的氨基酸序列或与SEQ ID NO:27所示的氨基酸序列具有70%以上同一性的氨基酸序列;第八肽基载体蛋白结构域,其包含如SEQ IDNO:28所示的氨基酸序列或由SEQ ID NO:28所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第九模块:其从N末端侧起依次包含:第九缩合结构域,其包含如SEQ ID NO:29所示的氨基酸序列或与SEQ ID NO:29所示的氨基酸序列具有70%以上同一性的氨基酸序列;第九腺苷酰化结构域,其包含如SEQ ID NO:30所示的氨基酸序列或与SEQ ID NO:30所示的氨基酸序列具有70%以上同一性的氨基酸序列;第九肽基载体蛋白结构域,其包含如SEQ IDNO:31所示的氨基酸序列或与SEQ ID NO:31所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第十模块:其从N末端侧起依次包含:第十缩合结构域,其包含如SEQ ID NO:32所示的氨基酸序列或与SEQ ID NO:32所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十腺苷酰化结构域,其包含如SEQ ID NO:33所示的氨基酸序列或与SEQ ID NO:33所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十肽基载体蛋白结构域,其包含如SEQ IDNO:34所示的氨基酸序列或与SEQ ID NO:34所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第十一模块:其从N末端侧起依次包含:第十一缩合结构域,其包含如SEQ ID NO:35所示的氨基酸序列或与SEQ ID NO:35所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十一腺苷酰化结构域,其包含如SEQ ID NO:36所示的氨基酸序列或与SEQ ID NO:36所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十一肽基载体蛋白结构域,其包含如SEQ ID NO:37所示的氨基酸序列或与SEQ ID NO:37所示的氨基酸序列具有70%以上同一性的氨基酸序列;第三差向异构酶结构域,其包含如SEQ ID NO:38所示的氨基酸序列或与SEQ ID NO:38所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第十二模块:其从N末端侧起依次包含:第十二缩合结构域,其包含如SEQ ID NO:39所示的氨基酸序列或与SEQ ID NO:39所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十二腺苷酰化结构域,其包含如SEQ ID NO:40所示的氨基酸序列或与SEQ ID NO:40所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十二肽基载体蛋白结构域,其包含如SEQ ID NO:41所示的氨基酸序列或与SEQ ID NO:41所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第十三模块:其从N末端侧起依次包含:第十三缩合结构域,其包含如SEQ ID NO:42所示的氨基酸序列或与SEQ ID NO:42所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十三腺苷酰化结构域,其包含如SEQ ID NO:43所示的氨基酸序列或与SEQ ID NO:43所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十三肽基载体蛋白结构域,其包含如SEQ ID NO:44所示的氨基酸序列或与SEQ ID NO:44所示的氨基酸序列具有70%以上同一性的氨基酸序列;第四差向异构酶结构域,其包含如SEQ ID NO:45所示的氨基酸序列或与SEQ ID NO:45所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第十四模块:其从N末端侧起依次包含:第十四腺苷酰化结构域,其包含如SEQ IDNO:46所示的氨基酸序列或与SEQ ID NO:46所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十四肽基载体蛋白结构域,其包含如SEQ ID NO:47所示的氨基酸序列或与SEQID NO:47所示的氨基酸序列具有70%以上同一性的氨基酸序列。
在本发明中,腺苷酰化结构域(adenylation domain) 简称为A结构域,肽基载体蛋白结构域(peptidyl carrier protein domain)简称为PCP结构域,缩合结构域(condesation domain)简称为C结构域,酮还原酶结构域(ketoreductase domain, KRdomain)简称为KR结构域,终端还原酶结构域(Terminal reductase domain, TD domain)简称为TD结构域,差向异构酶结构域(epimerization domain, E domain)简称为E结构域。
其中,第一模块中,如果分别作为第一A结构域、第一PCP结构域、KR结构域和第一C结构域发挥功能,也可以是与SEQ ID NO:1、SEQ ID NO:3、SEQ ID NO:2和SEQ ID NO:4所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第一A结构域更优选为和第一A结构域中与识别底物相关的关键位点残基(又称特异性代码:specificity-conferring code)相同的氨基酸序列。
第二模块中,如果分别作为第二C结构域、第二A结构域和第二PCP结构域发挥功能,则也可以是与SEQ ID NO:5、SEQ ID NO:6和SEQ ID NO:7所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第二A结构域更优选为和第二A结构域中与识别底物相关的特异性代码相同的氨基酸序列。
其中,第三模块中,如果分别作为第三C结构域、第三A结构域和第三PCP结构域以及TD结构域发挥功能,则也可以是与SEQ ID NO:8、SEQ ID NO:9、SEQ ID NO:10和SEQ IDNO:11所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第三A结构域更优选为和第三A结构域中与识别底物相关的特异性代码相同的氨基酸序列。
其中,第四模块中,如果分别作为第四C结构域、第四A结构域和第四PCP结构域发挥功能,则也可以是与SEQ ID NO:12、SEQ ID NO:13和SEQ ID NO:14所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,A结构域更优选为和第四A结构域中与识别底物相关的特异性代码相同的氨基酸序列。
其中,第五模块中,如果分别作为第五C结构域、第五A结构域和第五PCP结构域以及第一E结构域发挥功能,则也可以是与SEQ ID NO:15、SEQ ID NO:16、SEQ ID NO:17和SEQID NO:18所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第五A结构域更优选为和第五A结构域中与识别底物相关特异性代码相同的氨基酸序列。
其中,第六模块中,如果分别作为第六C结构域、第六A结构域和第六PCP结构域发挥功能,则也可以是与SEQ ID NO:19、SEQ ID NO:20和SEQ ID NO:21所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第六A结构域更优选为和第六A结构域中与识别底物相关的特异性代码相同的氨基酸序列。
其中,第七模块中,如果分别作为第七C结构域、第七A结构域和第七PCP结构域以及第二E结构域发挥功能,则也可以是与SEQ ID NO:22、SEQ ID NO:23、SEQ ID NO:24和SEQID NO:25所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第七A结构域更甚优选与第七A结构域中与识别底物相关特异性代码相同的氨基酸序列。
其中,第八模块中,如果分别作为第八C结构域、第八A结构域和第八PCP结构域发挥功能,则也可以是与SEQ ID NO:26、SEQ ID NO:27和SEQ ID NO:28所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第八A结构域更甚优选与第八A结构域中与识别底物相关的特异性代码相同的氨基酸序列。
其中,第九模块中,如果分别作为第九C结构域、第九A结构域和第九PCP结构域发挥功能,则也可以是与SEQ ID NO:29、SEQ ID NO:30和SEQ ID NO:31所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第九A结构域更甚优选与第九A结构域中与识别底物相关的特异性代码相同的氨基酸序列。
其中,第十模块中,如果分别作为第十C结构域、第十A结构域和第十PCP结构域发挥功能,则也可以是与SEQ ID NO:32、SEQ ID NO:33和SEQ ID NO:34所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第十A结构域更甚优选与第十A结构域中与识别底物相关的特异性代码相同的氨基酸序列。
其中,第十一模块中的第十一C结构域、第十一A结构域和第十一PCP结构域以及第三E结构域各自并不限于SEQ ID NO:35、SEQ ID NO:36、SEQ ID NO:37和SEQ ID NO:38所示的氨基酸序列,如果分别作为第十一C结构域、第十一A结构域和第十一PCP结构域以及第三E结构域发挥功能,则也可以是与SEQ ID NO:35、SEQ ID NO:36、SEQ ID NO:37和SEQ IDNO:38所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第十一A结构域更甚优选与第十一A结构域中与识别底物相关特异性代码相同的氨基酸序列。
其中,第十二模块,如果分别作第十二C结构域、第十二A结构域和第十二PCP结构域发挥功能,则也可以是与SEQ ID NO:39、SEQ ID NO:40和SEQ ID NO:41所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第十二A结构域更甚优选与第十二A结构域中与识别底物相关的特异性代码相同的氨基酸序列。
其中,第十三模块中,如果分别作为第十三C结构域、第十三A结构域和第十三PCP结构域以及第四E结构域发挥功能,则也可以是与SEQ ID NO:42、SEQ ID NO:43、SEQ IDNO:44和SEQ ID NO:45所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第十三A结构域更甚优选与第十三A结构域中与识别底物相关特异性代码相同的氨基酸序列。
其中,第十四模块,如果分别作为第十四A结构域和第十四PCP结构域发挥功能,则也可以是与SEQ ID NO:46和SEQ ID NO:47所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第十四A结构域更甚优选与第十四A结构域中与识别底物相关特异性代码相同的氨基酸序列。
可选的,所述蛋白质为如下(a)-(n)中的任意一种:
(a)包含如SEQ ID NO:48所示的氨基酸序列或包含与SEQ ID NO:48所示的氨基酸序列具有70%以上同一性的氨基酸序列;
(b)包含如SEQ ID NO:49所示的氨基酸序列或包含与SEQ ID NO:49所示的氨基酸序列具有70%以上同一性的氨基酸序列;
(c)包含如SEQ ID NO:50所示的氨基酸序列或包含与SEQ ID NO:50所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(d)包含如SEQ ID NO:51所示的氨基酸序列或包含与SEQ ID NO:51所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(e)包含如SEQ ID NO:52所示的氨基酸序列或包含与SEQ ID NO:52所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(f)包含如SEQ ID NO:53所示的氨基酸序列或包含与SEQ ID NO:53所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(g)包含如SEQ ID NO:54所示的氨基酸序列或包含与SEQ ID NO:54所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(h)包含如SEQ ID NO:55所示的氨基酸序列或包含与SEQ ID NO:55所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(i)包含如SEQ ID NO:56所示的氨基酸序列或包含与SEQ ID NO:56所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(j)包含如SEQ ID NO:57所示的氨基酸序列或包含与SEQ ID NO:57所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(k)包含如SEQ ID NO:58所示的氨基酸序列或包含与SEQ ID NO:58所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(l)包含如SEQ ID NO:59所示的氨基酸序列或包含与SEQ ID NO:59所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(m)包含如SEQ ID NO:60所示的氨基酸序列或包含与SEQ ID NO:60所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(n)包含有多核苷酸编码的蛋白质,所述多核苷酸在严格条件下与如SEQ ID NO:61-SEQ ID NO:67任一所示的核苷酸序列的互补链杂交,且具有由芽孢杆菌属细菌产生的阳离子肽化合物的非核糖体肽合成活性及聚合酮酶合成活性。
可选的,所述基因来源于芽孢杆菌属细菌,包括但不限于侧孢短芽孢杆菌(Brevibacillus laterosporus)、短小芽孢杆菌(Bacillus pumilus Meyer and Gottheil)、枯草芽孢杆菌(Bacillus subtilis)、蜡样芽孢杆菌(Bacillus cereus)或苏云金芽孢杆菌(Bacillus thuringiensis)。
可选的,所述基因来源于侧孢短芽孢杆菌Brevibacillus laterosporus S62-9,所述侧孢短芽孢杆菌Brevibacillus laterosporus S62-9的保藏编号为CGMCC No.18629。
可选的,所述阳离子肽化合物是指由芽孢杆菌属细菌产生的阳离子肽化合物。
可选的,所述阳离子肽化合物是指侧孢短芽孢杆菌产生的阳离子肽化合物。
可选的,所述阳离子肽化合物的化学式(1)如下:
(1)
其中,R3选自甲硫氨酸残基、缬氨酸残基、亮氨酸残基或异亮氨酸侧链基团,R5选自缬氨酸残基、亮氨酸残基或异亮氨酸侧链基团。
一种生物材料,包括如下任意一种:
(1)多肽,由所述的基因编码;
(2)含有所述的基因的表达盒;
(3)含有所述的基因或(2)中所述的表达盒的重组载体;
(4)含有所述的基因、(2)中所述的表达盒或(3)中所述的重组载体的宿主细胞。
可选的,所述宿主细胞包括但不限于芽孢杆菌属细菌,大肠杆菌(Escherichia coli),或毕赤酵母(Pichia pastoris)。
可选的,所述宿主细胞为侧孢短芽孢杆菌Brevibacillus laterosporus S62-9,所述侧孢短芽孢杆菌Brevibacillus laterosporus S62-9的保藏编号为CGMCC No.18629,保藏日期为2019年9月27日,保藏单位为中国微生物菌种保藏管理委员会普通微生物中心(CGMCC)。
一种阳离子肽化合物的制备方法,将所述的基因导入宿主细胞中,培养,从获得的培养液中分离。
所述的基因或所述的生物材料在制备阳离子肽化合物的用途。
本发明技术方案,具有如下优点:
1.本发明提供的参与阳离子肽化合物合成的基因,能够构建一种由芽孢杆菌属(Bacillus Cohn)细菌产生的阳离子肽化合物的合成系统,从而可以高效地制造该阳离子肽化合物。
本发明中侧孢短芽孢杆菌Brevibacillus laterosporus S62-9,所述侧孢短芽孢杆菌Brevibacillus laterosporus S62-9的保藏编号为CGMCC No.18629,侧孢短芽孢杆菌Brevibacillus laterosporus S62-9于2019年9月27日在中国微生物菌种保藏管理委员会普通微生物中心保藏,保藏地址:北京市朝阳区北辰西路1号院3号,中国科学院微生物研究所,邮编100101。
附图说明
为了更清楚地说明本发明具体实施方式或现有技术中的技术方案,下面将对具体实施方式或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本发明的一些实施方式,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。
图1 是本发明的参与阳离子肽合成的基因中的模块结构和结构域结构示意图;
图2 是本发明实施例3中使用antiSMASH程序并通过antiSMASH对实施例中推定的NRPS结构域结构进行分析的结果图;图中椭圆为PCP结构域,如Gene0851中的C结构域左侧的椭圆即为PCP结构域;
图3是本发明实施例4中表示本发明的参与阳离子肽化合物合成基因(推定的阳离子脂肽化合物合成簇)的上游和下游的基因blastp检索结果图;
图4是本发明实施例5中表示本发明的参与阳离子肽化合物生物合成的基因区域的图;
图5是本发明实施例6中第一模块A结构域检测结果;
图6是本发明实施例6中第二模块A结构域检测结果;
图7是本发明实施例6中第三模块A结构域检测结果;
图8是本发明实施例6中第四模块A结构域检测结果;
图9是本发明实施例6中第五模块A结构域检测结果;
图10是本发明实施例6中第六模块A结构域检测结果;
图11是本发明实施例6中第七模块A结构域检测结果;
图12是本发明实施例6中第八模块A结构域检测结果;
图13是本发明实施例6中第九模块A结构域检测结果;
图14是本发明实施例6中第十模块A结构域检测结果;
图15是本发明实施例6中第十一模块A结构域检测结果;
图16是本发明实施例6中第十二模块A结构域检测结果;
图17是本发明实施例6中第十三模块A结构域检测结果;
图18是本发明实施例6中第十四模块A结构域检测结果;
图19是本发明实施例6中第一模块合成残基(1)2-羟基-3甲基戊酸的过程;
图20是本发明实施例6中第十四模块和L-苏氨酸-3-脱氢酶协作合成残基(2)2-氨基-2烯丁酸的过程;
图21是本发明实施例6中阳离子肽化合物结构示意图。
具体实施方式
提供下述实施例是为了更好地进一步理解本发明,并不局限于所述最佳实施方式,不对本发明的内容和保护范围构成限制,任何人在本发明的启示下或是将本发明与其他现有技术的特征进行组合而得出的任何与本发明相同或相近似的产品,均落在本发明的保护范围之内。
实施例中未注明具体实验步骤或条件者,按照本领域内的文献所描述的常规实验步骤的操作或条件即可进行。所用试剂或仪器未注明生产厂商者,均为可以通过市购获得的常规试剂产品。
本发明所述的参与阳离子肽化合物合成的基因,是指参与阳离子肽化合物合成的一组基因(基因簇)中所含有的各个基因,所述阳离子肽化合物是由芽孢杆菌属(Bacillus Cohn)细菌产生的。该阳离子肽化合物如专利ZL202010248666.4中的活性肽A、活性肽B、活性肽C、活性肽D或活性肽E,其通式如下式(1)表示:
(1)
其中,R3选自甲硫氨酸残基、缬氨酸残基、亮氨酸残基或异亮氨酸侧链基团,R5选自缬氨酸残基、亮氨酸残基或异亮氨酸侧链基团。
产生上述阳离子肽化合物的芽孢杆菌属(Bacillus Cohn),代表性地可举出侧孢短芽孢杆菌(Brevibacillus laterosporus),此外还可举出:短小芽孢杆菌(Bacillus pumilus Meyer and Gottheil)、枯草芽孢杆菌(Bacillus subtilis)、蜡样芽孢杆菌(Bacillus cereus)、苏云金芽孢杆菌(Bacillus thuringiensis)等。此外,所谓侧孢短芽孢杆菌,可以举出从中国江苏无锡土壤中分离出的侧孢短芽孢杆菌Brevibacillus laterosporus S62-9株。侧孢短芽孢杆菌Brevibacillus laterosporus S62-9于2019年9月27日在中国微生物菌种保藏管理委员会普通微生物中心(CGMCC)保藏,保藏编号为CGMCCNo.18629,保藏地址:北京市朝阳区北辰西路1号院3号,中国科学院微生物研究所,邮编100101。
实施例1 参与阳离子肽化合物合成的基因的获取
参与阳离子肽化合物合成的一组基因,可以定义为含有11种基因。这11种基因为硫酯酶蛋白基因、聚酮合酶基因、酮还原酶基因、非核糖体肽合成酶基因(NRPS基因)(6种)、L-苏氨酸-3-脱氢酶基因、2-氨基-3-酮丁酸CoA连接酶基因。
在上述的一组基因中,NRPS基因编码的酶具有形成该阳离子肽化合物的基本骨架的功能。即该酶形成由13个氨基酸、一个脂肪酸和一个氨基醇组成的肽骨架:[3-甲基-2-氧代戊酸(α-KIL)-脱氢苏氨酸(Thr)-甲硫氨酸(Met)-鸟氨酸(Orn)-异亮氨酸(Ile)-缬氨酸(Val)-缬氨酸(Val)-赖氨酸(Lys)-缬氨酸(Val)-亮氨酸(Leu)-赖氨酸(Lys)-酪氨酸(Tyr)-亮氨酸(Leu)-缬氨酸(Val)]。该NRPS基因编码的酶具有以下活性:在3-甲基-2-氧代戊酸的羧基与脱氢苏氨酸的氨基之间形成肽键;在脱氢苏氨酸的羧基与甲硫氨酸的氨基之间形成肽键;在甲硫氨酸的羧基与鸟氨酸的氨基之间形成肽键;在鸟氨酸的羧基与异亮氨酸的氨基之间形成肽键;在异亮氨酸的羧基与缬氨酸的氨基之间形成肽键;在缬氨酸的羧基与缬氨酸的氨基之间形成肽键;在缬氨酸的羧基与赖氨酸的氨基之间形成肽键;在赖氨酸的羧基与缬氨酸的氨基之间形成肽键;在缬氨酸的羧基与亮氨酸的氨基之间形成肽键;在亮氨酸的羧基与赖氨酸的氨基之间形成肽键;在赖氨酸的羧基与酪氨酸的氨基之间形成肽键;在酪氨酸的羧基与亮氨酸的氨基之间形成肽键;在亮氨酸的羧基与缬氨醇的氨基之间形成肽键。此外,该酶具有以下活性:将3-甲基-2-氧代戊酸中的酮基还原成羟基;将鸟氨酸差向异构化为D构型;将第一个赖氨酸差向异构化为D构型;将第一个亮氨酸差向异构化为D构型,将酪氨酸差向异构化为D构型;将最后一个缬氨酸的羧基还原成醛基,形成缬氨醛。
上述一组基因中,L-苏氨酸-3-脱氢酶基因编码的酶具有修饰该阳离子肽化合物的基本骨架的功能。即该酶具有以下活性:将骨架中的苏氨酸脱氢形成2-氨基-2烯丁酸。
上述一组基因中,酮还原酶基因编码的酶具有修饰该阳离子肽化合物的基本骨架的功能。即该酶具有以下活性:将骨架由NRPS基因编码的酶还原形成的缬氨醛再次还原,最终形成缬氨醇。
该聚酮合酶基因和NRPS基因编码的合成酶具有14个模块,对应于上述基本肽骨架:[3-甲基-2-氧代戊酸(α-KIL)-脱氢苏氨酸(Thr)-甲硫氨酸(Met)-鸟氨酸(Orn)-异亮氨酸(Ile)-缬氨酸(Val)-缬氨酸(Val)-赖氨酸(Lys)-缬氨酸(Val)-亮氨酸(Leu)-赖氨酸(Lys)-酪氨酸(Tyr)-亮氨酸(Leu)-缬氨酸(Val)]。各模块具有A结构域(腺苷酰化结构域:adenylation domain),其识别选择目的氨基酸并在ATP存在下使目的氨基酸和AMP(单磷酸腺苷)结合生成氨酰AMP。此外,各模块具有PCP结构域(肽基载体蛋白结构域:peptidylcarrier protein domain),其连接磷酸泛酰巯基乙胺臂后具有活性,通过此臂的丝氨酸部位以硫酯的形式与氨酰AMP结合。另外,具有C结构域(缩合结构域:condesation domain),其具有将相邻的PCP结构域结合的氨酰AMP之间形成肽键。除以上结构域之外,有的模块还具有形成羟基的酮还原酶结构域(ketoreductase domain, KR domain),形成醛基的终端还原酶结构域(Terminal reductase domain, TD domain),可构成D构型的差向异构酶结构域(epimerization domain, E domain)。
在上述基础上,本实施例提供了一种参与阳离子肽化合物合成的基因,所述基因编码的蛋白质具有由芽孢杆菌属(Bacillus Cohn)细菌产生的阳离子肽化合物的非核糖体肽合成活性及聚合酮酶合成活性,所述蛋白质从N末端侧起依次包含如下的模块:
第一模块:其从N末端侧起依次包含:第一腺苷酰化结构域,其包含如SEQ ID NO:1所示的氨基酸序列或与SEQ ID NO:1所示的氨基酸序列具有70%以上同一性的氨基酸序列;第一肽基载体蛋白结构域,其包含如SEQ ID NO:2所示的氨基酸序列或与SEQ ID NO:2所示的氨基酸序列具有70%以上同一性的氨基酸序列;酮还原酶结构域,其包含如SEQ ID NO:3所示的氨基酸序列或与SEQ ID NO:3所示的氨基酸序列具有70%以上同一性的氨基酸序列;第一缩合结构域,其包含如SEQ ID NO:4所示的氨基酸序列或与SEQ ID NO:4所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第二模块:其从N末端侧起依次包含:第二缩合结构域,其包含如SEQ ID NO:5所示的氨基酸序列或与SEQ ID NO:5所示的氨基酸序列具有70%以上同一性的氨基酸序列;第二腺苷酰化结构域,其包含如SEQ ID NO:6所示的氨基酸序列或与SEQ ID NO:6所示的氨基酸序列具有70%以上同一性的氨基酸序列;第二肽基载体蛋白结构域,其包含如SEQ ID NO:7所示的氨基酸序列或与SEQ ID NO:7所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第三模块:其从N末端侧起依次包含:第三缩合结构域,其包含如SEQ ID NO:8所示的氨基酸序列或与SEQ ID NO:8所示的氨基酸序列具有70%以上同一性的氨基酸序列;第三腺苷酰化结构域,其包含如SEQ ID NO:9所示的氨基酸序列或与SEQ ID NO:9所示的氨基酸序列具有70%以上同一性的氨基酸序列;第三肽基载体蛋白结构域,其包含如SEQ ID NO:10所示的氨基酸序列或与SEQ ID NO:10所示的氨基酸序列具有70%以上同一性的氨基酸序列;终端还原结构域,其包含如SEQ ID NO:11所示的氨基酸序列或与SEQ ID NO:11所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第四模块:其从N末端侧起依次包含:第四缩合结构域,其包含如SEQ ID NO:12所示的氨基酸序列或与SEQ ID NO:12所示的氨基酸序列具有70%以上同一性的氨基酸序列;第四腺苷酰化结构域,其包含如SEQ ID NO:13所示的氨基酸序列或与SEQ ID NO:13所示的氨基酸序列具有70%以上同一性的氨基酸序列;第四肽基载体蛋白结构域,其包含如SEQ IDNO:14所示的氨基酸序列或与SEQ ID NO:14所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第五模块:其从N末端侧起依次包含:第五缩合结构域,其包含如SEQ ID NO:15所示的氨基酸序列或与SEQ ID NO:15所示的氨基酸序列具有70%以上同一性的氨基酸序列;第五腺苷酰化结构域,其包含如SEQ ID NO:16所示的氨基酸序列或与SEQ ID NO:16所示的氨基酸序列具有70%以上同一性的氨基酸序列;第五肽基载体蛋白结构域,其包含如SEQ IDNO:17所示的氨基酸序列或与SEQ ID NO:17所示的氨基酸序列具有70%以上同一性的氨基酸序列;第一差向异构酶结构域,其包含如SEQ ID NO:18所示的氨基酸序列或与SEQ IDNO:18所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第六模块:其从N末端侧起依次包含:第六缩合结构域,其包含如SEQ ID NO:19所示的氨基酸序列或与SEQ ID NO:19所示的氨基酸序列具有70%以上同一性的氨基酸序列;第六腺苷酰化结构域,其包含如SEQ ID NO:20所示的氨基酸序列或与SEQ ID NO:20所示的氨基酸序列具有70%以上同一性的氨基酸序列;第六肽基载体蛋白结构域,其包含如SEQ IDNO:21所示的氨基酸序列或与SEQ ID NO:21所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第七模块:其从N末端侧起依次包含:第七缩合结构域,其包含如SEQ ID NO:22所示的氨基酸序列或与SEQ ID NO:22所示的氨基酸序列具有70%以上同一性的氨基酸序列;第七腺苷酰化结构域,其包含如SEQ ID NO:23所示的氨基酸序列或与SEQ ID NO:23所示的氨基酸序列具有70%以上同一性的氨基酸序列;第七肽基载体蛋白结构域,其包含如SEQ IDNO:24所示的氨基酸序列或与SEQ ID NO:24所示的氨基酸序列具有70%以上同一性的氨基酸序列;第二差向异构酶结构域,其包含如SEQ ID NO:25所示的氨基酸序列或与SEQ IDNO:25所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第八模块:其从N末端侧起依次包含:第八缩合结构域,其包含如SEQ ID NO:26所示的氨基酸序列或与SEQ ID NO:26所示的氨基酸序列具有70%以上同一性的氨基酸序列;第八腺苷酰化结构域,其包含如SEQ ID NO:27所示的氨基酸序列或与SEQ ID NO:27所示的氨基酸序列具有70%以上同一性的氨基酸序列;第八肽基载体蛋白结构域,其包含如SEQ IDNO:28所示的氨基酸序列或由SEQ ID NO:28所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第九模块:其从N末端侧起依次包含:第九缩合结构域,其包含如SEQ ID NO:29所示的氨基酸序列或与SEQ ID NO:29所示的氨基酸序列具有70%以上同一性的氨基酸序列;第九腺苷酰化结构域,其包含如SEQ ID NO:30所示的氨基酸序列或与SEQ ID NO:30所示的氨基酸序列具有70%以上同一性的氨基酸序列;第九肽基载体蛋白结构域,其包含如SEQ IDNO:31所示的氨基酸序列或与SEQ ID NO:31所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第十模块:其从N末端侧起依次包含:第十缩合结构域,其包含如SEQ ID NO:32所示的氨基酸序列或与SEQ ID NO:32所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十腺苷酰化结构域,其包含如SEQ ID NO:33所示的氨基酸序列或与SEQ ID NO:33所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十肽基载体蛋白结构域,其包含如SEQ IDNO:34所示的氨基酸序列或与SEQ ID NO:34所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第十一模块:其从N末端侧起依次包含:第十一缩合结构域,其包含如SEQ ID NO:35所示的氨基酸序列或与SEQ ID NO:35所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十一腺苷酰化结构域,其包含如SEQ ID NO:36所示的氨基酸序列或与SEQ ID NO:36所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十一肽基载体蛋白结构域,其包含如SEQ ID NO:37所示的氨基酸序列或与SEQ ID NO:37所示的氨基酸序列具有70%以上同一性的氨基酸序列;第三差向异构酶结构域,其包含如SEQ ID NO:38所示的氨基酸序列或与SEQ ID NO:38所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第十二模块:其从N末端侧起依次包含:第十二缩合结构域,其包含如SEQ ID NO:39所示的氨基酸序列或与SEQ ID NO:39所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十二腺苷酰化结构域,其包含如SEQ ID NO:40所示的氨基酸序列或与SEQ ID NO:40所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十二肽基载体蛋白结构域,其包含如SEQ ID NO:41所示的氨基酸序列或与SEQ ID NO:41所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第十三模块:其从N末端侧起依次包含:第十三缩合结构域,其包含如SEQ ID NO:42所示的氨基酸序列或与SEQ ID NO:42所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十三腺苷酰化结构域,其包含如SEQ ID NO:43所示的氨基酸序列或与SEQ ID NO:43所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十三肽基载体蛋白结构域,其包含如SEQ ID NO:44所示的氨基酸序列或与SEQ ID NO:44所示的氨基酸序列具有70%以上同一性的氨基酸序列;第四差向异构酶结构域,其包含如SEQ ID NO:45所示的氨基酸序列或与SEQ ID NO:45所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第十四模块:其从N末端侧起依次包含:第十四腺苷酰化结构域,其包含如SEQ IDNO:46所示的氨基酸序列或与SEQ ID NO:46所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十四肽基载体蛋白结构域,其包含如SEQ ID NO:47所示的氨基酸序列或与SEQID NO:47所示的氨基酸序列具有70%以上同一性的氨基酸序列。
其中,第一模块中,如果分别作为第一A结构域、第一PCP结构域、KR结构域和第一C结构域发挥功能,则也可以是与SEQ ID NO:1、SEQ ID NO:3、SEQ ID NO:2和SEQ ID NO:4所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第一A结构域更优选为和第一A结构域中与识别底物相关的关键位点残基(又称特异性代码:specificity-conferring code,见表1)相同的氨基酸序列。
第二模块中,如果分别作为第二C结构域、第二A结构域和第二PCP结构域发挥功能,则也可以是与SEQ ID NO:5、SEQ ID NO:6和SEQ ID NO:7所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第二A结构域更优选为和第二A结构域中与识别底物相关的特异性代码(见表1)相同的氨基酸序列。
其中,第三模块中,如果分别作为第三C结构域、第三A结构域和第三PCP结构域以及TD结构域发挥功能,则也可以是与SEQ ID NO:8、SEQ ID NO:9、SEQ ID NO:10和SEQ IDNO:11所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第三A结构域更优选为和第三A结构域中与识别底物相关的特异性代码(见表1)相同的氨基酸序列。
其中,第四模块中,如果分别作第四C结构域、第四A结构域和第四PCP结构域发挥功能,则也可以是与SEQ ID NO:12、SEQ ID NO:13和SEQ ID NO:14所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第四A结构域更优选为和第四A结构域中与识别底物相关的特异性代码(见表1)相同的氨基酸序列。
其中,第五模块中,如果分别作为第五C结构域、第五A结构域和第五PCP结构域以及第一E结构域发挥功能,则也可以是与SEQ ID NO:15、SEQ ID NO:16、SEQ ID NO:17和SEQID NO:18所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第五A结构域更优选为和第五A结构域中与识别底物相关特异性代码(见表1)相同的氨基酸序列。
其中,第六模块中,如果分别作为第六C结构域、第六A结构域和第六PCP结构域发挥功能,则也可以是与SEQ ID NO:19、SEQ ID NO:20和SEQ ID NO:21所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第六A结构域更优选为和第六A结构域中与识别底物相关的特异性代码(见表1)相同的氨基酸序列。
其中,第七模块中,如果分别作为第七C结构域、第七A结构域和第七PCP结构域以及第二E结构域发挥功能,则也可以是与SEQ ID NO:22、SEQ ID NO:23、SEQ ID NO:24和SEQID NO:25所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第七A结构域更甚优选与第七A结构域中与识别底物相关特异性代码(见表1)相同的氨基酸序列。
其中,第八模块中,如果分别作为第八C结构域、第八A结构域和第八PCP结构域发挥功能,则也可以是与SEQ ID NO:26、SEQ ID NO:27和SEQ ID NO:28所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第八A结构域更甚优选与第八A结构域中与识别底物相关的特异性代码(见表1)相同的氨基酸序列。
其中,第九模块中,如果分别作为第九C结构域、第九A结构域和第九PCP结构域发挥功能,则也可以是与SEQ ID NO:29、SEQ ID NO:30和SEQ ID NO:31所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第九A结构域更甚优选与第九A结构域中与识别底物相关的特异性代码(见表1)相同的氨基酸序列。
其中,第十模块中,如果分别作为第十C结构域、第十A结构域和第十PCP结构域发挥功能,则也可以是与SEQ ID NO:32、SEQ ID NO:33和SEQ ID NO:34所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第十A结构域更甚优选与第十A结构域中与识别底物相关的特异性代码(见表1)相同的氨基酸序列。
其中,第十一模块,如果分别作为第十一C结构域、第十一A结构域和第十一PCP结构域以及第三E结构域发挥功能,则也可以是与SEQ ID NO:35、SEQ ID NO:36、SEQ ID NO:37和SEQ ID NO:38所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第十一A结构域更甚优选与第十一A结构域中与识别底物相关特异性代码(见表1)相同的氨基酸序列。
其中,第十二模块,如果分别作第十二C结构域、第十二A结构域和第十二PCP结构域发挥功能,则也可以是与SEQ ID NO:39、SEQ ID NO:40和SEQ ID NO:41所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第十二A结构域更甚优选与第十二A结构域中与识别底物相关的特异性代码(见表1)相同的氨基酸序列。
其中,第十三模块,如果分别作为第十三C结构域、第十三A结构域和第十三PCP结构域以及第四E结构域发挥功能,则也可以是与SEQ ID NO:42、SEQ ID NO:43、SEQ ID NO:44和SEQ ID NO:45所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第十三A结构域更甚优选与第十三A结构域中与识别底物相关特异性代码(见表1)相同的氨基酸序列。
其中,第十四模块,如果分别作为第十四A结构域和第十四PCP结构域发挥功能,则也可以是与SEQ ID NO:46和SEQ ID NO:47所示的氨基酸序列具有优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列。另外,第十四A结构域更甚优选与第十四A结构域中与识别底物相关特异性代码(见表1)相同的氨基酸序列。表1中的D、S、Y、W、L、G、K、V、N、F、A、D、I、H、C、Y、M等均为氨基酸缩写。A1、A2、A3……A10表示编号。
表1、A结构域代码
上述的各模块的位置与结构域组成及其合成的肽骨架的氨基酸位置如图1所示。图中A结构域表示腺苷酰化结构域,PCP结构域表示肽基载体蛋白结构域,C结构域表示缩合结构域,KR结构域表示酮还原酶结构域,TD结构域表示终端还原酶结构域,E结构域表示差向异构酶结构域。
实施例2 各模块中的结构域发挥功能评价
当第一A结构域与SEQ ID NO:1的氨基酸序列不同时,其是否可以作为对应于3-甲基-2-氧代戊酸的A结构域发挥功能可以如下进行评价。
首先,设计突变体基因以编码第一突变体A结构域,所述第一突变体A结构域被设计为与SEQ ID NO:1的氨基酸序列不同。在适当宿主内表达该突变体基因,确认在宿主内和培养上清液中的代谢物中是否合成具有上述阳离子肽化合物基本肽骨架的化合物。如果代谢物中具有合成上述阳离子肽化合物基本肽骨架的化合物,则设计的第一突变体A结构域可以被评价为作为与3-甲基-2-氧代戊酸相对应的A结构域发挥了功能。同样地,当第二A结构域至第十四A结构域与SEQ ID NO:6、SEQ ID NO:9、SEQ ID NO:13、SEQ ID NO:16、SEQ IDNO:20、SEQ ID NO:23、SEQ ID NO:27、SEQ ID NO: 30、SEQ ID NO:33、SEQ ID NO:36、SEQID NO:40、SEQ ID NO:43和SEQ ID NO:46的氨基酸序列不同时,也可以用此方式评价其是否可以作为A结构域发挥功能。
当第一PCP结构域与SEQ ID NO:2的氨基酸序列不同时,其是否可以作为PCP结构域发挥功能可以如下进行评价。首先设计突变体基因以编码第一突变体PCP结构域,所述第一突变体PCP结构域被设计为与SEQ ID NO:2的氨基酸序列不同。在适当宿主内表达该突变体基因,确认在宿主内和培养上清液中的代谢物中是否具有合成上述阳离子肽化合物基本肽骨架的化合物。如果代谢物中具有合成上述阳离子肽化合物基本肽骨架的化合物,则设计的第一突变体PCP结构域可以被评价为作为PCP结构域发挥了功能。同样地,当第二PCP结构域至第十四PCP结构域与SEQ ID NO:7、SEQ ID NO:10、SEQ ID NO:14、SEQ ID NO:17、SEQID NO:21、SEQ ID NO:24、SEQ ID NO:28、SEQ ID NO:31、SEQ ID NO:34、SEQ ID NO:37、SEQID NO:41、SEQ ID NO:44和SEQ ID NO:47的氨基酸序列不同时,也可以用此方式评价其是否可以作为PCP结构域发挥功能。
当第一C结构域与SEQ ID NO:4的氨基酸序列不同时,其是否可以作为C结构域发挥功能可以如下进行评价。首先设计突变体基因以编码第一突变体C结构域,所述第一突变体C结构域被设计为与SEQ ID NO:4的氨基酸序列不同。在适当宿主内表达该突变体基因,确认在宿主内和培养上清液中的代谢物中是否具有合成上述阳离子肽化合物基本肽骨架的化合物。如果代谢物中具有合成上述阳离子肽化合物基本肽骨架的化合物,则设计的第一突变体C结构域可以被评价为作为C结构域发挥了功能。同样地,当第二C结构域至第十三C结构域与SEQ ID NO:5、SEQ ID NO:8、SEQ ID NO:12、SEQ ID NO:15、SEQ ID NO:19、SEQID NO:22、SEQ ID NO:26、SEQ ID NO:29、SEQ ID NO:32、SEQ ID NO:35、SEQ ID NO:39和SEQ ID NO:42的氨基酸序列不同时,也可以用此方式评价其是否可以作为C结构域发挥功能。
当第KR结构域与SEQ ID NO:3的氨基酸序列不同时,其是否可以作为KR结构域发挥功能可以如下进行评价。首先设计突变体基因以编码突变体KR结构域,所述第一突变体KR结构域被设计为与SEQ ID NO:3的氨基酸序列不同。在适当宿主内表达该突变体基因,确认在宿主内和培养上清液中的代谢物中是否具有合成上述阳离子肽化合物基本肽骨架的化合物。如果代谢物中具有合成上述阳离子肽化合物基本肽骨架的化合物,则设计的第一突变体KR结构域可以被评价为作为KR结构域发挥了功能。
当TD结构域与SEQ ID NO:11的氨基酸序列不同时,其是否可以作为TD结构域发挥功能可以如下进行评价。首先设计突变体基因以编码突变体TD结构域,所述突变体TD结构域被设计为与SEQ ID NO:11的氨基酸序列不同。在适当宿主内表达该突变体基因,确认在宿主内和培养上清液中的代谢物中是否具有合成上述阳离子肽化合物基本肽骨架的化合物。如果代谢物中具有合成上述阳离子肽化合物基本肽骨架的化合物,则设计的第一突变体TD结构域可以被评价为作为TD结构域发挥了功能。
当第一E结构域与SEQ ID NO:18的氨基酸序列不同时,其是否可以作为E结构域发挥功能可以如下进行评价。首先设计突变体NRPS基因以编码第一突变体E结构域,所述第一突变体E结构域被设计为与SEQ ID NO:18的氨基酸序列不同。在适当宿主内表达该突变体基因,确认在宿主内和培养上清液中的代谢物中是否具有合成上述阳离子肽化合物基本肽骨架的化合物。如果代谢物中具有合成上述阳离子肽化合物基本肽骨架的化合物,则设计的第一突变体E结构域可以被评价为作为E结构域发挥了功能。同样地,当第二E结构域至第四E结构域与SEQ ID NO:25、SEQ ID NO:38和SEQ ID NO:45的氨基酸序列不同时,也可以用此方式评价其是否可以作为E结构域发挥功能。
如上所述,合成上述阳离子肽化合物基本肽骨架的聚酮合酶和NRPS可以由第一模块至第十四模块定义。将具有聚酮合酶活性和NRPS活性的氨基酸序列示于SEQ ID NO:48-SEQ ID NO:60,所述聚酮合酶或NRPS来源于侧孢短芽孢杆菌Brevibacillus laterosporusS62-9且具有合成上述阳离子肽化合物基本肽骨架的活性;将SEQ ID NO:48- SEQ ID NO:60所示的氨基酸序列相对应的编码区域示于SEQ ID NO:61- SEQ ID NO:67。
因此本发明的参与阳离子肽化合物合成的基因可以是编码如下蛋白质的基因,所述蛋白质具有上述SEQ ID NO:1- SEQ ID NO:47所示氨基酸序列定义的第一模块至第十四模块,并且与SEQ ID NO:48- SEQ ID NO:60所示的氨基酸序列相对应的编码区域示于SEQID NO:61-SEQ ID NO:67。或是与SEQ ID NO:48- SEQ ID NO:60所示氨基酸序列具有70%以上同一性、优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列且具有合成上述阳离子肽化合物基本肽骨架的活性。氨基酸序列之间的同一性数值可以通过安装BLAST算法的BLASTN和BLASTX程序来计算(默认设置)。同一性数值的计算是在对一对氨基酸序列进行双序列对比分析时计算完全一致的氨基酸残基,作为比较的全部氨基酸残基中的比例。
另外本发明的参与阳离子肽化合物合成的基因可以是编码如下蛋白质的基因,所述蛋白质具有上述SEQ ID NO:1- SEQ ID NO:47所示氨基酸序列定义的第一模块至第十四模块,并且具有相对于SEQ ID NO:48- SEQ ID NO:60的氨基酸序列置换、缺失、插入或增添1个或更多个氨基酸而成的氨基酸序列,且具有合成上述阳离子肽化合物基本肽骨架的活性。这里“更多个”是指2-1300个,优选为2-700个,进一步优选为2-250个,进一步优选为2-100个,进一步优选为2-50个。
然而本发明的参与阳离子肽化合物合成的基因并不限于编码如下蛋白质的基因,所述蛋白质具有上述SEQ ID NO:1- SEQ ID NO:47所示的氨基酸序列定义的第一模块至第十四模块。如上所述,将具有聚酮合酶活性和NRPS活性的氨基酸序列示于SEQ ID NO:48-SEQ ID NO:60,所述聚酮合酶和NRPS来源于侧孢短芽孢杆菌且具有合成上述阳离子肽化合物基本肽骨架的活性;将与该氨基酸序列相对应的编码区碱基序列所示于SEQ ID NO:61-SEQ ID NO:67,本发明的聚酮合酶和NRPS基因也可以由上述SEQ ID NO:48- SEQ ID NO:60和SEQ ID NO:61- SEQ ID NO:67定义。
即,本发明的参与阳离子肽化合物合成的基因可以作为编码如下蛋白质的基因,所述蛋白质由SEQ ID NO:48- SEQ ID NO:60所示氨基酸序列。
另外,本发明的参与阳离子肽化合物合成的基因也可以是编码如下蛋白质的基因,所述蛋白质与SEQ ID NO:48- SEQ ID NO:60所示氨基酸序列具有70%以上同一性、优选80%以上同一性、更优选90%以上同一性、进一步优选95%以上同一性、最优选97%以上同一性的氨基酸序列且具有上述阳离子肽化合物基本肽骨架的合成活性。与上述相同,氨基酸序列之间的同一性数值可以通过安装BLAST算法的BLASTN和BLASTX程序来计算(默认设置)。同一性数值的计算是在对一对氨基酸序列进行双序列对比分析时计算完全一致的氨基酸残基,作为比较的全部氨基酸残基中的比例。
此外,关于本发明的参与阳离子肽化合物合成的基因,可以利用存储有碱基序列信息的公开数据库来鉴定满足相对于SEQ ID NO:61- SEQ ID NO:67所示碱基序列覆盖率高、E值低、同一性数值高的条件的基因。作为鉴定的基因的条件,可以将覆盖率设置为90%以上,优选设为95%以上,更优选设为99%以上。此外,作为鉴定的基因的条件,可以将同一性数值设为70%以上,优选设为75%以上,更优选设为80%以上。被鉴定为满足这些条件的基因,可以被鉴定为是由SEQ ID NO:61- SEQ ID NO:67的碱基序列组成的基因的同源基因概率非常高,且与有SEQ ID NO:61-SEQ ID NO:67的碱基序列组成的基因同样编码上述阳离子肽化合物基本肽骨架的合成活性的蛋白质的基因。
鉴定的基因是否编码具有上述阳离子肽化合物的基本肽骨架的合成活性的蛋白质可以通过获得具有该基因的微生物,并对该微生物的上述阳离子肽化合物合成能力可以通过如下方式进行验证:培养该微生物,确认培养的菌体内或培养上清液中是否含有上述阳离子肽化合物。
然而,本发明的参与阳离子肽化合物合成的基因如果确定了其碱基序列,则可以通过如下方法制作:化学合成或者以基因组DNA为模板的PCR,或者将具有该碱基序列的DNA片段作为探针进行杂交。此外由SEQ ID NO:61- SEQ ID NO:67不同的碱基序列组成的基因或者编码与SEQ ID NO:48- SEQ ID NO:60不同的氨基酸序列的基因也可以通过对由SEQID NO:61- SEQ ID NO:67所示的碱基序列组成的多聚核苷酸进行定点诱变等来进行合成。
特别是,本发明的参与阳离子肽化合物合成的基因可以从已知产生上述阳离子肽化合物的微生物中分离,示例,可以从侧孢短芽孢杆菌中分离基因编码SEQ ID NO:48- SEQID NO:60所示的氨基酸序列。
本发明的参与阳离子肽化合物合成的基因也可以利用SEQ ID NO:61- SEQ IDNO:67所示的碱基序列,概率很高的从侧孢短芽孢杆菌以外的短芽胞杆菌中分离出。即通过如下多聚核苷酸作为探针的杂交反应,所述多聚核苷酸由选自SEQ ID NO:61- SEQ ID NO:67所示的碱基序列的连续一部分碱基序列组成,可以从侧孢短芽孢杆菌以外的芽孢杆菌属细菌的基因组或者转录产物来源的cDNA中,分离本发明的参与阳离子肽化合物合成的基因。侧孢短芽孢杆菌以外的芽孢杆菌属细菌既可以是不产生上述阳离子肽化合物的芽孢杆菌属细菌,也可以是产生上述阳离子肽化合物的芽孢杆菌属细菌。其原因在于,即使是不产生上述阳离子肽化合物的芽孢杆菌属细菌,也有可能具有本发明的参与阳离子肽化合物合成的基因。
实施例3 基因结构域结构进行分析
本实施例使用antiSMASH程序对本发明的参与阳离子肽化合物合成的基因编码的蛋白质(SEQ ID NO:48-60)中结构域结构进行分析的结果见图2,从图2中可看出,antiSMASH程序中的数据库中含有与本发明的参与阳离子肽化合物合成的基因编码的蛋白质中的结构域。
实施例4 基因的上游和下游的基因blastp检索
对本发明的参与阳离子肽化合物合成的基因(推定的阳离子脂肽化合物合成簇)的上游和下游的基因blastp检索结果见图3,以确定本发明的参与阳离子肽化合物合成的基因的位置。
实施例5参与阳离子肽化合物合成的基因区域
本实施例对本发明的参与阳离子肽化合物合成的基因区域的图见图4,说明本发明的参与阳离子肽化合物合成的基因编码的蛋白质主要是图4中注释的蛋白。
实施例6 参与阳离子肽化合物合成的基因的功能验证
利用PCR技术将本发明十四个模块中参与底物选择的腺苷酰化酶(A)结构域(SEQID NO:1、SEQ ID NO:6、SEQ ID NO:9、SEQ ID NO:13、SEQ ID NO:16、SEQ ID NO:20、SEQ IDNO:23、SEQ ID NO:27、SEQ ID NO:1、30、SEQ ID NO:33、SEQ ID NO:36、SEQ ID NO:40、SEQID NO:43和SEQ ID NO:46的编码基因)克隆至大肠杆菌表达质粒PET-21b中,通过添加异丙基-β-D-硫代半乳糖苷(IPTG)的方式对其进行异源表达,通过亲和层析的纯化方式获得此蛋白。通过EnzChek™焦磷酸检测试剂盒(厂家:Invitrogen™公司,货号E6645)对其进行底物特异性检测。结果如图5-18所示,此结果中14个模块中的A结构域均对应所述的阳离子肽化合物结构中的14个残基,如图21所示,具体详述为第一模块中A对应特殊残基(1)(2-羟基-3-甲基戊酸)的酮酸形式,即2-氧代-3-甲基戊酸(α-KIL);第二模块中A对应残基(13)(亮氨酸);第三模块中A对应特殊残基(14)(缬氨醇)的氨基酸形式,即缬氨酸;第四模块中A对应残基(11)(赖氨酸);第五模块中A对应残基(12)(酪氨酸);第六模块中A对应残基(9)(缬氨酸);第七模块A对应残基(10)(亮氨酸);第八模块A对应残基(5)(异亮氨酸或缬氨酸);第九模块中A对应残基(6)(缬氨酸);第十模块中A对应残基(7)(缬氨酸);第十一模块中A对应残基(8)(赖氨酸);第十二模块中A对应残基(3)(R3对应的氨基酸);第十三模块中A对应残基(4)(鸟氨酸);第十四模块中A对应特殊残基(2)(2-氨基-2烯丁酸)的未脱氢形式,即苏氨酸。特殊残基(1)及(2)的形成可详述为以下过程,通过第一模块中A对残基(1)的酮酸形式具有活性,表明其通过第一模块中的酮还原酶结构域进行还原作用,并最终表现为残基(1),具体见图19表示。通过第十四模块中A仅可特异性活化苏氨酸,苏氨酸为特殊残基(2)的未脱氢形式,其由基因中的L-苏氨酸-3-脱氢酶作用得来,具体如图20所示。
显然,上述实施例仅仅是为清楚地说明所作的举例,而并非对实施方式的限定。对于所属领域的普通技术人员来说,在上述说明的基础上还可以做出其它不同形式的变化或变动。这里无需也无法对所有的实施方式予以穷举。而由此所引伸出的显而易见的变化或变动仍处于本发明创造的保护范围之中。
序列表
<110> 北京工商大学
<120> 一种参与阳离子肽化合物合成的基因及其用途
<130> HA202109099-YS
<160> 67
<170> SIPOSequenceListing 1.0
<210> 1
<211> 585
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 1
Ile Glu Ser Ser Leu Pro Leu Ala Ile Ser His Gly Asp Glu Leu Lys
1 5 10 15
Arg Thr Ala Ser Ser Pro Thr Thr Leu Ile Asp Val Leu Leu Asn Ala
20 25 30
Ala Lys Leu Glu Asp Lys Gly Ile Ile Tyr Val Leu Asp Asn Gly Asp
35 40 45
Glu Ile Glu Thr Ser Tyr Arg Gln Leu Phe Glu Gln Ala Lys Arg Leu
50 55 60
Leu Lys Gly Leu Arg Asn Met Gly Ile Lys Pro Gln Asp Arg Val Ile
65 70 75 80
Phe Gln Phe Gln Lys Asn Gln Asn Tyr Val Pro Met Phe Trp Ala Cys
85 90 95
Val Leu Gly Gly Phe Ile Thr Val Pro Ile Ala Thr Ser Thr Ser Tyr
100 105 110
Ala Glu Lys Asn Ala Asp Thr Met Lys Leu Tyr Asn Ile Trp Asn Met
115 120 125
Leu Asp Lys Pro Val Ile Ile Thr Asp Glu Asp Leu Leu Glu Glu Val
130 135 140
Asn Lys Leu Ser Ala Val Trp Glu Thr Glu Glu Phe Val Ile Ser Thr
145 150 155 160
Ala Glu Gln Leu Met Asn Asn Asp Glu Glu Gln Glu Phe Tyr Ala Tyr
165 170 175
Gln Glu Glu Asp Phe Val Leu Tyr Leu Leu Thr Ser Gly Ser Thr Gly
180 185 190
Leu Pro Lys Cys Val Gln His Arg Asn Ala Ser Leu Val Ala Arg Ser
195 200 205
Met Ala Thr Ala Gln Phe Asn Gln Phe Asp Ser Glu Glu Val Leu Leu
210 215 220
Cys Trp Met Pro Leu Glu His Val Gly Gly Leu Val Met Tyr His Ile
225 230 235 240
Leu Gly Val Tyr Leu Ala Cys Gln Gln Ile Leu Pro Arg Val Asp Ser
245 250 255
Phe Ile Ser Gln Pro Leu Asn Trp Leu His Trp Met Asp Lys Tyr Lys
260 265 270
Ala Thr Leu Thr Trp Ala Pro Asn Phe Ala Phe Ser Leu Val Asn Asp
275 280 285
Gln Glu Lys Ala Ile Glu Glu Gly Ser Trp Asp Leu Ser Ser Val Lys
290 295 300
His Ile Leu Asn Gly Gly Glu Ala Ile Val Ala Lys Thr Ala Lys Arg
305 310 315 320
Phe Leu Ser Ile Leu Arg Lys His Gln Leu Arg Asp Asp Cys Met Tyr
325 330 335
Pro Ser Phe Gly Met Ser Glu Thr Ser Ser Gly Ile Val Phe Ser Lys
340 345 350
Thr Phe Arg Ser Lys Pro Gln Ser Gly Val His Tyr Val Asp Lys Ile
355 360 365
Ser Phe Glu Thr Leu Leu Arg Phe Val Asp Asp Thr Glu Lys Glu His
370 375 380
Ile Cys Phe Thr Glu Val Gly Gly Pro Ile Pro Gly Val Ser Val Arg
385 390 395 400
Ile Val Asp Asp Asn Asn Gln Leu Leu His Glu Asn His Ile Gly Arg
405 410 415
Val Gln Val Thr Gly Pro Thr Ile Met Ala Gly Tyr Tyr Gln Asn Gln
420 425 430
Glu Ala Asn Gln Glu Ala Phe Ile Gly Asp Gly Trp Phe Phe Thr Gly
435 440 445
Asp Leu Gly Phe Leu Asn Glu Gly Arg Leu Thr Val Thr Gly Arg Gln
450 455 460
Lys Asp Ile Ile Ile Met Asn Gly Lys Asn Tyr Tyr Asn Tyr Glu Ile
465 470 475 480
Glu Ser Leu Val Glu Ser Val Tyr Gly Val Gln Ser Thr Phe Val Ala
485 490 495
Ala Ala Ala Phe Ile Asp Thr Ser Lys Ser Val Glu Glu Leu Ala Ile
500 505 510
Phe Phe Thr Pro Ile Asn Asp Ile Asp Glu Arg Phe Leu Gly Phe Ile
515 520 525
Ile Thr Glu Ile Arg Arg Thr Val Ser Arg Asn Leu Gly Ile Thr Pro
530 535 540
Lys His Ile Ile Pro Ile Lys Lys Glu Thr Phe Ser Lys Thr Glu Ser
545 550 555 560
Gly Lys Ile Gln Arg Asn Lys Phe Thr Glu Arg Leu Tyr Ala Gly Glu
565 570 575
Phe Asp Glu Ile Met Lys Gln Ile Glu
580 585
<210> 2
<211> 66
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 2
Glu Glu Glu Ile Lys Gln Leu Trp Ser Glu Ile Leu Gln Val Asp Gln
1 5 10 15
Leu Gln Ala Phe Asp Ser Phe Phe Asp Leu Gly Gly Ser Ser Leu Lys
20 25 30
Ala Thr Gln Leu Leu Ser Ser Val Gln Ser Arg Phe Gly Val Ala Ile
35 40 45
Pro Leu Lys Val Phe Phe Glu Asn Pro Thr Ile Lys Gly Leu Ile His
50 55 60
Arg Met
65
<210> 3
<211> 194
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 3
Gly Phe Tyr Val Val Thr Gly Gly Leu Gly Gly Val Gly Lys Gln Ile
1 5 10 15
Ala His Tyr Leu Ile Gln Lys Leu Asp Ala Asn Ile Leu Leu Leu Gly
20 25 30
Lys Thr Ala Leu Pro Ser Ala Thr Asp Lys Asn Thr Gly Thr Gln Ser
35 40 45
Glu Arg Tyr Arg Ala Leu Arg Glu Leu Glu Lys Leu Thr Thr Arg Glu
50 55 60
Asn Gln Val Ile Tyr Lys Ser Thr Asp Leu Ser Asp Val Thr Val Leu
65 70 75 80
Glu Glu Ala Ile His Glu Ser Met Leu Gly Met Asn Ala Lys Lys Val
85 90 95
Asp Gly Ile Ile His Leu Ala Gly Val Tyr Tyr Glu Lys Pro Leu Leu
100 105 110
Glu Glu Ser Ile Glu Ser Leu Glu His Met Tyr Glu Ser Lys Val Ala
115 120 125
Gly Thr Tyr Leu Leu Gly Lys Leu Val Glu Lys Tyr Pro Glu Ala Ile
130 135 140
Leu Val Thr Ser Ser Ser Ser Ser Gly Leu Leu Ser Gly Tyr Gly Val
145 150 155 160
Gly Ala Tyr Thr Ser Ala Asn Met Phe Val Glu Thr Phe Thr Asp Tyr
165 170 175
Val Ala Gln Lys Arg Asp Gly Val Gln Cys Phe Ala Trp Ser Leu Trp
180 185 190
Asn Asp
<210> 4
<211> 300
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 4
Ile Asp Leu Ser Pro Ser Gln Arg Gly Gln Trp Tyr Leu Tyr Gln Thr
1 5 10 15
Arg Pro Asp Ser Pro Phe Tyr Asn Ile Thr Phe Thr Leu Thr Phe Glu
20 25 30
Glu Asn Val Asn Arg Gln Ala Leu Leu Lys Ser Ile Gln Ala Val Ile
35 40 45
Gln Ser Gln Glu Ala Leu Arg Ser Glu Ile Arg Met Ile Glu Asp Asp
50 55 60
Pro Lys Leu Phe Ile His Asp Thr Tyr Val Val Gly Ile Pro Ile Ile
65 70 75 80
Asp Leu Ser Ser Tyr Pro Leu Glu Gln Lys Glu Gln Lys Leu Ala Ala
85 90 95
Leu Ile Asp Thr Glu Ala Asn Thr Pro Phe Gln Met Met Gln Glu Ser
100 105 110
Leu Ser Arg Phe Thr Leu Ile His Val Gly Asp Asn Val His Ile Leu
115 120 125
Leu Gly Ser Met His His Ile Ile Ser Asp Gly Trp Ser Ile His Val
130 135 140
Phe Thr Lys Glu Leu Leu Arg Phe Tyr Lys Glu Ile Gln Glu Asn Gly
145 150 155 160
Val Ile Asp Arg Lys Glu Leu Asp Tyr Thr Tyr Thr Ser Tyr Leu Leu
165 170 175
Asp Gln Lys Glu Trp Leu Thr Lys Glu Asn Pro Glu Tyr Ala Asp Gln
180 185 190
Leu Tyr Tyr Trp Lys Glu Asn Leu Ala Thr Glu Thr Ala Pro Leu Thr
195 200 205
Leu Pro Ile Gln Leu Ser Arg Pro Ala Ile Gln Ser Tyr Gln Gly Met
210 215 220
Thr Ile Glu Gln Leu Val Thr Gln Glu Leu Thr Gln Asn Ile Met Glu
225 230 235 240
Ala Ser Lys His Met Lys Ser Thr Pro Tyr Met Phe Leu Leu Ser Thr
245 250 255
Phe Ala Ala Leu Leu Phe Arg Tyr Ser Gly Gln Asp Thr Phe Arg Leu
260 265 270
Gly Thr Val Leu Ala Asn Arg Asn Met Pro His Thr Glu Lys Ile Ile
275 280 285
Gly Tyr Leu Ala Asn Thr Val Thr Leu Ser Phe Asp
290 295 300
<210> 5
<211> 299
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 5
Lys Asp Ile Tyr Ser Leu Ser Pro Met Gln Arg Gly Met Leu Phe His
1 5 10 15
Thr Leu Lys Asp Lys Glu Asn Leu Ala Tyr Phe Asp Gln Thr Thr Phe
20 25 30
Gln Ile Glu Gly Asp Ile Cys Val Glu Ser Leu Glu Lys Ser Phe Asn
35 40 45
Glu Leu Ile Arg Lys Tyr Asp Val Leu Arg Thr Ile Phe Leu Tyr Gln
50 55 60
Lys Leu Lys Glu Pro Met Gln Val Val Leu Lys Glu Arg Thr Ala Asn
65 70 75 80
Ile His Tyr Glu Asp Phe Ser Met Lys Ser Glu Ser Asp Lys Ala Lys
85 90 95
Ala Leu Arg Val Ala Lys Gln Arg Asp Arg Asp Glu Gly Phe Asp Leu
100 105 110
Ser Arg Asp Ile Leu Met Arg Leu Ser Leu Leu Lys Val Ala Pro Asn
115 120 125
Gln Tyr Glu Leu Val Ile Ser Ser His His Ile Ile Ile Asp Gly Trp
130 135 140
Cys Thr Gly Ile Leu Tyr Gln Glu Leu Phe Tyr Phe Tyr Gln Cys Phe
145 150 155 160
Val Ala Asn Gln Pro Ile Pro Ala Glu Lys Ser Ile Pro Tyr Ser Arg
165 170 175
Tyr Ile Arg Trp Leu Glu Glu Gln Asp Glu Glu Glu Gly Lys Ala Tyr
180 185 190
Trp Gly Glu Tyr Leu Gln Asp Phe Glu Gly Ala Ser Val Ile Pro Lys
195 200 205
Gln Asn Ala Lys Gly Glu Lys Glu Val Cys Ser Ile Asp Lys Val Thr
210 215 220
Phe His Phe Asp Lys Lys Leu Thr Glu Glu Leu Val Gln Val Ala Lys
225 230 235 240
Ala Cys Gln Val Thr Ile Ser Thr Leu Phe Gln Thr Met Trp Gly Ile
245 250 255
Leu Leu Gln Lys Tyr Asn Asn Ser Gln Glu Ala Ile Phe Gly Ser Val
260 265 270
Ile Ser Gly Arg Ser Pro Glu Ile Pro Asp Val Glu Lys Ile Val Gly
275 280 285
Ile Phe Ile Asn Thr Ile Pro Val Arg Ile Arg
290 295
<210> 6
<211> 500
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 6
Ala Lys Ser Asp Phe Pro Met Asp Lys Thr Ile His Gln Leu Phe Glu
1 5 10 15
Glu Gln Val Ser Arg Thr Pro Glu Gln Ile Ala Val Val Phe Lys Gly
20 25 30
Glu Ser Phe Thr Tyr Arg Glu Leu Asn Glu Lys Ala Asn Gln Leu Ala
35 40 45
Trp Val Leu Arg Lys Arg Glu Val Arg Pro Asn Glu Ile Val Ala Ile
50 55 60
Met Ala Glu His Ser Leu Glu Met Leu Val Gly Val Ile Gly Thr Leu
65 70 75 80
Lys Ala Gly Ala Ala Tyr Leu Pro Ile Asp Pro Ser Tyr Pro Glu Lys
85 90 95
Arg Ile Ala His Met Leu Gln Asp Ser Lys Ala Glu Gln Leu Leu Ile
100 105 110
Gln Pro His Leu Asn Met Pro Gln Asp Phe Lys Gly Ser Val Leu Trp
115 120 125
Leu Thr Glu Glu Ser Trp Ala Lys Glu Ser Thr Thr Asp Leu Pro Leu
130 135 140
Ala Thr Ser Ala Asn Asp Leu Ala Tyr Met Ile Tyr Thr Ser Gly Ser
145 150 155 160
Thr Gly Leu Pro Lys Gly Val Met Val Glu His Gln Ala Leu Val Asn
165 170 175
Leu Val Met Trp His Asn Glu Ala Phe Gly Val Thr Met Thr Asp Gln
180 185 190
Cys Thr Lys Leu Ala Gly Phe Gly Phe Asp Ala Ser Val Trp Glu Thr
195 200 205
Phe Pro Pro Leu Ile Gln Gly Ala Thr Leu His Val Leu Glu Glu Ser
210 215 220
Arg Arg Gly Asp Ile Tyr Ala Leu His Glu Tyr Phe Glu Lys Asn Ala
225 230 235 240
Ile Thr Ile Ser Phe Leu Pro Thr Gln Leu Ala Glu Gln Phe Met Glu
245 250 255
Leu Thr Ser Ser Thr Leu Arg Val Leu Leu Ile Gly Gly Asp Arg Ala
260 265 270
Gln Lys Val Lys Lys Thr Thr Tyr Gln Ile Ile Asn Asn Tyr Gly Pro
275 280 285
Thr Glu Asn Thr Val Val Thr Thr Ser Gly Gln Leu His Pro Glu Gln
290 295 300
Asp Val Phe Pro Ile Gly Lys Pro Ile Thr Asn His Ser Val Tyr Ile
305 310 315 320
Leu Asp Gln Asn Arg His Pro Gln Pro Ile Gly Ile Pro Gly Glu Leu
325 330 335
Cys Val Ser Gly Ala Gly Leu Ala Arg Gly Tyr Leu Asn Gln Pro Glu
340 345 350
Leu Thr Val Glu Arg Phe Val Asp Asn Pro Phe Val Pro Gly Glu Arg
355 360 365
Ile Tyr Arg Thr Gly Asp Leu Val Arg Trp Arg Ile Asp Gly Ser Ile
370 375 380
Glu Tyr Leu Gly Arg Ile Asp Glu Gln Val Lys Ile Arg Gly Tyr Arg
385 390 395 400
Ile Glu Leu Gly Glu Ile Glu Thr Lys Leu Leu Glu His Pro Ser Ile
405 410 415
Ser Glu Ala Leu Val Val Ala Arg Asn Asp Glu Gln Gly Tyr Thr Tyr
420 425 430
Leu Cys Ala Tyr Val Val Ala Thr Gly Ala Trp Ser Val Ser Ser Leu
435 440 445
Arg Glu His Leu Ile Glu Thr Leu Pro Glu Tyr Met Ile Pro Ala Tyr
450 455 460
Met Met Glu Val Glu Lys Met Pro Leu Thr Ala Asn Gly Lys Val Asp
465 470 475 480
Lys Arg Ala Leu Pro Val Pro Asp Arg Gln Arg Met Asn Glu Tyr Val
485 490 495
Ala Pro Ala Thr
500
<210> 7
<211> 68
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 7
Glu Glu Lys Leu Val Leu Leu Phe Gln Glu Ile Leu Gly Leu Glu Arg
1 5 10 15
Ile Gly Thr Lys Asp His Phe Phe Glu Leu Gly Gly His Ser Leu Lys
20 25 30
Ala Met Met Leu Val Ser Arg Met His Lys Glu Leu Gly Val Asp Val
35 40 45
Gln Leu Asn Glu Met Phe Ala Arg Pro Thr Val Lys Asp Leu Ser Ala
50 55 60
Tyr Ile Asp Gln
65
<210> 8
<211> 285
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 8
Tyr Pro Val Ser Phe Ala Gln Arg Arg Met Tyr Val Val Gln Gln Met
1 5 10 15
Arg Asp Ser Glu Thr Thr Ser Tyr Asn Ile Pro Val Thr Phe Glu Leu
20 25 30
Lys Gly Lys Leu His Leu Asp Lys Leu Arg Glu Ala Leu Gln Ile Leu
35 40 45
Val Leu Arg His Glu Ser Leu Arg Thr Ser Phe His Met Ile Asp Glu
50 55 60
Asn Leu Val Gln Lys Val Asn Lys Asp Ile Ser Trp Asp Leu Glu Val
65 70 75 80
Ile Glu Ala Gln Glu Ser Glu Ile Glu Val Lys Leu Lys Glu Phe Ile
85 90 95
Arg Pro Phe His Leu Ser Glu Ala Pro Leu Phe Arg Ala Arg Leu Ile
100 105 110
Cys Leu Asn Pro Gln His His Leu Leu Ser Leu Asp Met His His Ile
115 120 125
Ile Ser Asp Gly Val Ser Met Asn Leu Phe Leu Gln Glu Phe Met Thr
130 135 140
Leu Tyr Gln Gly Glu Ala Leu Pro Ala Leu Ser Ile Gln Tyr Lys Asp
145 150 155 160
Tyr Ala Val Trp Gln Gln Ser Asp Lys Gln Arg Ala Arg Leu Lys Glu
165 170 175
Gln Glu Lys Tyr Trp Leu His His Phe Ser Gly Glu Leu Pro Thr Leu
180 185 190
Glu Leu Pro Thr Asp Phe Pro Arg Pro Ala Ile Gln Gln Phe Asp Gly
195 200 205
Asp Glu Trp Ser Phe Glu Met Asn Ala Asp Leu Leu Ala Lys Val Lys
210 215 220
Gln Ile Cys Ser Ser Gln Gly Thr Thr Leu Tyr Met Thr Leu Leu Ala
225 230 235 240
Ala Tyr Gln Val Phe Leu Ala Arg Tyr Thr Gly Gln Glu Asp Ile Ile
245 250 255
Val Gly Ser Pro Ile Ala Gly Arg Ser His Ala Asp Leu Glu Asn Met
260 265 270
Ile Gly Met Phe Val Asn Thr Leu Ala Leu Arg Gly Lys
275 280 285
<210> 9
<211> 500
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 9
Arg Glu Lys Thr Ile Gln Glu Leu Phe Glu Glu Gln Val Asp Lys Asn
1 5 10 15
Pro Asp Gln Ile Ala Leu Ile Cys Gly Glu Gln Gln Phe Thr Tyr Glu
20 25 30
Gln Leu Asn Val Lys Phe Asn Gln Leu Ala His Val Leu Arg Arg Glu
35 40 45
Gly Val Gln Pro Asn Gln Val Ile Gly Leu Ile Thr Asp Arg Ser Leu
50 55 60
Ser Met Ile Val Gly Ile Phe Gly Ile Ile Lys Ala Gly Gly Gly Tyr
65 70 75 80
Leu Pro Ile Asp Pro Thr Tyr Pro Thr Glu Arg Ile Glu Tyr Met Leu
85 90 95
Glu Asp Ser Gln Thr His Leu Leu Leu Val Gln His Arg Asp Met Val
100 105 110
Pro Ala Gly Tyr Gln Gly Glu Val Leu Ile Ile Glu Asp Glu Ile Ser
115 120 125
Arg Asp Glu Gln Val Ala Asn Ile Glu Leu Ile Asn Gln Pro Gln Asp
130 135 140
Leu Ala Tyr Val Met Tyr Thr Ser Gly Ser Thr Gly Lys Pro Lys Gly
145 150 155 160
Asn Leu Thr Thr His Arg Asn Ile Ile Lys Thr Val Cys Asn Asn Gly
165 170 175
Tyr Ile Glu Ile Thr Thr Glu Asp Arg Leu Leu Gln Leu Ser Asn Tyr
180 185 190
Ala Phe Asp Gly Ser Thr Phe Asp Ile Phe Ser Ser Leu Leu His Gly
195 200 205
Ala Thr Leu Val Leu Val Pro Lys Glu Val Ile Leu Asn Pro Thr Asp
210 215 220
Leu Val Thr Leu Ile Arg Glu Gln Gln Ile Thr Val Ser Phe Met Thr
225 230 235 240
Thr Ser Leu Phe Asn Ala Leu Val Glu Leu Asp Val Ser Ser Phe Gln
245 250 255
Asn Met Arg Lys Ile Ala Phe Gly Gly Glu Lys Ala Ser Phe Lys His
260 265 270
Val Glu Lys Ala Leu Asp Phe Leu Gly Asn Gly Arg Leu Val Asn Gly
275 280 285
Tyr Gly Pro Thr Glu Thr Thr Val Phe Ala Thr Thr Tyr Thr Val Asp
290 295 300
Glu Arg Ile Lys Glu Trp Gly Ile Ile Pro Ile Gly Arg Pro Leu His
305 310 315 320
Asn Thr Thr Val His Ile Leu Ser Ala Asp Asp Lys Leu Gln Pro Ile
325 330 335
Gly Val Ile Gly Glu Leu Cys Val Ser Gly Glu Gly Leu Ala Arg Gly
340 345 350
Tyr Leu Asn Leu Pro Glu Leu Thr Met Glu Arg Phe Val Glu Asn Pro
355 360 365
Phe Arg Pro Gly Glu Arg Met Tyr Arg Thr Gly Asp Leu Ala Arg Trp
370 375 380
Leu Pro Asp Gly Val Leu Glu Tyr Val Gly Arg Lys Asp Glu Gln Val
385 390 395 400
Lys Ile Arg Gly His Arg Ile Glu Leu Ser Glu Ile Glu Thr Arg Ile
405 410 415
Leu Glu His Pro Ala Ile Ser Glu Thr Val Leu Leu Ala Lys Arg Asn
420 425 430
Glu Gln Gly Ser Ser Tyr Leu Cys Ala Tyr Ile Val Ala His Gly Gln
435 440 445
Trp Asn Val Gln Glu Leu Arg Lys His Val Arg Asp Ala Leu Pro Glu
450 455 460
His Met Val Pro Ser Tyr Phe Ile Gly Leu Asp Lys Leu Pro Leu Thr
465 470 475 480
Ser Asn Gly Lys Val Asp Lys Arg Ala Leu Pro Glu Pro Glu Gly Ser
485 490 495
Leu Gln Leu Thr
500
<210> 10
<211> 68
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 10
Glu Lys Gln Leu Val Glu Ile Val Ala Glu Val Leu Gly Leu Glu Ala
1 5 10 15
Ser Glu Ile Ser Ile Thr Asp Asn Leu Phe Glu Leu Gly Gly His Ser
20 25 30
Leu Thr Ile Leu Arg Ile Leu Ala Lys Val His Thr Cys Asn Trp Lys
35 40 45
Leu Glu Met Lys Asp Phe Tyr Asn Cys Lys Asn Leu Glu Glu Ile Ala
50 55 60
Ser Lys Ala Ser
65
<210> 11
<211> 253
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 11
Val Leu Leu Leu Gly Ser Thr Gly Phe Leu Gly Ile His Leu Leu His
1 5 10 15
Glu Leu Leu Gln Lys Thr Glu Ala Thr Ile Leu Cys Val Ile Arg Ala
20 25 30
Glu Asn Asp Glu Ala Ala Met Gln Arg Leu Arg Lys Lys Ile Asp Phe
35 40 45
Tyr Phe Thr Ser Gln Tyr Ser Ser Ser Gln Ile Asp Glu Trp Phe Thr
50 55 60
Arg Ile Gln Ile Ile His Gly Asp Ile Thr Gln Asp Asn Phe Gly Leu
65 70 75 80
Glu Ala Lys His Tyr Glu Ser Leu Gly Ala Ile Val Asp Thr Val Ile
85 90 95
His Thr Ala Ala Leu Val Lys His Tyr Gly His Tyr Glu Glu Phe Glu
100 105 110
Arg Ala Asn Val His Gly Thr Gln Gln Val Val Thr Phe Cys Leu Asn
115 120 125
Asn Lys Leu Pro Met His Tyr Val Ser Thr Leu Ser Val Ser Gly Thr
130 135 140
Thr Val Glu Glu Ala Thr Glu Leu Val Glu Phe Thr Glu Lys Asp Phe
145 150 155 160
Tyr Val Gly Gln Asn Tyr Glu Ser Asn Val Tyr Leu Arg Ser Lys Phe
165 170 175
Glu Ala Glu Ala Val Leu Val Gly Gly Met Glu Asn Gly Leu Asp Ala
180 185 190
Arg Ile Tyr Arg Val Gly Asn Leu Thr Gly Arg Phe Gln Asp Gly Trp
195 200 205
Phe Gln Glu Asn Ile Asn Glu Asn Met Phe Tyr Leu Leu Ser Lys Ala
210 215 220
Phe Leu Glu Leu Gly Gly Phe Asp Gln Glu Ile Met Gln Gly Met Val
225 230 235 240
Asp Leu Thr Pro Ile Asp Ile Cys Ala Gln Ala Ile Ile
245 250
<210> 12
<211> 298
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 12
Lys Asp Ile Tyr Thr Leu Ser Pro Leu Gln Lys Gly Met Leu Phe Gln
1 5 10 15
His Leu Lys Glu Glu Ser Thr Ala Tyr Phe Glu Gln Leu His Phe Thr
20 25 30
Ile Lys Gly Gln Leu Tyr Val Asp Ser Phe Glu Ala Ser Phe Gln His
35 40 45
Leu Ile Asn Lys Tyr Asp Val Leu Arg Thr Val Phe Leu Tyr Lys Asn
50 55 60
Met Thr Gln Pro Met Gln Met Val Leu Lys Glu Arg Lys Thr Ser Val
65 70 75 80
His Phe Glu Asp Ile Ser His Leu Asp Ser Lys Ala Val Ser Glu Tyr
85 90 95
Val Glu Glu Phe Lys Asn Gln Asp Arg Glu Lys Gly Phe Glu Leu Ser
100 105 110
Lys Asp Ile Leu Met Arg Phe Ala Ile Leu Lys Ala Gly Ala Glu Ser
115 120 125
Tyr His Leu Ile Trp Ser Phe His His Ile Leu Met Asp Gly Trp Cys
130 135 140
Met Gly Ile Val Leu Gln Asp Leu Phe Arg Met Tyr Gln Gln His Arg
145 150 155 160
Gln Asn Ile Pro Ile Thr Val Glu Ser Val Pro Ala Tyr Ser Glu Tyr
165 170 175
Ile Arg Trp Leu Glu Lys Gln Asn Val Thr Lys Ala Arg Asp Tyr Trp
180 185 190
Lys Asn Tyr Leu Glu Gly Tyr Glu Glu Leu Thr Gly Ile Ile His Leu
195 200 205
Asp Thr Lys His Thr Ser His Asn Asn Glu Val Gln Glu Cys Ala Phe
210 215 220
Thr Leu Asp Lys Asp Ile Thr Glu Gly Leu Thr Gln Leu Ala Arg His
225 230 235 240
Tyr Ser Val Thr Val Asn Thr Leu Phe Gln Thr Ile Trp Gly Met Leu
245 250 255
Leu Gln Lys Tyr Asn Asn Lys Asp Asp Val Val Phe Gly Ala Val Val
260 265 270
Ser Gly Arg Pro Ser Glu Ile His Gly Val Glu Asn Met Val Gly Leu
275 280 285
Phe Ile Asn Thr Val Pro Ile Arg Ile Gln
290 295
<210> 13
<211> 500
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 13
Thr Glu Lys Thr Leu Pro Glu Leu Phe Glu Asp Gln Val Lys Arg Thr
1 5 10 15
Pro Glu Ala Ile Ala Leu Arg Phe Glu Asp Gln Gln Leu Thr Tyr Gln
20 25 30
Glu Leu Asn Gln Arg Val Asn Gln Leu Ala Trp Thr Leu Arg Met Lys
35 40 45
Gly Leu Gln Gln Glu Glu Leu Val Gly Ile Met Val Gln Arg Ser Leu
50 55 60
Glu Met Ile Val Gly Val Leu Ala Val Ile Lys Ala Gly Gly Ala Tyr
65 70 75 80
Val Pro Ile Asp Pro Glu Tyr Pro Leu Asp Arg Ile Gln Tyr Met Leu
85 90 95
Glu Asp Ser Gly Thr Asn Trp Leu Leu Thr Thr Lys Gln Ser Glu Ile
100 105 110
Pro Ser Ile Tyr Gln Gly His Val Leu Tyr Leu Glu Glu Asp Thr Val
115 120 125
Tyr His Glu Arg Ser Ser Asp Val Glu Ile Val Asn Gln Ser Ser Asp
130 135 140
Leu Ala Tyr Ile Ile Tyr Thr Ser Gly Ser Thr Gly Gln Pro Lys Gly
145 150 155 160
Val Met Ile Asp His Arg Ala Val His Asn Leu His Leu Ser Ala Gly
165 170 175
Ile Tyr Gly Ile Ala Gln Gly Ser Gln Val Leu Gln Phe Ala Ser Leu
180 185 190
Ser Phe Asp Ala Ser Val Gly Asp Ile Phe His Ser Leu Leu Thr Gly
195 200 205
Ala Thr Leu His Leu Val Lys Lys Glu Gln Leu Leu Ser Gly His Ala
210 215 220
Phe Met Glu Trp Leu Asp Glu Ala Gly Ile Thr Thr Ile Pro Phe Ile
225 230 235 240
Pro Pro Ser Val Leu Lys Glu Leu Pro Tyr Ala Lys Leu Pro Lys Leu
245 250 255
Lys Thr Ile Ser Thr Gly Gly Glu Glu Leu Pro Ala Asp Leu Val Arg
260 265 270
Ile Trp Gly Ala Asn Arg Thr Phe Leu Asn Ala Tyr Gly Pro Thr Glu
275 280 285
Thr Thr Val Asp Ala Ser Ile Gly Asn Cys Val Glu Met Thr Asp Lys
290 295 300
Pro Ser Ile Gly Thr Pro Thr Val Asn Lys Arg Ala Tyr Ile Leu Asp
305 310 315 320
Gln Tyr Gly His Ile Gln Pro Ile Gly Val Pro Gly Glu Leu Cys Val
325 330 335
Gly Gly Glu Gly Val Ala Arg Gly Tyr Leu His Arg Pro Glu Leu Thr
340 345 350
Asp Glu Lys Phe Val Asn Asp Pro Tyr Val Pro Asn Gly Arg Met Tyr
355 360 365
Lys Thr Gly Asp Leu Ala Arg Trp Leu Pro Asp Gly Thr Ile Glu Phe
370 375 380
Leu Gly Arg Met Asp Gly Gln Val Lys Ile Arg Gly Phe Arg Ile Glu
385 390 395 400
Leu Gly Glu Ile Glu Ala Arg Leu Asn Gln Ala Pro Ser Val Lys Gln
405 410 415
Ala Val Val Leu Ala Arg Ser Gly Glu Gln Lys Gln Val Tyr Leu Cys
420 425 430
Ala Tyr Leu Val Thr Asp Asn Asp Leu Lys Val Ser Ala Leu Arg Lys
435 440 445
Glu Leu Ser Gln Thr Leu Pro Asp Tyr Met Ile Pro Ser Phe Phe Ile
450 455 460
Lys Val Asp Lys Ile Pro Val Thr Val Asn Gly Lys Ile Asp Lys Lys
465 470 475 480
Ala Leu Pro Glu Pro Glu Lys Glu Val Glu Leu Gln Thr Glu Tyr Val
485 490 495
Ala Pro Thr Asn
500
<210> 14
<211> 68
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 14
Glu Glu Ile Leu Val Gln Ile Trp Gln Lys Val Leu Gly Met Glu Arg
1 5 10 15
Val Gly Ile Glu Asp Asn Phe Phe Glu Leu Gly Gly His Ser Ile Lys
20 25 30
Ala Met Met Leu Ala Ser Asn Ile Tyr Lys Glu Leu Lys Ile Asp Leu
35 40 45
Pro Leu Arg Glu Ile Phe Lys His Thr Thr Val Lys Glu Met Ala Arg
50 55 60
Phe Ile Asp Gly
65
<210> 15
<211> 287
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 15
Glu Tyr Tyr Pro Val Ser Ser Ala Gln Lys Arg Met Tyr Val Ile Gln
1 5 10 15
Ser Leu Glu Asp Lys Ala Gln Gly Thr Ser Tyr Asn Met Pro Ser Phe
20 25 30
Tyr Lys Met Lys Gly Ser Val Asp Ala Glu Lys Leu Glu Lys Val Phe
35 40 45
Gln Thr Leu Met Asp Arg His Glu Ser Leu Arg Thr Ser Phe His Met
50 55 60
Ile Glu Glu Gln Leu Val Gln Lys Val His Glu Gln Val Ser Trp Lys
65 70 75 80
Met Asp Met Lys Thr Val Ser Ala Asn Asp Val Ser Arg Leu Lys Asp
85 90 95
Ser Phe Val Gln Pro Phe Asp Ile Ser Thr Ala Pro Leu Phe Arg Ala
100 105 110
Ser Leu Leu Thr Ile Gln Lys Asp Glu His Ile Leu Met Met Asp Val
115 120 125
His His Ile Val Gly Asp Gly Val Ser Thr Thr Ile Leu Phe Gln Glu
130 135 140
Leu Ile Gln Leu Tyr Gln Gly Gln Ala Leu Pro Glu Val Lys Val His
145 150 155 160
Tyr Lys Asp Tyr Ala Val Trp Gln Leu Ser Gln Gln Asp Arg Leu Lys
165 170 175
Glu Ser Glu Asn Phe Trp Leu Gln Gln Phe Ser Gly Glu Leu Pro Val
180 185 190
Leu Glu Leu Pro Thr Asp Tyr Ser Arg Pro Pro Ile Arg Arg Leu Glu
195 200 205
Gly Glu Tyr Val Ser Gln Ser Leu Arg Gly Glu Leu His Glu Ser Val
210 215 220
Lys Ala Phe Met Lys Asn His Glu Val Thr Leu Tyr Met Val Leu Leu
225 230 235 240
Ala Thr Tyr Asn Val Leu Leu His Lys Tyr Thr Asn Gln Arg Asp Ile
245 250 255
Ile Val Gly Thr Pro Val Ser Asp Arg Pro His Pro Asp Val Met Ser
260 265 270
Thr Val Gly Met Phe Val Asn Thr Leu Ala Val Arg Asn Gln Leu
275 280 285
<210> 16
<211> 501
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 16
Pro Thr Lys Ile Thr Tyr Asp Gln Ala Ile Gln Asn Arg Phe Glu Glu
1 5 10 15
Gln Ala Met Lys Thr Pro Asp Ala Val Ala Leu Val Tyr Lys Gly Gln
20 25 30
Glu Leu Thr Tyr Arg Glu Leu Asn Gln Arg Ser Asn Gln Met Ala Arg
35 40 45
Thr Leu Arg Glu His Gly Val Gly Arg Asp Gln Ile Ile Ala Val Met
50 55 60
Ile Asn Arg Ser His Glu Leu Ile Ile Ser Ile Leu Ala Val Leu Lys
65 70 75 80
Ala Gly Gly Ala Tyr Leu Pro Ile Asp Pro Thr Tyr Pro Leu Asp Arg
85 90 95
Ile Glu His Met Leu Glu Asp Ser Gln Thr Ala Met Leu Leu Thr Gln
100 105 110
Lys Glu Ile Gln Ile Pro Thr Gly Tyr Ser Gly Glu Val Leu Phe Val
115 120 125
Asp Gln Ala Asp Ile Tyr His Glu Asp Ala Thr Asp Leu Ser Ser Met
130 135 140
Asn Gln Pro Ala Asp Leu Ala Tyr Ile Ile Tyr Thr Ser Gly Ser Thr
145 150 155 160
Gly Lys Ser Lys Gly Val Met Ile Glu His Arg Ser Leu His Asn Leu
165 170 175
Ile His Ile Ser His Pro Tyr Lys Met Gly Ala Gly Ser Arg Val Leu
180 185 190
Gln Phe Ala Ser Ser Ser Phe Asp Ala Ser Val Ala Glu Ile Phe Pro
195 200 205
Ala Leu Leu Thr Gly Ser Thr Leu Tyr Ile Glu Glu Lys Glu Glu Leu
210 215 220
Leu Thr Asn Leu Val Pro Tyr Leu Leu Glu Asn Gln Ile Thr Thr Val
225 230 235 240
Ala Leu Pro Pro Ser Leu Leu Arg Ser Val Pro Tyr Arg Glu Leu Pro
245 250 255
Ala Leu Glu Cys Ile Val Ser Val Gly Glu Ala Cys Thr Phe Asp Ile
260 265 270
Val Gln Thr Trp Gly Gln Asn Arg Thr Phe Ile Asn Gly Tyr Gly Pro
275 280 285
Thr Glu Ser Thr Val Cys Ser Thr Phe Gly Val Val Thr Ala Glu Asp
290 295 300
Lys Arg Ile Thr Ile Gly Lys Pro Phe Pro Asn Gln Lys Ala Tyr Ile
305 310 315 320
Ile Asn Glu Asn Gln Gln Leu Gln Pro Ile Gly Val Pro Gly Glu Leu
325 330 335
Cys Ile Ala Gly Ala Gly Leu Ser Arg Gly Tyr Leu Asn Arg Pro Glu
340 345 350
Leu Thr Gln Glu Lys Phe Val Asn Asn Pro Phe Ala Pro Gly Glu Arg
355 360 365
Met Tyr Lys Thr Gly Asp Val Ala Arg Trp Leu Pro Asp Gly Asn Ile
370 375 380
Glu Tyr Val Gly Arg Met Asp Asp Gln Val Lys Val Arg Gly Asn Arg
385 390 395 400
Val Glu Leu Gly Glu Val Thr Ser Gln Leu Leu Thr His Pro Ser Ile
405 410 415
Thr Glu Ala Val Val Val Pro Ile Val Asp Thr His Gly Ala Thr Thr
420 425 430
Leu Cys Ala Tyr Phe Ile Glu Asp Lys Glu Val Lys Val Asn Asp Leu
435 440 445
Arg His His Leu Ala Lys Ala Leu Pro Glu Phe Met Ile Pro Ala Tyr
450 455 460
Phe Ile Lys Val Asp His Ile Pro Leu Thr Gly Asn Gly Lys Val Asn
465 470 475 480
Lys Gln Ala Leu Pro Asp Pro Ser Glu Phe Ile Ser Ala Gln Thr Gly
485 490 495
His Glu Ile Val Ala
500
<210> 17
<211> 68
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 17
Glu Glu Ile Leu Val Gln Val Trp Glu Glu Val Leu Gln Ile Lys Pro
1 5 10 15
Ile Gly Val Glu Asp Asn Phe Phe Glu Arg Gly Gly Asp Ser Ile Lys
20 25 30
Ala Leu Gln Ile Val Ala Arg Leu Ser Lys Tyr Asn Arg Lys Leu Asp
35 40 45
Ser Arg His Ile Phe Lys Asn Pro Thr Ile Ser Met Leu Ala Pro Tyr
50 55 60
Leu Glu Gln Arg
65
<210> 18
<211> 296
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 18
Glu Gly Glu Val Pro Leu Thr Pro Ile Gln Ser Trp Phe Phe Glu Gln
1 5 10 15
Pro Phe Val His Pro His His Phe Asn Gln Ser Met Leu Leu Pro Asn
20 25 30
Glu Gln Gly Trp Asp Arg Gln Arg Ile Glu Gln Ala Phe Thr Thr Ile
35 40 45
Val Lys His His Asp Ala Leu Arg Met Lys Tyr Gln Phe Arg Glu Lys
50 55 60
Ile Ile Gln Glu Asn Gln Gly Ile Glu Gly Glu Phe Phe Thr Leu His
65 70 75 80
Glu Val Asp Val Thr Lys Glu Arg Asp Trp Gln Met Arg Ile Glu Arg
85 90 95
Glu Ala Asn Gln Leu Gln Ala Ser Phe Asp Leu Thr Thr Gly Pro Leu
100 105 110
Val Lys Leu Gly Leu Tyr His Thr Ala Tyr Gly Asp Tyr Leu Leu Ile
115 120 125
Val Val His His Leu Leu Ile Asp Gly Val Ser Trp Arg Ile Leu Leu
130 135 140
Glu Asp Phe Gln Thr Leu Tyr Glu Gln Lys Gly Glu Leu Pro Ala Lys
145 150 155 160
Thr Thr Ser Phe Lys Ala Trp Ala Val Gln Leu Glu Gly Tyr Ala Arg
165 170 175
Ser Lys Lys Leu Gln Asp Glu Ala Ser Tyr Trp Lys Gly Leu Leu Asn
180 185 190
Lys Ser Val Arg Glu Leu Pro Ala Asp Lys Glu Ser Ser Asp Thr Phe
195 200 205
Leu Phe Gly Asp Thr Lys Glu Val Gln Leu Thr Phe Asp Ile Asn Glu
210 215 220
Thr Gln Asp Leu Leu Thr Asp Ala His His Ala Tyr Lys Thr Lys Ala
225 230 235 240
Asp Asp Leu Leu Leu Ala Ala Leu Val Leu Ser Ile Asn Glu Trp Thr
245 250 255
Lys Gln Ser Asp Ile Ile Val Asn Leu Glu Gly His Gly Arg Glu Thr
260 265 270
Ile Val Glu Gly Ile Asp Leu Ser Arg Thr Ile Gly Trp Phe Thr Thr
275 280 285
Ile Tyr Pro Val Leu Phe Glu Val
290 295
<210> 19
<211> 296
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 19
Lys Asp Ile Tyr Ser Leu Ser Pro Leu Gln Lys Gly Met Leu Phe His
1 5 10 15
Ser Met Lys Asp Pro Gln Ser Asp Ala Tyr Phe Glu Gln Val Thr Leu
20 25 30
Leu Leu Glu Gly Val Val Asn Pro Thr Tyr Leu Ala Glu Ser Ile Gln
35 40 45
Gly Leu Val Gln Lys Tyr Asp Met Phe Arg Ser Val Phe Arg Tyr Lys
50 55 60
Lys Val Asp Pro Val Gln Val Val Leu Ser Glu Arg Lys Ile Asp Leu
65 70 75 80
Gln Ile Glu Asp Leu Thr Gln Ile Asn Glu Glu Glu Gln Arg Lys Phe
85 90 95
Ile Glu Glu Tyr Arg Lys Lys Asp Arg Glu Arg Gly Phe Asp Leu Ser
100 105 110
Arg Asp Ile Leu Leu Arg Phe Thr Leu Phe Gln Thr Ala Ala Asn Arg
115 120 125
Tyr Glu Leu Leu Trp Ser His His His Ile Leu Met Asp Gly Trp Cys
130 135 140
Thr Gly Ile Val Phe Gln Asp Leu Phe Gln Met Tyr Gln Arg Arg Leu
145 150 155 160
Ser Gly Gln Ala Leu Leu Pro Glu Val Ala Pro Gln Tyr Ser Glu Tyr
165 170 175
Ile Arg Trp Leu Lys Lys Gln Asp Asp Gln Gln Ala Leu Ala Phe Trp
180 185 190
Lys Glu Tyr Leu Gln Gly Phe Glu Asn Leu Thr Gly Ile Pro Arg Leu
195 200 205
Arg Ser Gly Asn His Pro Tyr Lys Gln Glu Glu Phe Ile Phe Ser Leu
210 215 220
Gly Glu Glu Ala Thr Gln Lys Leu Thr Gln Thr Ala Gln Lys Tyr Gln
225 230 235 240
Val Thr Leu Asn Thr Val Val Gln Thr Ile Trp Gly Ala Leu Leu Gln
245 250 255
Lys Tyr Asn Asn Thr Asn Asp Ala Ala Tyr Gly Val Val Val Ser Gly
260 265 270
Arg Pro Ala Glu Val Pro Asn Val Glu Gln Met Val Gly Leu Phe Ser
275 280 285
Asn Thr Ile Pro Ile Arg Ile Lys
290 295
<210> 20
<211> 504
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 20
Lys Tyr Pro Arg Glu Lys Thr Ile His Glu Leu Phe Gln Glu Gln Val
1 5 10 15
Asp Lys Asn Pro Asp Gln Val Ala Leu Val Phe Gly Glu Ala Gln Leu
20 25 30
Thr Tyr Arg Glu Leu Asn Glu Lys Ala Asn Gln Met Ala Arg Gly Leu
35 40 45
Arg Lys Gln Gly Val Leu Pro Asp Gln Val Ile Gly Leu Leu Thr Asp
50 55 60
Arg Ser Leu Glu Met Ile Ile Ala Ile Leu Ala Ile Phe Lys Ala Gly
65 70 75 80
Gly Ala Tyr Met Pro Ile Asp Pro Ser Tyr Pro Ser Glu Arg Ile Gln
85 90 95
Tyr Met Leu Ala Asp Ser Arg Thr His Leu Leu Leu Val Gln Lys Ala
100 105 110
Glu Met Ile Pro Ala Asn Tyr Gln Gly Glu Val Leu Leu Leu Thr Glu
115 120 125
Asp Ser Trp Met Asp Glu Asn Thr Asp Asn Leu Asp Leu Val Asn Gln
130 135 140
Ala Gln Asp Leu Ala Tyr Val Met Tyr Thr Ser Gly Ser Thr Gly Lys
145 150 155 160
Pro Lys Gly Asn Leu Thr Thr His Gln Asn Ile Val Lys Thr Ile Met
165 170 175
Asn Asn Gly Tyr Met Glu Ile Thr Pro Asn Asp Arg Leu Leu Gln Leu
180 185 190
Ser Asn Tyr Ala Phe Asp Gly Ser Thr Phe Asp Ile Tyr Ser Ala Leu
195 200 205
Leu Asn Gly Ala Ser Leu Ile Leu Val Pro Thr His Val Leu Met Asn
210 215 220
Pro Thr Asp Leu Ala Ser Val Ile Gln Asp Gln His Ile Thr Val Ser
225 230 235 240
Phe Met Thr Thr Ser Leu Phe Asn Thr Leu Val Glu Leu Asp Val Thr
245 250 255
Ser Leu Lys His Met Arg Lys Val Val Phe Gly Gly Glu Lys Ala Ser
260 265 270
Ile Lys His Val Glu Lys Ala Leu Asp Tyr Leu Gly Ala Gly Arg Leu
275 280 285
Val Asn Gly Tyr Gly Pro Thr Glu Thr Thr Val Phe Ala Thr Thr Tyr
290 295 300
Thr Val Asp His Thr Ile Lys Glu Thr Gly Ile Met Pro Ile Gly Arg
305 310 315 320
Pro Leu Asn Asn Thr Lys Val Phe Ile Leu Gly Ala Asp Asn Gln Leu
325 330 335
Gln Pro Ile Gly Ala Leu Gly Glu Leu Cys Val Ser Gly Glu Gly Leu
340 345 350
Ala Arg Gly Tyr Leu Asn Leu Pro Glu Leu Thr Ala Asp Arg Phe Val
355 360 365
Glu Asn Pro Phe Met Gln Gly Glu Arg Met Tyr Arg Thr Gly Asp Leu
370 375 380
Ala Arg Trp Leu Pro Asp Gly Ser Ile Glu Tyr Val Gly Arg Ile Asp
385 390 395 400
Glu Gln Val Lys Ile Arg Gly His Arg Ile Glu Leu Gly Glu Ile Glu
405 410 415
Ala Arg Leu Leu Glu His Pro Ala Ile Ser Glu Thr Val Leu Leu Ala
420 425 430
Lys Gln Asp Glu Gln Gly His Ser Phe Leu Cys Ala Tyr Leu Val Thr
435 440 445
Asn Gly Ala Trp Ser Val Ala Glu Leu Arg Lys His Ile Lys Glu Thr
450 455 460
Leu Pro Asp Ser Met Val Pro Ser Tyr Phe Ile Glu Ile Asp Lys Met
465 470 475 480
Pro Leu Thr Ser Asn Gly Lys Ala Asp Lys Arg Ala Leu Pro Glu Pro
485 490 495
Asp Val Gln Gln Val Ser Ser Tyr
500
<210> 21
<211> 69
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 21
Glu Glu Lys Leu Val Gln Leu Phe Gln Glu Ile Leu Ser Val Glu Gln
1 5 10 15
Val Gly Thr Gln Asp Asn Phe Phe Glu Leu Gly Gly His Ser Leu Lys
20 25 30
Ala Met Met Leu Val Ser Arg Met His Lys Glu Leu Asp Ile Glu Val
35 40 45
Pro Leu Lys Asp Val Phe Ala Arg Pro Ser Val Lys Glu Leu Ala Ala
50 55 60
Phe Leu Thr Asn Thr
65
<210> 22
<211> 286
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 22
Phe Tyr Pro Val Ser Ser Ala Gln Arg Arg Met Tyr Val Val Glu Gln
1 5 10 15
Ile Gly Ser Ser Asn Thr Thr Ser Tyr Asn Met Pro Phe Leu Leu Glu
20 25 30
Ile Gly Gly Ala Leu Asp Val Val Gly Leu Gln Lys Ala Leu Lys Lys
35 40 45
Leu Val Ile Arg His Glu Ser Leu Arg Thr Ser Phe His Met Val Asp
50 55 60
Glu Val Leu Met Gln Lys Ile His Pro Asp Val Glu Trp Asp Leu Met
65 70 75 80
Val Met Glu Ala Lys Asp Glu Asp Leu Pro Gln Ile Ile Asp Gly Phe
85 90 95
Ile Gln Pro Phe Asp Leu Ser Asp Ala Ser Leu Phe Arg Ala Gly Leu
100 105 110
Val Arg Met Glu Ala Asp Arg His Leu Leu Met Leu Asp Met His His
115 120 125
Ile Ile Ser Asp Gly Val Ser Thr Asn Val Leu Phe Gln Asp Leu Met
130 135 140
Gln Ile Tyr Gln Gly Lys Glu Leu Pro Ser Leu Arg Ile Gln Tyr Lys
145 150 155 160
Asp Tyr Ala Val Trp Gln Gln Ala Glu Ala Gln Val Asn Arg Leu Arg
165 170 175
Glu Gln Glu Gln Tyr Trp Leu Asn Gln Phe Ser Gly Glu Leu Pro Val
180 185 190
Leu Glu Met Pro Thr Asp Tyr Thr Arg Pro Ser Ile Gln Gln Ser Glu
195 200 205
Gly Asp Ile Trp Ser Phe Glu Ile Ser Ala Glu Ile Ile Asn Lys Val
210 215 220
Lys Lys Leu Ser Ser Ser Gln Gly Thr Thr Leu Tyr Met Thr Leu Leu
225 230 235 240
Ala Ala Tyr Gln Val Leu Leu Ser Lys Tyr Thr Gly Gln Glu Asp Val
245 250 255
Ile Val Gly Ser Pro Ile Ala Gly Arg Pro His Ala Asp Val Glu Lys
260 265 270
Ile Val Gly Met Phe Val Asn Thr Leu Ala Phe Arg Gly Gln
275 280 285
<210> 23
<211> 501
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 23
Gln Ser Asp Tyr Pro Val Asn Lys Thr Val His Gln Leu Phe Glu Glu
1 5 10 15
Gln Val Gln Asn Met Pro Asp Gln Lys Ala Ile Val Phe Gly Glu Glu
20 25 30
Gln Val Thr Tyr Lys Glu Leu Asn Ala Lys Ala Asn His Leu Ala Thr
35 40 45
Leu Leu Lys Gln Lys Gly Ile Thr Asn Glu Gln Leu Val Ala Val Met
50 55 60
Ile Glu Pro Ser Ile Glu Phe Phe Val Gly Ile Leu Ala Val Leu Lys
65 70 75 80
Ala Gly Gly Ala Tyr Leu Pro Ile Asp Pro Thr Tyr Pro Ala Glu Arg
85 90 95
Ile Ala Tyr Ile Leu Glu Asp Ser Gln Ser Lys Val Leu Leu Val Arg
100 105 110
Gly His Glu Gln Val Gln Thr Gln Phe Ala Gly Glu Ile Leu Glu Thr
115 120 125
Asp Ser Lys Lys Leu Ser Thr Glu Glu Leu Lys Asp Val Pro Met Asn
130 135 140
Asn Lys Val Thr Asp Leu Ala Tyr Val Ile Tyr Thr Ser Gly Ser Thr
145 150 155 160
Gly Gln Pro Lys Gly Val Met Val Glu His Arg Ser Leu Met Asn Leu
165 170 175
Ser Ala Trp His Val Gln Tyr Phe Gly Ile Thr Lys Asp Asp Arg Ser
180 185 190
Ile Lys Tyr Ala Gly Val Gly Phe Asp Ala Ser Val Trp Glu Val Phe
195 200 205
Pro Tyr Leu Ile Ala Gly Ala Thr Ile Tyr Val Ile Asp Gln Glu Thr
210 215 220
Arg Tyr Asp Val Glu Lys Leu Asn Gln Tyr Val Thr Asp Gln Gly Ile
225 230 235 240
Thr Ile Ser Phe Leu Pro Thr Gln Phe Ala Glu Gln Phe Met Leu Thr
245 250 255
Asp His Thr Asp His Thr Ala Leu Arg Trp Leu Leu Ile Gly Gly Asp
260 265 270
Lys Ala Gln Gln Ala Val Gln Gln Lys Lys Tyr Gln Ile Val Asn Asn
275 280 285
Tyr Gly Pro Thr Glu Asn Thr Val Val Thr Thr Ser Tyr Ile Val Ser
290 295 300
Pro Glu Asp Lys Lys Ile Pro Ile Gly Arg Pro Ile Ala Asn Asn Gln
305 310 315 320
Val Phe Ile Leu Asn Lys Glu Asn Gln Leu Gln Pro Val Gly Ile Pro
325 330 335
Gly Glu Leu Cys Val Ser Gly Asp Ser Leu Ala Arg Gly Tyr Leu His
340 345 350
Arg Pro Glu Leu Thr Ser Glu Arg Phe Val Ala Asn Pro Phe Val Pro
355 360 365
Gly Glu Arg Met Tyr Lys Thr Gly Asp Ile Ala Arg Trp Leu Pro Asp
370 375 380
Gly Asn Ile Glu Tyr Leu Gly Arg Leu Asp Asp Gln Ile Lys Ile Arg
385 390 395 400
Gly Tyr Arg Val Glu Leu Gly Glu Ile Glu Ser Ala Ile Leu Glu His
405 410 415
Glu Ala Ile Gln Glu Thr Val Val Leu Ala Arg Gln Asp Asp Gln Asn
420 425 430
Gln Thr Tyr Leu Cys Ala Tyr Val Val Pro Lys Lys Ser Phe Asp Val
435 440 445
Ala Glu Leu Arg Gln Tyr Leu Gly Arg Lys Leu Pro His Phe Met Ile
450 455 460
Pro Ala Phe Phe Thr Glu Met Thr Glu Phe Pro Ile Thr Ser Asn Gly
465 470 475 480
Lys Val Asp Lys Lys Ala Leu Pro Leu Pro Asp Leu Ser Lys Gln Ser
485 490 495
Glu Ile Asp Tyr Val
500
<210> 24
<211> 67
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 24
Glu Glu Thr Leu Ala Glu Leu Trp Thr Glu Val Leu Gly Val Ser Gln
1 5 10 15
Val Gly Ile His Asp Asn Phe Phe Lys Leu Gly Gly Asp Ser Ile Lys
20 25 30
Ala Ile Gln Ile Ala Ala Arg Leu Asn Thr Lys Gln Leu Lys Leu Glu
35 40 45
Val Lys Asp Leu Phe Gln Ala Gln Thr Ile Ala Gln Val Ile Pro Tyr
50 55 60
Ile Lys Thr
65
<210> 25
<211> 301
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 25
Gly Lys Val Glu Leu Thr Pro Ile Gln Glu Trp Phe Phe Gln Gln Ser
1 5 10 15
Phe Asp Ile Pro His His Trp Asn Gln Ser Met Met Phe Tyr Arg Lys
20 25 30
Glu Gly Trp Asp Gln His Val Val Gln Arg Val Phe Gln Lys Ile Ala
35 40 45
Glu His His Asp Ala Leu Arg Met Ala Tyr Gln Gln Glu Asn Gly Lys
50 55 60
Thr Ile Gln Ile Asn Arg Gly Val Glu Gly Lys Leu Phe Glu Leu Ser
65 70 75 80
Ile Phe Asp Phe Arg Gln Gln Ala Asn Val Pro Glu Leu Ile Glu Gln
85 90 95
Ala Ala Asn Arg Leu Gln Ser Ala Met Asn Leu Gln Asp Gly Pro Leu
100 105 110
Val Gln Leu Gly Leu Phe Gln Thr Ser Glu Gly Asp His Leu Leu Ile
115 120 125
Ala Ile His His Leu Val Val Asp Ala Val Ser Trp Arg Ile Ile Thr
130 135 140
Glu Asp Phe Met Asn Gly Tyr Gln Gln Asp Leu Gln Gly Glu Pro Ile
145 150 155 160
Ala Phe Thr Ser Lys Thr Asp Ser Tyr Gln Lys Trp Ala Lys Ser Leu
165 170 175
Leu Glu Tyr Ala Thr Ser Glu Glu Ile Gln Ser Glu Leu Lys Tyr Trp
180 185 190
Gln Ser Met Ile Ala Lys Gly Leu Pro Ala Leu Pro Arg Asp Ser Lys
195 200 205
Gly Gly Ala Pro Tyr Leu Leu Lys Asp Ile Gln Glu Val Ala Ile Gln
210 215 220
Leu Thr Lys Glu Gln Thr Asn Lys Leu Leu Thr Asp Ala His Asn Ala
225 230 235 240
Tyr Asn Thr Gln Ile Asn Asp Leu Leu Leu Thr Ala Leu Ala Leu Thr
245 250 255
Ile Gln Glu Trp Ala Gln Thr Asn Ser Ile Ala Ile Thr Leu Glu Gly
260 265 270
His Gly Arg Glu Asp Ile Gly Val Asp Ile Asp Ile Asn Arg Thr Val
275 280 285
Gly Trp Phe Thr Ser Met Tyr Pro Val Val Phe Asp Leu
290 295 300
<210> 26
<211> 289
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 26
Asn Leu Tyr Arg Leu Ser Pro Met Gln Lys Gly Ile Leu Phe His Ser
1 5 10 15
Leu Lys Asp Lys Glu Asn His Ala Tyr Phe Asp Gln Leu Ile Phe Thr
20 25 30
Leu Glu Gly Lys Val Glu Leu Glu Tyr Leu Glu Glu Ala Phe Asn Gln
35 40 45
Leu Ile Lys Lys His Asp Ile Leu Arg Thr Val Phe Arg Tyr Lys Lys
50 55 60
Val Lys Glu Pro Val Gln Met Val Leu Lys Glu Arg Ser Ser Thr Ile
65 70 75 80
Tyr Phe Glu Asp Ile Ser His Leu Glu Pro Glu Glu Lys Val Asn Tyr
85 90 95
Ile Lys Gln Phe Lys Met Arg Asp Arg Glu Lys Gly Phe Asp Leu Ser
100 105 110
Arg Asp Leu Leu Ile Arg Met Ser Leu Phe Lys Leu Asp Gln Glu Gln
115 120 125
Tyr Gln Leu Ile Met Ser Asn His His Ile Ile Met Asp Gly Trp Cys
130 135 140
Leu Gly Ile Ile Leu Thr Asp Phe Leu Arg Met Tyr Lys Gly Ile Val
145 150 155 160
Asn His Thr Pro Val Pro Tyr Glu His Val Thr Pro Tyr Ser Lys His
165 170 175
Ile Gln Trp Leu Glu Lys Gln Asp His Gln Glu Ala Lys Asp Phe Tyr
180 185 190
Gln Gln Leu Leu Glu Gly Tyr Asp Lys Val Thr Gly Val Pro Gln Gln
195 200 205
Leu Val Arg Ala Asn His Glu Glu Tyr Thr His Gly Gln Cys Ile Val
210 215 220
Lys Leu Asn Gln Glu Thr Ala Asp Arg Leu Ile Ala Ile Ala Lys Thr
225 230 235 240
Tyr Gln Val Thr Val Asn Thr Val Phe Gln Thr Ile Trp Gly Ile Leu
245 250 255
Leu Gln Lys Tyr Asn Asn Thr Asp Asp Val Val Phe Gly Ser Val Val
260 265 270
Ser Gly Arg Pro Ala Glu Ile Pro Asp Val Glu Lys Met Val Gly Leu
275 280 285
Phe
<210> 27
<211> 502
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 27
Glu Lys Thr Ile His Gln Leu Phe Glu Glu Gln Val Asp Lys Asn Pro
1 5 10 15
Asn Gln Ile Ala Leu Val Phe Lys Glu Glu Lys Leu Thr Tyr Gly Glu
20 25 30
Val Asn Ala Lys Ala Asn Gln Leu Ala Tyr Val Leu Ser Lys Gln Gly
35 40 45
Val Gln Pro Asn Asp Val Ile Gly Ile Ile Thr Glu Arg Ser Pro Glu
50 55 60
Met Ile Ile Gly Ile Leu Ala Ile Phe Lys Ala Gly Gly Ala Tyr Met
65 70 75 80
Pro Ile Asp Pro Thr Tyr Pro Ala Glu Arg Ile Glu Tyr Met Leu Gln
85 90 95
Asp Asn Gln Thr Lys Leu Leu Leu Val Gln Lys Gln Glu Met Ile Pro
100 105 110
Ala Asn Tyr Gln Gly Glu Val Leu Phe Leu Thr Gln Glu Ser Trp Met
115 120 125
His Glu Glu Thr Ser Asn Pro Ala His Ile Thr Gln Ala Gln Ala Leu
130 135 140
Ala Tyr Val Met Tyr Thr Ser Gly Ser Thr Gly Glu Pro Lys Gly Ile
145 150 155 160
Leu Thr Thr His Gln Asn Ile Met Lys Thr Val Ile His Asn Gly Tyr
165 170 175
Val Glu Ile Thr Pro Gly Asp Cys Leu Ser Gln Leu Ser Asn Tyr Ala
180 185 190
Phe Asp Gly Ser Thr Phe Glu Ile Tyr Gly Ala Leu Leu His Gly Ala
195 200 205
Thr Leu Leu Leu Val Thr Lys Glu Ala Val Leu Asn Met Asn Glu Leu
210 215 220
Ala Arg Leu Ile Lys Lys Glu Gln Val Thr Val Ser Phe Met Thr Thr
225 230 235 240
Ala Leu Phe Asn Thr Leu Val Asp Leu Asp Ile Thr Cys Phe Gln Ser
245 250 255
Val Arg Lys Val Leu Phe Gly Gly Glu Leu Ala Ser Val Lys His Val
260 265 270
Leu Lys Ala Leu Asp Tyr Leu Gly Glu His Arg Val Ile Asn Val Tyr
275 280 285
Gly Pro Thr Glu Thr Thr Val Tyr Ala Thr Tyr Tyr Ser Val Asp His
290 295 300
Ser Met Leu Thr Arg Ala Ser Val Pro Ile Gly Arg Pro Ile Asn Asn
305 310 315 320
Thr Lys Ala Tyr Ile Leu Asn Thr Asp Gly Gln Pro Gln Pro Ile Gly
325 330 335
Val Val Gly Glu Leu Cys Ile Gly Gly Glu Gly Val Ala Cys Gly Tyr
340 345 350
Leu Asn Arg Pro Glu Leu Thr Lys Lys His Phe Val Asp Asn Pro Phe
355 360 365
Val Leu Gly Glu Arg Met Tyr Cys Thr Gly Asp Leu Ala Arg Phe Leu
370 375 380
Pro Asp Gly Asn Ile Glu Tyr Ile Gly Arg Met Asp Glu Gln Val Lys
385 390 395 400
Ile Arg Gly His Arg Ile Glu Leu Gly Glu Ile Glu Lys Val Leu Leu
405 410 415
Gln His Pro Ala Ile Ser Glu Thr Val Leu Leu Ala Lys Arg Asp Glu
420 425 430
Gln Gly His Ser Tyr Leu Cys Ala Tyr Ile Val Gly Gln Ala Phe Trp
435 440 445
Thr Val Thr Glu Leu Arg Gln His Leu Met Glu Ser Leu Pro Glu Tyr
450 455 460
Met Val Pro Ser Tyr Phe Ile Glu Ile Glu Lys Leu Pro Leu Thr Ala
465 470 475 480
Asn Gly Lys Val Asp Lys Arg Ala Leu Pro Glu Pro Asp Arg Lys Met
485 490 495
Gly Ser Ala Tyr Val Ala
500
<210> 28
<211> 68
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 28
Glu Glu Lys Leu Val Gln Phe Phe Gln Glu Ile Leu Gly Val Glu Arg
1 5 10 15
Val Gly Thr Gln Asp Thr Phe Phe Glu Leu Gly Gly His Ser Leu Lys
20 25 30
Ala Met Met Leu Val Leu Gln Ile His Lys Glu Met Gly Ile Glu Val
35 40 45
Pro Leu Lys Glu Ile Phe Thr Arg Pro Thr Ile Lys Glu Leu Ala Ala
50 55 60
Tyr Ile His Lys
65
<210> 29
<211> 287
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 29
Glu Tyr Tyr Pro Val Ser Phe Ala Gln Arg Arg Met Phe Val Val Gln
1 5 10 15
Gln Ile Arg Asp Thr Asn Thr Thr Ser Tyr Asn Met Pro Ile Leu Leu
20 25 30
Glu Ile Glu Gly Ala Leu Asp Arg Glu Asn Val Arg Gln Thr Leu Lys
35 40 45
Lys Leu Ile Glu Arg His Glu Ser Met Arg Thr Ser Phe His Met Ile
50 55 60
Asp Glu Thr Leu Leu Gln Lys Val His Asp Asp Val Thr Trp Glu Met
65 70 75 80
Glu Glu Met Glu Ala Ser Glu Glu Glu Val Tyr Ala Leu Thr Lys Ser
85 90 95
Phe Ile Arg Pro Phe Asp Leu Gly Gln Ala Pro Leu Phe Arg Ala Gly
100 105 110
Leu Ile Arg Val Asn Ser Glu Arg His Leu Leu Leu Leu Asp Thr His
115 120 125
His Ile Ile Ser Asp Gly Val Ser Thr Asn Ile Leu Phe Gln Asp Phe
130 135 140
Thr Gln Leu Tyr Arg Gly Arg Glu Leu Pro Ala Leu Arg Ile Gln Tyr
145 150 155 160
Lys Asp Phe Ala Val Trp Gln Gln Gly Glu Ala Gln Leu Ala Arg Leu
165 170 175
Gln Glu Gln Glu Glu Tyr Trp Leu Lys Gln Phe Ser Glu Ser Val Pro
180 185 190
Val Leu Glu Leu Pro Thr Asp Phe Pro Arg Pro Ala Met Gln Gln Phe
195 200 205
Asp Gly Asp Val Leu Asp Phe Ala Leu Asn Arg Gln Val Trp Gln Glu
210 215 220
Leu Gln Gln Leu Ile Val Lys Glu Gly Cys Thr Ala Tyr Met Ile Leu
225 230 235 240
Leu Ala Ala Tyr His Val Leu Leu Ser Lys Tyr Ser Ser Gln Asn Asp
245 250 255
Ile Val Ile Gly Ser Pro Ile Ala Gly Arg Thr Asn Ala Asp Leu Gln
260 265 270
Ser Ile Val Gly Met Phe Val Asn Thr Leu Ala Ile Arg Thr Lys
275 280 285
<210> 30
<211> 500
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 30
Lys Thr Ile His Gln Gln Phe Glu Gln Lys Val Glu Glu Asn Pro Asp
1 5 10 15
Gln Ile Ala Leu Leu Phe Lys Asp Lys Glu Ile Thr Tyr Gly Gln Leu
20 25 30
Asn Ala Lys Ala Asn Gln Phe Ala Arg Val Leu Arg Lys His Gly Val
35 40 45
Gln Pro Asp Gln Val Val Gly Leu Ile Thr Asp Arg Ser Ile Glu Met
50 55 60
Met Ile Gly Ile Leu Ala Ile Leu Lys Ala Gly Gly Ala Tyr Leu Pro
65 70 75 80
Ile Asp Pro Ser Tyr Pro Val Glu Arg Ile Thr Tyr Met Leu Glu Asp
85 90 95
Ser Gln Ala Gln Leu Leu Ile Val Gln Glu Ala Ala Met Ile Pro Glu
100 105 110
Gly Tyr Gln Gly Lys Val Leu Leu Leu Val Glu Glu Cys Trp Ile Gln
115 120 125
Glu Glu Ala Ser Asn Leu Glu Leu Ile Asn Gly Ala Gln Asp Leu Ala
130 135 140
Tyr Val Met Tyr Thr Ser Gly Ser Thr Gly Lys Pro Lys Gly Asn Leu
145 150 155 160
Thr Thr His Gln Asn Ile Leu Arg Thr Ile Ile Asn Asn Gly Phe Ile
165 170 175
Glu Ile Val Pro Ala Asp Arg Leu Leu Gln Leu Ser Asn Tyr Ala Phe
180 185 190
Asp Gly Ser Thr Phe Asp Ile Tyr Ser Ala Leu Leu Asn Gly Ala Thr
195 200 205
Leu Val Leu Val Pro Lys Glu Val Met Leu Asn Pro Met Glu Leu Ala
210 215 220
Lys Ile Val Arg Glu Gln Asp Ile Thr Val Ser Phe Met Thr Thr Ser
225 230 235 240
Leu Phe His Thr Leu Val Glu Leu Asp Val Thr Ser Met Lys Ser Met
245 250 255
Arg Lys Val Val Phe Gly Gly Glu Lys Ala Ser Tyr Lys His Val Glu
260 265 270
Lys Ala Leu Asp Tyr Leu Gly Glu Gly Arg Leu Val Asn Gly Tyr Gly
275 280 285
Pro Thr Glu Thr Thr Val Phe Ala Thr Thr Tyr Thr Val Asp Ser Ser
290 295 300
Ile Lys Glu Thr Gly Ile Val Pro Ile Gly Arg Pro Leu Asn Asn Thr
305 310 315 320
Ser Val Tyr Ile Leu Asn Glu Asn Asn Gln Pro Gln Pro Ile Gly Val
325 330 335
Pro Gly Glu Leu Cys Val Gly Gly Ala Gly Ile Ala Arg Gly Tyr Leu
340 345 350
Asn Arg Pro Glu Leu Thr Ala Glu Arg Phe Val Asp Asn Pro Phe Leu
355 360 365
Val Gly Asp Arg Met Tyr Arg Thr Gly Asp Met Ala Arg Phe Leu Pro
370 375 380
Asp Gly Asn Ile Glu Tyr Ile Gly Arg Met Asp Glu Gln Val Lys Ile
385 390 395 400
Arg Gly His Arg Ile Glu Leu Gly Glu Ile Glu Lys Ser Leu Leu Glu
405 410 415
Tyr Pro Ala Ile Ser Glu Ala Val Leu Val Ala Lys Arg Asp Glu Gln
420 425 430
Gly His Ser Tyr Leu Cys Ala Tyr Val Val Ser Thr Asp Gln Trp Thr
435 440 445
Val Ala Gln Val Arg Gln His Leu Leu Glu Ala Leu Pro Glu Tyr Met
450 455 460
Val Pro Ser Tyr Phe Val Glu Leu Glu Lys Leu Pro Leu Thr Ser Asn
465 470 475 480
Gly Lys Val Asp Lys Arg Ala Leu Pro Glu Pro Asp Arg Val Ile Thr
485 490 495
Asn Glu Tyr Val
500
<210> 31
<211> 69
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 31
Glu Glu Lys Leu Val Gln Phe Phe Gln Glu Ile Leu Ala Val Asp Arg
1 5 10 15
Val Gly Thr Gln Asp Thr Phe Phe Glu Leu Gly Gly His Ser Leu Lys
20 25 30
Ala Met Met Leu Val Ser Arg Ile His Lys Glu Leu Glu Ile Glu Val
35 40 45
Pro Leu Lys Glu Val Phe Ala Arg Gln Thr Val Lys Glu Leu Ala Ala
50 55 60
Tyr Ile Arg Gln Ala
65
<210> 32
<211> 287
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 32
Glu Tyr Tyr Pro Val Ser Asn Ala Gln Arg Arg Met Tyr Val Val Gln
1 5 10 15
Gln Met Arg Asp Val Glu Thr Thr Gly Tyr Asn Met Pro Phe Tyr Leu
20 25 30
Glu Met Glu Gly Ala Leu Glu Val Glu Lys Leu Ser Leu Ala Leu Lys
35 40 45
Gln Leu Ile Lys Arg His Glu Ser Leu Arg Thr Ser Phe His Met Val
50 55 60
Glu Asp Glu Leu Met Gln Lys Val His Ala Glu Val Ala Trp Glu Met
65 70 75 80
Glu Met Ile His Ala Val Glu Glu Glu Val Gln Gln Leu Thr Asp Ser
85 90 95
Phe Met Arg Pro Phe Asp Leu Ala Lys Ala Pro Leu Phe Arg Ala Gly
100 105 110
Leu Ile Gln Ile Asn Pro Lys Arg His Leu Leu Met Leu Asp Met His
115 120 125
His Ile Ile Ser Asp Gly Val Ser Met Asn Val Leu Phe Gln Asp Ile
130 135 140
Thr Gln Leu Tyr Gln Gly Ile Glu Leu Ser Pro Leu Lys Ile Gln Tyr
145 150 155 160
Lys Asp Phe Ala Val Trp Gln Gln Gly Met Ala Gln Val Val Arg Phe
165 170 175
Gln Glu Gln Glu Arg Tyr Trp Leu Asn Gln Phe Ser Gly Asp Leu Pro
180 185 190
Ile Leu Glu Met Val Thr Asp Tyr Pro Arg Pro Ala Ile Gln Gln Phe
195 200 205
Asp Gly Asp Ser Trp Ser Phe Glu Ile Asp Ala Lys Val Leu Asp Ser
210 215 220
Ile Lys Gln Leu Ser Ala Lys Gln Gly Thr Thr Leu Tyr Met Thr Leu
225 230 235 240
Leu Ala Ile Tyr Gln Ile Leu Leu Ala Lys Tyr Thr Arg Gln Asp Asp
245 250 255
Ile Ile Val Gly Thr Pro Ile Ala Gly Arg Pro His Ala Asp Thr Glu
260 265 270
Ser Ile Val Gly Met Phe Val Asn Thr Leu Ala Leu Arg Gly Gln
275 280 285
<210> 33
<211> 490
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 33
Leu Phe Glu Gln Gln Val Gln Arg Phe Ser Asp Arg Pro Ala Leu Val
1 5 10 15
Phe Lys Asp Lys Gln Leu Thr Tyr Ser Glu Phe His Ala Lys Val Asn
20 25 30
Gln Leu Ala Arg Val Leu Arg Lys Lys Gly Val Gln Pro Asp Gln Ala
35 40 45
Val Gly Leu Ile Thr Asp Arg Ser Ile Glu Met Met Ile Gly Ile Phe
50 55 60
Ala Ile Leu Lys Ala Gly Gly Ala Tyr Met Pro Ile Asp Pro Ser Tyr
65 70 75 80
Pro Ile Asp Arg Ile Glu His Met Leu Glu Asp Ser Arg Thr Lys Leu
85 90 95
Leu Phe Val Gln Lys Thr Glu Met Ile Pro Ala Ser Tyr Gln Gly Glu
100 105 110
Val Leu Leu Leu Ala Glu Glu Cys Trp Met His Glu Asp Ser Ser Asn
115 120 125
Leu Glu Leu Ile Asn Lys Thr Gln Asp Leu Ala Tyr Val Met Tyr Thr
130 135 140
Ser Gly Ser Thr Gly Lys Pro Lys Gly Asn Leu Thr Thr His Gln Asn
145 150 155 160
Ile Leu Thr Thr Ile Ile Asn Asn Gly Tyr Ile Glu Ile Ala Pro Thr
165 170 175
Asp Arg Leu Leu Gln Leu Ser Asn Tyr Ala Phe Asp Gly Ser Thr Phe
180 185 190
Asp Ile Tyr Ser Ala Leu Leu Asn Gly Ala Thr Leu Val Leu Val Pro
195 200 205
Lys Glu Val Met Leu Asn Pro Met Glu Leu Ala Lys Ile Val Arg Glu
210 215 220
Gln Asp Ile Thr Val Ser Phe Met Thr Thr Ser Leu Phe His Thr Leu
225 230 235 240
Val Glu Leu Asp Val Thr Ser Met Lys Ser Met Arg Lys Val Val Phe
245 250 255
Gly Gly Glu Lys Ala Ser Tyr Lys His Val Glu Lys Ala Leu Asp Tyr
260 265 270
Leu Gly Glu Gly Arg Leu Val Asn Gly Tyr Gly Pro Thr Glu Thr Thr
275 280 285
Val Phe Ala Thr Thr Tyr Thr Val Asp Ser Ser Ile Lys Glu Met Gly
290 295 300
Ile Val Pro Ile Gly Arg Pro Leu Asn Asn Thr Ser Val Tyr Val Leu
305 310 315 320
Asn Glu Asn Asn Gln Leu Gln Pro Ile Gly Val Pro Gly Glu Leu Cys
325 330 335
Val Gly Gly Ala Gly Ile Ala Arg Gly Tyr Leu Asn Arg Pro Glu Leu
340 345 350
Thr Ala Glu Arg Phe Val Glu Asn Pro Phe Val Ser Gly Asp Arg Met
355 360 365
Tyr Arg Thr Gly Asp Leu Ala Arg Trp Leu Pro Asp Gly Ser Met Glu
370 375 380
Tyr Leu Gly Arg Met Asp Glu Gln Val Lys Val Arg Gly Tyr Arg Ile
385 390 395 400
Glu Leu Gly Glu Ile Glu Thr Arg Leu Leu Glu His Pro Ala Ile Ser
405 410 415
Ala Val Val Leu Leu Ala Lys Gln Asp Glu Gln Gly His Ser Tyr Leu
420 425 430
Cys Ala Tyr Ile Val Ala Asn Gly Val Trp Thr Val Ala Glu Leu Arg
435 440 445
Lys His Leu Ser Glu Ala Leu Pro Glu Tyr Met Val Pro Thr Tyr Phe
450 455 460
Val Glu Leu Glu Gln Ile Pro Phe Thr Ser Asn Gly Lys Val Asn Lys
465 470 475 480
Arg Ala Leu Pro Glu Pro Glu Gly Gln Ile
485 490
<210> 34
<211> 67
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 34
Glu Ala Lys Val Ala Ala Leu Phe Gln Glu Ile Leu Gly Val Glu Arg
1 5 10 15
Val Gly Thr Gln Asp Met Phe Phe Glu Leu Gly Gly His Ser Leu Lys
20 25 30
Ala Met Met Leu Val Leu Arg Met Asn Lys Glu Leu Gly Ile Glu Val
35 40 45
Pro Leu Lys Glu Val Phe Ala His Pro Thr Val Lys Glu Leu Ala Ala
50 55 60
Thr Ile Asp
65
<210> 35
<211> 286
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 35
Phe Tyr Pro Val Ser Ser Ala Gln Arg Arg Met Tyr Val Val Gln His
1 5 10 15
Leu Gly Asn Val Gln Thr Thr Ser Tyr Asn Met Pro Leu Phe Leu Glu
20 25 30
Val Glu Gly Ala Leu Glu Ile Asp Lys Leu His Leu Ala Leu Glu Lys
35 40 45
Leu Val Glu Arg His Glu Ser Leu Arg Thr Ser Phe His Met Val Asp
50 55 60
Glu Glu Leu Met Gln Gln Val His Glu Glu Val Ala Trp Asp Leu Glu
65 70 75 80
Ile Met Asp Gly Thr Glu Gly Asp Leu Ala Gly Ile Thr Ala Gly Phe
85 90 95
Ile Arg Pro Phe Asp Leu Ser Gln Ala Pro Leu Phe Arg Ala Gly Ile
100 105 110
Val Arg Ile Ser Pro Glu Arg Phe Leu Phe Met Leu Asp Met His His
115 120 125
Ile Ile Ser Asp Gly Val Ser Thr Asn Val Leu Phe Gln Asp Ile Thr
130 135 140
Gln Leu Tyr Gln Gly Lys Asp Leu Pro Pro Leu Pro Ile Gln Tyr Lys
145 150 155 160
Asp Tyr Ala Val Trp Gln Gln Ala Asp Ala Gln Val Thr Arg Leu Gln
165 170 175
Asp Gln Glu Ser Tyr Trp Leu His Gln Phe Ala Gly Glu Ala Ser Val
180 185 190
Leu Glu Met Pro Thr Asp Phe Pro Arg Pro Ala Val Gln Gln Phe Glu
195 200 205
Gly Asp Val Trp Thr Phe Glu Val Asp Ala Asp Ile Leu Ser Gln Leu
210 215 220
Lys Lys Leu Ser Val Ser Gln Gly Ser Thr Leu Tyr Met Thr Leu Leu
225 230 235 240
Ala Ala Tyr Gln Val Leu Leu Ala Lys Tyr Thr Gly Gln Asp Asp Ile
245 250 255
Ile Val Gly Ser Pro Ile Ala Gly Arg Pro His Ala Asp Val Glu Ser
260 265 270
Ile Val Gly Met Phe Val Asn Thr Leu Ala Leu Arg Gly Gln
275 280 285
<210> 36
<211> 500
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 36
Gln Thr Ile His Ser Leu Phe Glu Gln Gln Val Glu Arg Thr Pro Glu
1 5 10 15
Gln Ile Ala Val Val Tyr Gln Asp Gln Ser Ile Thr Tyr Arg Glu Leu
20 25 30
Asn Glu Arg Ala Asn Arg Leu Ala Arg Cys Leu Ile Asp Lys Gly Ile
35 40 45
Gln Arg Asn Gln Phe Val Ala Ile Met Ala Asp Arg Ser Ile Glu Thr
50 55 60
Val Ile Gly Met Met Gly Ile Leu Lys Ala Gly Gly Ala Tyr Val Pro
65 70 75 80
Ile Asp Pro Asp Tyr Pro Leu Asp Arg Lys Leu Tyr Ile Leu Glu Asp
85 90 95
Ser His Ala Ser Leu Leu Leu Phe Gln Gln Lys His Glu Val Pro Ser
100 105 110
Glu Phe Thr Gly Asp Arg Ile Leu Ile Glu Gln Met Gln Trp Tyr Gln
115 120 125
Ala Ala Asp Thr Asn Val Gly Ile Val Asn Thr Ala Gln Asp Leu Ala
130 135 140
Tyr Met Ile Tyr Thr Ser Gly Ser Thr Gly Gln Pro Lys Gly Val Met
145 150 155 160
Ile Asp His Gln Ala Val Cys Asn Leu Cys Leu Met Ala Gln Thr Tyr
165 170 175
Gly Ile Phe Ala Asn Ser Arg Val Leu Gln Phe Ala Ser Phe Ser Phe
180 185 190
Asp Ala Ser Val Gly Glu Val Phe His Thr Leu Thr Asn Gly Ala Thr
195 200 205
Leu Tyr Leu Met Asp Arg Asn Leu Val Met Ala Gly Val Glu Phe Val
210 215 220
Glu Trp Leu Arg Val Asn Glu Ile Thr Ser Ile Pro Phe Ile Ser Pro
225 230 235 240
Ser Ala Leu Arg Ala Leu Pro Tyr Glu Asp Leu Pro Ala Leu Lys Tyr
245 250 255
Ile Ser Thr Gly Gly Glu Ala Leu Pro Val Asp Leu Val Arg Leu Trp
260 265 270
Gly Thr Glu Arg Ile Phe Leu Asn Ala Tyr Gly Pro Thr Glu Thr Thr
275 280 285
Val Asp Ala Thr Ile Gly Leu Cys Thr Pro Glu Asp Lys Pro His Ile
290 295 300
Gly Lys Pro Val Leu Asn Lys Lys Ala Tyr Ile Ile Asn Pro Asn Tyr
305 310 315 320
Gln Leu Gln Pro Ile Gly Val Pro Gly Glu Leu Cys Ile Gly Gly Val
325 330 335
Gly Ile Ala Pro Gly Tyr Trp Asn Arg Pro Glu Leu Thr Arg Glu Lys
340 345 350
Phe Val Asp Asn Pro Phe Ala Gln Gly Glu Arg Met Tyr Lys Thr Gly
355 360 365
Asp Leu Val Arg Trp Leu Pro Asp Gly Asn Ile Glu Phe Leu Gly Arg
370 375 380
Ile Asp Asp Gln Val Lys Ile Arg Gly His Arg Ile Glu Leu Gly Glu
385 390 395 400
Ile Glu Thr Arg Leu Leu Glu His Glu Gln Val Ile Glu Ala Val Val
405 410 415
Leu Ala Arg Glu Asp Glu Gln Gly Gln Ala Tyr Leu Cys Ala Tyr Leu
420 425 430
Val Ala Ala Asp Glu Trp Thr Val Ala Glu Leu Arg Lys His Leu Gly
435 440 445
Lys Thr Leu Pro Asp Tyr Met Ile Pro Ala Tyr Phe Ile Glu Leu Glu
450 455 460
Glu Phe Pro Leu Thr Pro Ser Gly Lys Val Asn Lys Lys Ala Leu Pro
465 470 475 480
Glu Pro Asp Gly Gln Ile Gln Thr Gly Val Glu Tyr Val Glu Ala Thr
485 490 495
Thr Glu Ser Gln
500
<210> 37
<211> 66
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 37
Lys Ile Leu Val Glu Leu Trp Gln Glu Val Leu Arg Val Glu Arg Ile
1 5 10 15
Gly Ile Tyr Asp Asn Phe Phe Glu Leu Gly Gly Asp Ser Ile Lys Ala
20 25 30
Ile Gln Ile Thr Ala Arg Leu Arg Arg Tyr His Arg Lys Leu Glu Ile
35 40 45
Ser His Leu Phe Lys His Pro Thr Ile Ala Glu Leu Ala Pro Trp Met
50 55 60
Gln Thr
65
<210> 38
<211> 305
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 38
Glu Gly Glu Val Met Leu Thr Pro Ile Gln Lys Ala Phe Phe Glu Glu
1 5 10 15
Asn Gln Glu Gln Pro Gln His Phe Asn Gln Asp Ser Leu Leu Tyr Ser
20 25 30
Ser Asn Gly Trp Asn Gln Asp Ala Ile Glu Gln Val Phe Glu Lys Ile
35 40 45
Thr Glu His His Asp Ala Leu Arg Met Val Tyr Pro His Thr Glu Gly
50 55 60
Lys Val Thr Gln Ile Asn Arg Gly Leu Glu Asp Lys Ala Phe Thr Leu
65 70 75 80
Gln Val Phe Asp Phe Thr Gln Glu Pro Thr Asp Thr Gln Ala Thr Lys
85 90 95
Ile Glu Gln Ile Ala Thr Gln Leu Gln Ala Ser Phe Asp Leu Glu Lys
100 105 110
Gly Pro Leu Val Arg Leu Gly Leu Phe Thr Thr Lys Ala Gly Asp Tyr
115 120 125
Leu Leu Ile Val Ile His His Leu Val Ile Asp Gly Val Ser Trp Arg
130 135 140
Ile Leu Leu Glu Asp Phe His Asn Ala Tyr Gln Gln Val Ile Gln Gly
145 150 155 160
Gln Ala Ile Val Leu Pro Glu Lys Thr Thr Ser Phe Lys Thr Trp Ser
165 170 175
Glu Arg Leu Asn Glu Tyr Ala Asn Ser His Ala Leu Leu His Glu Ile
180 185 190
Pro Tyr Trp Lys Gln Met Glu Glu Ile Ser Ile Ala Pro Leu Pro Lys
195 200 205
Lys Gly Asn Asn Asp Gly Arg Tyr Tyr Val Lys Asp Ser Glu Tyr Ala
210 215 220
Thr Met Ser Leu Thr Glu Glu Glu Thr Gln Asn Leu Leu Thr Arg Val
225 230 235 240
His Arg Ala Tyr Arg Thr Glu Ile Asn Asp Leu Leu Leu Ala Ala Leu
245 250 255
Gly Leu Ala Ser Lys Glu Trp Thr Lys Glu Asn Arg Val Ala Ile His
260 265 270
Leu Glu Gly His Gly Arg Glu Glu Ile Gly Glu Gly Val Asp Val Asn
275 280 285
Arg Thr Val Gly Trp Phe Thr Ser Leu Phe Pro Val Val Ile Asp Leu
290 295 300
Glu
305
<210> 39
<211> 294
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 39
Lys Asp Leu Phe Pro Met Ser Asp Ile Gln Leu Gly Met Val Tyr His
1 5 10 15
Ser Leu Lys His Val His Glu Ala Val Tyr His Asp Gln Phe Val Tyr
20 25 30
Gln Val Asp Asp Asp Ser Phe Asp Val His Val Leu Glu Gln Ala Met
35 40 45
Arg Met Met Val Asp Lys His Asp Ile Leu Lys Thr Ser Phe His Ile
50 55 60
Glu Glu Phe Ser Thr Pro Val Gln Val Val His Gln Glu Val Ser Val
65 70 75 80
Arg Ile Asp Glu Thr Asp Ile Thr His Leu Gly Glu Lys Gln Lys Glu
85 90 95
Tyr Ile His Gln Tyr Leu Ala Gln Asp Arg Gln Ser Pro Phe Asp Val
100 105 110
Thr Thr Ala Pro Leu Trp Arg Met Ser Val Phe Lys Leu Asn Ala Ser
115 120 125
Gln Val Ala Leu Val Trp Ile Phe His His Ala Ile Leu Asp Gly Trp
130 135 140
Ser Val Ala Ser Phe Ile Thr Glu Leu Ile Asp Val Tyr Phe Lys Leu
145 150 155 160
Lys His Lys Thr Cys Thr Leu Glu His Leu Asn Thr Thr Tyr Lys Asp
165 170 175
Tyr Val Ile Asp Gln Met Leu Leu Ser Glu Gln Asn Glu Leu Arg Glu
180 185 190
Tyr Trp Lys Glu Glu Leu Lys Asp Tyr Lys Arg Leu Gln Leu Pro Val
195 200 205
Lys Val Asp Glu Asn Gly Gly Val His Val Thr Val Val Glu Lys Leu
210 215 220
Asp Pro Asp Ile Ile Asn Lys Cys Arg Glu Ile Ala Gln Ala His His
225 230 235 240
Ile Pro Leu Lys Thr Val Cys Leu Thr Ala Phe Leu Ser Met Met His
245 250 255
Met Ile Ser Tyr Glu Arg Asp Leu Thr Val Gly Leu Ile Glu Asn Asn
260 265 270
Arg Pro Ile Ile Glu Asp Ala Glu Lys Val Leu Gly Cys Phe Leu Asn
275 280 285
Ser Val Pro Phe Arg Ala
290
<210> 40
<211> 500
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 40
Leu His Lys Arg Phe Glu Glu Gln Ala Ala Lys Ser Glu Asp Gln Ile
1 5 10 15
Ala Leu Glu Tyr Glu Asp Lys Gln Leu Thr Tyr Arg Glu Leu Asn Ala
20 25 30
Lys Ala Asn Gln Leu Ala Arg Val Leu Gln Lys His Asn Thr Leu Pro
35 40 45
Thr Gln Val Val Gly Leu Met Ala Glu Arg Ser Leu Glu Met Ile Ile
50 55 60
Gly Ile Leu Gly Ile Leu Lys Ala Gly Gly Ala Tyr Met Pro Ile Asp
65 70 75 80
Pro Thr Tyr Pro Ala Glu Arg Ile Gln Tyr Met Leu Glu Asp Ser Arg
85 90 95
Ser Tyr Leu Leu Leu Val Gln Lys Ala Glu Met Ile Pro Ala Asn Tyr
100 105 110
Gln Gly Glu Val Leu Ile Leu Thr Glu Glu Leu Trp Ala Asp Glu Asn
115 120 125
Thr Glu Asn Leu Asp Leu Val Asn Gln Pro Gln Asp Val Ala Asn Ile
130 135 140
Met Tyr Thr Ser Gly Thr Thr Gly Lys Pro Lys Gly Ile Leu Ile Thr
145 150 155 160
His Arg Asn Ile Met Thr Thr Ile Ile Asn Asn Gly Tyr Leu Asp Ile
165 170 175
Phe Ser Thr Asp Arg Ile Leu Gln Met Ser Asn Tyr Ala Phe Asp Gly
180 185 190
Ser Thr Phe Asp Ile Tyr Ser Ala Leu Leu Asn Gly Ala Thr Leu Val
195 200 205
Leu Val Pro Lys Gln Thr Leu Met Asn Thr Thr Asp Leu Leu Ala Val
210 215 220
Ile Lys Asp Ser Asn Ile Thr Val Ala Leu Leu Thr Thr Ser Leu Phe
225 230 235 240
Asn Thr Leu Val Asp Leu Asp Val Thr Ser Phe Gln His Thr Arg Lys
245 250 255
Val Leu Phe Gly Gly Glu Lys Ala Ser Cys Lys His Val Glu Lys Ala
260 265 270
Leu Asp Tyr Leu Gly Glu Gly Arg Leu Val Asn Gly Tyr Gly Pro Thr
275 280 285
Glu Thr Thr Val Phe Ala Thr Thr Tyr Thr Val Asp Asn Thr Ile Lys
290 295 300
Lys Leu Gly Ser Val Pro Ile Gly Arg Pro Leu Ser Asn Thr Ser Val
305 310 315 320
Tyr Ile Phe Gly Leu Asp Asp Gln Leu Gln Pro Leu Gly Val Pro Gly
325 330 335
Glu Leu Cys Val Ala Gly Glu Cys Ile Ser Pro Gly Tyr Leu Asn Arg
340 345 350
Pro Asp Leu Thr Ala Asp Lys Phe Ile Asp Asn Pro Leu Lys Pro Gly
355 360 365
Glu Arg Met Tyr Arg Thr Gly Asp Leu Val Arg Trp Leu Pro Glu Gly
370 375 380
Val Met Glu Tyr Met Gly Arg Ile Asp Glu Gln Val Lys Ile Arg Gly
385 390 395 400
His Arg Ile Glu Leu Gly Glu Ile Glu Ala Lys Leu Leu Glu His Pro
405 410 415
Ser Ile Arg Glu Thr Val Leu Val Ala Lys Gln Asp Ala Asn Gly His
420 425 430
Ser Phe Leu Gly Ala Tyr Leu Val Thr Asp Asn Phe Cys Pro Val Thr
435 440 445
Glu Leu Arg Asn Tyr Leu Met Glu Thr Leu Pro Glu Tyr Met Val Pro
450 455 460
Ser Tyr Phe Ile Glu Leu Asp Ser Leu Pro Leu Thr Ser Asn Gly Lys
465 470 475 480
Val Asp Lys Arg Ala Leu Pro Glu Pro Glu Ser Gln Ala Leu His Ala
485 490 495
Tyr Thr Met Pro
500
<210> 41
<211> 68
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 41
Glu Glu Lys Leu Val Gln Leu Phe Gln Glu Val Met Asp Val Glu Arg
1 5 10 15
Val Gly Thr Gln Asp Ser Phe Tyr Glu Leu Gly Gly His Ser Leu Lys
20 25 30
Ala Met Leu Leu Val Ser Arg Ile His Lys Asp Phe Gly Ile Lys Ile
35 40 45
Pro Leu Lys Glu Val Phe Ser Arg Pro Thr Val Lys Glu Leu Ala Ala
50 55 60
Tyr Leu Thr Gly
65
<210> 42
<211> 290
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 42
Tyr Pro Val Thr Ala Ala Gln Lys Arg Met Tyr Ile Ala Gln Gln Trp
1 5 10 15
Glu Asp Gly Glu Ala Thr Ser Ser Tyr His Met Pro Phe Met Met Glu
20 25 30
Ile Thr Gly Pro Leu Gln Val Glu Lys Leu Gln Gln Thr Val Lys Ser
35 40 45
Leu Val Ala Arg His Glu Ser Leu Arg Thr Ser Phe His Met Ile Asn
50 55 60
Glu Val Leu Met Gln Lys Ile His Ala Asp Val Leu Trp Asp Leu Asp
65 70 75 80
Ile Asp Leu Glu Ser Val Val Ala Ser Glu Gln Glu Ile Asp Glu Lys
85 90 95
Met Phe Gln Phe Leu Arg Lys Phe Asp Leu Ser Gln Ala Pro Leu Phe
100 105 110
Arg Ala Lys Leu Ile Arg Val Asn Ala Ser Arg His Val Leu Leu Leu
115 120 125
Asp Met His His Ile Ile Ser Asp Gly Phe Ser Tyr Gln Ile Phe Phe
130 135 140
Asp Glu Leu Thr Lys Leu Tyr Gln Gly Glu Glu Leu Pro Ser Leu Lys
145 150 155 160
Ile Gln Tyr Lys Asp Tyr Ala Val Trp Gln His Ser Glu Glu Gln Gln
165 170 175
Lys Arg Leu Gln Gln Gln Glu Asp Tyr Trp Leu Gly Gln Phe Gln Gly
180 185 190
Glu Ile Pro Val Leu Glu Leu Pro Thr Asp Tyr Gln Arg Pro Val Asp
195 200 205
Lys Gln Phe Ala Gly Ala Leu Phe Thr His Gly Leu Ser Ala Gly Leu
210 215 220
Thr Glu Lys Leu Arg Lys Leu Ala Ile Lys Glu Lys Thr Thr Leu Tyr
225 230 235 240
Thr Val Leu Leu Thr Val Tyr Asn Ile Leu Leu Thr Lys Tyr Thr Ser
245 250 255
Gln Glu Asp Leu Ile Val Gly Thr Pro Ile Ala Gly Arg Pro His Ala
260 265 270
Asp Leu Asp Arg Val Phe Gly Met Phe Val Asn Thr Leu Ala Ile Arg
275 280 285
Thr Ala
290
<210> 43
<211> 500
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 43
Asp Lys Ser Phe Asp Ser Glu Lys Thr Ile Gln Glu Gln Phe Glu Glu
1 5 10 15
Trp Ala Glu Lys Ala Pro His Ser Ile Ala Leu Val Phe Lys Asp Lys
20 25 30
Gln Met Thr Tyr Gln Glu Leu Asn Gln Arg Ala Asn Gln Val Ala His
35 40 45
Leu Leu Arg Gly Asn Gly Ile Ser Ala Asn Asp Phe Ile Gly Leu Met
50 55 60
Val Asp Arg Ser Phe Glu Met Ile Ile Ser Met Leu Gly Ile Leu Lys
65 70 75 80
Ala Gly Gly Ala Tyr Leu Pro Ile Asp Pro Asp Tyr Pro Glu Asp Arg
85 90 95
Ile Asp Tyr Met Leu Ser Asp Ser Lys Ala Lys Ile Leu Leu Lys Gln
100 105 110
Ser Asp Gln Thr Ala Pro Ala Ser Phe Glu Gly Lys Val Ile Ala Ile
115 120 125
Asp Thr Pro Glu Leu Leu Glu Met Asp Ile Glu Asn Ile Pro Lys Val
130 135 140
Asn Asn Ser Ser Asp Leu Ala Tyr Ile Ile Tyr Thr Ser Gly Ser Thr
145 150 155 160
Gly Lys Pro Lys Gly Val Leu Ile Asn His Arg Cys Val Ile Asn Met
165 170 175
Gln Leu Thr Ala Glu Thr Phe Gly Ile Tyr Pro Ser Ser Arg Ile Leu
180 185 190
Gln Phe Ala Ser Phe Ser Phe Asp Ser Ser Val Gly Glu Ile Phe Tyr
195 200 205
Thr Leu Leu Asn Gly Ala Cys Leu Tyr Leu Val Glu Lys Asp Leu Leu
210 215 220
Leu Ser Gly Asn Glu Phe Val Ala Trp Leu Lys Lys Asn Arg Ile Ser
225 230 235 240
Ser Ile Pro Phe Ile Ser Pro Ser Ala Leu Arg Met Leu Pro Tyr Glu
245 250 255
Asp Leu Pro Asp Leu Ala Tyr Ile Ser Thr Gly Gly Glu Thr Leu Pro
260 265 270
Ala Asp Leu Val Lys Ala Trp Gly Glu Asn Arg Val Phe Leu Asn Ala
275 280 285
Tyr Gly Pro Thr Glu Thr Thr Val Asp Ala Thr Val Gly Val Cys Thr
290 295 300
Pro Glu Gly Lys Pro His Ile Gly Arg Pro Val Thr Asn Lys Lys Val
305 310 315 320
Tyr Val Val Asn Ser Asn Asn Gln Leu Gln Pro Ile Gly Val Pro Gly
325 330 335
Glu Leu Cys Ile Gly Gly Glu Gly Val Ala Leu Gly Tyr Leu Asn Arg
340 345 350
Pro Asp Leu Thr Gln Glu Lys Phe Val Ser Asn Pro Phe Ala Pro Gly
355 360 365
Glu Arg Met Tyr Arg Ser Gly Asp Leu Val Arg Trp Leu Pro Asp Gly
370 375 380
Thr Ile Glu Tyr Phe Gly Arg Leu Asp Asp Gln Val Lys Ile Arg Gly
385 390 395 400
His Arg Ile Glu Leu Gly Glu Ile Glu Thr Arg Leu Leu Glu His Pro
405 410 415
Cys Ile Lys Glu Ala Ile Val Ile Pro Arg Ser Asp Glu Ser Glu Ala
420 425 430
Thr Tyr Leu Cys Ser Tyr Leu Ile Ala Glu Gly Ser Trp Asn Ala Ala
435 440 445
Asp Leu Arg Lys Tyr Leu Lys Ala Ser Leu Pro Glu Tyr Met Ile Pro
450 455 460
Ser Tyr Phe Val Glu Leu His Glu Leu Pro Leu Thr Pro Asn Gly Lys
465 470 475 480
Val Asn Lys Lys Ala Leu Pro Lys Pro Glu Lys Gln Met Gln Arg Gly
485 490 495
Gln Asp Tyr Val
500
<210> 44
<211> 65
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 44
Ser Ile Leu Ser His Ile Trp Thr Asp Val Leu Gly Val Glu Asn Ile
1 5 10 15
Gly Ile His Asp Asn Phe Phe Glu Leu Gly Gly Asp Ser Ile Lys Ala
20 25 30
Ile Gln Ile Ser Ala Arg Leu Asn Lys His Asn Leu Lys Val Lys Met
35 40 45
Arg Glu Leu Phe Lys Asn Pro Thr Ile Ala Glu Leu Ser Leu Leu Val
50 55 60
Gln
65
<210> 45
<211> 300
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 45
Glu Gly Asn Ile Pro Leu Thr Pro Ile Gln His Trp Phe Phe Thr Gln
1 5 10 15
Ser Phe Pro Gln Val Asn His Tyr Asn Gln Ser Val Leu Leu Phe Asn
20 25 30
Ala Glu Gly Trp Asp Glu Gln Lys Val Asp Lys Ala Phe Glu Met Leu
35 40 45
Thr Gln His His Asp Ala Leu Arg Ile Val Tyr Ser Leu Asp Glu Gln
50 55 60
Gly Val Val Gln His Asn Arg Gly Leu Glu Gly Ser Asn Tyr His Phe
65 70 75 80
Glu Ile Ile Asp Val Arg Gln Asp Gly Glu Asp Gln Ser Asn Trp Lys
85 90 95
Ala Ala Ala Asn Arg Met Gln Ala Ser Met Asp Ile Val Glu Gly Pro
100 105 110
Leu Val Gln Ile Gly Leu Phe Arg Ala Asn Glu Gly Ala Tyr Leu Leu
115 120 125
Ile Ala Ile His His Leu Val Val Asp Gly Val Ser Trp Arg Ile Leu
130 135 140
Leu Glu Asp Phe Tyr His Leu Tyr Asn Gly Asn Asp Ser Leu Pro Leu
145 150 155 160
Lys Thr Thr Ser Phe Gln Ala Trp Ser Gln Lys Leu Gln Glu Tyr Ala
165 170 175
Gln Ser Lys Glu Leu Glu His Glu Leu Ser Tyr Trp Arg His Leu Asp
180 185 190
Glu Ala Ile Thr Asp Tyr Thr Leu His Lys Asp Ile Glu Ala Ala Thr
195 200 205
Ser Asn Lys Thr Thr Tyr Glu Glu Phe Leu Thr Val Ser Met Ser Leu
210 215 220
Ser Thr Glu Glu Thr Gln Gln Leu Val Thr Glu Ala His Lys Ala Tyr
225 230 235 240
Gln Thr Glu Ile Asn Asp Leu Leu Leu Thr Ala Leu Ala Leu Ala Leu
245 250 255
Lys Glu Trp Thr Asn Lys Glu Gln Leu Leu Val Ser Met Glu Gly His
260 265 270
Gly Arg Glu Glu Ile Leu Asp Asn Val Asp Ile Ser Arg Thr Val Gly
275 280 285
Trp Phe Thr Ser Glu Tyr Pro Val Ala Ile His Leu
290 295 300
<210> 46
<211> 511
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 46
Gln Glu Met Thr Ile His Gly Leu Phe Glu Lys Gln Val Gln Glu Arg
1 5 10 15
Pro Asn Gln Thr Ala Val Ile Phe Asn Glu Gln Ser Met Thr Tyr Lys
20 25 30
Glu Met Asn Glu Arg Ala Asn Gln Val Ala His Ser Leu Arg Lys His
35 40 45
Gly Val Ala Pro Asp Glu Ile Val Gly Ile Leu Ala Asp Arg Asn Met
50 55 60
Asp Met Leu Ile Ser Ile Leu Gly Val Leu Lys Ala Gly Ala Ala Tyr
65 70 75 80
Met Pro Ile Asp Pro Thr Tyr Pro Thr Glu Arg Ile Leu Tyr Met Ile
85 90 95
Asn Asp Ser Gln Thr Lys Ile Val Leu Ala Glu His Arg Glu Met Val
100 105 110
Pro Glu Gly Cys Asn Ala Glu Leu Ile Leu Leu Gln Asp Ser Ser Leu
115 120 125
Leu Asn Glu Glu Thr Ser Asp Leu Glu His Val Asn Lys Pro Glu Asp
130 135 140
Leu Ala Tyr Ile Ile Tyr Thr Ser Gly Ser Thr Gly Lys Pro Lys Gly
145 150 155 160
Val Met Ile Glu His Arg Asn Val Ile Arg Leu Leu Phe Asn Asp Arg
165 170 175
Asn Leu Phe Asp Phe Thr Ser Asp Asp Val Trp Thr Val Phe His Ser
180 185 190
Phe Cys Phe Asp Phe Ser Val Trp Glu Met Tyr Gly Ala Leu Leu Tyr
195 200 205
Gly Gly Lys Ile Val Leu Val Ser Phe Glu Ile Ala Arg Asp Pro Gln
210 215 220
Ala Phe Arg Asn Leu Leu Gln Glu Gln Lys Val Ser Ile Leu Asn Gln
225 230 235 240
Thr Pro Thr Ala Phe Tyr Gln Leu Ser Ser Glu Glu Met Gln His Ser
245 250 255
Asp Ser Asn Leu Ser Ile Arg Lys Ile Ile Phe Gly Gly Glu Ala Leu
260 265 270
Thr Pro Ser Gln Leu Lys Ala Trp Lys Gln Lys Tyr Pro Asn Thr Ala
275 280 285
Leu Ile Asn Met Tyr Gly Ile Thr Glu Thr Thr Val His Val Thr Tyr
290 295 300
Lys Glu Phe Gln Leu His Asp Met Asp Ser Thr Val Ser Asn Ile Gly
305 310 315 320
Lys Pro Ile Pro Thr Leu Arg Thr Tyr Val Leu Asp Ser Lys Arg Asn
325 330 335
Leu Ala Pro Ile Gly Val Lys Gly Glu Leu Tyr Val Ser Gly Lys Gly
340 345 350
Val Ala Arg Gly Tyr Leu Asn Lys Pro Glu Leu Thr Glu Glu Arg Phe
355 360 365
Met Asp Asn Pro Phe Val Ser Glu Glu Arg Met Tyr Arg Thr Gly Asp
370 375 380
Leu Ala Arg Trp Leu Pro Glu Gly Glu Leu Glu Tyr Leu Gly Arg Ile
385 390 395 400
Asp His Gln Val Lys Ile Arg Gly Tyr Arg Ile Glu Leu Gly Glu Ile
405 410 415
Glu Ala Glu Leu Leu Lys Gln Lys Gly Ile Lys Glu Ala Val Val Leu
420 425 430
Val Thr Asn Asp Lys Asp Ala Gln Pro Gln Leu His Ala Tyr Leu Thr
435 440 445
Ser Thr Glu Asp Leu Ala Ala Ala Asp Leu Arg Asn Gln Leu Thr Thr
450 455 460
Thr Leu Pro Ser Tyr Met Ile Pro Ala His Phe Ile Phe Val Ser Gln
465 470 475 480
Met Pro Val Thr Pro Asn Gly Lys Ile Asp Lys Glu Ser Leu Arg Lys
485 490 495
Ile Glu Pro Ser Leu Gln Glu Ser Pro Thr Glu Ala Tyr Val Ala
500 505 510
<210> 47
<211> 68
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 47
Glu Lys Gln Leu Val His Ile Trp Glu Glu Asn Ile Gly Met Gln Pro
1 5 10 15
Ile Ser Ile Asp Asp Asn Tyr Phe Ala Leu Gly Gly Asp Ser Ile Lys
20 25 30
Ala Ile Lys Leu Leu His Ala Ile Asn Lys Glu Phe Gln Ile Ser Phe
35 40 45
Gln Ile Gly Asp Leu Tyr Lys His Gly Thr Ile Arg Glu Met Gly Gln
50 55 60
Gln Ile Gly Glu
65
<210> 48
<211> 337
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 48
Met Asp Tyr Val Gln Leu Gly Ser Ser Glu Leu Lys Cys Ser Arg Ile
1 5 10 15
Gly Leu Gly Thr Trp Ala Ile Gly Gly Trp Ala Trp Gly Gly Thr Asn
20 25 30
Glu Lys Asp Ala Ile Gln Ala Ile His Arg Ala Phe Asp Ile Gly Ile
35 40 45
Asn Thr Leu Asp Thr Ala Pro Ile Tyr Gly Phe Gly Leu Ser Glu Glu
50 55 60
Ile Cys Gly Lys Ala Ile Lys Gln Tyr Gly Gln Arg Asp Lys Ile His
65 70 75 80
Ile Ala Thr Lys Ala Gly Val Glu Trp Val Gly Glu Glu Gly Val Tyr
85 90 95
Trp Cys Asn Ser Ser Lys Glu Arg Ile Leu Gln Glu Phe Glu Asp Ser
100 105 110
Leu Arg Arg Leu Gln Val Asp Tyr Ile Asp Leu Tyr Gln Leu His Trp
115 120 125
Pro Asp Pro Gly Leu Pro Leu Glu Glu Thr Ala Glu Ile Phe Ala Gln
130 135 140
Leu His Lys Glu Gly Lys Ile Arg Ala Ile Gly Val Ser Asn Leu Ser
145 150 155 160
Ile Glu Gln Ile Glu Gln Trp Gln Lys Val Ala Pro Leu His Ser Ser
165 170 175
Gln Asn Arg Leu Asn Leu Leu Gln Thr Glu His Lys Asp Thr Phe Leu
180 185 190
Tyr Cys Asp Lys Asn His Ile Asn Thr Ile Thr Trp Gly Thr Leu Ala
195 200 205
Gln Gly Leu Leu Thr Gly Lys Phe Thr Lys Asp Ser Thr Phe Ala Glu
210 215 220
Asp Asp Leu Arg His Gly Tyr Pro Leu Phe Ala Pro Glu Tyr Phe Asp
225 230 235 240
Gln Tyr Leu Gln Ala Val Asp Arg Leu Lys Glu Ile Ala Gln Glu Lys
245 250 255
Gly Lys Thr Met Ala Gln Leu Ala Val Arg Trp Thr Leu Asp His Pro
260 265 270
Gly Val Ser Ile Ser Leu Trp Gly Ala Arg Lys Pro Ser Gln Leu Asp
275 280 285
Glu Val Ser Gly Val Met Gly Trp Ser Leu Ser Ala Gln Asp Val Glu
290 295 300
Gln Ile Asp Gln Ile Ile Lys Glu Ser Leu His Asp Pro Phe His Asp
305 310 315 320
Pro Thr Leu Gly Pro Pro Thr Lys Glu Glu Trp Gln Glu Met Gly Val
325 330 335
Leu
<210> 49
<211> 317
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 49
Met Lys Arg Ile Met Val Thr Gly Ala Leu Gly Gln Ile Gly Ser Glu
1 5 10 15
Leu Val Thr Lys Leu Arg Ser Val Tyr Gly Thr Glu Ser Val Ile Ala
20 25 30
Thr Asp Ile Arg Gln Gly Glu Gly Gln Glu Gly Pro Phe Glu Ile Leu
35 40 45
Asp Val Thr Asp Ala Lys Ala Met His Asp Ile Ala Ala Lys Tyr Lys
50 55 60
Val Asp Thr Ile Met His Leu Ala Ala Leu Leu Ser Ala Thr Ala Glu
65 70 75 80
Ala Lys Pro Leu Leu Ala Trp Asn Leu Asn Met Gly Gly Leu Val Asn
85 90 95
Ala Leu Glu Val Ala Arg Glu Leu Asn Cys Gln Phe Phe Thr Pro Ser
100 105 110
Ser Ile Gly Ala Phe Gly Pro Thr Thr Pro Lys Asp Asn Thr Pro Gln
115 120 125
Asp Thr Ile Gln Arg Pro Val Thr Met Tyr Gly Val Asn Lys Val Ser
130 135 140
Gly Glu Leu Leu Cys Asp Tyr Tyr Tyr Gln Lys Phe Gly Val Asp Thr
145 150 155 160
Arg Gly Val Arg Phe Pro Gly Leu Ile Ser Tyr Val Thr Pro Pro Gly
165 170 175
Gly Gly Thr Thr Asp Tyr Ala Val Glu Ile Tyr Tyr Glu Ala Ile Lys
180 185 190
Asn Gly Arg Tyr Thr Ser Tyr Ile Gln Lys Gly Thr Tyr Met Asp Met
195 200 205
Met Tyr Met Pro Asp Ala Leu Asn Ala Ile Ile Asn Leu Met Glu Ala
210 215 220
Asp Ala Ser Lys Leu Gln His Arg Asn Ala Phe Asn Val Thr Ala Met
225 230 235 240
Ser Phe Glu Pro Glu Gln Ile Ala Arg Glu Ile Lys Lys His Ile Pro
245 250 255
Thr Phe Glu Met Asp Tyr Gln Val Asp Pro Ile Arg Gln Gly Ile Ala
260 265 270
Asp Ser Trp Pro Asn Ser Ile Asp Ala Thr Ala Ala Lys Ala Glu Trp
275 280 285
Gly Phe Gln Ala Glu Tyr Asp Leu Glu Lys Met Thr Thr Asp Met Leu
290 295 300
Ala Lys Leu Arg Asp Lys Leu Gln Val Gly Ala Gly Ile
305 310 315
<210> 50
<211> 239
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 50
Val Ser Thr Lys Met Lys Leu Phe Cys Ile Pro Tyr Ala Gly Gly Ser
1 5 10 15
Ala Ser Val Tyr Thr Ser Trp Lys Lys Ala Phe Leu Pro Asn Ile Glu
20 25 30
Leu Ile Pro Ile Glu Leu Ala Gly Lys Gly Ser Arg Phe Asn Gln Pro
35 40 45
Leu Tyr Glu Asn Ile Glu Gln Ala Val Ser Asp Ile Leu Ser Phe Leu
50 55 60
Gln Ser Lys Leu Asp Ser Ser Pro Tyr Ala Leu Phe Gly His Ser Met
65 70 75 80
Gly Ala Leu Leu Val Phe Glu Val Leu His Ala Leu Lys Glu Gln Asn
85 90 95
Ala Pro Met Pro Ile Ala Thr Phe Phe Ser Gly Lys Asn Pro Pro His
100 105 110
Ile Ala Pro Thr Thr Asn Arg His Lys Leu Glu Gly Lys Ala Phe Trp
115 120 125
Glu Glu Val Arg Ser Met Gly Gly Thr Pro Ala Glu Leu Met Thr Ser
130 135 140
Thr Glu Leu Met Asp Ile Phe Thr Pro Ile Leu Lys Arg Asp Phe Lys
145 150 155 160
Leu Val Glu Thr Tyr Ala Pro Asn Leu Asn Arg Glu Leu Ile Thr Thr
165 170 175
Pro Val Ile Val Leu Tyr Gly Thr Lys Asp Asp Thr Val Ser Val Asp
180 185 190
Lys Met Thr Glu Trp Ser Arg His Thr Ser Glu Asp Ile Ser Phe Ala
195 200 205
Thr Phe Asp Gly Gly His Phe Phe Ile Gln Glu His Glu Gln Ala Val
210 215 220
Ile Ser Leu Val Asn Gln Thr Leu Leu Ser Thr Met Leu Lys Val
225 230 235
<210> 51
<211> 1747
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 51
Met Ser Ile Gln Lys Gln Ser Gln Ile Val Leu Asp Met Glu Lys Ser
1 5 10 15
Leu Lys Gly Leu Glu Tyr Ile Asp Glu Val Arg Val Val Gln Glu Glu
20 25 30
Lys Asn Ile Gly Asn Arg Ile Leu His Ala Lys Asp Leu Ile Pro Thr
35 40 45
Met Met Lys Thr Asp Arg Val Lys Val Asp Thr Pro Lys Ser Glu Leu
50 55 60
Leu Arg Gln Ser Glu Arg Ile Glu Ser Ser Leu Pro Leu Ala Ile Ser
65 70 75 80
His Gly Asp Glu Leu Lys Arg Thr Ala Ser Ser Pro Thr Thr Leu Ile
85 90 95
Asp Val Leu Leu Asn Ala Ala Lys Leu Glu Asp Lys Gly Ile Ile Tyr
100 105 110
Val Leu Asp Asn Gly Asp Glu Ile Glu Thr Ser Tyr Arg Gln Leu Phe
115 120 125
Glu Gln Ala Lys Arg Leu Leu Lys Gly Leu Arg Asn Met Gly Ile Lys
130 135 140
Pro Gln Asp Arg Val Ile Phe Gln Phe Gln Lys Asn Gln Asn Tyr Val
145 150 155 160
Pro Met Phe Trp Ala Cys Val Leu Gly Gly Phe Ile Thr Val Pro Ile
165 170 175
Ala Thr Ser Thr Ser Tyr Ala Glu Lys Asn Ala Asp Thr Met Lys Leu
180 185 190
Tyr Asn Ile Trp Asn Met Leu Asp Lys Pro Val Ile Ile Thr Asp Glu
195 200 205
Asp Leu Leu Glu Glu Val Asn Lys Leu Ser Ala Val Trp Glu Thr Glu
210 215 220
Glu Phe Val Ile Ser Thr Ala Glu Gln Leu Met Asn Asn Asp Glu Glu
225 230 235 240
Gln Glu Phe Tyr Ala Tyr Gln Glu Glu Asp Phe Val Leu Tyr Leu Leu
245 250 255
Thr Ser Gly Ser Thr Gly Leu Pro Lys Cys Val Gln His Arg Asn Ala
260 265 270
Ser Leu Val Ala Arg Ser Met Ala Thr Ala Gln Phe Asn Gln Phe Asp
275 280 285
Ser Glu Glu Val Leu Leu Cys Trp Met Pro Leu Glu His Val Gly Gly
290 295 300
Leu Val Met Tyr His Ile Leu Gly Val Tyr Leu Ala Cys Gln Gln Ile
305 310 315 320
Leu Pro Arg Val Asp Ser Phe Ile Ser Gln Pro Leu Asn Trp Leu His
325 330 335
Trp Met Asp Lys Tyr Lys Ala Thr Leu Thr Trp Ala Pro Asn Phe Ala
340 345 350
Phe Ser Leu Val Asn Asp Gln Glu Lys Ala Ile Glu Glu Gly Ser Trp
355 360 365
Asp Leu Ser Ser Val Lys His Ile Leu Asn Gly Gly Glu Ala Ile Val
370 375 380
Ala Lys Thr Ala Lys Arg Phe Leu Ser Ile Leu Arg Lys His Gln Leu
385 390 395 400
Arg Asp Asp Cys Met Tyr Pro Ser Phe Gly Met Ser Glu Thr Ser Ser
405 410 415
Gly Ile Val Phe Ser Lys Thr Phe Arg Ser Lys Pro Gln Ser Gly Val
420 425 430
His Tyr Val Asp Lys Ile Ser Phe Glu Thr Leu Leu Arg Phe Val Asp
435 440 445
Asp Thr Glu Lys Glu His Ile Cys Phe Thr Glu Val Gly Gly Pro Ile
450 455 460
Pro Gly Val Ser Val Arg Ile Val Asp Asp Asn Asn Gln Leu Leu His
465 470 475 480
Glu Asn His Ile Gly Arg Val Gln Val Thr Gly Pro Thr Ile Met Ala
485 490 495
Gly Tyr Tyr Gln Asn Gln Glu Ala Asn Gln Glu Ala Phe Ile Gly Asp
500 505 510
Gly Trp Phe Phe Thr Gly Asp Leu Gly Phe Leu Asn Glu Gly Arg Leu
515 520 525
Thr Val Thr Gly Arg Gln Lys Asp Ile Ile Ile Met Asn Gly Lys Asn
530 535 540
Tyr Tyr Asn Tyr Glu Ile Glu Ser Leu Val Glu Ser Val Tyr Gly Val
545 550 555 560
Gln Ser Thr Phe Val Ala Ala Ala Ala Phe Ile Asp Thr Ser Lys Ser
565 570 575
Val Glu Glu Leu Ala Ile Phe Phe Thr Pro Ile Asn Asp Ile Asp Glu
580 585 590
Arg Phe Leu Gly Phe Ile Ile Thr Glu Ile Arg Arg Thr Val Ser Arg
595 600 605
Asn Leu Gly Ile Thr Pro Lys His Ile Ile Pro Ile Lys Lys Glu Thr
610 615 620
Phe Ser Lys Thr Glu Ser Gly Lys Ile Gln Arg Asn Lys Phe Thr Glu
625 630 635 640
Arg Leu Tyr Ala Gly Glu Phe Asp Glu Ile Met Lys Gln Ile Glu Thr
645 650 655
Gln Asn Glu Asp Pro His Gln Thr Phe Pro Glu Trp Met Tyr Ala Arg
660 665 670
Lys Trp Glu Glu Arg Pro Ile Ser Val Leu Ala Ser Pro Leu Pro Gln
675 680 685
Glu Thr Tyr Leu Ile Phe Arg Asp Lys Leu Gly Leu Gly Asp Gln Trp
690 695 700
Ser Ala Leu Ser Gln Asn Gly Ser Ser Thr Met Ile Ile Val Asp Gln
705 710 715 720
Gly Glu Ser Phe Lys Lys Arg Gly Glu Phe His Tyr Glu Ile Asn Gln
725 730 735
His Glu Gln Lys Asp Tyr Gln Leu Leu Cys Glu Glu Leu Lys Lys Glu
740 745 750
Ser Val Thr Ile His His Val Leu His Leu Trp Thr Tyr Gln Glu Asp
755 760 765
Leu Asp Glu Ala Lys Lys Glu Leu Ser Tyr Leu Lys Glu Ser Gln Phe
770 775 780
Leu Gly Ser Phe Ser Leu Ile Phe Leu Leu Gln Ala Leu His Lys Glu
785 790 795 800
Val Gly Met Pro Lys Thr Leu Thr Phe Val Ser Ser Asn Val Phe Ser
805 810 815
Leu Glu Asn Pro Glu His Ser Ala Tyr Glu Lys Ala Thr Ile Pro Gly
820 825 830
Ile Ile Lys Thr Met Lys His Glu Phe Pro Asn Thr Ile Ala Lys Phe
835 840 845
Val Asp Ile Asp Thr Asn His Pro Pro Tyr His Pro Leu Gln Ala Leu
850 855 860
Gln Ala Glu Gln Gly Leu Pro Asp Glu Glu Ser Val Val Ile Tyr Ser
865 870 875 880
Lys Gly Lys Arg Phe Val Pro Lys Leu Gln Gln Val Asn Leu Gln Glu
885 890 895
Ala Asn Lys Lys Pro Leu Pro Leu Val Glu Lys Gly Phe Tyr Val Val
900 905 910
Thr Gly Gly Leu Gly Gly Val Gly Lys Gln Ile Ala His Tyr Leu Ile
915 920 925
Gln Lys Leu Asp Ala Asn Ile Leu Leu Leu Gly Lys Thr Ala Leu Pro
930 935 940
Ser Ala Thr Asp Lys Asn Thr Gly Thr Gln Ser Glu Arg Tyr Arg Ala
945 950 955 960
Leu Arg Glu Leu Glu Lys Leu Thr Thr Arg Glu Asn Gln Val Ile Tyr
965 970 975
Lys Ser Thr Asp Leu Ser Asp Val Thr Val Leu Glu Glu Ala Ile His
980 985 990
Glu Ser Met Leu Gly Met Asn Ala Lys Lys Val Asp Gly Ile Ile His
995 1000 1005
Leu Ala Gly Val Tyr Tyr Glu Lys Pro Leu Leu Glu Glu Ser Ile Glu
1010 1015 1020
Ser Leu Glu His Met Tyr Glu Ser Lys Val Ala Gly Thr Tyr Leu Leu
1025 1030 1035 1040
Gly Lys Leu Val Glu Lys Tyr Pro Glu Ala Ile Leu Val Thr Ser Ser
1045 1050 1055
Ser Ser Ser Gly Leu Leu Ser Gly Tyr Gly Val Gly Ala Tyr Thr Ser
1060 1065 1070
Ala Asn Met Phe Val Glu Thr Phe Thr Asp Tyr Val Ala Gln Lys Arg
1075 1080 1085
Asp Gly Val Gln Cys Phe Ala Trp Ser Leu Trp Asn Asp Ile Gly Leu
1090 1095 1100
Gly Gln Gln Phe Thr Asp Met Lys Asp Leu Leu Ile Arg Lys Gly His
1105 1110 1115 1120
Gln Pro Ile Ser Pro Gln Lys Gly Ile Phe Ser Phe Glu Leu Gly Leu
1125 1130 1135
Met Leu Glu His Asn Met Leu Tyr Ile Gly Leu Asn Lys Gly Val Ser
1140 1145 1150
Glu Ile Ala Thr Leu Cys Ser Glu Val Val Glu Lys Glu Leu Val Thr
1155 1160 1165
Thr Val Tyr Phe Ser Thr Ala His Asn Gln Phe Ser Leu Ser Glu Met
1170 1175 1180
Tyr Asn Thr Ile Arg Leu His Thr Ile Asp Glu Lys Gly His Ser Thr
1185 1190 1195 1200
Lys Val Ile Phe Arg Gln Val Asp Gln Leu His Ser Asp Gln Ala Glu
1205 1210 1215
Asp His Leu Val Ser Asn Asn Arg Arg Val Glu Glu Glu Ile Lys Gln
1220 1225 1230
Leu Trp Ser Glu Ile Leu Gln Val Asp Gln Leu Gln Ala Phe Asp Ser
1235 1240 1245
Phe Phe Asp Leu Gly Gly Ser Ser Leu Lys Ala Thr Gln Leu Leu Ser
1250 1255 1260
Ser Val Gln Ser Arg Phe Gly Val Ala Ile Pro Leu Lys Val Phe Phe
1265 1270 1275 1280
Glu Asn Pro Thr Ile Lys Gly Leu Ile His Arg Met Gly Asn Val Ala
1285 1290 1295
Gly Thr Lys Ala Ser Val Met Arg Ser Tyr Glu Pro Ala Lys Lys Arg
1300 1305 1310
Glu Lys Ile Ile Asp Leu Ser Pro Ser Gln Arg Gly Gln Trp Tyr Leu
1315 1320 1325
Tyr Gln Thr Arg Pro Asp Ser Pro Phe Tyr Asn Ile Thr Phe Thr Leu
1330 1335 1340
Thr Phe Glu Glu Asn Val Asn Arg Gln Ala Leu Leu Lys Ser Ile Gln
1345 1350 1355 1360
Ala Val Ile Gln Ser Gln Glu Ala Leu Arg Ser Glu Ile Arg Met Ile
1365 1370 1375
Glu Asp Asp Pro Lys Leu Phe Ile His Asp Thr Tyr Val Val Gly Ile
1380 1385 1390
Pro Ile Ile Asp Leu Ser Ser Tyr Pro Leu Glu Gln Lys Glu Gln Lys
1395 1400 1405
Leu Ala Ala Leu Ile Asp Thr Glu Ala Asn Thr Pro Phe Gln Met Met
1410 1415 1420
Gln Glu Ser Leu Ser Arg Phe Thr Leu Ile His Val Gly Asp Asn Val
1425 1430 1435 1440
His Ile Leu Leu Gly Ser Met His His Ile Ile Ser Asp Gly Trp Ser
1445 1450 1455
Ile His Val Phe Thr Lys Glu Leu Leu Arg Phe Tyr Lys Glu Ile Gln
1460 1465 1470
Glu Asn Gly Val Ile Asp Arg Lys Glu Leu Asp Tyr Thr Tyr Thr Ser
1475 1480 1485
Tyr Leu Leu Asp Gln Lys Glu Trp Leu Thr Lys Glu Asn Pro Glu Tyr
1490 1495 1500
Ala Asp Gln Leu Tyr Tyr Trp Lys Glu Asn Leu Ala Thr Glu Thr Ala
1505 1510 1515 1520
Pro Leu Thr Leu Pro Ile Gln Leu Ser Arg Pro Ala Ile Gln Ser Tyr
1525 1530 1535
Gln Gly Met Thr Ile Glu Gln Leu Val Thr Gln Glu Leu Thr Gln Asn
1540 1545 1550
Ile Met Glu Ala Ser Lys His Met Lys Ser Thr Pro Tyr Met Phe Leu
1555 1560 1565
Leu Ser Thr Phe Ala Ala Leu Leu Phe Arg Tyr Ser Gly Gln Asp Thr
1570 1575 1580
Phe Arg Leu Gly Thr Val Leu Ala Asn Arg Asn Met Pro His Thr Glu
1585 1590 1595 1600
Lys Ile Ile Gly Tyr Leu Ala Asn Thr Val Thr Leu Ser Phe Asp Phe
1605 1610 1615
Lys Glu Glu Leu Thr Phe Asp Gln Leu Leu Asp Lys Val Arg Gly Met
1620 1625 1630
Ala Leu Glu Ala His Asp Asn Gln Asn Val Pro Leu Glu Thr Val Leu
1635 1640 1645
Arg Glu Leu Gln Val Arg His Thr Ala Asp Met Ala Pro Leu Phe Gln
1650 1655 1660
Ile Val Phe Thr Met Gln Asn Ala Val Gln His Leu Tyr Gln Asn Glu
1665 1670 1675 1680
Gln Ile Lys Thr Tyr Leu Asp Ile Val Ser Asn Lys Thr Ser Lys Tyr
1685 1690 1695
Asp Leu Ser Leu His Val Tyr Glu Glu Glu His Gln Leu Arg Leu Lys
1700 1705 1710
Leu Glu Phe Asn Thr Asp Leu Phe Glu Glu Glu Ala Met Lys Thr Leu
1715 1720 1725
Leu Gly His Tyr Leu Asn Phe Ile Gln Val Ile Thr Asn Asp Lys Val
1730 1735 1740
His Thr Gly
1745
<210> 52
<211> 337
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 52
Met Asp Tyr Val Gln Leu Gly Ser Ser Glu Leu Lys Cys Ser Arg Ile
1 5 10 15
Gly Leu Gly Thr Trp Ala Ile Gly Gly Trp Ala Trp Gly Gly Thr Asn
20 25 30
Glu Lys Asp Ala Ile Gln Ala Ile His Arg Ala Phe Asp Ile Gly Ile
35 40 45
Asn Thr Leu Asp Thr Ala Pro Ile Tyr Gly Phe Gly Leu Ser Glu Glu
50 55 60
Ile Cys Gly Lys Ala Ile Lys Gln Tyr Gly Gln Arg Asp Lys Ile His
65 70 75 80
Ile Ala Thr Lys Ala Gly Val Glu Trp Val Gly Glu Glu Gly Val Tyr
85 90 95
Trp Cys Asn Ser Ser Lys Glu Arg Ile Leu Gln Glu Phe Glu Asp Ser
100 105 110
Leu Arg Arg Leu Gln Val Asp Tyr Ile Asp Leu Tyr Gln Leu His Trp
115 120 125
Pro Asp Pro Gly Leu Pro Leu Glu Glu Thr Ala Glu Ile Phe Ala Gln
130 135 140
Leu His Lys Glu Gly Lys Ile Arg Ala Ile Gly Val Ser Asn Leu Ser
145 150 155 160
Ile Glu Gln Ile Glu Gln Trp Gln Lys Val Ala Pro Leu His Ser Ser
165 170 175
Gln Asn Arg Leu Asn Leu Leu Gln Thr Glu His Lys Asp Thr Phe Leu
180 185 190
Tyr Cys Asp Lys Asn His Ile Asn Thr Ile Thr Trp Gly Thr Leu Ala
195 200 205
Gln Gly Leu Leu Thr Gly Lys Phe Thr Lys Asp Ser Thr Phe Ala Glu
210 215 220
Asp Asp Leu Arg His Gly Tyr Pro Leu Phe Ala Pro Glu Tyr Phe Asp
225 230 235 240
Gln Tyr Leu Gln Ala Val Asp Arg Leu Lys Glu Ile Ala Gln Glu Lys
245 250 255
Gly Lys Thr Met Ala Gln Leu Ala Val Arg Trp Thr Leu Asp His Pro
260 265 270
Gly Val Ser Ile Ser Leu Trp Gly Ala Arg Lys Pro Ser Gln Leu Asp
275 280 285
Glu Val Ser Gly Val Met Gly Trp Ser Leu Ser Ala Gln Asp Val Glu
290 295 300
Gln Ile Asp Gln Ile Ile Lys Glu Ser Leu His Asp Pro Phe His Asp
305 310 315 320
Pro Thr Leu Gly Pro Pro Thr Lys Glu Glu Trp Gln Glu Met Gly Val
325 330 335
Leu
<210> 53
<211> 2491
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 53
Leu Ile Asn Thr Ser Asp Val Lys Asp Ile Tyr Ser Leu Ser Pro Met
1 5 10 15
Gln Arg Gly Met Leu Phe His Thr Leu Lys Asp Lys Glu Asn Leu Ala
20 25 30
Tyr Phe Asp Gln Thr Thr Phe Gln Ile Glu Gly Asp Ile Cys Val Glu
35 40 45
Ser Leu Glu Lys Ser Phe Asn Glu Leu Ile Arg Lys Tyr Asp Val Leu
50 55 60
Arg Thr Ile Phe Leu Tyr Gln Lys Leu Lys Glu Pro Met Gln Val Val
65 70 75 80
Leu Lys Glu Arg Thr Ala Asn Ile His Tyr Glu Asp Phe Ser Met Lys
85 90 95
Ser Glu Ser Asp Lys Ala Lys Ala Leu Arg Val Ala Lys Gln Arg Asp
100 105 110
Arg Asp Glu Gly Phe Asp Leu Ser Arg Asp Ile Leu Met Arg Leu Ser
115 120 125
Leu Leu Lys Val Ala Pro Asn Gln Tyr Glu Leu Val Ile Ser Ser His
130 135 140
His Ile Ile Ile Asp Gly Trp Cys Thr Gly Ile Leu Tyr Gln Glu Leu
145 150 155 160
Phe Tyr Phe Tyr Gln Cys Phe Val Ala Asn Gln Pro Ile Pro Ala Glu
165 170 175
Lys Ser Ile Pro Tyr Ser Arg Tyr Ile Arg Trp Leu Glu Glu Gln Asp
180 185 190
Glu Glu Glu Gly Lys Ala Tyr Trp Gly Glu Tyr Leu Gln Asp Phe Glu
195 200 205
Gly Ala Ser Val Ile Pro Lys Gln Asn Ala Lys Gly Glu Lys Glu Val
210 215 220
Cys Ser Ile Asp Lys Val Thr Phe His Phe Asp Lys Lys Leu Thr Glu
225 230 235 240
Glu Leu Val Gln Val Ala Lys Ala Cys Gln Val Thr Ile Ser Thr Leu
245 250 255
Phe Gln Thr Met Trp Gly Ile Leu Leu Gln Lys Tyr Asn Asn Ser Gln
260 265 270
Glu Ala Ile Phe Gly Ser Val Ile Ser Gly Arg Ser Pro Glu Ile Pro
275 280 285
Asp Val Glu Lys Ile Val Gly Ile Phe Ile Asn Thr Ile Pro Val Arg
290 295 300
Ile Arg Thr Leu Asp Lys Gln Thr Phe Lys Glu Leu Leu Ile Gln Val
305 310 315 320
Gln Glu Ala Ser Val Asn Ser Glu Lys Tyr Asn Tyr Leu Thr Leu Ala
325 330 335
Asp Ile Gln Ala Val Thr Gly Ser Asn His Ala Leu Ile His His Ile
340 345 350
Val Ala Phe Glu Asn Phe Pro Ile Ala Ser Asp Ser Phe Val Asp Ser
355 360 365
Ser Glu Ser Asp Ser Glu Glu Leu Lys Val Val Asn Val Ile Asp Asp
370 375 380
His Glu Lys Thr Asn Phe Asp Phe Ser Val Gln Val Gln Leu Asp Thr
385 390 395 400
Glu Leu Leu Val Lys Ile Ser Tyr Asn Gln His Leu Tyr His Arg Ser
405 410 415
Phe Ile Glu Asn Ile Phe His His Leu Gln Gln Ile Ala Gly Ser Ile
420 425 430
Ile His Asn Leu Asp Ile Gln Ile Asn Glu Ile Asp Ile Val Ser Lys
435 440 445
Glu Glu Lys Lys Gln Leu Leu Arg Tyr Ser Thr Pro Ala Lys Ser Asp
450 455 460
Phe Pro Met Asp Lys Thr Ile His Gln Leu Phe Glu Glu Gln Val Ser
465 470 475 480
Arg Thr Pro Glu Gln Ile Ala Val Val Phe Lys Gly Glu Ser Phe Thr
485 490 495
Tyr Arg Glu Leu Asn Glu Lys Ala Asn Gln Leu Ala Trp Val Leu Arg
500 505 510
Lys Arg Glu Val Arg Pro Asn Glu Ile Val Ala Ile Met Ala Glu His
515 520 525
Ser Leu Glu Met Leu Val Gly Val Ile Gly Thr Leu Lys Ala Gly Ala
530 535 540
Ala Tyr Leu Pro Ile Asp Pro Ser Tyr Pro Glu Lys Arg Ile Ala His
545 550 555 560
Met Leu Gln Asp Ser Lys Ala Glu Gln Leu Leu Ile Gln Pro His Leu
565 570 575
Asn Met Pro Gln Asp Phe Lys Gly Ser Val Leu Trp Leu Thr Glu Glu
580 585 590
Ser Trp Ala Lys Glu Ser Thr Thr Asp Leu Pro Leu Ala Thr Ser Ala
595 600 605
Asn Asp Leu Ala Tyr Met Ile Tyr Thr Ser Gly Ser Thr Gly Leu Pro
610 615 620
Lys Gly Val Met Val Glu His Gln Ala Leu Val Asn Leu Val Met Trp
625 630 635 640
His Asn Glu Ala Phe Gly Val Thr Met Thr Asp Gln Cys Thr Lys Leu
645 650 655
Ala Gly Phe Gly Phe Asp Ala Ser Val Trp Glu Thr Phe Pro Pro Leu
660 665 670
Ile Gln Gly Ala Thr Leu His Val Leu Glu Glu Ser Arg Arg Gly Asp
675 680 685
Ile Tyr Ala Leu His Glu Tyr Phe Glu Lys Asn Ala Ile Thr Ile Ser
690 695 700
Phe Leu Pro Thr Gln Leu Ala Glu Gln Phe Met Glu Leu Thr Ser Ser
705 710 715 720
Thr Leu Arg Val Leu Leu Ile Gly Gly Asp Arg Ala Gln Lys Val Lys
725 730 735
Lys Thr Thr Tyr Gln Ile Ile Asn Asn Tyr Gly Pro Thr Glu Asn Thr
740 745 750
Val Val Thr Thr Ser Gly Gln Leu His Pro Glu Gln Asp Val Phe Pro
755 760 765
Ile Gly Lys Pro Ile Thr Asn His Ser Val Tyr Ile Leu Asp Gln Asn
770 775 780
Arg His Pro Gln Pro Ile Gly Ile Pro Gly Glu Leu Cys Val Ser Gly
785 790 795 800
Ala Gly Leu Ala Arg Gly Tyr Leu Asn Gln Pro Glu Leu Thr Val Glu
805 810 815
Arg Phe Val Asp Asn Pro Phe Val Pro Gly Glu Arg Ile Tyr Arg Thr
820 825 830
Gly Asp Leu Val Arg Trp Arg Ile Asp Gly Ser Ile Glu Tyr Leu Gly
835 840 845
Arg Ile Asp Glu Gln Val Lys Ile Arg Gly Tyr Arg Ile Glu Leu Gly
850 855 860
Glu Ile Glu Thr Lys Leu Leu Glu His Pro Ser Ile Ser Glu Ala Leu
865 870 875 880
Val Val Ala Arg Asn Asp Glu Gln Gly Tyr Thr Tyr Leu Cys Ala Tyr
885 890 895
Val Val Ala Thr Gly Ala Trp Ser Val Ser Ser Leu Arg Glu His Leu
900 905 910
Ile Glu Thr Leu Pro Glu Tyr Met Ile Pro Ala Tyr Met Met Glu Val
915 920 925
Glu Lys Met Pro Leu Thr Ala Asn Gly Lys Val Asp Lys Arg Ala Leu
930 935 940
Pro Val Pro Asp Arg Gln Arg Met Asn Glu Tyr Val Ala Pro Ala Thr
945 950 955 960
Glu Thr Glu Glu Lys Leu Val Leu Leu Phe Gln Glu Ile Leu Gly Leu
965 970 975
Glu Arg Ile Gly Thr Lys Asp His Phe Phe Glu Leu Gly Gly His Ser
980 985 990
Leu Lys Ala Met Met Leu Val Ser Arg Met His Lys Glu Leu Gly Val
995 1000 1005
Asp Val Gln Leu Asn Glu Met Phe Ala Arg Pro Thr Val Lys Asp Leu
1010 1015 1020
Ser Ala Tyr Ile Asp Gln Met Asn Gly Ser Ala Tyr Thr Ala Ile Gln
1025 1030 1035 1040
Pro Val Glu Glu Gln Pro Tyr Tyr Pro Val Ser Phe Ala Gln Arg Arg
1045 1050 1055
Met Tyr Val Val Gln Gln Met Arg Asp Ser Glu Thr Thr Ser Tyr Asn
1060 1065 1070
Ile Pro Val Thr Phe Glu Leu Lys Gly Lys Leu His Leu Asp Lys Leu
1075 1080 1085
Arg Glu Ala Leu Gln Ile Leu Val Leu Arg His Glu Ser Leu Arg Thr
1090 1095 1100
Ser Phe His Met Ile Asp Glu Asn Leu Val Gln Lys Val Asn Lys Asp
1105 1110 1115 1120
Ile Ser Trp Asp Leu Glu Val Ile Glu Ala Gln Glu Ser Glu Ile Glu
1125 1130 1135
Val Lys Leu Lys Glu Phe Ile Arg Pro Phe His Leu Ser Glu Ala Pro
1140 1145 1150
Leu Phe Arg Ala Arg Leu Ile Cys Leu Asn Pro Gln His His Leu Leu
1155 1160 1165
Ser Leu Asp Met His His Ile Ile Ser Asp Gly Val Ser Met Asn Leu
1170 1175 1180
Phe Leu Gln Glu Phe Met Thr Leu Tyr Gln Gly Glu Ala Leu Pro Ala
1185 1190 1195 1200
Leu Ser Ile Gln Tyr Lys Asp Tyr Ala Val Trp Gln Gln Ser Asp Lys
1205 1210 1215
Gln Arg Ala Arg Leu Lys Glu Gln Glu Lys Tyr Trp Leu His His Phe
1220 1225 1230
Ser Gly Glu Leu Pro Thr Leu Glu Leu Pro Thr Asp Phe Pro Arg Pro
1235 1240 1245
Ala Ile Gln Gln Phe Asp Gly Asp Glu Trp Ser Phe Glu Met Asn Ala
1250 1255 1260
Asp Leu Leu Ala Lys Val Lys Gln Ile Cys Ser Ser Gln Gly Thr Thr
1265 1270 1275 1280
Leu Tyr Met Thr Leu Leu Ala Ala Tyr Gln Val Phe Leu Ala Arg Tyr
1285 1290 1295
Thr Gly Gln Glu Asp Ile Ile Val Gly Ser Pro Ile Ala Gly Arg Ser
1300 1305 1310
His Ala Asp Leu Glu Asn Met Ile Gly Met Phe Val Asn Thr Leu Ala
1315 1320 1325
Leu Arg Gly Lys Pro Lys Ala Asp Gln Ser Phe Leu Ser Tyr Leu Lys
1330 1335 1340
Gln Val Lys Glu Thr Val Phe Gln Ala Tyr Ala Asn Ala Glu Tyr Pro
1345 1350 1355 1360
Phe Glu Glu Leu Ile Glu Lys Leu Asp Leu Glu Arg Asp Met Ser Arg
1365 1370 1375
His Pro Leu Phe Asp Thr Leu Phe Ser Leu Gln Asn Met Glu Ile Ser
1380 1385 1390
Glu Phe Gln Met Asn Asn Leu Glu Ile Phe Pro Tyr Glu Thr Gly Gln
1395 1400 1405
Lys Asn Ala Lys Phe Ala Leu Ser Trp Leu Ile Ala Glu Gly Glu Ser
1410 1415 1420
Leu Tyr Val Thr Ile Glu Tyr Ser Thr Lys Cys Phe Lys Arg Glu Thr
1425 1430 1435 1440
Ile Ile Arg Met Ala Ser His Phe Glu Gln Leu Leu Ala Gln Ile Val
1445 1450 1455
Glu Gln Pro Glu Ala Arg Ile Gly His Leu Glu Leu Val Ala Asp Ala
1460 1465 1470
Glu Arg Lys Met Leu Leu Glu Asp Phe Asn Leu Thr Lys Val Asp Tyr
1475 1480 1485
Pro Arg Glu Lys Thr Ile Gln Glu Leu Phe Glu Glu Gln Val Asp Lys
1490 1495 1500
Asn Pro Asp Gln Ile Ala Leu Ile Cys Gly Glu Gln Gln Phe Thr Tyr
1505 1510 1515 1520
Glu Gln Leu Asn Val Lys Phe Asn Gln Leu Ala His Val Leu Arg Arg
1525 1530 1535
Glu Gly Val Gln Pro Asn Gln Val Ile Gly Leu Ile Thr Asp Arg Ser
1540 1545 1550
Leu Ser Met Ile Val Gly Ile Phe Gly Ile Ile Lys Ala Gly Gly Gly
1555 1560 1565
Tyr Leu Pro Ile Asp Pro Thr Tyr Pro Thr Glu Arg Ile Glu Tyr Met
1570 1575 1580
Leu Glu Asp Ser Gln Thr His Leu Leu Leu Val Gln His Arg Asp Met
1585 1590 1595 1600
Val Pro Ala Gly Tyr Gln Gly Glu Val Leu Ile Ile Glu Asp Glu Ile
1605 1610 1615
Ser Arg Asp Glu Gln Val Ala Asn Ile Glu Leu Ile Asn Gln Pro Gln
1620 1625 1630
Asp Leu Ala Tyr Val Met Tyr Thr Ser Gly Ser Thr Gly Lys Pro Lys
1635 1640 1645
Gly Asn Leu Thr Thr His Arg Asn Ile Ile Lys Thr Val Cys Asn Asn
1650 1655 1660
Gly Tyr Ile Glu Ile Thr Thr Glu Asp Arg Leu Leu Gln Leu Ser Asn
1665 1670 1675 1680
Tyr Ala Phe Asp Gly Ser Thr Phe Asp Ile Phe Ser Ser Leu Leu His
1685 1690 1695
Gly Ala Thr Leu Val Leu Val Pro Lys Glu Val Ile Leu Asn Pro Thr
1700 1705 1710
Asp Leu Val Thr Leu Ile Arg Glu Gln Gln Ile Thr Val Ser Phe Met
1715 1720 1725
Thr Thr Ser Leu Phe Asn Ala Leu Val Glu Leu Asp Val Ser Ser Phe
1730 1735 1740
Gln Asn Met Arg Lys Ile Ala Phe Gly Gly Glu Lys Ala Ser Phe Lys
1745 1750 1755 1760
His Val Glu Lys Ala Leu Asp Phe Leu Gly Asn Gly Arg Leu Val Asn
1765 1770 1775
Gly Tyr Gly Pro Thr Glu Thr Thr Val Phe Ala Thr Thr Tyr Thr Val
1780 1785 1790
Asp Glu Arg Ile Lys Glu Trp Gly Ile Ile Pro Ile Gly Arg Pro Leu
1795 1800 1805
His Asn Thr Thr Val His Ile Leu Ser Ala Asp Asp Lys Leu Gln Pro
1810 1815 1820
Ile Gly Val Ile Gly Glu Leu Cys Val Ser Gly Glu Gly Leu Ala Arg
1825 1830 1835 1840
Gly Tyr Leu Asn Leu Pro Glu Leu Thr Met Glu Arg Phe Val Glu Asn
1845 1850 1855
Pro Phe Arg Pro Gly Glu Arg Met Tyr Arg Thr Gly Asp Leu Ala Arg
1860 1865 1870
Trp Leu Pro Asp Gly Val Leu Glu Tyr Val Gly Arg Lys Asp Glu Gln
1875 1880 1885
Val Lys Ile Arg Gly His Arg Ile Glu Leu Ser Glu Ile Glu Thr Arg
1890 1895 1900
Ile Leu Glu His Pro Ala Ile Ser Glu Thr Val Leu Leu Ala Lys Arg
1905 1910 1915 1920
Asn Glu Gln Gly Ser Ser Tyr Leu Cys Ala Tyr Ile Val Ala His Gly
1925 1930 1935
Gln Trp Asn Val Gln Glu Leu Arg Lys His Val Arg Asp Ala Leu Pro
1940 1945 1950
Glu His Met Val Pro Ser Tyr Phe Ile Gly Leu Asp Lys Leu Pro Leu
1955 1960 1965
Thr Ser Asn Gly Lys Val Asp Lys Arg Ala Leu Pro Glu Pro Glu Gly
1970 1975 1980
Ser Leu Gln Leu Thr Arg Glu Ile Val Ala Pro Arg Asn Glu Thr Glu
1985 1990 1995 2000
Lys Gln Leu Val Glu Ile Val Ala Glu Val Leu Gly Leu Glu Ala Ser
2005 2010 2015
Glu Ile Ser Ile Thr Asp Asn Leu Phe Glu Leu Gly Gly His Ser Leu
2020 2025 2030
Thr Ile Leu Arg Ile Leu Ala Lys Val His Thr Cys Asn Trp Lys Leu
2035 2040 2045
Glu Met Lys Asp Phe Tyr Asn Cys Lys Asn Leu Glu Glu Ile Ala Ser
2050 2055 2060
Lys Ala Ser Asp Met Gln Glu Asn Gln Asn Leu Ser Gly Ser Gly Ser
2065 2070 2075 2080
Val Phe Lys Lys Gly Gly Lys Lys Ser Ile Pro Val Val Pro Val Cys
2085 2090 2095
Asp Arg Gln Lys Glu Met Glu His Val Leu Leu Leu Gly Ser Thr Gly
2100 2105 2110
Phe Leu Gly Ile His Leu Leu His Glu Leu Leu Gln Lys Thr Glu Ala
2115 2120 2125
Thr Ile Leu Cys Val Ile Arg Ala Glu Asn Asp Glu Ala Ala Met Gln
2130 2135 2140
Arg Leu Arg Lys Lys Ile Asp Phe Tyr Phe Thr Ser Gln Tyr Ser Ser
2145 2150 2155 2160
Ser Gln Ile Asp Glu Trp Phe Thr Arg Ile Gln Ile Ile His Gly Asp
2165 2170 2175
Ile Thr Gln Asp Asn Phe Gly Leu Glu Ala Lys His Tyr Glu Ser Leu
2180 2185 2190
Gly Ala Ile Val Asp Thr Val Ile His Thr Ala Ala Leu Val Lys His
2195 2200 2205
Tyr Gly His Tyr Glu Glu Phe Glu Arg Ala Asn Val His Gly Thr Gln
2210 2215 2220
Gln Val Val Thr Phe Cys Leu Asn Asn Lys Leu Pro Met His Tyr Val
2225 2230 2235 2240
Ser Thr Leu Ser Val Ser Gly Thr Thr Val Glu Glu Ala Thr Glu Leu
2245 2250 2255
Val Glu Phe Thr Glu Lys Asp Phe Tyr Val Gly Gln Asn Tyr Glu Ser
2260 2265 2270
Asn Val Tyr Leu Arg Ser Lys Phe Glu Ala Glu Ala Val Leu Val Gly
2275 2280 2285
Gly Met Glu Asn Gly Leu Asp Ala Arg Ile Tyr Arg Val Gly Asn Leu
2290 2295 2300
Thr Gly Arg Phe Gln Asp Gly Trp Phe Gln Glu Asn Ile Asn Glu Asn
2305 2310 2315 2320
Met Phe Tyr Leu Leu Ser Lys Ala Phe Leu Glu Leu Gly Gly Phe Asp
2325 2330 2335
Gln Glu Ile Met Gln Gly Met Val Asp Leu Thr Pro Ile Asp Ile Cys
2340 2345 2350
Ala Gln Ala Ile Ile His Ile Ile Asn Ser Lys Gly Ile Glu Glu Arg
2355 2360 2365
Val Phe His Leu Gln Asn Pro His Leu Val Thr Tyr Asp Asp Met Tyr
2370 2375 2380
Arg Val Phe Glu Gly Leu Gly Phe Ser Arg Arg Val Gln Ser Arg Glu
2385 2390 2395 2400
Asp Val Thr Arg Glu Leu Asp Val Met Met Ser Gln Gly Asn Glu Lys
2405 2410 2415
Leu Phe Leu Ala Gly Ile Leu Thr Thr Met Leu Asp Asp Val Glu Arg
2420 2425 2430
Ala Glu Gln Phe Asn Val Ala Val Asp Ser Ser Arg Thr Met Gln Leu
2435 2440 2445
Leu Glu Asp Thr Ser Phe Thr Tyr Pro Val Pro Asp Asp Glu Tyr Leu
2450 2455 2460
Arg Lys Leu Ala Met His Met Ile Lys Val Gly Phe Val Thr Pro Asn
2465 2470 2475 2480
His Thr Val Ala Glu Lys Ile Gly Thr Ser Arg
2485 2490
<210> 54
<211> 2526
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 54
Met Leu Ser Lys Ala Asn Ile Lys Asp Ile Tyr Thr Leu Ser Pro Leu
1 5 10 15
Gln Lys Gly Met Leu Phe Gln His Leu Lys Glu Glu Ser Thr Ala Tyr
20 25 30
Phe Glu Gln Leu His Phe Thr Ile Lys Gly Gln Leu Tyr Val Asp Ser
35 40 45
Phe Glu Ala Ser Phe Gln His Leu Ile Asn Lys Tyr Asp Val Leu Arg
50 55 60
Thr Val Phe Leu Tyr Lys Asn Met Thr Gln Pro Met Gln Met Val Leu
65 70 75 80
Lys Glu Arg Lys Thr Ser Val His Phe Glu Asp Ile Ser His Leu Asp
85 90 95
Ser Lys Ala Val Ser Glu Tyr Val Glu Glu Phe Lys Asn Gln Asp Arg
100 105 110
Glu Lys Gly Phe Glu Leu Ser Lys Asp Ile Leu Met Arg Phe Ala Ile
115 120 125
Leu Lys Ala Gly Ala Glu Ser Tyr His Leu Ile Trp Ser Phe His His
130 135 140
Ile Leu Met Asp Gly Trp Cys Met Gly Ile Val Leu Gln Asp Leu Phe
145 150 155 160
Arg Met Tyr Gln Gln His Arg Gln Asn Ile Pro Ile Thr Val Glu Ser
165 170 175
Val Pro Ala Tyr Ser Glu Tyr Ile Arg Trp Leu Glu Lys Gln Asn Val
180 185 190
Thr Lys Ala Arg Asp Tyr Trp Lys Asn Tyr Leu Glu Gly Tyr Glu Glu
195 200 205
Leu Thr Gly Ile Ile His Leu Asp Thr Lys His Thr Ser His Asn Asn
210 215 220
Glu Val Gln Glu Cys Ala Phe Thr Leu Asp Lys Asp Ile Thr Glu Gly
225 230 235 240
Leu Thr Gln Leu Ala Arg His Tyr Ser Val Thr Val Asn Thr Leu Phe
245 250 255
Gln Thr Ile Trp Gly Met Leu Leu Gln Lys Tyr Asn Asn Lys Asp Asp
260 265 270
Val Val Phe Gly Ala Val Val Ser Gly Arg Pro Ser Glu Ile His Gly
275 280 285
Val Glu Asn Met Val Gly Leu Phe Ile Asn Thr Val Pro Ile Arg Ile
290 295 300
Gln Lys Gln Thr Asn Asp Thr Phe Ser His Leu Leu Lys Arg Val His
305 310 315 320
Glu Ser Thr Leu Leu Ser Lys Gln Tyr Glu Phe Val Ser Leu Ala Asp
325 330 335
Ile Gln Thr Asp Ala Gly Phe Ser Gly Gln Leu Leu Asp His Ile Leu
340 345 350
Val Phe Glu Asn Tyr Pro Ile Ser Glu Gly Ser Phe Glu Glu Glu Glu
355 360 365
Phe Thr Met Asp Ser Ile Lys Thr Tyr Glu Lys Thr Ser Tyr Asp Leu
370 375 380
Asn Val Met Ile Arg Pro Asn Glu Asp Gln Leu Asp Ile Ala Phe Gln
385 390 395 400
Phe Asn Asp Asp Val Tyr Ser Ser Glu Asn Val Lys Arg Leu Phe Gln
405 410 415
His Met Lys Gln Leu Ala Leu Ala Val Ile Lys Asn Pro Asp Val Arg
420 425 430
Leu Glu Glu Ile Ala Met Ile Thr Glu Glu Glu Arg Tyr Gln Ile Leu
435 440 445
His Asp Phe Gln Gly Glu Ile Val Asp Phe Val Thr Glu Lys Thr Leu
450 455 460
Pro Glu Leu Phe Glu Asp Gln Val Lys Arg Thr Pro Glu Ala Ile Ala
465 470 475 480
Leu Arg Phe Glu Asp Gln Gln Leu Thr Tyr Gln Glu Leu Asn Gln Arg
485 490 495
Val Asn Gln Leu Ala Trp Thr Leu Arg Met Lys Gly Leu Gln Gln Glu
500 505 510
Glu Leu Val Gly Ile Met Val Gln Arg Ser Leu Glu Met Ile Val Gly
515 520 525
Val Leu Ala Val Ile Lys Ala Gly Gly Ala Tyr Val Pro Ile Asp Pro
530 535 540
Glu Tyr Pro Leu Asp Arg Ile Gln Tyr Met Leu Glu Asp Ser Gly Thr
545 550 555 560
Asn Trp Leu Leu Thr Thr Lys Gln Ser Glu Ile Pro Ser Ile Tyr Gln
565 570 575
Gly His Val Leu Tyr Leu Glu Glu Asp Thr Val Tyr His Glu Arg Ser
580 585 590
Ser Asp Val Glu Ile Val Asn Gln Ser Ser Asp Leu Ala Tyr Ile Ile
595 600 605
Tyr Thr Ser Gly Ser Thr Gly Gln Pro Lys Gly Val Met Ile Asp His
610 615 620
Arg Ala Val His Asn Leu His Leu Ser Ala Gly Ile Tyr Gly Ile Ala
625 630 635 640
Gln Gly Ser Gln Val Leu Gln Phe Ala Ser Leu Ser Phe Asp Ala Ser
645 650 655
Val Gly Asp Ile Phe His Ser Leu Leu Thr Gly Ala Thr Leu His Leu
660 665 670
Val Lys Lys Glu Gln Leu Leu Ser Gly His Ala Phe Met Glu Trp Leu
675 680 685
Asp Glu Ala Gly Ile Thr Thr Ile Pro Phe Ile Pro Pro Ser Val Leu
690 695 700
Lys Glu Leu Pro Tyr Ala Lys Leu Pro Lys Leu Lys Thr Ile Ser Thr
705 710 715 720
Gly Gly Glu Glu Leu Pro Ala Asp Leu Val Arg Ile Trp Gly Ala Asn
725 730 735
Arg Thr Phe Leu Asn Ala Tyr Gly Pro Thr Glu Thr Thr Val Asp Ala
740 745 750
Ser Ile Gly Asn Cys Val Glu Met Thr Asp Lys Pro Ser Ile Gly Thr
755 760 765
Pro Thr Val Asn Lys Arg Ala Tyr Ile Leu Asp Gln Tyr Gly His Ile
770 775 780
Gln Pro Ile Gly Val Pro Gly Glu Leu Cys Val Gly Gly Glu Gly Val
785 790 795 800
Ala Arg Gly Tyr Leu His Arg Pro Glu Leu Thr Asp Glu Lys Phe Val
805 810 815
Asn Asp Pro Tyr Val Pro Asn Gly Arg Met Tyr Lys Thr Gly Asp Leu
820 825 830
Ala Arg Trp Leu Pro Asp Gly Thr Ile Glu Phe Leu Gly Arg Met Asp
835 840 845
Gly Gln Val Lys Ile Arg Gly Phe Arg Ile Glu Leu Gly Glu Ile Glu
850 855 860
Ala Arg Leu Asn Gln Ala Pro Ser Val Lys Gln Ala Val Val Leu Ala
865 870 875 880
Arg Ser Gly Glu Gln Lys Gln Val Tyr Leu Cys Ala Tyr Leu Val Thr
885 890 895
Asp Asn Asp Leu Lys Val Ser Ala Leu Arg Lys Glu Leu Ser Gln Thr
900 905 910
Leu Pro Asp Tyr Met Ile Pro Ser Phe Phe Ile Lys Val Asp Lys Ile
915 920 925
Pro Val Thr Val Asn Gly Lys Ile Asp Lys Lys Ala Leu Pro Glu Pro
930 935 940
Glu Lys Glu Val Glu Leu Gln Thr Glu Tyr Val Ala Pro Thr Asn Pro
945 950 955 960
Thr Glu Glu Ile Leu Val Gln Ile Trp Gln Lys Val Leu Gly Met Glu
965 970 975
Arg Val Gly Ile Glu Asp Asn Phe Phe Glu Leu Gly Gly His Ser Ile
980 985 990
Lys Ala Met Met Leu Ala Ser Asn Ile Tyr Lys Glu Leu Lys Ile Asp
995 1000 1005
Leu Pro Leu Arg Glu Ile Phe Lys His Thr Thr Val Lys Glu Met Ala
1010 1015 1020
Arg Phe Ile Asp Gly Arg Asp Glu Glu Glu Tyr Val Gly Ile Gln Pro
1025 1030 1035 1040
Ala Ala Lys Lys Glu Tyr Tyr Pro Val Ser Ser Ala Gln Lys Arg Met
1045 1050 1055
Tyr Val Ile Gln Ser Leu Glu Asp Lys Ala Gln Gly Thr Ser Tyr Asn
1060 1065 1070
Met Pro Ser Phe Tyr Lys Met Lys Gly Ser Val Asp Ala Glu Lys Leu
1075 1080 1085
Glu Lys Val Phe Gln Thr Leu Met Asp Arg His Glu Ser Leu Arg Thr
1090 1095 1100
Ser Phe His Met Ile Glu Glu Gln Leu Val Gln Lys Val His Glu Gln
1105 1110 1115 1120
Val Ser Trp Lys Met Asp Met Lys Thr Val Ser Ala Asn Asp Val Ser
1125 1130 1135
Arg Leu Lys Asp Ser Phe Val Gln Pro Phe Asp Ile Ser Thr Ala Pro
1140 1145 1150
Leu Phe Arg Ala Ser Leu Leu Thr Ile Gln Lys Asp Glu His Ile Leu
1155 1160 1165
Met Met Asp Val His His Ile Val Gly Asp Gly Val Ser Thr Thr Ile
1170 1175 1180
Leu Phe Gln Glu Leu Ile Gln Leu Tyr Gln Gly Gln Ala Leu Pro Glu
1185 1190 1195 1200
Val Lys Val His Tyr Lys Asp Tyr Ala Val Trp Gln Leu Ser Gln Gln
1205 1210 1215
Asp Arg Leu Lys Glu Ser Glu Asn Phe Trp Leu Gln Gln Phe Ser Gly
1220 1225 1230
Glu Leu Pro Val Leu Glu Leu Pro Thr Asp Tyr Ser Arg Pro Pro Ile
1235 1240 1245
Arg Arg Leu Glu Gly Glu Tyr Val Ser Gln Ser Leu Arg Gly Glu Leu
1250 1255 1260
His Glu Ser Val Lys Ala Phe Met Lys Asn His Glu Val Thr Leu Tyr
1265 1270 1275 1280
Met Val Leu Leu Ala Thr Tyr Asn Val Leu Leu His Lys Tyr Thr Asn
1285 1290 1295
Gln Arg Asp Ile Ile Val Gly Thr Pro Val Ser Asp Arg Pro His Pro
1300 1305 1310
Asp Val Met Ser Thr Val Gly Met Phe Val Asn Thr Leu Ala Val Arg
1315 1320 1325
Asn Gln Leu Lys Ser Glu Gln Thr Phe Glu Glu Phe Leu Ala Asn Val
1330 1335 1340
Lys Asn Lys Met Leu Glu Val Tyr Gly His Gln Glu Tyr Pro Phe Glu
1345 1350 1355 1360
Asp Val Ile Glu Lys Val Lys Val Gln Arg Asp Thr Ser Arg His Pro
1365 1370 1375
Leu Phe Asp Thr Met Phe Gly Val Gln Asn Leu Glu Ile Ser His Val
1380 1385 1390
Glu Leu Pro Asp Trp Gly Ile Glu Ala Leu Asp Ile Asp Trp Thr Asn
1395 1400 1405
Ser Lys Phe Asp Met Ser Trp Met Val Phe Glu Ala Asp Gly Leu Glu
1410 1415 1420
Ile Gly Val Glu Tyr Ser Thr Ser Leu Phe Glu Arg Asn Thr Ile Gln
1425 1430 1435 1440
Arg Met Ile Gly His Phe Glu His Ile Ile Glu Gln Ile Met Glu Asn
1445 1450 1455
Pro Gln Ile Arg Leu Ala Asp Ile Gln Leu Thr Thr Glu Asp Glu Arg
1460 1465 1470
Ile Gln Ile Leu Glu Glu Phe Asn His Gln Pro Thr Lys Ile Thr Tyr
1475 1480 1485
Asp Gln Ala Ile Gln Asn Arg Phe Glu Glu Gln Ala Met Lys Thr Pro
1490 1495 1500
Asp Ala Val Ala Leu Val Tyr Lys Gly Gln Glu Leu Thr Tyr Arg Glu
1505 1510 1515 1520
Leu Asn Gln Arg Ser Asn Gln Met Ala Arg Thr Leu Arg Glu His Gly
1525 1530 1535
Val Gly Arg Asp Gln Ile Ile Ala Val Met Ile Asn Arg Ser His Glu
1540 1545 1550
Leu Ile Ile Ser Ile Leu Ala Val Leu Lys Ala Gly Gly Ala Tyr Leu
1555 1560 1565
Pro Ile Asp Pro Thr Tyr Pro Leu Asp Arg Ile Glu His Met Leu Glu
1570 1575 1580
Asp Ser Gln Thr Ala Met Leu Leu Thr Gln Lys Glu Ile Gln Ile Pro
1585 1590 1595 1600
Thr Gly Tyr Ser Gly Glu Val Leu Phe Val Asp Gln Ala Asp Ile Tyr
1605 1610 1615
His Glu Asp Ala Thr Asp Leu Ser Ser Met Asn Gln Pro Ala Asp Leu
1620 1625 1630
Ala Tyr Ile Ile Tyr Thr Ser Gly Ser Thr Gly Lys Ser Lys Gly Val
1635 1640 1645
Met Ile Glu His Arg Ser Leu His Asn Leu Ile His Ile Ser His Pro
1650 1655 1660
Tyr Lys Met Gly Ala Gly Ser Arg Val Leu Gln Phe Ala Ser Ser Ser
1665 1670 1675 1680
Phe Asp Ala Ser Val Ala Glu Ile Phe Pro Ala Leu Leu Thr Gly Ser
1685 1690 1695
Thr Leu Tyr Ile Glu Glu Lys Glu Glu Leu Leu Thr Asn Leu Val Pro
1700 1705 1710
Tyr Leu Leu Glu Asn Gln Ile Thr Thr Val Ala Leu Pro Pro Ser Leu
1715 1720 1725
Leu Arg Ser Val Pro Tyr Arg Glu Leu Pro Ala Leu Glu Cys Ile Val
1730 1735 1740
Ser Val Gly Glu Ala Cys Thr Phe Asp Ile Val Gln Thr Trp Gly Gln
1745 1750 1755 1760
Asn Arg Thr Phe Ile Asn Gly Tyr Gly Pro Thr Glu Ser Thr Val Cys
1765 1770 1775
Ser Thr Phe Gly Val Val Thr Ala Glu Asp Lys Arg Ile Thr Ile Gly
1780 1785 1790
Lys Pro Phe Pro Asn Gln Lys Ala Tyr Ile Ile Asn Glu Asn Gln Gln
1795 1800 1805
Leu Gln Pro Ile Gly Val Pro Gly Glu Leu Cys Ile Ala Gly Ala Gly
1810 1815 1820
Leu Ser Arg Gly Tyr Leu Asn Arg Pro Glu Leu Thr Gln Glu Lys Phe
1825 1830 1835 1840
Val Asn Asn Pro Phe Ala Pro Gly Glu Arg Met Tyr Lys Thr Gly Asp
1845 1850 1855
Val Ala Arg Trp Leu Pro Asp Gly Asn Ile Glu Tyr Val Gly Arg Met
1860 1865 1870
Asp Asp Gln Val Lys Val Arg Gly Asn Arg Val Glu Leu Gly Glu Val
1875 1880 1885
Thr Ser Gln Leu Leu Thr His Pro Ser Ile Thr Glu Ala Val Val Val
1890 1895 1900
Pro Ile Val Asp Thr His Gly Ala Thr Thr Leu Cys Ala Tyr Phe Ile
1905 1910 1915 1920
Glu Asp Lys Glu Val Lys Val Asn Asp Leu Arg His His Leu Ala Lys
1925 1930 1935
Ala Leu Pro Glu Phe Met Ile Pro Ala Tyr Phe Ile Lys Val Asp His
1940 1945 1950
Ile Pro Leu Thr Gly Asn Gly Lys Val Asn Lys Gln Ala Leu Pro Asp
1955 1960 1965
Pro Ser Glu Phe Ile Ser Ala Gln Thr Gly His Glu Ile Val Ala Pro
1970 1975 1980
Ser Ser Gln Asp Glu Glu Ile Leu Val Gln Val Trp Glu Glu Val Leu
1985 1990 1995 2000
Gln Ile Lys Pro Ile Gly Val Glu Asp Asn Phe Phe Glu Arg Gly Gly
2005 2010 2015
Asp Ser Ile Lys Ala Leu Gln Ile Val Ala Arg Leu Ser Lys Tyr Asn
2020 2025 2030
Arg Lys Leu Asp Ser Arg His Ile Phe Lys Asn Pro Thr Ile Ser Met
2035 2040 2045
Leu Ala Pro Tyr Leu Glu Gln Arg Gly Ala Leu Ile Glu Gln Asp Ser
2050 2055 2060
Ile Glu Gly Glu Val Pro Leu Thr Pro Ile Gln Ser Trp Phe Phe Glu
2065 2070 2075 2080
Gln Pro Phe Val His Pro His His Phe Asn Gln Ser Met Leu Leu Pro
2085 2090 2095
Asn Glu Gln Gly Trp Asp Arg Gln Arg Ile Glu Gln Ala Phe Thr Thr
2100 2105 2110
Ile Val Lys His His Asp Ala Leu Arg Met Lys Tyr Gln Phe Arg Glu
2115 2120 2125
Lys Ile Ile Gln Glu Asn Gln Gly Ile Glu Gly Glu Phe Phe Thr Leu
2130 2135 2140
His Glu Val Asp Val Thr Lys Glu Arg Asp Trp Gln Met Arg Ile Glu
2145 2150 2155 2160
Arg Glu Ala Asn Gln Leu Gln Ala Ser Phe Asp Leu Thr Thr Gly Pro
2165 2170 2175
Leu Val Lys Leu Gly Leu Tyr His Thr Ala Tyr Gly Asp Tyr Leu Leu
2180 2185 2190
Ile Val Val His His Leu Leu Ile Asp Gly Val Ser Trp Arg Ile Leu
2195 2200 2205
Leu Glu Asp Phe Gln Thr Leu Tyr Glu Gln Lys Gly Glu Leu Pro Ala
2210 2215 2220
Lys Thr Thr Ser Phe Lys Ala Trp Ala Val Gln Leu Glu Gly Tyr Ala
2225 2230 2235 2240
Arg Ser Lys Lys Leu Gln Asp Glu Ala Ser Tyr Trp Lys Gly Leu Leu
2245 2250 2255
Asn Lys Ser Val Arg Glu Leu Pro Ala Asp Lys Glu Ser Ser Asp Thr
2260 2265 2270
Phe Leu Phe Gly Asp Thr Lys Glu Val Gln Leu Thr Phe Asp Ile Asn
2275 2280 2285
Glu Thr Gln Asp Leu Leu Thr Asp Ala His His Ala Tyr Lys Thr Lys
2290 2295 2300
Ala Asp Asp Leu Leu Leu Ala Ala Leu Val Leu Ser Ile Asn Glu Trp
2305 2310 2315 2320
Thr Lys Gln Ser Asp Ile Ile Val Asn Leu Glu Gly His Gly Arg Glu
2325 2330 2335
Thr Ile Val Glu Gly Ile Asp Leu Ser Arg Thr Ile Gly Trp Phe Thr
2340 2345 2350
Thr Ile Tyr Pro Val Leu Phe Glu Val Glu Asn His Gln Leu Ser Ser
2355 2360 2365
Val Ile Lys His Val Lys Glu Thr Leu Arg Asn Val Pro Asn Asn Gly
2370 2375 2380
Ile Cys Phe Gly Ile Leu Gln His Met Ser His Phe Asp Val Ser Gln
2385 2390 2395 2400
Ser Gln Leu Ser Ser His His Ile Ser Phe Asn Tyr Leu Gly Gln Met
2405 2410 2415
Gly Glu Asp Ser Ala Ser Gln Ser Glu Thr Asp Asn Gly Val Leu Ile
2420 2425 2430
Asn Thr Gly Asp Gln Ile Ser Pro Met Asn Ala Asn Pro Gly Ser Leu
2435 2440 2445
Asn Met Thr Cys Leu Val Met Asn Asn Thr Leu Leu Val Thr Phe Asp
2450 2455 2460
Tyr Asn Pro Gln Arg Tyr Glu Gln Glu Thr Ile Gln Arg Leu Ala Asp
2465 2470 2475 2480
Arg Tyr Lys Ser Asn Leu Lys Ala Val Leu Asp His Cys Val Gln Arg
2485 2490 2495
Glu Gln Thr Glu Arg Thr Pro Ser Asp Phe Ser Thr Lys Lys Leu Ser
2500 2505 2510
Leu Glu Asp Leu Asp Asp Val Phe Ala Thr Leu Lys Asn Leu
2515 2520 2525
<210> 55
<211> 2541
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 55
Val Gln Lys Lys Asp Lys Ile Lys Asp Ile Tyr Ser Leu Ser Pro Leu
1 5 10 15
Gln Lys Gly Met Leu Phe His Ser Met Lys Asp Pro Gln Ser Asp Ala
20 25 30
Tyr Phe Glu Gln Val Thr Leu Leu Leu Glu Gly Val Val Asn Pro Thr
35 40 45
Tyr Leu Ala Glu Ser Ile Gln Gly Leu Val Gln Lys Tyr Asp Met Phe
50 55 60
Arg Ser Val Phe Arg Tyr Lys Lys Val Asp Pro Val Gln Val Val Leu
65 70 75 80
Ser Glu Arg Lys Ile Asp Leu Gln Ile Glu Asp Leu Thr Gln Ile Asn
85 90 95
Glu Glu Glu Gln Arg Lys Phe Ile Glu Glu Tyr Arg Lys Lys Asp Arg
100 105 110
Glu Arg Gly Phe Asp Leu Ser Arg Asp Ile Leu Leu Arg Phe Thr Leu
115 120 125
Phe Gln Thr Ala Ala Asn Arg Tyr Glu Leu Leu Trp Ser His His His
130 135 140
Ile Leu Met Asp Gly Trp Cys Thr Gly Ile Val Phe Gln Asp Leu Phe
145 150 155 160
Gln Met Tyr Gln Arg Arg Leu Ser Gly Gln Ala Leu Leu Pro Glu Val
165 170 175
Ala Pro Gln Tyr Ser Glu Tyr Ile Arg Trp Leu Lys Lys Gln Asp Asp
180 185 190
Gln Gln Ala Leu Ala Phe Trp Lys Glu Tyr Leu Gln Gly Phe Glu Asn
195 200 205
Leu Thr Gly Ile Pro Arg Leu Arg Ser Gly Asn His Pro Tyr Lys Gln
210 215 220
Glu Glu Phe Ile Phe Ser Leu Gly Glu Glu Ala Thr Gln Lys Leu Thr
225 230 235 240
Gln Thr Ala Gln Lys Tyr Gln Val Thr Leu Asn Thr Val Val Gln Thr
245 250 255
Ile Trp Gly Ala Leu Leu Gln Lys Tyr Asn Asn Thr Asn Asp Ala Ala
260 265 270
Tyr Gly Val Val Val Ser Gly Arg Pro Ala Glu Val Pro Asn Val Glu
275 280 285
Gln Met Val Gly Leu Phe Ser Asn Thr Ile Pro Ile Arg Ile Lys Lys
290 295 300
Glu Ala Gly Lys Thr Phe Gly Glu Val Leu Lys Asn Val Gln Gln Thr
305 310 315 320
Ala Leu Glu Ala Glu Lys Tyr Gly Tyr Leu Ser Leu Ala Asp Ile Gln
325 330 335
Ala Ser Ala Ala Tyr Thr His Gln Leu Leu Asp His Ile Leu Ala Phe
340 345 350
Glu Asn Phe Pro Met Asp Gln Glu Thr Phe Asn Gln Glu Asn Val Leu
355 360 365
Gly Phe Ala Val Lys Asp Ala His Thr Phe Glu Gln Thr His Tyr Asp
370 375 380
Leu Thr Val Leu Val Ile Pro Gly Lys Glu Leu Ile Phe Lys Phe Met
385 390 395 400
Tyr Asn Glu Ser Val Tyr Ser Lys Glu Tyr Leu Asn Leu Leu Glu Leu
405 410 415
Asn Met Lys Lys Leu Val Ser Leu Val Ile Glu Gln Gln Asp Ile Phe
420 425 430
Asp Pro Ala Thr Glu Phe Val Ser Asp Leu Glu Lys Asp Lys Leu Leu
435 440 445
Thr Ile Phe Asn Arg Thr Asp Ala Lys Tyr Pro Arg Glu Lys Thr Ile
450 455 460
His Glu Leu Phe Gln Glu Gln Val Asp Lys Asn Pro Asp Gln Val Ala
465 470 475 480
Leu Val Phe Gly Glu Ala Gln Leu Thr Tyr Arg Glu Leu Asn Glu Lys
485 490 495
Ala Asn Gln Met Ala Arg Gly Leu Arg Lys Gln Gly Val Leu Pro Asp
500 505 510
Gln Val Ile Gly Leu Leu Thr Asp Arg Ser Leu Glu Met Ile Ile Ala
515 520 525
Ile Leu Ala Ile Phe Lys Ala Gly Gly Ala Tyr Met Pro Ile Asp Pro
530 535 540
Ser Tyr Pro Ser Glu Arg Ile Gln Tyr Met Leu Ala Asp Ser Arg Thr
545 550 555 560
His Leu Leu Leu Val Gln Lys Ala Glu Met Ile Pro Ala Asn Tyr Gln
565 570 575
Gly Glu Val Leu Leu Leu Thr Glu Asp Ser Trp Met Asp Glu Asn Thr
580 585 590
Asp Asn Leu Asp Leu Val Asn Gln Ala Gln Asp Leu Ala Tyr Val Met
595 600 605
Tyr Thr Ser Gly Ser Thr Gly Lys Pro Lys Gly Asn Leu Thr Thr His
610 615 620
Gln Asn Ile Val Lys Thr Ile Met Asn Asn Gly Tyr Met Glu Ile Thr
625 630 635 640
Pro Asn Asp Arg Leu Leu Gln Leu Ser Asn Tyr Ala Phe Asp Gly Ser
645 650 655
Thr Phe Asp Ile Tyr Ser Ala Leu Leu Asn Gly Ala Ser Leu Ile Leu
660 665 670
Val Pro Thr His Val Leu Met Asn Pro Thr Asp Leu Ala Ser Val Ile
675 680 685
Gln Asp Gln His Ile Thr Val Ser Phe Met Thr Thr Ser Leu Phe Asn
690 695 700
Thr Leu Val Glu Leu Asp Val Thr Ser Leu Lys His Met Arg Lys Val
705 710 715 720
Val Phe Gly Gly Glu Lys Ala Ser Ile Lys His Val Glu Lys Ala Leu
725 730 735
Asp Tyr Leu Gly Ala Gly Arg Leu Val Asn Gly Tyr Gly Pro Thr Glu
740 745 750
Thr Thr Val Phe Ala Thr Thr Tyr Thr Val Asp His Thr Ile Lys Glu
755 760 765
Thr Gly Ile Met Pro Ile Gly Arg Pro Leu Asn Asn Thr Lys Val Phe
770 775 780
Ile Leu Gly Ala Asp Asn Gln Leu Gln Pro Ile Gly Ala Leu Gly Glu
785 790 795 800
Leu Cys Val Ser Gly Glu Gly Leu Ala Arg Gly Tyr Leu Asn Leu Pro
805 810 815
Glu Leu Thr Ala Asp Arg Phe Val Glu Asn Pro Phe Met Gln Gly Glu
820 825 830
Arg Met Tyr Arg Thr Gly Asp Leu Ala Arg Trp Leu Pro Asp Gly Ser
835 840 845
Ile Glu Tyr Val Gly Arg Ile Asp Glu Gln Val Lys Ile Arg Gly His
850 855 860
Arg Ile Glu Leu Gly Glu Ile Glu Ala Arg Leu Leu Glu His Pro Ala
865 870 875 880
Ile Ser Glu Thr Val Leu Leu Ala Lys Gln Asp Glu Gln Gly His Ser
885 890 895
Phe Leu Cys Ala Tyr Leu Val Thr Asn Gly Ala Trp Ser Val Ala Glu
900 905 910
Leu Arg Lys His Ile Lys Glu Thr Leu Pro Asp Ser Met Val Pro Ser
915 920 925
Tyr Phe Ile Glu Ile Asp Lys Met Pro Leu Thr Ser Asn Gly Lys Ala
930 935 940
Asp Lys Arg Ala Leu Pro Glu Pro Asp Val Gln Gln Val Ser Ser Tyr
945 950 955 960
Ile Ala Pro Glu Thr Glu Thr Glu Glu Lys Leu Val Gln Leu Phe Gln
965 970 975
Glu Ile Leu Ser Val Glu Gln Val Gly Thr Gln Asp Asn Phe Phe Glu
980 985 990
Leu Gly Gly His Ser Leu Lys Ala Met Met Leu Val Ser Arg Met His
995 1000 1005
Lys Glu Leu Asp Ile Glu Val Pro Leu Lys Asp Val Phe Ala Arg Pro
1010 1015 1020
Ser Val Lys Glu Leu Ala Ala Phe Leu Thr Asn Thr Glu Val Ser Asp
1025 1030 1035 1040
Tyr Ile Ala Ile Glu Pro Ala Ala Lys Gln Glu Phe Tyr Pro Val Ser
1045 1050 1055
Ser Ala Gln Arg Arg Met Tyr Val Val Glu Gln Ile Gly Ser Ser Asn
1060 1065 1070
Thr Thr Ser Tyr Asn Met Pro Phe Leu Leu Glu Ile Gly Gly Ala Leu
1075 1080 1085
Asp Val Val Gly Leu Gln Lys Ala Leu Lys Lys Leu Val Ile Arg His
1090 1095 1100
Glu Ser Leu Arg Thr Ser Phe His Met Val Asp Glu Val Leu Met Gln
1105 1110 1115 1120
Lys Ile His Pro Asp Val Glu Trp Asp Leu Met Val Met Glu Ala Lys
1125 1130 1135
Asp Glu Asp Leu Pro Gln Ile Ile Asp Gly Phe Ile Gln Pro Phe Asp
1140 1145 1150
Leu Ser Asp Ala Ser Leu Phe Arg Ala Gly Leu Val Arg Met Glu Ala
1155 1160 1165
Asp Arg His Leu Leu Met Leu Asp Met His His Ile Ile Ser Asp Gly
1170 1175 1180
Val Ser Thr Asn Val Leu Phe Gln Asp Leu Met Gln Ile Tyr Gln Gly
1185 1190 1195 1200
Lys Glu Leu Pro Ser Leu Arg Ile Gln Tyr Lys Asp Tyr Ala Val Trp
1205 1210 1215
Gln Gln Ala Glu Ala Gln Val Asn Arg Leu Arg Glu Gln Glu Gln Tyr
1220 1225 1230
Trp Leu Asn Gln Phe Ser Gly Glu Leu Pro Val Leu Glu Met Pro Thr
1235 1240 1245
Asp Tyr Thr Arg Pro Ser Ile Gln Gln Ser Glu Gly Asp Ile Trp Ser
1250 1255 1260
Phe Glu Ile Ser Ala Glu Ile Ile Asn Lys Val Lys Lys Leu Ser Ser
1265 1270 1275 1280
Ser Gln Gly Thr Thr Leu Tyr Met Thr Leu Leu Ala Ala Tyr Gln Val
1285 1290 1295
Leu Leu Ser Lys Tyr Thr Gly Gln Glu Asp Val Ile Val Gly Ser Pro
1300 1305 1310
Ile Ala Gly Arg Pro His Ala Asp Val Glu Lys Ile Val Gly Met Phe
1315 1320 1325
Val Asn Thr Leu Ala Phe Arg Gly Gln Pro Lys Ser Thr Gln Thr Phe
1330 1335 1340
Ser Thr Tyr Leu Ser Glu Val Lys Glu Gln Val Leu His Ala Tyr Asp
1345 1350 1355 1360
Asn Ala Glu Tyr Pro Phe Glu Glu Leu Leu Glu Lys Leu Asp Leu Glu
1365 1370 1375
Arg Asp Leu Ser Arg His Pro Leu Phe Asp Thr Met Phe Ala Leu Gln
1380 1385 1390
Asn Met Glu Met Ala Glu Ile Asn Ile Met Asp Leu Ser Phe Gln Pro
1395 1400 1405
Arg Asp Leu Thr Trp Lys Asn Ala Lys Phe Asp Leu Thr Trp Met Met
1410 1415 1420
Ala Glu Ala Glu Asn Leu Tyr Val Thr Ile Glu Tyr Ser Thr Ser Leu
1425 1430 1435 1440
Phe Lys Pro Glu Thr Ile Glu Arg Leu Gly Lys Arg Phe Thr His Leu
1445 1450 1455
Leu Lys Gln Ile Gly Asp Ala Pro Glu Arg Leu Ile Ala Asp Leu Glu
1460 1465 1470
Val Ala Thr Glu Asp Glu Lys His Gln Ile Leu Ser Val Phe Asn Leu
1475 1480 1485
Thr Gln Ser Asp Tyr Pro Val Asn Lys Thr Val His Gln Leu Phe Glu
1490 1495 1500
Glu Gln Val Gln Asn Met Pro Asp Gln Lys Ala Ile Val Phe Gly Glu
1505 1510 1515 1520
Glu Gln Val Thr Tyr Lys Glu Leu Asn Ala Lys Ala Asn His Leu Ala
1525 1530 1535
Thr Leu Leu Lys Gln Lys Gly Ile Thr Asn Glu Gln Leu Val Ala Val
1540 1545 1550
Met Ile Glu Pro Ser Ile Glu Phe Phe Val Gly Ile Leu Ala Val Leu
1555 1560 1565
Lys Ala Gly Gly Ala Tyr Leu Pro Ile Asp Pro Thr Tyr Pro Ala Glu
1570 1575 1580
Arg Ile Ala Tyr Ile Leu Glu Asp Ser Gln Ser Lys Val Leu Leu Val
1585 1590 1595 1600
Arg Gly His Glu Gln Val Gln Thr Gln Phe Ala Gly Glu Ile Leu Glu
1605 1610 1615
Thr Asp Ser Lys Lys Leu Ser Thr Glu Glu Leu Lys Asp Val Pro Met
1620 1625 1630
Asn Asn Lys Val Thr Asp Leu Ala Tyr Val Ile Tyr Thr Ser Gly Ser
1635 1640 1645
Thr Gly Gln Pro Lys Gly Val Met Val Glu His Arg Ser Leu Met Asn
1650 1655 1660
Leu Ser Ala Trp His Val Gln Tyr Phe Gly Ile Thr Lys Asp Asp Arg
1665 1670 1675 1680
Ser Ile Lys Tyr Ala Gly Val Gly Phe Asp Ala Ser Val Trp Glu Val
1685 1690 1695
Phe Pro Tyr Leu Ile Ala Gly Ala Thr Ile Tyr Val Ile Asp Gln Glu
1700 1705 1710
Thr Arg Tyr Asp Val Glu Lys Leu Asn Gln Tyr Val Thr Asp Gln Gly
1715 1720 1725
Ile Thr Ile Ser Phe Leu Pro Thr Gln Phe Ala Glu Gln Phe Met Leu
1730 1735 1740
Thr Asp His Thr Asp His Thr Ala Leu Arg Trp Leu Leu Ile Gly Gly
1745 1750 1755 1760
Asp Lys Ala Gln Gln Ala Val Gln Gln Lys Lys Tyr Gln Ile Val Asn
1765 1770 1775
Asn Tyr Gly Pro Thr Glu Asn Thr Val Val Thr Thr Ser Tyr Ile Val
1780 1785 1790
Ser Pro Glu Asp Lys Lys Ile Pro Ile Gly Arg Pro Ile Ala Asn Asn
1795 1800 1805
Gln Val Phe Ile Leu Asn Lys Glu Asn Gln Leu Gln Pro Val Gly Ile
1810 1815 1820
Pro Gly Glu Leu Cys Val Ser Gly Asp Ser Leu Ala Arg Gly Tyr Leu
1825 1830 1835 1840
His Arg Pro Glu Leu Thr Ser Glu Arg Phe Val Ala Asn Pro Phe Val
1845 1850 1855
Pro Gly Glu Arg Met Tyr Lys Thr Gly Asp Ile Ala Arg Trp Leu Pro
1860 1865 1870
Asp Gly Asn Ile Glu Tyr Leu Gly Arg Leu Asp Asp Gln Ile Lys Ile
1875 1880 1885
Arg Gly Tyr Arg Val Glu Leu Gly Glu Ile Glu Ser Ala Ile Leu Glu
1890 1895 1900
His Glu Ala Ile Gln Glu Thr Val Val Leu Ala Arg Gln Asp Asp Gln
1905 1910 1915 1920
Asn Gln Thr Tyr Leu Cys Ala Tyr Val Val Pro Lys Lys Ser Phe Asp
1925 1930 1935
Val Ala Glu Leu Arg Gln Tyr Leu Gly Arg Lys Leu Pro His Phe Met
1940 1945 1950
Ile Pro Ala Phe Phe Thr Glu Met Thr Glu Phe Pro Ile Thr Ser Asn
1955 1960 1965
Gly Lys Val Asp Lys Lys Ala Leu Pro Leu Pro Asp Leu Ser Lys Gln
1970 1975 1980
Ser Glu Ile Asp Tyr Val Ala Pro Thr Thr Thr Leu Glu Glu Thr Leu
1985 1990 1995 2000
Ala Glu Leu Trp Thr Glu Val Leu Gly Val Ser Gln Val Gly Ile His
2005 2010 2015
Asp Asn Phe Phe Lys Leu Gly Gly Asp Ser Ile Lys Ala Ile Gln Ile
2020 2025 2030
Ala Ala Arg Leu Asn Thr Lys Gln Leu Lys Leu Glu Val Lys Asp Leu
2035 2040 2045
Phe Gln Ala Gln Thr Ile Ala Gln Val Ile Pro Tyr Ile Lys Thr Lys
2050 2055 2060
Asp Ser Lys Ala Glu Gln Gly Ile Val Gln Gly Lys Val Glu Leu Thr
2065 2070 2075 2080
Pro Ile Gln Glu Trp Phe Phe Gln Gln Ser Phe Asp Ile Pro His His
2085 2090 2095
Trp Asn Gln Ser Met Met Phe Tyr Arg Lys Glu Gly Trp Asp Gln His
2100 2105 2110
Val Val Gln Arg Val Phe Gln Lys Ile Ala Glu His His Asp Ala Leu
2115 2120 2125
Arg Met Ala Tyr Gln Gln Glu Asn Gly Lys Thr Ile Gln Ile Asn Arg
2130 2135 2140
Gly Val Glu Gly Lys Leu Phe Glu Leu Ser Ile Phe Asp Phe Arg Gln
2145 2150 2155 2160
Gln Ala Asn Val Pro Glu Leu Ile Glu Gln Ala Ala Asn Arg Leu Gln
2165 2170 2175
Ser Ala Met Asn Leu Gln Asp Gly Pro Leu Val Gln Leu Gly Leu Phe
2180 2185 2190
Gln Thr Ser Glu Gly Asp His Leu Leu Ile Ala Ile His His Leu Val
2195 2200 2205
Val Asp Ala Val Ser Trp Arg Ile Ile Thr Glu Asp Phe Met Asn Gly
2210 2215 2220
Tyr Gln Gln Asp Leu Gln Gly Glu Pro Ile Ala Phe Thr Ser Lys Thr
2225 2230 2235 2240
Asp Ser Tyr Gln Lys Trp Ala Lys Ser Leu Leu Glu Tyr Ala Thr Ser
2245 2250 2255
Glu Glu Ile Gln Ser Glu Leu Lys Tyr Trp Gln Ser Met Ile Ala Lys
2260 2265 2270
Gly Leu Pro Ala Leu Pro Arg Asp Ser Lys Gly Gly Ala Pro Tyr Leu
2275 2280 2285
Leu Lys Asp Ile Gln Glu Val Ala Ile Gln Leu Thr Lys Glu Gln Thr
2290 2295 2300
Asn Lys Leu Leu Thr Asp Ala His Asn Ala Tyr Asn Thr Gln Ile Asn
2305 2310 2315 2320
Asp Leu Leu Leu Thr Ala Leu Ala Leu Thr Ile Gln Glu Trp Ala Gln
2325 2330 2335
Thr Asn Ser Ile Ala Ile Thr Leu Glu Gly His Gly Arg Glu Asp Ile
2340 2345 2350
Gly Val Asp Ile Asp Ile Asn Arg Thr Val Gly Trp Phe Thr Ser Met
2355 2360 2365
Tyr Pro Val Val Phe Asp Leu Gln Lys Gln Gly Ile Ala Asn Thr Val
2370 2375 2380
Lys Gln Val Lys Glu Glu Leu Arg Gln Ile Pro Asn Lys Gly Ile Gly
2385 2390 2395 2400
Tyr Gly Val Val Arg Tyr Leu Ser Asn Gln Gly Ser Thr Glu Leu Asp
2405 2410 2415
Ile Ser Ser His Ala Ile Asn Pro Glu Ile Ser Phe Asn Tyr Leu Gly
2420 2425 2430
Gln Met Asp Gln Ser Gly Gln Glu Glu Glu Tyr Gln Leu Ser Pro Leu
2435 2440 2445
Ser Ser Gly Gln Gln Ile Ser Gln Met Asn Gln Gly Leu Phe Pro Ile
2450 2455 2460
Asn Val Ser Gly Ile Val Val Glu Asn Gln Leu Ser Ile Gln Ile Ser
2465 2470 2475 2480
Tyr Asp Ser Gln Ala Tyr His Asp Ser Thr Met Glu Lys Leu Ile Gln
2485 2490 2495
Arg Tyr Gln Tyr His Leu Leu Glu Ile Ile Asn His Cys Val Gln Gln
2500 2505 2510
Thr Glu Thr Glu Leu Thr Pro Ser Asp Phe Ser Thr Lys Glu Leu Ser
2515 2520 2525
Met Glu Asp Leu Glu Ser Val Phe Glu Leu Leu Asp Glu
2530 2535 2540
<210> 56
<211> 4617
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 56
Met Phe Ser Arg Ser Asn Val Gln Asn Leu Tyr Arg Leu Ser Pro Met
1 5 10 15
Gln Lys Gly Ile Leu Phe His Ser Leu Lys Asp Lys Glu Asn His Ala
20 25 30
Tyr Phe Asp Gln Leu Ile Phe Thr Leu Glu Gly Lys Val Glu Leu Glu
35 40 45
Tyr Leu Glu Glu Ala Phe Asn Gln Leu Ile Lys Lys His Asp Ile Leu
50 55 60
Arg Thr Val Phe Arg Tyr Lys Lys Val Lys Glu Pro Val Gln Met Val
65 70 75 80
Leu Lys Glu Arg Ser Ser Thr Ile Tyr Phe Glu Asp Ile Ser His Leu
85 90 95
Glu Pro Glu Glu Lys Val Asn Tyr Ile Lys Gln Phe Lys Met Arg Asp
100 105 110
Arg Glu Lys Gly Phe Asp Leu Ser Arg Asp Leu Leu Ile Arg Met Ser
115 120 125
Leu Phe Lys Leu Asp Gln Glu Gln Tyr Gln Leu Ile Met Ser Asn His
130 135 140
His Ile Ile Met Asp Gly Trp Cys Leu Gly Ile Ile Leu Thr Asp Phe
145 150 155 160
Leu Arg Met Tyr Lys Gly Ile Val Asn His Thr Pro Val Pro Tyr Glu
165 170 175
His Val Thr Pro Tyr Ser Lys His Ile Gln Trp Leu Glu Lys Gln Asp
180 185 190
His Gln Glu Ala Lys Asp Phe Tyr Gln Gln Leu Leu Glu Gly Tyr Asp
195 200 205
Lys Val Thr Gly Val Pro Gln Gln Leu Val Arg Ala Asn His Glu Glu
210 215 220
Tyr Thr His Gly Gln Cys Ile Val Lys Leu Asn Gln Glu Thr Ala Asp
225 230 235 240
Arg Leu Ile Ala Ile Ala Lys Thr Tyr Gln Val Thr Val Asn Thr Val
245 250 255
Phe Gln Thr Ile Trp Gly Ile Leu Leu Gln Lys Tyr Asn Asn Thr Asp
260 265 270
Asp Val Val Phe Gly Ser Val Val Ser Gly Arg Pro Ala Glu Ile Pro
275 280 285
Asp Val Glu Lys Met Val Gly Leu Phe Ile Asn Thr Ile Pro Val Arg
290 295 300
Ile Lys Ala Asp Gln Gln Glu Arg Phe Asp Thr Leu Val Ala Lys Val
305 310 315 320
Gln Glu Met Ala Leu Ala Ser Glu Ser Tyr Asp Tyr Leu Ser Leu Ala
325 330 335
Asp Ile His Pro Glu Ala Gly Asp Phe Ile Asn His Ile Ile Ala Phe
340 345 350
Glu Asn Phe Tyr Ile Asp Met Asp Ser Phe Asn Gln Leu Ala Asp Lys
355 360 365
Lys Glu Leu Gly Phe Ser Leu Ala Phe Ala Thr Asp His His Glu Gln
370 375 380
Thr Asn Tyr Asp Leu Ser Val Gln Ala Gln Ile Gly Asp Glu Ser Ser
385 390 395 400
Ile Lys Ile Leu Tyr Asn Ser Lys Leu Tyr Thr Ser Glu Tyr Ile Ala
405 410 415
Asn Val Ile Asp His Phe Val Thr Val Ala Asp Ile Val Ala Ala Asp
420 425 430
Pro Ser Ile Pro Val Lys Glu Ile Asp Ile Leu Thr Lys Asp Lys Lys
435 440 445
Asp Gln Ile Leu Tyr Gly Phe Asn Asn Thr Tyr Ala Asp Tyr Pro Arg
450 455 460
Glu Lys Thr Ile His Gln Leu Phe Glu Glu Gln Val Asp Lys Asn Pro
465 470 475 480
Asn Gln Ile Ala Leu Val Phe Lys Glu Glu Lys Leu Thr Tyr Gly Glu
485 490 495
Val Asn Ala Lys Ala Asn Gln Leu Ala Tyr Val Leu Ser Lys Gln Gly
500 505 510
Val Gln Pro Asn Asp Val Ile Gly Ile Ile Thr Glu Arg Ser Pro Glu
515 520 525
Met Ile Ile Gly Ile Leu Ala Ile Phe Lys Ala Gly Gly Ala Tyr Met
530 535 540
Pro Ile Asp Pro Thr Tyr Pro Ala Glu Arg Ile Glu Tyr Met Leu Gln
545 550 555 560
Asp Asn Gln Thr Lys Leu Leu Leu Val Gln Lys Gln Glu Met Ile Pro
565 570 575
Ala Asn Tyr Gln Gly Glu Val Leu Phe Leu Thr Gln Glu Ser Trp Met
580 585 590
His Glu Glu Thr Ser Asn Pro Ala His Ile Thr Gln Ala Gln Ala Leu
595 600 605
Ala Tyr Val Met Tyr Thr Ser Gly Ser Thr Gly Glu Pro Lys Gly Ile
610 615 620
Leu Thr Thr His Gln Asn Ile Met Lys Thr Val Ile His Asn Gly Tyr
625 630 635 640
Val Glu Ile Thr Pro Gly Asp Cys Leu Ser Gln Leu Ser Asn Tyr Ala
645 650 655
Phe Asp Gly Ser Thr Phe Glu Ile Tyr Gly Ala Leu Leu His Gly Ala
660 665 670
Thr Leu Leu Leu Val Thr Lys Glu Ala Val Leu Asn Met Asn Glu Leu
675 680 685
Ala Arg Leu Ile Lys Lys Glu Gln Val Thr Val Ser Phe Met Thr Thr
690 695 700
Ala Leu Phe Asn Thr Leu Val Asp Leu Asp Ile Thr Cys Phe Gln Ser
705 710 715 720
Val Arg Lys Val Leu Phe Gly Gly Glu Leu Ala Ser Val Lys His Val
725 730 735
Leu Lys Ala Leu Asp Tyr Leu Gly Glu His Arg Val Ile Asn Val Tyr
740 745 750
Gly Pro Thr Glu Thr Thr Val Tyr Ala Thr Tyr Tyr Ser Val Asp His
755 760 765
Ser Met Leu Thr Arg Ala Ser Val Pro Ile Gly Arg Pro Ile Asn Asn
770 775 780
Thr Lys Ala Tyr Ile Leu Asn Thr Asp Gly Gln Pro Gln Pro Ile Gly
785 790 795 800
Val Val Gly Glu Leu Cys Ile Gly Gly Glu Gly Val Ala Cys Gly Tyr
805 810 815
Leu Asn Arg Pro Glu Leu Thr Lys Lys His Phe Val Asp Asn Pro Phe
820 825 830
Val Leu Gly Glu Arg Met Tyr Cys Thr Gly Asp Leu Ala Arg Phe Leu
835 840 845
Pro Asp Gly Asn Ile Glu Tyr Ile Gly Arg Met Asp Glu Gln Val Lys
850 855 860
Ile Arg Gly His Arg Ile Glu Leu Gly Glu Ile Glu Lys Val Leu Leu
865 870 875 880
Gln His Pro Ala Ile Ser Glu Thr Val Leu Leu Ala Lys Arg Asp Glu
885 890 895
Gln Gly His Ser Tyr Leu Cys Ala Tyr Ile Val Gly Gln Ala Phe Trp
900 905 910
Thr Val Thr Glu Leu Arg Gln His Leu Met Glu Ser Leu Pro Glu Tyr
915 920 925
Met Val Pro Ser Tyr Phe Ile Glu Ile Glu Lys Leu Pro Leu Thr Ala
930 935 940
Asn Gly Lys Val Asp Lys Arg Ala Leu Pro Glu Pro Asp Arg Lys Met
945 950 955 960
Gly Ser Ala Tyr Val Ala Pro Glu Asn Glu Thr Glu Glu Lys Leu Val
965 970 975
Gln Phe Phe Gln Glu Ile Leu Gly Val Glu Arg Val Gly Thr Gln Asp
980 985 990
Thr Phe Phe Glu Leu Gly Gly His Ser Leu Lys Ala Met Met Leu Val
995 1000 1005
Leu Gln Ile His Lys Glu Met Gly Ile Glu Val Pro Leu Lys Glu Ile
1010 1015 1020
Phe Thr Arg Pro Thr Ile Lys Glu Leu Ala Ala Tyr Ile His Lys Met
1025 1030 1035 1040
Asp Arg Ser Ala Tyr Ser Met Ile Glu Pro Thr Ala Lys Gln Glu Tyr
1045 1050 1055
Tyr Pro Val Ser Phe Ala Gln Arg Arg Met Phe Val Val Gln Gln Ile
1060 1065 1070
Arg Asp Thr Asn Thr Thr Ser Tyr Asn Met Pro Ile Leu Leu Glu Ile
1075 1080 1085
Glu Gly Ala Leu Asp Arg Glu Asn Val Arg Gln Thr Leu Lys Lys Leu
1090 1095 1100
Ile Glu Arg His Glu Ser Met Arg Thr Ser Phe His Met Ile Asp Glu
1105 1110 1115 1120
Thr Leu Leu Gln Lys Val His Asp Asp Val Thr Trp Glu Met Glu Glu
1125 1130 1135
Met Glu Ala Ser Glu Glu Glu Val Tyr Ala Leu Thr Lys Ser Phe Ile
1140 1145 1150
Arg Pro Phe Asp Leu Gly Gln Ala Pro Leu Phe Arg Ala Gly Leu Ile
1155 1160 1165
Arg Val Asn Ser Glu Arg His Leu Leu Leu Leu Asp Thr His His Ile
1170 1175 1180
Ile Ser Asp Gly Val Ser Thr Asn Ile Leu Phe Gln Asp Phe Thr Gln
1185 1190 1195 1200
Leu Tyr Arg Gly Arg Glu Leu Pro Ala Leu Arg Ile Gln Tyr Lys Asp
1205 1210 1215
Phe Ala Val Trp Gln Gln Gly Glu Ala Gln Leu Ala Arg Leu Gln Glu
1220 1225 1230
Gln Glu Glu Tyr Trp Leu Lys Gln Phe Ser Glu Ser Val Pro Val Leu
1235 1240 1245
Glu Leu Pro Thr Asp Phe Pro Arg Pro Ala Met Gln Gln Phe Asp Gly
1250 1255 1260
Asp Val Leu Asp Phe Ala Leu Asn Arg Gln Val Trp Gln Glu Leu Gln
1265 1270 1275 1280
Gln Leu Ile Val Lys Glu Gly Cys Thr Ala Tyr Met Ile Leu Leu Ala
1285 1290 1295
Ala Tyr His Val Leu Leu Ser Lys Tyr Ser Ser Gln Asn Asp Ile Val
1300 1305 1310
Ile Gly Ser Pro Ile Ala Gly Arg Thr Asn Ala Asp Leu Gln Ser Ile
1315 1320 1325
Val Gly Met Phe Val Asn Thr Leu Ala Ile Arg Thr Lys Ser Glu Gly
1330 1335 1340
Thr Gln Thr Phe Arg Glu Phe Leu Ser Thr Ile Lys Gln Leu Val Leu
1345 1350 1355 1360
Gln Ala Gln Ser Asn Ala Glu Tyr Pro Phe Glu Glu Leu Val Asp Lys
1365 1370 1375
Val Asn Pro Ser Arg Asp Leu Ser Arg Gln Pro Leu Phe Asp Thr Ile
1380 1385 1390
Phe Val Met Gln Asn Met Asp Ile Thr Glu Val Ala Ile Gln Gly Leu
1395 1400 1405
Ser Ile Val Thr Lys Asp Met Glu Trp Lys His Ser Lys Phe Asp Leu
1410 1415 1420
Thr Trp Ala Ala Val Glu Lys Glu Ser Leu His Phe Ser Val Glu Tyr
1425 1430 1435 1440
Ser Thr Arg Leu Phe Lys Gln Glu Thr Ile Glu Arg Met Ala Lys His
1445 1450 1455
Phe Ala His Leu Leu Asn Gln Val Ala Glu Asn Pro Asp Leu Ser Leu
1460 1465 1470
Ser Asp Met Glu Leu Ala Thr Asp Glu Glu Val Tyr Gln Leu Leu Glu
1475 1480 1485
Glu Phe Asn Asn Thr Glu Ala Asp Tyr Pro Ser Asp Lys Thr Ile His
1490 1495 1500
Gln Gln Phe Glu Gln Lys Val Glu Glu Asn Pro Asp Gln Ile Ala Leu
1505 1510 1515 1520
Leu Phe Lys Asp Lys Glu Ile Thr Tyr Gly Gln Leu Asn Ala Lys Ala
1525 1530 1535
Asn Gln Phe Ala Arg Val Leu Arg Lys His Gly Val Gln Pro Asp Gln
1540 1545 1550
Val Val Gly Leu Ile Thr Asp Arg Ser Ile Glu Met Met Ile Gly Ile
1555 1560 1565
Leu Ala Ile Leu Lys Ala Gly Gly Ala Tyr Leu Pro Ile Asp Pro Ser
1570 1575 1580
Tyr Pro Val Glu Arg Ile Thr Tyr Met Leu Glu Asp Ser Gln Ala Gln
1585 1590 1595 1600
Leu Leu Ile Val Gln Glu Ala Ala Met Ile Pro Glu Gly Tyr Gln Gly
1605 1610 1615
Lys Val Leu Leu Leu Val Glu Glu Cys Trp Ile Gln Glu Glu Ala Ser
1620 1625 1630
Asn Leu Glu Leu Ile Asn Gly Ala Gln Asp Leu Ala Tyr Val Met Tyr
1635 1640 1645
Thr Ser Gly Ser Thr Gly Lys Pro Lys Gly Asn Leu Thr Thr His Gln
1650 1655 1660
Asn Ile Leu Arg Thr Ile Ile Asn Asn Gly Phe Ile Glu Ile Val Pro
1665 1670 1675 1680
Ala Asp Arg Leu Leu Gln Leu Ser Asn Tyr Ala Phe Asp Gly Ser Thr
1685 1690 1695
Phe Asp Ile Tyr Ser Ala Leu Leu Asn Gly Ala Thr Leu Val Leu Val
1700 1705 1710
Pro Lys Glu Val Met Leu Asn Pro Met Glu Leu Ala Lys Ile Val Arg
1715 1720 1725
Glu Gln Asp Ile Thr Val Ser Phe Met Thr Thr Ser Leu Phe His Thr
1730 1735 1740
Leu Val Glu Leu Asp Val Thr Ser Met Lys Ser Met Arg Lys Val Val
1745 1750 1755 1760
Phe Gly Gly Glu Lys Ala Ser Tyr Lys His Val Glu Lys Ala Leu Asp
1765 1770 1775
Tyr Leu Gly Glu Gly Arg Leu Val Asn Gly Tyr Gly Pro Thr Glu Thr
1780 1785 1790
Thr Val Phe Ala Thr Thr Tyr Thr Val Asp Ser Ser Ile Lys Glu Thr
1795 1800 1805
Gly Ile Val Pro Ile Gly Arg Pro Leu Asn Asn Thr Ser Val Tyr Ile
1810 1815 1820
Leu Asn Glu Asn Asn Gln Pro Gln Pro Ile Gly Val Pro Gly Glu Leu
1825 1830 1835 1840
Cys Val Gly Gly Ala Gly Ile Ala Arg Gly Tyr Leu Asn Arg Pro Glu
1845 1850 1855
Leu Thr Ala Glu Arg Phe Val Asp Asn Pro Phe Leu Val Gly Asp Arg
1860 1865 1870
Met Tyr Arg Thr Gly Asp Met Ala Arg Phe Leu Pro Asp Gly Asn Ile
1875 1880 1885
Glu Tyr Ile Gly Arg Met Asp Glu Gln Val Lys Ile Arg Gly His Arg
1890 1895 1900
Ile Glu Leu Gly Glu Ile Glu Lys Ser Leu Leu Glu Tyr Pro Ala Ile
1905 1910 1915 1920
Ser Glu Ala Val Leu Val Ala Lys Arg Asp Glu Gln Gly His Ser Tyr
1925 1930 1935
Leu Cys Ala Tyr Val Val Ser Thr Asp Gln Trp Thr Val Ala Gln Val
1940 1945 1950
Arg Gln His Leu Leu Glu Ala Leu Pro Glu Tyr Met Val Pro Ser Tyr
1955 1960 1965
Phe Val Glu Leu Glu Lys Leu Pro Leu Thr Ser Asn Gly Lys Val Asp
1970 1975 1980
Lys Arg Ala Leu Pro Glu Pro Asp Arg Val Ile Thr Asn Glu Tyr Val
1985 1990 1995 2000
Ala Ala Val Asn Glu Thr Glu Glu Lys Leu Val Gln Phe Phe Gln Glu
2005 2010 2015
Ile Leu Ala Val Asp Arg Val Gly Thr Gln Asp Thr Phe Phe Glu Leu
2020 2025 2030
Gly Gly His Ser Leu Lys Ala Met Met Leu Val Ser Arg Ile His Lys
2035 2040 2045
Glu Leu Glu Ile Glu Val Pro Leu Lys Glu Val Phe Ala Arg Gln Thr
2050 2055 2060
Val Lys Glu Leu Ala Ala Tyr Ile Arg Gln Ala Glu Gln Ser Asp Tyr
2065 2070 2075 2080
Ser Glu Ile Gln Pro Ala Met Glu Gln Glu Tyr Tyr Pro Val Ser Asn
2085 2090 2095
Ala Gln Arg Arg Met Tyr Val Val Gln Gln Met Arg Asp Val Glu Thr
2100 2105 2110
Thr Gly Tyr Asn Met Pro Phe Tyr Leu Glu Met Glu Gly Ala Leu Glu
2115 2120 2125
Val Glu Lys Leu Ser Leu Ala Leu Lys Gln Leu Ile Lys Arg His Glu
2130 2135 2140
Ser Leu Arg Thr Ser Phe His Met Val Glu Asp Glu Leu Met Gln Lys
2145 2150 2155 2160
Val His Ala Glu Val Ala Trp Glu Met Glu Met Ile His Ala Val Glu
2165 2170 2175
Glu Glu Val Gln Gln Leu Thr Asp Ser Phe Met Arg Pro Phe Asp Leu
2180 2185 2190
Ala Lys Ala Pro Leu Phe Arg Ala Gly Leu Ile Gln Ile Asn Pro Lys
2195 2200 2205
Arg His Leu Leu Met Leu Asp Met His His Ile Ile Ser Asp Gly Val
2210 2215 2220
Ser Met Asn Val Leu Phe Gln Asp Ile Thr Gln Leu Tyr Gln Gly Ile
2225 2230 2235 2240
Glu Leu Ser Pro Leu Lys Ile Gln Tyr Lys Asp Phe Ala Val Trp Gln
2245 2250 2255
Gln Gly Met Ala Gln Val Val Arg Phe Gln Glu Gln Glu Arg Tyr Trp
2260 2265 2270
Leu Asn Gln Phe Ser Gly Asp Leu Pro Ile Leu Glu Met Val Thr Asp
2275 2280 2285
Tyr Pro Arg Pro Ala Ile Gln Gln Phe Asp Gly Asp Ser Trp Ser Phe
2290 2295 2300
Glu Ile Asp Ala Lys Val Leu Asp Ser Ile Lys Gln Leu Ser Ala Lys
2305 2310 2315 2320
Gln Gly Thr Thr Leu Tyr Met Thr Leu Leu Ala Ile Tyr Gln Ile Leu
2325 2330 2335
Leu Ala Lys Tyr Thr Arg Gln Asp Asp Ile Ile Val Gly Thr Pro Ile
2340 2345 2350
Ala Gly Arg Pro His Ala Asp Thr Glu Ser Ile Val Gly Met Phe Val
2355 2360 2365
Asn Thr Leu Ala Leu Arg Gly Gln Pro Lys Glu Glu Gln Ser Phe Ile
2370 2375 2380
Ser Tyr Leu Ser Glu Val Lys Glu Asn Val Leu Gln Ala Tyr Ala Asn
2385 2390 2395 2400
Ala Asp Tyr Pro Phe Glu Glu Leu Ile Glu Lys Leu His Leu Gln Arg
2405 2410 2415
Asp Met Ser Arg His Pro Leu Phe Asp Thr Met Phe Val Leu Gln Asn
2420 2425 2430
Met Asp Met Ser Asp Ile Asn Ile Ser Gly Leu Lys Leu His Ser Arg
2435 2440 2445
Asp Leu Asn Trp Lys Asn Ala Lys Phe Asp Met Thr Trp Met Ile Ala
2450 2455 2460
Glu Gln Asn Asn Leu Leu Ile Ser Val Glu Tyr Ser Thr Ser Leu Phe
2465 2470 2475 2480
Lys His Glu Thr Ile Gln Arg Leu Glu Lys His Phe Thr Tyr Leu Val
2485 2490 2495
Glu Gln Val Ala Lys His Pro Asp Cys Leu Leu Arg Asp Leu Glu Leu
2500 2505 2510
Thr Thr Asp Glu Glu Lys Gln Gln Ile Leu Thr Val Phe Asn Asp Thr
2515 2520 2525
Ala Thr Asp Asp Leu Gln Asp Leu Ser Ile Cys His Leu Phe Glu Gln
2530 2535 2540
Gln Val Gln Arg Phe Ser Asp Arg Pro Ala Leu Val Phe Lys Asp Lys
2545 2550 2555 2560
Gln Leu Thr Tyr Ser Glu Phe His Ala Lys Val Asn Gln Leu Ala Arg
2565 2570 2575
Val Leu Arg Lys Lys Gly Val Gln Pro Asp Gln Ala Val Gly Leu Ile
2580 2585 2590
Thr Asp Arg Ser Ile Glu Met Met Ile Gly Ile Phe Ala Ile Leu Lys
2595 2600 2605
Ala Gly Gly Ala Tyr Met Pro Ile Asp Pro Ser Tyr Pro Ile Asp Arg
2610 2615 2620
Ile Glu His Met Leu Glu Asp Ser Arg Thr Lys Leu Leu Phe Val Gln
2625 2630 2635 2640
Lys Thr Glu Met Ile Pro Ala Ser Tyr Gln Gly Glu Val Leu Leu Leu
2645 2650 2655
Ala Glu Glu Cys Trp Met His Glu Asp Ser Ser Asn Leu Glu Leu Ile
2660 2665 2670
Asn Lys Thr Gln Asp Leu Ala Tyr Val Met Tyr Thr Ser Gly Ser Thr
2675 2680 2685
Gly Lys Pro Lys Gly Asn Leu Thr Thr His Gln Asn Ile Leu Thr Thr
2690 2695 2700
Ile Ile Asn Asn Gly Tyr Ile Glu Ile Ala Pro Thr Asp Arg Leu Leu
2705 2710 2715 2720
Gln Leu Ser Asn Tyr Ala Phe Asp Gly Ser Thr Phe Asp Ile Tyr Ser
2725 2730 2735
Ala Leu Leu Asn Gly Ala Thr Leu Val Leu Val Pro Lys Glu Val Met
2740 2745 2750
Leu Asn Pro Met Glu Leu Ala Lys Ile Val Arg Glu Gln Asp Ile Thr
2755 2760 2765
Val Ser Phe Met Thr Thr Ser Leu Phe His Thr Leu Val Glu Leu Asp
2770 2775 2780
Val Thr Ser Met Lys Ser Met Arg Lys Val Val Phe Gly Gly Glu Lys
2785 2790 2795 2800
Ala Ser Tyr Lys His Val Glu Lys Ala Leu Asp Tyr Leu Gly Glu Gly
2805 2810 2815
Arg Leu Val Asn Gly Tyr Gly Pro Thr Glu Thr Thr Val Phe Ala Thr
2820 2825 2830
Thr Tyr Thr Val Asp Ser Ser Ile Lys Glu Met Gly Ile Val Pro Ile
2835 2840 2845
Gly Arg Pro Leu Asn Asn Thr Ser Val Tyr Val Leu Asn Glu Asn Asn
2850 2855 2860
Gln Leu Gln Pro Ile Gly Val Pro Gly Glu Leu Cys Val Gly Gly Ala
2865 2870 2875 2880
Gly Ile Ala Arg Gly Tyr Leu Asn Arg Pro Glu Leu Thr Ala Glu Arg
2885 2890 2895
Phe Val Glu Asn Pro Phe Val Ser Gly Asp Arg Met Tyr Arg Thr Gly
2900 2905 2910
Asp Leu Ala Arg Trp Leu Pro Asp Gly Ser Met Glu Tyr Leu Gly Arg
2915 2920 2925
Met Asp Glu Gln Val Lys Val Arg Gly Tyr Arg Ile Glu Leu Gly Glu
2930 2935 2940
Ile Glu Thr Arg Leu Leu Glu His Pro Ala Ile Ser Ala Val Val Leu
2945 2950 2955 2960
Leu Ala Lys Gln Asp Glu Gln Gly His Ser Tyr Leu Cys Ala Tyr Ile
2965 2970 2975
Val Ala Asn Gly Val Trp Thr Val Ala Glu Leu Arg Lys His Leu Ser
2980 2985 2990
Glu Ala Leu Pro Glu Tyr Met Val Pro Thr Tyr Phe Val Glu Leu Glu
2995 3000 3005
Gln Ile Pro Phe Thr Ser Asn Gly Lys Val Asn Lys Arg Ala Leu Pro
3010 3015 3020
Glu Pro Glu Gly Gln Ile Thr Ser Val Tyr Val Ala Pro Glu Thr Glu
3025 3030 3035 3040
Thr Glu Ala Lys Val Ala Ala Leu Phe Gln Glu Ile Leu Gly Val Glu
3045 3050 3055
Arg Val Gly Thr Gln Asp Met Phe Phe Glu Leu Gly Gly His Ser Leu
3060 3065 3070
Lys Ala Met Met Leu Val Leu Arg Met Asn Lys Glu Leu Gly Ile Glu
3075 3080 3085
Val Pro Leu Lys Glu Val Phe Ala His Pro Thr Val Lys Glu Leu Ala
3090 3095 3100
Ala Thr Ile Asp Leu Leu Asp Arg Ser Gly His Ser Glu Ile Glu Pro
3105 3110 3115 3120
Ala Pro Arg Gln Glu Phe Tyr Pro Val Ser Ser Ala Gln Arg Arg Met
3125 3130 3135
Tyr Val Val Gln His Leu Gly Asn Val Gln Thr Thr Ser Tyr Asn Met
3140 3145 3150
Pro Leu Phe Leu Glu Val Glu Gly Ala Leu Glu Ile Asp Lys Leu His
3155 3160 3165
Leu Ala Leu Glu Lys Leu Val Glu Arg His Glu Ser Leu Arg Thr Ser
3170 3175 3180
Phe His Met Val Asp Glu Glu Leu Met Gln Gln Val His Glu Glu Val
3185 3190 3195 3200
Ala Trp Asp Leu Glu Ile Met Asp Gly Thr Glu Gly Asp Leu Ala Gly
3205 3210 3215
Ile Thr Ala Gly Phe Ile Arg Pro Phe Asp Leu Ser Gln Ala Pro Leu
3220 3225 3230
Phe Arg Ala Gly Ile Val Arg Ile Ser Pro Glu Arg Phe Leu Phe Met
3235 3240 3245
Leu Asp Met His His Ile Ile Ser Asp Gly Val Ser Thr Asn Val Leu
3250 3255 3260
Phe Gln Asp Ile Thr Gln Leu Tyr Gln Gly Lys Asp Leu Pro Pro Leu
3265 3270 3275 3280
Pro Ile Gln Tyr Lys Asp Tyr Ala Val Trp Gln Gln Ala Asp Ala Gln
3285 3290 3295
Val Thr Arg Leu Gln Asp Gln Glu Ser Tyr Trp Leu His Gln Phe Ala
3300 3305 3310
Gly Glu Ala Ser Val Leu Glu Met Pro Thr Asp Phe Pro Arg Pro Ala
3315 3320 3325
Val Gln Gln Phe Glu Gly Asp Val Trp Thr Phe Glu Val Asp Ala Asp
3330 3335 3340
Ile Leu Ser Gln Leu Lys Lys Leu Ser Val Ser Gln Gly Ser Thr Leu
3345 3350 3355 3360
Tyr Met Thr Leu Leu Ala Ala Tyr Gln Val Leu Leu Ala Lys Tyr Thr
3365 3370 3375
Gly Gln Asp Asp Ile Ile Val Gly Ser Pro Ile Ala Gly Arg Pro His
3380 3385 3390
Ala Asp Val Glu Ser Ile Val Gly Met Phe Val Asn Thr Leu Ala Leu
3395 3400 3405
Arg Gly Gln Pro Val Gly Glu Gln Thr Phe Ile Ser Tyr Leu Ala Gln
3410 3415 3420
Val Lys Glu Gln Val Leu Gln Ala Tyr Ala Asn Ala Glu Tyr Pro Phe
3425 3430 3435 3440
Glu Lys Leu Val Glu Lys Leu Asp Leu Gln Arg Asp Met Ser Arg His
3445 3450 3455
Pro Leu Phe Asp Thr Met Phe Thr Leu Gln Asn Met Glu Met Ser Asp
3460 3465 3470
Ile Asp Leu Ala Gly Leu Thr Phe Lys Pro Phe Asp Phe Glu Trp Lys
3475 3480 3485
Asn Ala Lys Phe Asp Met Asp Trp Thr Met Leu Glu Glu Glu Thr Leu
3490 3495 3500
Lys Val Ala Ile Glu Tyr Ser Thr Ser Leu Tyr Thr Lys Glu Thr Ile
3505 3510 3515 3520
Ser Arg Met Ala Gln His Phe Thr Tyr Val Leu Gln Gln Ile Ile Glu
3525 3530 3535
His Pro Ala Ile Arg Leu Ala Glu Ile Lys Ile Ala Thr Leu Pro Glu
3540 3545 3550
Ile Glu Gln Ile Leu Thr Gln Phe Asn Asp Thr Arg Ala Ser Tyr Pro
3555 3560 3565
Asp Asn Gln Thr Ile His Ser Leu Phe Glu Gln Gln Val Glu Arg Thr
3570 3575 3580
Pro Glu Gln Ile Ala Val Val Tyr Gln Asp Gln Ser Ile Thr Tyr Arg
3585 3590 3595 3600
Glu Leu Asn Glu Arg Ala Asn Arg Leu Ala Arg Cys Leu Ile Asp Lys
3605 3610 3615
Gly Ile Gln Arg Asn Gln Phe Val Ala Ile Met Ala Asp Arg Ser Ile
3620 3625 3630
Glu Thr Val Ile Gly Met Met Gly Ile Leu Lys Ala Gly Gly Ala Tyr
3635 3640 3645
Val Pro Ile Asp Pro Asp Tyr Pro Leu Asp Arg Lys Leu Tyr Ile Leu
3650 3655 3660
Glu Asp Ser His Ala Ser Leu Leu Leu Phe Gln Gln Lys His Glu Val
3665 3670 3675 3680
Pro Ser Glu Phe Thr Gly Asp Arg Ile Leu Ile Glu Gln Met Gln Trp
3685 3690 3695
Tyr Gln Ala Ala Asp Thr Asn Val Gly Ile Val Asn Thr Ala Gln Asp
3700 3705 3710
Leu Ala Tyr Met Ile Tyr Thr Ser Gly Ser Thr Gly Gln Pro Lys Gly
3715 3720 3725
Val Met Ile Asp His Gln Ala Val Cys Asn Leu Cys Leu Met Ala Gln
3730 3735 3740
Thr Tyr Gly Ile Phe Ala Asn Ser Arg Val Leu Gln Phe Ala Ser Phe
3745 3750 3755 3760
Ser Phe Asp Ala Ser Val Gly Glu Val Phe His Thr Leu Thr Asn Gly
3765 3770 3775
Ala Thr Leu Tyr Leu Met Asp Arg Asn Leu Val Met Ala Gly Val Glu
3780 3785 3790
Phe Val Glu Trp Leu Arg Val Asn Glu Ile Thr Ser Ile Pro Phe Ile
3795 3800 3805
Ser Pro Ser Ala Leu Arg Ala Leu Pro Tyr Glu Asp Leu Pro Ala Leu
3810 3815 3820
Lys Tyr Ile Ser Thr Gly Gly Glu Ala Leu Pro Val Asp Leu Val Arg
3825 3830 3835 3840
Leu Trp Gly Thr Glu Arg Ile Phe Leu Asn Ala Tyr Gly Pro Thr Glu
3845 3850 3855
Thr Thr Val Asp Ala Thr Ile Gly Leu Cys Thr Pro Glu Asp Lys Pro
3860 3865 3870
His Ile Gly Lys Pro Val Leu Asn Lys Lys Ala Tyr Ile Ile Asn Pro
3875 3880 3885
Asn Tyr Gln Leu Gln Pro Ile Gly Val Pro Gly Glu Leu Cys Ile Gly
3890 3895 3900
Gly Val Gly Ile Ala Pro Gly Tyr Trp Asn Arg Pro Glu Leu Thr Arg
3905 3910 3915 3920
Glu Lys Phe Val Asp Asn Pro Phe Ala Gln Gly Glu Arg Met Tyr Lys
3925 3930 3935
Thr Gly Asp Leu Val Arg Trp Leu Pro Asp Gly Asn Ile Glu Phe Leu
3940 3945 3950
Gly Arg Ile Asp Asp Gln Val Lys Ile Arg Gly His Arg Ile Glu Leu
3955 3960 3965
Gly Glu Ile Glu Thr Arg Leu Leu Glu His Glu Gln Val Ile Glu Ala
3970 3975 3980
Val Val Leu Ala Arg Glu Asp Glu Gln Gly Gln Ala Tyr Leu Cys Ala
3985 3990 3995 4000
Tyr Leu Val Ala Ala Asp Glu Trp Thr Val Ala Glu Leu Arg Lys His
4005 4010 4015
Leu Gly Lys Thr Leu Pro Asp Tyr Met Ile Pro Ala Tyr Phe Ile Glu
4020 4025 4030
Leu Glu Glu Phe Pro Leu Thr Pro Ser Gly Lys Val Asn Lys Lys Ala
4035 4040 4045
Leu Pro Glu Pro Asp Gly Gln Ile Gln Thr Gly Val Glu Tyr Val Glu
4050 4055 4060
Ala Thr Thr Glu Ser Gln Lys Ile Leu Val Glu Leu Trp Gln Glu Val
4065 4070 4075 4080
Leu Arg Val Glu Arg Ile Gly Ile Tyr Asp Asn Phe Phe Glu Leu Gly
4085 4090 4095
Gly Asp Ser Ile Lys Ala Ile Gln Ile Thr Ala Arg Leu Arg Arg Tyr
4100 4105 4110
His Arg Lys Leu Glu Ile Ser His Leu Phe Lys His Pro Thr Ile Ala
4115 4120 4125
Glu Leu Ala Pro Trp Met Gln Thr Ser Gln Ala Leu Leu Glu Gln Gly
4130 4135 4140
Thr Val Glu Gly Glu Val Met Leu Thr Pro Ile Gln Lys Ala Phe Phe
4145 4150 4155 4160
Glu Glu Asn Gln Glu Gln Pro Gln His Phe Asn Gln Asp Ser Leu Leu
4165 4170 4175
Tyr Ser Ser Asn Gly Trp Asn Gln Asp Ala Ile Glu Gln Val Phe Glu
4180 4185 4190
Lys Ile Thr Glu His His Asp Ala Leu Arg Met Val Tyr Pro His Thr
4195 4200 4205
Glu Gly Lys Val Thr Gln Ile Asn Arg Gly Leu Glu Asp Lys Ala Phe
4210 4215 4220
Thr Leu Gln Val Phe Asp Phe Thr Gln Glu Pro Thr Asp Thr Gln Ala
4225 4230 4235 4240
Thr Lys Ile Glu Gln Ile Ala Thr Gln Leu Gln Ala Ser Phe Asp Leu
4245 4250 4255
Glu Lys Gly Pro Leu Val Arg Leu Gly Leu Phe Thr Thr Lys Ala Gly
4260 4265 4270
Asp Tyr Leu Leu Ile Val Ile His His Leu Val Ile Asp Gly Val Ser
4275 4280 4285
Trp Arg Ile Leu Leu Glu Asp Phe His Asn Ala Tyr Gln Gln Val Ile
4290 4295 4300
Gln Gly Gln Ala Ile Val Leu Pro Glu Lys Thr Thr Ser Phe Lys Thr
4305 4310 4315 4320
Trp Ser Glu Arg Leu Asn Glu Tyr Ala Asn Ser His Ala Leu Leu His
4325 4330 4335
Glu Ile Pro Tyr Trp Lys Gln Met Glu Glu Ile Ser Ile Ala Pro Leu
4340 4345 4350
Pro Lys Lys Gly Asn Asn Asp Gly Arg Tyr Tyr Val Lys Asp Ser Glu
4355 4360 4365
Tyr Ala Thr Met Ser Leu Thr Glu Glu Glu Thr Gln Asn Leu Leu Thr
4370 4375 4380
Arg Val His Arg Ala Tyr Arg Thr Glu Ile Asn Asp Leu Leu Leu Ala
4385 4390 4395 4400
Ala Leu Gly Leu Ala Ser Lys Glu Trp Thr Lys Glu Asn Arg Val Ala
4405 4410 4415
Ile His Leu Glu Gly His Gly Arg Glu Glu Ile Gly Glu Gly Val Asp
4420 4425 4430
Val Asn Arg Thr Val Gly Trp Phe Thr Ser Leu Phe Pro Val Val Ile
4435 4440 4445
Asp Leu Glu Asn Asp Glu Leu Pro Leu Ile Ile Lys Ser Val Lys Glu
4450 4455 4460
Thr Leu Arg Arg Val Pro Asn Lys Gly Met Gly Tyr Gly Ile Leu Lys
4465 4470 4475 4480
His Leu Thr Ser Asp Ala Asn Lys Gln Glu Ile Thr Phe Ser Leu Arg
4485 4490 4495
Pro Glu Ile Ser Phe Asn Tyr Leu Gly Val Phe Asp Gln Gln Glu Glu
4500 4505 4510
Glu Ser Glu Ser Ala Gly Ile Pro Thr Gly Gln Pro Ile Ser Pro Gln
4515 4520 4525
Tyr Tyr Asp Thr His Leu Leu Glu Phe Asn Gly Ala Val Ser Asn Asn
4530 4535 4540
Gln Leu His Val Asn Cys Arg Phe Ala Pro Thr Ala Val Asp Arg Ala
4545 4550 4555 4560
Ile Val Glu Ile Leu Met Glu Arg Phe Lys His His Leu Leu Leu Ile
4565 4570 4575
Ser Lys His Cys Leu Glu Lys Asp Thr Val Glu Phe Thr Pro Thr Asp
4580 4585 4590
Phe Thr Glu Lys Glu Leu Ser Gln Glu Gln Leu Asp Asp Leu Leu Asp
4595 4600 4605
Asp Leu Phe Glu Asp Ile Asp Asp Leu
4610 4615
<210> 57
<211> 2530
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 57
Met Ser Asp Lys Leu Ser Asn Ala Lys Asp Leu Phe Pro Met Ser Asp
1 5 10 15
Ile Gln Leu Gly Met Val Tyr His Ser Leu Lys His Val His Glu Ala
20 25 30
Val Tyr His Asp Gln Phe Val Tyr Gln Val Asp Asp Asp Ser Phe Asp
35 40 45
Val His Val Leu Glu Gln Ala Met Arg Met Met Val Asp Lys His Asp
50 55 60
Ile Leu Lys Thr Ser Phe His Ile Glu Glu Phe Ser Thr Pro Val Gln
65 70 75 80
Val Val His Gln Glu Val Ser Val Arg Ile Asp Glu Thr Asp Ile Thr
85 90 95
His Leu Gly Glu Lys Gln Lys Glu Tyr Ile His Gln Tyr Leu Ala Gln
100 105 110
Asp Arg Gln Ser Pro Phe Asp Val Thr Thr Ala Pro Leu Trp Arg Met
115 120 125
Ser Val Phe Lys Leu Asn Ala Ser Gln Val Ala Leu Val Trp Ile Phe
130 135 140
His His Ala Ile Leu Asp Gly Trp Ser Val Ala Ser Phe Ile Thr Glu
145 150 155 160
Leu Ile Asp Val Tyr Phe Lys Leu Lys His Lys Thr Cys Thr Leu Glu
165 170 175
His Leu Asn Thr Thr Tyr Lys Asp Tyr Val Ile Asp Gln Met Leu Leu
180 185 190
Ser Glu Gln Asn Glu Leu Arg Glu Tyr Trp Lys Glu Glu Leu Lys Asp
195 200 205
Tyr Lys Arg Leu Gln Leu Pro Val Lys Val Asp Glu Asn Gly Gly Val
210 215 220
His Val Thr Val Val Glu Lys Leu Asp Pro Asp Ile Ile Asn Lys Cys
225 230 235 240
Arg Glu Ile Ala Gln Ala His His Ile Pro Leu Lys Thr Val Cys Leu
245 250 255
Thr Ala Phe Leu Ser Met Met His Met Ile Ser Tyr Glu Arg Asp Leu
260 265 270
Thr Val Gly Leu Ile Glu Asn Asn Arg Pro Ile Ile Glu Asp Ala Glu
275 280 285
Lys Val Leu Gly Cys Phe Leu Asn Ser Val Pro Phe Arg Ala Ile Ile
290 295 300
Gln Lys Asp Met Ser Tyr Arg Glu Leu Leu Glu Gln Thr Gln Gln Lys
305 310 315 320
Leu Val Glu Ile Lys Thr Tyr Gly Arg Leu Ser Phe Ala Lys Ile Ile
325 330 335
Glu Val Ile Gly Asp Thr Gly Ser Glu Arg Asn Pro Val Phe Asp Cys
340 345 350
Leu Phe Asn Phe Val Asp Phe His Val Phe Lys Gly Ile Lys Asp His
355 360 365
Lys Val Lys Phe Trp Leu Asp Gly Tyr Glu Lys Thr Asn Thr Met Phe
370 375 380
Asp Phe Ser Val Ser Thr Thr Met Asp Asp Tyr Phe Val Arg Val Val
385 390 395 400
Ser Ala Leu Pro Glu Glu Asp Thr Ile Lys Leu Ile Asn Tyr Tyr Gln
405 410 415
Arg Ile Leu Glu Lys Ile Ala Leu His Ile Asp Glu Lys Ile Asp Lys
420 425 430
Gln Ala Asn Leu Asp Glu Lys Glu Ser His Leu Leu Leu Glu Glu Trp
435 440 445
Asn Gln Thr Ser Val Asp Tyr Pro Asp Lys Gln Thr Leu His Lys Arg
450 455 460
Phe Glu Glu Gln Ala Ala Lys Ser Glu Asp Gln Ile Ala Leu Glu Tyr
465 470 475 480
Glu Asp Lys Gln Leu Thr Tyr Arg Glu Leu Asn Ala Lys Ala Asn Gln
485 490 495
Leu Ala Arg Val Leu Gln Lys His Asn Thr Leu Pro Thr Gln Val Val
500 505 510
Gly Leu Met Ala Glu Arg Ser Leu Glu Met Ile Ile Gly Ile Leu Gly
515 520 525
Ile Leu Lys Ala Gly Gly Ala Tyr Met Pro Ile Asp Pro Thr Tyr Pro
530 535 540
Ala Glu Arg Ile Gln Tyr Met Leu Glu Asp Ser Arg Ser Tyr Leu Leu
545 550 555 560
Leu Val Gln Lys Ala Glu Met Ile Pro Ala Asn Tyr Gln Gly Glu Val
565 570 575
Leu Ile Leu Thr Glu Glu Leu Trp Ala Asp Glu Asn Thr Glu Asn Leu
580 585 590
Asp Leu Val Asn Gln Pro Gln Asp Val Ala Asn Ile Met Tyr Thr Ser
595 600 605
Gly Thr Thr Gly Lys Pro Lys Gly Ile Leu Ile Thr His Arg Asn Ile
610 615 620
Met Thr Thr Ile Ile Asn Asn Gly Tyr Leu Asp Ile Phe Ser Thr Asp
625 630 635 640
Arg Ile Leu Gln Met Ser Asn Tyr Ala Phe Asp Gly Ser Thr Phe Asp
645 650 655
Ile Tyr Ser Ala Leu Leu Asn Gly Ala Thr Leu Val Leu Val Pro Lys
660 665 670
Gln Thr Leu Met Asn Thr Thr Asp Leu Leu Ala Val Ile Lys Asp Ser
675 680 685
Asn Ile Thr Val Ala Leu Leu Thr Thr Ser Leu Phe Asn Thr Leu Val
690 695 700
Asp Leu Asp Val Thr Ser Phe Gln His Thr Arg Lys Val Leu Phe Gly
705 710 715 720
Gly Glu Lys Ala Ser Cys Lys His Val Glu Lys Ala Leu Asp Tyr Leu
725 730 735
Gly Glu Gly Arg Leu Val Asn Gly Tyr Gly Pro Thr Glu Thr Thr Val
740 745 750
Phe Ala Thr Thr Tyr Thr Val Asp Asn Thr Ile Lys Lys Leu Gly Ser
755 760 765
Val Pro Ile Gly Arg Pro Leu Ser Asn Thr Ser Val Tyr Ile Phe Gly
770 775 780
Leu Asp Asp Gln Leu Gln Pro Leu Gly Val Pro Gly Glu Leu Cys Val
785 790 795 800
Ala Gly Glu Cys Ile Ser Pro Gly Tyr Leu Asn Arg Pro Asp Leu Thr
805 810 815
Ala Asp Lys Phe Ile Asp Asn Pro Leu Lys Pro Gly Glu Arg Met Tyr
820 825 830
Arg Thr Gly Asp Leu Val Arg Trp Leu Pro Glu Gly Val Met Glu Tyr
835 840 845
Met Gly Arg Ile Asp Glu Gln Val Lys Ile Arg Gly His Arg Ile Glu
850 855 860
Leu Gly Glu Ile Glu Ala Lys Leu Leu Glu His Pro Ser Ile Arg Glu
865 870 875 880
Thr Val Leu Val Ala Lys Gln Asp Ala Asn Gly His Ser Phe Leu Gly
885 890 895
Ala Tyr Leu Val Thr Asp Asn Phe Cys Pro Val Thr Glu Leu Arg Asn
900 905 910
Tyr Leu Met Glu Thr Leu Pro Glu Tyr Met Val Pro Ser Tyr Phe Ile
915 920 925
Glu Leu Asp Ser Leu Pro Leu Thr Ser Asn Gly Lys Val Asp Lys Arg
930 935 940
Ala Leu Pro Glu Pro Glu Ser Gln Ala Leu His Ala Tyr Thr Met Pro
945 950 955 960
Glu Asn Glu Met Glu Glu Lys Leu Val Gln Leu Phe Gln Glu Val Met
965 970 975
Asp Val Glu Arg Val Gly Thr Gln Asp Ser Phe Tyr Glu Leu Gly Gly
980 985 990
His Ser Leu Lys Ala Met Leu Leu Val Ser Arg Ile His Lys Asp Phe
995 1000 1005
Gly Ile Lys Ile Pro Leu Lys Glu Val Phe Ser Arg Pro Thr Val Lys
1010 1015 1020
Glu Leu Ala Ala Tyr Leu Thr Gly Ser Glu Glu Ala Asn Tyr Ile Glu
1025 1030 1035 1040
Ile Glu Ala Ala Glu Glu Lys Pro Tyr Tyr Pro Val Thr Ala Ala Gln
1045 1050 1055
Lys Arg Met Tyr Ile Ala Gln Gln Trp Glu Asp Gly Glu Ala Thr Ser
1060 1065 1070
Ser Tyr His Met Pro Phe Met Met Glu Ile Thr Gly Pro Leu Gln Val
1075 1080 1085
Glu Lys Leu Gln Gln Thr Val Lys Ser Leu Val Ala Arg His Glu Ser
1090 1095 1100
Leu Arg Thr Ser Phe His Met Ile Asn Glu Val Leu Met Gln Lys Ile
1105 1110 1115 1120
His Ala Asp Val Leu Trp Asp Leu Asp Ile Asp Leu Glu Ser Val Val
1125 1130 1135
Ala Ser Glu Gln Glu Ile Asp Glu Lys Met Phe Gln Phe Leu Arg Lys
1140 1145 1150
Phe Asp Leu Ser Gln Ala Pro Leu Phe Arg Ala Lys Leu Ile Arg Val
1155 1160 1165
Asn Ala Ser Arg His Val Leu Leu Leu Asp Met His His Ile Ile Ser
1170 1175 1180
Asp Gly Phe Ser Tyr Gln Ile Phe Phe Asp Glu Leu Thr Lys Leu Tyr
1185 1190 1195 1200
Gln Gly Glu Glu Leu Pro Ser Leu Lys Ile Gln Tyr Lys Asp Tyr Ala
1205 1210 1215
Val Trp Gln His Ser Glu Glu Gln Gln Lys Arg Leu Gln Gln Gln Glu
1220 1225 1230
Asp Tyr Trp Leu Gly Gln Phe Gln Gly Glu Ile Pro Val Leu Glu Leu
1235 1240 1245
Pro Thr Asp Tyr Gln Arg Pro Val Asp Lys Gln Phe Ala Gly Ala Leu
1250 1255 1260
Phe Thr His Gly Leu Ser Ala Gly Leu Thr Glu Lys Leu Arg Lys Leu
1265 1270 1275 1280
Ala Ile Lys Glu Lys Thr Thr Leu Tyr Thr Val Leu Leu Thr Val Tyr
1285 1290 1295
Asn Ile Leu Leu Thr Lys Tyr Thr Ser Gln Glu Asp Leu Ile Val Gly
1300 1305 1310
Thr Pro Ile Ala Gly Arg Pro His Ala Asp Leu Asp Arg Val Phe Gly
1315 1320 1325
Met Phe Val Asn Thr Leu Ala Ile Arg Thr Ala Pro Lys Val Glu Tyr
1330 1335 1340
Ser Phe Leu Ala Tyr Leu Ser Glu Val Lys Glu Thr Val Leu Gly Ala
1345 1350 1355 1360
Tyr Gln Asn Pro Asp Tyr Pro Phe Glu Glu Leu Val Glu Lys Thr Leu
1365 1370 1375
Val Gln Arg Asp Val Ser Arg Asn Pro Leu Phe Asp Val Met Phe Ser
1380 1385 1390
Val Glu Lys Leu Pro Ser Ala Val Gln Phe Asp Asp Leu Arg Phe Cys
1395 1400 1405
Pro Arg Leu Phe Asp Trp Lys Lys Ala Lys Phe Asp Leu Asp Trp Thr
1410 1415 1420
Val Val Glu Gly Glu Ser Leu Glu Val Leu Val Glu Tyr Ser Thr Ser
1425 1430 1435 1440
Leu Phe Asp Arg Ala Thr Ile Glu Arg Met Ala Lys His Phe Glu His
1445 1450 1455
Ile Leu Glu Gln Ile Leu Asp Gln Pro Asp Leu Ser Ile Ser Glu Ile
1460 1465 1470
Glu Leu Leu Thr Glu Ala Glu Lys Gln Gln Ile Leu Ile Glu Phe Asn
1475 1480 1485
Gln Ser Asp Lys Ser Phe Asp Ser Glu Lys Thr Ile Gln Glu Gln Phe
1490 1495 1500
Glu Glu Trp Ala Glu Lys Ala Pro His Ser Ile Ala Leu Val Phe Lys
1505 1510 1515 1520
Asp Lys Gln Met Thr Tyr Gln Glu Leu Asn Gln Arg Ala Asn Gln Val
1525 1530 1535
Ala His Leu Leu Arg Gly Asn Gly Ile Ser Ala Asn Asp Phe Ile Gly
1540 1545 1550
Leu Met Val Asp Arg Ser Phe Glu Met Ile Ile Ser Met Leu Gly Ile
1555 1560 1565
Leu Lys Ala Gly Gly Ala Tyr Leu Pro Ile Asp Pro Asp Tyr Pro Glu
1570 1575 1580
Asp Arg Ile Asp Tyr Met Leu Ser Asp Ser Lys Ala Lys Ile Leu Leu
1585 1590 1595 1600
Lys Gln Ser Asp Gln Thr Ala Pro Ala Ser Phe Glu Gly Lys Val Ile
1605 1610 1615
Ala Ile Asp Thr Pro Glu Leu Leu Glu Met Asp Ile Glu Asn Ile Pro
1620 1625 1630
Lys Val Asn Asn Ser Ser Asp Leu Ala Tyr Ile Ile Tyr Thr Ser Gly
1635 1640 1645
Ser Thr Gly Lys Pro Lys Gly Val Leu Ile Asn His Arg Cys Val Ile
1650 1655 1660
Asn Met Gln Leu Thr Ala Glu Thr Phe Gly Ile Tyr Pro Ser Ser Arg
1665 1670 1675 1680
Ile Leu Gln Phe Ala Ser Phe Ser Phe Asp Ser Ser Val Gly Glu Ile
1685 1690 1695
Phe Tyr Thr Leu Leu Asn Gly Ala Cys Leu Tyr Leu Val Glu Lys Asp
1700 1705 1710
Leu Leu Leu Ser Gly Asn Glu Phe Val Ala Trp Leu Lys Lys Asn Arg
1715 1720 1725
Ile Ser Ser Ile Pro Phe Ile Ser Pro Ser Ala Leu Arg Met Leu Pro
1730 1735 1740
Tyr Glu Asp Leu Pro Asp Leu Ala Tyr Ile Ser Thr Gly Gly Glu Thr
1745 1750 1755 1760
Leu Pro Ala Asp Leu Val Lys Ala Trp Gly Glu Asn Arg Val Phe Leu
1765 1770 1775
Asn Ala Tyr Gly Pro Thr Glu Thr Thr Val Asp Ala Thr Val Gly Val
1780 1785 1790
Cys Thr Pro Glu Gly Lys Pro His Ile Gly Arg Pro Val Thr Asn Lys
1795 1800 1805
Lys Val Tyr Val Val Asn Ser Asn Asn Gln Leu Gln Pro Ile Gly Val
1810 1815 1820
Pro Gly Glu Leu Cys Ile Gly Gly Glu Gly Val Ala Leu Gly Tyr Leu
1825 1830 1835 1840
Asn Arg Pro Asp Leu Thr Gln Glu Lys Phe Val Ser Asn Pro Phe Ala
1845 1850 1855
Pro Gly Glu Arg Met Tyr Arg Ser Gly Asp Leu Val Arg Trp Leu Pro
1860 1865 1870
Asp Gly Thr Ile Glu Tyr Phe Gly Arg Leu Asp Asp Gln Val Lys Ile
1875 1880 1885
Arg Gly His Arg Ile Glu Leu Gly Glu Ile Glu Thr Arg Leu Leu Glu
1890 1895 1900
His Pro Cys Ile Lys Glu Ala Ile Val Ile Pro Arg Ser Asp Glu Ser
1905 1910 1915 1920
Glu Ala Thr Tyr Leu Cys Ser Tyr Leu Ile Ala Glu Gly Ser Trp Asn
1925 1930 1935
Ala Ala Asp Leu Arg Lys Tyr Leu Lys Ala Ser Leu Pro Glu Tyr Met
1940 1945 1950
Ile Pro Ser Tyr Phe Val Glu Leu His Glu Leu Pro Leu Thr Pro Asn
1955 1960 1965
Gly Lys Val Asn Lys Lys Ala Leu Pro Lys Pro Glu Lys Gln Met Gln
1970 1975 1980
Arg Gly Gln Asp Tyr Val Ala Pro Thr Asn Pro Ile Gln Ser Ile Leu
1985 1990 1995 2000
Ser His Ile Trp Thr Asp Val Leu Gly Val Glu Asn Ile Gly Ile His
2005 2010 2015
Asp Asn Phe Phe Glu Leu Gly Gly Asp Ser Ile Lys Ala Ile Gln Ile
2020 2025 2030
Ser Ala Arg Leu Asn Lys His Asn Leu Lys Val Lys Met Arg Glu Leu
2035 2040 2045
Phe Lys Asn Pro Thr Ile Ala Glu Leu Ser Leu Leu Val Gln Gln Ile
2050 2055 2060
Val Gln Glu Ile Asp Gln Gly Val Val Glu Gly Asn Ile Pro Leu Thr
2065 2070 2075 2080
Pro Ile Gln His Trp Phe Phe Thr Gln Ser Phe Pro Gln Val Asn His
2085 2090 2095
Tyr Asn Gln Ser Val Leu Leu Phe Asn Ala Glu Gly Trp Asp Glu Gln
2100 2105 2110
Lys Val Asp Lys Ala Phe Glu Met Leu Thr Gln His His Asp Ala Leu
2115 2120 2125
Arg Ile Val Tyr Ser Leu Asp Glu Gln Gly Val Val Gln His Asn Arg
2130 2135 2140
Gly Leu Glu Gly Ser Asn Tyr His Phe Glu Ile Ile Asp Val Arg Gln
2145 2150 2155 2160
Asp Gly Glu Asp Gln Ser Asn Trp Lys Ala Ala Ala Asn Arg Met Gln
2165 2170 2175
Ala Ser Met Asp Ile Val Glu Gly Pro Leu Val Gln Ile Gly Leu Phe
2180 2185 2190
Arg Ala Asn Glu Gly Ala Tyr Leu Leu Ile Ala Ile His His Leu Val
2195 2200 2205
Val Asp Gly Val Ser Trp Arg Ile Leu Leu Glu Asp Phe Tyr His Leu
2210 2215 2220
Tyr Asn Gly Asn Asp Ser Leu Pro Leu Lys Thr Thr Ser Phe Gln Ala
2225 2230 2235 2240
Trp Ser Gln Lys Leu Gln Glu Tyr Ala Gln Ser Lys Glu Leu Glu His
2245 2250 2255
Glu Leu Ser Tyr Trp Arg His Leu Asp Glu Ala Ile Thr Asp Tyr Thr
2260 2265 2270
Leu His Lys Asp Ile Glu Ala Ala Thr Ser Asn Lys Thr Thr Tyr Glu
2275 2280 2285
Glu Phe Leu Thr Val Ser Met Ser Leu Ser Thr Glu Glu Thr Gln Gln
2290 2295 2300
Leu Val Thr Glu Ala His Lys Ala Tyr Gln Thr Glu Ile Asn Asp Leu
2305 2310 2315 2320
Leu Leu Thr Ala Leu Ala Leu Ala Leu Lys Glu Trp Thr Asn Lys Glu
2325 2330 2335
Gln Leu Leu Val Ser Met Glu Gly His Gly Arg Glu Glu Ile Leu Asp
2340 2345 2350
Asn Val Asp Ile Ser Arg Thr Val Gly Trp Phe Thr Ser Glu Tyr Pro
2355 2360 2365
Val Ala Ile His Leu Thr Lys Thr Asp Ile Ser Phe Ala Ile Lys Gln
2370 2375 2380
Val Lys Glu Thr Leu Arg Arg Val Pro Asn Lys Gly Phe Gly Tyr Gly
2385 2390 2395 2400
Ile Leu Lys Tyr Leu Ala Lys Glu Thr Phe Lys Leu Lys Pro Glu Ile
2405 2410 2415
Ser Phe Asn Tyr Leu Gly Gln Phe Thr Asp Lys Glu Glu Gly Asn Ser
2420 2425 2430
Ser Leu Met Gly Asp Leu Ile Ser Pro Ala Asn Thr Ser Glu Leu Ser
2435 2440 2445
Leu Asp Ile Asn Gly Ser Ile Glu Ala Asp Arg Leu Gln Met His Phe
2450 2455 2460
Ser Tyr Asn Ser Arg Ala Tyr Tyr Pro Glu Thr Ile Ala Ala Leu Val
2465 2470 2475 2480
Gln Asn Phe Lys Ser Tyr Leu Leu Glu Ile Ile Asn His Cys Arg Ala
2485 2490 2495
Lys Glu Gly Val Glu His Thr Pro Ser Asp Phe Asp Ile Asn Asp Leu
2500 2505 2510
Thr Met Glu Glu Leu Asp Asp Ile Phe Asp Asp Leu Glu Glu Glu Val
2515 2520 2525
Tyr Lys
2530
<210> 58
<211> 641
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 58
Met Asp Leu Ser Thr Leu Asn Phe Leu Gly Glu Thr Glu Lys His Lys
1 5 10 15
Leu Leu Asn Gln Phe Asn Asp Thr Asp Ala Asn Phe Pro Gln Glu Met
20 25 30
Thr Ile His Gly Leu Phe Glu Lys Gln Val Gln Glu Arg Pro Asn Gln
35 40 45
Thr Ala Val Ile Phe Asn Glu Gln Ser Met Thr Tyr Lys Glu Met Asn
50 55 60
Glu Arg Ala Asn Gln Val Ala His Ser Leu Arg Lys His Gly Val Ala
65 70 75 80
Pro Asp Glu Ile Val Gly Ile Leu Ala Asp Arg Asn Met Asp Met Leu
85 90 95
Ile Ser Ile Leu Gly Val Leu Lys Ala Gly Ala Ala Tyr Met Pro Ile
100 105 110
Asp Pro Thr Tyr Pro Thr Glu Arg Ile Leu Tyr Met Ile Asn Asp Ser
115 120 125
Gln Thr Lys Ile Val Leu Ala Glu His Arg Glu Met Val Pro Glu Gly
130 135 140
Cys Asn Ala Glu Leu Ile Leu Leu Gln Asp Ser Ser Leu Leu Asn Glu
145 150 155 160
Glu Thr Ser Asp Leu Glu His Val Asn Lys Pro Glu Asp Leu Ala Tyr
165 170 175
Ile Ile Tyr Thr Ser Gly Ser Thr Gly Lys Pro Lys Gly Val Met Ile
180 185 190
Glu His Arg Asn Val Ile Arg Leu Leu Phe Asn Asp Arg Asn Leu Phe
195 200 205
Asp Phe Thr Ser Asp Asp Val Trp Thr Val Phe His Ser Phe Cys Phe
210 215 220
Asp Phe Ser Val Trp Glu Met Tyr Gly Ala Leu Leu Tyr Gly Gly Lys
225 230 235 240
Ile Val Leu Val Ser Phe Glu Ile Ala Arg Asp Pro Gln Ala Phe Arg
245 250 255
Asn Leu Leu Gln Glu Gln Lys Val Ser Ile Leu Asn Gln Thr Pro Thr
260 265 270
Ala Phe Tyr Gln Leu Ser Ser Glu Glu Met Gln His Ser Asp Ser Asn
275 280 285
Leu Ser Ile Arg Lys Ile Ile Phe Gly Gly Glu Ala Leu Thr Pro Ser
290 295 300
Gln Leu Lys Ala Trp Lys Gln Lys Tyr Pro Asn Thr Ala Leu Ile Asn
305 310 315 320
Met Tyr Gly Ile Thr Glu Thr Thr Val His Val Thr Tyr Lys Glu Phe
325 330 335
Gln Leu His Asp Met Asp Ser Thr Val Ser Asn Ile Gly Lys Pro Ile
340 345 350
Pro Thr Leu Arg Thr Tyr Val Leu Asp Ser Lys Arg Asn Leu Ala Pro
355 360 365
Ile Gly Val Lys Gly Glu Leu Tyr Val Ser Gly Lys Gly Val Ala Arg
370 375 380
Gly Tyr Leu Asn Lys Pro Glu Leu Thr Glu Glu Arg Phe Met Asp Asn
385 390 395 400
Pro Phe Val Ser Glu Glu Arg Met Tyr Arg Thr Gly Asp Leu Ala Arg
405 410 415
Trp Leu Pro Glu Gly Glu Leu Glu Tyr Leu Gly Arg Ile Asp His Gln
420 425 430
Val Lys Ile Arg Gly Tyr Arg Ile Glu Leu Gly Glu Ile Glu Ala Glu
435 440 445
Leu Leu Lys Gln Lys Gly Ile Lys Glu Ala Val Val Leu Val Thr Asn
450 455 460
Asp Lys Asp Ala Gln Pro Gln Leu His Ala Tyr Leu Thr Ser Thr Glu
465 470 475 480
Asp Leu Ala Ala Ala Asp Leu Arg Asn Gln Leu Thr Thr Thr Leu Pro
485 490 495
Ser Tyr Met Ile Pro Ala His Phe Ile Phe Val Ser Gln Met Pro Val
500 505 510
Thr Pro Asn Gly Lys Ile Asp Lys Glu Ser Leu Arg Lys Ile Glu Pro
515 520 525
Ser Leu Gln Glu Ser Pro Thr Glu Ala Tyr Val Ala Pro Gln Thr Pro
530 535 540
Thr Glu Lys Gln Leu Val His Ile Trp Glu Glu Asn Ile Gly Met Gln
545 550 555 560
Pro Ile Ser Ile Asp Asp Asn Tyr Phe Ala Leu Gly Gly Asp Ser Ile
565 570 575
Lys Ala Ile Lys Leu Leu His Ala Ile Asn Lys Glu Phe Gln Ile Ser
580 585 590
Phe Gln Ile Gly Asp Leu Tyr Lys His Gly Thr Ile Arg Glu Met Gly
595 600 605
Gln Gln Ile Gly Glu Gln Gly Lys Gln Ser Ser Asn Gln Lys Leu Leu
610 615 620
Lys Leu Gln Glu Leu Asp Arg Leu Lys Glu Lys Ile Leu Gly Ser Glu
625 630 635 640
Lys
<210> 59
<211> 317
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 59
Met Lys Arg Ile Met Val Thr Gly Ala Leu Gly Gln Ile Gly Ser Glu
1 5 10 15
Leu Val Thr Lys Leu Arg Ser Val Tyr Gly Thr Glu Ser Val Ile Ala
20 25 30
Thr Asp Ile Arg Gln Gly Glu Gly Gln Glu Gly Pro Phe Glu Ile Leu
35 40 45
Asp Val Thr Asp Ala Lys Ala Met His Asp Ile Ala Ala Lys Tyr Lys
50 55 60
Val Asp Thr Ile Met His Leu Ala Ala Leu Leu Ser Ala Thr Ala Glu
65 70 75 80
Ala Lys Pro Leu Leu Ala Trp Asn Leu Asn Met Gly Gly Leu Val Asn
85 90 95
Ala Leu Glu Val Ala Arg Glu Leu Asn Cys Gln Phe Phe Thr Pro Ser
100 105 110
Ser Ile Gly Ala Phe Gly Pro Thr Thr Pro Lys Asp Asn Thr Pro Gln
115 120 125
Asp Thr Ile Gln Arg Pro Val Thr Met Tyr Gly Val Asn Lys Val Ser
130 135 140
Gly Glu Leu Leu Cys Asp Tyr Tyr Tyr Gln Lys Phe Gly Val Asp Thr
145 150 155 160
Arg Gly Val Arg Phe Pro Gly Leu Ile Ser Tyr Val Thr Pro Pro Gly
165 170 175
Gly Gly Thr Thr Asp Tyr Ala Val Glu Ile Tyr Tyr Glu Ala Ile Lys
180 185 190
Asn Gly Arg Tyr Thr Ser Tyr Ile Gln Lys Gly Thr Tyr Met Asp Met
195 200 205
Met Tyr Met Pro Asp Ala Leu Asn Ala Ile Ile Asn Leu Met Glu Ala
210 215 220
Asp Ala Ser Lys Leu Gln His Arg Asn Ala Phe Asn Val Thr Ala Met
225 230 235 240
Ser Phe Glu Pro Glu Gln Ile Ala Arg Glu Ile Lys Lys His Ile Pro
245 250 255
Thr Phe Glu Met Asp Tyr Gln Val Asp Pro Ile Arg Gln Gly Ile Ala
260 265 270
Asp Ser Trp Pro Asn Ser Ile Asp Ala Thr Ala Ala Lys Ala Glu Trp
275 280 285
Gly Phe Gln Ala Glu Tyr Asp Leu Glu Lys Met Thr Thr Asp Met Leu
290 295 300
Ala Lys Leu Arg Asp Lys Leu Gln Val Gly Ala Gly Ile
305 310 315
<210> 60
<211> 396
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 60
Met Ser Ser Arg Gly Leu Gln Glu Phe Leu Gln Glu Asn Leu Ala Glu
1 5 10 15
Leu Lys Ser Lys Gly Leu Tyr Asn Val Ile Asp Pro Leu Glu Ser Ala
20 25 30
Asn Gly Pro Val Ile Thr Ile Lys Gly Lys Asn Leu Val Asn Leu Ser
35 40 45
Ser Asn Asn Tyr Leu Gly Leu Ala Thr Asp Gln Arg Leu Lys Asp Ala
50 55 60
Ala Tyr Gln Ala Ile Glu Lys Tyr Gly Val Gly Ala Gly Ala Val Arg
65 70 75 80
Thr Ile Asn Gly Thr Leu Asp Ile His Ile Thr Leu Glu Lys Thr Leu
85 90 95
Ala Glu Phe Lys His Thr Glu Ala Ala Ile Ala Tyr Gln Ser Gly Phe
100 105 110
Asn Cys Asn Met Ala Ala Ile Ser Ala Val Met Asp Lys Asn Asp Ala
115 120 125
Ile Leu Ser Asp Glu Leu Asn His Ala Ser Ile Ile Asp Gly Cys Arg
130 135 140
Leu Ser Lys Ala Lys Ile Ile Arg Val Asn His Ser Asp Met Asp Asp
145 150 155 160
Leu Arg Gln Lys Ala Lys Glu Ala Arg Glu Ser Gly Leu Tyr Asn Lys
165 170 175
Ile Met Val Ile Thr Asp Gly Val Phe Ser Met Asp Gly Asp Ile Ala
180 185 190
Lys Leu Pro Glu Ile Val Glu Ile Ala Glu Glu Phe Asp Leu Ile Thr
195 200 205
Tyr Val Asp Asp Ala His Gly Ser Gly Val Leu Gly Lys Gly Ala Gly
210 215 220
Thr Val Lys His Phe Gly Leu Ser Asp Lys Val Asp Phe Gln Ile Gly
225 230 235 240
Thr Leu Ser Lys Ala Ile Gly Val Val Gly Gly Tyr Val Ala Gly Lys
245 250 255
Gln Glu Leu Ile Asp Trp Leu Lys Val Arg Ser Arg Pro Phe Leu Phe
260 265 270
Ser Thr Ala Leu Thr Pro Gly Asp Val Gly Ala Ile Thr Lys Ala Ile
275 280 285
Asn Ile Leu Thr Glu Ser Thr Glu Leu His Asp Arg Leu Trp Glu Asn
290 295 300
Gly Asn Tyr Leu Lys Lys Gly Leu Lys Glu Leu Gly Phe Asn Ile Gly
305 310 315 320
Asp Ser Glu Thr Pro Ile Thr Pro Cys Ile Ile Gly Asp Glu Ile Lys
325 330 335
Thr Gln Glu Phe Ser Lys Arg Leu Tyr Glu Glu Gly Val Tyr Ala Lys
340 345 350
Ala Ile Val Phe Pro Thr Val Ala Lys Gly Thr Gly Arg Val Arg Asn
355 360 365
Met Pro Thr Ala Ala His Thr Lys Glu Met Leu Asp Gln Ala Leu Ala
370 375 380
Thr Tyr Lys Lys Val Gly Lys Glu Met Gly Ile Ile
385 390 395
<210> 61
<211> 5244
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 61
atgagtatac aaaagcaatc tcagatcgtt ttggatatgg agaagagtct caaaggccta 60
gaatatatcg atgaagttcg tgtggtacag gaagaaaaaa atataggtaa ccgaatattg 120
catgcaaaag atttaatccc aaccatgatg aaaacggaca gggtaaaagt agatacacca 180
aaatcagagc tactgcggca gagcgaacgg attgaaagct cccttccctt agcaattagt 240
catggggatg aattaaagag aacggctagc agtcctacta cgttaattga tgtattactc 300
aatgctgcta agctggagga taagggaatc atctatgttt tggataacgg agatgaaatt 360
gaaacgagct atcgacagtt atttgagcaa gctaagcgcc tattaaaggg actgcggaac 420
atggggatca agcctcaaga tcgggttatt tttcaatttc aaaaaaatca aaactatgtt 480
cctatgtttt gggcttgcgt attaggtggg tttattactg taccgattgc tacgtctacc 540
tcgtatgcag aaaagaatgc tgatactatg aagctataca atatctggaa catgctagat 600
aagccagtaa tcataacgga tgaagatttg ctggaggaag ttaacaagtt atcggcagtt 660
tgggaaacag aagagtttgt tatcagcacg gctgagcagt taatgaataa tgatgaggaa 720
caagaatttt atgcgtatca agaggaagat ttcgtcctgt atttgttaac atcgggtagt 780
actggtttac ctaaatgcgt gcagcataga aatgccagtc tggtggcgcg atcaatggcc 840
acagcacaat ttaaccaatt tgacagcgaa gaggttcttt tatgctggat gccattagag 900
catgttggtg gattggtcat gtatcatatt ttaggtgttt atttagcctg ccaacagata 960
ttaccacgag tagattcatt tatttctcag ccgctgaatt ggttgcattg gatggacaag 1020
tataaggcta ccttaacatg ggctcctaat ttcgcctttt ctctggtaaa tgaccaagaa 1080
aaggctattg aagaaggtag ctgggattta tccagtgtca aacatatctt aaatggtgga 1140
gaagcaattg ttgctaagac agctaagcga tttttatcca tattgcgtaa acatcaactc 1200
cgtgacgatt gcatgtaccc ctcttttgga atgtctgaaa cctcttccgg tattgtattt 1260
tccaaaacgt tccgcagtaa gcctcaatcc ggcgttcatt atgtagataa gatttcattt 1320
gaaaccctac ttcgatttgt agatgatacg gaaaaagaac atatttgctt tacagaggta 1380
ggaggaccaa tcccgggtgt aagtgttcga attgttgatg ataataacca gttgctacat 1440
gaaaatcata ttggccgcgt gcaagtcaca ggccctacaa tcatggcagg ctattatcaa 1500
aatcaagagg ccaatcagga ggcctttatt ggagatggct ggtttttcac aggggaccta 1560
ggattcctca atgaaggtcg attaaccgtt acaggtcgtc aaaaagatat cattattatg 1620
aacggcaaaa attattataa ttacgaaatt gaatcattag tagaaagtgt ttacggggtt 1680
caatctacgt tcgtagccgc cgcggcgttt attgatacct ccaaaagcgt tgaggagcta 1740
gcgatctttt ttactcctat aaatgatata gatgaaagat tcctaggttt tatcattaca 1800
gaaattagac gaacagtaag taggaattta ggtataacac cgaaacacat cattccgatt 1860
aagaaggaaa ccttttcgaa aacggaatct ggaaaaattc agcgaaataa attcactgag 1920
agactatatg ctggggaatt tgatgagatt atgaagcaaa tagagacaca aaatgaggac 1980
ccgcatcaaa cctttccaga atggatgtat gcaagaaagt gggaagaacg ccctatttct 2040
gtactagcct ccccgcttcc gcaagaaacg tatcttattt tccgcgataa attagggctg 2100
ggagatcaat ggtcagcctt gtctcagaat ggttctagca cgatgattat agtggatcag 2160
ggagaaagct ttaaaaagcg aggagaattc cactacgaaa taaaccagca tgagcaaaag 2220
gattatcagt tactatgtga ggaactaaaa aaagaatcag taaccattca tcatgttcta 2280
catctatgga catatcagga ggacttggat gaagcaaaaa aagagctgtc ctatttaaaa 2340
gaaagtcaat ttttaggtag ctttagcctt attttcttgt tacaagcttt acataaagaa 2400
gtaggtatgc caaaaacgtt gacattcgtt tcttccaatg tattctcgtt agaaaatcct 2460
gaacattcag cttacgagaa agcaacaatt ccaggaatca ttaagacaat gaagcatgaa 2520
tttcctaata caatcgctaa atttgtagat atagacacga accatcctcc ttatcatcca 2580
ttgcaagcgt tacaagcaga gcaggggctg ccggacgagg aaagcgtggt tatttattca 2640
aagggcaaac gatttgtccc aaaactgcaa caggtaaatc ttcaagaagc taacaaaaaa 2700
ccattgccgc tggtagaaaa aggattttat gtcgttacag gtggtcttgg cggggtcggc 2760
aagcaaattg ctcactattt gattcaaaag ttagatgcaa atattctact actcgggaag 2820
actgcgcttc caagtgcaac tgacaagaat actggtacac agagtgagcg ctatcgagca 2880
cttcgcgaat tagagaagct gaccactaga gaaaatcaag tcatctataa aagcacggat 2940
ttatcagatg taacagtatt ggaagaggca attcatgaat caatgttagg tatgaatgct 3000
aaaaaagtgg atggcattat tcatttagct ggcgtctatt atgaaaaacc attattggaa 3060
gaatctattg aatcactgga acatatgtat gaatctaaag tagctggaac ctatctatta 3120
gggaaattgg ttgaaaaata tcctgaagct attttggtta cctcctcttc ctccagtggc 3180
ttgttatcag gctatggagt gggagcctat acttccgcca atatgtttgt ggagacattt 3240
acggattatg tagcacaaaa acgtgatggc gtacagtgct ttgcatggag cttatggaat 3300
gatatcggtt tagggcaaca atttacagat atgaaagatt tattgataag aaaaggacat 3360
caaccgatct ctccacaaaa aggaatcttt tcatttgaat taggcttgat gcttgaacat 3420
aacatgctgt atatcgggct gaataaaggg gtttccgaaa ttgcaacgct atgttctgag 3480
gtggtagaaa aggaactagt aacgacggtt tatttctcca ctgcccataa tcaattctcc 3540
ttatctgaaa tgtacaacac catcagatta cacaccattg atgaaaaggg ccattccaca 3600
aaggttattt ttagacaggt cgatcagctc cactcagatc aggctgagga tcaccttgta 3660
agtaataacc gaagagtgga agaggaaatc aagcagctat ggtctgaaat cctacaggtc 3720
gaccagcttc aggcttttga tagtttcttt gatttgggag gaagctcact aaaggcaacg 3780
caattactct cctctgtaca gtcacgtttt ggtgtagcca ttcctctaaa ggtatttttt 3840
gaaaatccta caattaaggg attaatccat cgcatgggaa atgtagctgg gaccaaggcc 3900
agcgtcatgc gaagctatga gccagctaaa aagagggaaa agatcatcga cttatcccca 3960
tcgcaaagag ggcaatggta cttatatcaa acgcggccag acagtccttt ttataacatc 4020
accttcactc ttacctttga ggaaaatgtg aatagacaag ctttattaaa aagcatacag 4080
gcggttattc aatcacagga agcactacgt tccgaaatca gaatgatcga ggatgatcca 4140
aagctattta tccacgatac ctatgtagtg gggataccga ttatcgattt atcctcctat 4200
cctttggaac aaaaagaaca gaaactggca gccttgattg atacggaagc aaacactcct 4260
tttcaaatga tgcaggagtc tctgtcaaga tttaccctta ttcatgttgg cgacaatgtg 4320
cacatcctat taggaagcat gcatcatatt atttcagacg gctggtcaat acatgtattt 4380
accaaagagc tattacgttt ttataaggaa atacaagaaa acggtgtgat tgataggaaa 4440
gagctagatt atacctatac aagctatctg ctagaccaaa aggaatggct gacaaaagag 4500
aacccagaat atgctgatca actatattat tggaaagaga acttggcaac cgaaacagca 4560
ccattaaccc tgcctattca actctcacgc ccagctattc aaagctatca agggatgacc 4620
attgaacagc ttgtgaccca ggagttaacg caaaacatca tggaagcaag taaacatatg 4680
aaatctacac cgtatatgtt tttactttct acttttgcgg cgctattatt ccgctattcc 4740
gggcaggata ccttccgtct tgggacggtt ttagcgaatc ggaacatgcc gcatacagaa 4800
aagattatcg gctatctagc caatacggtg acactatcct ttgatttcaa ggaagagctc 4860
acttttgacc aattacttga taaggtaaga ggaatggcgt tagaagcgca tgataatcag 4920
aatgtcccac tagaaactgt tttacgtgag ttacaagtaa ggcatactgc cgatatggcg 4980
cctttgttcc aaattgtctt cacaatgcaa aatgcggtac agcatctcta tcaaaatgaa 5040
caaataaaaa cctacttgga cattgtttcc aataaaacct ctaagtacga tctaagctta 5100
catgtttatg aagaggagca ccaattgcga cttaagctgg aattcaatac cgacttattt 5160
gaagaagaag caatgaaaac cttgttggga cattacctga attttattca agtgatcacc 5220
aacgataagg tacacacagg ataa 5244
<210> 62
<211> 7476
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 62
ttgattaata cctcagacgt caaagacatt tatagtttat ccccgatgca acgaggaatg 60
ttgtttcata cattaaaaga caaagaaaac cttgcctatt ttgatcagac aacttttcaa 120
atagaaggtg acatatgtgt cgaatccctt gagaaaagtt ttaacgagct gattcgtaag 180
tatgatgttc tgcgtacgat ctttttatat cagaaattaa aagagccgat gcaggttgtg 240
ttaaaggaga gaacagcaaa cattcattat gaggatttct ctatgaagag cgagtcggat 300
aaagcaaagg ctcttcgtgt agcaaaacag agggaccggg acgagggctt tgacctctcc 360
cgggacatcc tcatgcggtt atctttatta aaagtcgccc ctaaccaata cgaattagtg 420
atcagtagcc accatattat cattgatggt tggtgcacag ggattttgta ccaggagctg 480
ttctattttt atcaatgctt cgtagcaaat caacctatcc ctgctgagaa atcgattccg 540
tatagcagat atattcgttg gcttgaagaa caggatgaag aggaaggaaa agcctattgg 600
ggtgaatatc tacaagattt cgagggggca tctgttattc ctaagcaaaa cgctaaggga 660
gagaaggaag tatgctccat tgataaggta accttccact ttgataaaaa gctgacggag 720
gaactggtgc aggtagcaaa agcttgccaa gtaacaataa gtaccttgtt tcaaacaatg 780
tggggcatcc tgctccaaaa gtataataac tcgcaggaag ctatatttgg atcggttatt 840
tcaggaagat caccagagat tcctgatgtg gaaaaaatag ttggaatttt tattaatacc 900
attcctgttc gcattcgtac attggacaag caaaccttca aggaattgct gatccaggtt 960
caggaggcat ctgtcaactc tgaaaaatat aattatctaa cattggctga tattcaagcg 1020
gttaccggat cgaatcatgc acttatccat catattgtgg catttgaaaa tttcccgatt 1080
gcctcggaca gcttcgtaga ttcgagcgag tccgattcag aagaattgaa agttgtgaac 1140
gtcatagacg atcatgaaaa gaccaacttt gattttagcg tgcaagttca gcttgataca 1200
gagttactag taaaaatctc ttataatcaa catctttatc atagaagctt tattgaaaat 1260
atctttcatc acctgcaaca gattgccggg tctatcattc ataacctaga tattcaaata 1320
aatgagatag atattgtttc taaggaagag aagaagcagc tattacgcta ttccactcca 1380
gccaagtcag attttccaat ggataaaacc attcatcagc tatttgagga gcaggtatca 1440
cggacaccag agcagatcgc ggtcgttttt aaaggggagt ccttcaccta tcgcgagtta 1500
aatgaaaagg caaatcaatt ggcatgggtg ctaagaaaac gggaggtaag acctaacgag 1560
atcgttgcga tcatggcaga gcactcccta gagatgttgg ttggggtgat tgggacttta 1620
aaggcaggtg cggcctatct tcctattgac ccatcctacc cagaaaaaag aatcgctcat 1680
atgctacaag atagcaaagc ggagcaacta cttatccagc ctcatttgaa tatgccacag 1740
gactttaagg gaagtgtctt atggttaaca gaagagagct gggcgaagga gagtacgacc 1800
gatctgccgc ttgcaacgag tgcaaatgat ctagcataca tgatttatac ctcaggctca 1860
acaggactgc cgaagggagt tatggttgag catcaagctc tggttaattt agttatgtgg 1920
cataacgagg catttggtgt aaccatgact gatcaatgca cgaaattggc gggatttgga 1980
ttcgatgcgt cggtgtggga gaccttccct ccgcttatac agggagcgac gcttcatgtg 2040
ttagaggaat cgagacgtgg agatatttat gctctgcatg aatactttga aaagaatgcg 2100
atcaccatta gcttcttgcc tactcaatta gccgaacaat ttatggagct tacaagcagt 2160
acattacgtg tgttactcat tggcggtgac cgagctcaaa aggttaaaaa gacaacgtat 2220
caaatcataa acaactacgg tccaaccgaa aatacagtag tcacgacgag cggccaactg 2280
catcctgagc aagatgtctt ccctattgga aagccgatca ccaatcacag cgtttatatt 2340
ttagatcaga acagacatcc tcagccgatc ggaatacccg gcgagctgtg cgtcagtggt 2400
gcagggcttg ctagaggcta cctcaatcag cctgaactca ccgtagaacg ctttgttgat 2460
aatccttttg tacctggaga gagaatctat cgcacagggg acttagttcg ttggagaatc 2520
gatggcagca tcgaatatct gggaaggatt gacgagcaag tcaagattcg aggatacaga 2580
attgagttag gtgagatcga aacaaagctt cttgagcatc cttccattag tgaggcgctc 2640
gtcgtggctc gaaatgacga gcaaggttat acctatctat gcgcttatgt ggtggcgact 2700
ggggcctgga gcgtatcttc attacgtgag catttaatcg aaacattgcc cgaatatatg 2760
attccagctt acatgatgga agtggaaaaa atgccgctta cggcaaacgg aaaggtagat 2820
aagcgagcgt taccagtgcc tgataggcaa agaatgaacg aatatgtggc acctgcaaca 2880
gagacagagg aaaagctagt tctactgttc caagagattt taggacttga gcgtattggt 2940
actaaagatc acttctttga attaggggga cattcgctga aggcgatgat gcttgtgtct 3000
cgtatgcaca aggagctagg tgtggatgtg cagttaaatg agatgtttgc tcgtccaacg 3060
gttaaagatc tatctgctta catagatcag atgaacggct ctgcttacac agcaattcag 3120
ccagtggagg aacagcctta ttatcctgtt tcttttgccc aaagaagaat gtatgttgta 3180
cagcaaatga gagatagtga aacaacgagc tataacatac cggttacgtt tgagctaaaa 3240
ggaaagctac atctggacaa gctgagagaa gcgttacaga ttctggttct acgacatgaa 3300
agtctgcgta catcctttca tatgattgat gaaaatcttg ttcaaaaagt gaataaagat 3360
atttcatggg atttagaagt aatagaagct caggagtcag agatagaagt aaaactgaag 3420
gaatttatca gaccgttcca tttaagtgag gctccgcttt tcagagctcg tttaatttgc 3480
ttgaatccac agcatcatct tttgagcttg gacatgcatc atattatttc agatggagta 3540
tctatgaacc tgttcctaca ggaattcatg acactctatc agggagaagc attgccagcg 3600
ctctctattc aatacaagga ttacgccgta tggcaacaat cagacaagca gcgagctaga 3660
ttaaaagagc aggaaaaata ttggttacat catttttctg gagagctgcc taccttagaa 3720
ttgccaacag attttccacg acctgctata cagcaatttg acggagatga atggtcgttt 3780
gaaatgaatg ctgatctttt agcgaaggtc aaacagatct gctctagcca aggcacgacg 3840
ttatatatga cgcttctcgc tgcttatcag gtattcttag ccagatatac cgggcaggag 3900
gatatcattg taggttctcc aattgctgga cgttctcatg ctgatttgga aaacatgata 3960
ggtatgtttg tcaatacatt agctttgcgc ggtaagccaa aggcagatca atccttcctc 4020
tcctatttaa aacaggtaaa agagaccgta ttccaagcat acgcgaacgc agaatatcca 4080
tttgaagagt tgattgaaaa actcgattta gaacgagata tgagccgtca tccgctattt 4140
gataccttgt tctctttgca aaatatggaa atatctgagt tccaaatgaa taatctagag 4200
atttttcctt atgaaacggg acaaaagaat gcaaaattcg ctcttagctg gttaatagca 4260
gaaggagagt ccctttatgt aacaatcgaa tacagcacca aatgctttaa gcgagaaacc 4320
attatacgca tggcaagtca ttttgaacaa ctgctagccc aaattgttga gcaaccggaa 4380
gcgcgcattg gccatctgga gttagtagca gatgcggaaa gaaaaatgtt actggaagac 4440
tttaatctga caaaagtcga ctatccacga gaaaaaacaa ttcaagaatt atttgaagag 4500
caggtggaca aaaaccctga tcaaatcgcg cttatatgtg gagagcaaca gtttacctac 4560
gaacaattaa atgtgaaatt taaccaatta gctcacgtat taagaagaga aggcgttcaa 4620
cccaatcagg taataggtct aattacggat cgatcgctgt cgatgattgt aggcattttt 4680
ggaattataa aagcaggtgg gggctatctg ccaatcgatc cgacctatcc aaccgaaaga 4740
attgaataca tgcttgaaga tagtcaaact cacctattgt tggtacaaca cagagacatg 4800
gttccagcag gttatcaggg agaggttttg ataatagagg atgagataag tcgagatgaa 4860
caagtagcta acatagaatt gatcaatcag ccgcaagact tggcttatgt catgtacaca 4920
tctggctcta caggtaaacc aaaggggaac ctgactactc atcgaaacat tatcaaaacg 4980
gtatgcaata acggatatat tgagataacg actgaggatc gtcttttgca gttatctaat 5040
tatgcttttg acggctctac ctttgatata ttcagctcgt tattacacgg agcaacgctg 5100
gtactggtac caaaagaagt gatcttgaat ccaacagact tggttacatt gatacgcgaa 5160
cagcagatca ctgtatcgtt tatgactacc tcattgttta atgcattagt ggaactggat 5220
gtaagcagtt tccaaaacat gcgcaagatc gcatttggag gagaaaaggc ttcctttaag 5280
catgtggaaa aggcattgga tttcctcgga aatggacgat tggtgaatgg atatggtccc 5340
acagaaacaa ccgtttttgc tacaacctac actgtagatg agcgtataaa ggaatggggg 5400
attataccga ttggtcgacc gctacataat accacggtcc acattttaag cgctgatgac 5460
aagctacagc caattggagt cattggagag ctgtgcgtaa gcggtgaagg attagcacgc 5520
ggttacctta atctaccaga gttgacgatg gagcgatttg ttgaaaatcc atttagacct 5580
ggtgaaagaa tgtaccgcac aggggacttg gctcgttggt taccagatgg ggttcttgaa 5640
tatgtaggac gcaaggatga acaagtgaaa attcgcggac atcgcattga gcttagtgaa 5700
attgaaacaa ggatattgga gcatcctgcg atcagtgaaa cggttctgct agccaagcga 5760
aatgagcaag gtagctcata cctgtgcgct tatattgttg cccatggcca atggaatgtc 5820
caagaattgc gcaagcatgt aagagatgct ttgccagaac acatggtgcc ttcttatttt 5880
attggcttag acaaacttcc acttacctcc aatggtaaag tcgacaaacg agcattgcca 5940
gaaccagagg gcagcttgca actgactaga gaaattgtcg ctccacgcaa tgaaactgaa 6000
aaacagttag ttgaaattgt tgctgaggtt ctgggactag aagctagtga aataagtatt 6060
accgataatc tttttgagct aggtggacat tccctaacga ttctaagaat ccttgctaag 6120
gtccatacat gtaactggaa gcttgaaatg aaagacttct ataattgcaa gaatcttgag 6180
gaaatagcaa gcaaggcatc tgatatgcag gaaaatcaaa atctgtctgg cagtggctcg 6240
gtctttaaaa agggtgggaa gaaatcaatc ccggtagtac ccgtctgcga tagacaaaaa 6300
gagatggagc atgttttatt gctcggctcc actggtttct taggtattca tttgctacat 6360
gagctactac agaaaacaga agcgacaatt ctttgcgtca ttcgtgcaga aaatgatgag 6420
gctgctatgc aacgactacg caaaaaaatt gatttttact ttacctcaca gtacagtagc 6480
tcccaaattg atgagtggtt tacccgcatc caaatcattc acggtgatat tacgcaagac 6540
aattttggat tagaggcaaa acattacgag tcgctaggag ctatcgttga cactgtcatt 6600
catacggctg cattggtgaa gcactacggg cactatgaag agtttgaaag agcaaatgta 6660
catggaactc agcaagtagt taccttttgc ttgaacaata aattaccaat gcactatgtt 6720
tcaaccctga gcgtttcggg aaccaccgtt gaagaagcaa cagagcttgt agaatttacc 6780
gagaaggact tttatgttgg tcaaaactat gagtcaaatg tatatctgag aagtaaattt 6840
gaagccgaag ccgtacttgt tggcggaatg gaaaacggac tcgatgcacg tatctaccgg 6900
gttggcaatt taacaggacg ctttcaggac ggatggttcc aggaaaatat caatgaaaat 6960
atgttttatc tcctatcgaa agccttcctt gagcttggag gttttgatca ggaaattatg 7020
cagggtatgg ttgatttaac ccctattgat atatgtgcac aagctattat acacatcatc 7080
aacagcaaag gaattgagga aagagtcttc catttacaga atccgcactt ggtaacatac 7140
gatgatatgt atcgtgtatt tgaagggctt ggcttttcta gacgggtaca aagtcgagaa 7200
gatgttacac gtgaactaga tgtaatgatg tctcagggta atgaaaagct atttttggct 7260
gggattctga ccacgatgtt ggatgatgta gagcgtgctg aacaatttaa tgttgcagtc 7320
gattcaagta ggacaatgca gctattagag gatacctcgt ttacctatcc tgttcctgat 7380
gatgagtatt tgcgcaagct ggctatgcat atgatcaaag ttgggtttgt tactcctaat 7440
catactgttg ctgaaaagat aggaactagt cgttag 7476
<210> 63
<211> 7581
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 63
atgttaagta aagcaaatat taaagacatc tatacattat ctccgctaca aaaaggcatg 60
ttatttcagc atttaaaaga agaaagcacg gcttattttg agcaattaca ctttacgatt 120
aagggacaac tatatgtaga tagcttcgaa gcaagctttc agcatctcat aaacaaatat 180
gatgtgctac gaaccgtttt tctgtataaa aatatgaccc agcccatgca aatggtttta 240
aaagaaagaa aaacaagtgt gcattttgaa gatatctccc acctagattc taaagccgtg 300
agtgaatatg ttgaagagtt taaaaatcag gatcgggaga agggatttga actctcgaag 360
gacattctca tgcgttttgc tattttgaag gctggtgctg agtcctatca tttaatttgg 420
agcttccatc atattttaat ggacggctgg tgcatgggca ttgtgttaca ggatttgttc 480
agaatgtatc agcagcatcg tcaaaatata ccgattaccg ttgagagcgt tcctgcctat 540
agcgagtata tccgttggct tgagaagcag aatgtaacaa aggcaaggga ttactggaaa 600
aattacttag agggctatga ggaattaaca ggtatcattc atctcgatac gaagcataca 660
agtcataaca acgaggtaca ggaatgcgcc tttacactgg ataaggacat aacggaagga 720
cttactcagc ttgctcgtca ttattcagtg acagtaaata cgctttttca aacaatttgg 780
ggcatgctgc tgcaaaagta taacaataag gatgatgttg tgtttggtgc ggtcgtatct 840
ggccgcccct ctgaaatcca tggcgtagaa aacatggttg gcttgtttat caacactgtc 900
cctattcgta ttcaaaaaca aacgaatgat acctttagcc atttattaaa aagagttcac 960
gaatctacgc tattgtcgaa acagtatgag tttgtatcct tggcagatat tcaaaccgat 1020
gcgggatttt ctggtcaatt gctagatcac atcttagttt ttgaaaacta tccgataagt 1080
gaaggttctt ttgaggaaga agaatttacg atggatagta taaaaaccta tgagaaaaca 1140
agctatgacc taaacgtgat gattcggcct aatgaggatc agcttgatat tgccttccaa 1200
ttcaacgatg acgtgtactc aagcgaaaat gtaaaaagac tgttccagca tatgaagcaa 1260
ctggctttag ctgtaatcaa gaatccggat gtgcgcttgg aagaaatagc aatgatcaca 1320
gaagaggaac gctatcaaat cttgcacgat ttccaagggg agatagttga ttttgtaaca 1380
gaaaaaacgc tccctgaact gtttgaagac caggtgaaac gaactccaga agcaattgca 1440
cttcgatttg aagatcaaca attgacctat caggagctaa atcagcgagt aaatcaatta 1500
gcttggacac taagaatgaa gggcttgcag caagaagaac tcgttggaat tatggtgcag 1560
cgctcattag aaatgatcgt tggtgtgcta gccgttataa aagcaggcgg tgcatacgta 1620
ccaattgatc cggaatatcc gctcgaccga atccaatata tgctggaaga cagtggaacc 1680
aattggctgt taaccacgaa acagagcgaa attccttcca tctatcaagg gcatgtcctg 1740
tatcttgagg aagatacggt gtatcacgag cggtcttcag atgtagagat tgtaaatcaa 1800
tccagcgact tagcttatat tatctacacg tccggttcta ctggtcagcc taagggtgtc 1860
atgattgatc atcgtgctgt tcataatttg catttgtcag caggaatcta tggaatcgca 1920
cagggaagcc aggttttgca gtttgcctct ttaagctttg atgcttcggt gggtgatatc 1980
ttccacagct tattaacggg agctaccttg catcttgtaa aaaaagagca attgctatcc 2040
ggacacgcct ttatggagtg gttagacgaa gctggcatta cgactattcc gtttattcca 2100
ccaagcgtcc taaaagaatt accatatgca aaactgccta agctcaaaac aatcagtact 2160
ggcggggaag aattaccggc cgatttagta aggatttggg gagcaaaccg cacattttta 2220
aatgcgtatg gtccgacaga aacgacggtt gatgcttcga ttggtaattg tgtagagatg 2280
acggataagc cttcgattgg tacgccaacc gttaataagc gagcgtatat tttggatcaa 2340
tacggtcata ttcagccaat cggtgttccc ggggaattat gcgtaggtgg agaaggcgta 2400
gctcgtggat atttacatag acctgagctt acagatgaaa agttcgtgaa cgatccttat 2460
gtaccaaacg ggagaatgta taaaacggga gacttagcta gatggttgcc ggatggaaca 2520
atcgaatttt taggccgtat ggatggccaa gtaaaaattc gtggatttag gattgagctt 2580
ggagaaattg aagctcggct aaaccaagcc ccatctgtaa agcaagctgt ggttctagct 2640
cgttcaggag aacaaaagca ggtataccta tgcgcatatt tggtgacgga caacgattta 2700
aaggtttcag ccctacgtaa ggaattaagt caaacgttac cagactatat gattccatcg 2760
ttttttataa aagtcgataa gattccagtc acagtaaacg gcaagataga caagaaagcc 2820
ttgccagaac cagaaaaaga agtagagctg caaaccgaat atgtagctcc aacgaaccca 2880
acagaggaga ttcttgtaca gatttggcaa aaggtgctgg gaatggagcg agtagggata 2940
gaggataact tctttgagct aggtggtcac tctatcaagg caatgatgct tgcttccaat 3000
atttataagg aattaaagat tgatctgcct ttgcgtgaga tttttaagca tacgacagta 3060
aaagaaatgg cgcgttttat cgacggtcgg gatgaagaag aatacgtcgg aattcaaccc 3120
gcagccaaaa aagaatacta ccctgtctct tctgcacaaa aaaggatgta tgtcattcaa 3180
tcattggaag ataaggctca aggcacgagc tataacatgc cgtctttcta taaaatgaag 3240
ggctcggtag atgcagagaa attagagaag gtattccaaa cattaatgga tcggcacgaa 3300
tcattacgaa cctcctttca tatgatcgag gagcagctag ttcaaaaggt tcacgaacag 3360
gtttcatgga aaatggacat gaaaaccgtc agcgccaatg atgtttcaag attaaaggat 3420
tcgtttgtcc aaccgtttga catcagtaca gctcctttgt tccgagccag tcttcttacg 3480
attcagaaag atgagcacat tcttatgatg gatgtacacc atattgtagg agacggtgtt 3540
tcgaccacga tcttgttcca agagcttatc cagttgtatc aagggcaagc gctacctgaa 3600
gtgaaggtac actataaaga ttacgctgtg tggcaattgt cccagcagga tcgtttgaaa 3660
gaaagcgaaa atttctggtt gcagcaattt tctggagagt tgccggtatt ggagctacct 3720
actgattatt cacgtccccc aattcgccga ttggaaggag aatatgtaag ccaaagccta 3780
cgtggtgaac tccatgaaag cgtaaaagcc ttcatgaaaa atcacgaagt aacgctatat 3840
atggtactgc ttgcgacata taacgttctt ctgcacaaat acacgaatca gcgcgacatt 3900
attgttggta cgcctgtttc ggaccgaccg catccagatg tcatgtccac tgtcggtatg 3960
tttgtaaata cgctggcagt ccgaaatcag ttgaagtctg agcaaacctt cgaagagttt 4020
ttagcaaatg tgaaaaataa aatgctagag gtctatggtc atcaggagta tccgtttgaa 4080
gatgtaattg aaaaagtaaa ggttcaaagg gatacaagca gacatccgct atttgacaca 4140
atgtttggtg tacaaaatct ggagatatcc cacgtggagc tacccgattg gggtatagaa 4200
gcattggata ttgactggac taactccaag tttgatatga gctggatggt atttgaagca 4260
gacggtctag aaattggcgt ggagtatagc acaagcctat ttgagcgcaa tacgattcag 4320
cgaatgatcg gacactttga acatatcatc gagcagatta tggaaaatcc tcaaattcgt 4380
ttagctgata ttcagttgac gacagaagat gagagaatcc aaatcttaga ggaattcaat 4440
catcaaccaa caaaaataac ctacgatcag gcaatccaaa acagatttga agaacaggct 4500
atgaagacac ctgatgcagt ggcacttgta tataaaggtc aggagttaac ctatcgtgag 4560
cttaaccaaa gatcaaatca gatggctcgt acattaagag agcatggggt cgggcgtgat 4620
caaataatag cggtcatgat taatcgttca catgagctga tcattagtat cctagccgta 4680
ttaaaggcag gaggagcata tctgccaatt gatccaacgt acccgcttga tcggattgaa 4740
cacatgctag aggatagcca gactgcaatg ctgttaactc aaaaagaaat ccaaatacct 4800
acaggatatt cgggggaagt tctcttcgtt gatcaagctg atatttatca tgaggatgct 4860
acggatttat ctagtatgaa tcagcctgcg gatttggcct atattattta cacatcaggc 4920
tctactggaa agtccaaggg agtaatgatc gagcatcgtt cattacataa tctgattcat 4980
atttctcacc cctataaaat gggagcagga agcagagtcc ttcaatttgc ctctagcagc 5040
tttgatgcct cggtagcaga gatctttcca gctcttttaa ctggatcaac tttatatata 5100
gaagagaaag aggagctatt aaccaattta gttccctact tacttgagaa tcaaataaca 5160
acagtagcat tgccgccatc tttattaaga tccgttcctt atagggaact gccagcttta 5220
gagtgcatag ttagtgtcgg agaagcttgc acatttgaca ttgtacaaac ttgggggcaa 5280
aaccgcacct ttataaacgg atacggccct acagaatcaa ctgtttgcag tacctttggt 5340
gtggttacag cagaggacaa gcgtatcacg attggtaaac cgttccctaa tcaaaaggcc 5400
tatatcatca atgaaaatca acagctacaa ccaatcgggg ttccaggtga gctttgcata 5460
gcaggggctg gattaagccg tgggtacttg aatcgtccag agctgacaca agaaaaattt 5520
gtaaacaacc cctttgcacc tggtgagcgt atgtataaaa caggagacgt agctcgctgg 5580
ttgcctgatg gcaatatcga atatgtcggt cgtatggatg atcaggttaa agtacgcgga 5640
aatcgggtcg agcttgggga ggttaccagc caattactta cgcatccttc gattacagaa 5700
gctgttgttg taccaatagt cgatacacat ggagcaacga cactatgcgc ctatttcatc 5760
gaggataaag aagtgaaggt caacgatttg cgccatcatt tggctaaagc tctacctgag 5820
tttatgattc ctgcttactt tattaaagta gatcatattc cattgacagg aaacggaaag 5880
gtaaataaac aagcattacc tgacccttcc gaattcattt cagcacaaac aggccatgaa 5940
atcgttgccc cttcttctca ggacgaggaa atactggttc aggtatggga agaagtcctc 6000
cagatcaaac cgattggggt agaggacaac ttctttgaac gaggcggaga ctccattaag 6060
gcattgcaaa tcgtagctag acttagcaaa tataatcgga aattggatag tagacatatt 6120
tttaaaaatc caacgatttc catgctggct ccttaccttg aacaaagagg tgctttgatt 6180
gaacaagatt caattgaagg cgaagtgccg cttacaccga ttcaatcctg gttctttgaa 6240
caaccctttg tgcatccaca ccactttaat caatctatgc ttctgccaaa tgaacaaggc 6300
tgggatcgtc aacgaataga acaagcattt acaaccattg ttaaacacca tgatgcctta 6360
agaatgaagt accagtttag agagaagatc attcaagaaa atcagggtat cgagggagag 6420
ttttttaccc tgcatgaggt ggatgtaacc aaggaaagag actggcaaat gcgcatcgaa 6480
cgagaagcga atcaactcca agcaagcttt gatttgacaa caggccctct tgtaaagctt 6540
ggcttatacc atacggcata tggcgattat cttctgattg tcgtacatca tctcttaatt 6600
gatggtgtct catggcgcat cctgctggag gatttccaga cgctttatga gcaaaagggt 6660
gaactgccgg cgaaaaccac ttcctttaag gcgtgggctg tacaactgga ggggtatgct 6720
cgcagcaaaa agctacaaga cgaggcaagc tactggaaag ggttgttgaa taaatcggta 6780
agagagctgc ctgcggataa ggaatcaagc gatacattcc tctttggaga tacaaaagaa 6840
gtacagctta cctttgatat aaatgaaacc caagacctgc ttacggatgc ccaccatgct 6900
tataagacaa aagcggatga tttattgctg gcagcgttgg ttcttagcat aaatgagtgg 6960
acgaagcaaa gcgatatcat agtgaatttg gaaggtcatg gccgtgagac gatcgtcgaa 7020
ggcattgatt tgagccgtac aattggctgg tttactacaa tttatccagt tctatttgaa 7080
gtagagaacc atcaactttc cagcgtgatt aaacatgtaa aagaaacgct gcgcaatgta 7140
ccgaataatg gtatttgttt tgggatctta caacacatgt ctcattttga tgtaagccag 7200
agccaattaa gttctcatca cataagcttc aactacctag gtcagatggg agaagattcc 7260
gctagtcagt ctgagacgga taatggagtc cttatcaata caggagacca gataagccca 7320
atgaacgcaa atccgggctc gcttaatatg acttgccttg taatgaataa tacgttgctt 7380
gttacttttg attataatcc gcaacgttac gaacaggaga caattcaacg tctggcagat 7440
cgttataaga gcaatttaaa agcagtcctc gatcattgtg ttcaacgaga gcagacagag 7500
cgaacaccta gtgattttag tacgaagaag ctttctttag aggacttaga cgacgtgttt 7560
gcaacactta aaaatctata a 7581
<210> 64
<211> 7626
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 64
gtgcaaaaaa aagacaagat caaagatatc tattcacttt ctccgttgca aaagggtatg 60
ctatttcatt ccatgaaaga cccgcaaagc gatgcctatt tcgagcaggt tacccttttg 120
ctggaggggg ttgtaaaccc aacctatttg gctgaaagta ttcagggact cgtacaaaaa 180
tacgacatgt tccgaagtgt gttccgctat aaaaaagtag accctgttca ggttgtgctt 240
agtgaacgaa aaatagattt acagattgag gaccttactc aaatcaatga agaagagcaa 300
cggaaattca ttgaggaata tagaaaaaaa gaccgggaaa gaggcttcga cctttcccgg 360
gatatcctgc tacgttttac attgtttcaa acagccgcca atcggtatga attactgtgg 420
agtcatcatc atatcctgat ggatggctgg tgtacgggta tcgtttttca ggatttattt 480
caaatgtacc aacgtcgctt gtcaggacag gccttacttc cagaggtggc ccctcaatat 540
agcgaatata tacgctggtt aaagaaacaa gatgaccaac aagcattggc attttggaag 600
gagtatctac aggggtttga aaaccttacg ggaatcccgc gtctaaggtc aggcaatcat 660
ccctacaagc aagaggaatt cattttctcc ttgggagagg aagctacaca aaaactaacg 720
caaacggctc aaaagtatca ggtgacctta aatactgttg tgcaaacaat ttggggagcg 780
ttattgcaaa aatacaataa cacgaatgac gcggcctacg gtgtggttgt ctccggacga 840
cccgccgagg tgccaaatgt tgaacaaatg gtggggttat tcagtaatac gattccaatc 900
cgtattaaaa aagaagcagg aaaaacgttt ggggaagtgc tgaaaaacgt acagcaaaca 960
gcgctggagg cagaaaaata cggatatctt tctttagccg atattcaggc gagcgcagct 1020
tatacgcatc aattgcttga tcatatttta gcgtttgaaa atttcccgat ggatcaagaa 1080
acatttaatc aagaaaacgt actcggattt gccgtgaagg atgcccacac gtttgagcag 1140
acgcactatg atctgaccgt gctagtcatt cctggcaagg aattaatctt taagtttatg 1200
tataacgaaa gtgtttattc aaaagagtac ctcaatcttt tagagctgaa tatgaaaaag 1260
ctggtctctt tggttattga gcagcaggat atctttgacc cagctaccga gtttgtatct 1320
gatttggaaa aggataagct tttaaccatt tttaatcgta cggatgcaaa gtacccaaga 1380
gaaaaaacga ttcatgagct gtttcaagag caggttgaca agaaccctga tcaagtggca 1440
ctcgtatttg gcgaggctca actaacatac cgcgagctga acgaaaaggc gaatcaaatg 1500
gcccgcggtt tgcgcaaaca aggggtttta cctgatcagg tgatagggtt acttacggat 1560
cgttccttag agatgatcat agccattcta gcgatcttta aagctggtgg cgcttatatg 1620
cctatcgatc catcttatcc gagtgaacgt attcaataca tgctagcaga tagtcgtacc 1680
catttgctat tggtgcaaaa agctgaaatg atcccagcta attatcaggg tgaggtacta 1740
ctgttaacag aagatagctg gatggatgag aatacagata atttagattt ggtcaaccaa 1800
gcacaagacc ttgcttatgt catgtatacc tcaggttcaa caggtaaacc aaagggaaat 1860
ctgacaaccc atcaaaatat cgtcaagacc atcatgaaca atggttacat ggagattacg 1920
ccaaatgatc gtcttctcca gttgtccaat tacgcgtttg atggatctac ctttgatata 1980
tacagcgcat tgttaaacgg agcttctctt attttagtac caacgcatgt actgatgaat 2040
ccgactgatt tggcatcggt cattcaagac cagcatatta ccgtgtcctt tatgacaaca 2100
tctctattta acactctggt tgagctggat gtgactagtc tcaaacacat gcgtaaggtg 2160
gtgtttggag gagaaaaggc ttcgatcaag cacgtagaaa aagcgctgga ttatttggga 2220
gctggacgtt tggtcaatgg gtatggacca acagaaacta ctgtttttgc cactacctat 2280
acggtggacc atacgatcaa ggagacgggg attatgccta taggtcgccc gttgaacaat 2340
acgaaggtgt ttattttagg agcagacaat caactacagc cgataggtgc attaggcgag 2400
ctatgtgtga gcggggaagg gcttgcccgc gggtatctca atcttccaga gctgactgct 2460
gatcgtttcg ttgagaatcc ttttatgcag ggagagagaa tgtatcgcac aggggattta 2520
gcgcgttggt taccggatgg aagcattgag tacgtaggta gaatagatga acaagttaag 2580
attcggggac atcggatcga attaggggaa attgaagcta gattactaga gcatcctgct 2640
attagcgaga ctgttttgct ggcaaagcag gatgagcagg gtcattcctt cctatgtgcc 2700
tatctggtga caaatggtgc ttggtcagtc gcagagcttc gcaagcatat caaggaaaca 2760
ttgccggact ctatggtgcc atcttatttt atcgagatag ataaaatgcc gctcacttca 2820
aatggcaagg cagacaagcg tgcattgcca gagccagatg ttcaacaagt aagctcttat 2880
attgctcctg agaccgaaac agaggaaaag ctggttcaat tatttcaaga aatcctaagt 2940
gttgaacaag tcggtacgca ggataatttt ttcgagctgg gcggacattc gttaaaagcg 3000
atgatgctgg tttcaagaat gcacaaggaa ttagatatag aggtaccgct caaggacgtg 3060
tttgctcgac cttcagtaaa agaattggcc gcatttctta caaacacaga agtgtcggat 3120
tatatagcga ttgaaccggc ggcaaaacag gaattttatc cggtttcttc tgcacagcgc 3180
cgaatgtatg tagtagagca aatcggtagc agtaatacaa ccagctacaa tatgcctttt 3240
ttgcttgaaa taggaggagc cctcgatgta gtagggttac aaaaggcatt aaagaaactg 3300
gtcataagac atgaatcgtt gagaacgtcc tttcacatgg ttgatgaggt attaatgcag 3360
aagatccatc ctgacgtgga atgggattta atggtcatgg aagcaaaaga cgaggacctt 3420
ccccaaatca ttgatggttt tatccagccg tttgatttaa gtgacgcttc tttatttaga 3480
gcgggactcg tacgaatgga agctgatcga catctactga tgcttgatat gcaccatatt 3540
atttcagatg gggtatcaac caatgtatta ttccaagacc tgatgcaaat ctatcagggc 3600
aaggagctcc cttctcttag aattcaatac aaggattatg ctgtttggca acaggcagaa 3660
gcccaggtta atcgtttacg agaacaggag cagtattggc ttaaccaatt ttcgggagag 3720
ttacctgtac tggaaatgcc taccgattac actcgtccat ctattcagca gtcagaaggg 3780
gatatatggt catttgaaat tagtgccgag atcataaaca aagtaaagaa actgtcctcc 3840
tcgcagggta caacgttgta tatgacattg ctggccgcct accaagtatt attgtcaaaa 3900
tatacggggc aagaggacgt tattgtgggt tctcctattg ctggccgacc tcatgcggat 3960
gtagaaaaga ttgttggtat gttcgtgaac acgttagcct tcagagggca gccaaaatca 4020
actcaaacct ttagtacata tctgtccgag gttaaggagc aggtattgca cgcctatgac 4080
aatgcagaat atccgtttga ggaattactt gaaaagcttg atttagaaag agatctaagt 4140
cgtcatccac tgtttgatac catgtttgct ttgcagaata tggaaatggc tgaaatcaat 4200
atcatggatc tctcctttca gccgcgggat ttaacatgga aaaatgcaaa attcgacctg 4260
acatggatga tggcggaagc ggaaaatttg tatgtcacca ttgagtatag tacctcgctc 4320
tttaagccag aaacgattga gcgattaggc aaacgattca cccatttact aaaacagatc 4380
ggggatgctc ctgaacgttt gattgctgac ttagaagtag cgacggagga tgaaaaacat 4440
cagattttat cggtatttaa tttgactcaa tcggattatc cagtaaataa aacagttcat 4500
cagctctttg aggagcaagt gcaaaatatg cctgatcaaa aggcgatagt atttggtgaa 4560
gagcaagtaa catacaaaga attaaacgcc aaagccaacc atctagctac cctcttaaaa 4620
caaaaaggca taacaaacga gcaacttgtg gctgttatga ttgagccttc catcgagttt 4680
tttgtaggca ttctagctgt tctaaaagca ggaggggctt atctaccaat tgacccaact 4740
tatccggcgg aacgaattgc ctatatttta gaggatagtc aatcgaaggt tctgttagtg 4800
agaggtcatg aacaggtaca gacacaattt gctggggaaa tcttggaaac tgatagcaag 4860
aagttgtcta ccgaagagct gaaagacgta cctatgaata acaaagtaac cgatctagcc 4920
tatgtcattt atacatcagg gtccactggg caaccaaaag gtgtcatggt ggagcataga 4980
tcgttaatga atctttcagc ttggcacgtt cagtattttg gcatcacaaa ggatgatcga 5040
agcatcaaat acgcaggggt tggatttgat gcatctgtat gggaggtctt cccttactta 5100
atagctggtg caacgattta cgtcatcgat caagagacaa gatacgatgt agaaaaactg 5160
aatcagtacg taacagatca agggattaca atcagctttt tacctacgca atttgctgaa 5220
cagtttatgc tgacagatca tacggatcat actgccctac gctggttgct tatcggcggt 5280
gataaagccc agcaagccgt tcagcagaag aagtatcaga ttgtaaataa ctatgggcct 5340
actgagaaca cggttgtaac aaccagctat atagtgagtc ctgaggataa aaaaattccg 5400
atagggcgtc caattgctaa taatcaggta tttatcctga ataaagagaa tcaattacag 5460
ccagtaggga ttccaggtga actatgcgtt agcggcgaca gcctagcacg cggctatctg 5520
catcgtccag agttaacgag tgagcgtttt gtagctaatc cgtttgtccc tggtgaacgc 5580
atgtataaaa ccggagatat tgcccgctgg ttaccagatg gaaatattga gtatctaggt 5640
agattggatg atcaaattaa gatcagagga taccgggttg aattaggtga gatcgaatcc 5700
gctattttgg agcatgaagc aattcaggag acagtagtgc tcgcaagaca agacgatcag 5760
aatcagacat atctatgtgc ttatgttgta ccgaaaaaat cttttgatgt agccgagctt 5820
cgtcaatatc taggcagaaa gctacctcat tttatgattc cggccttttt tacggaaatg 5880
acagagttcc caattacatc gaatgggaaa gtagataaaa aagcactccc actaccggat 5940
ttgtccaagc aatcagagat cgattacgtt gccccaacca ccacgttaga agaaacgctg 6000
gcggaactat ggacagaagt gctaggagtt tcccaagtgg gaatccatga taacttcttt 6060
aaactgggtg gggattcaat caaggctatt cagattgcag caagattaaa tacaaagcaa 6120
ttaaaattgg aagttaagga tttattccag gcacaaacga ttgctcaggt tattccatac 6180
atcaaaacca aggacagtaa agctgagcaa ggaattgttc aaggaaaggt agagctaacc 6240
cctatacagg aatggttttt ccagcaatcc ttcgatattc cacatcattg gaatcagtcc 6300
atgatgtttt atcgaaagga agggtgggat cagcacgttg tacaaagggt gttccaaaaa 6360
attgcagaac accatgatgc cttgcgaatg gcttatcagc aggaaaatgg caaaacgatt 6420
cagatcaatc gcggagtgga aggcaagttg tttgagctaa gcatttttga ctttagacaa 6480
caggcgaatg tgccagagct gatcgagcaa gcagctaatc gtctacaatc cgcaatgaac 6540
ttgcaggacg gtccattggt tcaactggga ctctttcaga catctgaggg ggatcatctt 6600
ttgatagcaa ttcatcactt agtggttgat gccgtttcat ggcgaatcat tacggaggat 6660
ttcatgaatg gctatcaaca agatttgcag ggcgagccga ttgcatttac gagcaaaaca 6720
gactcctacc aaaaatgggc caagagcctg ctagagtacg ctactagtga agaaattcaa 6780
tcagagctga aatactggca aagcatgatt gcaaaagggt tacctgcatt gccaagagat 6840
tcaaaaggag gtgccccgta tctactcaag gatatacaag aggtcgctat ccaattgaca 6900
aaagagcaaa cgaataaact attaacggat gcccataacg cgtacaacac acagattaac 6960
gatcttttgt tgacagcatt agctctaact attcaggaat gggcacaaac caattcaatc 7020
gcaattacac tagaaggcca cggacgcgag gatattgggg tggacattga cattaaccgt 7080
acagttggtt ggtttacgtc catgtatcca gtggtatttg atttgcagaa gcaagggatt 7140
gcaaatacgg ttaagcaagt aaaagaagag ctgcgacaaa taccgaataa agggattggc 7200
tatggggttg ttagatacct atcgaatcaa ggaagtacag agctggatat aagctcccat 7260
gcgataaatc cagagattag cttcaattac cttgggcaaa tggatcaatc tggacaggaa 7320
gaggagtatc aattgtcccc attgtcttcc ggtcaacaga ttagtcagat gaatcaaggc 7380
ttgttcccga taaatgtgag tggaattgta gtggaaaatc agttgtccat tcaaatatct 7440
tatgatagcc aagcttatca tgattctact atggaaaagc tgattcaacg ttatcaatat 7500
cacttgttgg agattattaa tcattgtgtt cagcaaacag aaacagaatt aaccccgagt 7560
gatttttcca ccaaagagct ttcgatggag gatttagaat cagtatttga gttactagat 7620
gaataa 7626
<210> 65
<211> 13854
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 65
atgtttagca gaagtaatgt gcaaaatttg tatcgcttat ctcctatgca aaaagggatc 60
ttatttcatt ccttaaaaga taaagaaaat catgcctatt ttgatcaact gatctttact 120
ttggaaggta aggtagagct tgaatatttg gaagaagcct ttaaccaatt aatcaaaaag 180
catgatattt tacgaactgt ttttcgctac aaaaaagtaa aagaacctgt acaaatggta 240
ttaaaggaaa gaagctccac tatttatttt gaagatattt ctcatctgga gccagaagaa 300
aaagtgaatt acattaagca gtttaaaatg agggatcggg agaaggggtt tgacctctcc 360
cgggaccttc tcatccgaat gtcattattt aagcttgatc aggagcagta tcagttaata 420
atgagtaatc accatatcat tatggatggt tggtgccttg gcattatcct tactgatttc 480
ttacgtatgt ataaaggaat cgtgaatcat acccctgttc catacgagca tgtgacacct 540
tacagtaagc atattcaatg gctagaaaaa caggatcatc aggaagcaaa ggatttttat 600
caacagctat tagagggata cgacaaagta acaggtgttc cacagcaatt agtacgggcg 660
aatcacgaag aatatactca cggacaatgc atcgtgaaat taaatcaaga aactgccgac 720
cgattgattg ccatagccaa aacctaccag gttacagtca ataccgtttt ccaaacgatt 780
tgggggatat tattacaaaa atataataat acggatgacg tagtatttgg atcagttgtc 840
tcggggagac cggcagagat tcctgatgtt gaaaaaatgg ttgggctatt tatcaataca 900
attcctgtgc gaatcaaagc tgatcaacaa gagcgatttg acacgctagt agccaaagta 960
caggaaatgg ccttggcttc agaatcatat gattatcttt cgttggcaga tattcatcca 1020
gaagctggcg attttatcaa tcatattatt gcgtttgaaa atttttatat cgatatggac 1080
agctttaatc agctagcaga taaaaaagag cttggattct cgctcgcatt cgccacagat 1140
catcacgagc aaaccaatta tgatctaagt gtgcaggcgc agattggtga tgaatcttcc 1200
attaaaattt tatataattc caagctttat acatcggaat acatagcaaa tgtaattgat 1260
cattttgtta ctgtggctga catagtggct gctgatccta gcatccctgt aaaggaaatc 1320
gatattttaa caaaagataa aaaagatcag attctctatg gttttaacaa tacctatgca 1380
gattatccaa gagagaagac catccatcag ctatttgaag aacaagtaga taaaaatccg 1440
aatcagatcg cacttgtgtt taaagaagag aagctgactt acggtgaggt aaatgcgaaa 1500
gcaaatcagt tggcatacgt gttaagcaag caaggtgtac agcctaatga tgtaatcggc 1560
atcatcaccg aacgctcccc agaaatgatt ataggcattt tggcgatttt taaagcaggc 1620
ggagcttata tgccaattga tcctacttat ccggctgaac gcattgaata tatgctacag 1680
gataatcaaa cgaagctatt attggtgcaa aaacaagaaa tgataccagc caattatcaa 1740
ggagaggtat tgttcttaac ccaagagagc tggatgcatg aggaaacatc taatccggct 1800
catattactc aagcacaggc tttagcatat gtgatgtata cctctggttc tacaggagag 1860
cctaagggca ttttgacaac acatcaaaat attatgaaga ccgtcattca taacggttat 1920
gttgagatta ccccaggaga ttgcttgtcg cagctctcca attatgcctt tgacggctct 1980
acctttgaaa tctatggggc attattgcat ggagctacat tacttttagt aacaaaagag 2040
gctgtactca atatgaatga gctggcacgt cttattaaga aggagcaagt gacggtttcc 2100
ttcatgacga ctgctctgtt taatacactg gtggatttgg atataacgtg ctttcaatcg 2160
gtacgaaagg tgttgttcgg aggagagctt gcttcggtta agcatgtcct gaaagccctt 2220
gattatttag gcgagcaccg ggttatcaat gtgtatggac caacggaaac taccgtgtat 2280
gctacctatt actctgtaga tcactccatg ctgacgaggg catctgttcc tattggaaga 2340
ccgattaata acacgaaagc ctacatttta aatacagatg gacagcctca gccaatagga 2400
gtagtcggtg agctatgcat tggcggtgag ggggtagcat gtggttatct taaccgtcca 2460
gagctgacaa agaaacattt cgtggataat ccgtttgtct tgggtgagcg aatgtattgt 2520
accggagatt tagcccgctt tttaccggac ggcaacatcg aatacattgg gcggatggat 2580
gaacaggtaa agattcgtgg tcaccggatt gagctgggcg aaatcgaaaa ggttctttta 2640
cagcacccag ctatcagcga gacagtgctt ttagcaaaac gagatgagca aggtcattcc 2700
tacctgtgtg cgtatatagt aggtcaggca ttttggactg ttacagagct gcgtcaacac 2760
ttgatggaat ccttgccaga atacatggtg ccttcctact ttatcgagat tgagaaacta 2820
ccgcttacgg caaacgggaa ggtagataag cgagcgttgc ctgaaccaga cagaaaaatg 2880
ggcagtgctt acgttgctcc agagaacgaa acagaggaga agctggttca atttttccaa 2940
gagattttgg gtgttgagcg agttggtacg caggatacat ttttcgagct tggtggtcac 3000
tcccttaagg ccatgatgct cgttctacag attcataaag aaatgggcat tgaagtcccg 3060
ttaaaggaga tatttacacg tcctaccatc aaagaattag cggcgtatat tcataagatg 3120
gatcgctctg cctacagcat gattgagcca accgccaaac aagagtatta tccagtctcc 3180
tttgcccaaa gacgaatgtt tgtagtgcag caaattagag atacgaatac aaccagctac 3240
aatatgccga ttttgctaga aatagaaggg gctcttgata gggaaaatgt gagacaaaca 3300
ctaaagaaat tgatagagcg tcatgaatca atgagaacgt cattccatat gattgacgag 3360
accttgctgc aaaaggtgca tgatgatgtg acatgggaaa tggaggagat ggaagcgtcc 3420
gaggaagagg tttatgcttt gacaaaatcc ttcattcgtc cttttgatct cggtcaagct 3480
ccattgttta gagcaggatt aattcgtgtt aattctgagc gtcatttgct actactagat 3540
acgcatcaca ttatctcaga tggcgtatct actaacatac tctttcaaga ttttacgcaa 3600
ttatatcgtg gacgagagct gcctgcccta cgaattcaat acaaggattt cgccgtctgg 3660
caacaaggag aggctcagct tgctcgtctg caagaacaag aagaatactg gctgaaacaa 3720
ttttcagaga gtgtgcctgt actagagctt cctactgatt ttccacgtcc agcgatgcag 3780
cagtttgatg gtgacgtatt ggactttgca ttaaatcggc aagtatggca ggaattacaa 3840
cagctcattg ttaaagaggg ctgtacggct tacatgatat tgctggcggc ttatcatgtc 3900
ttgctttcca agtattcgtc gcaaaacgat attgtgatag gttccccgat agcaggccgg 3960
acaaatgctg atttgcaatc gattgtcggg atgtttgtta acacgctggc tatccgcacc 4020
aaatcagagg gaactcagac attccgcgag tttctctcta cgatcaaaca actggttctt 4080
caagctcaat ccaatgcaga gtatccattt gaagagctgg ttgataaggt aaatccaagt 4140
cgcgatctaa gtcgccagcc tttatttgac acaatctttg tcatgcaaaa catggatatt 4200
accgaggttg cgatacaagg tctttcaatc gtaacgaaag atatggaatg gaagcattca 4260
aaatttgatc ttacatgggc ggctgtagag aaagaatcct tacatttttc agttgaatat 4320
agtacccgct tatttaagca agaaacaatc gagcggatgg cgaagcattt tgcccatttg 4380
ctaaatcaag tggcggaaaa tcctgacttg agcctttcag atatggaatt ggcaacggat 4440
gaagaagtgt accagctttt ggaggagttt aataatacag aagctgatta tccgagtgat 4500
aaaacgattc accagcagtt tgagcagaag gtagaggaaa accctgatca gatagcgttg 4560
ttatttaaag ataaggaaat tacttacgga cagttgaatg caaaagcaaa tcaatttgct 4620
cgcgtattaa gaaagcatgg agtacagccg gatcaagtgg ttggattaat cactgatcgt 4680
tccattgaaa tgatgatagg cattttggca atcttaaaag ctggcggagc ctatttgcca 4740
attgatcctt cttatccagt agaacggatt acctacatgc tagaggatag tcaggcacag 4800
cttttgattg tgcaggaagc tgctatgatt ccagaggggt atcagggcaa agtgttgctt 4860
ctagtagaag agtgctggat acaggaggaa gcgtccaact tagagttgat taatggtgcc 4920
caggatttgg cgtatgtgat gtatacctca gggtctactg gtaagccaaa gggtaatctg 4980
acgactcacc aaaatatttt gagaaccatc atcaacaatg gatttatcga gattgtacca 5040
gcagaccgtc tattacagct atcgaactac gcctttgatg gctctacctt cgatatctac 5100
agcgcgctat taaatggagc gactcttgta ctggtgccaa aagaggtcat gctaaatcca 5160
atggagctgg cgaagatcgt ccgcgagcag gatattacgg tttcgtttat gaccacgtcc 5220
ctgttccata cgctagtgga gcttgacgtg actagtatga aatcaatgcg caaggttgta 5280
tttggcgggg aaaaggcttc atacaagcat gtagaaaagg ctctggatta tctcggagaa 5340
ggccgtttag taaatggata cggccctaca gaaacaaccg tttttgctac cacatacacg 5400
gtggattcta gtatcaagga aacgggaatc gtaccgattg gccgtccgtt aaacaatacg 5460
agtgtctata ttttgaatga gaataatcaa ccacagccga ttggagtacc aggggaattg 5520
tgcgttggcg gagcaggaat tgcacgtgga tatttaaacc gtccagagct gacagcagag 5580
cgctttgtgg ataacccgtt tcttgtcgga gatagaatgt atcggacggg agacatggct 5640
agattcttac cagatggcaa cattgagtac atcggacgaa tggatgaaca agtgaagatt 5700
cgcggacatc gaattgaact gggcgaaatt gaaaaaagtc tcctggagta ccctgctatc 5760
agtgaagcag tacttgtcgc aaaacgtgat gaacaaggtc attcctatct gtgcgcttat 5820
gttgtaagca cggatcaatg gacggtggct caggtacgtc aacacctact ggaggctctg 5880
ccagagtaca tggtaccatc ctatttcgtt gagcttgaaa agctacctct tacttccaat 5940
ggcaaggtag acaagcgtgc attgcctgaa ccagatcgag tgattaccaa tgagtatgtg 6000
gcggcagtaa atgagacaga ggagaagcta gttcagtttt tccaagagat cttagctgta 6060
gaccgagtgg gaacgcagga tacattcttt gaattgggtg gtcattccct aaaagcaatg 6120
atgctggttt caagaataca caaggaatta gaaatagagg ttccgttaaa agaagtattc 6180
gccagacaaa ccgttaaaga attagcagcc tatatcagac aggctgaaca gtcggattac 6240
agcgaaatcc aaccggccat ggagcaagaa tactacccag tatctaatgc gcagcgacga 6300
atgtatgtgg ttcaacaaat gagggatgta gaaacaacag gctacaatat gccgttctat 6360
ttagaaatgg agggtgctct tgaggtagaa aagctatctc tagctttgaa acaactaatt 6420
aagcgtcatg agtcattgcg aacctccttc catatggttg aggatgaact gatgcaaaag 6480
gtacatgcag aagtcgcatg ggagatggaa atgattcatg ccgtagagga agaagttcaa 6540
cagctgaccg attcctttat gcgtcctttc gatcttgcca aggcgccact attccgtgcg 6600
ggactcattc aaatcaatcc gaagcgacat ttattgatgc tggatatgca tcatatcatc 6660
tcagatgggg tatcgatgaa tgtattgttt caggatataa cgcagttgta tcaagggata 6720
gagctgagtc ctctcaagat tcaatacaag gattttgcgg tgtggcaaca agggatggct 6780
caggttgtcc gttttcagga gcaggaaagg tattggttaa accaattctc tggtgaccta 6840
ccaattttgg aaatggtaac agattatcca cgaccagcca tacagcagtt cgatggagat 6900
tcctggtcat ttgaaattga tgccaaagta ttggacagca taaagcaatt gtcagctaag 6960
caaggcacta cgttgtatat gactctgttg gcgatttatc aaatcctgtt agccaagtat 7020
acccgtcaag atgacatcat tgtcggaact ccgattgcag gaagacctca tgcagacaca 7080
gagagcattg tggggatgtt tgttaataca ctagccctac gcggtcaacc aaaagaagag 7140
caatctttca tctcttactt atcagaagtg aaagaaaacg tactacaagc ttatgccaac 7200
gctgattatc catttgaaga gttgatagag aagctgcatt tacaaagaga tatgagtcgt 7260
catccattgt ttgatacgat gtttgtttta caaaacatgg atatgtccga tataaatatt 7320
tctggtctaa agcttcattc gcgtgattta aactggaaaa atgcaaaatt tgatatgacc 7380
tggatgatag ccgaacaaaa taatctattg atttcggttg agtacagcac cagcctgttt 7440
aaacatgaaa ccattcaaag gctagaaaag catttcactt atttagtaga acaagtggct 7500
aagcatccgg attgcttact cagagattta gaactcacaa cagacgaaga aaaacaacaa 7560
atactgacgg tctttaacga cactgctact gatgatttac aggatttatc catctgccat 7620
ctattcgaac aacaagtaca gcgtttttca gatcggccgg cacttgtgtt taaagacaag 7680
cagctcacat acagtgagtt ccatgcaaaa gtaaatcaat tagcccgggt actcagaaag 7740
aaaggtgtgc agccggatca agcggttgga ttaatcaccg atcgttccat tgagatgatg 7800
atagggattt tcgccatcct aaaagcaggc ggagcttata tgccaattga tccttcctat 7860
ccaatcgatc ggatcgagca catgctagag gacagccgga ctaagttgtt attcgtgcaa 7920
aaaacagaaa tgatccctgc tagctatcag ggggaggtat tactcctagc ggaagagtgc 7980
tggatgcatg aagattcatc gaatttggag ctgatcaata aaacacagga tttggcgtat 8040
gtcatgtata cctcagggtc tactggtaaa ccaaagggta acctgacgac tcaccaaaac 8100
attttgacca ccatcatcaa caatggctat atcgagatcg cgccaacaga ccgtctatta 8160
cagctatcta actatgcttt tgatggctct accttcgata tctacagtgc gttattaaat 8220
ggagccactc ttgtactggt gccaaaagag gtcatgttaa atccaatgga gctggcgaag 8280
atcgtccgcg agcaggatat tacggtttcg tttatgacca cgtccctgtt ccatacgcta 8340
gtggagcttg acgtgactag tatgaaatca atgcgcaagg ttgtatttgg cggggaaaag 8400
gcttcataca agcatgtaga aaaggctctg gattatctcg gagaaggccg tttagtaaat 8460
ggatacggcc ctacagaaac aaccgttttc gctaccacat acacggtgga ttctagtatc 8520
aaggaaatgg gaatcgtacc gattggacgt ccgttaaaca atacgagtgt ctatgtctta 8580
aatgagaata atcagcttca gccgattgga gtaccagggg aattgtgcgt tggcggagca 8640
ggaattgcac ggggctattt aaatcgtcca gagctgacag cagagcgctt tgtggaaaat 8700
cctttcgtgt caggagatag aatgtatcgt accggtgatt tagcacgttg gttgccggat 8760
ggaagcatgg agtatttagg acggatggat gagcaggtta aggtacgcgg ttaccgaatt 8820
gagctgggtg aaatagagac aagattattg gagcatcctg ctataagcgc agtggtttta 8880
ctagcaaagc aagatgagca agggcattcg tacctatgtg cttacatcgt tgcaaatggg 8940
gtatggacgg ttgcggaact acgtaagcat ctaagcgagg ctttgccaga atacatggtg 9000
cctacttatt ttgtagaact agagcagata ccgttcactt ctaatgggaa ggtgaacaaa 9060
cgcgctttac cagagccgga aggacaaatt actagtgtat atgtggcccc agaaacggag 9120
acagaagcaa aagtagcagc gttattccaa gagattttgg gtgtcgagag agttggtaca 9180
caggacatgt tctttgagct gggtggtcat tcgctaaaag cgatgatgct cgttttacga 9240
atgaataaag aactgggcat cgaggtgcct ttgaaagagg tatttgcaca tcctactgtc 9300
aaggagttgg cagcaacgat cgaccttctt gatcgatcag gccactcaga gattgagcct 9360
gctccaaggc aggaattcta tccggtatcc tccgcgcaga gacggatgta tgtggtgcag 9420
catttaggaa atgtccaaac aaccagctac aatatgccgc ttttccttga agtggaggga 9480
gctttagaaa ttgataagct tcatctagca cttgagaaat tggtcgaaag acacgagtcg 9540
ctacgaacct cctttcatat ggttgacgaa gagctgatgc agcaggtgca tgaagaggtg 9600
gcctgggatt tagagatcat ggatggaacg gaaggagacc ttgcaggcat cacagcagga 9660
tttatacgtc cgtttgatct tagccaagct ccattgttcc gtgcaggcat cgtgcggatt 9720
agccctgaga gattcctttt catgctagat atgcaccata tcatctcaga cggagtttct 9780
accaatgtat tgttccagga tataacgcag ctctatcaag gaaaggacct gccccctctt 9840
ccgatacagt acaaggatta cgctgtgtgg caacaagctg atgctcaagt gactcgctta 9900
caagatcagg aaagctattg gctacatcaa tttgctggag aagcttctgt cttggaaatg 9960
ccgacagatt tcccgcgtcc tgcagtccag cagttcgaag gagatgtatg gacctttgag 10020
gttgatgctg acattctcag ccagttgaaa aaattatcag tgagtcaggg ttctactcta 10080
tatatgactt tattggcggc ttatcaggtg ttgctggcta agtataccgg tcaagatgat 10140
attattgtcg gttcaccaat tgccggacgc cctcatgcgg atgtagagag catcgtcggt 10200
atgttcgtca acacgctagc tttacgtgga cagcctgtag gagagcagac gtttatttcc 10260
tatctggcac aagttaagga acaggtttta caagcttatg ccaatgcaga gtatccattt 10320
gagaaattgg tagagaagct tgatttacaa cgagatatga gtcgccatcc actcttcgat 10380
acgatgttta ctttgcaaaa catggagatg tctgatattg atttggcagg cttgaccttc 10440
aagccatttg attttgaatg gaaaaatgcc aagtttgaca tggattggac aatgcttgag 10500
gaagaaacac tcaaggtagc tattgaatac agtacaagcc tgtatacaaa agaaaccatt 10560
agcagaatgg ctcaacattt cacctatgtt ttacaacaaa ttattgagca tccagccatt 10620
cgtttggctg aaatcaaaat tgctactcta ccagaaattg aacagatttt aacgcaattt 10680
aatgatacta gggccagtta ccctgataac caaaccattc atagcctatt cgagcaacaa 10740
gtggagcgta caccagaaca gatagctgtt gtctatcagg atcaatccat cacgtatcgt 10800
gagcttaatg aacgtgcaaa tagattggca cgttgcttga tcgataaagg gatacagaga 10860
aatcaatttg ttgcaatcat ggcggatcgt tccatagaaa ccgttattgg aatgatggga 10920
attctcaaag caggaggagc ttatgttcca attgatcctg attaccctct agatcgaaag 10980
ctgtatattc ttgaagacag ccatgcatca ctattattgt tccagcaaaa gcatgaggtc 11040
ccctcagaat tcacaggtga tcggatatta attgagcaga tgcagtggta ccaagcggct 11100
gatacgaatg tggggatcgt caatacagct caagatttgg cgtatatgat ctatacctca 11160
ggttctacag gtcaaccaaa aggggtaatg attgatcatc aggcagtatg taacctatgc 11220
ttaatggccc aaacctatgg aatctttgcg aatagtcgcg ttctacagtt tgcctccttt 11280
agctttgacg cttccgtagg agaggttttc cataccctta caaatggagc cactctctat 11340
ctgatggatc gcaatctggt catggctggc gttgagtttg ttgaatggtt acgagtaaat 11400
gaaataactt ctattccgtt tatctcgcct tctgcattgc gtgcattgcc gtatgaggat 11460
ttaccagcat tgaaatatat cagtacaggt ggggaagcat tacctgtaga tttagtcaga 11520
ctatggggaa ctgagcgaat cttcttaaat gcatatggtc cgactgaaac aacagtagat 11580
gcaacgattg gcttatgtac gccagaggat aagccacata ttggtaagcc tgtgttgaat 11640
aaaaaagcct acattattaa tccaaattat caacttcagc caattggggt accgggtgag 11700
ttatgcatcg gtggagtagg gattgctcct ggatattgga atcgccctga attaactaga 11760
gagaaatttg tggataatcc atttgcccaa ggcgaaagaa tgtataagac gggggactta 11820
gtacgttggc ttccggatgg aaatattgag tttttaggac gtattgatga tcaggtgaaa 11880
attcgtggac accgaattga attgggtgaa attgagacgc ggcttcttga gcatgagcag 11940
gtaatagagg cggttgtgct ggcgcgtgaa gatgaacaag gtcaagctta tctgtgtgct 12000
tatctggtag cagcagatga atggacggta gcagaactgc gcaaacatct aggaaaaaca 12060
ctgcccgatt atatgattcc tgcttatttt atcgagcttg aggagtttcc tttgacacca 12120
agcgggaagg tgaataaaaa agctttacca gagcctgatg gacaaataca aacgggagtg 12180
gagtacgtag aggctactac cgaaagccaa aaaatccttg ttgagctttg gcaagaggtg 12240
ttacgtgtcg agcggatcgg tatttacgat aacttctttg agctgggcgg tgactccatc 12300
aaagcaattc aaatcacagc aagattgcgt cgttaccacc gcaagctgga aatcagccat 12360
ctgtttaagc acccaacgat tgcagagctt gctccatgga tgcaaaccag tcaggcatta 12420
cttgaacaag gaactgttga aggcgaagtt atgctcacgc caattcaaaa agcattcttt 12480
gaagaaaatc aggaacagcc gcagcatttt aatcaggatt cgttactgta cagctcgaat 12540
ggctggaacc aagatgcgat cgagcaggta tttgaaaaaa taacggagca tcacgatgcc 12600
ctgcgaatgg tgtatccgca taccgagggc aaggtgactc agatcaacag gggacttgag 12660
gacaaggcgt tcacattgca ggtgttcgat tttacccaag aaccaactga tacgcaggca 12720
acgaaaattg agcaaatcgc tactcaattg caagcgagct ttgatttaga aaagggacct 12780
ctggtacgac ttggcttatt taccaccaag gctggggatt atttactgat cgtgatccat 12840
cacctagtga ttgacggcgt ctcttggcgt atattgcttg aggattttca taatgcttat 12900
cagcaagtca ttcaaggtca agcaattgta cttcctgaaa aaacgacctc ctttaaaaca 12960
tggagtgagc gcttgaatga atatgcaaat agtcatgctc ttttacatga gattccatat 13020
tggaagcaga tggaagaaat atcgatcgcc cctcttccta aaaaaggaaa caatgacggt 13080
agatattatg tgaaggacag cgaatatgcc acgatgagtc taacagaaga agaaacccaa 13140
aatcttctta ctcgtgtaca tcgagcttat cgaacggaga ttaatgatct gttgcttgct 13200
gcattaggat tagcaagtaa ggaatggaca aaagagaatc gagtggctat ccacttagag 13260
ggtcatggtc gtgaggaaat aggtgaaggg gtagatgtca accgcactgt tggatggttt 13320
acctcactgt tcccagtcgt gattgattta gaaaatgacg aattgcctct catcattaaa 13380
tcggtaaaag aaaccctgcg ccgagttcct aataaaggca tgggctacgg catactcaag 13440
catctgacaa gcgatgcgaa caaacaggag ataaccttct cgcttcgtcc agagatcagc 13500
tttaactatc tgggagtatt tgatcaacaa gaggaggaga gcgaatctgc tgggattcct 13560
actggtcagc cgatcagccc gcaatattat gacacgcacc tgctggagtt taatggagcg 13620
gtctcgaata accagttgca tgtaaattgc cgatttgctc ctacagccgt tgatcgagcg 13680
attgttgaaa ttttgatgga gcgcttcaag caccatttac ttctaattag taagcattgc 13740
ttggaaaagg ataccgtaga atttacacct actgatttta cagaaaagga attaagccag 13800
gaacagcttg acgatctatt agatgatttg tttgaagaca tagatgatct gtaa 13854
<210> 66
<211> 7593
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 66
atgtcggata agctaagcaa cgctaaagac ctatttccaa tgagcgatat acagctaggg 60
atggtctacc attcgttaaa acatgtacac gaagctgtat accatgatca atttgtttat 120
caagtagatg atgattcatt tgatgttcat gtactagagc aagcgatgag aatgatggtt 180
gataagcacg acatcttaaa aaccagcttt catattgagg aattttccac tccagttcaa 240
gtagtgcacc aggaggtttc tgttcgaatt gatgagacag acattacgca tctgggagaa 300
aaacaaaaag agtatatcca tcagtatttg gcgcaggatc gtcaatcccc ttttgatgta 360
acaaccgctc ctctatggag aatgagcgtt tttaaactga atgcaagcca agttgcttta 420
gtttggatct ttcatcatgc tattttggat ggatggagtg ttgcatcttt tattacggaa 480
ttaattgatg tttatttcaa attaaagcac aaaacttgca ctttggagca tttgaacacg 540
acctataagg attatgtgat tgatcagatg ctattatccg agcaaaatga gctgcgtgaa 600
tattggaaag aagaattaaa agattacaaa cggctacagc tcccagtaaa agtggatgaa 660
aatggcggtg ttcacgttac cgttgttgag aagctagacc ctgacattat aaataaatgc 720
agagaaattg cacaagctca tcacattcca ctaaagaccg tatgcctcac agcctttctt 780
tctatgatgc atatgatttc atatgagaga gacctgactg tgggattgat tgagaacaac 840
cgaccaatta tagaagatgc tgaaaaggtg ttgggatgtt ttcttaactc agttccattc 900
cgcgccatta tacagaaaga tatgagctac agagagctat tagagcagac acagcaaaag 960
cttgttgaga ttaagacata tggaaggctt tcctttgcta agattattga agtaattggc 1020
gatacgggaa gcgaacgtaa tccagttttt gactgtcttt ttaactttgt cgacttccat 1080
gtatttaaag ggataaagga tcataaagta aagttttggt tagatggata tgaaaagacg 1140
aacaccatgt ttgacttttc tgtttcgacc acaatggatg actattttgt tcgggttgta 1200
tctgcactgc cagaagaaga taccataaaa ctaattaact attatcaacg aattttagaa 1260
aagattgctc ttcacataga tgaaaaaata gataaacaag ccaatcttga tgaaaaggaa 1320
agccacttgc tgctagagga atggaatcaa acgtcagttg attatccaga caagcaaaca 1380
ttgcataaac ggtttgaaga gcaagcagcc aaaagtgaag atcagatagc gctggaatat 1440
gaggataagc agcttaccta tagggaattg aacgctaaag ccaatcaatt ggcacgtgtt 1500
ttacagaagc ataatacgct gccaactcag gtagttggtc taatggcaga gcgttcacta 1560
gagatgataa taggcattct tgggatatta aaagccggcg gagcctatat gcctattgac 1620
cctacgtatc ctgcggagcg tatccaatat atgctcgaag atagtcgatc ctatctctta 1680
cttgtgcaaa aagcggaaat gattccagcc aattatcagg gtgaagtact tatcctcaca 1740
gaggaacttt gggcagatga gaatacagag aacctggatt tagtcaatca gccacaggat 1800
gttgccaata tcatgtatac atctgggact acaggaaagc caaaaggtat cctgattact 1860
catcgaaaca ttatgactac cataatcaat aatggctatc ttgatatttt ttcaacagat 1920
cgaatattgc aaatgtctaa ttatgctttt gatggttcta cctttgatat atacagtgct 1980
ttgctaaacg gagctaccct cgtgctagtt cccaagcaaa cactcatgaa tacgaccgat 2040
ctgttagcag tcatcaaaga tagcaatatc acggtagctt tactgacaac ctctctattc 2100
aatacgttgg ttgatcttga tgtaaccagc ttccaacata cacgtaaggt tttatttggc 2160
ggggaaaagg cttcatgtaa gcatgtagaa aaagcattgg attatctggg tgaggggcgc 2220
ctagtaaatg gatatggtcc gacagaaaca acggtgttcg ctactaccta tacagtcgat 2280
aacacgatta aaaagctggg aagtgtcccg atcggacgtc ctttgagcaa tacttcggta 2340
tatatttttg gattagatga tcaattacaa ccacttggag taccagggga gttatgtgta 2400
gcaggagaat gcatttcgcc tggatatctg aatcgtcccg acttaacggc agacaaattt 2460
attgataatc cacttaaacc aggtgagaga atgtaccgta caggtgacct ggttcgttgg 2520
ctacctgaag gtgtcatgga atacatgggg cggattgatg aacaagtcaa gattcgtgga 2580
catcgtatcg agctagggga gattgaggca aaactgcttg agcatccttc gattcgagaa 2640
acagtgctgg tggctaaaca ggatgcaaat ggccattctt ttttaggtgc gtatcttgtt 2700
acagacaact tctgccctgt aacggaatta cggaattatc tgatggaaac cttgccagaa 2760
tatatggttc cttcttattt tatcgagctg gatagcctac cgcttacttc aaatggaaaa 2820
gtagataagc gagcattgcc cgaaccggaa tctcaggctt tacacgcata taccatgccg 2880
gagaatgaga tggaagaaaa attggttcag ctattccagg aagtgatgga tgtagagcgt 2940
gttggtaccc aagatagctt ttatgaatta ggcggtcatt ccttaaaagc aatgcttttg 3000
gtttcacgaa ttcataagga ttttggaata aagataccgt tgaaggaagt attcagtcgt 3060
ccgaccgtga aggaattggc tgcctatctg actgggtcag aagaagcaaa ctatattgaa 3120
attgaagcag cagaagagaa accatactat ccagttactg ccgctcaaaa acggatgtat 3180
atcgcccagc aatgggagga tggggaagcc actagcagtt atcacatgcc gtttatgatg 3240
gaaatcacag ggcctcttca agtagaaaag ctacaacaaa cagtaaagag tcttgtcgca 3300
aggcacgagt cgttgcggac atcatttcac atgatcaatg aagtattgat gcaaaagata 3360
catgcagatg tattgtggga tttagacatt gatctagagt cagttgtcgc ttcagagcaa 3420
gaaattgatg aaaaaatgtt ccaattcctc cgcaaatttg atttgagtca agctccttta 3480
tttagagcta agctgattcg tgtcaacgct agtcggcatg tattgttatt agatatgcat 3540
catattattt cggatggatt ttcataccag atattttttg atgagcttac caagctgtat 3600
cagggcgagg aattgccatc tctcaaaata caatataagg attatgccgt ttggcagcat 3660
tcggaagaac aacagaagcg tttgcaacag caagaggatt attggttagg tcaattccaa 3720
ggggaaattc ctgttctgga attgcctacg gattaccagc gcccggttga taaacagttt 3780
gctggagcat tattcacaca cgggttatct gctggtctaa cagagaagct gagaaaatta 3840
gcgattaagg aaaaaacgac gttatacacc gtactgctga cggtctataa cattctattg 3900
acgaaatata caagtcaaga ggacctcatt gtaggtacac cgattgctgg acgtccacac 3960
gcagatttag acagagtatt tgggatgttt gtaaacacgc tggccatcag aacagctcca 4020
aaagtagagt attccttctt agcgtatctc tctgaggtca aagaaacagt actaggtgct 4080
tatcaaaatc cagactatcc atttgaggag ctggttgaaa aaacgctagt tcagcgggat 4140
gtaagccgta atcctttatt cgatgtaatg ttctccgtag agaaattacc atctgctgta 4200
cagttcgatg atttacgttt ctgcccacgc ttatttgatt ggaagaaggc aaaatttgac 4260
ttggattgga cagtggtgga aggtgaatca ttggaggttt tggttgaata tagcacgagc 4320
ttgttcgatc gggcgaccat tgagcgcatg gctaagcatt ttgagcatat tttggagcaa 4380
atccttgatc agccagacct gtctatttct gagattgaac tgctgaccga ggcagaaaaa 4440
caacaaattt tgattgagtt taatcaatcg gataaatcct ttgacagcga aaaaacgatt 4500
caggagcaat ttgaagaatg ggcagaaaaa gccccgcaca gcattgcctt agtctttaaa 4560
gacaagcaaa tgacctatca ggaattaaat caacgtgcta accaagttgc gcatttatta 4620
cgtggcaatg ggatttccgc aaatgatttt atcggtttaa tggtggatcg atcgtttgag 4680
atgatcatta gtatgctagg tattttgaag gcgggtggag cctacctacc tattgatcct 4740
gattatcctg aggaccgtat cgattatatg ttatctgaca gcaaagcgaa gattctctta 4800
aagcaaagtg accaaactgc accagcttcc tttgaaggta aagtcatcgc tattgatact 4860
ccagaattgc tagagatgga tatagaaaat attcctaagg tgaataactc atccgacttg 4920
gcttatatca tttatacatc tggatcaacc ggaaaaccaa aaggagtatt gattaatcat 4980
cgatgcgtga tcaatatgca gcttacagct gaaacctttg gtatctatcc ttcgagtcgt 5040
attctacagt ttgcatcctt tagttttgat tcatctgtgg gcgagatttt ttatacatta 5100
ttaaacggag catgcctgta tttggtagaa aaggatttgc ttttatccgg taatgaattc 5160
gtggcatggc taaagaaaaa taggattagc tcgattccat ttatttcacc gtcggctctg 5220
cggatgcttc cttatgagga tttacctgat ctcgcatata taagtacggg tggggagaca 5280
ttgccggctg accttgttaa agcgtgggga gaaaatcgtg tcttcctaaa tgcatatggc 5340
ccgacggaaa caactgtaga tgccactgtc ggtgtatgta caccagaagg gaaaccgcat 5400
atcggaagac ccgttacgaa taaaaaggtg tatgtagtaa atagcaacaa tcaattacag 5460
ccgattggtg ttcctggcga gctttgcatt ggcggggaag gggttgcact tggctatcta 5520
aacagacctg atctaaccca agaaaaattc gtttccaatc cgtttgcccc gggtgaaaga 5580
atgtaccgct ccggagactt agtcagatgg ctacctgatg gaacaattga gtacttcgga 5640
agattagacg atcaagtaaa aattagaggt caccgtattg aattaggaga gattgaaaca 5700
aggctactag agcatccatg cattaaagaa gccattgtca ttccacgttc tgatgagtca 5760
gaggctacat atttatgcag ctatttgatt gcagaaggat catggaatgc ggctgactta 5820
cgtaagtatt tgaaggcttc tctgccggaa tatatgatac cttcgtattt tgtggagctg 5880
cacgagctac cgctaacacc taatggaaaa gttaataaaa aagcattacc aaaaccagaa 5940
aagcaaatgc agagagggca ggattatgta gcccctacta accctattca atccatttta 6000
tctcatattt ggactgatgt gcttggtgtt gaaaatatag gaattcacga caatttcttt 6060
gaattaggtg gagattcaat taaagccatc caaatttcag ctcgacttaa taagcataat 6120
ctcaaggtta aaatgcggga attgtttaag aacccaacga ttgctgagct aagtctgctt 6180
gtacaacaga tcgttcagga gatcgatcaa ggagtagtag aaggaaatat tccgcttaca 6240
ccgatccagc attggttctt tacccaatca ttcccgcagg tcaaccatta caatcaatcg 6300
gttctcctct ttaatgcgga gggctgggat gagcaaaaag tagacaaagc ttttgagatg 6360
ctaactcagc accatgatgc actgcgaatc gtatatagcc tcgacgagca aggggttgta 6420
cagcataacc ggggattgga aggctcgaac tatcatttcg aaatcattga tgtaagacaa 6480
gatggagaag atcagtcgaa ctggaaagca gcggcgaatc ggatgcaggc aagtatggat 6540
atcgtagaag gacctttagt gcagatcgga ttgttccgtg ctaatgaagg agcttatttg 6600
ttaattgcca ttcatcactt agtggtagat ggggtgtctt ggcgtatcct actagaagac 6660
ttctatcatt tatataacgg aaacgactct ctgccattaa aaacgacctc tttccaagca 6720
tggtctcaaa agctccaaga gtacgcccaa agcaaggagc tagaacatga gctttcctat 6780
tggcgccatt tagatgaagc tatcacggac tataccttac acaaagatat agaagccgca 6840
acctcaaata agacaaccta tgaggaattt ttaactgtat cgatgtcttt atcaactgag 6900
gaaacccaac agctagttac agaggctcat aaggcgtacc agacggaaat aaatgatctg 6960
ctgctcacgg cactggcttt agctttgaag gaatggacga ataaagagca gttgctagtt 7020
agtatggagg ggcatggacg tgaagaaatt ctagataacg tagatatctc ccgtacagtt 7080
gggtggttta catcagagta tccggttgct attcatctga cgaaaacaga catttcgttt 7140
gccattaaac aagtaaagga aacgttgcgt cgtgtaccta acaaagggtt tggctatggg 7200
attcttaaat atttggccaa agagacgttc aagcttaagc cagaaatcag ttttaactat 7260
ctaggccaat ttacagataa ggaagagggg aactcctctt taatgggtga tctgattagc 7320
ccggcaaata ccagtgagct gtccctagat atcaatggaa gtatagaagc tgacagactg 7380
caaatgcact ttagttataa ctctcgtgcg tactatccag agacaatcgc agcccttgtt 7440
caaaacttca aatcctactt gcttgagatt atcaatcatt gccgggcgaa agaaggagta 7500
gagcatacac caagcgactt tgatatcaat gatctcacca tggaagaact agatgatatt 7560
tttgatgacc tggaagaaga ggtatacaaa taa 7593
<210> 67
<211> 1926
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 67
atggatttat ctacattaaa ttttttgggt gaaacagaaa agcataagtt attgaatcaa 60
ttcaatgata cggacgctaa ttttcctcag gagatgacca ttcatgggct gtttgaaaag 120
caagtccaag aaagaccgaa tcaaactgcg gtaattttta atgaacaaag tatgacgtat 180
aaagaaatga atgaacgagc caatcaagta gcacatagct tacggaagca tggagttgct 240
ccagatgaga tcgttggaat tctagcagat cgcaacatgg acatgcttat ttccattctc 300
ggcgtattaa aggctggggc tgcttatatg cctattgatc ctacataccc tacagaacgt 360
attctttata tgatcaatga tagccagacc aaaattgtct tagcagaaca tagagagatg 420
gttccggaag gctgtaatgc agagctgatc ctcttgcaag atagctccct tttaaacgaa 480
gagacatctg atctagagca tgtaaataag cctgaagatt tggcctatat tatctataca 540
tcaggttcta ctggtaaacc aaaaggggtt atgattgaac atcgaaatgt cattcgcttg 600
ctatttaatg acagaaacct atttgatttt actagtgatg atgtctggac cgttttccat 660
tcgttctgtt ttgacttctc tgtttgggag atgtatgggg ctttactgta tggaggaaaa 720
atcgttctcg tctcttttga gatagctaga gatcctcagg ccttccgaaa tttacttcag 780
gagcaaaagg tttcgatttt aaatcaaacc cctacagctt tttatcagct ctcgtctgaa 840
gagatgcagc actcagacag caatctatcg attcgtaaaa tcatttttgg tggagaagcg 900
ttgacgccat cacagttgaa ggcatggaaa caaaaatatc caaatacagc cttgattaat 960
atgtacggta ttacagaaac aactgttcat gtgacttata aggagtttca attacatgat 1020
atggacagca cagttagcaa tatcggaaag cctatcccaa cgcttagaac ctatgtttta 1080
gattccaaga gaaacctagc tccaattgga gtgaaaggtg aactgtatgt gagcggcaag 1140
ggagtagccc gcggttattt aaacaaacct gaattgacgg aagaacgatt tatggataac 1200
ccgtttgtct ctgaagaaag aatgtatcgc acaggagacc tagctagatg gctacctgaa 1260
ggagagctag aatacttagg caggattgac catcaggtaa aaatcagagg ctatcgcatt 1320
gaactcggag aaatagaagc cgagctattg aagcaaaaag ggattaaaga agcagtcgtt 1380
ttagttacaa atgataaaga tgcacaacca caattacatg cctatttaac atctacggaa 1440
gatttggcag cagcagatct tcgtaatcaa cttactacaa cattaccctc ttatatgatt 1500
ccggctcatt tcatttttgt gtcgcaaatg cctgttacgc caaatggaaa aattgataaa 1560
gaatcacttc gtaaaataga accatcactt caagaaagcc ctacagaagc ttatgtagct 1620
ccacaaacac ctacagaaaa gcaattagtc cacatatggg aagaaaatat tggaatgcaa 1680
ccgatcagca tagacgataa ttattttgct ctaggtggcg attccatcaa agcgattaag 1740
ctattgcatg ctataaataa agagtttcag attagtttcc aaattggaga tttgtataag 1800
catggaacca ttagagaaat gggacagcaa atcggtgaac agggcaagca atctagcaat 1860
caaaaactgt tgaaacttca ggaattggac cgtttaaaag agaaaatttt gggaagtgag 1920
aaatag 1926
Claims (9)
1.一种参与阳离子肽化合物合成的基因,其特征在于,所述基因编码的蛋白质具有阳离子肽化合物的非核糖体肽合成活性及聚合酮酶合成活性,所述阳离子肽化合物是由芽孢杆菌属(Bacillus Cohn)细菌产生的,所述蛋白质从N末端侧起依次包含如下的模块:
第一模块:其从N末端侧起依次包含:第一腺苷酰化结构域,其包含如SEQ ID NO:1所示的氨基酸序列或与SEQ ID NO:1所示的氨基酸序列具有70%以上同一性的氨基酸序列;第一肽基载体蛋白结构域,其包含如SEQ ID NO:2所示的氨基酸序列或与SEQ ID NO:2所示的氨基酸序列具有70%以上同一性的氨基酸序列;酮还原酶结构域,其包含如SEQ ID NO:3所示的氨基酸序列或与SEQ ID NO:3所示的氨基酸序列具有70%以上同一性的氨基酸序列;第一缩合结构域,其包含如SEQ ID NO:4所示的氨基酸序列或与SEQ ID NO:4所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第二模块:其从N末端侧起依次包含:第二缩合结构域,其包含如SEQ ID NO:5所示的氨基酸序列或与SEQ ID NO:5所示的氨基酸序列具有70%以上同一性的氨基酸序列;第二腺苷酰化结构域,其包含如SEQ ID NO:6所示的氨基酸序列或与SEQ ID NO:6所示的氨基酸序列具有70%以上同一性的氨基酸序列;第二肽基载体蛋白结构域,其包含如SEQ ID NO:7所示的氨基酸序列或与SEQ ID NO:7所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第三模块:其从N末端侧起依次包含:第三缩合结构域,其包含如SEQ ID NO:8所示的氨基酸序列或与SEQ ID NO:8所示的氨基酸序列具有70%以上同一性的氨基酸序列;第三腺苷酰化结构域,其包含如SEQ ID NO:9所示的氨基酸序列或与SEQ ID NO:9所示的氨基酸序列具有70%以上同一性的氨基酸序列;第三肽基载体蛋白结构域,其包含如SEQ ID NO:10所示的氨基酸序列或与SEQ ID NO:10所示的氨基酸序列具有70%以上同一性的氨基酸序列;终端还原结构域,其包含如SEQ ID NO:11所示的氨基酸序列或与SEQ ID NO:11所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第四模块:其从N末端侧起依次包含:第四缩合结构域,其包含如SEQ ID NO:12所示的氨基酸序列或与SEQ ID NO:12所示的氨基酸序列具有70%以上同一性的氨基酸序列;第四腺苷酰化结构域,其包含如SEQ ID NO:13所示的氨基酸序列或与SEQ ID NO:13所示的氨基酸序列具有70%以上同一性的氨基酸序列;第四肽基载体蛋白结构域,其包含如SEQ ID NO:14所示的氨基酸序列或与SEQ ID NO:14所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第五模块:其从N末端侧起依次包含:第五缩合结构域,其包含如SEQ ID NO:15所示的氨基酸序列或与SEQ ID NO:15所示的氨基酸序列具有70%以上同一性的氨基酸序列;第五腺苷酰化结构域,其包含如SEQ ID NO:16所示的氨基酸序列或与SEQ ID NO:16所示的氨基酸序列具有70%以上同一性的氨基酸序列;第五肽基载体蛋白结构域,其包含如SEQ ID NO:17所示的氨基酸序列或与SEQ ID NO:17所示的氨基酸序列具有70%以上同一性的氨基酸序列;第一差向异构酶结构域,其包含如SEQ ID NO:18所示的氨基酸序列或与SEQ ID NO:18所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第六模块:其从N末端侧起依次包含:第六缩合结构域,其包含如SEQ ID NO:19所示的氨基酸序列或与SEQ ID NO:19所示的氨基酸序列具有70%以上同一性的氨基酸序列;第六腺苷酰化结构域,其包含如SEQ ID NO:20所示的氨基酸序列或与SEQ ID NO:20所示的氨基酸序列具有70%以上同一性的氨基酸序列;第六肽基载体蛋白结构域,其包含如SEQ ID NO:21所示的氨基酸序列或与SEQ ID NO:21所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第七模块:其从N末端侧起依次包含:第七缩合结构域,其包含如SEQ ID NO:22所示的氨基酸序列或与SEQ ID NO:22所示的氨基酸序列具有70%以上同一性的氨基酸序列;第七腺苷酰化结构域,其包含如SEQ ID NO:23所示的氨基酸序列或与SEQ ID NO:23所示的氨基酸序列具有70%以上同一性的氨基酸序列;第七肽基载体蛋白结构域,其包含如SEQ ID NO:24所示的氨基酸序列或与SEQ ID NO:24所示的氨基酸序列具有70%以上同一性的氨基酸序列;第二差向异构酶结构域,其包含如SEQ ID NO:25所示的氨基酸序列或与SEQ ID NO:25所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第八模块:其从N末端侧起依次包含:第八缩合结构域,其包含如SEQ ID NO:26所示的氨基酸序列或与SEQ ID NO:26所示的氨基酸序列具有70%以上同一性的氨基酸序列;第八腺苷酰化结构域,其包含如SEQ ID NO:27所示的氨基酸序列或与SEQ ID NO:27所示的氨基酸序列具有70%以上同一性的氨基酸序列组成的;第八肽基载体蛋白结构域,其包含如SEQID NO:28所示的氨基酸序列或由SEQ ID NO:28所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第九模块:其从N末端侧起依次包含:第九缩合结构域,其包含如SEQ ID NO:29所示的氨基酸序列或与SEQ ID NO:29所示的氨基酸序列具有70%以上同一性的氨基酸序列;第九腺苷酰化结构域,其包含如SEQ ID NO:30所示的氨基酸序列或与SEQ ID NO:30所示的氨基酸序列具有70%以上同一性的氨基酸序列;第九肽基载体蛋白结构域,其包含如SEQ ID NO:31所示的氨基酸序列或与SEQ ID NO:31所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第十模块:其从N末端侧起依次包含:第十缩合结构域,其包含如SEQ ID NO:32所示的氨基酸序列或与SEQ ID NO:32所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十腺苷酰化结构域,其包含如SEQ ID NO:33所示的氨基酸序列或与SEQ ID NO:33所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十肽基载体蛋白结构域,其包含如SEQ ID NO:34所示的氨基酸序列或与SEQ ID NO:34所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第十一模块:其从N末端侧起依次包含:第十一缩合结构域,其包含如SEQ ID NO:35所示的氨基酸序列或与SEQ ID NO:35所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十一腺苷酰化结构域,其包含如SEQ ID NO:36所示的氨基酸序列或与SEQ ID NO:36所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十一肽基载体蛋白结构域,其包含如SEQ ID NO:37所示的氨基酸序列或与SEQ ID NO:37所示的氨基酸序列具有70%以上同一性的氨基酸序列;第三差向异构酶结构域,其包含如SEQ ID NO:38所示的氨基酸序列或与SEQID NO:38所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第十二模块:其从N末端侧起依次包含:第十二缩合结构域,其包含如SEQ ID NO:39所示的氨基酸序列或与SEQ ID NO:39所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十二腺苷酰化结构域,其包含如SEQ ID NO:40所示的氨基酸序列或与SEQ ID NO:40所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十二肽基载体蛋白结构域,其包含如SEQ ID NO:41所示的氨基酸序列或与SEQ ID NO:41所示的氨基酸序列具有70%以上同一性的氨基酸序列;
第十三模块:其从N末端侧起依次包含:第十三缩合结构域,其包含如SEQ ID NO:42所示的氨基酸序列或与SEQ ID NO:42所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十三腺苷酰化结构域,其包含如SEQ ID NO:43所示的氨基酸序列或与SEQ ID NO:43所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十三肽基载体蛋白结构域,其包含如SEQ ID NO:44所示的氨基酸序列或与SEQ ID NO:44所示的氨基酸序列具有70%以上同一性的氨基酸序列;第四差向异构酶结构域,其包含如SEQ ID NO:45所示的氨基酸序列或与SEQID NO:45所示的氨基酸序列具有70%以上同一性的氨基酸序列组成的;
第十四模块:其从N末端侧起依次包含:第十四腺苷酰化结构域,其包含如SEQ ID NO:46所示的氨基酸序列或与SEQ ID NO:46所示的氨基酸序列具有70%以上同一性的氨基酸序列;第十四肽基载体蛋白结构域,其包含如SEQ ID NO:47所示的氨基酸序列或与SEQ IDNO:47所示的氨基酸序列具有70%以上同一性的氨基酸序列。
2.根据权利要求1所述的参与阳离子肽化合物合成的基因,其特征在于,所述蛋白质为如下(a)-(n)中的任意一种:
(a)包含如SEQ ID NO:48所示的氨基酸序列或包含与SEQ ID NO:48所示的氨基酸序列具有70%以上同一性的氨基酸序列;
(b)包含如SEQ ID NO:49所示的氨基酸序列或包含与SEQ ID NO:49所示的氨基酸序列具有70%以上同一性的氨基酸序列;
(c)包含如SEQ ID NO:50所示的氨基酸序列或包含与SEQ ID NO:50所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(d)包含如SEQ ID NO:51所示的氨基酸序列或包含与SEQ ID NO:51所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(e)包含如SEQ ID NO:52所示的氨基酸序列或包含与SEQ ID NO:52所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(f)包含如SEQ ID NO:53所示的氨基酸序列或包含与SEQ ID NO:53所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(g)包含如SEQ ID NO:54所示的氨基酸序列或包含与SEQ ID NO:54所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(h)包含如SEQ ID NO:55所示的氨基酸序列或包含与SEQ ID NO:55所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(i)包含如SEQ ID NO:56所示的氨基酸序列或包含与SEQ ID NO:56所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(j)包含如SEQ ID NO:57所示的氨基酸序列或包含与SEQ ID NO:57所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(k)包含如SEQ ID NO:58所示的氨基酸序列或包含与SEQ ID NO:58所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(l)包含如SEQ ID NO:59所示的氨基酸序列或包含与SEQ ID NO:59所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(m)包含如SEQ ID NO:60所示的氨基酸序列或包含与SEQ ID NO:60所示的氨基酸序列具有70%以上同一性的氨基酸序列,且具有转录因子活性的蛋白质;
(n)包含有多核苷酸编码的蛋白质,所述多核苷酸在严格条件下与如SEQ ID NO:61-SEQ ID NO:67任一所示的核苷酸序列的互补链杂交,且具有由芽孢杆菌属细菌产生的阳离子肽化合物的非核糖体肽合成活性及聚合酮酶合成活性。
3.根据权利要求1或2所述的参与阳离子肽化合物合成的基因,其特征在于,所述基因来源于芽孢杆菌属细菌,包括但不限于侧孢短芽孢杆菌(Brevibacillus laterosporus)、短小芽孢杆菌(Bacillus pumilus Meyer and Gottheil)、枯草芽孢杆菌(Bacillus subtilis)、蜡样芽孢杆菌(Bacillus cereus)或苏云金芽孢杆菌(Bacillus thuringiensis)。
4.根据权利要求3所述的参与阳离子肽化合物合成的基因,其特征在于,所述基因来源于侧孢短芽孢杆菌Brevibacillus laterosporus S62-9,所述侧孢短芽孢杆菌Brevibacillus laterosporus S62-9的保藏编号为CGMCC No.18629。
6.一种生物材料,其特征在于,包括如下任意一种:
(1)多肽,由权利要求1-5任一项所述的参与阳离子肽化合物合成的基因编码;
(2)含有权利要求1-5任一项所述的参与阳离子肽化合物合成的基因的表达盒;
(3)含有权利要求1-5任一项所述的参与阳离子肽化合物合成的基因或(2)中所述的表达盒的重组载体;
(4)含有权利要求1-5任一项所述的参与阳离子肽化合物合成的基因、(2)中所述的表达盒或(3)中所述的重组载体的宿主细胞。
7.根据权利要求6所述的生物材料,其特征在于,所述宿主细胞包括但不限于芽孢杆菌属细菌,大肠杆菌(Escherichia coli),或毕赤酵母(Pichia pastoris)。
8.一种阳离子肽化合物的制备方法,其特征在于,将权利要求1-5任一项所述的参与阳离子肽化合物合成的基因导入宿主细胞中,培养,从获得的培养液中分离。
9.权利要求1-5任一项所述的参与阳离子肽化合物合成的基因或权利要求6-7任一项所述的生物材料在制备阳离子肽化合物的用途。
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202210442081.5A CN114561406A (zh) | 2022-04-26 | 2022-04-26 | 一种参与阳离子肽化合物合成的基因及其用途 |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202210442081.5A CN114561406A (zh) | 2022-04-26 | 2022-04-26 | 一种参与阳离子肽化合物合成的基因及其用途 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN114561406A true CN114561406A (zh) | 2022-05-31 |
Family
ID=81721232
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202210442081.5A Pending CN114561406A (zh) | 2022-04-26 | 2022-04-26 | 一种参与阳离子肽化合物合成的基因及其用途 |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN114561406A (zh) |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101809030A (zh) * | 2007-08-09 | 2010-08-18 | 诺瓦提斯公司 | 硫肽前体蛋白质、编码该蛋白质的基因及其用途 |
| US20190264189A1 (en) * | 2017-09-07 | 2019-08-29 | The Regents Of The University Of California | Compositions and methods for the treatment or prevention of heart failure |
| CN111280182A (zh) * | 2020-04-01 | 2020-06-16 | 北京工商大学 | 一种活性肽组合物及其应用 |
-
2022
- 2022-04-26 CN CN202210442081.5A patent/CN114561406A/zh active Pending
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101809030A (zh) * | 2007-08-09 | 2010-08-18 | 诺瓦提斯公司 | 硫肽前体蛋白质、编码该蛋白质的基因及其用途 |
| US20190264189A1 (en) * | 2017-09-07 | 2019-08-29 | The Regents Of The University Of California | Compositions and methods for the treatment or prevention of heart failure |
| CN111280182A (zh) * | 2020-04-01 | 2020-06-16 | 北京工商大学 | 一种活性肽组合物及其应用 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| AU2022202248B2 (en) | Nucleic acid-guided nucleases | |
| AU2014335915B2 (en) | Modified helicases | |
| JP7027334B2 (ja) | アルファ溶血素バリアントおよびその使用 | |
| CN100516220C (zh) | 双岐杆菌的丝氨酸蛋白酶抑制剂 | |
| KR102531695B1 (ko) | 프로바이오틱으로서 사용하기 위한 락토바실러스, 및 프로바이오틱을 비롯한 제제에 대한 면역 반응을 평가하기 위해 사용되는 혈액 세포 집단 | |
| KR20190082318A (ko) | Crispr/cpf1 시스템 및 방법 | |
| KR20220029676A (ko) | 희토류 원소(ree) 결합 단백질 | |
| KR20200111172A (ko) | 네페탈락톨 산화 환원 효소, 네페탈락톨 합성 효소, 및 네페탈락톤을 생산할 수 있는 미생물 | |
| KR20230111399A (ko) | 신규 CRISPR/Cas12a 시스템 및 그 용도 | |
| KR20230079107A (ko) | 개선된 특성을 갖는 유전자 변형된 메틸로바실러스 세균 | |
| CA2258337A1 (en) | Peptidoglycan biosynthetic gene murd from streptococcus pneumoniae | |
| JP3404533B2 (ja) | 特にグラム陽性バクテリアのグリコペプタイドに対する抵抗力の発現に係るポリペプタイド、ポリペプタイドをコードするヌクレオチドシーケンスおよび診断への使用 | |
| WO1995027057A1 (en) | GENE CODING FOR IgG-Fc-BINDING PROTEIN | |
| JPH11243977A (ja) | 1,3−β−グルカンシンターゼのエキノカンジン結合領域 | |
| CN114561406A (zh) | 一种参与阳离子肽化合物合成的基因及其用途 | |
| JP2000509605A (ja) | テイコ酸酵素および検定法 | |
| CN110804596B (zh) | 一种新型的脯氨酸3-羟化酶及其应用 | |
| CN113402596A (zh) | 双叉犀金龟rr-2家族表皮蛋白,编码核苷酸序列及其应用 | |
| JP2000503525A (ja) | 新規dna分子 | |
| CN115362370A (zh) | 新颖的杀昆虫毒素受体和使用方法 | |
| KR101492561B1 (ko) | 항생제 내성균 검출용 마이크로어레이 및 이를 이용한 항생제 내성균 분석 방법 | |
| CN1952130A (zh) | 肝癌高表达基因peg10、其编码蛋白及其应用 | |
| CN103012585B (zh) | 一种整合膜蛋白的多克隆抗体及其制备方法 | |
| RU2639246C1 (ru) | Применение домена белка S-слоя из Lactobacillus brevis в качестве компонента системы для экспонирования слитых белков на поверхности клеток молочнокислых бактерий | |
| Tan et al. | Molecular cloning and functional characterisation of VanX, a D-alanyl-D-alanine dipeptidase from Streptomyces coelicolor A3 (2) |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication |
Application publication date: 20220531 |
|
| RJ01 | Rejection of invention patent application after publication |