CN1311075C - 表达超热稳定蛋白的系统 - Google Patents
表达超热稳定蛋白的系统 Download PDFInfo
- Publication number
- CN1311075C CN1311075C CNB200410101968XA CN200410101968A CN1311075C CN 1311075 C CN1311075 C CN 1311075C CN B200410101968X A CNB200410101968X A CN B200410101968XA CN 200410101968 A CN200410101968 A CN 200410101968A CN 1311075 C CN1311075 C CN 1311075C
- Authority
- CN
- China
- Prior art keywords
- gly
- ala
- val
- ser
- thr
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 123
- 102000004169 proteins and genes Human genes 0.000 title claims description 28
- 108091005804 Peptidases Proteins 0.000 claims abstract description 221
- 239000004365 Protease Substances 0.000 claims abstract description 127
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 68
- 125000000539 amino acid group Chemical group 0.000 claims abstract description 14
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 claims abstract 5
- 238000012217 deletion Methods 0.000 claims abstract 2
- 230000037430 deletion Effects 0.000 claims abstract 2
- 102000035195 Peptidases Human genes 0.000 claims description 198
- 239000013612 plasmid Substances 0.000 claims description 132
- 108090000790 Enzymes Proteins 0.000 claims description 64
- 102000004190 Enzymes Human genes 0.000 claims description 63
- 241000894006 Bacteria Species 0.000 claims description 49
- 238000000034 method Methods 0.000 claims description 43
- 230000000694 effects Effects 0.000 claims description 38
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 22
- 108090000787 Subtilisin Proteins 0.000 claims description 22
- 235000018102 proteins Nutrition 0.000 claims description 18
- 241000193830 Bacillus <bacterium> Species 0.000 claims description 13
- 230000008676 import Effects 0.000 claims description 12
- 239000013600 plasmid vector Substances 0.000 claims description 7
- 210000004899 c-terminal region Anatomy 0.000 claims 1
- 238000003780 insertion Methods 0.000 abstract description 5
- 230000037431 insertion Effects 0.000 abstract description 5
- 238000004519 manufacturing process Methods 0.000 abstract description 3
- 238000007792 addition Methods 0.000 abstract 1
- 238000010353 genetic engineering Methods 0.000 abstract 1
- 238000006467 substitution reaction Methods 0.000 abstract 1
- 235000019419 proteases Nutrition 0.000 description 121
- 108020004414 DNA Proteins 0.000 description 93
- 238000006062 fragmentation reaction Methods 0.000 description 59
- 238000013467 fragmentation Methods 0.000 description 57
- 108020004707 nucleic acids Proteins 0.000 description 43
- 102000039446 nucleic acids Human genes 0.000 description 43
- 150000007523 nucleic acids Chemical class 0.000 description 43
- 230000014509 gene expression Effects 0.000 description 28
- 101150101373 fus gene Proteins 0.000 description 26
- 108090000765 processed proteins & peptides Proteins 0.000 description 23
- 239000011541 reaction mixture Substances 0.000 description 23
- 108010056079 Subtilisins Proteins 0.000 description 22
- 239000012634 fragment Substances 0.000 description 22
- 239000013611 chromosomal DNA Substances 0.000 description 18
- 238000002360 preparation method Methods 0.000 description 17
- 239000000047 product Substances 0.000 description 17
- 235000001014 amino acid Nutrition 0.000 description 15
- 150000001413 amino acids Chemical class 0.000 description 14
- 210000004027 cell Anatomy 0.000 description 14
- 239000012228 culture supernatant Substances 0.000 description 13
- 108010017391 lysylvaline Proteins 0.000 description 12
- 238000006243 chemical reaction Methods 0.000 description 11
- 108010050848 glycylleucine Proteins 0.000 description 11
- 102000004196 processed proteins & peptides Human genes 0.000 description 11
- 108091008146 restriction endonucleases Proteins 0.000 description 11
- 102000005158 Subtilisins Human genes 0.000 description 10
- 230000008859 change Effects 0.000 description 10
- 230000035800 maturation Effects 0.000 description 10
- 238000011144 upstream manufacturing Methods 0.000 description 10
- 108010047857 aspartylglycine Proteins 0.000 description 9
- 230000001580 bacterial effect Effects 0.000 description 9
- 238000009396 hybridization Methods 0.000 description 9
- 229920001184 polypeptide Polymers 0.000 description 9
- 239000000523 sample Substances 0.000 description 9
- 108010061238 threonyl-glycine Proteins 0.000 description 9
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 8
- 108010026333 seryl-proline Proteins 0.000 description 8
- 239000000126 substance Substances 0.000 description 8
- 108020004705 Codon Proteins 0.000 description 7
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 7
- 241000589516 Pseudomonas Species 0.000 description 7
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 7
- 108020005038 Terminator Codon Proteins 0.000 description 7
- 108010093581 aspartyl-proline Proteins 0.000 description 7
- 230000029087 digestion Effects 0.000 description 7
- 239000013613 expression plasmid Substances 0.000 description 7
- 239000000284 extract Substances 0.000 description 7
- XUWPJKDMEZSVTP-LTYMHZPRSA-N kalafungina Chemical compound O=C1C2=C(O)C=CC=C2C(=O)C2=C1[C@@H](C)O[C@H]1[C@@H]2OC(=O)C1 XUWPJKDMEZSVTP-LTYMHZPRSA-N 0.000 description 7
- 239000006228 supernatant Substances 0.000 description 7
- 108091034117 Oligonucleotide Proteins 0.000 description 6
- 238000012408 PCR amplification Methods 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- 108091081024 Start codon Proteins 0.000 description 6
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 6
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 6
- KSGKJSFPWSMJHK-JNPHEJMOSA-N Tyr-Tyr-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSGKJSFPWSMJHK-JNPHEJMOSA-N 0.000 description 6
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 6
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 6
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 6
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 6
- 108010070783 alanyltyrosine Proteins 0.000 description 6
- 230000003321 amplification Effects 0.000 description 6
- 230000007062 hydrolysis Effects 0.000 description 6
- 238000006460 hydrolysis reaction Methods 0.000 description 6
- 244000005700 microbiome Species 0.000 description 6
- 238000003199 nucleic acid amplification method Methods 0.000 description 6
- 238000011160 research Methods 0.000 description 6
- PPINMSZPTPRQQB-NHCYSSNCSA-N 2-[[(2s)-1-[(2s)-2-[[(2s)-2-amino-3-methylbutanoyl]amino]propanoyl]pyrrolidine-2-carbonyl]amino]acetic acid Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PPINMSZPTPRQQB-NHCYSSNCSA-N 0.000 description 5
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 5
- LKDMKWNDBAVNQZ-WJNSRDFLSA-N 4-[[(2s)-1-[[(2s)-1-[(2s)-2-[[(2s)-1-(4-nitroanilino)-1-oxo-3-phenylpropan-2-yl]carbamoyl]pyrrolidin-1-yl]-1-oxopropan-2-yl]amino]-1-oxopropan-2-yl]amino]-4-oxobutanoic acid Chemical compound OC(=O)CCC(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)NC=1C=CC(=CC=1)[N+]([O-])=O)CC1=CC=CC=C1 LKDMKWNDBAVNQZ-WJNSRDFLSA-N 0.000 description 5
- TYMLOMAKGOJONV-UHFFFAOYSA-N 4-nitroaniline Chemical compound NC1=CC=C([N+]([O-])=O)C=C1 TYMLOMAKGOJONV-UHFFFAOYSA-N 0.000 description 5
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 5
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 5
- 244000063299 Bacillus subtilis Species 0.000 description 5
- 240000009212 Coccoloba uvifera Species 0.000 description 5
- 235000003913 Coccoloba uvifera Nutrition 0.000 description 5
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 5
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 5
- XZUBGOYOGDRYFC-XGEHTFHBSA-N Thr-Ser-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O XZUBGOYOGDRYFC-XGEHTFHBSA-N 0.000 description 5
- ZEJBJDHSQPOVJV-UAXMHLISSA-N Thr-Trp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZEJBJDHSQPOVJV-UAXMHLISSA-N 0.000 description 5
- 108010047495 alanylglycine Proteins 0.000 description 5
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 5
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 5
- 108010068265 aspartyltyrosine Proteins 0.000 description 5
- 238000011534 incubation Methods 0.000 description 5
- 239000007788 liquid Substances 0.000 description 5
- 235000013336 milk Nutrition 0.000 description 5
- 239000008267 milk Substances 0.000 description 5
- 210000004080 milk Anatomy 0.000 description 5
- 108010031719 prolyl-serine Proteins 0.000 description 5
- 108010070643 prolylglutamic acid Proteins 0.000 description 5
- 108010015796 prolylisoleucine Proteins 0.000 description 5
- 108010072644 valyl-alanyl-prolyl-glycine Proteins 0.000 description 5
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 4
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 4
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 4
- HUUOZYZWNCXTFK-INTQDDNPSA-N Ala-His-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N HUUOZYZWNCXTFK-INTQDDNPSA-N 0.000 description 4
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 4
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 4
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 4
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 4
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 4
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 4
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 4
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 4
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 4
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 4
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 4
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 4
- 108091027305 Heteroduplex Proteins 0.000 description 4
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 4
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 4
- 241000880493 Leptailurus serval Species 0.000 description 4
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 4
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 4
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 4
- 108010047562 NGR peptide Proteins 0.000 description 4
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 4
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 4
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 4
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 4
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 4
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 4
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 4
- 101150080287 SUB3 gene Proteins 0.000 description 4
- 101150023658 SUB4 gene Proteins 0.000 description 4
- NERYDXBVARJIQS-JYBASQMISA-N Ser-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N)O NERYDXBVARJIQS-JYBASQMISA-N 0.000 description 4
- 238000002105 Southern blotting Methods 0.000 description 4
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 4
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 4
- NAHUCETZGZZSEX-IHPCNDPISA-N Tyr-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N NAHUCETZGZZSEX-IHPCNDPISA-N 0.000 description 4
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 4
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 4
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 4
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 4
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 4
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 108010078274 isoleucylvaline Proteins 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 230000035772 mutation Effects 0.000 description 4
- 238000004321 preservation Methods 0.000 description 4
- 108010071207 serylmethionine Proteins 0.000 description 4
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 3
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 3
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 3
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 3
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 3
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 3
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 3
- UPAGTDJAORYMEC-VHWLVUOQSA-N Asn-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)N)N UPAGTDJAORYMEC-VHWLVUOQSA-N 0.000 description 3
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 3
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 3
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 3
- ILQCHXURSRRIRY-YUMQZZPRSA-N Asp-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N ILQCHXURSRRIRY-YUMQZZPRSA-N 0.000 description 3
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 3
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 3
- 235000014469 Bacillus subtilis Nutrition 0.000 description 3
- 101800001415 Bri23 peptide Proteins 0.000 description 3
- 101800000655 C-terminal peptide Proteins 0.000 description 3
- 102400000107 C-terminal peptide Human genes 0.000 description 3
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 3
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 3
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 3
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 3
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 3
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 3
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 3
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 3
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 3
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 3
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 3
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 3
- FLYSHWAAHYNKRT-JYJNAYRXSA-N His-Gln-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FLYSHWAAHYNKRT-JYJNAYRXSA-N 0.000 description 3
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 3
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 3
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 3
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 3
- RWYCOSAAAJBJQL-KCTSRDHCSA-N Ile-Gly-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N RWYCOSAAAJBJQL-KCTSRDHCSA-N 0.000 description 3
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 3
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 3
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 3
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 3
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 3
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 3
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 3
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 3
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 3
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 3
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 3
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- 108010079364 N-glycylalanine Proteins 0.000 description 3
- 239000001888 Peptone Substances 0.000 description 3
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 3
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 3
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 3
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 3
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 3
- MLSQXWSRHURDMF-GARJFASQSA-N Ser-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N)C(=O)O MLSQXWSRHURDMF-GARJFASQSA-N 0.000 description 3
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 3
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 3
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 3
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 3
- BIJDDZBDSJLWJY-PJODQICGSA-N Trp-Ala-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O BIJDDZBDSJLWJY-PJODQICGSA-N 0.000 description 3
- ADBFWLXCCKIXBQ-XIRDDKMYSA-N Trp-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ADBFWLXCCKIXBQ-XIRDDKMYSA-N 0.000 description 3
- ULHJJQYGMWONTD-HKUYNNGSSA-N Tyr-Gly-Trp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ULHJJQYGMWONTD-HKUYNNGSSA-N 0.000 description 3
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 3
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 3
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 3
- 108010038633 aspartylglutamate Proteins 0.000 description 3
- 229940041514 candida albicans extract Drugs 0.000 description 3
- 239000005018 casein Substances 0.000 description 3
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 3
- 235000021240 caseins Nutrition 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- GLNDAGDHSLMOKX-UHFFFAOYSA-N coumarin 120 Chemical compound C1=C(N)C=CC2=C1OC(=O)C=C2C GLNDAGDHSLMOKX-UHFFFAOYSA-N 0.000 description 3
- 238000013016 damping Methods 0.000 description 3
- 230000008034 disappearance Effects 0.000 description 3
- 230000002255 enzymatic effect Effects 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 239000012530 fluid Substances 0.000 description 3
- 108020001507 fusion proteins Proteins 0.000 description 3
- 102000037865 fusion proteins Human genes 0.000 description 3
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 3
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 108010085325 histidylproline Proteins 0.000 description 3
- 108010064235 lysylglycine Proteins 0.000 description 3
- 239000011259 mixed solution Substances 0.000 description 3
- 238000002156 mixing Methods 0.000 description 3
- 108010084572 phenylalanyl-valine Proteins 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- UVFAEQZFLBGVRM-MSMWPWNWSA-N succinyl-Leu-Leu-Val-Tyr-7-amino-4-methylcoumarin Chemical compound C([C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)CCC(O)=O)CC(C)C)C(=O)NC=1C=C2OC(=O)C=C(C)C2=CC=1)C1=CC=C(O)C=C1 UVFAEQZFLBGVRM-MSMWPWNWSA-N 0.000 description 3
- 108010090544 succinyl-leucyl-leucyl-valyl-tyrosyl-methylcoumarinamide Proteins 0.000 description 3
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 3
- 108010073969 valyllysine Proteins 0.000 description 3
- 239000012138 yeast extract Substances 0.000 description 3
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 2
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 2
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 2
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 2
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 2
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 2
- 101710184263 Alkaline serine protease Proteins 0.000 description 2
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 2
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 2
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 2
- QNJIRRVTOXNGMH-GUBZILKMSA-N Asn-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(N)=O QNJIRRVTOXNGMH-GUBZILKMSA-N 0.000 description 2
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 2
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 2
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 2
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 2
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 2
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- 102000008186 Collagen Human genes 0.000 description 2
- 108010035532 Collagen Proteins 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 2
- FUESBOMYALLFNI-VKHMYHEASA-N Gly-Asn Chemical compound NCC(=O)N[C@H](C(O)=O)CC(N)=O FUESBOMYALLFNI-VKHMYHEASA-N 0.000 description 2
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 2
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- WSWWTQYHFCBKBT-DVJZZOLTSA-N Gly-Thr-Trp Chemical compound C[C@@H](O)[C@H](NC(=O)CN)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O WSWWTQYHFCBKBT-DVJZZOLTSA-N 0.000 description 2
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 2
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 2
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 2
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 2
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 2
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 2
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 2
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 2
- 102000000588 Interleukin-2 Human genes 0.000 description 2
- 108010002350 Interleukin-2 Proteins 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 2
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 2
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 2
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 2
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 2
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 2
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 2
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 2
- ZBLSZPYQQRIHQU-RCWTZXSCSA-N Met-Thr-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ZBLSZPYQQRIHQU-RCWTZXSCSA-N 0.000 description 2
- 108010064696 N,O-diacetylmuramidase Proteins 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 108010080698 Peptones Proteins 0.000 description 2
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 2
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 2
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 2
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 2
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 2
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 2
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 2
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 2
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 2
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 2
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 2
- 229920002472 Starch Polymers 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 2
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 2
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 2
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 2
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 2
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 2
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 2
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 2
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 2
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 2
- RMRFSFXLFWWAJZ-HJOGWXRNSA-N Tyr-Tyr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 RMRFSFXLFWWAJZ-HJOGWXRNSA-N 0.000 description 2
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 2
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 2
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 2
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 2
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 2
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 2
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 2
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 2
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 2
- 241000219095 Vitis Species 0.000 description 2
- 235000009754 Vitis X bourquina Nutrition 0.000 description 2
- 235000012333 Vitis X labruscana Nutrition 0.000 description 2
- 235000014787 Vitis vinifera Nutrition 0.000 description 2
- 239000008272 agar Substances 0.000 description 2
- 238000000246 agarose gel electrophoresis Methods 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- ROOXNKNUYICQNP-UHFFFAOYSA-N ammonium persulfate Chemical compound [NH4+].[NH4+].[O-]S(=O)(=O)OOS([O-])(=O)=O ROOXNKNUYICQNP-UHFFFAOYSA-N 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 229940098773 bovine serum albumin Drugs 0.000 description 2
- 239000006285 cell suspension Substances 0.000 description 2
- 229920001436 collagen Polymers 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 238000001976 enzyme digestion Methods 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 230000002349 favourable effect Effects 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 235000019319 peptone Nutrition 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 239000008107 starch Substances 0.000 description 2
- 235000019698 starch Nutrition 0.000 description 2
- 238000007669 thermal treatment Methods 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 1
- HGRBNYQIMKTUNT-XVYDVKMFSA-N Ala-Asn-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HGRBNYQIMKTUNT-XVYDVKMFSA-N 0.000 description 1
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 1
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 1
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- XCZXVTHYGSMQGH-NAKRPEOUSA-N Ala-Ile-Met Chemical compound C[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C([O-])=O XCZXVTHYGSMQGH-NAKRPEOUSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- DCUCOIYYUBILPS-GUBZILKMSA-N Ala-Leu-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DCUCOIYYUBILPS-GUBZILKMSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 1
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- FSXDWQGEWZQBPJ-HERUPUMHSA-N Ala-Trp-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FSXDWQGEWZQBPJ-HERUPUMHSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 1
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- 239000004160 Ammonium persulphate Substances 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- USNSOPDIZILSJP-FXQIFTODSA-N Arg-Asn-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O USNSOPDIZILSJP-FXQIFTODSA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- YFBGNGASPGRWEM-DCAQKATOSA-N Arg-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFBGNGASPGRWEM-DCAQKATOSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 1
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 1
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 1
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 1
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 1
- PAXHINASXXXILC-SRVKXCTJSA-N Asn-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)O PAXHINASXXXILC-SRVKXCTJSA-N 0.000 description 1
- PQAIOUVVZCOLJK-FXQIFTODSA-N Asn-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PQAIOUVVZCOLJK-FXQIFTODSA-N 0.000 description 1
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- ZKDGORKGHPCZOV-DCAQKATOSA-N Asn-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZKDGORKGHPCZOV-DCAQKATOSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 1
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- UWFOMGUWGPRVBW-GUBZILKMSA-N Asn-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N UWFOMGUWGPRVBW-GUBZILKMSA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- QTKYFZCMSQLYHI-UBHSHLNASA-N Asn-Trp-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O QTKYFZCMSQLYHI-UBHSHLNASA-N 0.000 description 1
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 1
- DPSUVAPLRQDWAO-YDHLFZDLSA-N Asn-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)N)N DPSUVAPLRQDWAO-YDHLFZDLSA-N 0.000 description 1
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 1
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 1
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 1
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 1
- RQYMKRMRZWJGHC-BQBZGAKWSA-N Asp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N RQYMKRMRZWJGHC-BQBZGAKWSA-N 0.000 description 1
- SWTQDYFZVOJVLL-KKUMJFAQSA-N Asp-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)O SWTQDYFZVOJVLL-KKUMJFAQSA-N 0.000 description 1
- YRBGRUOSJROZEI-NHCYSSNCSA-N Asp-His-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O YRBGRUOSJROZEI-NHCYSSNCSA-N 0.000 description 1
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 1
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 1
- QOCFFCUFZGDHTP-NUMRIWBASA-N Asp-Thr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QOCFFCUFZGDHTP-NUMRIWBASA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 1
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 1
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 1
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 108010067770 Endopeptidase K Proteins 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 101710089384 Extracellular protease Proteins 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 1
- KJRXLVZYJJLUCV-DCAQKATOSA-N Gln-Arg-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KJRXLVZYJJLUCV-DCAQKATOSA-N 0.000 description 1
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 1
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 1
- ILKYYKRAULNYMS-JYJNAYRXSA-N Gln-Lys-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ILKYYKRAULNYMS-JYJNAYRXSA-N 0.000 description 1
- WPJDPEOQUIXXOY-AVGNSLFASA-N Gln-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WPJDPEOQUIXXOY-AVGNSLFASA-N 0.000 description 1
- AKDOUBMVLRCHBD-SIUGBPQLSA-N Gln-Tyr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AKDOUBMVLRCHBD-SIUGBPQLSA-N 0.000 description 1
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 1
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 1
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 1
- JPUNZXVHHRZMNL-XIRDDKMYSA-N Glu-Pro-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JPUNZXVHHRZMNL-XIRDDKMYSA-N 0.000 description 1
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 1
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- LSYFGBRDBIQYAQ-FHWLQOOXSA-N Glu-Tyr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LSYFGBRDBIQYAQ-FHWLQOOXSA-N 0.000 description 1
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 1
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- RCHFYMASWAZQQZ-ZANVPECISA-N Gly-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CN)=CNC2=C1 RCHFYMASWAZQQZ-ZANVPECISA-N 0.000 description 1
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 1
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Natural products NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 1
- OZBDSFBWIDPVDA-BZSNNMDCSA-N His-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N OZBDSFBWIDPVDA-BZSNNMDCSA-N 0.000 description 1
- QMUHTRISZMFKAY-MXAVVETBSA-N His-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N QMUHTRISZMFKAY-MXAVVETBSA-N 0.000 description 1
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 1
- VDHOMPFVSABJKU-ULQDDVLXSA-N His-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N VDHOMPFVSABJKU-ULQDDVLXSA-N 0.000 description 1
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 1
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 1
- 101001002657 Homo sapiens Interleukin-2 Proteins 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 1
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 1
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 1
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 1
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 1
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 1
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 1
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 1
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 1
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 1
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 1
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 1
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- WSSGUVAKYCQSCT-XUXIUFHCSA-N Ile-Met-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)O)N WSSGUVAKYCQSCT-XUXIUFHCSA-N 0.000 description 1
- DNKDIDZHXZAGRY-HJWJTTGWSA-N Ile-Met-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N DNKDIDZHXZAGRY-HJWJTTGWSA-N 0.000 description 1
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 1
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- IFQSXNOEEPCSLW-DKWTVANSSA-N L-cysteine hydrochloride Chemical compound Cl.SC[C@H](N)C(O)=O IFQSXNOEEPCSLW-DKWTVANSSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- QUAAUWNLWMLERT-IHRRRGAJSA-N Leu-Arg-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O QUAAUWNLWMLERT-IHRRRGAJSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- DUBAVOVZNZKEQQ-AVGNSLFASA-N Leu-Arg-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCN=C(N)N DUBAVOVZNZKEQQ-AVGNSLFASA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 1
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 1
- MSFITIBEMPWCBD-ULQDDVLXSA-N Leu-Val-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MSFITIBEMPWCBD-ULQDDVLXSA-N 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- YRWCPXOFBKTCFY-NUTKFTJISA-N Lys-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N YRWCPXOFBKTCFY-NUTKFTJISA-N 0.000 description 1
- PXHCFKXNSBJSTQ-KKUMJFAQSA-N Lys-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)O PXHCFKXNSBJSTQ-KKUMJFAQSA-N 0.000 description 1
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 1
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 1
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- KZJQUYFDSCFSCO-DLOVCJGASA-N Lys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N KZJQUYFDSCFSCO-DLOVCJGASA-N 0.000 description 1
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 1
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 1
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 1
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- AHZNUGRZHMZGFL-GUBZILKMSA-N Met-Arg-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCNC(N)=N AHZNUGRZHMZGFL-GUBZILKMSA-N 0.000 description 1
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 1
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 1
- AXHNAGAYRGCDLG-UWVGGRQHSA-N Met-Lys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AXHNAGAYRGCDLG-UWVGGRQHSA-N 0.000 description 1
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 1
- NSMXRFMGZYTFEX-KJEVXHAQSA-N Met-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCSC)N)O NSMXRFMGZYTFEX-KJEVXHAQSA-N 0.000 description 1
- PNHRPOWKRRJATF-IHRRRGAJSA-N Met-Tyr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 PNHRPOWKRRJATF-IHRRRGAJSA-N 0.000 description 1
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 1
- HTKNPQZCMLBOTQ-XVSYOHENSA-N Phe-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O HTKNPQZCMLBOTQ-XVSYOHENSA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 1
- OVJMCXAPGFDGMG-HKUYNNGSSA-N Phe-Gly-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OVJMCXAPGFDGMG-HKUYNNGSSA-N 0.000 description 1
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 1
- MIICYIIBVYQNKE-QEWYBTABSA-N Phe-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MIICYIIBVYQNKE-QEWYBTABSA-N 0.000 description 1
- FENSZYFJQOFSQR-FIRPJDEBSA-N Phe-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FENSZYFJQOFSQR-FIRPJDEBSA-N 0.000 description 1
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 1
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 1
- OLZVAVSJEUAOHI-UNQGMJICSA-N Phe-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O OLZVAVSJEUAOHI-UNQGMJICSA-N 0.000 description 1
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 1
- VDTYRPWRWRCROL-UFYCRDLUSA-N Phe-Val-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VDTYRPWRWRCROL-UFYCRDLUSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 1
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 1
- SBYVDRLQAGENMY-DCAQKATOSA-N Pro-Asn-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O SBYVDRLQAGENMY-DCAQKATOSA-N 0.000 description 1
- KQCCDMFIALWGTL-GUBZILKMSA-N Pro-Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 KQCCDMFIALWGTL-GUBZILKMSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 1
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- PUQRDHNIOONJJN-AVGNSLFASA-N Pro-Lys-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PUQRDHNIOONJJN-AVGNSLFASA-N 0.000 description 1
- BGGWNVWMHNTRDU-BZSNNMDCSA-N Pro-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 BGGWNVWMHNTRDU-BZSNNMDCSA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- BVRBCQBUNGAWFP-KKUMJFAQSA-N Pro-Tyr-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O BVRBCQBUNGAWFP-KKUMJFAQSA-N 0.000 description 1
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 1
- 102100023152 Scinderin Human genes 0.000 description 1
- 108091058545 Secretory proteins Proteins 0.000 description 1
- 102000040739 Secretory proteins Human genes 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- SDFUZKIAHWRUCS-QEJZJMRPSA-N Ser-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N SDFUZKIAHWRUCS-QEJZJMRPSA-N 0.000 description 1
- RTXKJFWHEBTABY-IHPCNDPISA-N Ser-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CO)N RTXKJFWHEBTABY-IHPCNDPISA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 230000006035 T cell-directed cellular cytotoxicity Effects 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 1
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 1
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 1
- PZSDPRBZINDEJV-HTUGSXCWSA-N Thr-Phe-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PZSDPRBZINDEJV-HTUGSXCWSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 1
- NDLHSJWPCXKOGG-VLCNGCBASA-N Thr-Trp-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N)O NDLHSJWPCXKOGG-VLCNGCBASA-N 0.000 description 1
- JNKAYADBODLPMQ-HSHDSVGOSA-N Thr-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)=CNC2=C1 JNKAYADBODLPMQ-HSHDSVGOSA-N 0.000 description 1
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 1
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- OETOOJXFNSEYHQ-WFBYXXMGSA-N Trp-Ala-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 OETOOJXFNSEYHQ-WFBYXXMGSA-N 0.000 description 1
- UKINEYBQXPMOJO-UBHSHLNASA-N Trp-Asn-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N UKINEYBQXPMOJO-UBHSHLNASA-N 0.000 description 1
- IQGJAHMZWBTRIF-UBHSHLNASA-N Trp-Asp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IQGJAHMZWBTRIF-UBHSHLNASA-N 0.000 description 1
- DVAAUUVLDFKTAQ-VHWLVUOQSA-N Trp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N DVAAUUVLDFKTAQ-VHWLVUOQSA-N 0.000 description 1
- NKUIXQOJUAEIET-AQZXSJQPSA-N Trp-Asp-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@H](O)C)C(O)=O)=CNC2=C1 NKUIXQOJUAEIET-AQZXSJQPSA-N 0.000 description 1
- YXONONCLMLHWJX-SZMVWBNQSA-N Trp-Glu-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 YXONONCLMLHWJX-SZMVWBNQSA-N 0.000 description 1
- SAKLWFSRZTZQAJ-GQGQLFGLSA-N Trp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SAKLWFSRZTZQAJ-GQGQLFGLSA-N 0.000 description 1
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 1
- KRCPXGSWDOGHAM-XIRDDKMYSA-N Trp-Lys-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O KRCPXGSWDOGHAM-XIRDDKMYSA-N 0.000 description 1
- FHHYVSCGOMPLLO-IHPCNDPISA-N Trp-Tyr-Asp Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 FHHYVSCGOMPLLO-IHPCNDPISA-N 0.000 description 1
- MXKUGFHWYYKVDV-SZMVWBNQSA-N Trp-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(C)C)C(O)=O MXKUGFHWYYKVDV-SZMVWBNQSA-N 0.000 description 1
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 1
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 1
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 1
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 1
- CRWOSTCODDFEKZ-HRCADAONSA-N Tyr-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CRWOSTCODDFEKZ-HRCADAONSA-N 0.000 description 1
- SGFIXFAHVWJKTD-KJEVXHAQSA-N Tyr-Arg-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SGFIXFAHVWJKTD-KJEVXHAQSA-N 0.000 description 1
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 1
- OEVJGIHPQOXYFE-SRVKXCTJSA-N Tyr-Asn-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OEVJGIHPQOXYFE-SRVKXCTJSA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- NLMXVDDEQFKQQU-CFMVVWHZSA-N Tyr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLMXVDDEQFKQQU-CFMVVWHZSA-N 0.000 description 1
- LMLBOGIOLHZXOT-JYJNAYRXSA-N Tyr-Glu-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O LMLBOGIOLHZXOT-JYJNAYRXSA-N 0.000 description 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 1
- FIRUOPRJKCBLST-KKUMJFAQSA-N Tyr-His-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O FIRUOPRJKCBLST-KKUMJFAQSA-N 0.000 description 1
- GFJXBLSZOFWHAW-JYJNAYRXSA-N Tyr-His-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GFJXBLSZOFWHAW-JYJNAYRXSA-N 0.000 description 1
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 1
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 1
- MDXLPNRXCFOBTL-BZSNNMDCSA-N Tyr-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MDXLPNRXCFOBTL-BZSNNMDCSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 1
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 1
- GAKBTSMAPGLQFA-JNPHEJMOSA-N Tyr-Thr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 GAKBTSMAPGLQFA-JNPHEJMOSA-N 0.000 description 1
- OJCISMMNNUNNJA-BZSNNMDCSA-N Tyr-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 OJCISMMNNUNNJA-BZSNNMDCSA-N 0.000 description 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 1
- VKYDVKAKGDNZED-STECZYCISA-N Tyr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N VKYDVKAKGDNZED-STECZYCISA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 1
- ZQGPWORGSNRQLN-NHCYSSNCSA-N Val-Asp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZQGPWORGSNRQLN-NHCYSSNCSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 1
- PGBJAZDAEWPDAA-NHCYSSNCSA-N Val-Gln-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N PGBJAZDAEWPDAA-NHCYSSNCSA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- OEVFFOBAXHBXKM-HSHDSVGOSA-N Val-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N)O OEVFFOBAXHBXKM-HSHDSVGOSA-N 0.000 description 1
- VBTFUDNTMCHPII-FKBYEOEOSA-N Val-Trp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O VBTFUDNTMCHPII-FKBYEOEOSA-N 0.000 description 1
- VBTFUDNTMCHPII-UHFFFAOYSA-N Val-Trp-Tyr Natural products C=1NC2=CC=CC=C2C=1CC(NC(=O)C(N)C(C)C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 VBTFUDNTMCHPII-UHFFFAOYSA-N 0.000 description 1
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 1
- YKZVPMUGEJXEOR-JYJNAYRXSA-N Val-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N YKZVPMUGEJXEOR-JYJNAYRXSA-N 0.000 description 1
- YVNQAIFQFWTPLQ-UHFFFAOYSA-O [4-[[4-(4-ethoxyanilino)phenyl]-[4-[ethyl-[(3-sulfophenyl)methyl]amino]-2-methylphenyl]methylidene]-3-methylcyclohexa-2,5-dien-1-ylidene]-ethyl-[(3-sulfophenyl)methyl]azanium Chemical compound C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C(=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S(O)(=O)=O)C)C=2C(=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S(O)(=O)=O)C)C=C1 YVNQAIFQFWTPLQ-UHFFFAOYSA-O 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 230000001476 alcoholic effect Effects 0.000 description 1
- 235000019395 ammonium persulphate Nutrition 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- YTRQFSDWAXHJCC-UHFFFAOYSA-N chloroform;phenol Chemical compound ClC(Cl)Cl.OC1=CC=CC=C1 YTRQFSDWAXHJCC-UHFFFAOYSA-N 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 238000010612 desalination reaction Methods 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000004043 dyeing Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 238000006047 enzymatic hydrolysis reaction Methods 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 230000000799 fusogenic effect Effects 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 1
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 102000055277 human IL2 Human genes 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 235000013372 meat Nutrition 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 239000006916 nutrient agar Substances 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000003534 oscillatory effect Effects 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 239000002953 phosphate buffered saline Substances 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 235000019833 protease Nutrition 0.000 description 1
- 238000000734 protein sequencing Methods 0.000 description 1
- 238000010926 purge Methods 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 239000012488 sample solution Substances 0.000 description 1
- 108010073419 scinderin Proteins 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000004062 sedimentation Methods 0.000 description 1
- 238000012882 sequential analysis Methods 0.000 description 1
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 1
- 108010063779 seryl-tyrosyl-prolyl-tyrosyl-aspartic acid Proteins 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 230000009182 swimming Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- HRXKRNGNAMMEHJ-UHFFFAOYSA-K trisodium citrate Chemical compound [Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O HRXKRNGNAMMEHJ-UHFFFAOYSA-K 0.000 description 1
- 229940038773 trisodium citrate Drugs 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010077037 tyrosyl-tyrosyl-phenylalanine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 108010000998 wheylin-2 peptide Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
- C12N15/75—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora for Bacillus
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
- C12N9/50—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25)
- C12N9/52—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from bacteria or Archaea
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/02—Fusion polypeptide containing a localisation/targetting motif containing a signal sequence
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Medicinal Chemistry (AREA)
- Enzymes And Modification Thereof (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
Abstract
一种具有由序列表SEQ ID NO:1表示的氨基酸序列或经缺失、替换、插入或添加一个至几个氨基酸残基从其产生的序列的超热稳定蛋白酶,一种编码该超热稳定蛋白酶的基因和一种制备该蛋白酶的方法,目的是通过遗传工程技术提供一种有利于工业应用的超热稳定的蛋白酶。
Description
本发明是分案申请,原申请的申请号为98805990.8(PCT/JP98/02465),申请日为1998年6月4日,发明名称为“表达超热稳定蛋白酶的系统”。
技术领域
本发明涉及一种在工业应用中作为酶使用的超热稳定蛋白酶,编码该酶的基因以及通过遗传工程技术产生该酶的方法。
技术背景
蛋白酶是一种切割蛋白中肽键的酶。许多上述酶已在动物、植物和微生物中发现。蛋白酶可作为实验室使用的试剂和作为药物以及在工业领域,如作为添加剂用于去垢剂,用于处理食品及用于利用逆反应进行化学合成。因此,可以说该蛋白酶是工业上极其重要的酶。由于用于工业领域的蛋白酶需要高度的物理和化学稳定性,在其它酶类中热稳定酶是优选使用的。因为芽孢杆菌属的细菌产生的蛋白酶表现为相对高的热稳定性,它们主要作为工业用途的蛋白酶。然而,在寻找更为优异的酶中,想方设法从高温下生长的微生物中获得该酶,例如,从芽孢杆菌属的嗜热细菌或超嗜热菌中获得。
例如,熟知的超嗜热菌激烈热球菌产生一种蛋白酶(应用环境微生物学,56:1992-1998(1990);FEMS微生物学通讯,71:17-20(1990);遗传微生物学杂志,137:1193-1199(1991))。
此外,据报道嗜热菌热球菌属菌株KOD1产生一种硫醇基蛋白酶(半胱氨酸蛋白酶)(应用环境微生物学,60:4559-4566(1994))。也已知超嗜热菌热球菌属、葡萄嗜热菌属、栖热拟芽孢杆菌属产生蛋白酶(应用微生物学生物技术,34:715-719(1991))。
如上面描述的来自嗜热菌的蛋白酶具有高度热稳定性。因此,可预计它们可用于代替目前正使用的热稳定蛋白酶或用于还未考虑使用蛋白酶的领域。
然而,大部分产生这些酶的微生物仅在高温下生长。例如,激烈热球菌需要在90-100℃培养。在上述高温下培养从能量消耗的角度看是不利的。此外,来自超嗜热菌的蛋白酶生产力比常规微生物蛋白酶的生产力低。所以,工业上从超嗜热菌产生蛋白酶的方法存在问题。
另外,用遗传工程技术通过分离目的酶的基因并将其导入到容易培养的宿主微生物中产生该酶的方法,是本领域目前的常规方法。然而,该酶的基因导入到宿主中并不总是如预期的那样有效地表达。据信主要原因是导入基因的GC含量或密码子使用与宿主的基因不同。
因此,需要对导入的每个基因和/或每个宿主优化表达方法,以便达到符合设计用途的酶的合适生产力。
本发明的目的
本发明的目的是提供工业使用有利的来自超嗜热菌的蛋白酶,从该超嗜热菌中分离编码该蛋白酶的基因和提供利用该基因通过遗传工程技术产生超热稳定性蛋白酶的方法,以便解决上面所述的问题。
本发明概述
由超嗜热菌产生的蛋白酶中,一些种类根据其氨基酸序列的同源性分类为枯草杆菌蛋白酶型碱性蛋白酶。当编码该蛋白酶的基因被导入到枯草芽孢杆菌中时(枯草芽孢杆菌一般通过遗传工程技术用于生产),该酶的生产力比由枯草芽孢杆菌内源产生的蛋白的生产力更低。
本发明者经深入的研究并发现,通过把编码来源于枯草杆菌蛋白酶信号肽的基因(信号序列)置于来源于超嗜热菌待表达的蛋白酶基因的上游,然后修饰切割位点周围的氨基酸序列,该目的基因将在枯草芽孢杆菌中高效表达。此外,还发现通过把来源于目的超嗜热菌的蛋白酶基因中酶活性非必需的部分缺失、可增加所述的酶的表达水平。至此,完成了本发明。
本发明概括如下。本发明的第一个发明是一种具有由序列表SEQ IDNO:1给出的氨基酸序列的热稳定蛋白酶和一种具有如下氨基酸序列及热稳定蛋白酶活性的蛋白酶,在该氨基酸序列中,一个或几个氨基酸残基被缺失、替换、插入或添加到序列表SEQ ID NO:1给出的氨基酸序列中形成。
本发明的第二个发明是一种编码第一个发明的热稳定蛋白酶的基因和一种能与该基因杂交的热稳定蛋白酶基因。
本发明的第三个发明是通过遗传工程技术将来源于超嗜热菌的基因用于生产热稳定蛋白酶,其特征是该基因编码如通式I表示的氨基酸序列:
SIG-Ala-Gly-Gly-Asn-PRO [I]
其中,SIG代表来源于枯草杆菌蛋白酶信号肽的氨基酸序列,PRO代表待表达的蛋白的氨基酸序列。优选地是,SIG是由序列表SEQ IDNO:3给出的氨基酸序列。优选地是,PRO是来源于超嗜热菌的超热稳定蛋白酶的氨基酸序列,更优选地是,来源于激烈热球菌的蛋白酶的氨基酸序列。
本发明的第四个发明涉及一种通过遗传工程技术产生蛋白质的方法,其特征在于该方法包括培养已导入第三个发明的基因的芽孢杆菌属细菌,然后从培养物中收集目的蛋白。
本发明的第五个发明是用于通过遗传工程技术生产蛋白的一种质粒,其特征在于第三个发明的基因已插入到质粒中。
在氨基酸序列中一个至几个氨基酸残基诸如缺失、替换、插入或添加的突变,可在天然产生的蛋白及由本发明披露的蛋白中产生。上述突变可能由于编码该蛋白的基因的多态性或突变产生,或者由于蛋白的体内修饰或合成后纯化过程中产生。不过,已知上述突变蛋白表现的生理学和生物学活性与未突变的蛋白相等。这可用于如下蛋白中,在该蛋白中上述突变已人为地导入到其氨基酸序列中,在该情况下,可能产生大量的各种突变。例如,众所周知的在人白介素-2(IL-2)的氨基酸序列中的半胱氨酸残基被丝氨酸残基替换的多肽仍保持白介素-2的活性(科学,224:1431(1984))。因此,具有在本发明披露的氨基酸序列中一个或几个氨基酸残基缺失、替换、插入或添加而形成的氨基酸序列及其蛋白酶活性与本发明的蛋白酶相等的蛋白酶也在本发明的范围之内。
至于本文所用的“(与特定的基因)杂交的基因”是一种具有的碱基序列与特定基因的序列相似的基因。有可能具有的碱基序列与特定基因的序列相似的基因编码一种具有氨基酸序列的蛋白,其功能与由特定基因编码的蛋白相似。基因碱基序列的相似性可通过在严格条件下确定基因或其部分互相之间是否形成杂交体(杂交)而检查出来。通过使用该方法,能获得所编码的蛋白与特定基因编码的蛋白具有相似功能的基因。这就是说具有与本发明的基因的碱基序列相似的基因可通过使用本发明获得的基因或其部分作为探针,按照熟知的方法进行杂交而获得。杂交可按如下方法进行,例如,T.Maniatis等编著,分子克隆:实验室手册第二版,冷泉港实验室出版,1989,书中所描述的方法。更具体而言,杂交在下列条件下进行。简言之,固定有DNAs的杂交膜在含有0.5%SDS,0.1%牛血清白蛋白(BSA),0.1%聚乙烯吡咯烷酮,0.1%Ficoll 400,0.01%变性鲑精DNA的6×SSC(1×SSC表示为0.15MNaCl,0.015M柠檬酸钠,pH 7.0)中,和探针在50℃温育12-20小时。温育后,洗涤杂交膜直到固定的DNAs信号与背景能区别为止,开始洗涤在含0.5%SDS的2×SSC中37℃下进行,然后降低SSC浓度至0.1×和提高温度至50℃。
另外,作为杂交的替代方法,可应用基因扩增方法(如,PCR方法),该法用本发明获得的基因的部分碱基序列作为引物。因此所获得的基因是否编码具有目的功能的蛋白,能通过利用合适的宿主和合适的表达系统表达该基因,然后检查所得到蛋白的活性的方法确定。
附图简述
图1是质粒pSTC3的限制性酶图谱。
图2比较了蛋白酶PFUS、蛋白酶TCES和枯草杆菌蛋白酶的氨基酸序列。
图3比较了蛋白酶PFUS、蛋白酶TCES和枯草杆菌蛋白酶的氨基酸序列。
图4比较了蛋白酶PFUS、蛋白酶TCES和枯草杆菌蛋白酶的氨基酸序列。
图5比较了蛋白酶PFUS、蛋白酶TCES和枯草杆菌蛋白酶的氨基酸序列。
图6是质粒pSNP 1的限制性酶切图谱。
图7是质粒pPS 1的限制性酶切图谱。
图8是质粒pNAPS 1的限制性酶切图谱。
发明详述
根据本发明的超热稳定蛋白酶包括来自各种超嗜热菌的蛋白酶。例如,WO95/34645描述的来自激烈热球菌和速生热球菌的蛋白酶。
来自激烈热球菌DSM 3638的蛋白酶基因是从该菌株的基因组DNA文库中,根据表达热稳定蛋白酶活性而分离得到。含有该基因的质粒被命名为质粒pTPR 12。用该质粒转化的大肠杆菌JM 109被命名并且表示为大肠杆菌JM 109/pTPR 12,于1994年5月24日(最初保藏日期)根据布达佩斯条约保藏在日本茨城县筑波市东1丁目1番3号,通商产业省工业技术院生命工学和人体技术研究所,登记号FERM BP-5103。
该蛋白酶下文命名为蛋白酶PFUL。蛋白酶PFUL是一种具有高度热稳定性并且甚至在95℃下也表现蛋白酶活性的蛋白酶。
已经测定了来源于激烈热球菌插入到质粒pTPR 12的DNA片段的碱基序列。在该DNA片段中以2个DraI位点为边界大约4.8kb的部分碱基序列插入到质粒pTPR 12中,在序列表SEQ ID NO:5中显示。此外,根据碱基序列推导的基因产物的氨基酸序列在序列表SEQ IDNO:6中显示。换言之,在序列表SEQ ID NO:6中显示的氨基酸序列是蛋白酶PFUL的氨基酸序列。如在序列中显示的那样,蛋白酶PFUL由1398个氨基酸残基组成,是一种超过150,000的高分子量蛋白酶。
将序列表SEQ ID NO:6中显示的蛋白酶PFUL的氨基酸序列与熟知的来源于微生物的蛋白酶的氨基酸序列进行比较显示,蛋白酶PFUL的第一个半部分的氨基酸序列与以枯草杆菌蛋白酶为代表的系列碱性丝氨酸蛋白酶有同源性(蛋白工程,4:719-737(1991)),所存在的大约4个氨基酸残基的极其高度同源性据信对蛋白酶的催化活性是重要的。
如上所述,发现来源于嗜温菌的蛋白酶中的共同区域在由超嗜热菌激烈热球菌产生的蛋白酶PFUL的氨基酸序列中是保守的。因此,可预期由不是激烈热球菌的超嗜热菌产生的同源蛋白酶也具有该区域。
例如进行PCR能筛选出编码超热稳定蛋白酶的基因,用来自各种超嗜热菌的染色体DNA作为模板,寡核苷酸PRO-1F、PRO-2F、PRO-2R和PRO-4R的组合作为引物。根据在蛋白酶PFUL基因中编码显示为与枯草杆菌蛋白酶高度同源的区域的碱基序列或蛋白酶PFUL的氨基酸序列之间相似的序列,合成这些寡核苷酸。寡核苷酸PRO-1F、PRO-2F、PRO-2R和PRO-4R的碱基序列分别在序列表SEQ ID NOS:7、8、9和10中显示。
由于根据本发明的蛋白酶来源于超嗜熟菌,能用属于激烈热球菌属、热球菌属、葡萄嗜热菌属、热拟芽孢杆菌属等细菌。至于属于热球菌属的细菌,例如能用速生热球菌DSM 2476。该菌株可从DeutscheSammlung Von Mikroorganismen und Zellkulturen GmbH得到。当用来源于速生热球菌DSM 2476的染色体DNA作为模板和寡核苷酸PRO-1F和PRO-2R或寡核苷酸PRO-2F和Pro-4R的组合作为引物进行PCR时,能扩增出特异的DNA片段,表明存在蛋白酶基因。此外,通过该DNA片段被插入到合适的质粒载体中创造重组质粒,然后用双脱氧法测定该插入DNA片段的碱基序列,能推导出由该片段编码的氨基酸序列。结果证明上述的DNA片段编码的氨基酸序列与蛋白酶PFUL和来自各种微生物的碱性丝氨酸蛋白酶的氨基酸序列同源,以及PCR扩增的DNA片段是从蛋白酶基因作为模板扩增的。
下一步,用该PCR-扩增的DNA片段或如上所述的寡核苷酸作为探针,筛选来自超嗜热菌的基因文库,能得到编码超热稳定蛋白酶的基因(例如,由速生热球菌产生的编码超热稳定蛋白酶的基因)。
例如,用PCR-扩增的DNA片段作为探针,针对文库进行噬菌斑杂交,能获得含有目的基因的噬菌体克隆。通过把λGEM-11载体(Promega)和来自速生热球菌DSM 2476用限制性酶Sau 3AI部分消化的染色体DNA的DNA片段连接起来,然后用体外包装方法将它们包装到λ噬菌体颗粒中,产生上述的文库。
通过分析由此获得的一个噬菌体克隆中含有的DNA片段,发现蛋白酶基因存在于大约1.9kb的SacI片段中。此外,还发现通过测定其碱基序列该片段缺乏蛋白酶基因的5′区。用盒和盒引物(Takara Shuzo基因技术产物指导,1994-1995,pp.250-251)用PCR能获得5′区。因此,能获得包括超热稳定蛋白酶5′区的DNA片段,在质粒pTCS 6中缺乏5′区。此外,来源于速生热球菌的整个超热稳定蛋白酶基因的碱基序列能从2个DNA片段的碱基序列确定。
在测定过的碱基序列中找到的可读框的碱基序列在序列表的SEOID NO:11中显示,并且从该碱基序列推导的氨基酸序列在序列表的SEQ ID NO:12中显示。因而确定了来源于速生热球菌编码超热稳定蛋白酶基因的碱基序列和该蛋白酶的氨基酸序列。该蛋白酶被命名为蛋白酶TCES。
可构建表达载体,在该载体中通过把2个DNA片段结合在一起重组整个蛋白酶TCES基因。然而,当用大肠杆菌作为宿主时,目的表达质粒导入其中的转化体没有得到,可能由于在细胞中来自基因表达的产物的产生对大肠杆菌可能是有害的或致死的。在上述情况下,例如,可用枯草芽孢杆菌作为宿主,使蛋白酶分泌到细胞外,然后测定其活性。
至于枯草芽孢杆菌菌株,可用枯草芽孢杆菌DB 104,它是在“基因”83:215-233(1989)描述过的熟知的菌株。至于克隆载体,用质粒pUB 18-P43,它由Calgary大学Sui-Lam Wong博士馈赠。该质粒含有的卡那霉素抗性基因作为选择标志。
在质粒载体pUB 18-P43中P43启动子的下游插入蛋白酶TCES基因所形成的重组质粒被命名为质粒pSTC 3。用该质粒转化的枯草芽孢杆菌DB 104被命名并表示为枯草芽孢杆菌DB 104/pST 3,于1995年12月1日(最初的保藏日期)根据布达佩斯条约保藏在通产省工业技术院生命工学和人体技术研究所,日本,茨城县筑波市东1丁目1番3号,登记号为FERM BP-5635。
质粒pSTC 3的限制性酶图谱在图1显示。在图1中,粗线表示插入到质粒载体pUB 18-P43中的DNA片段。
热稳定蛋白酶活性可在枯草芽孢杆菌DB 104/pSTC 3的培养物上清液中和细胞抽提液中找到。
从转化体培养物中得到的蛋白酶粗酶制备物的主要特性如下:
(1)作用:
降解酪蛋白和明胶而产生短链多肽。
水解琥珀酰-L-亮氨酰-L-亮氨酰-L-缬氨酰-L-酪氨酸-4-甲基香豆素-7-酰胺(Suc-Leu-Leu-Val-Tyr-MCA)产生荧光物质(7-氨基-4-甲基香豆素)。
水解琥珀酰-L-丙氨酰-L-丙氨酰-L-脯氨酰-L-苯丙氨酸-对-硝基酰苯胺(Suc-Ala-Ala-Pro-Phe-p-NA)产生黄色物质(对硝基苯胺)。
(2)最适温度:
在37-95℃下表现为酶活性,最适温度为70-80℃。
(3)最适pH:
在pH 5.5-9下表现为酶活性,最适pH为pH 7-8。
(4)热稳定性:
在80℃处理3小时后仍保持90%或更高的酶活性。
当把蛋白酶PFUL、蛋白酶TCES和枯草杆菌蛋白酶(枯草杆菌蛋白酶BNP′;核酸研究,11:7911-7925(1983))排列在一起,使同源区互相配对如在图2-5显示的那样时,发现蛋白酶PFUL的同源区之间和C-末端存在的序列,在蛋白酶TCES或枯草杆菌蛋白酶中未找到。根据这些结果,在激烈热球菌中,除了蛋白酶PFUL外可能存在具有的分子量比蛋白酶PFUL更低并与蛋白酶TCES或枯草杆菌蛋白酶相似的蛋白酶。
因此,用来自同源区的DNA探针进行针对从激烈热球菌制备的染色体DNA的Southern杂交,并观察到除蛋白酶PFUL基因外的信号,这表明存在另一种蛋白酶基因。
这种新型的蛋白酶基因通过下列方法分离。
例如,通过合适的限制性酶消化来自激烈热球菌的染色体DNA,然后按上面所述的方法对消化过的DNA进行Southern杂交,可获得含有编码新蛋白酶基因的DNA片段。测定该DNA片段的碱基序列,以确认该碱基序列编码的氨基酸序列与上面提到的蛋白酶同源。如果该DNA片段不含有整个目的基因,剩余的部分可通过逆向PCR法或相似的方法进一步获得。
例如,当用限制性酶SacI和SpeI(Takara Shuzo)消化来自激烈热球菌的染色体DNA然后用于Southern杂交时,观察到大小大约为0.6kb的信号。该大小的DNA片段被重新得到,插入到质粒载体pBluescriptSK(-)(Stratagene)中的SpeI-SaeI位点之间,然后用产生的重组质粒转化大肠杆菌JM 109。用相同的探针,按上述Southern杂交所用的方法进行集落杂交,从转化体中能获得目的片段插入其中的克隆。从获得的克隆含有的质粒是否拥有编码所述的蛋白酶的序列,能用测定插入到质粒中的DNA片段的碱基序列确定。因而确认了在该质粒中存在蛋白酶基因。该质粒被命名为质粒pSS 3。
发现从插入到质粒pSS 3中的DNA片段的碱基序列推导的氨基酸序列与枯草杆菌蛋白酶、蛋白酶PFUL、蛋白酶TCES等的序列具有同源性。与蛋白酶PFUL基因不同的该蛋白酶基因的产物,其部分最近按上述的方法从激烈热球菌中得到,被命名为蛋白酶PFUS。编码该蛋白酶的N-末端和C-末端区的区域能用逆向PCR法获得。
用于逆向PCR的引物可根据插入在质粒pSS3中的DNA片段的碱基序列制备。用合适的限制性酶消化来自激烈热球菌的染色体DNA,然后将产生的DNA片段进行分子内连接反应。通过用反应混合物作模板和上述提到的引物进行PCR,能获得在质粒pSS 3中含有的该蛋白酶基因片段侧翼区相应的DNA片段。由这些区域编码的酶蛋白的氨基酸序列可通过分析所获得的DNA片段的碱基序列而推导出来。此外,用来自激烈热球菌的染色体DNA为模板可制备能扩增整个蛋白酶PFUS基因的引物。可设计引物NPF-4和NPR-4。引物NPF-4拥有的碱基序列紧邻于蛋白酶PFUS基因的启动密码子上游处,能把BamHI 5′位点引入到该序列上。引物NPR-4拥有的序列与蛋白酶PFUS基因3′部分互补,可把SphI 5′位点引入到该序列上。
引物NPK-4和NPR-4的碱基序列在序列表的SEQ ID NOS:13和14中显示。用来自激烈热球菌的染色体DNA为模板,这2个引物可用于扩增整个蛋白酶PFUS的基因。
与蛋白酶TCES一样,蛋白酶PFUS能在作为宿主的枯草芽孢杆菌中表达。表达蛋白酶PFUS的质粒可按用于蛋白酶TCES的表达质粒pSTC 3构建。具体来说,通过把质粒pSTC 3中的蛋白酶TCES基因用含有整个蛋白酶PFUS基因的DNA片段取代,可构建表达蛋白酶PFUS的质粒,所述的DNA片段用上述的引物由PCR扩增得到。因而所构建的表达质粒被命名为质粒pSNP 1。用该质粒转化的枯草芽孢杆菌DB 104被命名为枯草芽孢杆菌DB 104/pSNP 1,并于1995年12月1日(最初保藏的日期)根据布达佩斯条约,保藏在通产省工业技术院生命工学和人体技术研究所,日本,茨城县筑波市东1丁目1番3号,登记号为FERMBP-5634。质粒pSNP 1的限制性酶图谱在图6显示。
在编码蛋白酶PFUS基因中与可读框相应的碱基序列以及从该碱基序列推导的蛋白酶PFUS的氨基酸序列分别在序列表的SEQ ID NOS:15和16中显示。
热稳定蛋白酶活性在来自枯草芽孢杆菌DB 104/pSNP 1培养物的上清液中和细胞抽提液中发现。即,部分表达的蛋白酶PFUS被分泌到培养物上清液中。
从转化体培养物中获得的蛋白酶的主要特性如下:
(1)作用:
降解酪蛋白和胶原以产生短链多肽。
水解琥珀酰-L-亮氨酰-L-亮氨酰-L-缬氨酰-L-酪氨酸-4-甲基香豆素-7-酰胺(Suc-Leu-Leu-Val-Tyr-MCA)产生荧光物质(7-氨基-4-甲基香豆素)。
水解琥珀酰-L-丙氨酰-L-丙氨酰-L-脯氨酰-L-苯丙氨酸-对-硝基酰替苯胺(Suc-Ala-Ala-Pro-Phe-p-NA)产生黄色物质(对硝基苯胺)。
(2)最适温度:
在41-110℃下表现为酶活性,最适温度为80-95℃。
(3)最适pH:
在pH 5-10下表现酶活性,最适pH为pH 6-8。
(4)热稳定性:
在95℃处理8小时后仍有90%或更高的酶活性。
(5)pH稳定性:
在pH 5-11,95℃下处理60分钟后仍保持95%或更高的酶活性。
(6)分子量:
在SDS-PAGE上出现的分子量大约为45kDa。
使用获得蛋白酶TCES基因和蛋白酶PFUS基因相似的方法,与蛋白酶TCES基因和蛋白酶PFUS基因同源的蛋白酶基因能从不是激烈热球菌和速生热球菌的超嗜热菌中得到。
可制备大约1kb的DNA片段,该片段编码在序列表SEQ ID NO:6中显示的蛋白酶PFUL的氨基酸序列从残基323位至残基650的序列,用该片段作探针,针对来自海葡萄嗜热菌和解蛋白热拟杆菌DSM 5265的染色体DNAs进行基因组Southern杂交。结果,用PstI(TakaraShuzo)消化的来自海葡萄嗜热菌染色体DNA大约4.8kb的位置和用XbaI消化的来自解蛋白热拟杆菌染色体DNA大约3.5kb的位置观察到信号。
根据这些结果,证明从海葡萄嗜热菌和解蛋白热拟杆菌来的染色体DNAs存在编码蛋白酶PFUL、蛋白酶PFUS和蛋白酶TCES等基因同源的序列。用分离和鉴别编码蛋白酶TCES和蛋白酶PFUS基因相似的方法,检测DNA片段,可分离和鉴定在海葡萄嗜热菌和解蛋白热拟杆菌中编码超热稳定蛋白酶的基因。
一般来说,为了通过遗传工程技术大量制备蛋白,认为应用在宿主中有效地起作用的启动子而不用编码目的蛋白的基因内部结合的启动子是有利的。尽管用于构建蛋白酶TCES和蛋白酶PFUS表达系统的P43启动子是来自枯草芽孢杆菌的启动子,它不能足够有效地表达这2种蛋白酶。
因此,为了增加表达水平,可利用在枯草芽孢杆菌中高水平表达的基因,尤其是编码分泌蛋白的基因。可用编码α-淀粉酶或各种细胞外蛋白酶的基因。例如,可预期应用枯草杆菌蛋白酶基因的启动子和编码信号肽区可增加蛋白酶PFUS的表达水平。
具体而言,通过把整个蛋白酶PFUS基因置于编码枯草杆菌蛋白酶基因信号肽及包括启动子区的区域下游,这样2个基因的翻译框能互相匹配,在枯草杆菌蛋白酶的启动子控制下蛋白酶PFUS能作为融合蛋白表达。
例如,编码枯草杆菌蛋白酶E的基因能用作枯草杆菌蛋白酶基因,用在本发明中。可使用把枯草杆菌蛋白酶E基因的启动子和编码信号肽的区域插入到质粒pKW2中,该质粒在“细菌学杂志”171:2657-2665(1989)中描述。包括启动子序列的5′上游区的碱基序列在该参考文献(同上)中描述而编码枯草杆菌蛋白酶区的碱基序列在“细菌学杂志”158:411-418(1984)中描述。
根据这些序列,合成引入EcoRI位点位于该基因启动子序列上游的引物SUB 4和引入BamHI位点位于编码枯草杆菌蛋白酶E的信号肽区下游的引物BmR1。引物SUB4和BmR1的碱基序列分别在序列表SEQID NOS:17和18中显示。用质粒pKWZ作模板,经PCR用引物SUB4和BmR1扩增出大约0.3kb的DNA片段,该片段含有枯草杆菌蛋白酶E基因的启动子和编码信号肽区。
置于该DNA片段下游的蛋白酶PFUS基因,可通过PCR法从激烈热球菌的染色体DNA中获得。引物NPF-4能用作和该基因5′区杂交的引物。根据该基因终止密码子下游的碱基序列设计的引物NPM-1拥有SphI位点,能用作和该基因3′区杂交的引物。引物NPM-1的序列在序列表的SEQ ID NO:19中显示。
存在于基因中的一个BamHI位点在所述的方法中会产生问题,在该方法中,BamHI位点是用于连接蛋白酶PFUS基因和0.3kb DNA片段。依据在序列表SEQ ID NO:15中显示的蛋白酶PFUS基因的碱基序列,能制备经PCR-诱变法消除BamHI位点的引物mutRR和mutFR。引物mutRR和mutFR的碱基序列分别在序列表SEQ ID NOS:20和21中显示。当用这些引物消除BamHI位点时,由该位点编码的氨基酸残基即甘氨酸,由于引入该位点的碱基替换,被缬氨酸所替换,所述的位点在序列表SEQ ID NO:16中显示的蛋白酶PFUS氨基酸序列中560位。
用这些引物能获得连接有枯草杆菌蛋白酶E的启动子和编码信号肽区的蛋白酶PFUS基因。具体而言,用来自激烈热球菌的染色体DNA作模板及配对引物mutRR和NPF-4或配对引物mutFR和NPM-1,进行2次PCRs。此外,用各自PCR扩增的DNA片段作模板和引物NPF-4和NPM-1混合形成异源双链,进行第二轮PCR。因此,能扩增出不含有内部BamHI位点大约2.4kb的整个蛋白酶PFUS基因。
分离用BamHI和SphI消化的PCR扩增的DNA片段获得大约2.4kb的一个DNA片段,用该片段取代含有蛋白酶PFUS基因的质粒pSNP 1中的BamHI-SphI片段。由此构建的表达载体被命名为质粒pPS 1。用该质粒转化的枯草芽孢杆菌DB 104命名为枯草芽孢杆菌DB 104/pPS1。观察含有质粒pSNP 1的转化体,发现在该转化体培养物的上清液和抽提液中相似的蛋白酶活性,这表明该氨基酸替换不影响酶活性。质粒pPS 1的限制性酶图谱在图7中显示。
含有枯草杆菌蛋白酶E基因的启动子和编码信号肽区的大约0.3kbDNA片段用EcoRI和BamHI消化,用于取代在质粒pPS 1中含有的P43启动子和核糖体结合位点的EcoRI-BamHI片段。由此构建的表达质粒被命名为pNAPS 1。用该质粒转化的枯草芽孢杆菌DB 104被命名为枯草芽孢杆菌DB 104/pNAPS 1。热稳定蛋白酶活性在该转化体培养物的上清液、和细胞抽提液中找到,与枯草芽孢杆菌DB 104/pSNP 1相比较增加了表达水平。质粒pNAPS 1的限制性酶图谱在图8中显示。
从该转化体表达的蛋白酶表现的酶学特性与上述的由枯草芽孢杆菌DB 104/pSNP 1表达的蛋白酶的那些特性相等。纯化了由该转化体表达的蛋白酶。分析纯化蛋白酶N-末端氨基酸序列提出的氨基酸序列在序列表SEQ ID NO:22中显示。该序列与在序列表SEQ ID NO:15中显示的蛋白酶PFUS的氨基酸序列从133位至144位的序列相同,表明成熟的蛋白酶PFUS是从该部分开始的多肽所组成的酶。根据这些结果呈现的成熟蛋白酶PFUS的氨基酸序列在序列表SEQ ID NO:4中显示。
尽管与枯草芽孢杆菌DB 104/pSNP 1(FERM BP-5634)产生的蛋白酶量相比较,由枯草芽孢杆菌DB 104/pNAPS 1产生的蛋白酶量增加,仍需要更高的生产力。通过修饰枯草芽孢杆菌信号肽和蛋白酶PFUS之间由pNAPS 1编码的融合肽结合部分,以更有效地移去信号肽,预计能增加该蛋白酶的表达水平。在质粒pNAPS 1中,由3个氨基酸残基Ala-Gly-Ser组成的肽插入在序列表SEQ ID NO:3中显示的枯草杆菌蛋白酶E信号肽的C-末端氨基酸残基(Ala)和蛋白酶PFUS的N-末端氨基酸残基(Met)之间。通过把突变导入到质粒pNAPS 1中编码该肽的DNA中然后检查已导入突变质粒的转化体的蛋白酶生产性,能获得具有增加蛋白酶表达水平的转化体。
首先,制备突变质粒,其中在质粒pNAPS 1中编码融合蛋白,枯草杆菌蛋白酶E-蛋白酶PFUS的基因中3个氨基酸肽中编码Ser的部分被修饰,这样该部分的碱基序列编码随机的2个氨基酸残基。上述突变质粒可用PCR法产生。例如,具有用随机的6个碱基替换编码Ser密码子(TCC)-的序列的引物SPOFO和SPORO(引物SPOFO和SPORO的碱基序列分别在序列表SEQ ID NOS:24和25中显示)和根据该区周围的碱基序列制备的引物SUB 3和NPR-10,用于进行PCR以获得一个DNA片段,在与编码Ser密码子(Ser)相应的部位的定向突变已引入了该DNA片段中。由得到的片段取代在质粒pNAPS 1中相应的区域能获得含有导入突变的蛋白酶基因的突变质粒。
通过由此获得的突变质粒导入到合适的宿主如枯草芽孢杆菌DB 104中,然后测定由转化体表达的蛋白酶水平,那么就能获得具有增加表达水平的转化体。由测定所分离的转化体独立培养物中的活性确定蛋白酶的表达水平。另外,具有增加表达水平的转化体可用含基质的琼脂平板方便地选出。
具体来说,突变质粒导入其中的转化体生长在含有脱脂乳的琼脂平板上。然后,平板培养在蛋白酶PFUS能表现其活性的温度下,如在70℃下。在表达蛋白酶的转化体集落周围的脱脂乳被降解而变得清亮。蛋白酶的表达水平能从清亮区的大小估计出来。
与枯草芽孢杆菌DB 104/pNAPS 1相比较由此得到的一种表达高水平蛋白酶活性的转化体,被命名为枯草芽孢杆菌DB 104/pSPO 124。制备含有该转化体的质粒(该质粒被命名为pSPO 124)。分析该质粒的碱基序列显示编码Ser的部分转变成碱基序列GGGAAT,那就是说,由该质粒编码的是一种Ser已转变为Gly-Asn的蛋白。
因此,通过把由4个氨基酸残基Ala-Gly-Gly-Asn组成的肽置于枯草杆菌蛋白酶信号肽的下游,将其与目的蛋白的N-末端融合,然后表达融合蛋白,证明目的蛋白的表达水平能在作为宿主的芽孢杆菌属细胞中增加。除了用于本发明的枯草杆菌蛋白酶E(来自枯草芽孢杆菌)外,来自解淀粉枯草杆菌的枯草杆菌蛋白酶BPN′(核酸研究,11:7911-7925(1983))、来自地衣枯草杆菌的枯草杆菌蛋白酶Carlsberg(核酸研究,13:8913-8926(1985))等都是熟知的由芽孢杆菌属细菌产生的枯草杆菌蛋白酶。来自它们的信号肽能优选地用于本发明,尽管它们的氨基酸序列互相间稍有不同。在芽孢杆菌属细菌中起作用的各种启动子能用在本发明为控制表达所用的来自枯草杆菌蛋白酶E基因的启动子的部位。
关于被表达的蛋白没有限制。只要获得编码蛋白的基因就可应用本发明通过遗传工程技术高水平地表达该蛋白。根据分类学上与芽孢杆菌属细菌不同的激烈热球菌的蛋白可高水平表达的事实,显然本发明能用于表达不是本发明宿主的生物来源的蛋白。通过遗传工程技术,本发明优选地用于生产蛋白酶PFUL、蛋白酶TCES以及结构上与蛋白酶PFUS相似,来源于海葡萄嗜热菌和解蛋白热拟杆菌的蛋白酶。
依据和枯草杆菌蛋白酶的同源性,可考虑将蛋白酶PFUS表达为具有信号肽和前肽的前体蛋白,然后进行加工以产生成熟的酶。此外,根据成熟蛋白酶PFUS酶的N-末端氨基酸序列分析的结果,可以设定该成熟酶是一种在序列表SEQ ID NO:4中所示的氨基酸序列组成的酶。然而,纯化的成熟蛋白酶PFUS的分子量大约为45kDa,比根据氨基酸序列计算的分子量更小,表明也进行其C-末端肽的加工后,作为前肽表达的蛋白酶PFUS转变成成熟的蛋白酶。
如果由加工过程移去的C-末端肽对酶活性或酶蛋白折叠成合适的结构是非必需的,预期通过从该基因中缺失编码该部分的区域然后表达蛋白酶,也可增加蛋白酶PFUS的表达水平。
从枯草芽孢杆菌DB 104/pNAPS 1获得的成熟蛋白酶PFUS的分子量例如可用质子光谱仪精确地测定。根据测定的分子量和按上述方法确定的成熟蛋白酶PFUS的N-末端氨基酸序列,发现该蛋白酶是一种与序列表SEQ ID NO:15中所示的氨基酸序列133位Ala至552位Thr相应的肽。此外,通过把终止密码子导入到在质粒pNAPS 1含有的蛋白酶PFUS基因中编码552位Thr部分的附近区域,能构建表达缺乏对其酶活性非必需的多肽的蛋白酶PFUS的质粒。具体而言,用引物NPR 544和引物NPFE 81经PCR获得具有所设计的终止密码子引入其中的碱基序列的DNA片段,引物NPR 544是在质粒pNAPS 1中的蛋白酶PFUS基因从起始密码子至第544位氨基酸残基编码密码子的C-末端侧(Ser)导入一个终止密码子(TGA)(引物NPR 544的碱基序列在序列表的SEQ ID NO:28中显示),而引物NPFE 81拥有在该基因中从NspV位点上游区的碱基序列(引物NPFE 81的碱基序列在序列表的SEQ ID NO:29中显示)。然后用该片段取代质粒pNAPS 1的相应区域能获得含有目的突变导入蛋白酶基因中的突变质粒。该质粒命名为质粒pNAPSΔC。用该质粒转化的枯草芽孢杆菌DB 104被命名为枯草芽孢杆菌DB 104/pNAPSΔC。
该转化体表达蛋白酶活性,具有的特性与蛋白酶PFUS相同,其表达水平比枯草芽孢杆菌DB 104/pNAPS 1更高。
因此,可以发现在质粒pNAPSΔC中含有的蛋白酶PFUS基因拥有表达该酶活性的足够区域。在质粒中存在的编码蛋白酶PFUS区的碱基序列在序列表的SEQ ID NO:2中显示。由该碱基序列编码的氨基酸序列在序列表的SEQ ID NO:2中显示。
此外,通过把在质粒pNAPSΔC中相似的突变导入到质粒pSPO 124的蛋白酶PFUS基因中,能表达缺乏其C-末端肽的蛋白酶PFUS。
具体而言,通过用NspV和SphI消化质粒pNAPSΔC获得的大约13kb的DNA片段和已被NspV和SphI消化的质粒pSPO 124混合并连接,而构建该目的质粒。该质粒被命名为质粒pSO124ΔC。用该质粒转化的枯草芽孢杆菌DB 104被命名并表示为枯草芽孢杆菌DB104/pSO124ΔC,于1997年5月16日(最初的保藏日期)根据布达佩斯条约保藏在通产省工业技术院生命工学和人体技术研究所,日本,茨城县筑波市东1丁目1番3号,登记号为FERM BP-6294。该转化体的蛋白酶表达水平与枯草芽孢杆菌DB 104/pNAPS 1相比增加。
由转化体枯草芽孢杆菌DB 104/pNAPSΔC和枯草芽孢杆菌DB 104/pSPO124ΔC产生的蛋白酶的酶学特性以及物理和化学特性,似乎与由枯草芽孢杆菌DB 104/pSNP 1产生的蛋白酶的那些特性相同。从该2个转化体中的培养物中获得的蛋白酶的主要特性如下:
(1)作用:
降解酪蛋白和胶原产生短链多肽。
水解琥珀酰-L-亮氨酰-L-亮氨酰-L-缬氨酰-L-酪氨酸-4-甲基香豆素-7-酰胺(Suc-Leu-Leu-Val-Tyr-MCA)产生荧光物质(7-氨基-4-甲基香豆素)。
水解琥珀酰-L-丙氨酰-L-丙氨酰-L-脯氨酰-L-苯丙氨酸-对-硝基酰替苯胺(Suc-Ala-Ala-Pro-Phe-p-NA)产生黄色物质(对硝基苯胺)。
(2)最适温度:
在40-110℃下表现酶活性,最适温度为80-95℃。
(3)最适pH:
在pH 5-10下表现酶活性,最适pH为pH 6-8。
(4)热稳定性:
在95℃下处理8小时后保留90%或更高的酶活性。
(5)pH稳定性:
在95℃,pH 5-11下处理60分钟后保留95%或更高的酶活性。
(6)分子量:
在SDS-PAGE上表现的分子量大约45kDa。
因此,提供了具有高热稳定性的蛋白酶和其基因。此外,本发明披露了表达蛋白的新系统,该系统使蛋白酶大量表达。用遗传工程技术将该表达系统用于生产本发明的蛋白酶及各种蛋白。
下列的实施例更详细地说明本发明,但不能解释为限制本发明的范围。
实施例1
(1)从激烈热球菌中制备染色体DNA
按下列条件培养激烈热球菌DSM 3638。
含有1%蛋白胨,0.5%酵母抽提物,1%可溶性淀粉,3.5%Jamarine S固体物(Jamarine实验室),3.5%Jamarine S液体物(Jamarine实验室),0.003%MgSO4,0.001%NaCl,0.0001%FeSO4·7H2O,0.0001%CoSO4,0.0001%CaCl2·7H2O,0.0001%ZnSO4,0.1ppm CuSO4·5H2O,0.1ppm H3BO3,0.1ppm KAl(SO4)2,0.1ppm Na2MoO4·2H2O,0.25ppmNiCl2·H2O的培养基放在2L的培养瓶中,120℃灭菌20分钟,用氮气充泡以移去溶解的氧气,然后将菌株接种到培养基中,在95℃下非振荡条件培养16小时。培养后,通过离心收集细胞。
然后得到的细胞悬浮在4ml含有25%蔗糖的50mM Tris-HCl(pH8.0)中。加2ml的0.2M EDTA和0.8ml溶菌酶(5mg/ml)到悬液中。混合液在20℃下温育1小时。然后往混合液中加入24ml的SET溶液(150mM NaCl,1mM EDTA,20mM Tris-HCl,pH 8.0),4ml的5%SDS和400μl的蛋白酶K(10mg/ml)。继续在37℃下温育1小时。反应通过用苯酚-氯仿抽提混合物而终止。然后,进行乙醇沉淀获得大约3.2mg的染色体DNA。
实施例2
(1)为构建质粒pNSP 1的引物合成
为了合成用于扩增整个蛋白酶PFUS基因的引物,含有该整个基因的质粒pSNP 1从枯草芽孢杆菌DB 104/pSNP 1(FERM BP-5634)中分离,然后测定所获得区域的碱基序列。根据该碱基序列,合成引入BamHI位点紧邻于蛋白酶PFUS基因起始密码子上游的引物NPF-4和能与该基因3′区杂交并含有SphI识别位点的引物NPM-1。引物NPF-4和NPM-1的碱基序列分别在序列表SEQ ID NOS:13和19中显示。
也合成移去BamHI位点的引物mutRR和mutFR,该BamHI位点位于蛋白酶PFUS基因中自起始密码子下游大约1.7kb处。引物mutRR和mutFR的碱基序列分别在序列表的SEQ ID NOS:20和21中显示。
(2)制备质粒pPS1
制备2套LA-PCR反应混合液,然后进行94℃30秒-55℃1分钟-68℃3分钟的30个循环反应,每套LA-PCR反应混合液含有来自激烈热球菌的染色体DNA为模板和引物NPF-4和mutRR的组合或引物mutFR和NPM-1的组合。用LAPCR试剂盒Ver.2(TakaraShuzo)制备LA-PCR反应混合液。等份的反应混合物进行琼脂糖凝胶电泳,分别观察到用引物NPF-4和mutRR扩增出一条大约1.8kb的DNA片段而用引物mutFR和NPM-1扩增出一条大约0.6kb的DNA片段。
用SUPREC-02(Takara Shuzo)从2套PCR反应混合液中移去引物以制备扩增的DNA片段。制备含有这2种扩增的DNA片段但不含有引物或LA Taq的LA-PCR反应混合液,94℃下热变性10分钟,在30分钟内冷却至30℃,然后在30℃温育15分钟以形成异源双链。接着,把LA Taq(Takara Shuzo)加到反应混合液中,72℃下反应30分钟。然后把引物NPF-4和NPM-1加到反应混合液中,然后该反应混合液进行94℃30分钟-55℃1分钟-68℃3分钟的25个循环反应。在反应混合液中观察到扩增出一条大约2.4kb的DNA片段。
大约2.4kb的该DNA片段用BamHI和SphI(两者均来自TakaraShuzo)消化。该片段和已被BamHI和SphI消化过移去整个蛋白酶PFUS基因的质粒pSNP 1混合并连接,然后导入到枯草芽孢杆菌DB 104中。从产生的卡那霉素抗性的转化体中制备质粒,选出其中仅大约2.4kb片段一个分子插入的质粒,并且命名为质粒pPS 1。用该质粒pPS 1转化的枯草芽孢杆菌DB 104被命名为枯草芽孢杆菌DB 104/pPS 1。
质粒pPS 1的限制性酶图谱在图7中显示。
(3)对枯草杆菌蛋白酶E基因的启动子-编码信号肽区扩增DNA片段
为获得枯草杆菌蛋白酶E基因的启动子-编码信号肽区而合成引物。首先,根据在细菌学杂志,171:2657-2665(1989)所描述的枯草杆菌蛋白酶E基因启动子区的碱基序列合成引物SUB 4,该引物能与该区上游序列杂交并含有1个EcoRI位点(引物SUB 4的碱基序列在序列表的SEQ ID NO:17中显示)。根据在细胞学杂志,158:411-418(1984)中所描述的枯草杆菌蛋白酶E基因的碱基序列,合成能引入1个BamHI位点紧邻于编码信号肽区下游的引物BmR1(引物BmR1的碱基序列在序列表的SEQ ID NO:18中显示)。
制备含有在细菌学杂志,171:2657-2665中所描述的枯草杆菌蛋白酶E基因的质粒pKWZ作模板及引物SUB4和BmR1的PCR反应混合液,然后进行94℃30秒-55℃1分钟-68℃2分钟的30个循环反应。等份的反应混合液进行琼脂糖凝胶电泳,并观察到扩增出一条大约0.3kb的DNA片段。
(4)构建蛋白酶表达质粒pNAPS1
用EcoRI(Takara Shuzo)和BamHI消化上述大约0.3kb的DNA片段,用实施例3中所述的方法已被EcoRI和BamHI消化的质粒pPS1与DNA片段混合并连接,然后导入到枯草芽孢杆菌DB 104中。从产生的卡那霉素抗性的转化体中制备质粒,然后挑选出仅大约0.3kb的片段一个分子插入的质粒,命名为质粒pNAPS 1。用该质粒pNAPS 1转化的枯草芽孢杆菌DB 104被命名为枯草芽孢杆菌DB 104/pNAPS 1。
质粒pNAPS 1的限制性酶图谱在图8中显示。
(5)构建质粒pSNP 2
为引入BamHI位点位于在上面提到的质粒pNAPS 1中的枯草杆菌蛋白酶E基因的编码信号肽区上游,合成引物SUB17R(引物SUB17R的碱基序列在序列表的SEQ ID NO:23中显示)。制备含有质粒pNAPS1作模板及引物SUB17R和SUB4的PCR反应混合液,进行94℃30秒-55℃1分钟-72℃1分钟的25个循环反应。用EcoRI和BamHI消化扩增的大约0.21kb的DNA片段,以获得一条大约0.2kb的DNA片段,该片段含有枯草杆菌蛋白酶E基因的启动子和SD序列。该片段和已被EcoRI和BamHI消化过的质粒pNAPS1混合并连接。用该反应混合液转化枯草芽孢杆菌DB 104。从产生的卡那霉素抗性的转化体中制备质粒,然后挑选出大约0.2kb的DNA片段插入的质粒,命名为质粒pSNP 2。
(6)产生高水平表达蛋白酶的突变质粒
为把位于质粒pNAPS 1中枯草杆菌蛋白酶E基因的编码信号肽区和蛋白酶PFUS基因的起始密码子之间的编码氨基酸残基Ser的序列(碱基序列:TCC)用编码2个随机氨基酸残基的序列替换,合成引物SPOFO和SPORO(引物SPOFO和SPORO的碱基序列分别在序列表的SEQ ID NOS:24和25中显示)。合成引入的BamHI位点紧邻于质粒pNAPS 1的枯草杆菌蛋白酶E基因中编码信号肽的上游的引物SUB3和在蛋白酶PFUS编码区中含有SpeI位点的引物NPR-10(引物SUB3和NPR-10的碱基序列分别在序列表的SEQ ID NOS:26和27中显示)。
制备PCR反应混合液,每种混合液含有以质粒pNAPS1作模板和引物SPOFP和NPR-10组合或引物SUB3和SPORO的组合,将反应混合液进行94℃30秒-50℃1分钟-72℃1分钟的20个循环反应。在2个反应混合液中扩增出的大约0.13kb和大约0.35kb的DNA片段混合在一起,在94℃变性10分钟,逐渐冷却到37℃以形成异源双链。然后用Taq聚合酶(Takara Shuzo)的方法从异源双链产生双链DNA。制备含有由上获得的双链DNA作模板及引物SUB3和NPR-10的PCR反应混合液,进行94℃30秒-50℃1分钟-72℃1分钟的25个循环反应。用BamHI和SpeI消化扩增的大约0.43kb的DNA片段所获得的一条DNA片段和用BamHI和SpeI消化过的质粒pSNP 2混合并连接。用反应混合液转化枯草芽孢杆菌DB 104。
将产生的卡那霉素抗性的转化体接种在脱脂乳的平板上(用于高温培养的LB-琼脂培养基含有10μg/ml的卡那霉素和1%脱脂乳)以形成集落。接着,平板培养在70℃下,根据在集落周围的脱脂乳的降解程度,检查由各个转化体表达的蛋白酶活性。结果,分离到一个表达特别高活性的克隆,从该克隆制备命名为质粒pSPO 124的质粒。用该质粒转化的枯草芽孢杆菌DB 104被命名为枯草芽孢杆菌DB 104/pSPO 124。分析质粒pSPO 124的碱基序列,发现在质粒pNAPS 1中编码Ser的碱基序列被碱基序列GGGAAT所替换,那就是说所编码的蛋白中,Ser被转换为2个氨基酸残基Gly-Asn。另外,证明与蛋白酶PFUS基因的Pro(CCA)相应的从起始密码子计算的第25位密码子转变为编码Leu的密码子(CTA),这是与上面所述的突变同时发生的。
(7)构建蛋白酶表达质粒pNAPSΔC
1个终止密码子被导入在质粒pNAPS 1中从蛋白酶PFUS基因的起始密码子算起的第544位氨基酸残基的C-末端侧,以构建表达从该位点起缺乏下游区的蛋白酶的质粒。合成引物NPR544,该引物在该基因中编码第544位氨基酸残基的密码子C-末端侧导入一个终止密码子(碱基序列:TGA)和有一个SphI位点(引物NPR 544的碱基序列在序列表的SEQ ID NO:28中显示)。此外,根据在该基因中从NspV位点上游部分的碱基序列,合成引物NPFE 81(引物NPFE 81的碱基序列在序列表的SEQ ID NO:29中显示)。
制备含有质粒pNAPS 1作模板及引物NPFE 81和NPR 544的PCR反应混合液,然后进行94℃30秒-50℃1分钟-72℃1分钟的20个循环反应。扩增出的大约0.61kb的DNA片段用NspV(Takara Shuzo)和SpeI消化,以获得一条含有终止密码子的大约0.13kb的DNA片段。该DNA片段和已被限制性酶NspV和SphI消化过的质粒pNAPS 1混合并连接。用反应混合液转化枯草芽孢杆菌DB 104。从产生的卡那霉素抗性的转化体中制备质粒,选出大约0.13kb的DNA片段插入其中的质粒,该质粒被命名为质粒pNAPSΔC。用该质粒pNAPSΔC转化的枯草芽孢杆菌DB 104被命名为枯草芽孢杆菌DB 104/pNAPSΔC。
(8)构建蛋白酶表达质粒pSPO124ΔC
用NspV和SphI消化质粒pNAPSΔC分离得到一条大约1.3kb的DNA片段,然后该片段与已被NspV和SphI消化过的质粒pSPO124混合并连接。用反应混合液转化枯草芽孢杆菌DB104。从产生的卡那霉素抗性转化体中制备质粒,挑选出大约1.3kb的DNA片段插入的质粒,并被命名为质粒pSPO124ΔC。用该质粒pSPO124ΔC转化的枯草芽孢杆菌DB 104被命名为枯草芽孢杆菌DB 104/pSPO124ΔC。
实施例3
(1)用含蛋白酶PFUS基因的质粒转化的枯草芽孢杆菌的培养和制备粗酶溶液
用实施例2所述的方法将含有蛋白酶PFUS基因的质粒pNAPS 1导入到枯草芽孢杆菌DB 104中出现的枯草芽孢杆菌DB 104/pNAPS 1,在2ml含有10μg/ml卡那霉素的LB培养基(蛋白胨10g/L,酵母抽提物5g/L,NaCl 5g/L,pH 7.2)中,37℃下培养24小时。离心培养物以得到培养上清液(制备物1-S)和细胞。
把细胞悬浮在100μl的50mM Tris-HCl,pH7.5中,加2mg溶菌酶(Sigma)后37℃消化45分钟。消化过的样品在95℃热处理10分钟,然后通过离心收集上清液以获得无细胞的抽提液(制备物1-L)。
相似地,从含有质粒pSPO 124的枯草芽孢杆菌DB 104/pSPO 124、含有质粒pNAPSΔC的枯草芽孢杆菌DB 104/pNAPSΔC或含有质粒pSPO124ΔC的枯草芽孢杆菌DB 104/pSPO124ΔC中,获得培养物上清液和无细胞的抽提液。来自枯草芽孢杆菌DB 104/pSPO 124的培养物上清液和无细胞抽提液分别称为124-S和124-L。来自枯草芽孢杆菌DB 104/pNAPSΔC的培养物上清液和无细胞抽提液分别称为ΔC-S和ΔC-L。来自枯草芽孢杆菌DB 104/pSPO124ΔC的培养物上清液和无细胞抽提液分别称为124ΔC-S和124ΔC-L。用这些制备物测定蛋白酶活性并测定在每个制备物中含有的蛋白酶浓度。
(2)比较蛋白酶的生产力
用Suc-Ala-Ala-Pro-Phe-p-NA(Sigma)作底物光谱法测定酶水解反应中产生的p-硝基苯胺的量确定蛋白酶PFUS的活性。简言之,适当稀释待测定其酶活性的酶制备液。50μl在100mM磷酸盐缓冲液pH7.0中的1mM Suc-Ala-Ala-Pro-Phe-p-NA被加到50μl稀释过的样品溶液中。然后,使反应在95℃进行30分钟。通过在冰上冷却终止反应后,测定在405nm的吸收值以计算出产生的对硝基苯胺的量。该酶的一个单位被定义为95℃每1分钟产生1μmol对硝基苯胺的酶量。假定比活性为9.5单位/mg蛋白酶PFUS蛋白,根据所测定的酶活性计算在培养物上清液或细胞中表达的酶蛋白的量。
测定在实施例3-(1)制备的每种酶制备物的蛋白酶活性。根据测定结果计算的每种转化体每1L培养物的蛋白酶PFUS的生产力在表1显示。
在枯草芽孢杆菌DB 104/pSPO 124中,在细胞中的蛋白酶PFUS的生产力比枯草芽孢杆菌DB 104/pNAPS 1增加了3.6倍。在枯草芽孢杆菌DB 104/pNAPSΔC中,蛋白酶PFUS的生产力分别在培养物上清液中增加2.4倍而在细胞中增加2.2倍。此外,在枯草芽孢杆菌DB 104/pSPO124ΔC中,蛋白酶PFUS的生产力分别在培养物上清液中增加2倍而在细胞中增加2.4倍。每种细胞的生产力也增加。
在培养物上清液和细胞中产生的蛋白酶PFUS的总量,与枯草芽孢杆菌DB 104/pNAPS 1相比,分别在枯草芽孢杆菌DB 104/pSPO 124中增加2.1倍,在枯草芽孢杆菌DB 104/pNAPSΔC中增加2.1倍及在枯草芽孢杆菌DB 104/pSPO124ΔC中增加2.2倍。
表1
蛋白酶PFUS的生产力(mg/L的培养物)
| 转化体(质粒) | 培养物上清液 | 细胞 | 培养物上清液+细胞 |
| pNAPS1 | 15.1 | 12.5 | 27.6 |
| pSPO124 | 13.1 | 45.4 | 58.5 |
| pNAPSΔC | 35.5 | 28.1 | 63.6 |
| pSPO124ΔC | 30.5 | 30.1 | 60.6 |
实施例4
(1)制备纯化的成熟蛋白酶PFUS的酶制备物按实施例2中所述的方法将编码本发明的超热稳定蛋白酶的基因导入枯草芽孢杆菌DB 104中出现的2种菌株,枯草芽孢杆菌DB 104/pNAPS 1和枯草芽孢杆菌DB 104/pSPO124ΔC,分别接种到含有10μg/ml卡那霉素的5ml LB培养基中,37℃振荡培养7小时。5ml的培养物接种到5L锥形瓶中500ml含10μg/ml卡那霉素的TM培养基(大豆粉末5g/L,聚胨10g/L,肉膏5g/L,酵母抽提物2g/L,葡萄糖10g/L,FeSO4·7H2O 10mg/L,MnSO4·4H2O 10mg/L,ZnSO4·7H2O 1mg/L,pH7.0)中,30℃振荡培养3天。超声处理所产生的培养物,在95℃热处理30分钟,然后离心收集上清液。加过硫酸铵到上清液中至25%的饱和度,然后由下一步离心获得的上清液上样到用含25%饱和过硫酸铵的25mM Tris-HCl缓冲液平衡过的微量制备甲基HIC柱(Bio-Rad)上。用相同的缓冲液洗涤凝胶后,吸附到柱上的蛋白酶PFUS用含40%乙醇的25mM Tris-HCl缓冲液(pH7.6)用分步洗脱法洗脱下来。由此获得的含有蛋白酶PFUS的级分用NAP-25柱进行凝胶过滤,该NAP-25柱预先用含20%乙腈的0.05%三氟乙酸平衡,在变性蛋白酶PFUS的同时脱盐,然后获得纯化的蛋白酶PFUS的制备物。从枯草芽孢杆菌DB 104/pNAPS 1和枯草芽孢杆菌DB 104/pSPO124ΔC中得到的制备物分别称为NAPS-1和SPO-124ΔC。
2种纯化的酶制备物在0.1%SDS-10%聚丙烯酰胺凝胶上电泳,然后用考马斯亮蓝R-250染色,结果显示2种纯化的酶制备物NAPS-1和SPO-124ΔC为单一的泳带,估测的分子量大约为45kDa。
(2)成熟蛋白酶PFUS的N-末端氨基酸序列的分析
用G1000A蛋白测序仪(Hewlett-Packard),通过自动Edman法分析纯化的酶制备物NAPS-1和SPO-124ΔC的N-末端氨基酸序列。2种纯化的酶制备物的2个N-末端氨基酸序列在序列表的SEQID NO:22中显示。该序列与在序列表的SEQ ID NO:15中所示的蛋白酶PFUS氨基酸序列从133位至144位的序列重合,表明NAPS-1和SPO-124ΔC二者是从该部分开始的多肽所组成的酶。
(3)成熟蛋白酶PFUS的质谱法分析
对纯化的酶制备物NAPS-1和SPO-124ΔC的质谱法分析用API300四极三质子分光仪(Perkin-Elmer Scix)进行。根据NAPS-1的估测分子量,43,744Da,说明由枯草芽孢杆菌DB 104/pNAPS 1产生的成熟蛋白酶PFUS是一种在序列表SEQ ID NO:15中所示的蛋白醇氨基酸序列从133位Ala至552位Thr的多肽组成的酶。此外,根据SPO-124ΔC的估测分子量42,906Da,说明由枯草芽孢杆菌DB 104/pSPO124ΔC产生的成熟蛋白酶PFUS是一种由这样的一种多肽组成的酶,该多肽为在序列表SEQ ID NO:15中所示的蛋白酶PFUS从133位Ala至544位Ser的氨基酸序列,即是序列表SEQ ID NO:2中所示的氨基酸序列。
序列表
申请人名称:TAKARA SHUZO CO.,LTD
发明题目:表达超热稳定蛋白的系统
参考/摘要号:660782
目前申请号:
目前申请日:
在先申请号:151969/1997
在先申请日:1997,6月10日
序列数:29
SEQ ID NO:1的资料:
长度:412
类型:氨基酸
链型:单链
拓扑结构:线性
分子类型:肽
序列描述:
Ala Glu Leu Glu Gly Leu Asp Glu Ser Ala Ala Gln Val Met Ala
5 10 15
Thr Tyr Val Trp Asn Leu Gly Tyr Asp Gly Ser Gly Ile Thr Ile
20 25 30
Gly Ile Ile Asp Thr Gly Ile Asp Ala Ser His Pro Asp Leu Gln
35 40 45
Gly Lys Val Ile Gly Trp Val Asp Phe Val Asn Gly Arg Ser Tyr
50 55 60
Pro Tyr Asp Asp His Gly His Gly Thr His Val Ala Ser Ile Ala
65 70 75
Ala Gly Thr Gly Ala Ala Ser Asn Gly Lys Tyr Lys Gly Met Ala
80 85 90
Pro Gly Ala Lys Leu Ala Gly Ile Lys Val Leu Gly Ala Asp Gly
95 100 105
Ser Gly Ser Ile Ser Thr Ile Ile Lys Gly Val Glu Trp Ala Val
110 115 120
Asp Asn Lys Asp Lys Tyr Gly Ile Lys Val Ile Asn Leu Ser Leu
125 130 135
Gly Ser Ser Gln Ser Ser Asp Gly Thr Asp Ala Leu Ser Gln Ala
140 145 150
Val Asn Ala Ala Trp Asp Ala Gly Leu Val Val Val Val Ala Ala
155 160 165
Gly Asn Ser Gly Pro Asn Lys Tyr Thr Ile Gly Ser Pro Ala Ala
170 175 180
Ala Ser Lys Val Ile Thr Val Gly Ala Val Asp Lys Tyr Asp Val
185 190 195
Ile Thr Ser Phe Ser Ser Arg Gly Pro Thr Ala Asp Gly Arg Leu
200 205 210
Lys Pro Glu Val Val Ala Pro Gly Asn Trp Ile Ile Ala Ala Arg
215 220 225
Ala Ser Gly Thr Ser Met Gly Gln Pro Ile Asn Asp Tyr Tyr Thr
230 235 240
Ala Ala Pro Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ile
245 250 255
Ala Ala Leu Leu Leu Gln Ala His Pro Ser Trp Thr Pro Asp Lys
260 265 270
Val Lys Thr Ala Leu Ile Glu Thr Ala Asp Ile Val Lys Pro Asp
275 280 285
Glu Ile Ala Asp Ile Ala Tyr Gly Ala Cly Arg Val Asn Ala Tvr
290 295 300
Lys Ala Ile Asn Tyr Asp Asn Tyr Ala Lys Leu Val Phe Thr Gly
305 310 315
Tyr Val Ala Asn Lys Gly Ser Gln Thr His Gln Phe Val Ile Ser
320 325 330
Gly Ala Ser Phe Val Thr Ala Thr Leu Tvr Trp Asp Asn Ala Asn
335 340 345
Ser Asp Leu Asp Leu Tyr Leu Tyr Asp Pro Asn Gly Asn Gln Val
350 355 360
Asp Tyr Ser Tyr Thr Ala Tyr Tyr Gly Phe Glu Lys Val Gly Tyr
365 370 375
Tyr Asn Pro Thr Asp Gly Thr Trp Thr Ile Lys Val Val Ser Tyr
380 385 390
Ser Gly Ser Ala Asn Tyr Gln Val Asp Val Val Ser Asp Gly Ser
395 400 405
Leu Ser Gln Pro Gly Ser Ser
410
SEQ ID NO:2的资料:
长度:1236
类型:核酸
链型:双链
拓扑结构:线性
分子类型:基因组DNA
原始来源:
生物:激烈热球菌
菌株:DSM 3638
序列描述:
GCAGAATTAG AAGGACTGGA TGAGTCTGCA GCTCAAGTTA TGGCAACTTA CGTTTGGAAC 60
TTGGGATATG ATGGTTCTGG AATCACAATA GGAATAATTG ACACTGGAAT TGACGCTTCT 120
CATCCAGATC TCCAAGGAAA AGTAATTGGG TGGGTAGATT TTGTCAATGG TAGGAGTTAT 180
CCATACGATG ACCATGGACA TGGAACTCAT GTAGCTTCAA TAGCAGCTGG TACTGGAGCA 240
GCAAGTAATG GCAAGTACAA GGGAATGGCT CCAGGAGCTA AGCTGGCGGG AATTAAGGTT 300
CTAGGTGCCG ATGGTTCTGG AAGCATATCT ACTATAATTA AGGGAGTTGA GTGGGCCGTT 360
GATAACAAAG ATAAGTACGG AATTAAGGTC ATTAATCTTT CTCTTGGTTC AAGCCAGAGC 420
TCAGATGGTA CTGACGCTCT AAGTCAGGCT GTTAATGCAG CGTGGGATGC TGGATTAGTT 480
GTTGTGGTTG CCGCTGGAAA CAGTGGACCT AACAAGTATA CAATCGGTTC TCCAGCAGCT 540
GCAAGCAAAG TTATTACAGT TGGAGCCGTT GACAAGTATG ATGTTATAAC AAGCTTCTCA 600
AGCAGAGGGC CAACTGCAGA CGGCAGGCTT AAGCCTGAGG TTGTTGCTCC AGGAAACTGG 660
ATAATTGCTG CCAGAGCAAG TGGAACTAGC ATGGGTCAAC CAATTAATGA CTATTACACA 720
GCAGCTCCTG GGACATCAAT GGCAACTCCT CACGTAGCTG GTATTGCAGC CCTCTTGCTC 780
CAAGCACACC CGAGCTGGAC TCCAGACAAA GTAAAAACAG CCCTCATAGA AACTGCTGAT 840
ATCGTAAAGC CAGATGAAAT AGCCGATATA GCCTACGGTG CAGGTAGGGT TAATGCATAC 900
AAGGCTATAA ACTACGATAA CTATGCAAAG CTAGTGTTCA CTGGATATGT TGCCAACAAA 960
GGCAGCCAAA CTCACCAGTT CGTTATTAGC GGAGCTTCGT TCGTAACTGC CACATTATAC 1020
TGGGACAATG CCAATAGCGA CCTTGATCTT TACCTCTACG ATCCCAATGG AAACCAGGTT 1080
GACTACTCTT ACACCGCCTA CTATGGATTC GAAAAGGTTG GTTATTACAA CCCAACTGAT 1140
GGAACATGGA CAATTAAGGT TGTAAGCTAC AGCGGAAGTG CAAACTATCA AGTAGATGTG 1200
GTAAGTGATG GTTCCCTTTC ACAGCCTGGA AGTTCA 1236
SEQ ID NO:3的资料:
长度:29
类型:氨基酸
链型:单链
拓扑结构:线性
分子类型:肽
序列描述:
Met Arg Ser Lys Lys Leu Trp Ile Ser Leu Leu Phe Ala Leu Thr
5 10 15
Leu Ile Phe Thr Met Ala Phe Ser Asn Met Ser Ala Gln Ala
20 25
SEQ ID NO:4的资料:
长度:522
类型:氨基酸
链型:单链
拓扑结构:线性
分子类型:肽
特征:
其它资料:在428位的Xaa是Gly或Val
序列描述:
Ala Glu Leu Glu Gly Leu Asp Glu Ser Ala Ala Gln Val Met Ala
5 10 15
Thr Tyr Val Trp Asn Leu Gly Tyr Asp Gly Ser Gly Ile Thr Ile
20 25 30
Gly Ile Ile Asp Thr Gly Ile Asp Ala Ser His Pro Asp Leu Gln
35 40 45
Gly Lys Val Ile Gly Trp Val Asp Phe Val Asn Gly Arg Ser Tyr
50 55 60
Pro Tyr Asp Asp His Gly His Gly Thr His Val Ala Ser Ile Ala
65 70 75
Ala Gly Thr Gly Ala Ala Ser Asn Gly Lys Tyr Lys Gly Met Ala
80 85 90
Pro Gly Ala Lys Leu Ala Gly Ile Lys Val Leu Gly Ala Asp Gly
95 100 105
Ser Gly Ser Ile Ser Thr Ile Ile Lys Gly Val Glu Trp Ala Val
110 115 120
Asp Asn Lys Asp Lys Tyr Gly Ile Lys Val Ile Asn Leu Ser Leu
125 130 135
Gly Ser Ser Gln Ser Ser Asp Gly Thr Asp Ala Leu Ser Gln Ala
140 145 150
Val Asn Ala Ala Trp Asp Ala Gly Leu Val Val Val Val Ala Ala
155 160 165
Gly Asn Ser Gly Pro Asn Lys Tyr Thr Ile Gly Ser Pro Ala Ala
170 175 180
Ala Ser Lys Val Ile Thr Val Gly Ala Val Asp Lys Tyr Asp Val
185 190 195
Ile Thr Ser Phe Ser Ser Arg Gly Pro Thr Ala Asp Gly Arg Leu
200 205 210
Lys Pro Glu Val Val Ala Pro Gly Asn Trp Ile Ile Ala Ala Arg
215 220 225
Ala Ser Gly Thr Ser Met Gly Gln Pro Ile Asn Asp Tyr Tyr Thr
230 235 240
Ala Ala Pro Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ile
245 250 255
Ala Ala Leu Leu Leu Gln Ala His Pro Ser Trp Thr Pro Asp Lys
260 265 270
Val Lys Thr Ala Leu Ile Glu Thr Ala Asp Ile Val Lys Pro Asp
275 280 285
Glu Ile Ala Asp Ile Ala Tyr Gly Ala Gly Arg Val Asn Ala Tyr
290 295 300
Lys Ala Ile Asn Tyr Asp Asn Tyr Ala Lys Leu Val Phe Thr Gly
305 310 315
Tyr Val Ala Asn Lys Gly Ser Gln Thr His Gln Phe Val Ile Ser
320 325 330
Gly Ala Ser Phe Val Thr Ala Thr Leu Tyr Trp Asp Asn Ala Asn
335 340 345
Ser Asp Leu Asp Leu Tyr Leu Tyr Asp Pro Asn Gly Asn Gln Val
350 355 360
Asp Tyr Ser Tyr Thr Ala Tyr Tyr Gly Phe Glu Lys Val Gly Tyr
365 370 375
Tyr Asn Pro Thr Asp Gly Thr Trp Thr Ile Lys Val Val Ser Tyr
380 385 390
Ser Gly Ser Ala Asn Tyr Gln Val Asp Val Val Ser Asp Gly Ser
395 400 405
Leu Ser Gln Pro Gly Ser Ser Pro Ser Pro Gln Pro Glu Pro Thr
410 415 420
Val Asp Ala Lys Thr Phe Gln Xaa Ser Asp His Tyr Tyr Tyr Asp
425 430 435
Arg Ser Asp Thr Phe Thr Met Thr Val Asn Ser Gly Ala Thr Lys
440 445 450
Ile Thr Gly Asp Leu Val Phe Asp Thr Ser Tyr His Asp Leu Asp
455 460 465
Leu Tyr Leu Tyr Asp Pro Asn Gln Lys Leu Val Asp Arg Ser Glu
470 475 480
Ser Pro Asn Ser Tyr Glu His Val Glu Tyr Leu Thr Pro Ala Pro
485 490 495
Gly Thr Trp Tyr Phe Leu Val Tyr Ala Tyr Tyr Thr Tyr Gly Trp
500 505 510
Ala Tyr Tyr Glu Leu Thr Ala Lys Val Tyr Tyr Gly
515 520
SEQ ID NO:5的资料:
长度:4765
类型:核酸
链型:双链
拓扑结构:线性
分子类型:基因组DNA
原始来源:
生物:激烈热球菌
菌株:DSM 3638
序列描述:
TTTAAATTAT AAGATATAAT CACTCCGAGT GATGAGTAAG ATACATCATT ACAGTCCCAA 60
AATGTTTATA ATTGGAACGC AGTGAATATA CAAAATGAAT ATAACCTCGG AGGTGACTGT 120
AGAATGAATA AGAAGGGACT TACTGTGCTA TTTATAGCGA TAATGCTCCT TTCAGTAGTT 180
CCAGTGCACT TTGTGTCCGC AGAAACACCA CCGGTTAGTT CAGAAAATTC AACAACTTCT 240
ATACTCCCTA ACCAACAAGT TGTGACAAAA GAAGTTTCAC AAGCGGCGCT TAATGCTATA 300
ATGAAAGGAC AACCCAACAT GGTTCTTATA ATCAAGACTA AGGAAGGCAA ACTTGAAGAG 360
GCAAAAACCG AGCTTGAAAA GCTAGGTGCA GAGATTCTTG ACGAAAATAG AGTTCTTAAC 420
ATGTTGCTAG TTAAGATTAA GCCTGAGAAA GTTAAAGAGC TCAACTATAT CTCATCTCTT 480
GAAAAAGCCT GGCTTAACAG AGAAGTTAAG CTTTCCCCTC CAATTGTCGA AAAGGACGTC 540
AAGACTAAGC AGCCCTCCCT AGAACCAAAA ATGTATAACA GCACCTGGGT AATTAATGCT 600
CTCCAGTTCA TCCAGGAATT TGGATATGAT GGTAGTGGTG TTGTTGTTGC AGTACTTGAC 660
ACGGGAGTTG ATCCGAACCA TCCTTTCTTG AGCATAACTC CAGATGGACG CAGGAAAATT 720
ATAGAATGGA AGGATTTTAC AGACGAGGGA TTCGTGGATA CATCATTCAG CTTTAGCAAG 780
GTTGTAAATG GGACTCTTAT AATTAACACA ACATTCCAAG TGGCCTCAGG TCTCACGCTG 840
AATGAATCGA CAGGACTTAT GGAATACGTT GTTAAGACTG TTTACGTGAG CAATGTGACC 900
ATTGGAAATA TCACTTCTGC TAATGGCATC TATCACTTCG GCCTGCTCCC AGAAAGATAC 960
TTCGACTTAA ACTTCGATGG TGATCAAGAG GACTTCTATC CTGTCTTATT AGTTAACTCC 1020
ACTGGCAATG GTTATGACAT TGCATATGTG GATACTGACC TTGACTACGA CTTCACCGAC 1080
GAAGTTCCAC TTGGCCAGTA CAACGTTACT TATGATGTTG CTGTTTTTAG CTACTACTAC 1140
GGTCCTCTCA ACTACGTGCT TGCAGAAATA GATCCGAACG GAGAATATGC AGTATTTGGG 1200
TGGGATGGTC ACGGTCACGG AACTCACGTA GCTGGAACTG TTGCTGGTTA CGACAGCAAC 1260
AATGATGCTT GGGATTGGCT CAGTATGTAC TCTGGTGAAT GGGAAGTGTT CTCAAGACTC 1320
TATGGTTGGG ATTATACGAA CGTTACCACA GACACCGTGC AGGGTGTTGC TCCAGGTGCC 1380
CAAATAATGG CAATAAGAGT TCTTAGGAGT GATGGACGGG GTAGCATGTG GGATATTATA 1440
GAAGGTATGA CATACGCAGC AACCCATGGT GCAGACGTTA TAAGCATGAG TCTCGGTGGA 1500
AATGCTCCAT ACTTAGATGG TACTGATCCA GAAAGCGTTG CTGTGGATGA GCTTACCGAA 1560
AAGTACGGTG TTGTATTCGT AATAGCTGCA GGAAATGAAG GTCCTGGCAT TAACATCGTT 1620
GGAAGTCCTG GTGTTGCAAC AAAGGCAATA ACTGTTGGAG CTGCTGCAGT GCCCATTAAC 1680
GTTGGAGTTT ATGTTTCCCA AGCACTTGGA TATCCTGATT ACTATGGATT CTATTACTTC 1740
CCCGCCTACA CAAACGTTAG AATAGCATTC TTCTCAAGCA GAGGGCCGAG AATAGATGGT 1800
GAAATAAAAC CCAATGTAGT GGCTCCAGGT TACGGAATTT ACTCATCCCT GCCGATGTGG 1860
ATTGGCGGAG CTGACTTCAT GTCTGGAACT TCGATGGCTA CTCCACATGT CAGCGGTGTC 1920
GTTGCACTCC TCATAAGCGG GGCAAAGGCC GAGGGAATAT ACTACAATCC AGATATAATT 1980
AAGAAGGTTC TTGAGAGCGG TGCAACCTGG CTTGAGGGAG ATCCATATAC TGGGCAGAAG 2040
TACACTGAGC TTGACCAAGG TCATGGTCTT GTTAACGTTA CCAAGTCCTG GGAAATCCTT 2100
AAGGCTATAA ACGGCACCAC TCTCCCAATT GTTGATCACT GGGCAGACAA GTCCTACAGC 2160
GACTTTGCGG AGTACTTGGG TGTGGACGTT ATAAGAGGTC TCTACGCAAG GAACTCTATA 2220
CCTGACATTG TCGAGTGGCA CATTAAGTAC GTAGGGGACA CGGAGTACAG AACTTTTGAG 2280
ATCTATGCAA CTGAGCCATG GATTAAGCCT TTTGTCAGTG GAAGTGTAAT TCTAGAGAAC 2340
AATACCGAGT TTGTCCTTAG GGTGAAATAT GATGTAGAGG GTCTTGAGCC AGGTCTCTAT 2400
GTTGGAAGGA TAATCATTGA TGATCCAACA ACGCCAGTTA TTGAAGACGA GATCTTGAAC 2460
ACAATTGTTA TTCCCGAGAA GTTCACTCCT GAGAACAATT ACACCCTCAC CTGGTATGAT 2520
ATTAATGGTC CAGAAATGGT GACTCACCAC TTCTTCACTG TGCCTGAGGG AGTGGACGTT 2580
CTCTACGCGA TGACCACATA CTGGGACTAC GGTCTGTACA GACCAGATGG AATGTTTGTG 2640
TTCCCATACC AGCTAGATTA TCTTCCCGCT GCAGTCTCAA ATCCAATGCC TGGAAACTGG 2700
GAGCTAGTAT GGACTGGATT TAACTTTGCA CCCCTCTATG AGTCGGGCTT CCTTGTAAGG 2760
ATTTACGGAG TAGAGATAAC TCCAAGCGTT TGGTACATTA ACAGGACATA CCTTGACACT 2820
AACACTGAAT TCTCAATTGA ATTCAATATT ACTAACATCT ATGCCCCAAT TAATGCAACT 2880
CTAATCCCCA TTGGCCTTGG AACCTACAAT GCGAGCGTTG AAAGCGTTGG TGATGGAGAG 2940
TTCTTCATAA AGGGCATTGA AGTTCCTGAA GGCACCGCAG AGTTGAAGAT TAGGATAGGC 3000
AACCCAAGTG TTCCGAATTC AGATCTAGAC TTGTACCTTT ATGACAGTAA AGGCAATTTA 3060
GTGGCCTTAG ATGGAAACCC AACAGCAGAA GAAGAGGTTG TAGTTGAGTA TCCTAAGCCT 3120
GGAGTTTATT CAATAGTAGT ACATGGTTAC AGCGTCAGGG ACGAAAATGG TAATCCAACG 3180
ACAACCACCT TTGACTTAGT TGTTCAAATG ACCCTTGATA ATGGAAACAT AAAGCTTGAC 3240
AAAGACTCGA TTATTCTTGG AAGCAATGAA AGCGTAGTTG TAACTGCAAA CATAACAATT 3300
GATAGAGATC ATCCTACAGG AGTATACTCT GGTATCATAG AGATTAGAGA TAATGAGGTC 3360
TACCAGGATA CAAATACTTC AATTGCGAAA ATACCCATAA CTTTGGTAAT TGACAAGGCG 3420
GACTTTGCCG TTGGTCTCAC ACCAGCAGAG GGAGTACTTG GAGAGGCTAG AAATTACACT 3480
CTAATTGTAA AGCATGCCCT AACACTAGAG CCTGTGCCAA ATGCTACAGT GATTATAGGA 3540
AACTACACCT ACCTCACAGA CGAAAACGGT ACAGTGACAT TCACGTATGC TCCAACTAAG 3600
TTAGGCAGTG ATGAAATCAC AGTCATAGTT AAGAAAGAGA ACTTCAACAC ATTAGAGAAG 3660
ACCTTCCAAA TCACAGTATC AGAGCCTGAA ATAACTGAAG AGGACATAAA TGAGCCCAAG 3720
CTTGCAATGT CATCACCAGA AGCAAATGCT ACCATAGTAT CAGTTGAGAT GGAGAGTGAG 3780
GGTGGCGTTA AAAAGACAGT GACAGTGGAA ATAACTATAA ACGGAACCGC TAATGAGACT 3840
GCAACAATAG TGGTTCCTGT TCCTAAGAAG GCCGAAAACA TCGAGGTAAG TGGAGACCAC 3900
GTAATTTCCT ATAGTATAGA GGAAGGAGAG TACGCCAAGT ACGTTATAAT TACAGTGAAG 3960
TTTGCATCAC CTGTAACAGT AACTGTTACT TACACTATCT ATGCTGGCCC AAGAGTCTCA 4020
ATCTTGACAC TTAACTTCCT TGGCTACTCA TGGTACAGAC TATATTCACA GAAGTTTGAC 4080
GAATTGTACC AAAAGGCCCT TGAATTGGGA GTGGACAACG AGACATTAGC TTTAGCCCTC 4140
AGCTACCATG AAAAAGCCAA AGAGTACTAC GAAAAGGCCC TTGAGCTTAG CCAGGGTAAC 4200
ATAATCCAAT ACCTTGGAGA CATAAGACTA TTACCTCCAT TAAGACAGGC ATACATCAAT 4260
GAAATGAAGG CAGTTAAGAT ACTGGAAAAG GCCATAGAAG AATTAGAGGG TGAAGAGTAA 4320
TCTCCAATTT TTCCCACTTT TTCTTTTATA ACATTCCAAG CCTTTTCTTA GCTTCTTCGC 4380
TCATTCTATC AGGAGTCCAT GGAGGATCAA AGGTAAGTTC AACCTCCACA TCTCTTACTC 4440
CTGGGATTTC GAGTACTTTC TCCTCTACAG CTCTAAGAAG CCAGAGAGTT AAAGGACACC 4500
CAGGAGTTGT CATTGTCATC TTTATATATA CCGTTTTGTC AGGATTAATC TTTAGCTCAT 4560
AAATTAATCC AAGGTTTACA ACATCCATCC CAATTTCTGG GTCGATAACC TCCTTTAGCT 4620
TTTCCAGAAT CATTTCTTCA GTAATTTCAA GGTTCTCATC TTTGGTTTCT CTCACAAACC 4680
CAATTTCAAC CTGCCTGATA CCTTCTAACT CCCTAAGCTT GTTATATATC TCCAAAAGAG 4740
TGGCATCATC AATTTTCTCT TTAAA 4765
SEQ ID NO:6的资料:
长度:1398
类型:氨基酸
链型:单链
拓扑结构:线性
分子类型:肽
序列描述:
Met Asn Lys Lys Gly Leu Thr Val Leu Phe Ile Ala Ile Met Leu
5 10 15
Leu Ser Val Val Pro Val His Phe Val Ser Ala Glu Thr Pro Pro
20 25 30
Val Ser Ser Glu Asn Ser Thr Thr Ser Ile Leu Pro Asn Gln Gln
35 40 45
Val Val Thr Lys Glu Val Ser Gln Ala Ala Leu Asn Ala Ile Met
50 55 60
Lys Gly Gln Pro Asn Met Val Leu Ile Ile Lys Thr Lys Glu Gly
65 70 75
Lys Leu Glu Glu Ala Lys Thr Glu Leu Glu Lys Leu Gly Ala Glu
80 85 90
Ile Leu Asp Glu Asn Arg Val Leu Asn Met Leu Leu Val Lys Ile
95 100 105
Lys Pro Glu Lys Val Lys Glu Leu Asn Tyr Ile Ser Ser Leu Glu
110 115 120
Lys Ala Trp Leu Asn Arg Glu Val Lys Leu Ser Pro Pro Ile Val
125 130 135
Glu Lys Asp Val Lys Thr Lys Glu Pro Ser Leu Glu Pro Lys Met
140 145 150
Tyr Asn Ser Thr Trp Val Ile Asn Ala Leu Gln Phe Ile Gln Glu
155 160 165
Phe Gly Tyr Asp Gly Ser Gly Val Val Val Ala Val Leu Asp Thr
170 175 180
Gly Val Asp Pro ASn His Pro Phe Leu Ser Ile Thr Pro Asp Gly
185 190 195
Arg Arg Lys Ile Ile Glu Trp Lys Asp Phe Thr Asp Glu Gly Phe
200 205 210
Val Asp Thr Ser Phe Ser Phe Ser Lys Val Val Asn Gly Thr Leu
215 220 225
Ile Ile Asn Thr Thr Phe Gln Val Ala Ser Gly Leu Thr Leu Asn
230 235 240
Glu Ser Thr Gly Leu Met Glu Tyr Val Val Lys Thr Val Tyr Val
245 250 255
Ser Asn Val Thr Ile Gly Asn Ile Thr Ser Ala Asn Gly Ile Tyr
260 265 270
His Phe Gly Leu Leu Pro Glu Arg Tyr Phe Asp Leu Asn Phe Asp
275 280 285
Gly Asp Gln Glu Asp Phe Tyr Pro Val Leu Leu Val Asn Ser Thr
290 295 300
Gly Asn Gly Tyr Asp Ile Ala Tyr Val Asp Thr Asp Leu Asp Tyr
305 310 315
Asp Phe Thr Asp Glu Val Pro Leu Gly Gln Tyr Asn Val Thr Tyr
320 325 330
Asp Val Ala Val Phe Ser Tyr Tyr Tyr Gly Pro Leu Asn Tyr Val
335 340 345
Leu Ala Glu Ile Asp Pro Asn Gly Glu Tyr Ala Val Phe Gly Trp
350 355 360
Asp Gly His Gly His Gly Thr His Val Ala Gly Thr Val Ala Gly
365 370 375
Tyr Asp Ser Asn Asn Asp Ala Trp Asp Trp Leu Ser Met Tyr Ser
380 385 390
Gly Glu Trp Glu Val Phe Ser Arg Leu Tyr Gly Trp Asp Tyr Thr
395 400 405
Asn Val Thr Thr Asp Thr Val Gln Gly Val Ala Pro Gly Ala Gln
410 415 420
Ile Met Ala Ile Arg Val Leu Arg Ser Asp Gly Arg Gly Ser Met
425 430 435
Trp Asp Ile Ile Glu Gly Met Thr Tyr Ala Ala Thr His Gly Ala
440 445 450
Asp Val Ile Ser Met Ser Leu Gly Gly Asn Ala Pro Tyr Leu Asp
455 460 465
Gly Thr Asp Pro Glu Ser Val Ala Val Asp Glu Leu Thr Glu Lys
470 475 480
Tyr Gly Val Val Phe Val Ile Ala Ala Gly Asn Glu Gly Pro Gly
485 490 495
Ile Asn Ile Val Gly Ser Pro Gly Val Ala Thr Lys Ala Ile Thr
500 505 510
Val Gly Ala Ala Ala Val Pro Ile Asn Val Gly Val Tyr Val Ser
515 520 525
Gln Ala Leu Gly Tyr Pro Asp Tyr Tyr Gly Phe Tyr Tyr Phe Pro
530 535 540
Ala Tyr Thr Ash Val Arg Ile Ala Phe Phe Ser Ser Arg Gly Pro
545 550 555
Arg Ile Asp Gly Glu Ile Lys Pro Asn Val Val Ala Pro Gly Tyr
560 565 570
Gly Ile Tyr Ser Ser Leu Pro Met Trp Ile Gly Gly Ala Asp Phe
575 580 585
Met Ser Gly Thr Ser Met Ala Thr Pro His Val Ser Gly Val Val
590 595 600
Ala Leu Leu Ile Ser Gly Ala Lys Ala Glu Gly Ile Tyr Tyr Asn
605 610 615
Pro Asp Ile Ile Lys Lys Val Leu Glu Ser Gly Ala Thr Trp Leu
620 625 630
Glu Gly Asp Pro Tyr Thr Gly Gln Lys Tyr Thr Glu Leu Asp Gln
635 640 645
Gly His Gly Leu Val Asn Val Thr Lys Ser Trp Glu Ile Leu Lys
650 655 660
Ala Ile Asn Gly Thr Thr Leu Pro Ile Val Asp His Trp Ala Asp
665 670 675
Lys Ser Tyr Ser Asp Phe Ala Glu Tyr Leu Gly Val Asp Val Ile
680 685 690
Arg Gly Leu Tyr Ala Arg Asn Ser Ile Pro Asp Ile Val Glu Trp
695 700 705
His Ile Lys Tyr Val Gly Asp Thr Glu Tyr Arg Thr Phe Glu Ile
710 715 720
Tyr Ala Thr Glu Pro Trp Ile Lys Pro Phe Val Ser Gly Ser Val
725 730 735
Ile Leu Glu Asn Asn Thr Glu Phe Val Leu Arg Val Lys Tyr Asp
740 745 750
Val Glu Gly Leu Glu Pro Gly Leu Tyr Val Gly Arg Ile Ile Ile
755 760 765
Asp Asp Pro Thr Thr Pro Val Ile Glu Asp Glu Ile Leu Asn Thr
770 775 780
Ile Val Ile Pro Glu Lys Phe Thr Pro Glu Asn Asn Tyr Thr Leu
785 790 795
Thr Trp Tyr Asp Ile Asn Gly Pro Glu Met Val Thr His His Phe
800 805 810
Phe Thr Val Pro Glu Gly Val Asp Val Leu Tyr Ala Met Thr Thr
815 820 825
Tyr Trp Asp Tyr Gly Leu Tyr Arg Pro Asp Gly Met Phe Val Phe
830 835 840
Pro Tyr Gln Leu Asp Tyr Leu Pro Ala Ala Val Ser Asn Pro Met
845 850 855
Pro Gly Asn Trp Glu Leu Val Trp Thr Gly Phe Asn Phe Ala Pro
860 865 870
Leu Tyr Glu Ser Gly Phe Leu Val Arg Ile Tyr Gly Val Glu Ile
875 880 885
Thr Pro Ser Val Trp Tyr Ile Asn Arg Thr Tyr Leu Asp Thr Asn
890 895 900
Thr Glu Phe Ser Ile Glu Phe Asn Ile Thr Asn Ile Tyr Ala Pro
905 910 915
Ile Asn Ala Thr Leu Ile Pro Ile Gly Leu Gly Thr Tyr Asn Ala
920 925 930
Ser Val Glu Ser Val Gly Asp Gly Glu Phe Phe Ile Lys Gly Ile
935 940 945
Glu Val Pro Glu Gly Thr Ala Glu Leu Lys Ile Arg Ile Gly Asn
950 955 960
Pro Ser Val Pro Asn Ser Asp Leu Asp Leu Tyr Leu Tyr Asp Ser
965 970 975
Lys Gly Asn Leu Val Ala Leu Asp Gly Asn Pro Thr Ala Glu Glu
980 985 990
Glu Val Val Val Glu Tyr Pro Lys Pro Gly Val Tyr Ser Ile Val
995 1000 1005
Val His Gly Tyr Ser Val Arg Asp Glu Asn Gly Asn Pro Thr Thr
1010 1015 1020
Thr Thr Phe Asp Leu Val Val Gln Met Thr Leu Asp Asn Gly Asn
1025 1030 1035
Ile Lys Leu Asp Lys Asp Ser Ile Ile Leu Gly Ser Asn Glu Ser
1040 1045 1050
Val Val Val Thr Ala Asn Ile Thr Ile Asp Arg Asp His Pro Thr
1055 1060 1065
Gly Val Tyr Ser Gly Ile Ile Glu Ile Arg Asp Asn Glu Val Tyr
1070 1075 1080
Gln Asp Thr Asn Thr Ser Ile Ala Lys Ile Pro Ile Thr Leu Val
1085 1090 1095
Ile Asp Lys Ala Asp Phe Ala Val Gly Leu Thr Pro Ala Glu Gly
1100 1105 1110
Val Leu Gly Glu Ala Arg Asn Tyr Thr Leu Ile Val Lys His Ala
1115 1120 1125
Leu Thr Leu Glu Pro Val Pro Asn Ala Thr Val Ile Ile Gly Asn
1130 1135 1140
Tyr Thr Tyr Leu Thr Asp Glu Asn Gly Thr Val Thr Phe Thr Tyr
1145 1150 1155
Ala Pro Thr Lys Leu Gly Ser Asp Glu Ile Thr Val Ile Val Lys
1160 1165 1170
Lys Glu Asn Phe Asn Thr Leu Glu Lys Thr Phe Gln Ile Thr Val
1175 1180 1185
Ser Glu Pro Glu Ile Thr Glu Glu Asp Ile Asn Glu Pro Lys Leu
1190 1195 1200
Ala Met Ser Ser Pro Glu Ala Asn Ala Thr Ile Val Ser Val Glu
1205 1210 1215
Met Glu Ser Glu Gly Gly Val Lys Lys Thr Val Thr Val Glu Ile
1220 1225 1230
Thr Ile Asn Gly Thr Ala Asn Glu Thr Ala Thr Ile Val Val Pro
1235 1240 1245
Val Pro Lys Lys Ala Glu Asn Ile Glu Val Ser Gly Asp His Val
1250 1255 1260
Ile Ser Tyr Ser Ile Glu Glu Gly Glu Tyr Ala Lys Tyr Val Ile
1265 1270 1275
Ile Thr Val Lys Phe Ala Ser Pro Val Thr Val Thr Val Thr Tyr
1280 1285 1290
Thr Ile Tyr Ala Gly Pro Arg Val Ser Ile Leu Thr Leu Asn Phe
1295 1300 1305
Leu Gly Tyr Ser Trp Tyr Arg Leu Tyr Ser Gln Lys Phe Asp Glu
1310 1315 1320
Leu Tyr Gln Lys Ala Leu Glu Leu Gly Val Asp Asn Glu Thr Leu
1325 1330 1335
Ala Leu Ala Leu Ser Tyr His Glu Lys Ala Lys Glu Tyr Tyr Glu
1340 1345 1350
Lys Ala Leu Glu Leu Ser Glu Gly Asn Ile Ile Gln Tyr Leu Gly
1355 1360 1365
Asp Ile Arg Leu Leu Pro Pro Leu Arg Gln Ala Tyr Ile Asn Glu
1370 1375 1380
Met Lys Ala Val Lys Ile Leu Glu Lys Ala Ile Glu Glu Leu Glu
1385 1390 1395
Gly Glu Glu
SEQ ID NO:7的资料:
长度:35
类型:核酸
链型:单链
拓扑结构:线性
分子类型:其它核酸(合成的DNA)
序列描述:
GGWWSDRRTG TTRRHGTHGC DGTDMTYGAC ACBGG 35
SEQ ID NO:8的资料:
长度:32
类型:核酸
链型:单链
拓扑结构:线性
分子类型:其它核酸(合成的DNA)
序列描述:
KSTCACGGAA CTCACGTDGC BGGHACDGTT GC 32
SEQ ID NO:9的资料:
长度:33
类型:核酸
链型:单链
拓扑结构:线性
分子类型:其它核酸(合成的DNA)
序列描述:
ASCMGCAACH GTKCCVGCHA CGTGAGTTCC GTG 33
SEQ ID NO:10的资料:
长度:34
类型:核酸
链型:单链
拓扑结构:线性
分子类型:其它核酸(合成的DNA)
序列描述:
CHCCGSYVAC RTGBGGAGWD GCCATBGAVG TDCC 34
SEQ ID NO:11的资料:
长度:1977
类型:核酸
链型:双链
拓扑结构:线性
分子类型:基因组DNA
原始来源:
生物:速生热球菌
菌株:DSM 2476
序列描述:
ATGAAGAGGT TAGGTGCTGT GGTGCTGGCA CTGGTGCTCG TGGGTCTTCT GGCCGGAACG 60
GCCCTTGCGG CACCCGTAAA ACCGGTTGTC AGGAACAACG CGGTTCAGCA GAAGAACTAC 120
GGACTGCTGA CCCCGGGACT GTTCAAGAAA GTCCAGAGGA TGAACTGGAA CCAGGAAGTG 180
GACACCGTCA TAATGTTCGG GAGCTACGGA GACAGGGACA GGGCGGTTAA GGTACTGAGG 240
CTCATGGGCG CCCAGGTCAA GTACTCCTAC AAGATAATCC CTGCTGTCGC GGTTAAAATA 300
AAGGCCAGGG ACCTTCTGCT GATCGCGGGC ATGATAGACA CGGGTTACTT CGGTAACACA 360
AGGGTCTCGG GCATAAAGTT CATACAGGAG GATTACAAGG TTCAGGTTGA CGACGCCACT 420
TCCGTCTCCC AGATAGGGGC CGATACCGTC TGGAACTCCC TCGGCTACGA CGGAAGCGGT 480
GTGGTGGTTG CCATCGTCGA TACGGGTATA GACGCGAACC ACCCCGATCT GAAGGGCAAG 540
GTCATAGGCT GGTACGACGC CGTCAACGGC AGGTCGACCC CCTACGATGA CCAGGGACAC 600
GGAACCCACG TTGCGGGTAT CGTTGCCGGA ACCGGCAGCG TTAACTCCCA GTACATAGGC 660
GTCGCCCCCG GCGCGAAGCT CGTCGGCGTC AAGGTTCTCG GTGCCGACGG TTCGGGAAGC 720
GTCTCCACCA TCATCGCGGG TGTTGACTGG GTCGTCCAGA ACAAGGACAA GTACGGGATA 780
AGGGTCATCA ACCTCTCCCT CGGCTCCTCC CAGAGCTCCG ACGGAACCGA CTCCCTCAGT 840
CAGGCCGTCA ACAACGCCTG GGACGCCGGT ATAGTAGTCT GCGTCGCCGC CGGCAACAGC 900
GGGCCGAACA CCTACACCGT CGGCTCACCC GCCGCCGCGA GCAAGGTCAT AACCGTCGGT 960
GCAGTTGACA GCAACGACAA CATCGCCAGC TTCTCCAGCA GGGGACCGAC CGCGGACGGA 1020
AGGCTCAAGC CGGAAGTCGT CGCCCCCGGC GTTGACATCA TAGCCCCGCG CGCCAGCGGA 1080
ACCAGCATGG GCACCCCGAT AAACGACTAC TACACCAAGG CCTCTGGAAC CAGCATGGCC 1140
ACCCCGCACG TTTCGGGCGT TGGCGCGCTC ATCCACCAGG CCCACCCGAG CTGGACCCCG 1200
GACAAGGTGA AGACCGCCCT CATCGAGACC GCCGACATAG TCGCCCCCAA GGAGATAGCG 1260
GACATCGCCT ACGGTGCGGG TAGGGTGAAC GTCTACAAGG CCATCAAGTA CGACGACTAC 1320
GCCAAGCTCA CCTTCACCGG CTCCGTCGCC GACAAGGGAA GCGCCACCCA CACCTTCGAC 1380
GTCAGCGGCG CCACCTTCGT GACCGCCACC CTCTACTGGG ACACGGGCTC GAGCGACATC 1440
GACCTCTACC TCTACGACCC CAACGGGAAC GAGGTTGACT ACTCCTACAC CGCCTACTAC 1500
GGCTTCGAGA AGGTCGGCTA CTACAACCCG ACCGCCGGAA CCTGGACGGT CAAGGTCGTC 1560
AGCTACAAGG GCGCGGCGAA CTACCAGGTC GACGTCGTCA GCGACGGGAG CCTCAGCCAG 1620
TCCGGCGGCG GCAACCCGAA TCCAAACCCC AACCCGAACC CAACCCCGAC CACCGACACC 1680
CAGACCTTCA CCGGTTCCGT TAACGACTAC TGGGACACCA GCGACACCTT CACCATGAAC 1740
GTCAACAGCG GTGCCACCAA GATAACCGGT GACCTGACCT TCGATACTTC CTACAACGAC 1800
CTCGACCTCT ACCTCTACGA CCCCAACGGC AACCTCGTTG ACAGGTCCAC GTCGAGCAAC 1860
AGCTACGAGC ACGTCGAGTA CGCCAACCCC GCCCCGGGAA CCTGGACGTT CCTCGTCTAC 1920
GCCTACAGCA CCTACGGCTG GGCGGACTAC CAGCTCAAGG CCGTCGTCTA CTACGGG 1977
SEQ ID NO:12的资料:
长度:659
类型:氨基酸
链型:单链
拓扑结构:线性
分子类型:肽
序列描述:
Met Lys Arg Leu Gly Ala Val Val Leu Ala Leu Val Leu Val Gly
5 10 15
Leu Leu Ala Gly Thr Ala Leu Ala Ala Pro Val Lys Pro Val Val
20 25 30
Arg Asn Asn Ala Val Gln Gln Lys Asn Tyr Gly Leu Leu Thr Pro
35 40 45
Gly Leu Phe Lys Lys Val Gln Arg Met Asn Trp Asn Gln Glu Val
50 55 60
Asp Thr Val Ile Met Phe Gly Ser Tyr Gly Asp Arg Asp Arg Ala
65 70 75
Val Lys Val Leu Arg Leu Met Gly Ala Gln Val Lys Tyr Ser Tyr
80 85 90
Lys Ile Ile Pro Ala Val Ala Val Lys Ile Lys Ala Arg Asp Leu
95 100 105
Leu Leu Ile Ala Gly Met Ile Asp Thr Gly Tyr Phe Gly Asn Thr
110 115 120
Arg Val Ser Gly Ile Lys Phe Ile Gln Glu Asp Tyr Lys Val Gln
125 130 135
Val Asp Asp Ala Thr Ser Val Ser Gln Ile Gly Ala Asp Thr Val
140 145 150
Trp Asn Ser Leu Gly Tyr Asp Gly Ser Gly Val Val Val Ala Ile
155 160 165
Val Asp Thr Gly Ile Asp Ala Asn His Pro Asp Leu Lys Gly Lys
170 175 180
Val Ile Gly Trp Tyr Asp Ala Val Asn Gly Arg Ser Thr Pro Tyr
185 190 195
Asp Asp Gln Gly His Gly Thr His Val Ala Gly Ile Val Ala Gly
200 205 210
Thr Gly Ser Val Asn Ser Gln Tyr Ile Gly Val Ala Pro Gly Ala
215 220 225
Lys Leu Val Gly Val Lys Val Leu Gly Ala Asp Gly Ser Gly Ser
230 235 240
Val Ser Thr Ile Ile Ala Gly Val Asp Trp Val Val Gln Asn Lys
245 250 255
Asp Lys Tyr Gly Ile Arg Val Ile Asn Leu Ser Leu Gly Ser Ser
260 265 270
Gln Ser Ser Asp Gly Thr Asp Ser Leu Ser Gln Ala Val Asn Asn
275 280 285
Ala Trp Asp Ala Gly Ile Val Val Cys Val Ala Ala Gly Asn Ser
290 295 300
Gly Pro Asn Thr Tyr Thr Val Gly Ser Pro Ala Ala Ala Ser Lys
305 310 315
Val Ile Thr Val Gly Ala Val Asp Ser Asn Asp Asn Ile Ala Ser
320 325 330
Phe Ser Ser Arg Gly Pro Thr Ala Asp Gly Arg Leu Lys Pro Glu
335 340 345
Val Val Ala Pro Gly Val Asp Ile Ile Ala Pro Arg Ala Ser Gly
350 355 360
Thr Ser Met Gly Thr Pro Ile Asn Asp Tyr Tyr Thr Lys Ala Ser
365 370 375
Gly Thr Ser Met Ala Thr Pro His Val Ser Gly Val Gly Ala Leu
380 385 390
Ile Leu Gln Ala His Pro Ser Trp Thr Pro Asp Lys Val Lys Thr
395 400 405
Ala Leu Ile Glu Thr Ala Asp Ile Val Ala Pro Lys Glu Ile Ala
410 415 420
Asp Ile Ala Tyr Gly Ala Gly Arg Val Asn Val Tyr Lys Ala Ile
425 430 435
Lys Tyr Asp Asp Tyr Ala Lys Leu Thr Phe Thr Gly Ser Val Ala
440 445 450
Asp Lys Gly Ser Ala Thr His Thr Phe Asp Val Ser Gly Ala Thr
455 460 465
Phe Val Thr Ala Thr Leu Tyr Trp Asp Thr Gly Ser Ser Asp Ile
470 475 480
Asp Leu Tyr Leu Tyr Asp Pro Asn Gly Asn Glu Val Asp Tyr Ser
485 490 495
Tyr Thr Ala Tyr Tyr Gly Phe Glu Lys Val Gly Tyr Tyr Asn Pro
500 505 510
Thr Ala Gly Thr Trp Thr Val Lys Val Val Ser Tyr Lys Gly Ala
515 520 525
Ala Asn Tyr Gln Val Asp Val Val Ser Asp Gly Ser Leu Ser Gln
530 535 540
Ser Gly Gly Gly Asn Pro Asn Pro Asn Pro Asn Pro Asn Pro Thr
545 550 555
Pro Thr Thr Asp Thr Gln Thr Phe Thr Gly Ser Val Asn Asp Tyr
560 565 570
Trp Asp Thr Ser Asp Thr Phe Thr Met Asn Val Asn Ser Gly Ala
575 580 585
Thr Lys Ile Thr Gly Asp Leu Thr Phe Asp Thr Ser Tyr Asn Asp
590 595 600
Leu Asp Leu Tyr Leu Tyr Asp Pro Asn Gly Asn Leu Val Asp Arg
605 610 615
Ser Thr Ser Ser Asn Ser Tyr Glu His Val Glu Tyr Ala Asn Pro
620 625 630
Ala Pro Gly Thr Trp Thr Phe Leu Val Tyr Ala Tyr Ser Thr Tyr
635 640 645
Gly Trp Ala Asp Tyr Gln Leu Lys Ala Val Val Tyr Tyr Gly
650 655
SEQ ID NO:13的资料:
长度:28
类型:核酸
链型:单链
拓扑结构:线性
分子类型:其它核酸(合成的DNA)
序列描述:
AGAGGGATCC ATGAAGGGGC TGAAAGCT 28
SEQ ID NO:14的资料:
长度:28
类型:核酸
链型:单链
拓扑结构:线性
分子类型:其它核酸(合成的DNA)
序列描述:
AGAGGCATGC GCTCTAGACT GTGGGAGAGT 28
SEQ ID NO:15的资料:
长度:1962
类型:核酸
链型:双链
拓扑结构:线性
分子类型:基因组DNA
原始来源:
生物:激烈热球菌
菌株:DSM 3638
序列描述:
ATGAAGGGGC TGAAAGCTCT CATATTAGTG ATTTTAGTTC TAGGTTTGGT AGTAGGGAGC 60
GTAGCGGCAG CTCCAGAGAA GAAAGTTGAA CAAGTAAGAA ATGTTGAGAA GAACTATGGT 120
CTGCTAACGC CAGGACTGTT CAGAAAAATT CAAAAATTGA ATCCTAACGA GGAAATCAGC 180
ACAGTAATTG TATTTGAAAA CCATAGGGAA AAAGAAATTG CAGTAAGAGT TCTTGAGTTA 240
ATGGGTGCAA AAGTTAGGTA TGTGTACCAT ATTATACCCG CAATAGCTGC CGATCTTAAG 300
GTTAGAGACT TACTAGTCAT CTCAGGTTTA ACAGGGGGTA AAGCTAAGCT TTCAGGTGTT 360
AGGTTTATCC AGGAAGACTA CAAAGTTACA GTTTCAGCAG AATTAGAAGG ACTGGATGAG 420
TCTGCAGCTC AAGTTATGGC AACTTACGTT TGGAACTTGG GATATGATGG TTCTGGAATC 480
ACAATAGGAA TAATTGACAC TGGAATTGAC GCTTCTCATC CAGATCTCCA AGGAAAAGTA 540
ATTGGGTGGG TAGATTTTGT CAATGGTAGG AGTTATCCAT ACGATGACCA TGGACATGGA 600
ACTCATGTAG CTTCAATAGC AGCTGGTACT GGAGCAGCAA GTAATGGCAA GTACAAGGGA 660
ATGGCTCCAG GAGCTATGCT GGCGGGAATT AAGGTTCTAG GTGCCGATGG TTCTGGAAGC 720
ATATCTACTA TAATTAAGGG AGTTGAGTGG GCCGTTGATA ACAAAGATAA GTACGGAATT 780
AAGGTCATTA ATCTTTCTCT TGGTTCAAGC CAGAGCTCAG ATGGTACTGA CGCTCTAAGT 840
CAGGCTGTTA ATGCAGCGTG GGATGCTGGA TTAGTTGTTG TGGTTGCCGC TGGAAACAGT 900GGACCTAACA AGTATACAAT CGGTTCTCCA GCAGCTGCAA GCAAAGTTAT TACAGTTGGA 960
GCCGTTGACA AGTATGATGT TATAACAAGC TTCTCAAGCA GAGGGCCAAC TGCAGACGGC 1020
AGGCTTAAGC CTGAGGTTGT TGCTCCAGGA AACTGGATAA TTGCTGCCAG AGCAAGTGGA 1080
ACTAGCATGG GTCAACCAAT TAATGACTAT TACACAGCAG CTCCTGGGAC ATCAATGGCA 1140
ACTCCTCACG TAGCTGGTAT TGCAGCCCTC TTGCTCCAAG CACACCCGAG CTGGACTCCA 1200
GACAAAGTAA AAACAGCCCT CATAGAAACT GCTGATATCG TAAAGCCAGA TGAAATAGCC 1260
GATATAGCCT ACGGTGCAGG TAGGGTTAAT GCATACAAGG CTATAAACTA CGATAACTAT 1320
GCAAAGCTAG TGTTCACTGG ATATGTTGCC AACAAAGGCA GCCAAACTCA CCAGTTCGTT 1380
ATTAGCGGAG CTTCGTTCGT AACTGCCACA TTATACTGGG ACAATGCCAA TAGCGACCTT 1440
GATCTTTACC TCTACGATCC CAATGGAAAC CAGGTTGACT ACTCTTACAC CGCCTACTAT 1500
GGATTCGAAA AGGTTGGTTA TTACAACCCA ACTGATGGAA CATGGACAAT TAAGGTTGTA 1560
AGCTACAGCG GAAGTGCAAA CTATCAAGTA GATGTGGTAA GTGATGGTTC CCTTTCACAG 1620
CCTGGAAGTT CACCATCTCC ACAACCAGAA CCAACAGTAG ACGCAAAGAC GTTCCAAGGA 1680
TCCGATCACT ACTACTATGA CAGGAGCGAC ACCTTTACAA TGACCGTTAA CTCTGGGGCT 1740
ACAAAGATTA CTGGAGACCT AGTGTTTGAC ACAAGCTACC ATGATCTTGA CCTTTACCTC 1800
TACGATCCTA ACCAGAAGCT TGTAGATAGA TCGGAGAGTC CCAACAGCTA CGAACACGTA 1860
GAATACTTAA CCCCCGCCCC AGGAACCTGG TACTTCCTAG TATATGCCTA CTACACTTAC 1920
GGTTGGGCTT ACTACGAGCT GACGGCTAAA GTTTATTATG GC 1962
SEQ ID NO:16的资料:
长度:654
类型:氨基酸
链型:单链
拓扑结构:线性
分子类型:肽
序列描述:
Met Lys Gly Leu Lys Ala Leu Ile Leu Val Ile Leu Val Leu Gly
5 10 15
Leu Val Val Gly Ser Val Ala Ala Ala Pro Glu Lys Lys Val Glu
20 25 30
Gln Val Arg Asn Val Glu Lys Asn Tyr Gly Leu Leu Thr Pro Gly
35 40 45
Leu Phe Arg Lys Ile Gln Lys Leu Asn Pro Asn Glu Glu Ile Ser
50 55 60
Thr Val Ile Val Phe Glu Asn His Arg Glu Lys Glu Ile Ala Val
65 70 75
Arg Val Leu Glu Leu Met Gly Ala Lys Val Arg Tyr Val Tyr His
80 85 90
Ile Ile Pro Ala Ile Ala Ala Asp Leu Lys Val Arg Asp Leu Leu
95 100 105
Val Ile Ser Gly Leu Thr Gly Gly Lys Ala Lys Leu Ser Gly Val
110 115 120
Arg Phe Ile Gln Glu Asp Tyr Lys Val Thr Val Ser Ala Glu Leu
125 130 135
Glu Gly Leu Asp Glu Ser Ala Ala Gln Val Mer Ala Thr Tyr Val
140 145 150
Trp Asn Leu Gly Tyr Asp Gly Ser Gly Ile Thr Ile Gly Ile Ile
155 160 165
Asp Thr Gly Ile Asp Ala Ser His Pro Asp Leu Gln Gly Lys Val
170 175 180
Ile Gly Trp Val Asp Phe Val Asn Gly Arg Ser Tyr Pro Tyr Asp
185 190 195
Asp His Gly His Gly Thr His Val Ala Ser Ile Ala Ala Gly Thr
200 205 210
Gly Ala Ala Ser Asn Gly Lys Tyr Lys Gly Met Ala Pro Gly Ala
215 220 225
Lys Leu Ala Gly Ile Lys Val Leu Gly Ala Asp Gly Ser Gly Ser
230 235 240
Ile Ser Thr Ile Ile Lys Gly Val Glu Trp Ala Val Asp Asn Lys
245 250 255
Asp Lys Tyr Gly Ile Lys Val Ile Asn Leu Ser Leu Gly Ser Ser
260 265 270
Gln Ser Ser Asp Gly Thr Asp Ala Leu Ser Gln Ala Val Asn Ala
275 280 285
Ala Trp Asp Ala Gly Leu Val Val Val Val Ala Ala Gly Asn Ser
290 295 300
Gly Pro Asn Lys Tyr Thr Ile Gly Ser Pro Ala Ala Ala Ser Lys
305 310 315
Val Ile Thr Val Gly Ala Val Asp Lys Tyr Asp Val Ile Thr Ser
320 325 330
Phe Ser Ser Arg Gly Pro Thr Ala Asp Gly Arg Leu Lys Pro Glu
335 340 345
Val Val Ala Pro Gly Asn Trp Ile Ile Ala Ala Arg Ala Ser Gly
350 355 360
Thr Ser Met Gly Gln Pro Ile Asn Asp Tyr Tyr Thr Ala Ala Pro
365 370 375
Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ile Ala Ala Leu
380 385 390
Leu Leu Gln Ala His Pro Ser Trp Thr Pro Asp Lys Val Lys Thr
395 400 405
Ala Leu Ile Glu Thr Ala Asp Ile Val Lys Pro Asp Glu Ile Ala
410 415 420
Asp Ile Ala Tyr Gly Ala Gly Arg Val Asn Ala Tyr Lys Ala Ile
425 430 435
Asn Tyr Asp Asn Tyr Ala Lys Leu Val Phe Thr Gly Tyr Val Ala
440 445 450
Asn Lys Gly Ser Gln Thr His Gln Phe Val Ile Ser Gly Ala Ser
455 460 465
Phe Val Thr Ala Thr Leu Tyr Trp Asp Asn Ala Asn Ser Asp Leu
470 475 480
Asp Leu Tyr Leu Tyr Asp Pro Asn Gly Asn Gln Val Asp Tyr Ser
485 490 495
Tyr Thr Ala Tyr Tyr Gly Phe Glu Lys Val Gly Tyr Tyr Asn Pro
500 505 510
Thr Asp Gly Thr Trp Thr Ile Lys Val Val Ser Tyr Ser Gly Ser
515 520 525
Ala Asn Tyr Gln Val Asp Val Val Ser Asp Gly Ser Leu Ser Gln
530 535 540
Pro Gly Ser Ser Pro Ser Pro Gln Pro Glu Pro Thr Val Asp Ala
545 550 555
Lys Thr Phe Gln Gly Ser Asp His Tyr Tyr Tyr Asp Arg Ser Asp
560 565 570
Thr Phe Thr Met Thr Val Asn Ser Gly Ala Thr Lys Ile Thr Gly
575 580 585
Asp Leu Val Phe Asp Thr Ser Tyr His Asp Leu Asp Leu Tyr Leu
590 595 600
Tyr Asp Pro Asn Gln Lys Leu Val Asp Arg Ser Glu Ser Pro Asn
605 610 615
Ser Tyr Glu His Val Glu Tyr Leu Thr Pro Ala Pro Gly Thr Trp
620 625 630
Tyr Phe Leu Val Tyr Ala Tyr Tyr Thr Tyr Gly Trp Ala Tyr Tyr
635 640 645
Glu Leu Thr Ala Lys Val Tyr Tyr Gly
650
SEQ ID NO:17的资料:
长度:25
类型:核酸
链型:单链
拓扑结构:线性
分子类型:其它核酸(合成的DNA)
序列描述:
TCTGAATTCG TTCTTTTCTG TATGG 25
SEQ ID NO:18的资料:
长度:20
类型:核酸
链型:单链
拓扑结构:线性
分子类型:其它核酸(合成的DNA)
序列描述:
TGTACTGCTG GATCCGGCAG 20
SEQ ID NO:19的资料:
长度:25
类型:核酸
链型:单链
拓扑结构:线性
分子类型:其它核酸(合成的DNA)
序列描述:
AGAGGCATGC GTATCCATCA GATTTTTGAG 30
SEQ ID NO:20的资料:
长度:20
类型:核酸
链型:单链
拓扑结构:线性
分子类型:其它核酸(合成的DNA)
序列描述:
AGTGAACGGA TACTTGGAAC 20
SEQ ID NO:21的资料:
长度:20
类型:核酸
链型:单链
拓扑结构:线性
分子类型:其它核酸(合成的DNA)
序列描述:
GTTCCAAGTA TCCGTTCACT 20
SEQ ID NO:22的资料:
长度:12
类型:氨基酸
链型:单链
拓扑结构:线性
分子类型:肽
序列描述:
Ala Glu Leu Glu Gly Leu Asp Glu Ser Ala Ala Gln
5 10
SEQ ID NO:23的资料:
长度:24
类型:核酸
链型:单链
拓扑结构:线性
分子类型:其它核酸(合成的DNA)
序列描述:
TCATGGATCC ACCCTCTCCT TTTA 24
SEQ ID NO:24的资料:
长度:26
类型:核酸
链型:单链
拓扑结构:线性
分子类型:其它核酸(合成的DNA)
序列描述:
GTCTGCGCAG GCTGCCGGAN NNNNNATGAA GGGGCTGAAA GCTCTC 26
SEQ ID NO:25的资料:
长度:29
类型:核酸
链型:单链
拓扑结构:线性
分子类型:其它核酸(合成的DNA)
序列描述:
GAGAGCTTTC AGCCCCTTCA TNNNNNNTCC GGCAGCCTGC GCAGACATG 29
SEQ ID NO:26的资料:
长度:27
类型:核酸
链型:单链
拓扑结构:线性
分子类型:其它核酸(合成的DNA)
序列描述:
AGAGGGGGAT CCGTGAGAAG CAAAAAA 27
SEQ ID NO:27的资料:
长度:20
类型:核酸
链型:单链
拓扑结构:线性
分子类型:其它核酸(合成的DNA)
序列描述:
GATGACTAGT AAGTCTCTAA 20
SEQ ID NO:28的资料:
长度:20
类型:核酸
链型:单链
拓扑结构:线性
分子类型:其它核酸(合成的DNA)
序列描述:
AAGCCTGAGG TTGTTGCTCC 20
SEQ ID NO:29的资料:
长度:29
类型:核酸
链型:单链
拓扑结构:线性
分子类型:其它核酸(合成的DNA)
序列描述:
GGGCATGCTC ATGAACTTCC AGGCTGTGA 29
Claims (13)
1.一种编码由通式I所示的氨基酸序列的基因:
SIG-Ala-Gly-Gly-Asn-PRO [I]
其中SIG表示序列表SEQ ID NO:3所示的来源于枯草杆菌蛋白酶的信号肽的氨基酸序列,PRO表示被表达的蛋白酶的氨基酸序列。
2.根据权利要求1的基因,其中PRO是一种来源于超嗜热菌的超热稳定蛋白酶的氨基酸序列。
3.根据权利要求2的基因,其中PRO是一种来源于激烈热球菌的蛋白酶的氨基酸序列。
4.根据权利要求3的基因,其中PRO包括蛋白酶的氨基酸序列,所述蛋白酶由如下氨基酸序列组成并具有热稳定蛋白酶活性,在该氨基酸序列中,一个或多个氨基酸残基从由序列表SEQ ID NO:4表示的氨基酸序列的C-末端缺失,其中所述蛋白酶包括序列表SEQ ID NO:1表示的氨基酸序列。
5.根据权利要求4的基因,它包含在选自pSPO124或pSPO124ΔC的质粒中。
6.根据权利要求4的基因,其中PRO包括由序列表SEQ ID NO:1表示的氨基酸序列组成的蛋白酶氨基酸序列。
7.一种产生蛋白酶的方法,包括培养导入了根据权利要求1至6中任一项的基因的芽孢杆菌属细菌,并从该培养物中收集目的蛋白。
8.根据权利要求7产生蛋白的方法,其中芽孢杆菌属细菌是枯草芽孢杆菌。
9.根据权利要求7产生蛋白的方法,其中借助于质粒载体把根据权利要求1至6中任一项的基因导入到芽孢杆菌属的细菌中。
10.根据权利要求9产生蛋白的方法,其中选自pSPO124或pSPO124ΔC的质粒被导入到芽孢杆菌属细菌中。
11.根据权利要求10产生蛋白的方法,包括培养枯草芽孢杆菌DB104/pSPO124ΔC FERM P-16227,并从该培养物中收集目的蛋白。
12.一种把根据权利要求1至6中任一项的基因插入其中的质粒载体。
13.根据权利要求12的质粒载体,它选自pSPO124或pSPO124ΔC。
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP15196997 | 1997-06-10 | ||
| JP151969/1997 | 1997-06-10 | ||
| JP151969/97 | 1997-06-10 |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CNB988059908A Division CN1190494C (zh) | 1997-06-10 | 1998-06-04 | 表达超热稳定蛋白酶的系统 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN1657614A CN1657614A (zh) | 2005-08-24 |
| CN1311075C true CN1311075C (zh) | 2007-04-18 |
Family
ID=15530186
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CNB200410101968XA Expired - Fee Related CN1311075C (zh) | 1997-06-10 | 1998-06-04 | 表达超热稳定蛋白的系统 |
| CNB988059908A Expired - Fee Related CN1190494C (zh) | 1997-06-10 | 1998-06-04 | 表达超热稳定蛋白酶的系统 |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CNB988059908A Expired - Fee Related CN1190494C (zh) | 1997-06-10 | 1998-06-04 | 表达超热稳定蛋白酶的系统 |
Country Status (10)
| Country | Link |
|---|---|
| US (3) | US6358726B1 (zh) |
| EP (1) | EP0994191B1 (zh) |
| JP (1) | JP3601835B2 (zh) |
| KR (1) | KR100530598B1 (zh) |
| CN (2) | CN1311075C (zh) |
| AT (1) | ATE318913T1 (zh) |
| AU (1) | AU7550098A (zh) |
| DE (1) | DE69833652T2 (zh) |
| TW (1) | TW530088B (zh) |
| WO (1) | WO1998056926A1 (zh) |
Families Citing this family (73)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| AU3455000A (en) * | 1999-04-07 | 2000-11-14 | Takara Shuzo Co., Ltd. | Composition for decomposing protein |
| ATE411383T1 (de) | 2001-03-14 | 2008-10-15 | Takara Bio Inc | Promotoren |
| EP1578935A2 (en) * | 2002-10-10 | 2005-09-28 | Diversa Corporation | Proteases, nucleic acids encoding them and methods for making and using them |
| DE10315640A1 (de) * | 2003-04-04 | 2004-10-14 | Ignatov, Konstantin | Verfahren zur kontrollierten Freisetzung von Komponenten in eine Lösung |
| CN103509776B (zh) * | 2003-06-19 | 2016-10-12 | 诺维信公司 | 蛋白酶 |
| GB2416352B (en) * | 2004-07-21 | 2009-01-28 | Bioline Ltd | A method for performing the hot start of enzymatic reactions |
| CA2673527A1 (en) * | 2006-12-21 | 2008-07-24 | Danisco Us Inc. | Compositions and uses for an alpha-amylase polypeptide of bacillus species 195 |
| EP2319921B1 (en) * | 2008-07-31 | 2016-04-20 | Osaka University | Novel protease and use thereof |
| US8748132B2 (en) * | 2008-10-21 | 2014-06-10 | The Chemo-Sero-Therapeutic Research Institute | Process for preparing inclusion body-forming protein |
| US9617527B2 (en) | 2010-04-14 | 2017-04-11 | Novozymes A/S | Polypeptides having glucoamylase activity and polynucleotides encoding same |
| WO2012088303A2 (en) | 2010-12-22 | 2012-06-28 | Novozymes North America, Inc. | Processes for producing fermentation products |
| MX353581B (es) | 2011-10-11 | 2018-01-19 | Novozymes North America Inc | Procesos para producir productos de fermentacion. |
| CN103930544A (zh) * | 2011-11-08 | 2014-07-16 | 诺维信公司 | 酵母中生产古细菌蛋白酶的方法 |
| IN2014CN04905A (zh) * | 2011-12-02 | 2015-09-18 | Novozymes As | |
| ES2935920T3 (es) | 2012-03-30 | 2023-03-13 | Novozymes North America Inc | Procesos de elaboración de productos de fermentación |
| BR112015012968A2 (pt) * | 2012-12-07 | 2017-09-12 | Danisco Us Inc | composições e métodos de uso |
| CA2891504A1 (en) * | 2012-12-07 | 2014-06-12 | Danisco Us Inc. | Compositions and methods of use |
| US11939552B2 (en) | 2013-06-24 | 2024-03-26 | Novozymes A/S | Process of recovering oil |
| ES2754203T3 (es) | 2013-06-24 | 2020-04-16 | Novozymes As | Proceso de extracción de aceite a partir de vinaza diluida |
| WO2014206829A1 (en) * | 2013-06-25 | 2014-12-31 | Novozymes A/S | Expression of natively secreted polypeptides without signal peptide |
| US9951323B2 (en) | 2013-07-17 | 2018-04-24 | Novozymes A/S | Pullulanase chimeras and polynucleotides encoding same |
| CN105431528A (zh) | 2013-08-30 | 2016-03-23 | 诺维信公司 | 酶组合物及其用途 |
| US9951364B2 (en) | 2013-09-11 | 2018-04-24 | Novozymes A/S | Processes for producing fermentation products |
| US10030237B2 (en) | 2014-01-22 | 2018-07-24 | Novozymes A/S | Pullulanase variants and polynucleotides encoding same |
| WO2015200659A1 (en) | 2014-06-25 | 2015-12-30 | Novozymes A/S | Xylanase variants and polynucleotides encoding same |
| EP3209788B1 (en) | 2014-10-23 | 2019-06-05 | Novozymes A/S | Glucoamylase variants and polynucleotides encoding same |
| WO2016087327A1 (en) | 2014-12-01 | 2016-06-09 | Novozymes A/S | Polypeptides having pullulanase activity comprising the x25, x45 and cbm41 domains |
| AU2016225049B2 (en) | 2015-02-27 | 2022-03-31 | Microbiogen Pty. Ltd. | Processes of producing ethanol using a fermenting organism |
| WO2016205127A1 (en) | 2015-06-18 | 2016-12-22 | Novozymes A/S | Polypeptides having trehalase activity and the use thereof in process of producing fermentation products |
| WO2017015329A1 (en) | 2015-07-23 | 2017-01-26 | Novozymes A/S | Alpha-amylase variants and polynucleotides encoding same |
| CA3249312A1 (en) | 2015-10-14 | 2025-02-24 | Novozymes A/S | GLUCOAMYLASE VARIANTS AND THE POLYNUCLEOTIDES CODING FOR THEM |
| CA3006992A1 (en) | 2015-12-22 | 2017-06-29 | Novozymes A/S | Processes for improving fermentation product yield using phospholipase c |
| WO2018075430A1 (en) | 2016-10-17 | 2018-04-26 | Novozymes A/S | Methods of reducing foam during ethanol fermentation |
| US10889836B2 (en) | 2016-11-23 | 2021-01-12 | Novozymes A/S | Yeast for ethanol production |
| EP3606936B1 (en) | 2017-04-03 | 2021-10-20 | Novozymes A/S | Recovery process |
| WO2018191215A1 (en) | 2017-04-11 | 2018-10-18 | Novozymes A/S | Glucoamylase variants and polynucleotides encoding same |
| WO2018188667A1 (en) * | 2017-04-14 | 2018-10-18 | Novozymes A/S | Processes for solubilizing municipal solid waste with enzyme compositions comprising protease and enzyme compositions thereof |
| WO2018222990A1 (en) | 2017-06-02 | 2018-12-06 | Novozymes A/S | Improved yeast for ethanol production |
| WO2019005755A1 (en) | 2017-06-28 | 2019-01-03 | Novozymes A/S | POLYPEPTIDES HAVING TREHALASE ACTIVITY AND POLYNUCLEOTIDES ENCODING SAME |
| EP3665275A1 (en) | 2017-08-08 | 2020-06-17 | Novozymes A/S | Polypeptides having trehalase activity and the use thereof in process of producing fermentation products |
| CN111164214A (zh) | 2017-09-15 | 2020-05-15 | 诺维信公司 | 用于改进动物饲料营养质量的酶共混物和方法 |
| CA3075907A1 (en) | 2017-10-23 | 2019-05-02 | Novozymes A/S | Processes for reducing lactic acid in a biofuel fermentation system |
| DK3720955T5 (da) | 2017-12-08 | 2024-08-26 | Novozymes As | Alfa-amylasevarianter og polynukleotider, der koder for dem |
| DK3720956T5 (da) | 2017-12-08 | 2024-08-26 | Novozymes As | Alfa-amylasevarianter og polynukleotider, der koder for dem |
| MX2020007914A (es) | 2018-01-29 | 2020-10-28 | Novozymes As | Microorganismos con uso de nitrogeno mejorado para produccion de etanol. |
| AU2019222480B9 (en) | 2018-02-15 | 2024-11-28 | Microbiogen Pty. Ltd. | Improved yeast for ethanol production |
| MX2020010526A (es) | 2018-04-09 | 2021-01-08 | Novozymes As | Polipéptidos que tienen actividad alfa-amilasa ypolinucleótidos que los codifican. |
| WO2019231944A2 (en) | 2018-05-31 | 2019-12-05 | Novozymes A/S | Processes for enhancing yeast growth and productivity |
| WO2020014407A1 (en) | 2018-07-11 | 2020-01-16 | Novozymes A/S | Processes for producing fermentation products |
| EP3827088A1 (en) | 2018-07-25 | 2021-06-02 | Novozymes A/S | Enzyme-expressing yeast for ethanol production |
| WO2020076697A1 (en) | 2018-10-08 | 2020-04-16 | Novozymes A/S | Enzyme-expressing yeast for ethanol production |
| WO2020160126A1 (en) | 2019-01-31 | 2020-08-06 | Novozymes A/S | Polypeptides having xylanase activity and use thereof for improving the nutritional quality of animal feed |
| CA3128139A1 (en) | 2019-03-18 | 2020-09-24 | Novozymes A/S | Polypeptides having pullulanase activity suitable for use in liquefaction |
| US20220186266A1 (en) | 2019-04-02 | 2022-06-16 | Novozymes A/S | Process For Producing A Fermentation Product |
| EP4004029A1 (en) | 2019-07-26 | 2022-06-01 | Novozymes A/S | Microorganisms with improved nitrogen transport for ethanol production |
| BR112022002203A2 (pt) | 2019-08-05 | 2022-09-06 | Novozymes As | Misturas de enzimas e processos para produção de um ingrediente alimentar com alto teor de proteína a partir de um subproduto de vinhaça completa |
| MX2021015818A (es) | 2019-08-06 | 2022-02-03 | Novozymes As | Proteinas de fusion para mejorar la expresion de enzimas. |
| WO2021055395A1 (en) | 2019-09-16 | 2021-03-25 | Novozymes A/S | Polypeptides having beta-glucanase activity and polynucleotides encoding same |
| EP4073089A1 (en) | 2019-12-10 | 2022-10-19 | Novozymes A/S | Microorganism for improved pentose fermentation |
| CN114867860A (zh) | 2019-12-16 | 2022-08-05 | 诺维信公司 | 用于生产发酵产物的方法 |
| WO2021163030A2 (en) | 2020-02-10 | 2021-08-19 | Novozymes A/S | Polypeptides having alpha-amylase activity and polynucleotides encoding same |
| US20230142991A1 (en) | 2020-02-10 | 2023-05-11 | Novozymes A/S | Alpha-amylase variants and polynucleotides encoding same |
| CN111876435A (zh) * | 2020-07-31 | 2020-11-03 | 山东大学 | 一种海洋嗜热胶原蛋白酶a69及其编码基因与应用 |
| WO2022090564A1 (en) | 2020-11-02 | 2022-05-05 | Novozymes A/S | Glucoamylase variants and polynucleotides encoding same |
| US20240279688A1 (en) | 2021-06-07 | 2024-08-22 | Novozymes A/S | Engineered microorganism for improved ethanol fermentation |
| WO2024137250A1 (en) | 2022-12-19 | 2024-06-27 | Novozymes A/S | Carbohydrate esterase family 3 (ce3) polypeptides having acetyl xylan esterase activity and polynucleotides encoding same |
| CN120380158A (zh) | 2022-12-19 | 2025-07-25 | 诺维信公司 | 用于降低发酵产物生产方法后端的浆体粘度的方法 |
| WO2024137248A1 (en) | 2022-12-19 | 2024-06-27 | Novozymes A/S | Compositions comprising arabinofuranosidases and a xylanase, and use thereof for increasing hemicellulosic fiber solubilization |
| CN120283060A (zh) | 2022-12-19 | 2025-07-08 | 诺维信公司 | 用于使用纤维降解酶和工程化酵母生产发酵产物的工艺 |
| EP4638724A1 (en) | 2022-12-19 | 2025-10-29 | Novozymes A/S | Carbohydrate esterase family 1 (ce1) polypeptides having ferulic acid esterase and/or acetyl xylan esterase activity and polynucleotides encoding same |
| WO2024258820A2 (en) | 2023-06-13 | 2024-12-19 | Novozymes A/S | Processes for producing fermentation products using engineered yeast expressing a beta-xylosidase |
| WO2025128568A1 (en) | 2023-12-11 | 2025-06-19 | Novozymes A/S | Composition and use thereof for increasing hemicellulosic fiber solubilization |
| WO2026008449A2 (en) | 2024-07-04 | 2026-01-08 | Novozymes A/S | A process for producing a fermentation product and a concentrated protein co-product |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO1995034645A1 (en) * | 1994-06-13 | 1995-12-21 | Takara Shuzo Co., Ltd. | Hyperthermostable protease gene |
Family Cites Families (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| IT1228925B (it) * | 1987-08-07 | 1991-07-10 | Eniricerche Spa | Procedimento per la preparazione dell'ormone della crescita umano |
| ATE205257T1 (de) | 1995-12-12 | 2001-09-15 | Takara Shuzo Co | Gene kodierend für ultrathermostabile proteasen |
-
1998
- 1998-06-04 CN CNB200410101968XA patent/CN1311075C/zh not_active Expired - Fee Related
- 1998-06-04 DE DE69833652T patent/DE69833652T2/de not_active Expired - Lifetime
- 1998-06-04 AU AU75500/98A patent/AU7550098A/en not_active Abandoned
- 1998-06-04 EP EP98923114A patent/EP0994191B1/en not_active Expired - Lifetime
- 1998-06-04 JP JP50206599A patent/JP3601835B2/ja not_active Expired - Lifetime
- 1998-06-04 US US09/445,472 patent/US6358726B1/en not_active Expired - Lifetime
- 1998-06-04 CN CNB988059908A patent/CN1190494C/zh not_active Expired - Fee Related
- 1998-06-04 WO PCT/JP1998/002465 patent/WO1998056926A1/ja not_active Ceased
- 1998-06-04 KR KR10-1999-7011547A patent/KR100530598B1/ko not_active Expired - Fee Related
- 1998-06-04 AT AT98923114T patent/ATE318913T1/de not_active IP Right Cessation
- 1998-06-09 TW TW087109161A patent/TW530088B/zh not_active IP Right Cessation
-
2002
- 2002-03-06 US US10/090,624 patent/US6783970B2/en not_active Expired - Fee Related
-
2004
- 2004-07-12 US US10/888,588 patent/US7144719B2/en not_active Expired - Fee Related
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO1995034645A1 (en) * | 1994-06-13 | 1995-12-21 | Takara Shuzo Co., Ltd. | Hyperthermostable protease gene |
| US5756339A (en) * | 1994-06-13 | 1998-05-26 | Takara Shuzo Co., Ltd. | Hyperthermostable protease gene |
Non-Patent Citations (1)
| Title |
|---|
| 稳定蛋白酶的分离和纯化 吉林农业大学学报,第SA卷 1998 * |
Also Published As
| Publication number | Publication date |
|---|---|
| ATE318913T1 (de) | 2006-03-15 |
| DE69833652D1 (de) | 2006-04-27 |
| JP3601835B2 (ja) | 2004-12-15 |
| DE69833652T2 (de) | 2006-09-21 |
| US7144719B2 (en) | 2006-12-05 |
| US6358726B1 (en) | 2002-03-19 |
| TW530088B (en) | 2003-05-01 |
| EP0994191A1 (en) | 2000-04-19 |
| KR20010013540A (ko) | 2001-02-26 |
| WO1998056926A1 (en) | 1998-12-17 |
| US20050084934A1 (en) | 2005-04-21 |
| CN1657614A (zh) | 2005-08-24 |
| KR100530598B1 (ko) | 2005-11-23 |
| CN1260002A (zh) | 2000-07-12 |
| CN1190494C (zh) | 2005-02-23 |
| EP0994191A4 (en) | 2003-04-23 |
| US6783970B2 (en) | 2004-08-31 |
| US20020132335A1 (en) | 2002-09-19 |
| AU7550098A (en) | 1998-12-30 |
| EP0994191B1 (en) | 2006-03-01 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN1311075C (zh) | 表达超热稳定蛋白的系统 | |
| CN1263859C (zh) | 具有蛋白酶活性的多肽的编码核酸 | |
| CN1262639C (zh) | 新的宿主细胞和生产蛋白质的方法 | |
| CN1513057A (zh) | 宿主微生物 | |
| CN1062169A (zh) | 巴斯德毕赤酵母酸性磷酸酶基因 | |
| CN1500145A (zh) | 分泌生产蛋白质的方法 | |
| CN1233287A (zh) | 选定生物体的人工启动子文库和得自这种文库的启动子 | |
| CN1151252C (zh) | 生产微生物转谷氨酰胺酶的方法 | |
| CN1114355A (zh) | Dna扩增 | |
| CN1128879C (zh) | 超耐热蛋白酶基因 | |
| CN1152955C (zh) | 生产蛋白酶的方法 | |
| CN1160465C (zh) | 其中areA、pepC和/或pepE基因已被灭活的真菌 | |
| CN1198939C (zh) | 来自绿色木霉的纤维素酶cbh1基因的调控序列以及利用该序列的蛋白质或多肽的大量生产体系 | |
| CN1611601A (zh) | 三肽基氨肽酶 | |
| CN1154739C (zh) | 利用腐质霉属微生物大量生产蛋白南或肽的体系 | |
| CN1592785A (zh) | 酵母中来自Tritirachium album的重组蛋白酶K的表达 | |
| CN1875106A (zh) | 重组微生物 | |
| CN1153831C (zh) | 编码神经酰胺糖内切酶激活物的基因 | |
| CN1550502A (zh) | 用于由大肠杆菌向培养基中分泌来制备Leu-水蛭素的信号序列 | |
| CN1878859A (zh) | 新型桥石短小芽孢杆菌和使用该微生物作为宿主生产蛋白的方法 | |
| CN1301307A (zh) | 在丝状真菌突变细胞中生产多肽的方法 | |
| CN101045933A (zh) | 一种适冷蛋白酶的基因mcp01及其制备方法 | |
| CN1768135A (zh) | 稳定的脂酶变异体 | |
| CN1228447C (zh) | 一种含有人胰腺组织激肽释放酶成熟蛋白基因的重组表达载体 | |
| CN1125181C (zh) | 在酵母细胞中表达n-末端延伸蛋白质的载体 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| C17 | Cessation of patent right | ||
| CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20070418 Termination date: 20140604 |