US20040265936A1 - Recombinant expression of S-layer proteins - Google Patents
Recombinant expression of S-layer proteins Download PDFInfo
- Publication number
- US20040265936A1 US20040265936A1 US10/890,179 US89017904A US2004265936A1 US 20040265936 A1 US20040265936 A1 US 20040265936A1 US 89017904 A US89017904 A US 89017904A US 2004265936 A1 US2004265936 A1 US 2004265936A1
- Authority
- US
- United States
- Prior art keywords
- thr
- val
- ala
- lys
- asp
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 108010082913 S-layer proteins Proteins 0.000 title abstract description 28
- 238000003259 recombinant expression Methods 0.000 title description 3
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 99
- 239000002773 nucleotide Substances 0.000 claims abstract description 41
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 41
- 150000007523 nucleic acids Chemical class 0.000 claims description 66
- 108020004707 nucleic acids Proteins 0.000 claims description 65
- 102000039446 nucleic acids Human genes 0.000 claims description 65
- 238000003780 insertion Methods 0.000 claims description 62
- 230000037431 insertion Effects 0.000 claims description 62
- 101710168515 Cell surface glycoprotein Proteins 0.000 claims description 56
- 101710099182 S-layer protein Proteins 0.000 claims description 56
- 102000004169 proteins and genes Human genes 0.000 claims description 44
- 150000001413 amino acids Chemical class 0.000 claims description 42
- 235000018102 proteins Nutrition 0.000 claims description 42
- 235000001014 amino acid Nutrition 0.000 claims description 32
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 28
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 20
- 102000004190 Enzymes Human genes 0.000 claims description 17
- 108090000790 Enzymes Proteins 0.000 claims description 17
- 108090000695 Cytokines Proteins 0.000 claims description 13
- 102000004127 Cytokines Human genes 0.000 claims description 13
- 108010090804 Streptavidin Proteins 0.000 claims description 13
- 230000002163 immunogen Effects 0.000 claims description 12
- 230000027455 binding Effects 0.000 claims description 10
- 230000002068 genetic effect Effects 0.000 claims description 8
- 230000000890 antigenic effect Effects 0.000 claims description 7
- 102000014914 Carrier Proteins Human genes 0.000 claims description 6
- 108091008324 binding proteins Proteins 0.000 claims description 6
- 239000002184 metal Substances 0.000 claims description 6
- 229910052751 metal Inorganic materials 0.000 claims description 6
- 241001529453 unidentified herpesvirus Species 0.000 claims description 6
- 108090000363 Bacterial Luciferases Proteins 0.000 claims description 5
- 230000002009 allergenic effect Effects 0.000 claims description 5
- 230000004568 DNA-binding Effects 0.000 claims description 4
- 241000710198 Foot-and-mouth disease virus Species 0.000 claims description 4
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 claims description 4
- 102000015696 Interleukins Human genes 0.000 claims description 3
- 108010063738 Interleukins Proteins 0.000 claims description 3
- 101710120037 Toxin CcdB Proteins 0.000 claims description 3
- 206010054094 Tumour necrosis Diseases 0.000 claims description 3
- 239000002253 acid Substances 0.000 claims description 3
- 239000002158 endotoxin Substances 0.000 claims description 3
- 229940047122 interleukins Drugs 0.000 claims description 3
- 102000014150 Interferons Human genes 0.000 claims 2
- 108010050904 Interferons Proteins 0.000 claims 2
- 229940047124 interferons Drugs 0.000 claims 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 238000000034 method Methods 0.000 abstract description 27
- 238000004519 manufacturing process Methods 0.000 abstract description 12
- 230000008569 process Effects 0.000 abstract description 11
- 239000010410 layer Substances 0.000 description 78
- 210000004027 cell Anatomy 0.000 description 71
- 230000014509 gene expression Effects 0.000 description 35
- 101100149559 Geobacillus stearothermophilus sbsA gene Proteins 0.000 description 32
- 241000588724 Escherichia coli Species 0.000 description 29
- 241000193385 Geobacillus stearothermophilus Species 0.000 description 24
- 239000013598 vector Substances 0.000 description 20
- 239000013612 plasmid Substances 0.000 description 19
- 108020004414 DNA Proteins 0.000 description 18
- 108010076504 Protein Sorting Signals Proteins 0.000 description 18
- 238000003776 cleavage reaction Methods 0.000 description 18
- 102000037865 fusion proteins Human genes 0.000 description 18
- 108020001507 fusion proteins Proteins 0.000 description 18
- 239000013615 primer Substances 0.000 description 18
- 102000004196 processed proteins & peptides Human genes 0.000 description 18
- 230000007017 scission Effects 0.000 description 18
- 229920001184 polypeptide Polymers 0.000 description 16
- 241000894006 Bacteria Species 0.000 description 14
- 239000012634 fragment Substances 0.000 description 14
- 108010017391 lysylvaline Proteins 0.000 description 14
- 241000880493 Leptailurus serval Species 0.000 description 12
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 12
- 101150085357 sbsA gene Proteins 0.000 description 12
- 108010073969 valyllysine Proteins 0.000 description 12
- 108020004705 Codon Proteins 0.000 description 10
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 10
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 10
- 108010005233 alanylglutamic acid Proteins 0.000 description 10
- 230000001580 bacterial effect Effects 0.000 description 10
- 108010034529 leucyl-lysine Proteins 0.000 description 10
- 108010003700 lysyl aspartic acid Proteins 0.000 description 10
- 108010061238 threonyl-glycine Proteins 0.000 description 10
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 8
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 8
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 8
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 8
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 8
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 8
- 108010047857 aspartylglycine Proteins 0.000 description 8
- 235000014469 Bacillus subtilis Nutrition 0.000 description 7
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 7
- 230000006698 induction Effects 0.000 description 7
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 6
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 6
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 6
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 6
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 6
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 6
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 6
- TZJSEJOXAIWOST-RHYQMDGZSA-N Thr-Lys-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N TZJSEJOXAIWOST-RHYQMDGZSA-N 0.000 description 6
- CJEHCEOXPLASCK-MEYUZBJRSA-N Thr-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=C(O)C=C1 CJEHCEOXPLASCK-MEYUZBJRSA-N 0.000 description 6
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 6
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 6
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 238000010367 cloning Methods 0.000 description 6
- 210000000805 cytoplasm Anatomy 0.000 description 6
- 239000003053 toxin Substances 0.000 description 6
- 231100000765 toxin Toxicity 0.000 description 6
- 108700012359 toxins Proteins 0.000 description 6
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 5
- 241000282326 Felis catus Species 0.000 description 5
- 235000004279 alanine Nutrition 0.000 description 5
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 108010078304 poly-beta-hydroxybutyrate polymerase Proteins 0.000 description 5
- 238000002360 preparation method Methods 0.000 description 5
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 4
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 4
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 4
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 4
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 4
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 4
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 4
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 4
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 4
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 4
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 4
- 241000252867 Cupriavidus metallidurans Species 0.000 description 4
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 4
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 4
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 4
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 4
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 4
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 4
- 239000004471 Glycine Substances 0.000 description 4
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 4
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 4
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 4
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 4
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 4
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 4
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 4
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 4
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 4
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 4
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 4
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 4
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 4
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 4
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 4
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 4
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 4
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 4
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 4
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 4
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 4
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 4
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 4
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 4
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 4
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 4
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 4
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 4
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 4
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 4
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 4
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 4
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 4
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 4
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 4
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 4
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 4
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 4
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 4
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 4
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 4
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 4
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 4
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 4
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 4
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 4
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 4
- 108010041407 alanylaspartic acid Proteins 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 108010087924 alanylproline Proteins 0.000 description 4
- 108010062796 arginyllysine Proteins 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 108010092854 aspartyllysine Proteins 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 108010089804 glycyl-threonine Proteins 0.000 description 4
- 108010010147 glycylglutamine Proteins 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- 108010037850 glycylvaline Proteins 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 108010009298 lysylglutamic acid Proteins 0.000 description 4
- 108010038320 lysylphenylalanine Proteins 0.000 description 4
- 239000012528 membrane Substances 0.000 description 4
- 108010051242 phenylalanylserine Proteins 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- 108010003137 tyrosyltyrosine Proteins 0.000 description 4
- 229960005486 vaccine Drugs 0.000 description 4
- 238000001262 western blot Methods 0.000 description 4
- SRBFZHDQGSBBOR-IOVATXLUSA-N D-xylopyranose Chemical group O[C@@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-IOVATXLUSA-N 0.000 description 3
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 3
- 241000701093 Suid alphaherpesvirus 1 Species 0.000 description 3
- 239000002671 adjuvant Substances 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 239000012228 culture supernatant Substances 0.000 description 3
- 230000002255 enzymatic effect Effects 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 238000009396 hybridization Methods 0.000 description 3
- 230000008105 immune reaction Effects 0.000 description 3
- 230000001965 increasing effect Effects 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 238000002955 isolation Methods 0.000 description 3
- 238000002703 mutagenesis Methods 0.000 description 3
- 231100000350 mutagenesis Toxicity 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 108010036211 5-HT-moduline Proteins 0.000 description 2
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 2
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 2
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 2
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 2
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 2
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 2
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 2
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 2
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 2
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- DCUCOIYYUBILPS-GUBZILKMSA-N Ala-Leu-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DCUCOIYYUBILPS-GUBZILKMSA-N 0.000 description 2
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 2
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 2
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 2
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 2
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 2
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 2
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 2
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- HIIJOGIBQXHFKE-HHKYUTTNSA-N Ala-Thr-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O HIIJOGIBQXHFKE-HHKYUTTNSA-N 0.000 description 2
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 2
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 2
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 2
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 2
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 2
- WESHVRNMNFMVBE-FXQIFTODSA-N Arg-Asn-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N WESHVRNMNFMVBE-FXQIFTODSA-N 0.000 description 2
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 2
- TTXYKSADPSNOIF-IHRRRGAJSA-N Arg-Asp-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O TTXYKSADPSNOIF-IHRRRGAJSA-N 0.000 description 2
- VDBKFYYIBLXEIF-GUBZILKMSA-N Arg-Gln-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VDBKFYYIBLXEIF-GUBZILKMSA-N 0.000 description 2
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 2
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 2
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 2
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 2
- BWMMKQPATDUYKB-IHRRRGAJSA-N Arg-Tyr-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=C(O)C=C1 BWMMKQPATDUYKB-IHRRRGAJSA-N 0.000 description 2
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 2
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 2
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 2
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 2
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 2
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 2
- DNYRZPOWBTYFAF-IHRRRGAJSA-N Asn-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)O DNYRZPOWBTYFAF-IHRRRGAJSA-N 0.000 description 2
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 2
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 2
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 2
- VJTWLBMESLDOMK-WDSKDSINSA-N Asn-Gln-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VJTWLBMESLDOMK-WDSKDSINSA-N 0.000 description 2
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 2
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 2
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 2
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 2
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 2
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 2
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 2
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 2
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 description 2
- RAUPFUCUDBQYHE-AVGNSLFASA-N Asn-Phe-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RAUPFUCUDBQYHE-AVGNSLFASA-N 0.000 description 2
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 2
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 2
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 2
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 2
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 2
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 2
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 2
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 2
- JPPLRQVZMZFOSX-UWJYBYFXSA-N Asn-Tyr-Ala Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 JPPLRQVZMZFOSX-UWJYBYFXSA-N 0.000 description 2
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 2
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 2
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 2
- QXNGSPZMGFEZNO-QRTARXTBSA-N Asn-Val-Trp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QXNGSPZMGFEZNO-QRTARXTBSA-N 0.000 description 2
- HBUJSDCLZCXXCW-YDHLFZDLSA-N Asn-Val-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HBUJSDCLZCXXCW-YDHLFZDLSA-N 0.000 description 2
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 2
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 2
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 2
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 2
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 2
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 2
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 2
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 2
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- LTXGDRFJRZSZAV-CIUDSAMLSA-N Asp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N LTXGDRFJRZSZAV-CIUDSAMLSA-N 0.000 description 2
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 2
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 2
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 2
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 2
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 2
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 2
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 2
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 2
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 2
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 2
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 2
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 2
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 2
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 2
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 2
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 2
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 2
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 2
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 2
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 2
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 2
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 2
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 2
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 2
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 2
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 2
- 241000193830 Bacillus <bacterium> Species 0.000 description 2
- 244000063299 Bacillus subtilis Species 0.000 description 2
- 241000305071 Enterobacterales Species 0.000 description 2
- 241000206602 Eukaryota Species 0.000 description 2
- 241000192125 Firmicutes Species 0.000 description 2
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 2
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 2
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 description 2
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 2
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 2
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 2
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 2
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 2
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 2
- MTCXQQINVAFZKW-MNXVOIDGSA-N Gln-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MTCXQQINVAFZKW-MNXVOIDGSA-N 0.000 description 2
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 2
- ILKYYKRAULNYMS-JYJNAYRXSA-N Gln-Lys-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ILKYYKRAULNYMS-JYJNAYRXSA-N 0.000 description 2
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 2
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 2
- MFORDNZDKAVNSR-SRVKXCTJSA-N Gln-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O MFORDNZDKAVNSR-SRVKXCTJSA-N 0.000 description 2
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 2
- WIMVKDYAKRAUCG-IHRRRGAJSA-N Gln-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WIMVKDYAKRAUCG-IHRRRGAJSA-N 0.000 description 2
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 2
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 2
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 2
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 2
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 2
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 2
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 2
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 2
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 2
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- VOORMNJKNBGYGK-YUMQZZPRSA-N Glu-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N VOORMNJKNBGYGK-YUMQZZPRSA-N 0.000 description 2
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 2
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 2
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 2
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 2
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 2
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 2
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 2
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 2
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 2
- UCZXXMREFIETQW-AVGNSLFASA-N Glu-Tyr-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O UCZXXMREFIETQW-AVGNSLFASA-N 0.000 description 2
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 2
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 2
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 2
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 2
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 2
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 2
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 2
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 2
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 2
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 2
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 2
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 2
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 2
- BULIVUZUDBHKKZ-WDSKDSINSA-N Gly-Gln-Asn Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BULIVUZUDBHKKZ-WDSKDSINSA-N 0.000 description 2
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 2
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 2
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 2
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 2
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 2
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 2
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 2
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 2
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 2
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 2
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 2
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 2
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 2
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 2
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- VOEGKUNRHYKYSU-XVYDVKMFSA-N His-Asp-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O VOEGKUNRHYKYSU-XVYDVKMFSA-N 0.000 description 2
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 2
- VXZZUXWAOMWWJH-QTKMDUPCSA-N His-Thr-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VXZZUXWAOMWWJH-QTKMDUPCSA-N 0.000 description 2
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 2
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 2
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 2
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 2
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 2
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 2
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 2
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 2
- BALLIXFZYSECCF-QEWYBTABSA-N Ile-Gln-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N BALLIXFZYSECCF-QEWYBTABSA-N 0.000 description 2
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 2
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 2
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 2
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 2
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 2
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 2
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 2
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 2
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 2
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 2
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 2
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 2
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 2
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 2
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 2
- ZFWISYLMLXFBSX-KKPKCPPISA-N Ile-Trp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N ZFWISYLMLXFBSX-KKPKCPPISA-N 0.000 description 2
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 2
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 2
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 2
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 2
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 2
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 2
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 2
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 2
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 2
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 2
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 2
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 2
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 2
- 108010071324 Livagen Proteins 0.000 description 2
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 2
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 2
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 2
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 2
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 2
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 2
- GGAPIOORBXHMNY-ULQDDVLXSA-N Lys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)O GGAPIOORBXHMNY-ULQDDVLXSA-N 0.000 description 2
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 2
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 2
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 2
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 2
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 2
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 2
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 2
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 2
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 2
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 2
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 2
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 2
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 2
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 2
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 2
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 2
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 2
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 2
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 2
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 2
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 2
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 2
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 2
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 2
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 2
- IMDJSVBFQKDDEQ-MGHWNKPDSA-N Lys-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N IMDJSVBFQKDDEQ-MGHWNKPDSA-N 0.000 description 2
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 2
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 2
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 2
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 2
- 108010052285 Membrane Proteins Proteins 0.000 description 2
- 102000018697 Membrane Proteins Human genes 0.000 description 2
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 2
- BVXXDMUMHMXFER-BPNCWPANSA-N Met-Ala-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVXXDMUMHMXFER-BPNCWPANSA-N 0.000 description 2
- SQUTUWHAAWJYES-GUBZILKMSA-N Met-Asp-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SQUTUWHAAWJYES-GUBZILKMSA-N 0.000 description 2
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 2
- MSFSPUZXLOGKHJ-UHFFFAOYSA-N Muraminsaeure Natural products OC(=O)C(C)OC1C(N)C(O)OC(CO)C1O MSFSPUZXLOGKHJ-UHFFFAOYSA-N 0.000 description 2
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 101100205189 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-5 gene Proteins 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- 108010013639 Peptidoglycan Proteins 0.000 description 2
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 2
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 2
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 2
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 2
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 2
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 2
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 2
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 2
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 2
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 2
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 2
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 2
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 2
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 2
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 2
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 2
- KTFZQPLSPLWLKN-KKUMJFAQSA-N Pro-Gln-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KTFZQPLSPLWLKN-KKUMJFAQSA-N 0.000 description 2
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 2
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 2
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 2
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 2
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 2
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 2
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 2
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 2
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 2
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 2
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 2
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 2
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 2
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 2
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 2
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 2
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 2
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 2
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 2
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 2
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 2
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 2
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 2
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 2
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 2
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 2
- HEYZPTCCEIWHRO-IHRRRGAJSA-N Ser-Met-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HEYZPTCCEIWHRO-IHRRRGAJSA-N 0.000 description 2
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 2
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 2
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 2
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 2
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 2
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 2
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 2
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 2
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 2
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 2
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 2
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 2
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 2
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 2
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 2
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 2
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 2
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 2
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 2
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 2
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 2
- DCCGCVLVVSAJFK-NUMRIWBASA-N Thr-Asp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O DCCGCVLVVSAJFK-NUMRIWBASA-N 0.000 description 2
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 2
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 2
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 2
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 2
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 2
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 2
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 2
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 2
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 2
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 2
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 2
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 2
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 2
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 2
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 2
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 2
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 2
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 2
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 2
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 2
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 2
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 2
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 2
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 2
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 2
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 2
- FBQHKSPOIAFUEI-OWLDWWDNSA-N Thr-Trp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O FBQHKSPOIAFUEI-OWLDWWDNSA-N 0.000 description 2
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 2
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 2
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 2
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 2
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 2
- MHNHRNHJMXAVHZ-AAEUAGOBSA-N Trp-Asn-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N MHNHRNHJMXAVHZ-AAEUAGOBSA-N 0.000 description 2
- HIZDHWHVOLUGOX-BPUTZDHNSA-N Trp-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O HIZDHWHVOLUGOX-BPUTZDHNSA-N 0.000 description 2
- RKISDJMICOREEL-QRTARXTBSA-N Trp-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RKISDJMICOREEL-QRTARXTBSA-N 0.000 description 2
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 2
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 2
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 2
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 2
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 2
- WVGKPKDWYQXWLU-BZSNNMDCSA-N Tyr-His-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WVGKPKDWYQXWLU-BZSNNMDCSA-N 0.000 description 2
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 2
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 2
- YMUQBRQQCPQEQN-CXTHYWKRSA-N Tyr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YMUQBRQQCPQEQN-CXTHYWKRSA-N 0.000 description 2
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 2
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 2
- BCOBSVIZMQXKFY-KKUMJFAQSA-N Tyr-Ser-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O BCOBSVIZMQXKFY-KKUMJFAQSA-N 0.000 description 2
- KSGKJSFPWSMJHK-JNPHEJMOSA-N Tyr-Tyr-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSGKJSFPWSMJHK-JNPHEJMOSA-N 0.000 description 2
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 2
- SMKXLHVZIFKQRB-GUBZILKMSA-N Val-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N SMKXLHVZIFKQRB-GUBZILKMSA-N 0.000 description 2
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 2
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 2
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 2
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 2
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 2
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 2
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 2
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 2
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 2
- OUUBKKIJQIAPRI-LAEOZQHASA-N Val-Gln-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OUUBKKIJQIAPRI-LAEOZQHASA-N 0.000 description 2
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 2
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 2
- ZTKGDWOUYRRAOQ-ULQDDVLXSA-N Val-His-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N ZTKGDWOUYRRAOQ-ULQDDVLXSA-N 0.000 description 2
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 2
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 2
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 2
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 2
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 2
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 2
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 2
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 2
- LGXUZJIQCGXKGZ-QXEWZRGKSA-N Val-Pro-Asn Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N LGXUZJIQCGXKGZ-QXEWZRGKSA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 2
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 2
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 2
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 2
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 2
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 2
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- 108010081404 acein-2 Proteins 0.000 description 2
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 2
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 108010011559 alanylphenylalanine Proteins 0.000 description 2
- 229960000723 ampicillin Drugs 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 239000000427 antigen Substances 0.000 description 2
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 2
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 2
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 2
- 108010036533 arginylvaline Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 239000013611 chromosomal DNA Substances 0.000 description 2
- 230000009089 cytolysis Effects 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 238000001493 electron microscopy Methods 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 238000002744 homologous recombination Methods 0.000 description 2
- 230000006801 homologous recombination Effects 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 230000001900 immune effect Effects 0.000 description 2
- 210000003000 inclusion body Anatomy 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 230000003834 intracellular effect Effects 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical class O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 2
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 230000000877 morphologic effect Effects 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 108010091617 pentalysine Proteins 0.000 description 2
- 108010084572 phenylalanyl-valine Proteins 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 239000002344 surface layer Substances 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 2
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 241001515965 unidentified phage Species 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- NKDFYOWSKOHCCO-YPVLXUMRSA-N 20-hydroxyecdysone Chemical compound C1[C@@H](O)[C@@H](O)C[C@]2(C)[C@@H](CC[C@@]3([C@@H]([C@@](C)(O)[C@H](O)CCC(C)(O)C)CC[C@]33O)C)C3=CC(=O)[C@@H]21 NKDFYOWSKOHCCO-YPVLXUMRSA-N 0.000 description 1
- WHBMMWSBFZVSSR-UHFFFAOYSA-N 3-hydroxybutyric acid Chemical compound CC(O)CC(O)=O WHBMMWSBFZVSSR-UHFFFAOYSA-N 0.000 description 1
- 102100026105 3-ketoacyl-CoA thiolase, mitochondrial Human genes 0.000 description 1
- SJZRECIVHVDYJC-UHFFFAOYSA-N 4-hydroxybutyric acid Chemical compound OCCCC(O)=O SJZRECIVHVDYJC-UHFFFAOYSA-N 0.000 description 1
- RZVAJINKPMORJF-UHFFFAOYSA-N Acetaminophen Chemical compound CC(=O)NC1=CC=C(O)C=C1 RZVAJINKPMORJF-UHFFFAOYSA-N 0.000 description 1
- 108010003902 Acetyl-CoA C-acyltransferase Proteins 0.000 description 1
- 208000035285 Allergic Seasonal Rhinitis Diseases 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 241000304886 Bacilli Species 0.000 description 1
- 241000193744 Bacillus amyloliquefaciens Species 0.000 description 1
- 235000018185 Betula X alpestris Nutrition 0.000 description 1
- 235000018212 Betula X uliginosa Nutrition 0.000 description 1
- 241000149420 Bothrometopus brevis Species 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 230000007023 DNA restriction-modification system Effects 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 108050001049 Extracellular proteins Proteins 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 241000423297 Geobacillus stearothermophilus 10 Species 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108010093096 Immobilized Enzymes Proteins 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 241000193386 Lysinibacillus sphaericus Species 0.000 description 1
- GXCLVBGFBYZDAG-UHFFFAOYSA-N N-[2-(1H-indol-3-yl)ethyl]-N-methylprop-2-en-1-amine Chemical compound CN(CCC1=CNC2=C1C=CC=C2)CC=C GXCLVBGFBYZDAG-UHFFFAOYSA-N 0.000 description 1
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 108010002747 Pfu DNA polymerase Proteins 0.000 description 1
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 1
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 1
- 229920001397 Poly-beta-hydroxybutyrate Polymers 0.000 description 1
- 229920000331 Polyhydroxybutyrate Polymers 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 206010048908 Seasonal allergy Diseases 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- YCUVUDODLRLVIC-UHFFFAOYSA-N Sudan black B Chemical compound C1=CC(=C23)NC(C)(C)NC2=CC=CC3=C1N=NC(C1=CC=CC=C11)=CC=C1N=NC1=CC=CC=C1 YCUVUDODLRLVIC-UHFFFAOYSA-N 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- COQLPRJCUIATTQ-UHFFFAOYSA-N Uranyl acetate Chemical compound O.O.O=[U]=O.CC(O)=O.CC(O)=O COQLPRJCUIATTQ-UHFFFAOYSA-N 0.000 description 1
- 241000607618 Vibrio harveyi Species 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- 108091000039 acetoacetyl-CoA reductase Proteins 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 150000001299 aldehydes Chemical class 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 241000385736 bacterium B Species 0.000 description 1
- 229920001222 biopolymer Polymers 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 239000013575 birch pollen allergen Substances 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000010839 body fluid Substances 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000021164 cell adhesion Effects 0.000 description 1
- 210000002421 cell wall Anatomy 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 229960003067 cystine Drugs 0.000 description 1
- 239000007857 degradation product Substances 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 101150012763 endA gene Proteins 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 238000011990 functional testing Methods 0.000 description 1
- 238000004817 gas chromatography Methods 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 1
- 238000003119 immunoblot Methods 0.000 description 1
- 230000003308 immunostimulating effect Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000007852 inverse PCR Methods 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 230000002101 lytic effect Effects 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 244000000010 microbial pathogen Species 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 108091005573 modified proteins Proteins 0.000 description 1
- 102000035118 modified proteins Human genes 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 239000002808 molecular sieve Substances 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 244000045947 parasite Species 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 101150048611 phaC gene Proteins 0.000 description 1
- 239000013573 pollen allergen Substances 0.000 description 1
- 201000004338 pollen allergy Diseases 0.000 description 1
- 239000005014 poly(hydroxyalkanoate) Substances 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 210000004896 polypeptide structure Anatomy 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 239000011253 protective coating Substances 0.000 description 1
- 235000004252 protein component Nutrition 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000001338 self-assembly Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 238000009987 spinning Methods 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 238000010998 test method Methods 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1025—Acyltransferases (2.3)
- C12N9/1029—Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/04—Antibacterial agents
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/32—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Bacillus (G)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/02—Fusion polypeptide containing a localisation/targetting motif containing a signal sequence
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/40—Fusion polypeptide containing a tag for immunodetection, or an epitope for immunisation
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/70—Fusion polypeptide containing domain for protein-protein interaction
- C07K2319/735—Fusion polypeptide containing domain for protein-protein interaction containing a domain for self-assembly, e.g. a viral coat protein (includes phage display)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/16011—Herpesviridae
- C12N2710/16711—Varicellovirus, e.g. human herpesvirus 3, Varicella Zoster, pseudorabies
- C12N2710/16722—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
Definitions
- the present invention concerns processes for the recombinant production of S-layer proteins and modified S-layer proteins in gram-negative host cells.
- S-layers Crystalline bacterial cell surface layers form the outermost cell wall component in many eubacteria and most of the archaebacteria (Sleytr et al. (1988), Crystalline Bacterial Cell Surface Layers, “Springer Verlag Berlin”; Messner and Sleytr, Adv. Microb. Physiol. 33 (1992), 213-275).
- Most of the presently known S-layer proteins are composed of identical proteins or glycoproteins which have apparent molecular weights in the range of 40,000 to 220,000.
- the components of S-layers are self-assembling and most of the lattices have an oblique (p2), quadratic (p4) or hexagonal (p6) symmetry.
- the functions of bacterial S-layers are still not completely understood but due to their location on the cell surface the porous crystalline S-layers probably serve mainly as protective coatings, molecular sieves or to promote cell adhesion and surface recognition.
- the main component of the S-layer is a 128 kd protein which is the most frequent protein in the cell with a proportion of about 15% relative to the total protein components.
- Various strains of B.stearothermophilus have been characterized which differ with regard to the type of the S-layer lattice, the molecular weight and glycosilation of the S-layer components (Messner and Sleytr (1992), supra).
- the German Patent Application P 44 25 527.6 discloses the signal peptide-coding section of the S-layer gene from B.stearothermophilus and the amino acid sequence derived therefrom.
- the cleavage site between the signal peptide and the mature protein is located between position 30 and 31 of the amino acid sequence.
- the signal peptide-coding nucleic acid can be operatively linked to a protein-coding nucleic acid and can be used for the recombinant production of proteins in a process in which a transformed host cell is provided, the host cell is cultured under conditions which lead to an expression of the nucleic acid and to production and secretion of the polypeptide coded thereby and the resulting polypeptide is isolated from the culture medium.
- Prokaryotic organisms are preferably used as host cells in particular gram-positive organisms of the genus bacillus.
- S-layer proteins are not only possible in gram-positive prokaryotic host cells but also in gram-negative prokaryotic host cells.
- the S-layer protein is not formed in the interior of the host cell in the form of ordered inclusion bodies but rather unexpectedly in the form of ordered monomolecular layers.
- one subject matter of the present invention is a process for the recombinant production of S-layer proteins characterized in that (a) a gram-negative prokaryotic host cell is provided which is transformed with a nucleic acid coding for an S-layer protein selected from (i) a nucleic acid which comprises the nucleotide sequence shown in SEQ ID NO.
- nucleic acid which comprises a nucleotide sequence corresponding to the nucleic acid from (i) within the scope of the degeneracy of the genetic code and (iii) a nucleic acid which comprises a nucleotide sequence which hybridizes with the nucleic acids from (i) or/and (ii) under stringent conditions; (b) the host cell is cultured under conditions which lead to an expression of the nucleic acid and to production of the polypeptide coded thereby and (c) the resulting polypeptide is isolated from the host cell.
- stringent hybridization is understood within the sense of the present invention to mean that a hybridization still also occurs after washing at 55° C., preferably 60° C. in an aqueous low salt buffer (e.g. 0.2 ⁇ SSC) (see also Sambrook et al. (1989), Molecular Cloning. A Laboratory Manual).
- an aqueous low salt buffer e.g. 0.2 ⁇ SSC
- the process according to the invention is carried out in gram-negative prokaryotic host cells.
- an ordered S-layer protein structure is surprisingly obtained in the cell interior.
- Enterobacteria in particular E. coli , are preferably used as host cells.
- the E. coli strain pop2125 which was deposited on the 31.01.1996 at the “Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH”, Mascheroder Weg 1b, D 38124 Braunschweig under the file number DSM 10509 is particularly preferred.
- the process according to the invention can also be used to isolate recombinant S-layer proteins.
- a nucleic acid coding for the S-layer protein which contains one or several insertions which code for peptide or polypeptide sequences. These insertions can, on the one hand, only code for peptides with a few amino acids e.g. 1-25 amino acids.
- the insertions can also code for larger polypeptides of for example up to 1000 amino acids and preferably up to 500 amino acids without loss of the ability of the S-layer protein to form a correctly folded structure.
- the recombinant S-layer protein can also have amino acid substitutions, in particular substitutions of individual amino acids in the region of the insertion sites as well as optionally deletions of individual amino acids or short amino acid sections of up to 30 amino acids.
- Regions between the positions 1-1200 and 2200-3000 of the nucleotide sequence shown in SEQ ID NO.1 are preferred as insertion sites for polypeptide-coding sequences. Particularly preferred insertion sites are the NruI cleavage site at position 582, the PvuII cleavage site at position 878, the SnaB-I cleavage site at position 917, the PVuII cleavage site at position 2504 and the PvuII cleavage site at position 2649. It was already possible to demonstrate the insertion of a nucleic acid coding for streptavidin into the NruI cleavage site at position 581.
- the peptide or polypeptide-coding insertions are preferably selected from nucleotide sequences which code for cysteine residues, regions with several charged amino acids, e.g. Arg, Lys, Asp or Glu, or Tyr residues, DNA-binding epitopes, antigenic, allergenic or immunogenic epitopes, metal-binding epitopes, streptavidin, enzymes, cytokines or antibody-binding proteins.
- a particularly preferred example of an insertion into the nucleic acid coding for the S-layer protein is a nucleotide sequence coding for streptavidin. In this manner it is possible to obtain universal carrier molecules which are suitable for coupling biotinylated reagents and for detection in immunological or hybridization test procedures.
- a further preferred example of insertions are antigenic, allergenic or immunogenic epitopes e.g. epitopes from pathogenic microorganisms such as bacteria, fungi, parasites etc. and viruses, or epitopes from plants or epitopes against endogenous substances e.g. cytokines as well as against toxins in particular endotoxins.
- immunogenic epitopes are epitopes from herpes viruses such as the herpes virus 6 or pseudorabies virus (Lomniczi et al., J. Virol.
- immunogenic epitopes from the genes gB, gC or/and gD, or foot-and-mouth disease virus (FMDV), in particular epitopes from the gene sections which code for VP1, VP2 or/and VP3.
- the immunogenic epitopes can be selected such that they promote an antibody-mediated immune reaction or/and the production of a cellular immune reaction e.g. by stimulation of T cells.
- suitable allergenic epitopes are birch pollen allergens e.g. Bet v I (Ebner et al., J. Immunol. 150 (1993) 1047-1054).
- Antigenic epitopes are additionally particularly preferred which are able to bind and filter out endogenous or exogenous substances such as cytokines or toxins from serum or other body fluids.
- Such epitopes can include components of cytokine or toxin receptors or of antibodies against cytokines or toxins.
- the insertions can also code for enzymes.
- Preferred examples are enzymes for the synthesis of polyhydroxybutyric acid e.g. PHB synthase. Incorporation of PHB synthase into the S-layer can lead to the formation of a molecular spinning nozzle under suitable conditions when the substrate hydroxybutyric acid is provided.
- a further preferred example of an enzyme is bacterial luciferase. In this case when the enzyme substrate, an aldehyde, is supplied and O 2 is present, a molecular laser can be obtained.
- Insertions are likewise preferred which code for cytokines such as interleukins, interferones or tumour necrosis factors. These molecules can for example be used in combination with immunogenic epitopes to prepare vaccines.
- a cell which contains immobilized recombinant polypeptides in a native form e.g. active enzymes in the cytoplasm.
- immobilized recombinant polypeptides in a native form e.g. active enzymes in the cytoplasm.
- 50,000-200,000 e.g. ca. 100,000 recombinant molecules can be immobilized per m 2 recombinant S-layer.
- Up to 3000 m 2 S-layer can be obtained per kg recombinant E. coli cells.
- the nucleic acid coding for the S-layer protein is preferably used in operative linkage with a nucleic acid coding for a signal peptide of gram-positive bacteria i.e. the signal peptide-coding nucleic acid is located on the 5′ side of the S-layer protein-coding nucleic acid.
- the nucleic acid coding for the signal peptide particularly preferably comprises (a) the signal peptide-coding section of the nucleotide sequence shown in SEQ ID NO.
- nucleic acid which codes for a recombinant S-layer protein and is selected from (i) a nucleic acid which comprises the nucleotide sequence shown in SEQ ID NO.1 from position 1 to 3684 optionally without the signal peptide-coding-section (ii) a nucleic acid which comprises a nucleotide sequence corresponding to a nucleic acid from (i) within the scope of the degeneracy of the genetic code and (iii) a nucleic acid which comprises a nucleotide sequence which hybridizes under stringent conditions with the nucleic acids from (i) or/and (ii).
- the coding nucleotide sequence of the S-layer gene sbsA from B.stearothermophilus including the signal peptide-coding section is shown in SEQ ID NO. 1.
- the signal peptide-coding section extends from position 1 to 90 of the nucleotide sequence shown in SEQ ID NO. 1.
- the section coding for the mature SbsA polypeptide extends from position 91 to 3684.
- the sbsA gene of B.stearothermophilus codes for a protein with a total of 1228 amino acids including an N-terminal signal peptide with 30 amino acids (SEQ ID NO. 2).
- the cleavage site between the signal peptide and the mature protein is located between position 30 and 31 of the amino acid sequence.
- the signal peptide has a basic amino-terminal domain followed by a hydrophobic domain.
- a further subject matter of the present invention is a recombinant vector which contains at least one copy of a nucleic acid according to the invention.
- the vector is preferably replicatable in prokaryotes.
- the vector is particularly preferably a prokaryotic plasmid.
- a further subject matter of the present invention is a host cell which is transformed with a nucleic acid or a recombinant vector according to the present invention.
- the cell is preferably a gram-negative prokaryotic organism and most preferably an E. coli cell.
- the cell according to the invention can contain a recombinant S-layer structure in its interior. Methods for the transformation of cells with nucleic acids are general state of the art (cf. Sambrook et al., supra) and therefore do not need to be elucidated.
- Yet a further subject matter of the present invention is a recombinant S-layer protein which contains at least one peptide insertion or/and polypeptide insertion within the amino acid sequence shown in SEQ ID NO. 2.
- Preferred examples of peptide insertions and polypeptide insertions have already been elucidated.
- a recombinant S-layer structure can be assembled from recombinant S-layer protein molecules according to the invention which contain at least one recombinant S-layer protein according to the invention as a subunit. Furthermore it is preferred that the S-layer structure according to the invention also contains non-modified S-layer proteins as diluent molecules.
- the non-modified S-layer proteins are preferably present in a molar proportion of 10-99% relative to the total S-layer proteins.
- the S-layer structure according to the invention can comprise several layers that are covalently linked together or by means of affinity binding.
- Covalent linkages can for example be introduced by insertions of cysteine residues and a subsequent formation of cystine bridges.
- Linkages by affinity binding comprise for example antibody-antigen, antibody-protein A or antibody-protein G or streptavidin-biotin interactions.
- S-layer structures which contain recombinant S-layer proteins can optionally also be prepared in a carrier-bound form.
- the S-layer structure can be reassembled from individual units in the presence of a peptidoglycan carrier to for example produce peptidoglycan layers which are covered on one or on both sides with an S-layer structure.
- Another method of preparing carrier-bound S-layer structures is to produce an S-layer layer at an interface between two media e.g. water/air and to immobilize this layer on a solid phase e.g. a filter membrane (cf. e.g. Pum and Sleytr (1994), Thin Solid Films 244, 882-886; kupcü et al., (1995), Biochim. Biophys. Acta 1235, 263-269).
- the recombinant S-layer proteins and S-layer structures according to the invention are suitable for a multitude of applications.
- An application as a vaccine or adjuvant is particularly preferred in which case recombinant S-layer proteins are used which contain immunogenic epitopes of pathogens and/or endogenous immunostimulatory polypeptides such as cytokines.
- cytokines endogenous immunostimulatory polypeptides
- suitable “bacterial ghosts” is described for example in the International Patent application PCT/EP91/00967 to which reference is herewith made.
- modified bacteria are disclosed which are obtainable by transformation of a gram-negative bacterium with the gene of a lytically active membrane protein from bacteriophages, with the gene of a lytically active toxin release protein or with genes which contain partial sequences thereof which code for lytic proteins, culturing the bacterium, expression of this lysis gene and isolation of the resulting bacterial ghost from the culture medium.
- a recombinant protein which is obtainable by expression of a recombinant DNA in these gram-negative bacteria, can be bound to the membrane of these bacteria as described in the European Patent 0 516 655.
- This recombinant DNA comprises a first DNA sequence which codes for a hydrophobic, non-lytically active membrane-integrating protein domain which has an ⁇ -helical structure and is composed of 14-20 amino acids which can be flanked N- and C-terminally by 2-30 arbitrary amino acids in each case.
- a second DNA sequence is in operative linkage with this first DNA sequence which codes for a desired recombinant protein.
- the gram-negative bacterium contains a third DNA sequence which is under a different control from the first and second DNA sequences and codes for a lytically active membrane protein from bacteriophages or a lytically active toxin release protein or for their lytically active components.
- So-called “bacterial ghosts” are obtained by expression and lysis of such recombinant gram-negative bacteria which contain an intact surface structure with immunogenic epitopes bound to the surface.
- vaccines and adjuvants can be produced which have particularly advantageous properties.
- a further particularly preferred application for recombinant S-layer proteins and S-layer structures is their use as an enzyme reactor.
- Such an enzyme reactor can for example be formed by a cell which contains a recombinant S-layer structure according to the invention in its interior.
- the enzyme reactor can also be formed from isolated and in vitro reassembled S-layer structures or combinations of various S-layer structures.
- the gram-positive bacterium B.stearothermophilus PV72 has an additional S-layer protein in addition to SbsA which is subsequently denoted as SbsB (Sara and Sleytr (1994), J. Bacteriol. 176, 7182-7189). It was possible to isolate and characterize the sbsB gene by amplification using suitable nucleic acid primers.
- the coding nucleotide sequence of the S-layer gene sbsB from B.stearothermophilus including the signal peptide-coding section which extends from position 1 to 93 of the nucleic acid sequence is shown in SEQ ID NO.5.
- the amino acid sequence derived therefrom is shown in SEQ ID NO.6.
- the sbsB gene codes for a protein with a total of 921 amino acids including an N-terminal signal peptide with 31 amino acids.
- One subject matter of the present invention is hence a nucleic acid which codes for an S-layer protein and is selected from
- nucleic acid which comprises the nucleotide sequence from position 1 to 2763 shown in SEQ ID NO.5 optionally without the signal peptide-coding section,
- nucleic acid which comprises a nucleotide sequence corresponding to the nucleic acid from (i) within the scope of the degeneracy of the genetic code
- nucleic acid which comprises a nucleotide sequence that hybridizes with the nucleic acids from (i) or/and (ii) under stringent conditions.
- nucleic acid insertion coding for a peptide or polypeptide into the sbsB gene within the region coding for the S-layer protein.
- insertions in the sbsB gene reference is made to the previous statements regarding the sbsA gene.
- a further subject matter of the present invention is a vector which contains at least one copy of an sbsB gene optionally containing an insertion.
- This vector can be replicated in eukaryotes, prokaryotes or in eukaryotes and prokaryotes. It can be a vector that can be integrated into the genome of the host cell or a vector which is present extrachromosomally.
- the vector according to the invention is preferably a plasmid in particular a prokaryotic plasmid.
- a further subject matter of the present invention is a host cell which is transformed with an sbsB gene wherein the sbsB gene optionally can contain an insertion.
- the host cell can be a eukaryotic as well as a prokaryotic cell.
- the cell is preferably a prokaryotic organism. Gram-positive organisms e.g. organisms of the genus bacillus as well as gram-negative organisms such as enterobacteria in particular E. coli are preferred. Methods for transforming eukaryotic and prokaryotic cells with nucleic acids are known and therefore do not need to be elucidated in detail.
- the present invention also concerns an SbsB protein i.e. an S-layer protein which is coded by a nucleic acid as defined above.
- SbsB proteins are particularly preferred which contain one or several peptide or/and polypeptide insertions within the sbsB sequence.
- the SbsB part of a polypeptide according to the invention particularly preferably has a homology of at least 80% and in particular of at least 90% to the amino acid sequence shown in SEQ ID NO.6.
- a recombinant S-layer structure can also be assembled from the recombinant SbsB-S-layer protein molecules analogous to the recombinant SbsA-S-layer structure.
- the non-modified S-layer proteins are preferably present in a molar proportion of 10-99% relative to the total S-layer proteins.
- Recombinant S-layer proteins are obtainable by a process in which
- a host cell which contains a nucleic acid coding for an S-layer protein which contains a peptide-coding or polypeptide-coding insertion within the region coding for the S-layer protein,
- a recombinant SbsA-S-layer protein is prepared i.e. the nucleic acid coding for the recombinant S-layer protein is selected from
- nucleic acid which comprises the nucleotide sequence from position 1 to 3684 shown in SEQ ID NO.1 optionally without the signal peptide-coding section,
- nucleic acid which comprises a nucleotide sequence corresponding to the nucleic acid from (i) within the scope of the degeneracy of the genetic code and
- nucleic acid which comprises a nucleotide sequence which hybridizes with the nucleic acids from (i) or/and (ii) under stringent conditions.
- a recombinant SbsB-S-layer protein is prepared i.e. the nucleic acid coding for the recombinant S-layer protein is selected from
- nucleic acid which comprises the nucleotide sequence from position 1 to 2763 shown in SEQ ID NO.5 optionally without the signal peptide-coding section,
- nucleic acid which comprises a nucleotide sequence corresponding to the nucleic acid from (i) within the scope of the degeneracy of the genetic code and
- nucleic acid which comprises a nucleotide sequence which hybridizes with the nucleic acids from (i) or/and (ii) under stringent conditions.
- the recombinant S-layer proteins can on the one hand be produced in a heterologous host cell i.e. in a host cell which originally contains no S-layer gene.
- heterologous host cells are gram-negative prokaryotic organisms such as E. coli.
- the heterologous expression of S-layer proteins can also take place in gram-positive prokaryotic organisms such as B. subtilis.
- integration vectors are preferably used which contain a native or/and a recombinant S-layer gene.
- native signal sequences When the native signal sequences are used the S-layer proteins are secreted into the culture supernatant.
- the recombinant S-layer proteins in homologous host cells i.e. host cells which originally contain a natural S-layer gene.
- the recombinant S-layer gene is introduced into the host cell in such a way that the host cell is still able to express a further S-layer gene which codes for a non-modified S-layer protein.
- the non-modified S-layer protein is preferably capable of forming an S-layer structure that is compatible with the recombinant S-layer protein.
- An example of this embodiment of homologous expression is a B.stearothermophilus PV72 cell which contains intact natural sbsA genes or/and sbsB genes and is transformed with a plasmid which contains a recombinant S-layer-gene.
- the homologous expression can occur in a host cell in which the intact S-layer gene originally present has been inactivated. Consequently in this embodiment no further S-layer gene is expressed in the host cell which codes for a non-modified S-layer protein which is able to form a compatible S-layer structure with the recombinant S-layer protein.
- a specific example of such a host cell is a B.stearothermophilus PV72 cell in the genome of which a gene coding for a recombinant S-layer protein has been introduced, e.g. by homologous recombination, which replaces the original S-layer gene.
- a further example of such a host cell is a B.stearothermophilus cell in which the native S-layer gene has been inactivated e.g. by site-specific mutagenesis or/and homologous recombination and is transformed with a vector containing a recombinant S-layer gene.
- Gram-positive prokaryotic organisms are usually used as host cells for the homologous expression of recombinant S-layer genes.
- B.stearothermophilus PV72 is particularly preferred as a host cell which can be cultured at a high temperature in a defined synthetic medium (Schuster et al., (1995), Biotechnol. and Bioeng. 48: 66-77).
- SEQ ID NO.1 shows the complete nucleotide sequence of the coding section of the S-layer gene sbsA of B.stearothermophilus
- SEQ ID NO.2 shows the amino acid sequence derived therefrom
- SEQ ID NO.3 shows the nucleotide sequence of the primer T5-X
- SEQ ID NO.4 shows the nucleotide sequence of the primer E
- SEQ ID NO.5 shows the complete nucleotide sequence of the coding section of the S-layer gene sbsB of B.stearothermophilus
- SEQ ID NO.6 shows the amino acid sequence derived therefrom
- SEQ ID NO.7 shows the nucleotide sequence of a partial fragment of the streptavidin gene
- SEQ ID NO.8 shows the nucleotide sequence of the primer NIS 2AG
- SEQ ID NO.9 shows the nucleotide sequence of the primer LIS C3
- FIG. 1 shows a schematic representation of the sbsA PCR fragment used to prepare the recombinant vector pBK4;
- FIG. 2 shows a schematic representation of peptide insertions in the amino acid sequence of the SbsA S-layer protein
- FIG. 3 shows a schematic representation of amino acid substitutions and amino acid insertions in recombinant S-layer proteins.
- Gram-positive bacteria of the strain Bacillus stearothermophilus PV72 were cultured at 58° C. in SVIII medium (Bartelmus and Perschak, Z. Zuckerrind. 7 (1957), 276-281).
- Bacteria of the strain E. coli pop2135 (endA, thi, hsdr, malT, cI857, ⁇ pR, malPQ) were cultured in LB medium (Sambrook et al., (1989), supra). Ampicillin was added to the medium at a final concentration of 100 ⁇ g/ml to select for transformants.
- the plasmid pPLcAT10 ( ⁇ pL, bla, colE1) (Stanssens et al., Gene 36 (1985), 211-223) was used as the cloning vector.
- Competent cells were transformed by electroporation using a Bio-Rad gene pulser (Bio-Rad Laboratories, Richmond, Calif. USA) according to the manufacturer's instructions.
- Plasmid DNA was isolated by the method of Birnboim and Doly (Nucleic Acids Res. 7 (1979), 1513-1523). Chromosomal DNA was isolated according to the method described in Ausubel et al. (Current Protocols in Molecular Biology (1987), New York, John Wiley).
- DNA sequences of the 5′ regions and the 3′ regions (including the region coding for the signal sequence) of the gene sbsA in the vector pPLcAT10 were determined by the dideoxy chain termination method of Sanger et al.
- the primers used for sequencing were constructed on the basis of the already published sbsA sequence (Kuen et al. Gene 145 (1994), 115-120).
- the PCR amplification of the sbsA gene was carried out in a reaction volume of 100 ⁇ l in which 200 ⁇ M deoxynucleotides, 1 U Pfu-polymerase (Stratagene), 1 ⁇ Pfu-reaction buffer, 0.5 ⁇ M of each oligonucleotide primer and 100 ng genomic DNA from B.stearothermophilus as a template were present.
- the amplification was carried out for 30 cycles in a thermocycler (Biomed thermocycler 60). Each cycle was composed of a denaturing step of 1.5 min at 95° C., an annealing step of 1 min at 56° C. and 1 min at 50° C. as well as an extension step of 2 min at 72° C.
- primer T5-X shown in the sequence protocol as SEQ ID NO.3 which flanks the 5′ region of sbsA and contains an XbaI site and the primer E shown in the sequence protocol in SEQ ID NO.4 which flanks the 20 nucleotide upstream region of the transcription terminator of the sbsA sequence and contains a BamHI site were used as primers.
- the sbsA gene obtained by PCR with a length of 3.79 kb was purified and cleaved with the restriction endonucleases XbaI and BamHI.
- the resulting XbaI-BamHI fragment was cloned into the corresponding restriction sites of the vector pPLcAT10 so that the sbsA gene was under transcriptional control of the pL promoter located upstream.
- the ATG start codon of the sbsA sequence was reconstructed by the cloning procedure.
- the cloned sbsA sequence contained the N-terminal signal sequence of sbsA and ended 20 nt after the transcription terminator.
- the E. coli strain pop2135 was transformed by electrotransformation.
- the resulting clones were subjected to a DNA restriction analysis.
- a positive clone was sequenced in order to verify the correct sequence transitions at the 5′ and 3′ ends. This clone was named pBK4.
- FIG. 1 A schematic representation of the 3.79 kb XbaI sbsA fragment and its location in the multiple cloning site of the plasmid pBK4 is shown in FIG. 1 (abbreviations: tT: transcription terminator; ori: origin of the DNA replication; amp: ampicillin resistance gene).
- E. coli pop2135/pBK4 cells were cultured at 28° C. until an optical density OD 600 of 0.3 was reached. Then the expression of sbsA was induced by increasing the culture temperature from 28° C. to 42° C. 1.5 ml aliquots were taken before and 1, 2, 3 and 5 hours after induction of the sbsA expression.
- E. coli pop2135/pPLcAT10 (cultured under the same conditions) and B.stearothermophilus PV72 were used as controls.
- the cytoplasm of non-induced E. coli cells exhibited the typical granular structure which did not change even when the OD of the suspensions increased.
- Longitudinal sections of E. coli cells which were harvested 1 hour after induction of the S-layer protein expression exhibited parallel, leaf-like structures in the cytoplasm. From cross sections it was apparent that these structures have a concentric arrangement.
- the sbsA protein recombinantly produced in E. coli could also be detected by immunogold labelling with sbsA-specific antibodies. An ordered structure of the recombinantly produced SbsA protein was also found with this detection method.
- a modified kanamycin cassette (1.3 kb) was used for the site-specific insertion mutagenesis of the sbsA gene which was isolated by cleavage of the plasmid pWJC3 (obtained from W. T. McAllister, New York) by SmaI.
- the cassette was ligated into five different blunt-ended restriction sites of the sbsA gene, i.e. into the NruI site at position bp 582 (pSL582), into the SnaBI site at position bp 917 (pSL917) and into each of the PvuII sites at positions bp 878 (pSL878), bp 2504 (pSL2504) and bp 2649 (pSL2649).
- the cassette was removed from the insertion site by cleavage with ApaI followed by a religation of the S-layer plasmid pBK4.
- the cutting out and religation procedure left an insertion of 6 bp CCCGGG (ApaI restriction site).
- the system of this linker insertion is shown schematically in FIG. 2.
- the resulting recombinant S-layer genes code for modified sbsA proteins elongated by 2 amino acids.
- alanine at position 293 was substituted by glycine.
- the amino acids alanine and proline were inserted between the amino acids 835 and 836 and the alanine at position 835 was replaced by glycine.
- the SbsA-streptavidin fusion proteins can be isolated as monomers and reassembled to form homogeneous SbsA-streptavidin S-layers or mixed SbsA-streptavidin/SbsA-S-layers. They can be used to bind biotinylated substances as well as to determine the binding capacity of enzymes and other bound molecules.
- the resulting fusion protein can be used for therapeutic or diagnostic purposes. Hence it can be attempted by administration of the fusion protein to convert a T H 2-directed IgE antibody reaction into a T H 1-mediated reaction against BetvI. In this manner it is possible to suppress the occurrence of symptoms of a pollen allergy. Furthermore SbsA-BetvI fusion proteins can be used to test for anti-BetvI antibody concentrations or/and to reduce high concentrations of anti-BetvI IgE.
- the fusion proteins can be used to test gB-specific immune reactions.
- a Western blot analysis using a monoclonal antibody which corresponds to the inserted sequence showed the immunological activity of the viral domain within the recombinant SbsA-SmaBB proteins.
- a regular arrangement of polypeptide structures with enzymatic activity on the surface of S-layers is an important goal in the production of immobilized enzymes within a living cell and in the case of the 590 amino acid long PHB synthase for the production of a molecular machine for biopolymer synthesis.
- the phbC gene was isolated by PCR from the plasmid p4A (Janes et al., Molecular characterisation of the poly- ⁇ -hydroxy-butyrate biosynthesis in Alcaligenes eutrophus H16. In: Novel Biodegradable Microbial Polymers (publisher Daves, E. A.), pp 175-190 (1990), Kluver, Dordrecht) as a 1770 nt long DNA fragment (corresponding to an open reading frame of 590 amino acids) and inserted into the ApaI cleavage site of the sbsA clone pSL878 to obtain the plasmid pSbsA-PhbC.
- the E. coli cells which contained the plasmid pSbsA-PhbC were co-transformed with the plasmid pUMS which contains the ⁇ -ketothiolase (PhbA) and the acetoacetyl-CoA reductase (PhbB) from A. eutrophus (Kalousek et al., Genetic engineering of PHB-synthase from Alcaligenes eutrophus H16. In: Proceedings of the International Symposium on Bacterial Polyhydroxy-alkanoates, pp 426-427 (1993), publisher Schlegel H.
- a monocistronic LuxAB gene with a length of 2,070 nt which contains the fusion protein LuxAB composed of the two subunits LuxB and LuxB of the bacterial luciferase from Vibrio harveyi was isolated from the plasmid pT7-mut3 (Boylan et al., J. Biol. Chem. 264 (1989), 1915-1918) by PCR and inserted into the ApaI site of the clone pSL878 prepared in example 8.1 to obtain the plasmid pBK878-LuxAB. It was possible to detect the expression of an SbsA-PhbC fusion protein of ca. 207 kD in an E. coli cell transformed with this plasmid. The enzymatic activity of,the fusion protein was demonstrated by the method described in Boylan et al., Supra.
- the basis for the isolation of the sbsB gene was the amino acid sequence of the N-terminus as well as the sequence of three internal peptides of the SbsB protein. Starting with these peptide sequences, degenerate oligonucleotide primers were constructed and used for the PCR. In this manner a 1076 bp long PCR fragment from the chromosomal DNA of B.stearothermophilus was amplified, cloned and sequenced (corresponding to position 100-1176 of the sequence shown in SEQ ID No.5).
- the integration vector pX (Kim, L., Mogk, A. and Schumann W., Gene 181 (1996), 71-76: A xylose-inducible Bacillus subtilis integration vector and its application) was used for the heterologous expression of sbsA and sbsB in B. subtilis.
- the S-layer genes in the resulting recombinant expression vectors are under the transcriptional control of the xyl promoter.
- Transformants of B.subtilis containing an S-layer gene integrated in the chromosome exhibited an expression of large amounts of S-layer proteins in the supernatant of the cells which was inducible by addition of xylose to the growth medium. This shows that the signal sequences of sbsA and sbsB are recognized by the B. subtilis cell.
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Engineering & Computer Science (AREA)
- Medicinal Chemistry (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Gastroenterology & Hepatology (AREA)
- General Engineering & Computer Science (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Botany (AREA)
- Communicable Diseases (AREA)
- Pharmacology & Pharmacy (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Oncology (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Virology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Laminated Bodies (AREA)
Abstract
Description
- The present invention concerns processes for the recombinant production of S-layer proteins and modified S-layer proteins in gram-negative host cells.
- Crystalline bacterial cell surface layers (S-layers) form the outermost cell wall component in many eubacteria and most of the archaebacteria (Sleytr et al. (1988), Crystalline Bacterial Cell Surface Layers, “Springer Verlag Berlin”; Messner and Sleytr, Adv. Microb. Physiol. 33 (1992), 213-275). Most of the presently known S-layer proteins are composed of identical proteins or glycoproteins which have apparent molecular weights in the range of 40,000 to 220,000. The components of S-layers are self-assembling and most of the lattices have an oblique (p2), quadratic (p4) or hexagonal (p6) symmetry. The functions of bacterial S-layers are still not completely understood but due to their location on the cell surface the porous crystalline S-layers probably serve mainly as protective coatings, molecular sieves or to promote cell adhesion and surface recognition.
- Genetic data and sequence information are known for various S-layer genes from microorganisms. A review may be found in Peyret et al., Mol. Microbiol. 9 (1993), 97-109. Explicit reference is made to these data. The sequence of the sbsA gene coding for the S-layer protein of B.stearothermophilus PV72 and a process for cloning it are stated in Kuen et al. (Gene 145 (1994), 115-120). B.stearothermophilus PV72 is a gram-positive bacterium which is covered with a hexagonally arranged S-layer. The main component of the S-layer is a 128 kd protein which is the most frequent protein in the cell with a proportion of about 15% relative to the total protein components. Various strains of B.stearothermophilus have been characterized which differ with regard to the type of the S-layer lattice, the molecular weight and glycosilation of the S-layer components (Messner and Sleytr (1992), supra).
- The German Patent Application P 44 25 527.6 discloses the signal peptide-coding section of the S-layer gene from B.stearothermophilus and the amino acid sequence derived therefrom. The cleavage site between the signal peptide and the mature protein is located between position 30 and 31 of the amino acid sequence. The signal peptide-coding nucleic acid can be operatively linked to a protein-coding nucleic acid and can be used for the recombinant production of proteins in a process in which a transformed host cell is provided, the host cell is cultured under conditions which lead to an expression of the nucleic acid and to production and secretion of the polypeptide coded thereby and the resulting polypeptide is isolated from the culture medium. Prokaryotic organisms are preferably used as host cells in particular gram-positive organisms of the genus bacillus.
- Surprisingly it was found that the recombinant production of S-layer proteins is not only possible in gram-positive prokaryotic host cells but also in gram-negative prokaryotic host cells. In this case the S-layer protein is not formed in the interior of the host cell in the form of ordered inclusion bodies but rather unexpectedly in the form of ordered monomolecular layers.
- Hence one subject matter of the present invention is a process for the recombinant production of S-layer proteins characterized in that (a) a gram-negative prokaryotic host cell is provided which is transformed with a nucleic acid coding for an S-layer protein selected from (i) a nucleic acid which comprises the nucleotide sequence shown in SEQ ID NO. 1 from position 1 to 3684 optionally without the section coding for the signal peptide, (ii) a nucleic acid which comprises a nucleotide sequence corresponding to the nucleic acid from (i) within the scope of the degeneracy of the genetic code and (iii) a nucleic acid which comprises a nucleotide sequence which hybridizes with the nucleic acids from (i) or/and (ii) under stringent conditions; (b) the host cell is cultured under conditions which lead to an expression of the nucleic acid and to production of the polypeptide coded thereby and (c) the resulting polypeptide is isolated from the host cell.
- The term “stringent hybridization” is understood within the sense of the present invention to mean that a hybridization still also occurs after washing at 55° C., preferably 60° C. in an aqueous low salt buffer (e.g. 0.2×SSC) (see also Sambrook et al. (1989), Molecular Cloning. A Laboratory Manual).
- The process according to the invention is carried out in gram-negative prokaryotic host cells. In this process an ordered S-layer protein structure is surprisingly obtained in the cell interior. Enterobacteria, in particular E. coli, are preferably used as host cells.
- The E. coli strain pop2125 which was deposited on the 31.01.1996 at the “Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH”, Mascheroder Weg 1b, D 38124 Braunschweig under the file number DSM 10509 is particularly preferred.
- The process according to the invention can also be used to isolate recombinant S-layer proteins. For this one uses a nucleic acid coding for the S-layer protein which contains one or several insertions which code for peptide or polypeptide sequences. These insertions can, on the one hand, only code for peptides with a few amino acids e.g. 1-25 amino acids. On the other hand, the insertions can also code for larger polypeptides of for example up to 1000 amino acids and preferably up to 500 amino acids without loss of the ability of the S-layer protein to form a correctly folded structure. In addition to the insertions the recombinant S-layer protein can also have amino acid substitutions, in particular substitutions of individual amino acids in the region of the insertion sites as well as optionally deletions of individual amino acids or short amino acid sections of up to 30 amino acids.
- Regions between the positions 1-1200 and 2200-3000 of the nucleotide sequence shown in SEQ ID NO.1 are preferred as insertion sites for polypeptide-coding sequences. Particularly preferred insertion sites are the NruI cleavage site at position 582, the PvuII cleavage site at position 878, the SnaB-I cleavage site at position 917, the PVuII cleavage site at position 2504 and the PvuII cleavage site at position 2649. It was already possible to demonstrate the insertion of a nucleic acid coding for streptavidin into the NruI cleavage site at position 581.
- The peptide or polypeptide-coding insertions are preferably selected from nucleotide sequences which code for cysteine residues, regions with several charged amino acids, e.g. Arg, Lys, Asp or Glu, or Tyr residues, DNA-binding epitopes, antigenic, allergenic or immunogenic epitopes, metal-binding epitopes, streptavidin, enzymes, cytokines or antibody-binding proteins.
- A particularly preferred example of an insertion into the nucleic acid coding for the S-layer protein is a nucleotide sequence coding for streptavidin. In this manner it is possible to obtain universal carrier molecules which are suitable for coupling biotinylated reagents and for detection in immunological or hybridization test procedures.
- A further preferred example of insertions are antigenic, allergenic or immunogenic epitopes e.g. epitopes from pathogenic microorganisms such as bacteria, fungi, parasites etc. and viruses, or epitopes from plants or epitopes against endogenous substances e.g. cytokines as well as against toxins in particular endotoxins. Particularly preferred examples of immunogenic epitopes are epitopes from herpes viruses such as the herpes virus 6 or pseudorabies virus (Lomniczi et al., J. Virol. 49 (1984), 970-979), in particular epitopes from the genes gB, gC or/and gD, or foot-and-mouth disease virus (FMDV), in particular epitopes from the gene sections which code for VP1, VP2 or/and VP3. The immunogenic epitopes can be selected such that they promote an antibody-mediated immune reaction or/and the production of a cellular immune reaction e.g. by stimulation of T cells. Examples of suitable allergenic epitopes are birch pollen allergens e.g. Bet v I (Ebner et al., J. Immunol. 150 (1993) 1047-1054). Antigenic epitopes are additionally particularly preferred which are able to bind and filter out endogenous or exogenous substances such as cytokines or toxins from serum or other body fluids. Such epitopes can include components of cytokine or toxin receptors or of antibodies against cytokines or toxins.
- On the other hand the insertions can also code for enzymes. Preferred examples are enzymes for the synthesis of polyhydroxybutyric acid e.g. PHB synthase. Incorporation of PHB synthase into the S-layer can lead to the formation of a molecular spinning nozzle under suitable conditions when the substrate hydroxybutyric acid is provided. A further preferred example of an enzyme is bacterial luciferase. In this case when the enzyme substrate, an aldehyde, is supplied and O 2 is present, a molecular laser can be obtained.
- Insertions are likewise preferred which code for cytokines such as interleukins, interferones or tumour necrosis factors. These molecules can for example be used in combination with immunogenic epitopes to prepare vaccines.
- Finally insertions are also preferred which code for antibody binding proteins such as protein A or protein G or for DNA-binding or/and metal-binding epitopes such as the leucine zipper, zinc finger etc.
- Thus for the first time a cell is provided by the present invention which contains immobilized recombinant polypeptides in a native form e.g. active enzymes in the cytoplasm. In this manner 50,000-200,000 e.g. ca. 100,000 recombinant molecules can be immobilized per m 2 recombinant S-layer. Up to 3000 m2 S-layer can be obtained per kg recombinant E. coli cells.
- In the method according to the invention the nucleic acid coding for the S-layer protein is preferably used in operative linkage with a nucleic acid coding for a signal peptide of gram-positive bacteria i.e. the signal peptide-coding nucleic acid is located on the 5′ side of the S-layer protein-coding nucleic acid. Surprisingly it was found that the presence of such signal peptide sequences, which are not cleaved in the gram-negative host cells used in the invention, can improve the stability of the S-layer structures. The nucleic acid coding for the signal peptide particularly preferably comprises (a) the signal peptide-coding section of the nucleotide sequence shown in SEQ ID NO. 1, (b) a nucleotide sequence corresponding to the sequence from (a) within the scope of the degeneracy of the genetic code or/and (c) a nucleotide sequence which is at least 80% and in particular at least 90% homologous to the sequences from (a) or/and (b).
- Yet a further subject matter of the present invention is a nucleic acid which codes for a recombinant S-layer protein and is selected from (i) a nucleic acid which comprises the nucleotide sequence shown in SEQ ID NO.1 from position 1 to 3684 optionally without the signal peptide-coding-section (ii) a nucleic acid which comprises a nucleotide sequence corresponding to a nucleic acid from (i) within the scope of the degeneracy of the genetic code and (iii) a nucleic acid which comprises a nucleotide sequence which hybridizes under stringent conditions with the nucleic acids from (i) or/and (ii).
- The coding nucleotide sequence of the S-layer gene sbsA from B.stearothermophilus including the signal peptide-coding section is shown in SEQ ID NO. 1. The signal peptide-coding section extends from position 1 to 90 of the nucleotide sequence shown in SEQ ID NO. 1. The section coding for the mature SbsA polypeptide extends from position 91 to 3684.
- The sbsA gene of B.stearothermophilus codes for a protein with a total of 1228 amino acids including an N-terminal signal peptide with 30 amino acids (SEQ ID NO. 2). The cleavage site between the signal peptide and the mature protein is located between position 30 and 31 of the amino acid sequence. The signal peptide has a basic amino-terminal domain followed by a hydrophobic domain.
- Sequence comparisons with other signal peptides indicate a certain homology to signal peptides of extracellular proteins in bacilli such as alkaline phosphatase and neutral phosphatase of B.amyloliquefaciens (Vasantha et al., J. Bacteriol. 159 (1984), 811-819) as well as with the signal peptides for the B.sphaericus gene 125 (Bowditch et al., J. Bacteriol. 171 (1989), 4178-4188) and the OWP gene of B.brevis (Tsuboi et al., J. Bacteriol. 168 (1986), 365-373).
- A further subject matter of the present invention is a recombinant vector which contains at least one copy of a nucleic acid according to the invention. The vector is preferably replicatable in prokaryotes. The vector is particularly preferably a prokaryotic plasmid.
- Yet a further subject matter of the present invention is a host cell which is transformed with a nucleic acid or a recombinant vector according to the present invention. The cell is preferably a gram-negative prokaryotic organism and most preferably an E. coli cell. The cell according to the invention can contain a recombinant S-layer structure in its interior. Methods for the transformation of cells with nucleic acids are general state of the art (cf. Sambrook et al., supra) and therefore do not need to be elucidated.
- Yet a further subject matter of the present invention is a recombinant S-layer protein which contains at least one peptide insertion or/and polypeptide insertion within the amino acid sequence shown in SEQ ID NO. 2. Preferred examples of peptide insertions and polypeptide insertions have already been elucidated.
- A recombinant S-layer structure can be assembled from recombinant S-layer protein molecules according to the invention which contain at least one recombinant S-layer protein according to the invention as a subunit. Furthermore it is preferred that the S-layer structure according to the invention also contains non-modified S-layer proteins as diluent molecules. The non-modified S-layer proteins are preferably present in a molar proportion of 10-99% relative to the total S-layer proteins.
- The S-layer structure according to the invention can comprise several layers that are covalently linked together or by means of affinity binding. Covalent linkages can for example be introduced by insertions of cysteine residues and a subsequent formation of cystine bridges. Linkages by affinity binding comprise for example antibody-antigen, antibody-protein A or antibody-protein G or streptavidin-biotin interactions.
- S-layer structures which contain recombinant S-layer proteins can optionally also be prepared in a carrier-bound form. For this the S-layer structure can be reassembled from individual units in the presence of a peptidoglycan carrier to for example produce peptidoglycan layers which are covered on one or on both sides with an S-layer structure. Another method of preparing carrier-bound S-layer structures is to produce an S-layer layer at an interface between two media e.g. water/air and to immobilize this layer on a solid phase e.g. a filter membrane (cf. e.g. Pum and Sleytr (1994), Thin Solid Films 244, 882-886; Küpcü et al., (1995), Biochim. Biophys. Acta 1235, 263-269).
- The recombinant S-layer proteins and S-layer structures according to the invention are suitable for a multitude of applications. An application as a vaccine or adjuvant is particularly preferred in which case recombinant S-layer proteins are used which contain immunogenic epitopes of pathogens and/or endogenous immunostimulatory polypeptides such as cytokines. In this application it is not absolutely necessary to purify the recombinant S-layer proteins. Instead they can for example be used in combination with a bacterial ghost which optionally contains additional immunogenic epitopes in its membrane.
- The preparation of suitable “bacterial ghosts” is described for example in the International Patent application PCT/EP91/00967 to which reference is herewith made. In this application modified bacteria are disclosed which are obtainable by transformation of a gram-negative bacterium with the gene of a lytically active membrane protein from bacteriophages, with the gene of a lytically active toxin release protein or with genes which contain partial sequences thereof which code for lytic proteins, culturing the bacterium, expression of this lysis gene and isolation of the resulting bacterial ghost from the culture medium.
- A recombinant protein, which is obtainable by expression of a recombinant DNA in these gram-negative bacteria, can be bound to the membrane of these bacteria as described in the European Patent 0 516 655. This recombinant DNA comprises a first DNA sequence which codes for a hydrophobic, non-lytically active membrane-integrating protein domain which has an α-helical structure and is composed of 14-20 amino acids which can be flanked N- and C-terminally by 2-30 arbitrary amino acids in each case. A second DNA sequence is in operative linkage with this first DNA sequence which codes for a desired recombinant protein. Furthermore the gram-negative bacterium contains a third DNA sequence which is under a different control from the first and second DNA sequences and codes for a lytically active membrane protein from bacteriophages or a lytically active toxin release protein or for their lytically active components. So-called “bacterial ghosts” are obtained by expression and lysis of such recombinant gram-negative bacteria which contain an intact surface structure with immunogenic epitopes bound to the surface.
- When these bacterial ghosts are combined with recombinant S-layers according to the invention vaccines and adjuvants can be produced which have particularly advantageous properties.
- A further particularly preferred application for recombinant S-layer proteins and S-layer structures is their use as an enzyme reactor. Such an enzyme reactor can for example be formed by a cell which contains a recombinant S-layer structure according to the invention in its interior. On the other hand the enzyme reactor can also be formed from isolated and in vitro reassembled S-layer structures or combinations of various S-layer structures.
- It was found that the gram-positive bacterium B.stearothermophilus PV72 has an additional S-layer protein in addition to SbsA which is subsequently denoted as SbsB (Sara and Sleytr (1994), J. Bacteriol. 176, 7182-7189). It was possible to isolate and characterize the sbsB gene by amplification using suitable nucleic acid primers. The coding nucleotide sequence of the S-layer gene sbsB from B.stearothermophilus including the signal peptide-coding section which extends from position 1 to 93 of the nucleic acid sequence is shown in SEQ ID NO.5. The amino acid sequence derived therefrom is shown in SEQ ID NO.6. The sbsB gene codes for a protein with a total of 921 amino acids including an N-terminal signal peptide with 31 amino acids.
- One subject matter of the present invention is hence a nucleic acid which codes for an S-layer protein and is selected from
- (i) a nucleic acid which comprises the nucleotide sequence from position 1 to 2763 shown in SEQ ID NO.5 optionally without the signal peptide-coding section,
- (ii) a nucleic acid which comprises a nucleotide sequence corresponding to the nucleic acid from (i) within the scope of the degeneracy of the genetic code and
- (iii) a nucleic acid which comprises a nucleotide sequence that hybridizes with the nucleic acids from (i) or/and (ii) under stringent conditions.
- As in the case of the sbsA-gene, it is also possible to insert at least one nucleic acid insertion coding for a peptide or polypeptide into the sbsB gene within the region coding for the S-layer protein. With regard to preferred examples of insertions in the sbsB gene reference is made to the previous statements regarding the sbsA gene.
- Yet a further subject matter of the present invention is a vector which contains at least one copy of an sbsB gene optionally containing an insertion. This vector can be replicated in eukaryotes, prokaryotes or in eukaryotes and prokaryotes. It can be a vector that can be integrated into the genome of the host cell or a vector which is present extrachromosomally. The vector according to the invention is preferably a plasmid in particular a prokaryotic plasmid.
- Yet a further subject matter of the present invention is a host cell which is transformed with an sbsB gene wherein the sbsB gene optionally can contain an insertion. The host cell can be a eukaryotic as well as a prokaryotic cell. The cell is preferably a prokaryotic organism. Gram-positive organisms e.g. organisms of the genus bacillus as well as gram-negative organisms such as enterobacteria in particular E. coli are preferred. Methods for transforming eukaryotic and prokaryotic cells with nucleic acids are known and therefore do not need to be elucidated in detail.
- The present invention also concerns an SbsB protein i.e. an S-layer protein which is coded by a nucleic acid as defined above. Recombinant SbsB proteins are particularly preferred which contain one or several peptide or/and polypeptide insertions within the sbsB sequence. The SbsB part of a polypeptide according to the invention particularly preferably has a homology of at least 80% and in particular of at least 90% to the amino acid sequence shown in SEQ ID NO.6.
- A recombinant S-layer structure can also be assembled from the recombinant SbsB-S-layer protein molecules analogous to the recombinant SbsA-S-layer structure. In this structure the non-modified S-layer proteins are preferably present in a molar proportion of 10-99% relative to the total S-layer proteins.
- The applications for the recombinant SbsB-S-layer proteins and S-layer structures according to the invention also correspond to the applications for SbsA mentioned above. In this connection its use as a vaccine or adjuvant or as an enzyme reactor is noteworthy.
- Recombinant S-layer proteins are obtainable by a process in which
- (a) a host cell is provided which contains a nucleic acid coding for an S-layer protein which contains a peptide-coding or polypeptide-coding insertion within the region coding for the S-layer protein,
- (b) the host cell is cultured under conditions which lead to an expression of the nucleic acid and to production of the polypeptide coded by it and
- (c) the resulting polypeptide is isolated from the host cell or from the culture medium.
- In a first preferred embodiment of this process a recombinant SbsA-S-layer protein is prepared i.e. the nucleic acid coding for the recombinant S-layer protein is selected from
- (i) a nucleic acid which comprises the nucleotide sequence from position 1 to 3684 shown in SEQ ID NO.1 optionally without the signal peptide-coding section,
- (ii) a nucleic acid which comprises a nucleotide sequence corresponding to the nucleic acid from (i) within the scope of the degeneracy of the genetic code and
- (iii) a nucleic acid which comprises a nucleotide sequence which hybridizes with the nucleic acids from (i) or/and (ii) under stringent conditions.
- In a second preferred embodiment a recombinant SbsB-S-layer protein is prepared i.e. the nucleic acid coding for the recombinant S-layer protein is selected from
- (i) a nucleic acid which comprises the nucleotide sequence from position 1 to 2763 shown in SEQ ID NO.5 optionally without the signal peptide-coding section,
- (ii) a nucleic acid which comprises a nucleotide sequence corresponding to the nucleic acid from (i) within the scope of the degeneracy of the genetic code and
- (iii) a nucleic acid which comprises a nucleotide sequence which hybridizes with the nucleic acids from (i) or/and (ii) under stringent conditions.
- In addition to the recombinant SbsA and SbsB-S-layer proteins from B.stearothermophilus it is, however, also possible to prepare recombinant S-layer proteins from other organisms (cf. e.g. Peyret et al., (1993), supra).
- The recombinant S-layer proteins can on the one hand be produced in a heterologous host cell i.e. in a host cell which originally contains no S-layer gene. Examples of such heterologous host cells are gram-negative prokaryotic organisms such as E. coli.
- However, the heterologous expression of S-layer proteins can also take place in gram-positive prokaryotic organisms such as B. subtilis. For this integration vectors are preferably used which contain a native or/and a recombinant S-layer gene. When the native signal sequences are used the S-layer proteins are secreted into the culture supernatant.
- However, it is often preferable to produce the recombinant S-layer proteins in homologous host cells i.e. host cells which originally contain a natural S-layer gene. In one embodiment of this homologous expression the recombinant S-layer gene is introduced into the host cell in such a way that the host cell is still able to express a further S-layer gene which codes for a non-modified S-layer protein. The non-modified S-layer protein is preferably capable of forming an S-layer structure that is compatible with the recombinant S-layer protein. An example of this embodiment of homologous expression is a B.stearothermophilus PV72 cell which contains intact natural sbsA genes or/and sbsB genes and is transformed with a plasmid which contains a recombinant S-layer-gene.
- In a second embodiment the homologous expression can occur in a host cell in which the intact S-layer gene originally present has been inactivated. Consequently in this embodiment no further S-layer gene is expressed in the host cell which codes for a non-modified S-layer protein which is able to form a compatible S-layer structure with the recombinant S-layer protein. A specific example of such a host cell is a B.stearothermophilus PV72 cell in the genome of which a gene coding for a recombinant S-layer protein has been introduced, e.g. by homologous recombination, which replaces the original S-layer gene. A further example of such a host cell is a B.stearothermophilus cell in which the native S-layer gene has been inactivated e.g. by site-specific mutagenesis or/and homologous recombination and is transformed with a vector containing a recombinant S-layer gene.
- Gram-positive prokaryotic organisms are usually used as host cells for the homologous expression of recombinant S-layer genes. B.stearothermophilus PV72 is particularly preferred as a host cell which can be cultured at a high temperature in a defined synthetic medium (Schuster et al., (1995), Biotechnol. and Bioeng. 48: 66-77).
- The present invention is further elucidated by the following examples and figures.
- SEQ ID NO.1 shows the complete nucleotide sequence of the coding section of the S-layer gene sbsA of B.stearothermophilus;
- SEQ ID NO.2 shows the amino acid sequence derived therefrom;
- SEQ ID NO.3 shows the nucleotide sequence of the primer T5-X;
- SEQ ID NO.4 shows the nucleotide sequence of the primer E;
- SEQ ID NO.5 shows the complete nucleotide sequence of the coding section of the S-layer gene sbsB of B.stearothermophilus;
- SEQ ID NO.6 shows the amino acid sequence derived therefrom;
- SEQ ID NO.7 shows the nucleotide sequence of a partial fragment of the streptavidin gene;
- SEQ ID NO.8 shows the nucleotide sequence of the primer NIS 2AG;
- SEQ ID NO.9 shows the nucleotide sequence of the primer LIS C3;
- FIG. 1 shows a schematic representation of the sbsA PCR fragment used to prepare the recombinant vector pBK4;
- FIG. 2 shows a schematic representation of peptide insertions in the amino acid sequence of the SbsA S-layer protein and
- FIG. 3 shows a schematic representation of amino acid substitutions and amino acid insertions in recombinant S-layer proteins.
- 1. Bacterial Strains, Media and Plasmids
- Gram-positive bacteria of the strain Bacillus stearothermophilus PV72 were cultured at 58° C. in SVIII medium (Bartelmus and Perschak, Z. Zuckerrind. 7 (1957), 276-281). Bacteria of the strain E. coli pop2135 (endA, thi, hsdr, malT, cI857, λpR, malPQ) were cultured in LB medium (Sambrook et al., (1989), supra). Ampicillin was added to the medium at a final concentration of 100 μg/ml to select for transformants. The plasmid pPLcAT10 (λpL, bla, colE1) (Stanssens et al., Gene 36 (1985), 211-223) was used as the cloning vector.
- 2. Manipulation of DNA Fragments
- Restriction analysis of DNA, agarose gel electrophoresis and cloning of DNA fragments were carried out according to the standard methods described in Sambrook et al. (1989), supra.
- Competent cells were transformed by electroporation using a Bio-Rad gene pulser (Bio-Rad Laboratories, Richmond, Calif. USA) according to the manufacturer's instructions.
- Plasmid DNA was isolated by the method of Birnboim and Doly (Nucleic Acids Res. 7 (1979), 1513-1523). Chromosomal DNA was isolated according to the method described in Ausubel et al. (Current Protocols in Molecular Biology (1987), New York, John Wiley).
- Restriction endonucleases and other enzymes were obtained from Boehringer Mannheim, New England Biolabs or Stratagene and used according to the manufacturer's instructions.
- 3. DNA Sequencing
- The DNA sequences of the 5′ regions and the 3′ regions (including the region coding for the signal sequence) of the gene sbsA in the vector pPLcAT10 were determined by the dideoxy chain termination method of Sanger et al. The primers used for sequencing were constructed on the basis of the already published sbsA sequence (Kuen et al. Gene 145 (1994), 115-120).
- 4. PCR Amplification of sbsA
- The PCR amplification of the sbsA gene was carried out in a reaction volume of 100 μl in which 200 μM deoxynucleotides, 1 U Pfu-polymerase (Stratagene), 1× Pfu-reaction buffer, 0.5 μM of each oligonucleotide primer and 100 ng genomic DNA from B.stearothermophilus as a template were present. The amplification was carried out for 30 cycles in a thermocycler (Biomed thermocycler 60). Each cycle was composed of a denaturing step of 1.5 min at 95° C., an annealing step of 1 min at 56° C. and 1 min at 50° C. as well as an extension step of 2 min at 72° C.
- The primer T5-X shown in the sequence protocol as SEQ ID NO.3 which flanks the 5′ region of sbsA and contains an XbaI site and the primer E shown in the sequence protocol in SEQ ID NO.4 which flanks the 20 nucleotide upstream region of the transcription terminator of the sbsA sequence and contains a BamHI site were used as primers.
- The products amplified by PCR were electrophoretically separated on a 0.8% agarose gel and purified for cloning using the system from Gene Clean (BIO101 La Jolla, Calif. USA).
- 5. Cloning of the sbsA Gene into the Vector pPLcAT10
- The sbsA gene obtained by PCR with a length of 3.79 kb was purified and cleaved with the restriction endonucleases XbaI and BamHI. The resulting XbaI-BamHI fragment was cloned into the corresponding restriction sites of the vector pPLcAT10 so that the sbsA gene was under transcriptional control of the pL promoter located upstream. The ATG start codon of the sbsA sequence was reconstructed by the cloning procedure. The cloned sbsA sequence contained the N-terminal signal sequence of sbsA and ended 20 nt after the transcription terminator. After ligation of the vector DNA with the sbsA fragment, the E. coli strain pop2135 was transformed by electrotransformation. The resulting clones were subjected to a DNA restriction analysis. A positive clone was sequenced in order to verify the correct sequence transitions at the 5′ and 3′ ends. This clone was named pBK4.
- A schematic representation of the 3.79 kb XbaI sbsA fragment and its location in the multiple cloning site of the plasmid pBK4 is shown in FIG. 1 (abbreviations: tT: transcription terminator; ori: origin of the DNA replication; amp: ampicillin resistance gene).
- 6. Recombinant Expression of the SbsA Gene in E. coli
- E. coli pop2135/pBK4 cells were cultured at 28° C. until an optical density OD600 of 0.3 was reached. Then the expression of sbsA was induced by increasing the culture temperature from 28° C. to 42° C. 1.5 ml aliquots were taken before and 1, 2, 3 and 5 hours after induction of the sbsA expression. E. coli pop2135/pPLcAT10 (cultured under the same conditions) and B.stearothermophilus PV72 were used as controls.
- Culture supernatants and cell extracts from all samples were examined for the expression of S-layer proteins by SDS-PAGE and Western immunoblotting.
- An additional strong protein band with the same molecular weight as the wild type SbsA protein was found in extracts from E. coli cells transformed with pBK4. No degradation products of SbsA itself were found in a period of up to 5 hours after induction of expression. Thus presumably the S-layer protein sbsA is stable in E. coli and is not degraded by proteases.
- A densitometric determination of the relative amount of SbsA protein was carried out. At a time point of 4 hours after induction the sbsA protein was in a proportion of ca. 16% relative to the total cellular protein.
- The SbsA protein produced in E. coli migrated in the SDS gel slightly more slowly than the natural SbsA protein from B.stearothermophilus. Experiments to determine the N-terminal amino acid sequence of the SbsA protein by Edman degradation were not successful due to a blocking of the N-terminus. Thus presumably the signal sequence was not cleaved in E. coli.
- A Western blot analysis of total cell extracts and culture supernatants of E. coli/pBK4 also only yielded a single sbsA-specific protein band with a slightly higher molecular weight than wild type SbsA protein from stearothermophilus.
- For the Western blot the proteins were transferred onto a nitrocellulose membrane and incubated with a polyclonal antiserum against SbsA from rabbits. The preparation of this antiserum is described in Egelseer et al. (J. Bacteriol. 177 (1995), 1444-1451). A conjugate of goat anti-rabbit IgG and alkaline phosphatase was used to detect bound SbsA-specific antibodies.
- No SbsA protein could be detected from supernatants from E. coli cells transformed with pBK4 even after induction of sbsA gene expression. This shows that SbsA is not exported into the surrounding medium.
- 7. Location and Organisation of the S-layer Protein SbsA in the Cytoplasm of E. coli
- Cells of E. coli pop2135/pBK4 which were harvested from cultures 1, 2, 3 and 5 hours after induction of the S-layer protein expression were examined for the intracellular organisation of sbsA. Non-induced cells cultured at 28° C. and cells of B.stearothermophilus PV72 were examined as controls.
- For this whole cells of both organisms were fixed and embedded in detection resin according to the method of Messner et al. (Int. J.Syst.Bacteriol. 34 (1984), 202-210). Subsequently ultrathin sections of the embedded preparations were prepared and stained with uranyl acetate.
- The cytoplasm of non-induced E. coli cells exhibited the typical granular structure which did not change even when the OD of the suspensions increased. Longitudinal sections of E. coli cells which were harvested 1 hour after induction of the S-layer protein expression exhibited parallel, leaf-like structures in the cytoplasm. From cross sections it was apparent that these structures have a concentric arrangement.
- The amount of leaf-like structures considerably increased between 1 and 2 hours after induction of the sbsA expression and afterwards remained essentially constant.
- The sbsA protein recombinantly produced in E. coli could also be detected by immunogold labelling with sbsA-specific antibodies. An ordered structure of the recombinantly produced SbsA protein was also found with this detection method.
- It was clearly apparent from these morphological data that the SbsA protein did not aggregate to form irregular inclusion bodies but rather formed monomolecular S-layer crystals. A remarkable property of the SbsA-S-layer layers assembled in E. coli was the concentric arrangement at defined distances. The presence of the signal sequence did not interfere with correct assembly.
- 8. Preparation of Recombinant sbsA-S-Layer Genes
- 8.1 Insertion of a 6 bp Long DNA Sequence
- A modified kanamycin cassette (1.3 kb) was used for the site-specific insertion mutagenesis of the sbsA gene which was isolated by cleavage of the plasmid pWJC3 (obtained from W. T. McAllister, New York) by SmaI. The cassette was ligated into five different blunt-ended restriction sites of the sbsA gene, i.e. into the NruI site at position bp 582 (pSL582), into the SnaBI site at position bp 917 (pSL917) and into each of the PvuII sites at positions bp 878 (pSL878), bp 2504 (pSL2504) and bp 2649 (pSL2649). After selection of kanamycin-resistant clones, the cassette was removed from the insertion site by cleavage with ApaI followed by a religation of the S-layer plasmid pBK4. The cutting out and religation procedure left an insertion of 6 bp CCCGGG (ApaI restriction site). The system of this linker insertion is shown schematically in FIG. 2.
- The resulting recombinant S-layer genes code for modified sbsA proteins elongated by 2 amino acids.
- The specific changes in the primary structure of the sbsA proteins are shown in FIG. 3. In the clone pSL582 the insertion led to the incorporation of glycine and proline between the amino acids 194 and 195 at the N-terminus of the SbsA protein. The amino acids alanine and arginine were inserted in the clone pSL917 between the amino acids 306 and 307. In the clone pSL2649 glycine and proline were inserted between the amino acids at positions 883 and 884. An insertion of alanine and proline between the amino acids 293 and 294 was obtained in the clone pSL878. Furthermore the alanine at position 293 was substituted by glycine. In the clone pSL2504 the amino acids alanine and proline were inserted between the amino acids 835 and 836 and the alanine at position 835 was replaced by glycine.
- All clones obtained by insertion mutagenesis retained their ability to synthesise the S-layer protein.
- In order to test the ability of the modified proteins to assemble into S-layer structures, ultrathin longitudinal sections of whole cells which had been cultured for 4 hours under inductive conditions were prepared according to the procedure described in section 7. It was found that the cytoplasm of all five clones is filled with parallel, leaf-like structures which follow the curve of the cell poles. There were no morphological differences of the cytoplasm in the 5 different clones examined. Exactly the same leaf-like structures were found as in the assembly of the wild type SbsA protein in E. coli (section 7).
- 8.2 Insertion of a DNA Sequence Coding for Streptavidin
- In order to examine whether the insertion of larger protein sequences into the SbsA protein can also be tolerated, a DNA fragment coding for a part of streptavidin (160 amino acids) provided with ApaI linkers (SEQ ID NO.7) was gene inserted into the ApaI restriction site of the sbsA clones pSL582, pSL878, pSL917 and pSL2649 prepared in the example on page 1. The streptavidin sequence was inserted in SL582 in the codon 197, in pSL878 between codon 295 and 296, in pSL917 in the codon 308 and 309 and in pSL2649 in the codon 886. It was possible to detect the expression of SbsA-streptavidin fusion proteins in all constructs by SDS-PAGE and immunoblots. It was found by EM analysis that a self assembly of the S-layer structure was possible in the fusion proteins containing insertions in the codon 197 and between the codons 295 and 296.
- The SbsA-streptavidin fusion proteins can be isolated as monomers and reassembled to form homogeneous SbsA-streptavidin S-layers or mixed SbsA-streptavidin/SbsA-S-layers. They can be used to bind biotinylated substances as well as to determine the binding capacity of enzymes and other bound molecules.
- 8.3 Insertion of a DNA Sequence Coding for BetvI
- A DNA sequence coding for the open reading frame of BetvI (161 amino acids) the main pollen allergen of the birch (Ferreira et al., J. Biol. Chem. 268 (1993), 19574-19580) was inserted at the ApaI site into the sbsA clone pSL878. It was possible to detect the expression of an SbsA-BetvI fusion protein which contained an immunologically active BetvI domain.
- The resulting fusion protein can be used for therapeutic or diagnostic purposes. Hence it can be attempted by administration of the fusion protein to convert a T H2-directed IgE antibody reaction into a TH1-mediated reaction against BetvI. In this manner it is possible to suppress the occurrence of symptoms of a pollen allergy. Furthermore SbsA-BetvI fusion proteins can be used to test for anti-BetvI antibody concentrations or/and to reduce high concentrations of anti-BetvI IgE.
- 8.4 Insertion of a DNA Sequence Coding for a Pseudorabies Virus Antigen
- The DNA sequence coding for the gB epitope SmaBB (255 amino acids) (nucleotides 489-1224 corresponding to the coordinates according to the EMBL-Seq: HEHSSGP2) from the pseudorabies virus was inserted into SSPI site of the sbsA gene after nt 3484 (between codon 1161 and 1162). It was possible to detect the expression of SbsA-SmaBB fusion proteins.
- The fusion proteins can be used to test gB-specific immune reactions. A Western blot analysis using a monoclonal antibody which corresponds to the inserted sequence showed the immunological activity of the viral domain within the recombinant SbsA-SmaBB proteins.
- 8.5 Insertion of a DNA Sequence Coding for the PHB Synthase (PhbC) from Alcaligenes Eutrophus H16
- A regular arrangement of polypeptide structures with enzymatic activity on the surface of S-layers is an important goal in the production of immobilized enzymes within a living cell and in the case of the 590 amino acid long PHB synthase for the production of a molecular machine for biopolymer synthesis.
- The phbC gene was isolated by PCR from the plasmid p4A (Janes et al., Molecular characterisation of the poly-β-hydroxy-butyrate biosynthesis in Alcaligenes eutrophus H16. In: Novel Biodegradable Microbial Polymers (publisher Daves, E. A.), pp 175-190 (1990), Kluver, Dordrecht) as a 1770 nt long DNA fragment (corresponding to an open reading frame of 590 amino acids) and inserted into the ApaI cleavage site of the sbsA clone pSL878 to obtain the plasmid pSbsA-PhbC. It was possible to detect the expression of an SbsA-PhbC fusion protein of ca. 195 kD in an E. coli cell transformed with this plasmid. When two copies of the phbc gene were inserted one behind the other into the ApaI site of pSL878, it was possible to detect the expression of a fusion protein of ca. 260 kD.
- For a functional test of the enzymatic activity of the SbsA-PhbC construct, the E. coli cells which contained the plasmid pSbsA-PhbC were co-transformed with the plasmid pUMS which contains the β-ketothiolase (PhbA) and the acetoacetyl-CoA reductase (PhbB) from A. eutrophus (Kalousek et al., Genetic engineering of PHB-synthase from Alcaligenes eutrophus H16. In: Proceedings of the International Symposium on Bacterial Polyhydroxy-alkanoates, pp 426-427 (1993), publisher Schlegel H. G., Steinbüchel A. Goltze Press, Göttingen). The poly-β-hydroxybutyrate formation in the co-transformed E. coli cells was detectable by staining with Sudan black, gas chromatography and electron microscopy. These findings show that the SbsA-PhbC construct is enzymatically active and represents a successful example of the immobilization of enzymes on intracellular S-layer matrices.
- 8.6 Insertion of a DNA Sequence Coding for a Bacterial Luciferase Gene
- A monocistronic LuxAB gene with a length of 2,070 nt which contains the fusion protein LuxAB composed of the two subunits LuxB and LuxB of the bacterial luciferase from Vibrio harveyi was isolated from the plasmid pT7-mut3 (Boylan et al., J. Biol. Chem. 264 (1989), 1915-1918) by PCR and inserted into the ApaI site of the clone pSL878 prepared in example 8.1 to obtain the plasmid pBK878-LuxAB. It was possible to detect the expression of an SbsA-PhbC fusion protein of ca. 207 kD in an E. coli cell transformed with this plasmid. The enzymatic activity of,the fusion protein was demonstrated by the method described in Boylan et al., Supra.
- 9. Isolation and Characterization of the SbsB Gene
- The basis for the isolation of the sbsB gene was the amino acid sequence of the N-terminus as well as the sequence of three internal peptides of the SbsB protein. Starting with these peptide sequences, degenerate oligonucleotide primers were constructed and used for the PCR. In this manner a 1076 bp long PCR fragment from the chromosomal DNA of B.stearothermophilus was amplified, cloned and sequenced (corresponding to position 100-1176 of the sequence shown in SEQ ID No.5).
- The method of inverse PCR was used to amplify the sections on the 5′ side and 3′ side of the sbsB gene and stepwise overlapping DNA fragments were obtained with the aid of various primer combinations and sequenced.
- The primer NIS 2AG shown in the sequence protocol as SEQ ID NO.8 which contains the 5′ region of sbsB as well as the primer LIS C3 shown in the. sequence protocol of SEQ ID NO.9 which contains the 3′ region of sbsB were used as primers to amplify the complete sbsB gene.
- The PCR fragment obtained in this manner which contains the nucleotide sequence shown in SEQ ID NO.5 with 5′ and 3′ BamHI restriction cleavage sites was cloned as described in example 5 into the vector pPLcAT10 in which the expression takes place under the control of the lambda PL promoter.
- Furthermore the sbsB-PCR fragment with the 5′ side EcoRI and 3′ side BamHi cleavage site were cloned into the vector pUC18 in which the expression took place under the control of the lac promoter.
- The detection of the sbsB expression was carried out as described in examples 6 and 7 by SDS gel electrophoresis and electron microscopy.
- 10. Preparation of Recombinant SbsB-S-layer Genes
- Recombinant sbsB genes were prepared analogously to the methods described in example 8.
- Thus in accordance with the method described in example 8.1, a 6 nt long DNA sequence containing an ApaI restriction cleavage site was introduced at various positions into the sbsB-layer gene. The recombinant sbsB clones pAK407, pAK481 and pAK1582 with ApaI cleavage sites at nt 407 (codon 136), 481 (codon 161/162) and 1582 (codon 528/529) were obtained in this manner. These clones obtained by insertion mutagenesis retained their ability to synthesize the S-layer protein and form S-layer structures.
- Analogously to the method described in example 8.2, a DNA fragment coding for streptavidin was inserted into the ApaI restriction sites of the sbsB clones pAK407 and pAK481.
- Analogously to example 8.4, a DNA sequence coding for the gB epitope SmaBB was inserted into the ApaI cleavage sites of the sbsB clones pAK481 and pAK1582. It was possible to detect the expression of sbsB-SmaB fusion proteins of ca. 130 kD in the E. coli cells transformed with the resulting recombinant plasmids. When two copies of the SmaBB epitopes were inserted one behind the other into the ApaI cleavage site of pAK481 it was possible to detect the expression of a fusion protein of ca. 157 kD. The SmaBB domains of the fusion proteins were recognized by specific antibodies.
- Analogously to example 8.6 it was possible to detect the expression of a 175 kD SbsB-LuxAB fusion protein when the LuxAB sequence was inserted into the ApaI cleavage site of pAK407.
- 11. Heterologous Expression of sbsA and sbsB in Bacillus subtilis
- The integration vector pX (Kim, L., Mogk, A. and Schumann W., Gene 181 (1996), 71-76: A xylose-inducible Bacillus subtilis integration vector and its application) was used for the heterologous expression of sbsA and sbsB in B. subtilis. The S-layer genes in the resulting recombinant expression vectors are under the transcriptional control of the xyl promoter. Transformants of B.subtilis containing an S-layer gene integrated in the chromosome exhibited an expression of large amounts of S-layer proteins in the supernatant of the cells which was inducible by addition of xylose to the growth medium. This shows that the signal sequences of sbsA and sbsB are recognized by the B. subtilis cell.
- In an analogous manner it was possible to achieve a heterologous expression of recombinant sbsA and sbsB layer genes in B. subtilis.
-
1 10 1 3687 DNA Bacillus stearothermophilus CDS (1)..(3684) sig_peptide (1)..(90) mat_peptide (91)..(3684) 1 atg gat agg aaa aaa gct gtg aaa cta gca aca gca agt gct att gca 48 Met Asp Arg Lys Lys Ala Val Lys Leu Ala Thr Ala Ser Ala Ile Ala -30 -25 -20 -15 gca agt gca ttt gtc gct gca aat cca aac gct tct gaa gcg gct aca 96 Ala Ser Ala Phe Val Ala Ala Asn Pro Asn Ala Ser Glu Ala Ala Thr -10 -5 -1 1 gat gta gca aca gta gta agc caa gca aaa gca cag ttc aaa aaa gca 144 Asp Val Ala Thr Val Val Ser Gln Ala Lys Ala Gln Phe Lys Lys Ala 5 10 15 tac tat act tac agc cat aca gta acg gaa act ggt gaa ttc cca aac 192 Tyr Tyr Thr Tyr Ser His Thr Val Thr Glu Thr Gly Glu Phe Pro Asn 20 25 30 att aac gat gta tat gct gaa tac aac aaa gcg aaa aaa cga tac cgt 240 Ile Asn Asp Val Tyr Ala Glu Tyr Asn Lys Ala Lys Lys Arg Tyr Arg 35 40 45 50 gat gcg gta gca tta gtg aat aaa gca ggt ggc gcg aaa aaa gac gct 288 Asp Ala Val Ala Leu Val Asn Lys Ala Gly Gly Ala Lys Lys Asp Ala 55 60 65 tac tta gct gat tta caa aaa gaa tat gaa act tac gtt ttc aaa gca 336 Tyr Leu Ala Asp Leu Gln Lys Glu Tyr Glu Thr Tyr Val Phe Lys Ala 70 75 80 aac cct aaa tct ggc gaa gct cgt gta gca act tac atc gat gct tac 384 Asn Pro Lys Ser Gly Glu Ala Arg Val Ala Thr Tyr Ile Asp Ala Tyr 85 90 95 aac tat gca aca aaa tta gac gaa atg cgc caa gag cta gag gct gct 432 Asn Tyr Ala Thr Lys Leu Asp Glu Met Arg Gln Glu Leu Glu Ala Ala 100 105 110 gtt caa gca aaa gat tta gaa aaa gca gaa caa tac tat cac aaa att 480 Val Gln Ala Lys Asp Leu Glu Lys Ala Glu Gln Tyr Tyr His Lys Ile 115 120 125 130 cct tat gaa att aaa act cgc aca gtc att tta gat cgc gta tat ggt 528 Pro Tyr Glu Ile Lys Thr Arg Thr Val Ile Leu Asp Arg Val Tyr Gly 135 140 145 aaa aca act cgt gat tta ctt cgc tct aca ttt aaa gca aaa gca caa 576 Lys Thr Thr Arg Asp Leu Leu Arg Ser Thr Phe Lys Ala Lys Ala Gln 150 155 160 gaa ctt cgc gac agc tta att tat gat att acc gtt gca atg aaa gcg 624 Glu Leu Arg Asp Ser Leu Ile Tyr Asp Ile Thr Val Ala Met Lys Ala 165 170 175 cgc gaa gta caa gac gct gtg aaa gca ggc aat tta gac aaa gct aaa 672 Arg Glu Val Gln Asp Ala Val Lys Ala Gly Asn Leu Asp Lys Ala Lys 180 185 190 gct gct gtt gat caa atc aat caa tac tta cca aaa gta aca gat gct 720 Ala Ala Val Asp Gln Ile Asn Gln Tyr Leu Pro Lys Val Thr Asp Ala 195 200 205 210 ttc aaa act gaa cta aca gaa gta gcg aaa aaa gca tta gat gca gat 768 Phe Lys Thr Glu Leu Thr Glu Val Ala Lys Lys Ala Leu Asp Ala Asp 215 220 225 gaa gct gcg ctt act cca aaa gtt gaa agt gta agt gcg att aac act 816 Glu Ala Ala Leu Thr Pro Lys Val Glu Ser Val Ser Ala Ile Asn Thr 230 235 240 caa aac aaa gct gtt gaa tta aca gca gta cca gtg aac gga aca cta 864 Gln Asn Lys Ala Val Glu Leu Thr Ala Val Pro Val Asn Gly Thr Leu 245 250 255 aaa tta caa ctt tca gct gct gca aat gaa gat aca gta aac gta aat 912 Lys Leu Gln Leu Ser Ala Ala Ala Asn Glu Asp Thr Val Asn Val Asn 260 265 270 act gta cgt atc tat aaa gtg gac ggt aac att cca ttt gcc ctt aat 960 Thr Val Arg Ile Tyr Lys Val Asp Gly Asn Ile Pro Phe Ala Leu Asn 275 280 285 290 acg gca gat gtt tct tta tct aca gac gga aaa act atc act gtg gat 1008 Thr Ala Asp Val Ser Leu Ser Thr Asp Gly Lys Thr Ile Thr Val Asp 295 300 305 gct tca act cca ttc gaa aat aat acg gag tat aaa gta gta gtt aaa 1056 Ala Ser Thr Pro Phe Glu Asn Asn Thr Glu Tyr Lys Val Val Val Lys 310 315 320 ggt att aaa gac aaa aat ggc aaa gaa ttt aaa gaa gat gca ttc act 1104 Gly Ile Lys Asp Lys Asn Gly Lys Glu Phe Lys Glu Asp Ala Phe Thr 325 330 335 ttc aag ctt cga aat gat gct gta gtt act caa gtg ttt gga act aat 1152 Phe Lys Leu Arg Asn Asp Ala Val Val Thr Gln Val Phe Gly Thr Asn 340 345 350 gta aca aac aac act tct gta aac tta gca gca ggt act ttc gac act 1200 Val Thr Asn Asn Thr Ser Val Asn Leu Ala Ala Gly Thr Phe Asp Thr 355 360 365 370 gac gat act tta aca gta gta ttt gat aag ttg tta gca cct gaa act 1248 Asp Asp Thr Leu Thr Val Val Phe Asp Lys Leu Leu Ala Pro Glu Thr 375 380 385 gta aac agc tcg aac gtt act att aca gat gtt gaa act gga aaa cgc 1296 Val Asn Ser Ser Asn Val Thr Ile Thr Asp Val Glu Thr Gly Lys Arg 390 395 400 att cca gta att gca tct act tct ggt tct aca att act att acg tta 1344 Ile Pro Val Ile Ala Ser Thr Ser Gly Ser Thr Ile Thr Ile Thr Leu 405 410 415 aaa gaa gcg tta gta act ggt aaa caa tat aaa ctt gct atc aat aat 1392 Lys Glu Ala Leu Val Thr Gly Lys Gln Tyr Lys Leu Ala Ile Asn Asn 420 425 430 gtt aaa aca tta act ggt tac aat gca gaa gct tac gag tta gtg ttc 1440 Val Lys Thr Leu Thr Gly Tyr Asn Ala Glu Ala Tyr Glu Leu Val Phe 435 440 445 450 act gca aac gca tca gca cca act gtt gct acc gct cct act act tta 1488 Thr Ala Asn Ala Ser Ala Pro Thr Val Ala Thr Ala Pro Thr Thr Leu 455 460 465 ggt ggt aca act tta tct act ggt tct ctt aca aca aat gtt tgg ggt 1536 Gly Gly Thr Thr Leu Ser Thr Gly Ser Leu Thr Thr Asn Val Trp Gly 470 475 480 aaa ttg gct ggt ggt gtg aat gaa gct gga act tat tat cct ggt ctt 1584 Lys Leu Ala Gly Gly Val Asn Glu Ala Gly Thr Tyr Tyr Pro Gly Leu 485 490 495 caa ttc aca aca acg ttt gct act aag tta gac gaa tct act tta gct 1632 Gln Phe Thr Thr Thr Phe Ala Thr Lys Leu Asp Glu Ser Thr Leu Ala 500 505 510 gat aac ttt gta tta gtt gaa aaa gaa tct ggt aca gtt gtt gct tct 1680 Asp Asn Phe Val Leu Val Glu Lys Glu Ser Gly Thr Val Val Ala Ser 515 520 525 530 gaa cta aaa tat aat gca gac gct aaa atg gta act tta gtg cca aaa 1728 Glu Leu Lys Tyr Asn Ala Asp Ala Lys Met Val Thr Leu Val Pro Lys 535 540 545 gcg gac ctt aaa gaa aat aca atc tat caa atc aaa att aaa aaa ggc 1776 Ala Asp Leu Lys Glu Asn Thr Ile Tyr Gln Ile Lys Ile Lys Lys Gly 550 555 560 ttg aag tcc gat aaa ggt att gaa tta ggc act gtt aac gag aaa aca 1824 Leu Lys Ser Asp Lys Gly Ile Glu Leu Gly Thr Val Asn Glu Lys Thr 565 570 575 tat gag ttc aaa act caa gac tta act gct cct aca gtt att agc gta 1872 Tyr Glu Phe Lys Thr Gln Asp Leu Thr Ala Pro Thr Val Ile Ser Val 580 585 590 acg tct aaa aat ggc gac gct gga tta aaa gta act gaa gct caa gaa 1920 Thr Ser Lys Asn Gly Asp Ala Gly Leu Lys Val Thr Glu Ala Gln Glu 595 600 605 610 ttt act gtg aag ttc tca gag aat tta aat aca ttt aat gct aca acc 1968 Phe Thr Val Lys Phe Ser Glu Asn Leu Asn Thr Phe Asn Ala Thr Thr 615 620 625 gtt tcg ggt agc aca atc aca tac ggt caa gtt gct gta gta aaa gcg 2016 Val Ser Gly Ser Thr Ile Thr Tyr Gly Gln Val Ala Val Val Lys Ala 630 635 640 ggt gca aac tta tct gct ctt aca gca agt gac atc att cca gct agt 2064 Gly Ala Asn Leu Ser Ala Leu Thr Ala Ser Asp Ile Ile Pro Ala Ser 645 650 655 gtt gaa gcg gtt act ggt caa gat gga aca tac aaa gtg aaa gtt gct 2112 Val Glu Ala Val Thr Gly Gln Asp Gly Thr Tyr Lys Val Lys Val Ala 660 665 670 gct aac caa tta gaa cgt aac caa ggg tac aaa tta gta gtg ttc ggt 2160 Ala Asn Gln Leu Glu Arg Asn Gln Gly Tyr Lys Leu Val Val Phe Gly 675 680 685 690 aaa ggt gca aca gct cct gtt aaa gat gct gca aat gca aat act tta 2208 Lys Gly Ala Thr Ala Pro Val Lys Asp Ala Ala Asn Ala Asn Thr Leu 695 700 705 gca act aac tat atc tat aca ttt aca act gaa ggt caa gac gta aca 2256 Ala Thr Asn Tyr Ile Tyr Thr Phe Thr Thr Glu Gly Gln Asp Val Thr 710 715 720 gca cca acg gtt aca aaa gta ttc aaa ggt gat tct tta aaa gac gct 2304 Ala Pro Thr Val Thr Lys Val Phe Lys Gly Asp Ser Leu Lys Asp Ala 725 730 735 gat gca gtt act aca ctt acg aac gtt gat gca ggt caa aaa ttc act 2352 Asp Ala Val Thr Thr Leu Thr Asn Val Asp Ala Gly Gln Lys Phe Thr 740 745 750 atc caa ttt agc gaa gaa tta aaa act tct agt ggt tct tta gtg ggt 2400 Ile Gln Phe Ser Glu Glu Leu Lys Thr Ser Ser Gly Ser Leu Val Gly 755 760 765 770 ggc aaa gta act gtc gag aaa tta aca aac aac gga tgg gta gat gct 2448 Gly Lys Val Thr Val Glu Lys Leu Thr Asn Asn Gly Trp Val Asp Ala 775 780 785 ggt act gga aca act gta tca gtt gct cct aag aca gat gca aat ggt 2496 Gly Thr Gly Thr Thr Val Ser Val Ala Pro Lys Thr Asp Ala Asn Gly 790 795 800 aaa gta aca gct gct gtg gtt aca tta act ggt ctt gac aat aac gac 2544 Lys Val Thr Ala Ala Val Val Thr Leu Thr Gly Leu Asp Asn Asn Asp 805 810 815 aaa gat gcg aaa ttg cgt ctg gta gta gat aag tct tct act gat gga 2592 Lys Asp Ala Lys Leu Arg Leu Val Val Asp Lys Ser Ser Thr Asp Gly 820 825 830 att gct gat gta gct ggt aat gta att aag gaa aaa gat att tta att 2640 Ile Ala Asp Val Ala Gly Asn Val Ile Lys Glu Lys Asp Ile Leu Ile 835 840 845 850 cgt tac aac agc tgg aga cac act gta gct tct gtg aaa gct gct gct 2688 Arg Tyr Asn Ser Trp Arg His Thr Val Ala Ser Val Lys Ala Ala Ala 855 860 865 gac aaa gat ggt caa aac gct tct gct gca ttc cca aca agc act gca 2736 Asp Lys Asp Gly Gln Asn Ala Ser Ala Ala Phe Pro Thr Ser Thr Ala 870 875 880 att gat aca act aag agc tta tta gtt gaa ttc aat gaa act gat tta 2784 Ile Asp Thr Thr Lys Ser Leu Leu Val Glu Phe Asn Glu Thr Asp Leu 885 890 895 gcg gaa gtt aaa cct gag aac atc gtt gtt aaa gat gca gca ggt aat 2832 Ala Glu Val Lys Pro Glu Asn Ile Val Val Lys Asp Ala Ala Gly Asn 900 905 910 gcg gta gct ggt act gta aca gca tta gac ggt tct aca aat aaa ttt 2880 Ala Val Ala Gly Thr Val Thr Ala Leu Asp Gly Ser Thr Asn Lys Phe 915 920 925 930 gta ttc act cca tct caa gaa tta aaa gct ggt aca gtt tac tct gta 2928 Val Phe Thr Pro Ser Gln Glu Leu Lys Ala Gly Thr Val Tyr Ser Val 935 940 945 aca att gac ggt gtg aga gat aaa gta ggt aac aca atc tct aaa tac 2976 Thr Ile Asp Gly Val Arg Asp Lys Val Gly Asn Thr Ile Ser Lys Tyr 950 955 960 att act tcg ttc aag act gta tct gcg aat cca acg tta tct tca atc 3024 Ile Thr Ser Phe Lys Thr Val Ser Ala Asn Pro Thr Leu Ser Ser Ile 965 970 975 agc att gct gac ggt gca gtt aac gtt gac cgt tct aaa aca att aca 3072 Ser Ile Ala Asp Gly Ala Val Asn Val Asp Arg Ser Lys Thr Ile Thr 980 985 990 att gaa ttc agc gat tca gtt cca aac cca aca atc act ctt aag aag 3120 Ile Glu Phe Ser Asp Ser Val Pro Asn Pro Thr Ile Thr Leu Lys Lys 995 1000 1005 1010 gct gac gga act tca ttt act aat tac act tta gta aat gta aat aat 3168 Ala Asp Gly Thr Ser Phe Thr Asn Tyr Thr Leu Val Asn Val Asn Asn 1015 1020 1025 gaa aat aaa aca tac aaa att gta ttc cac aaa ggt gta aca ctt gac 3216 Glu Asn Lys Thr Tyr Lys Ile Val Phe His Lys Gly Val Thr Leu Asp 1030 1035 1040 gag ttt act caa tat gag tta gca gtt tca aaa gat ttt caa act ggt 3264 Glu Phe Thr Gln Tyr Glu Leu Ala Val Ser Lys Asp Phe Gln Thr Gly 1045 1050 1055 act gat att gat agc aaa gtt aca ttc atc aca ggt tct gtt gct act 3312 Thr Asp Ile Asp Ser Lys Val Thr Phe Ile Thr Gly Ser Val Ala Thr 1060 1065 1070 gac gaa gta aaa cct gct cta gta ggc gtt ggt tca tgg aat gga aca 3360 Asp Glu Val Lys Pro Ala Leu Val Gly Val Gly Ser Trp Asn Gly Thr 1075 1080 1085 1090 agc tat act cag gat gct gca gca aca cga ctt cgg tct gta gct gac 3408 Ser Tyr Thr Gln Asp Ala Ala Ala Thr Arg Leu Arg Ser Val Ala Asp 1095 1100 1105 ttc gtt gcg gag cca gtt gcc ctt caa ttc tca gaa ggt atc gat tta 3456 Phe Val Ala Glu Pro Val Ala Leu Gln Phe Ser Glu Gly Ile Asp Leu 1110 1115 1120 acg aat gca act gtg aca gta aca aat att act gat gat aaa act gtt 3504 Thr Asn Ala Thr Val Thr Val Thr Asn Ile Thr Asp Asp Lys Thr Val 1125 1130 1135 gaa gtt att tca aaa gag agt gta gac gca gac cat gat gca ggt gct 3552 Glu Val Ile Ser Lys Glu Ser Val Asp Ala Asp His Asp Ala Gly Ala 1140 1145 1150 act aag gag aca tta gta att aac aca gtt act cct tta gta ctt gat 3600 Thr Lys Glu Thr Leu Val Ile Asn Thr Val Thr Pro Leu Val Leu Asp 1155 1160 1165 1170 aac agc aag act tat aag att gtt gta agt gga gtt aaa gat gca gca 3648 Asn Ser Lys Thr Tyr Lys Ile Val Val Ser Gly Val Lys Asp Ala Ala 1175 1180 1185 ggt aat gtt gca gat act att aca ttc tat att aag taa 3687 Gly Asn Val Ala Asp Thr Ile Thr Phe Tyr Ile Lys 1190 1195 2 1228 PRT Bacillus stearothermophilus 2 Met Asp Arg Lys Lys Ala Val Lys Leu Ala Thr Ala Ser Ala Ile Ala -30 -25 -20 -15 Ala Ser Ala Phe Val Ala Ala Asn Pro Asn Ala Ser Glu Ala Ala Thr -10 -5 -1 1 Asp Val Ala Thr Val Val Ser Gln Ala Lys Ala Gln Phe Lys Lys Ala 5 10 15 Tyr Tyr Thr Tyr Ser His Thr Val Thr Glu Thr Gly Glu Phe Pro Asn 20 25 30 Ile Asn Asp Val Tyr Ala Glu Tyr Asn Lys Ala Lys Lys Arg Tyr Arg 35 40 45 50 Asp Ala Val Ala Leu Val Asn Lys Ala Gly Gly Ala Lys Lys Asp Ala 55 60 65 Tyr Leu Ala Asp Leu Gln Lys Glu Tyr Glu Thr Tyr Val Phe Lys Ala 70 75 80 Asn Pro Lys Ser Gly Glu Ala Arg Val Ala Thr Tyr Ile Asp Ala Tyr 85 90 95 Asn Tyr Ala Thr Lys Leu Asp Glu Met Arg Gln Glu Leu Glu Ala Ala 100 105 110 Val Gln Ala Lys Asp Leu Glu Lys Ala Glu Gln Tyr Tyr His Lys Ile 115 120 125 130 Pro Tyr Glu Ile Lys Thr Arg Thr Val Ile Leu Asp Arg Val Tyr Gly 135 140 145 Lys Thr Thr Arg Asp Leu Leu Arg Ser Thr Phe Lys Ala Lys Ala Gln 150 155 160 Glu Leu Arg Asp Ser Leu Ile Tyr Asp Ile Thr Val Ala Met Lys Ala 165 170 175 Arg Glu Val Gln Asp Ala Val Lys Ala Gly Asn Leu Asp Lys Ala Lys 180 185 190 Ala Ala Val Asp Gln Ile Asn Gln Tyr Leu Pro Lys Val Thr Asp Ala 195 200 205 210 Phe Lys Thr Glu Leu Thr Glu Val Ala Lys Lys Ala Leu Asp Ala Asp 215 220 225 Glu Ala Ala Leu Thr Pro Lys Val Glu Ser Val Ser Ala Ile Asn Thr 230 235 240 Gln Asn Lys Ala Val Glu Leu Thr Ala Val Pro Val Asn Gly Thr Leu 245 250 255 Lys Leu Gln Leu Ser Ala Ala Ala Asn Glu Asp Thr Val Asn Val Asn 260 265 270 Thr Val Arg Ile Tyr Lys Val Asp Gly Asn Ile Pro Phe Ala Leu Asn 275 280 285 290 Thr Ala Asp Val Ser Leu Ser Thr Asp Gly Lys Thr Ile Thr Val Asp 295 300 305 Ala Ser Thr Pro Phe Glu Asn Asn Thr Glu Tyr Lys Val Val Val Lys 310 315 320 Gly Ile Lys Asp Lys Asn Gly Lys Glu Phe Lys Glu Asp Ala Phe Thr 325 330 335 Phe Lys Leu Arg Asn Asp Ala Val Val Thr Gln Val Phe Gly Thr Asn 340 345 350 Val Thr Asn Asn Thr Ser Val Asn Leu Ala Ala Gly Thr Phe Asp Thr 355 360 365 370 Asp Asp Thr Leu Thr Val Val Phe Asp Lys Leu Leu Ala Pro Glu Thr 375 380 385 Val Asn Ser Ser Asn Val Thr Ile Thr Asp Val Glu Thr Gly Lys Arg 390 395 400 Ile Pro Val Ile Ala Ser Thr Ser Gly Ser Thr Ile Thr Ile Thr Leu 405 410 415 Lys Glu Ala Leu Val Thr Gly Lys Gln Tyr Lys Leu Ala Ile Asn Asn 420 425 430 Val Lys Thr Leu Thr Gly Tyr Asn Ala Glu Ala Tyr Glu Leu Val Phe 435 440 445 450 Thr Ala Asn Ala Ser Ala Pro Thr Val Ala Thr Ala Pro Thr Thr Leu 455 460 465 Gly Gly Thr Thr Leu Ser Thr Gly Ser Leu Thr Thr Asn Val Trp Gly 470 475 480 Lys Leu Ala Gly Gly Val Asn Glu Ala Gly Thr Tyr Tyr Pro Gly Leu 485 490 495 Gln Phe Thr Thr Thr Phe Ala Thr Lys Leu Asp Glu Ser Thr Leu Ala 500 505 510 Asp Asn Phe Val Leu Val Glu Lys Glu Ser Gly Thr Val Val Ala Ser 515 520 525 530 Glu Leu Lys Tyr Asn Ala Asp Ala Lys Met Val Thr Leu Val Pro Lys 535 540 545 Ala Asp Leu Lys Glu Asn Thr Ile Tyr Gln Ile Lys Ile Lys Lys Gly 550 555 560 Leu Lys Ser Asp Lys Gly Ile Glu Leu Gly Thr Val Asn Glu Lys Thr 565 570 575 Tyr Glu Phe Lys Thr Gln Asp Leu Thr Ala Pro Thr Val Ile Ser Val 580 585 590 Thr Ser Lys Asn Gly Asp Ala Gly Leu Lys Val Thr Glu Ala Gln Glu 595 600 605 610 Phe Thr Val Lys Phe Ser Glu Asn Leu Asn Thr Phe Asn Ala Thr Thr 615 620 625 Val Ser Gly Ser Thr Ile Thr Tyr Gly Gln Val Ala Val Val Lys Ala 630 635 640 Gly Ala Asn Leu Ser Ala Leu Thr Ala Ser Asp Ile Ile Pro Ala Ser 645 650 655 Val Glu Ala Val Thr Gly Gln Asp Gly Thr Tyr Lys Val Lys Val Ala 660 665 670 Ala Asn Gln Leu Glu Arg Asn Gln Gly Tyr Lys Leu Val Val Phe Gly 675 680 685 690 Lys Gly Ala Thr Ala Pro Val Lys Asp Ala Ala Asn Ala Asn Thr Leu 695 700 705 Ala Thr Asn Tyr Ile Tyr Thr Phe Thr Thr Glu Gly Gln Asp Val Thr 710 715 720 Ala Pro Thr Val Thr Lys Val Phe Lys Gly Asp Ser Leu Lys Asp Ala 725 730 735 Asp Ala Val Thr Thr Leu Thr Asn Val Asp Ala Gly Gln Lys Phe Thr 740 745 750 Ile Gln Phe Ser Glu Glu Leu Lys Thr Ser Ser Gly Ser Leu Val Gly 755 760 765 770 Gly Lys Val Thr Val Glu Lys Leu Thr Asn Asn Gly Trp Val Asp Ala 775 780 785 Gly Thr Gly Thr Thr Val Ser Val Ala Pro Lys Thr Asp Ala Asn Gly 790 795 800 Lys Val Thr Ala Ala Val Val Thr Leu Thr Gly Leu Asp Asn Asn Asp 805 810 815 Lys Asp Ala Lys Leu Arg Leu Val Val Asp Lys Ser Ser Thr Asp Gly 820 825 830 Ile Ala Asp Val Ala Gly Asn Val Ile Lys Glu Lys Asp Ile Leu Ile 835 840 845 850 Arg Tyr Asn Ser Trp Arg His Thr Val Ala Ser Val Lys Ala Ala Ala 855 860 865 Asp Lys Asp Gly Gln Asn Ala Ser Ala Ala Phe Pro Thr Ser Thr Ala 870 875 880 Ile Asp Thr Thr Lys Ser Leu Leu Val Glu Phe Asn Glu Thr Asp Leu 885 890 895 Ala Glu Val Lys Pro Glu Asn Ile Val Val Lys Asp Ala Ala Gly Asn 900 905 910 Ala Val Ala Gly Thr Val Thr Ala Leu Asp Gly Ser Thr Asn Lys Phe 915 920 925 930 Val Phe Thr Pro Ser Gln Glu Leu Lys Ala Gly Thr Val Tyr Ser Val 935 940 945 Thr Ile Asp Gly Val Arg Asp Lys Val Gly Asn Thr Ile Ser Lys Tyr 950 955 960 Ile Thr Ser Phe Lys Thr Val Ser Ala Asn Pro Thr Leu Ser Ser Ile 965 970 975 Ser Ile Ala Asp Gly Ala Val Asn Val Asp Arg Ser Lys Thr Ile Thr 980 985 990 Ile Glu Phe Ser Asp Ser Val Pro Asn Pro Thr Ile Thr Leu Lys Lys 995 1000 1005 1010 Ala Asp Gly Thr Ser Phe Thr Asn Tyr Thr Leu Val Asn Val Asn Asn 1015 1020 1025 Glu Asn Lys Thr Tyr Lys Ile Val Phe His Lys Gly Val Thr Leu Asp 1030 1035 1040 Glu Phe Thr Gln Tyr Glu Leu Ala Val Ser Lys Asp Phe Gln Thr Gly 1045 1050 1055 Thr Asp Ile Asp Ser Lys Val Thr Phe Ile Thr Gly Ser Val Ala Thr 1060 1065 1070 Asp Glu Val Lys Pro Ala Leu Val Gly Val Gly Ser Trp Asn Gly Thr 1075 1080 1085 1090 Ser Tyr Thr Gln Asp Ala Ala Ala Thr Arg Leu Arg Ser Val Ala Asp 1095 1100 1105 Phe Val Ala Glu Pro Val Ala Leu Gln Phe Ser Glu Gly Ile Asp Leu 1110 1115 1120 Thr Asn Ala Thr Val Thr Val Thr Asn Ile Thr Asp Asp Lys Thr Val 1125 1130 1135 Glu Val Ile Ser Lys Glu Ser Val Asp Ala Asp His Asp Ala Gly Ala 1140 1145 1150 Thr Lys Glu Thr Leu Val Ile Asn Thr Val Thr Pro Leu Val Leu Asp 1155 1160 1165 1170 Asn Ser Lys Thr Tyr Lys Ile Val Val Ser Gly Val Lys Asp Ala Ala 1175 1180 1185 Gly Asn Val Ala Asp Thr Ile Thr Phe Tyr Ile Lys 1190 1195 3 33 DNA Artificial Sequence Description of Artificial Sequence synthetic primer 3 ttaatcgatt ctagatggat aggaaaaaag ctg 33 4 37 DNA Artificial Sequence Description of Artificial Sequence synthetic primer 4 atacccgggg gtacggatcc gatacagatt tgagcaa 37 5 2766 DNA Bacillus stearothermophilus CDS (1)..(2763) sig_peptide (1)..(93) mat_peptide (94)..(2763) 5 atg gct tat caa cct aag tct ttt cgc aag ttt gtt gcg aca act gca 48 Met Ala Tyr Gln Pro Lys Ser Phe Arg Lys Phe Val Ala Thr Thr Ala -30 -25 -20 aca gct gcc att gta gca tct gcg gta gct cct gta gta tct gca gca 96 Thr Ala Ala Ile Val Ala Ser Ala Val Ala Pro Val Val Ser Ala Ala -15 -10 -5 -1 1 agc ttc aca gat gtt gcg ccg caa tat aaa gat gcg atc gat ttc tta 144 Ser Phe Thr Asp Val Ala Pro Gln Tyr Lys Asp Ala Ile Asp Phe Leu 5 10 15 gta tca act ggt gca aca aaa ggt aaa aca gaa aca aaa ttc ggc gtt 192 Val Ser Thr Gly Ala Thr Lys Gly Lys Thr Glu Thr Lys Phe Gly Val 20 25 30 tac gat gaa atc act cgt cta gat gcg gca gtt att ctt gca aga gta 240 Tyr Asp Glu Ile Thr Arg Leu Asp Ala Ala Val Ile Leu Ala Arg Val 35 40 45 tta aaa cta gac gtt gac aac gca aaa gac gca ggc ttc aca gat gtg 288 Leu Lys Leu Asp Val Asp Asn Ala Lys Asp Ala Gly Phe Thr Asp Val 50 55 60 65 cca aaa gac cgt gca aaa tac gtc aac gcg ctt gta gaa gct ggc gta 336 Pro Lys Asp Arg Ala Lys Tyr Val Asn Ala Leu Val Glu Ala Gly Val 70 75 80 tta aac ggt aaa gca cct ggc aaa ttt ggt gca tac gac cca tta act 384 Leu Asn Gly Lys Ala Pro Gly Lys Phe Gly Ala Tyr Asp Pro Leu Thr 85 90 95 cgc gtt gaa atg gca aaa atc atc gcg aac cgt tac aaa tta aaa gct 432 Arg Val Glu Met Ala Lys Ile Ile Ala Asn Arg Tyr Lys Leu Lys Ala 100 105 110 gac gat gta aaa ctt cca ttc act gat gta aac gat aca tgg gca cca 480 Asp Asp Val Lys Leu Pro Phe Thr Asp Val Asn Asp Thr Trp Ala Pro 115 120 125 tac gta aaa gcg ctt tat aaa tac gaa gta acc aaa agg tta aaa cac 528 Tyr Val Lys Ala Leu Tyr Lys Tyr Glu Val Thr Lys Arg Leu Lys His 130 135 140 145 caa caa gct tcg gtg cat acc aaa aac atc act ctg cgt gac ttt gcg 576 Gln Gln Ala Ser Val His Thr Lys Asn Ile Thr Leu Arg Asp Phe Ala 150 155 160 caa ttt gta tat aga gcg gtg aat att aat gca gtg cca gaa ata gtt 624 Gln Phe Val Tyr Arg Ala Val Asn Ile Asn Ala Val Pro Glu Ile Val 165 170 175 gaa gta act gcg gtt aat tcg act aca gtg aaa gta aca ttc aat acg 672 Glu Val Thr Ala Val Asn Ser Thr Thr Val Lys Val Thr Phe Asn Thr 180 185 190 caa att gct gat gtt gat ttc aca aat ttt gct atc gat aac ggt tta 720 Gln Ile Ala Asp Val Asp Phe Thr Asn Phe Ala Ile Asp Asn Gly Leu 195 200 205 act gtt act aaa gca act ctt tct cgt gat aaa aaa tcc gta gag gtt 768 Thr Val Thr Lys Ala Thr Leu Ser Arg Asp Lys Lys Ser Val Glu Val 210 215 220 225 gtg gta aat aaa ccg ttt act cgt aat cag gaa tat aca att aca gcg 816 Val Val Asn Lys Pro Phe Thr Arg Asn Gln Glu Tyr Thr Ile Thr Ala 230 235 240 aca ggc att aaa aat tta aaa ggc gag acc gct aag gaa tta act ggt 864 Thr Gly Ile Lys Asn Leu Lys Gly Glu Thr Ala Lys Glu Leu Thr Gly 245 250 255 aag ttt gtt tgg tct gtt caa gat gcg gta act gtt gca cta aat aat 912 Lys Phe Val Trp Ser Val Gln Asp Ala Val Thr Val Ala Leu Asn Asn 260 265 270 agt tcg ctt aaa gtt gga gag gaa tct ggt tta act gta aaa gat cag 960 Ser Ser Leu Lys Val Gly Glu Glu Ser Gly Leu Thr Val Lys Asp Gln 275 280 285 gat ggc aaa gat gtt gta ggt gct aaa gta gaa ctt act tct tct aat 1008 Asp Gly Lys Asp Val Val Gly Ala Lys Val Glu Leu Thr Ser Ser Asn 290 295 300 305 act aat att gtt gta gtt tca agt ggc gaa gta tca gta tct gct gct 1056 Thr Asn Ile Val Val Val Ser Ser Gly Glu Val Ser Val Ser Ala Ala 310 315 320 aaa gtt aca gct gta aaa ccg gga aca gct gat gtt act gca aaa gtt 1104 Lys Val Thr Ala Val Lys Pro Gly Thr Ala Asp Val Thr Ala Lys Val 325 330 335 aca tta cca gat ggt gtt gta cta aca aat aca ttt aaa gtg aca gtt 1152 Thr Leu Pro Asp Gly Val Val Leu Thr Asn Thr Phe Lys Val Thr Val 340 345 350 aca gaa gtg cct gtt caa gtc caa aat caa gga ttt act tta gtt gat 1200 Thr Glu Val Pro Val Gln Val Gln Asn Gln Gly Phe Thr Leu Val Asp 355 360 365 aat ctt tct aat gct cca cag aat aca gtt gca ttt aac aaa gct gag 1248 Asn Leu Ser Asn Ala Pro Gln Asn Thr Val Ala Phe Asn Lys Ala Glu 370 375 380 385 aaa gta act tca atg ttt gct gga gaa act aaa aca gtt gca atg tat 1296 Lys Val Thr Ser Met Phe Ala Gly Glu Thr Lys Thr Val Ala Met Tyr 390 395 400 gat act aaa aac ggt gat cct gaa act aaa cct gtt gat ttc aaa gat 1344 Asp Thr Lys Asn Gly Asp Pro Glu Thr Lys Pro Val Asp Phe Lys Asp 405 410 415 gca act gta cgt tca tta aat cca att att gca aca gct gct att aat 1392 Ala Thr Val Arg Ser Leu Asn Pro Ile Ile Ala Thr Ala Ala Ile Asn 420 425 430 ggt agt gag ctc ctt gtc aca gct aat gct ggc caa tct gga aaa gct 1440 Gly Ser Glu Leu Leu Val Thr Ala Asn Ala Gly Gln Ser Gly Lys Ala 435 440 445 tca ttt gaa gta aca tta aaa gat aat aca aaa aga aca ttt aca gtt 1488 Ser Phe Glu Val Thr Leu Lys Asp Asn Thr Lys Arg Thr Phe Thr Val 450 455 460 465 gat gta aaa aaa gac cct gta tta caa gat ata aaa gta gat gca act 1536 Asp Val Lys Lys Asp Pro Val Leu Gln Asp Ile Lys Val Asp Ala Thr 470 475 480 tct gtt aaa ctt tcc gat gaa gct gtt ggc ggc ggg gaa gtt gaa gga 1584 Ser Val Lys Leu Ser Asp Glu Ala Val Gly Gly Gly Glu Val Glu Gly 485 490 495 gtt aac caa aaa acg att aaa gta agt gca gtt gac caa tac ggt aaa 1632 Val Asn Gln Lys Thr Ile Lys Val Ser Ala Val Asp Gln Tyr Gly Lys 500 505 510 gaa att aaa ttt ggt aca aaa ggt aaa gtt act gtt aca act aat aca 1680 Glu Ile Lys Phe Gly Thr Lys Gly Lys Val Thr Val Thr Thr Asn Thr 515 520 525 gaa gga cta gtt att aaa aat gta aat agc gat aat aca att gac ttt 1728 Glu Gly Leu Val Ile Lys Asn Val Asn Ser Asp Asn Thr Ile Asp Phe 530 535 540 545 gat agc ggc aat agt gca act gac caa ttt gtt gtc gtt gca aca aaa 1776 Asp Ser Gly Asn Ser Ala Thr Asp Gln Phe Val Val Val Ala Thr Lys 550 555 560 gac aaa att gtc aat ggt aaa gta gaa gtt aaa tat ttc aaa aat gct 1824 Asp Lys Ile Val Asn Gly Lys Val Glu Val Lys Tyr Phe Lys Asn Ala 565 570 575 agt gac aca aca cca act tca act aaa aca att act gtt aat gta gta 1872 Ser Asp Thr Thr Pro Thr Ser Thr Lys Thr Ile Thr Val Asn Val Val 580 585 590 aat gta aaa gct gac gct aca cca gta gga tta gat att gta gca cct 1920 Asn Val Lys Ala Asp Ala Thr Pro Val Gly Leu Asp Ile Val Ala Pro 595 600 605 tct aaa att gat gta aat gct cca aac act gct tct act gca gat gtt 1968 Ser Lys Ile Asp Val Asn Ala Pro Asn Thr Ala Ser Thr Ala Asp Val 610 615 620 625 gat ttt ata aat ttc gaa agt gtt gag att tac aca ctc gat tca aat 2016 Asp Phe Ile Asn Phe Glu Ser Val Glu Ile Tyr Thr Leu Asp Ser Asn 630 635 640 ggt aga cgt caa aaa aaa gtt act cca act gca act aca ctt gta ggt 2064 Gly Arg Arg Gln Lys Lys Val Thr Pro Thr Ala Thr Thr Leu Val Gly 645 650 655 aca aaa aaa aaa aaa aaa gtt aat ggg aat gta tta caa ttc aag ggg 2112 Thr Lys Lys Lys Lys Lys Val Asn Gly Asn Val Leu Gln Phe Lys Gly 660 665 670 aac gaa gaa tta acg cta tca act tct tct agt aca gga aac gta gat 2160 Asn Glu Glu Leu Thr Leu Ser Thr Ser Ser Ser Thr Gly Asn Val Asp 675 680 685 gga aca gca gaa gga atg aca aaa cgt att cca ggg aaa tat atc aac 2208 Gly Thr Ala Glu Gly Met Thr Lys Arg Ile Pro Gly Lys Tyr Ile Asn 690 695 700 705 tct gca agt gta cct gcc agt gca aca gta gca aca agt cct gtt act 2256 Ser Ala Ser Val Pro Ala Ser Ala Thr Val Ala Thr Ser Pro Val Thr 710 715 720 gta aag ctt aat tca agt gat aat gat tta aca ttt gaa gaa tta ata 2304 Val Lys Leu Asn Ser Ser Asp Asn Asp Leu Thr Phe Glu Glu Leu Ile 725 730 735 ttc ggt gta att gac cct aca caa tta gtc aaa gat gaa gac atc aac 2352 Phe Gly Val Ile Asp Pro Thr Gln Leu Val Lys Asp Glu Asp Ile Asn 740 745 750 gaa ttt att gca gtt tca aaa gcg gct aaa aat gat gga tat ttg tat 2400 Glu Phe Ile Ala Val Ser Lys Ala Ala Lys Asn Asp Gly Tyr Leu Tyr 755 760 765 aat aaa ccg ctt gta acg gtt aaa gat gca tca gga aaa gtt att cca 2448 Asn Lys Pro Leu Val Thr Val Lys Asp Ala Ser Gly Lys Val Ile Pro 770 775 780 785 aca ggt gca aat gtt tac ggt cta aat cat gat gca act aac gga aac 2496 Thr Gly Ala Asn Val Tyr Gly Leu Asn His Asp Ala Thr Asn Gly Asn 790 795 800 att tgg ttt gat gag gaa caa gct ggc tta gct aaa aaa ttt agt gat 2544 Ile Trp Phe Asp Glu Glu Gln Ala Gly Leu Ala Lys Lys Phe Ser Asp 805 810 815 gta cat ttt gat gtt gat ttt tca tta act aac gtt gta aaa act ggt 2592 Val His Phe Asp Val Asp Phe Ser Leu Thr Asn Val Val Lys Thr Gly 820 825 830 agc ggt aca gtt tct tca tcg cca tca tta tct gac gca att caa ctt 2640 Ser Gly Thr Val Ser Ser Ser Pro Ser Leu Ser Asp Ala Ile Gln Leu 835 840 845 act aat tca ggc gat gca gta tcg ttt aca tta gtt atc aaa tca att 2688 Thr Asn Ser Gly Asp Ala Val Ser Phe Thr Leu Val Ile Lys Ser Ile 850 855 860 865 tat gtt aaa ggc gca gat aaa gat gat aat aac tta ctt gca gcc cct 2736 Tyr Val Lys Gly Ala Asp Lys Asp Asp Asn Asn Leu Leu Ala Ala Pro 870 875 880 gtt tct gtc aat gtg act gtg aca aaa taa 2766 Val Ser Val Asn Val Thr Val Thr Lys 885 890 6 921 PRT Bacillus stearothermophilus 6 Met Ala Tyr Gln Pro Lys Ser Phe Arg Lys Phe Val Ala Thr Thr Ala -30 -25 -20 Thr Ala Ala Ile Val Ala Ser Ala Val Ala Pro Val Val Ser Ala Ala -15 -10 -5 -1 1 Ser Phe Thr Asp Val Ala Pro Gln Tyr Lys Asp Ala Ile Asp Phe Leu 5 10 15 Val Ser Thr Gly Ala Thr Lys Gly Lys Thr Glu Thr Lys Phe Gly Val 20 25 30 Tyr Asp Glu Ile Thr Arg Leu Asp Ala Ala Val Ile Leu Ala Arg Val 35 40 45 Leu Lys Leu Asp Val Asp Asn Ala Lys Asp Ala Gly Phe Thr Asp Val 50 55 60 65 Pro Lys Asp Arg Ala Lys Tyr Val Asn Ala Leu Val Glu Ala Gly Val 70 75 80 Leu Asn Gly Lys Ala Pro Gly Lys Phe Gly Ala Tyr Asp Pro Leu Thr 85 90 95 Arg Val Glu Met Ala Lys Ile Ile Ala Asn Arg Tyr Lys Leu Lys Ala 100 105 110 Asp Asp Val Lys Leu Pro Phe Thr Asp Val Asn Asp Thr Trp Ala Pro 115 120 125 Tyr Val Lys Ala Leu Tyr Lys Tyr Glu Val Thr Lys Arg Leu Lys His 130 135 140 145 Gln Gln Ala Ser Val His Thr Lys Asn Ile Thr Leu Arg Asp Phe Ala 150 155 160 Gln Phe Val Tyr Arg Ala Val Asn Ile Asn Ala Val Pro Glu Ile Val 165 170 175 Glu Val Thr Ala Val Asn Ser Thr Thr Val Lys Val Thr Phe Asn Thr 180 185 190 Gln Ile Ala Asp Val Asp Phe Thr Asn Phe Ala Ile Asp Asn Gly Leu 195 200 205 Thr Val Thr Lys Ala Thr Leu Ser Arg Asp Lys Lys Ser Val Glu Val 210 215 220 225 Val Val Asn Lys Pro Phe Thr Arg Asn Gln Glu Tyr Thr Ile Thr Ala 230 235 240 Thr Gly Ile Lys Asn Leu Lys Gly Glu Thr Ala Lys Glu Leu Thr Gly 245 250 255 Lys Phe Val Trp Ser Val Gln Asp Ala Val Thr Val Ala Leu Asn Asn 260 265 270 Ser Ser Leu Lys Val Gly Glu Glu Ser Gly Leu Thr Val Lys Asp Gln 275 280 285 Asp Gly Lys Asp Val Val Gly Ala Lys Val Glu Leu Thr Ser Ser Asn 290 295 300 305 Thr Asn Ile Val Val Val Ser Ser Gly Glu Val Ser Val Ser Ala Ala 310 315 320 Lys Val Thr Ala Val Lys Pro Gly Thr Ala Asp Val Thr Ala Lys Val 325 330 335 Thr Leu Pro Asp Gly Val Val Leu Thr Asn Thr Phe Lys Val Thr Val 340 345 350 Thr Glu Val Pro Val Gln Val Gln Asn Gln Gly Phe Thr Leu Val Asp 355 360 365 Asn Leu Ser Asn Ala Pro Gln Asn Thr Val Ala Phe Asn Lys Ala Glu 370 375 380 385 Lys Val Thr Ser Met Phe Ala Gly Glu Thr Lys Thr Val Ala Met Tyr 390 395 400 Asp Thr Lys Asn Gly Asp Pro Glu Thr Lys Pro Val Asp Phe Lys Asp 405 410 415 Ala Thr Val Arg Ser Leu Asn Pro Ile Ile Ala Thr Ala Ala Ile Asn 420 425 430 Gly Ser Glu Leu Leu Val Thr Ala Asn Ala Gly Gln Ser Gly Lys Ala 435 440 445 Ser Phe Glu Val Thr Leu Lys Asp Asn Thr Lys Arg Thr Phe Thr Val 450 455 460 465 Asp Val Lys Lys Asp Pro Val Leu Gln Asp Ile Lys Val Asp Ala Thr 470 475 480 Ser Val Lys Leu Ser Asp Glu Ala Val Gly Gly Gly Glu Val Glu Gly 485 490 495 Val Asn Gln Lys Thr Ile Lys Val Ser Ala Val Asp Gln Tyr Gly Lys 500 505 510 Glu Ile Lys Phe Gly Thr Lys Gly Lys Val Thr Val Thr Thr Asn Thr 515 520 525 Glu Gly Leu Val Ile Lys Asn Val Asn Ser Asp Asn Thr Ile Asp Phe 530 535 540 545 Asp Ser Gly Asn Ser Ala Thr Asp Gln Phe Val Val Val Ala Thr Lys 550 555 560 Asp Lys Ile Val Asn Gly Lys Val Glu Val Lys Tyr Phe Lys Asn Ala 565 570 575 Ser Asp Thr Thr Pro Thr Ser Thr Lys Thr Ile Thr Val Asn Val Val 580 585 590 Asn Val Lys Ala Asp Ala Thr Pro Val Gly Leu Asp Ile Val Ala Pro 595 600 605 Ser Lys Ile Asp Val Asn Ala Pro Asn Thr Ala Ser Thr Ala Asp Val 610 615 620 625 Asp Phe Ile Asn Phe Glu Ser Val Glu Ile Tyr Thr Leu Asp Ser Asn 630 635 640 Gly Arg Arg Gln Lys Lys Val Thr Pro Thr Ala Thr Thr Leu Val Gly 645 650 655 Thr Lys Lys Lys Lys Lys Val Asn Gly Asn Val Leu Gln Phe Lys Gly 660 665 670 Asn Glu Glu Leu Thr Leu Ser Thr Ser Ser Ser Thr Gly Asn Val Asp 675 680 685 Gly Thr Ala Glu Gly Met Thr Lys Arg Ile Pro Gly Lys Tyr Ile Asn 690 695 700 705 Ser Ala Ser Val Pro Ala Ser Ala Thr Val Ala Thr Ser Pro Val Thr 710 715 720 Val Lys Leu Asn Ser Ser Asp Asn Asp Leu Thr Phe Glu Glu Leu Ile 725 730 735 Phe Gly Val Ile Asp Pro Thr Gln Leu Val Lys Asp Glu Asp Ile Asn 740 745 750 Glu Phe Ile Ala Val Ser Lys Ala Ala Lys Asn Asp Gly Tyr Leu Tyr 755 760 765 Asn Lys Pro Leu Val Thr Val Lys Asp Ala Ser Gly Lys Val Ile Pro 770 775 780 785 Thr Gly Ala Asn Val Tyr Gly Leu Asn His Asp Ala Thr Asn Gly Asn 790 795 800 Ile Trp Phe Asp Glu Glu Gln Ala Gly Leu Ala Lys Lys Phe Ser Asp 805 810 815 Val His Phe Asp Val Asp Phe Ser Leu Thr Asn Val Val Lys Thr Gly 820 825 830 Ser Gly Thr Val Ser Ser Ser Pro Ser Leu Ser Asp Ala Ile Gln Leu 835 840 845 Thr Asn Ser Gly Asp Ala Val Ser Phe Thr Leu Val Ile Lys Ser Ile 850 855 860 865 Tyr Val Lys Gly Ala Asp Lys Asp Asp Asn Asn Leu Leu Ala Ala Pro 870 875 880 Val Ser Val Asn Val Thr Val Thr Lys 885 890 7 498 DNA Unknown Organism Description of Unknown Organism streptavidin gene 7 cccatggacc cgtccaagga ctccaaagct caggtttctg cagccgaagc tggtatcact 60 ggcacctggt ataaccaact ggggtcgact ttcattgtga ccgctggtgc ggacggagct 120 ctgactggca cctacgaatc tgcggttggt aacgcagaat cccgctacgt actgactggc 180 cgttatgact ctgcacctgc caccgatggc tctggtaccg ctctgggctg gactgtggct 240 tggaaaaaca actatcgtaa tgcgcacagc gccactacgt ggtctggcca atacgttggc 300 ggtgctgagg ctcgtatcaa cactcagtgg ctgttaacat ccggcactac cgaagcgaat 360 gcatggaaat cgacactagt aggtcatgac acctttacca aagttaagcc ttctgctgct 420 agcattgatg ctgccaagaa agcaggcgta aacaacggta accctctaga cgctgttcag 480 caataataag gatccggg 498 8 29 DNA Artificial Sequence Description of Artificial Sequence synthetic primer 8 ttcatcgtaa acgccgaatt ttgtttctg 29 9 26 DNA Artificial Sequence Description of Artificial Sequence synthetic primer 9 agggaaatat atcaactctg caagtg 26 10 49 DNA Bacillus stearothermophilus 10 gaattcatcg atgtcgacca aggaggtcta gatggatccg gccaagctt 49
Claims (23)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US10/890,179 US20040265936A1 (en) | 1996-02-01 | 2004-07-14 | Recombinant expression of S-layer proteins |
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| DE19603649.6 | 1996-02-01 | ||
| DE19603649A DE19603649A1 (en) | 1996-02-01 | 1996-02-01 | Recombinant expression of S-layer proteins |
| US09/117,447 US6777202B2 (en) | 1996-02-01 | 1997-01-31 | Recombinant expression of S-layer proteins |
| US10/890,179 US20040265936A1 (en) | 1996-02-01 | 2004-07-14 | Recombinant expression of S-layer proteins |
Related Parent Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/EP1997/000432 Division WO1997028263A1 (en) | 1996-02-01 | 1997-01-31 | Recombinant expression of s-layer proteins |
| US09/117,447 Division US6777202B2 (en) | 1996-02-01 | 1997-01-31 | Recombinant expression of S-layer proteins |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US20040265936A1 true US20040265936A1 (en) | 2004-12-30 |
Family
ID=7784274
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US09/117,447 Expired - Fee Related US6777202B2 (en) | 1996-02-01 | 1997-01-31 | Recombinant expression of S-layer proteins |
| US10/890,179 Abandoned US20040265936A1 (en) | 1996-02-01 | 2004-07-14 | Recombinant expression of S-layer proteins |
Family Applications Before (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US09/117,447 Expired - Fee Related US6777202B2 (en) | 1996-02-01 | 1997-01-31 | Recombinant expression of S-layer proteins |
Country Status (10)
| Country | Link |
|---|---|
| US (2) | US6777202B2 (en) |
| EP (1) | EP0882129B1 (en) |
| JP (1) | JP4077878B2 (en) |
| CN (1) | CN1213402A (en) |
| AT (1) | ATE316138T1 (en) |
| AU (1) | AU713999B2 (en) |
| CA (1) | CA2245584C (en) |
| DE (2) | DE19603649A1 (en) |
| NZ (1) | NZ331300A (en) |
| WO (1) | WO1997028263A1 (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20070065466A1 (en) * | 2001-02-09 | 2007-03-22 | The Provost, Fellows and Scholars of the College of the Holy and Undivided Trinity of Queen | Clostridium difficile vaccine |
Families Citing this family (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| DE19603649A1 (en) * | 1996-02-01 | 1997-08-07 | Werner Lubitz | Recombinant expression of S-layer proteins |
| DE19903345A1 (en) * | 1999-01-28 | 2000-08-03 | Werner Lubitz | Compartmentalization of recombinant polypeptides in host cells |
| EP1163001A2 (en) * | 1999-03-24 | 2001-12-19 | The Secretary of State for Defence | Vaccine composition |
| GB9906695D0 (en) * | 1999-03-24 | 1999-05-19 | Secr Defence | Vaccine composition |
| EP1229045A1 (en) * | 2001-02-01 | 2002-08-07 | Institut Curie | Universal carrier for targeting molecules to Gb3 receptor expressing cells |
| WO2002101026A2 (en) | 2001-06-11 | 2002-12-19 | Applied Nanosystems B.V. | Methods for binding acma-type protein anchor fusions to cell-wall material of micro-organisms |
| ITMI20052187A1 (en) * | 2005-11-15 | 2007-05-16 | Sunbird Investiments Ltd | CONTROL DEVICE OF THE FLOW OF A FLUID |
| DE102009032646A1 (en) * | 2009-07-03 | 2011-01-05 | Forschungszentrum Dresden - Rossendorf E.V. | E. coli secretion system based on S-layer proteins |
| EP2485764A4 (en) * | 2009-10-09 | 2014-01-29 | Childrens Medical Center | WHOLE CELL VACCINE WITH SELECTIVE DISSOCIATION |
Family Cites Families (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| DE4005874A1 (en) * | 1990-02-24 | 1991-11-07 | Boehringer Mannheim Gmbh | CARRIER-TIED RECOMBINANT PROTEINS, METHOD FOR THE PRODUCTION AND USE AS IMMUNOGENIC AND VACCINE |
| US5783441A (en) * | 1991-08-09 | 1998-07-21 | The United States Of America As Represented By The Secretary Of The Navy | Gene and protein applicable to the preparation of vaccines for rickettsia prowazekii and rickettsia typhi and the detection of both |
| CA2090549C (en) * | 1992-06-09 | 2008-01-15 | John Smit | Bacterial surface protein expression |
| US5976864A (en) * | 1992-06-09 | 1999-11-02 | The University Of British Columbia | Expression and secretion of heterologous polypeptides from caulobacter |
| WO1997034000A1 (en) * | 1992-06-09 | 1997-09-18 | The University Of British Columbia | Expression and secretion of heterologous polypeptides from caulobacter |
| US6210948B1 (en) * | 1992-06-09 | 2001-04-03 | The University Of British Columbia | Expression and secretion of heterologous polypeptides from caulobacter |
| GB9400650D0 (en) * | 1994-01-14 | 1994-03-09 | Smithkline Beecham Biolog | Expression of surface lazer proteins |
| DE4425527A1 (en) * | 1994-07-19 | 1996-01-25 | Vogelbusch Gmbh | Nucleic acid encoding signal peptide of Bacillus stearothermophilus S-layer protein |
| DE19603649A1 (en) * | 1996-02-01 | 1997-08-07 | Werner Lubitz | Recombinant expression of S-layer proteins |
| DE19732829A1 (en) * | 1997-07-30 | 1999-02-04 | Lubitz Werner Prof Dr | Secretion of carrier-bound proteins in the periplasm and in the extracellular space |
| CA2237704A1 (en) * | 1998-07-14 | 2000-01-14 | University Of British Columbia | Cleavage of caulobacter produced recombinant fusion proteins |
| DE19903345A1 (en) * | 1999-01-28 | 2000-08-03 | Werner Lubitz | Compartmentalization of recombinant polypeptides in host cells |
-
1996
- 1996-02-01 DE DE19603649A patent/DE19603649A1/en not_active Ceased
-
1997
- 1997-01-31 AU AU17203/97A patent/AU713999B2/en not_active Ceased
- 1997-01-31 JP JP52730797A patent/JP4077878B2/en not_active Expired - Fee Related
- 1997-01-31 WO PCT/EP1997/000432 patent/WO1997028263A1/en not_active Ceased
- 1997-01-31 EP EP97904360A patent/EP0882129B1/en not_active Expired - Lifetime
- 1997-01-31 CA CA002245584A patent/CA2245584C/en not_active Expired - Fee Related
- 1997-01-31 AT AT97904360T patent/ATE316138T1/en not_active IP Right Cessation
- 1997-01-31 CN CN97192940A patent/CN1213402A/en active Pending
- 1997-01-31 US US09/117,447 patent/US6777202B2/en not_active Expired - Fee Related
- 1997-01-31 DE DE59712554T patent/DE59712554D1/en not_active Expired - Lifetime
- 1997-01-31 NZ NZ331300A patent/NZ331300A/en not_active IP Right Cessation
-
2004
- 2004-07-14 US US10/890,179 patent/US20040265936A1/en not_active Abandoned
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20070065466A1 (en) * | 2001-02-09 | 2007-03-22 | The Provost, Fellows and Scholars of the College of the Holy and Undivided Trinity of Queen | Clostridium difficile vaccine |
Also Published As
| Publication number | Publication date |
|---|---|
| JP4077878B2 (en) | 2008-04-23 |
| CA2245584C (en) | 2009-01-06 |
| JP2000503850A (en) | 2000-04-04 |
| DE59712554D1 (en) | 2006-04-06 |
| NZ331300A (en) | 1999-06-29 |
| DE19603649A1 (en) | 1997-08-07 |
| ATE316138T1 (en) | 2006-02-15 |
| US6777202B2 (en) | 2004-08-17 |
| EP0882129A1 (en) | 1998-12-09 |
| AU1720397A (en) | 1997-08-22 |
| WO1997028263A1 (en) | 1997-08-07 |
| AU713999B2 (en) | 1999-12-16 |
| CA2245584A1 (en) | 1997-08-07 |
| EP0882129B1 (en) | 2006-01-18 |
| US20020168728A1 (en) | 2002-11-14 |
| CN1213402A (en) | 1999-04-07 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR100316004B1 (en) | Method and plasmid for production of crm protein and diphtheria toxin | |
| EP0722461B1 (en) | Expression of fusion polypeptides transported out of the cytoplasm without leader sequences | |
| CA2168459C (en) | Fusion proteins containing the c-terminal of tetanus toxin linked to a heterologous protein | |
| CA2413045C (en) | Expression system | |
| US6777202B2 (en) | Recombinant expression of S-layer proteins | |
| KR100338894B1 (en) | Vaccine compositions | |
| JPH0376580A (en) | Escherichia coli manifestation vector and production of antiviral protein using the same | |
| JP2842693B2 (en) | Methods and DNA expression systems for overexpression of proteins in host cells | |
| EP1221317A1 (en) | Vaccines containing hybrid polypeptides consisting of at least two different allergenic proteins | |
| WO1991015587A1 (en) | Self-polymerising expression system based on modified potyvirus coat proteins | |
| AU747328B2 (en) | Secretion of carrier-bonded proteins into the periplasma and the extracellular space | |
| CA2359210C (en) | Compartmentalization of recombinant polypeptides in host cells | |
| EP0400973B1 (en) | Mycobacterial secretory expression vectors and transformants | |
| US6015696A (en) | Mycobacterial secretory expression vectors and transformants | |
| WO1996026282A1 (en) | Expression of gene products from genetically manipulated strains of bordetella | |
| Engel et al. | Enzymatic preparation of 1, 6-anhydro-muropeptides by immobilized murein hydrolases from Escherichia coli fused to staphylococcal protein A | |
| CN103214561A (en) | Human hepatitis c virus core antigen and preparation method and application thereof |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: LUBITZ, WERNER,AUSTRIA Free format text: CHANGE OF ADDRESS OF ASSIGNEE;ASSIGNOR:LUBITZ, WERNER;REEL/FRAME:018231/0927 Effective date: 20060907 Owner name: LUBITZ, WERNER, AUSTRIA Free format text: CHANGE OF ADDRESS OF ASSIGNEE;ASSIGNOR:LUBITZ, WERNER;REEL/FRAME:018231/0927 Effective date: 20060907 |
|
| AS | Assignment |
Owner name: LUBITZ, WERNER, AUSTRIA Free format text: RE-RECORD FOR CORRECTION TO CHANGE OF ADDRESS OF ASSIGNEE AND ASSIGNEE ADDRESS IS SPELLED WRONG. R/F 018231/0927;ASSIGNOR:LUBITZ, WERNER;REEL/FRAME:019588/0401 Effective date: 20060907 Owner name: LUBITZ, WERNER,AUSTRIA Free format text: RE-RECORD FOR CORRECTION TO CHANGE OF ADDRESS OF ASSIGNEE AND ASSIGNEE ADDRESS IS SPELLED WRONG. R/F 018231/0927;ASSIGNOR:LUBITZ, WERNER;REEL/FRAME:019588/0401 Effective date: 20060907 |
|
| STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |